Skip to main content

Table 2 Statistics of protein sequence corpora

From: ELECTRA-DTA: a new compound-protein binding affinity prediction model based on the contextualized sequence encoding

No. of corpus

Average length of the corpus

Minimum length of the corpus

No. of vocabulary

1868198

382

2

30