Skip to main content

Table 2 Statistics of protein sequence corpora

From: ELECTRA-DTA: a new compound-protein binding affinity prediction model based on the contextualized sequence encoding

No. of corpus Average length of the corpus Minimum length of the corpus No. of vocabulary
1868198 382 2 30