Skip to main content

Table 1 Statistics of the compound SMILES corpora

From: ELECTRA-DTA: a new compound-protein binding affinity prediction model based on the contextualized sequence encoding

No. of corpus Average length of the corpus Minimum length of the corpus No. of vocabulary
1114424 47 3 72