Skip to main content

Table 1 Statistics of the compound SMILES corpora

From: ELECTRA-DTA: a new compound-protein binding affinity prediction model based on the contextualized sequence encoding

No. of corpus

Average length of the corpus

Minimum length of the corpus

No. of vocabulary

1114424

47

3

72