Skip to main content

Table 1 Statistics of the training and test sets

From: Improving chemical similarity ensemble approach in target prediction

Data set Target Molecule Ligand-target pair Active
Training set (5) 2,809 393,090 666,313 All
Training set (6) 2,297 294,877 407,296 All
Training set (7) 1,711 179,710 246,651 All
Kinase training set (5) 429 42,164 101,502 All
Test set 1190 26,498 80,066 37,138
Kinase test 259 2,225 3010 2,192
  1. The size of 4 training data sets and 2 test sets. Numbers in brackets denote activity thresholds