Fig. 9From: SIMPD: an algorithm for generating simulated time splits for validating machine learning approachesSummary of the 99 data sets extracted from ChEMBL32 (Top): Number of compounds in each data set. (Bottom): Distribution of median AUC values for random forest models built using MFP2 and random splitting for these data setsBack to article page