Table 2 To ensure that the accuracies achieved for the ECHA dataset were not due to favorable train-test split, we also performed a 5-fold cross validation on the entire dataset. The classification accuracy and weighted F1 scores per fold are summarized here (higher is better)