Fig. 1From: Evaluating parameters for ligand-based modeling with random forest on sparse data setsCollisions per feature. Number of collisions per feature for different hash sizes averaged over the four datasets using the Morgan fingerprint descriptor. Error bars are standard deviation of the meanBack to article page