Skip to main content
Fig. 1 | Journal of Cheminformatics

Fig. 1

From: Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach

Fig. 1

An illustrative example for the effects of related fingerprints on similarity measures. A hypothetical fingerprint scheme with nine bit keys (\(F_1\) to \(F_9\)) is used to represent small molecules in a hypothetical compound dataset. The fingerprint matrix of this dataset is found to have a perfect multicollinearity in the first four features with \(2\,F_1 = F_2 + F_3 + F_4\). The similarity of a query compound against three compounds is computed using Tanomoto coefficient (Tc) with and without this collinearity. For the results without the collinearity, the Tanimoto coefficient without the first four features (\(F_1\) to \(F_4\)) is shown

Back to article page