Fig. 1From: Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approachAn illustrative example for the effects of related fingerprints on similarity measures. A hypothetical fingerprint scheme with nine bit keys (\(F_1\) to \(F_9\)) is used to represent small molecules in a hypothetical compound dataset. The fingerprint matrix of this dataset is found to have a perfect multicollinearity in the first four features with \(2\,F_1 = F_2 + F_3 + F_4\). The similarity of a query compound against three compounds is computed using Tanomoto coefficient (Tc) with and without this collinearity. For the results without the collinearity, the Tanimoto coefficient without the first four features (\(F_1\) to \(F_4\)) is shownBack to article page