Searching substructures in fragment spaces
Journal of Cheminformatics volume 3, Article number: P11 (2011)
Fragment spaces (FSs) are an elegant way to model a large or even infinite number of chemical compounds and their synthetic accessibility. A FS consists of molecular fragments and a set of rules defining how fragments can be combined to products. In virtual screening experiments, FSs might include products with undesired functional groups or inadequate central building blocks. The recognition of such products, especially when they span over multiple fragments, would require their explicit construction from the FS. Due to the generally huge number of possible products in an FS, the complete enumeration is undesired or even impossible. Therefore, algorithms that perform substructure search in FSs must be able to process fragments and joining rules rather than complete molecules. Even though some algorithms that work in FSs exist [1, 2], a method that excludes undesired products via substructure definition from a FS is still missing.
We present and compare two algorithms to modify an FS such that no possible product can include a given functional group or substructure. The methods utilize a search procedure based on the Ullmann  respectively the VF2 algorithm  for subgraph isomorphism. Thereby, we find substructures that are present inside fragments or would be formed by joining two fragments. After the identification of such fragments, they are either removed from the FS or their joining rules are altered in a way that a formation of the substructure becomes impossible.
Degen J, Wegscheid-Gerlach C, Zaliani A, Rarey M: On the art of compiling and using ’drug-like’ chemical fragment spaces. ChemMedChem. 2008, 3 (10): 1503-1507. 10.1002/cmdc.200800178.
Rarey M, Stahl M: Similarity searching in large combinatorial chemistry spaces. J Comput Aided Mol Des. 2001, 15 (6): 497-520. 10.1023/A:1011144622059.
Ullmann JR: An algorithm for subgraph isomorphism. J Assoc Comput Mach. 1976, 23: 31-42.
Cordella LP, Foggia P, Sansone C, Vento M: A (sub)graph isomorphism algorithm for matching large graphs. IEEE T-PAMI. 2004, 26 (10): 1367-1372.
Schomburg K, Ehrlich H-C, Stierand K, Rarey M: From structure diagrams to visual chemical patterns. J Chem Inf Model. 2010,
About this article
Cite this article
Ehrlich, HC., Rarey, M. Searching substructures in fragment spaces. J Cheminform 3 (Suppl 1), P11 (2011). https://doi.org/10.1186/1758-2946-3-S1-P11
- Virtual Screening
- Explicit Construction
- Screening Experiment
- Molecular Fragment
- Subgraph Isomorphism