Database | Open | Published:
TCMSP: a database of systems pharmacology for drug discovery from herbal medicines
Journal of Cheminformaticsvolume 6, Article number: 13 (2014)
Modern medicine often clashes with traditional medicine such as Chinese herbal medicine because of the little understanding of the underlying mechanisms of action of the herbs. In an effort to promote integration of both sides and to accelerate the drug discovery from herbal medicines, an efficient systems pharmacology platform that represents ideal information convergence of pharmacochemistry, ADME properties, drug-likeness, drug targets, associated diseases and interaction networks, are urgently needed.
The traditional Chinese medicine systems pharmacology database and analysis platform (TCMSP) was built based on the framework of systems pharmacology for herbal medicines. It consists of all the 499 Chinese herbs registered in the Chinese pharmacopoeia with 29,384 ingredients, 3,311 targets and 837 associated diseases. Twelve important ADME-related properties like human oral bioavailability, half-life, drug-likeness, Caco-2 permeability, blood-brain barrier and Lipinski’s rule of five are provided for drug screening and evaluation. TCMSP also provides drug targets and diseases of each active compound, which can automatically establish the compound-target and target-disease networks that let users view and analyze the drug action mechanisms. It is designed to fuel the development of herbal medicines and to promote integration of modern medicine and traditional medicine for drug discovery and development.
The particular strengths of TCMSP are the composition of the large number of herbal entries, and the ability to identify drug-target networks and drug-disease networks, which will help revealing the mechanisms of action of Chinese herbs, uncovering the nature of TCM theory and developing new herb-oriented drugs. TCMSP is freely available at http://sm.nwsuaf.edu.cn/lsp/tcmsp.php.
Traditional herbal medicine with the longest history in Asia, is a cost-effective system of medical practice that differs in substance, methodology, and philosophy from modern medicine, and plays an important role in health maintenance for the peoples of the world . The increasing popularity of herbal products has seen the monetary value of the industry increase to hundreds of millions of dollars per annum, concomitantly, there is increasing interests and need to dissect and evaluate the complex physiological effects of herbal products rigorously.
Herbal medicines formula often combines different botanicals, sometimes containing even up to 50 species and thousands of chemical compounds. However, only a part of them exhibit favorable pharmacokinetics (the absorption, distribution, metabolism, and excretion (ADME) properties of a drug) with potential of a biological effect . Moreover, the therapeutic effects of these herbal products might arise from cooperate actions of the herbal ingredients. All these resist the conventional analytical chemistry and pharmacology technologies which intend to isolate and identify chemical constituents possessing possible pharmacological effects.
Corresponding to the complexity of the components in diverse herbs or even in one herb, herbal medicines hit multiple biological targets involved in various pathogenesis. Clearly, in a systems level to search potential compound and target interactions, the ‘dry’ experiment (computational method) should be the first choice, owing to the shortages of the ‘wet’ experiment as time-consuming, expensive and also being limited in small scale . Alternatively, a comprehensive systems-based approach, which could simultaneously prioritize all the active ingredients and their targets in the crude drugs, is necessary.
More importantly, multi-component and multi-function features in herbal concoctions make their pharmacological and toxicological effects difficult to be evaluated independently. It might be more suitable to view through the lens of systems-based approaches. By considering drug actions and side effects in the context of the regulatory networks within which the drug targets and disease gene products function, systems analysis promises to greatly increase our knowledge of the mechanisms underlying the multiple actions of drugs. Thus, the application of systems pharmacology to herbal medicine affords new possibilities for investigating the explicit targets of medicinal herbs’ active ingredients and their interactions in the context of molecular networks [4–7].
In our previous work, we have proposed a novel integrated herbal medicine systems pharmacology (HmSP) platform for the purpose of investigating how herbs interact with the human body from a molecular level to the organism level . This systems/network pharmacology methodology has been successfully applied to dissect basic TCM theories such as yin-yang theory , qi-blood , herbal synergy [10, 11], as well as to develop new drugs . However, systems pharmacology, as a novel holistic, a multi-disciplinary, integrative field, is still difficult to be widely applied. An accessible systems pharmacology platform of Chinese herbal medicines that captures the relationships between drugs, targets and diseases is urgently needed to help understand basic TCM theories, illustrate the mechanisms of action and develop new drugs.
Presently, several databases have provided useful tools in different aspects for TCM investigations. For example, TCM-ID  and TCM Database@Taiwan  provide the largest number of herbal ingredients with 3D structures and functional properties. Chem-TCM  and HIT  focus on herbal compounds and their corresponding targets. TCMID  comprises TCM formulae, herbs, ingredients and the targets and diseases. CVDHD  collects those natural products related to cardiovascular diseases and targets. Comparisons among these databases are listed on the TCMSP website.
Here, we constructed a unique systems pharmacology platform of Chinese herbal medicines, which is different from the above-mentioned databases. The newly developed TCMSP provides up-to-date, quantitative and systems information about TCM ingredients, ADME-related properties, targets and diseases. TCMSP is unique in three key ways: (1) Integration of a large scale structural data (29,384 chemicals in total with 13,144 unique molecules) with manually curated information for all registered herbs in Chinese pharmacopoeia; (2) Incorporation of 12 key ADME-related properties from diverse sources for active compound screening; (3) Establishment of the compound-target, target-disease networks for deep study of TCM theory, mechanisms of action and discovery of new drugs. In total, TCMSP contains more than 84260 compound-target pairs (CT pairs) and 2387 target-disease pairs (TD pairs).
In addition, the TCMSP website is more than a data repository. It contains tools for visualization and analysis of TCM results on the network level. Such approach to systematic and multi-target drug discovery could lead to a new generation of candidates with improved physicochemical and pharmacokinetics properties. Unexpected associations can also be revealed thereby furthering the understanding of the mechanisms of diverse interactions and potentially indicating novel treatments. Therefore, TCMSP is a powerful knowledge repository and analysis platform for chemists, biologists and pharmacologists.
Construction and content
TCMSP is divided into three major categories: (1) Compounds, targets and diseases information (Figure 1 B1, B2 and B3); (2) Herbal ingredients with their ADME-related properties (Figure 1 C1); (3) Compounds-Targets relationships (Figure 1 C2) and Targets-Diseases relationships (Figure 1 C3).
In order to gather all available information about ingredients of herbal medicines, we performed an extensive literature search for each herbal medicine. Structure files of molecules were downloaded from PubChem  Compound database, ChEMBL  and ChemSpider , or produced by ISIS Draw 2.5 (MDL Information Systems, Inc.) and further optimized by Sybyl 6.9 (Tripos, Inc.) with Sybyl force field and default parameters [2, 21]. Different format types of the chemical files were converted to SDF format by Open Babel . The duplicates were removed according to InChIKey.
To analyze the druggability of herbs on molecular level, the database was structured to incorporate several important ADME-related properties such as human oral bioavailability (OB) , half-life (HL) Yao Y, Wei Z, Yonghua W: A novel Systems Pharmacology model for herbal medicine injection: a case using Reduning Injection. submitted, drug-likeness (DL) , FASA- , Caco-2 permeability (Caco-2), blood-brain barrier (BBB) and Lipinski’s rule of five (MW, AlogP, TPSA, Hdon, Hacc) . Detailed parameters’ information, screening criteria and calculation can be obtained from TCMSP website (http://sm.nwsuaf.edu.cn/lsp/load_intro.php?id=29).
Drug targeting and disease association
Target information was obtained from DrugBank database . Drug-Target mappings were obtained from two sources. Experimental validated drug-target pairs were retrieved from HIT database . For those compounds without validated targets, the SysDT model constructed in our previous work  was used to predict the potential targets of a compound. SysDT shows impressive performance of prediction for drug-target interactions, with a concordance of 82.83%, a sensitivity of 81.33%, and a specificity of 93.62%, respectively. The disease information was obtained from TTD database  and PharmGKB (https://www.pharmgkb.org/).
Network building and analysis
In order to analyze the CT and TD relationships, we have developed a visualization interface by Cytoscape Web , from which the network can be displayed within webpage and downloaded as XGMML format. Further topological analysis can be implemented with the NetworkAnalyzer  plugin in Cytoscape software .
Website and server
TCMSP is freely available at: http://sm.nwsuaf.edu.cn/lsp/tcmsp.php. It is designed as a relational database and implemented in MySQL 5.1.63 with Apache 2.2.22 as the web server. The website is built with PHP, HTML and CSS.
Utility and discussion
There are six major sections in TCMSP website (Homepage, How to search, TCMSP User Guide, Browse, Download and Parameter information). Users can search herbal name, ingredient’s chemical name, InChIKey, CAS number, target name or disease name in the search box at the TCMSP homepage. Querying principles and database structure are illustrated in Figure 1. A movie tutorial on the “How to search” page gives users a brief scope of TCMSP database. The “TCMSP User Guide” page offers a detailed case study. From the “Browse” page, users can browse all the herbal medicines, herbal ingredients, targets and diseases. “Parameters information” page introduces each ADME-related properties with the criteria for screening. All the data in TCMSP can be freely downloaded at the “Download” page.
Drug discovery and drug combination
ADME evaluations of drugs are critical procedures in drug discovery and development . Unfavorable pharmacokinetics properties were the primary causes of costly late-stage failures in drug development . To estimate the possibility of converting a compound into a drug, the TCMSP database incorporated a series of key ADME-related properties including compound OB, DL, FASA-, Caco-2 permeability, BBB, HL and Lipinski’s rule of five. This database can easily screen out the molecules which obey these rules or other customized thresholds. For example, in our case study of Licorice, 69 bioactive compounds of licorice were obtained by ADME screening with the criteria OB ≥ 40% and DL ≥ 0.18.
Investigate mechanisms of action of herbal medicines and TCM formula
Understanding how the diverse chemical components in medicinal herbs contribute to the overall pharmacological effect is a major challenge for current studies. TCMSP provides information on the ability of herbs to overcome biological barriers and their associated drug targets. The key techniques in the TCMSP platform have been successfully applied in the previous work to explore the mechanisms of action of herbal medicines and TCM formula in the treatment of cardiovascular diseases and virus diseases [2, 7, 34, 35]. For instance, with this model, two representative herbs Lonicera japonica and Fructus Forsythiae were analyzed regarding their pharmacological effect on influenza, inflammation and other diseases. Janus-function of these chemical compounds in both herbs was uncovered: directly inhibiting virus replications and simultaneously promoting host immune response . With the help of TCMSP, researches could uncover the mechanism of pharmacological action of herbal medicines more comprehensively.
Uncover the nature of TCM theory
The selection of those compound formula, or fufang, is based on the holistic philosophy of traditional Chinese medicine and follows traditional TCM theory, including the holistic philosophy, qi-enriching and blood-tonifying natures or the rule of “Jun–Chen–Zuo–Shi”, known as the Four Responsible Roles. However, the molecular basis of these basic theory and the mechanisms of action are still a mystery. Our previous research shows that systems pharmacology-based study of TCM may open up the possibility to understand the TCM theory in the context of a molecular network. For example, we have applied systems pharmacology to dissect the rule of drug combination for TCM , which is exemplified by Ma huang Decoction (also known as Ephedra Decoction, MHD). For the first time, by this methodology, we have revealed the chemical features of the qi-enriching and blood-tonifying compounds, and have uncovered the targets, leading to the deep understanding of the nature of qi-blood theory .
Licorice is one of the oldest and most popular herbal medicines in the world. It has been broadly used in traditional Chinese medicine as a cough reliever, anti-inflammatory, anti-anabrosis, immunomodulatory, anti-platelet, antiviral (hepatitis) and detoxifying agent. However, due to its extreme complexity in both chemical components and mechanisms of action, deep understanding of licorice is still difficult. This case will show us how to use TCMSP for screening active ingredients, identifying drug targets and diseases. Here we will introduce the process and results of this study briefly (Figure 2), detailed information about the biological basis of pharmacology of licorice can be reached at our previous work .
We retrieved 280 known licorice compounds from TCMSP. The ADME screening was applied with the criteria OB ≥ 40% and DL ≥ 0.18. Under these criteria, 69 ingredients in licorice are identified as active substances. These compounds are then mapped to the Compound-Target network and Target-Disease network. The networks can be downloaded as a XGMML file from TCMSP, or imported into Cytoscape software and analyzed with the NetworkAnalyzer plugin. We calculated two key topological parameters, degree and betweenness, to specify the importance of a node (a compound or a target) and how this node influences the communication between two nodes.
Finally, we obtained 91 targets related with different diseases, which are critical for understanding the pharmacological mechanisms of licorice. The generated drug–target network suggests that glyasperins C, licoagrocarpin, glycyrrhizic acid and the target proteins PTPN1, HRH1, F2 with high degree or betweenness are the key components playing important roles in the drug–target interactions. For instance, compounds liquiritin and licochalcone G can destroy bacteria by targeting the metalloelastase and strengthen the tissue macrophages to defense against external invasions. Additionally, details of utilizing systems pharmacology methods in TCM can be referred to our previous work .
The particular strengths of TCMSP are the composition of the large number of herbal entries with ADME properties, and the ability to identify drug-target networks and drug-disease networks, which will reveal the mechanisms of action of Chinese herbs, uncover nature of TCM theory and develop new herbal-oriented drugs. In the future version, more medicinal and pharmacological data will be added, such as the drug action mode: stimulation and inhibition, drug combination for various diseases etc. Particularly, we are planning to implement the physiologically based pharmacokinetics (PBPK) method to provide a more realistic description of the behavior of the substance in various tissues and organs.
Traditional Chinese medicine
Absorption, distribution, metabolism, and excretion
Blood brain barrier
Physiologically based pharmacokinetics
Cheung F: TCM: made in China. Nature. 2011, 480: S82-S83. 10.1038/480S82a.
Li X, Xu X, Wang J, Yu H, Wang X, Yang H, Xu H, Tang S, Li Y, Yang L, Huang L, Wang Y, Yang S: A System-Level Investigation into the Mechanisms of Chinese Traditional Medicine: compound Danshen Formula for Cardiovascular Disease Treatment. PLoS One. 2012, 7: e43918-10.1371/journal.pone.0043918.
Kuruvilla FG, Shamji AF, Sternson SM, Hergenrother PJ, Schreiber SL: Dissecting glucose signalling with diversity-oriented synthesis and small-molecule microarrays. Nature. 2002, 416: 653-657. 10.1038/416653a.
Tao W, Xu X, Wang X, Li B, Wang Y, Li Y, Yang L: Network pharmacology-based prediction of the active ingredients and potential targets of Chinese herbal Radix Curcumae formula for application to cardiovascular disease. J Ethnopharmacol. 2013, 145: 1-10. 10.1016/j.jep.2012.09.051.
Liu J, Pei M, Zheng C, Li Y, Wang Y, Lu A, Yang L: A Systems-Pharmacology Analysis of Herbal Medicines Used in Health Improvement Treatment: predicting Potential New Drugs and Targets. Evid Based Complement Alternat Med. 2013, 2013: 1-17.
Liu H, Wang J, Zhou W, Wang Y, Yang L: Systems approaches and polypharmacology for drug discovery from herbal medicines: an example using licorice. J Ethnopharmacol. 2013, 146: 773-793. 10.1016/j.jep.2013.02.004.
Li P, Chen J, Wang J, Zhou W, Wang X, Li B, Tao W, Wang W, Wang Y, Yang L: Systems pharmacology strategies for drug discovery and combination with applications to cardiovascular diseases. J Ethnopharmacol. 2014, 151: 93-107. 10.1016/j.jep.2013.07.001.
Huang C, Zheng C, Li Y, Wang Y, Lu A, Yang L: Systems pharmacology in drug discovery and therapeutic insight for herbal medicines. Brief Bioinform. 2013, in press
Ma T, Tan C, Zhang H, Wang M, Ding W, Li S: Bridging the gap between traditional Chinese medicine and systems biology: the connection of Cold Syndrome and NEI network. Mol Biosyst. 2010, 6: 613-10.1039/b914024g.
Li S, Zhang B, Zhang N: Network target for screening synergistic drug combinations with application to traditional Chinese medicine. BMC Syst Biol. 2011, 5: S10-
Wang X, Xu X, Tao W, Li Y, Wang Y, Yang L: A Systems Biology Approach to Uncovering Pharmacological Synergy in Herbal Medicines with Applications to Cardiovascular Disease. Evid Based Complement Alternat Med. 2012, 2012: 1-15.
Chen X, Zhou H, Liu YB, Wang JF, Li H, Ung CY, Han LY, Cao ZW, Chen YZ: Database of traditional Chinese medicine and its application to studies of mechanism and to prescription validation. Br J Pharmacol. 2006, 149: 1092-1103.
Chen CY-C: TCM Database@Taiwan: The World’s Largest Traditional Chinese Medicine Database for Drug Screening In Silico. PLoS One. 2011, 6: e15939-10.1371/journal.pone.0015939.
Ehrman TM, Barlow DJ, Hylands PJ: Phytochemical Databases of Chinese Herbal Constituents and Bioactive Plant Compounds with Known Target Specificities. J Chem Inf Model. 2007, 47: 254-263. 10.1021/ci600288m.
Ye H, Ye L, Kang H, Zhang D, Tao L, Tang K, Liu X, Zhu R, Liu Q, Chen YZ, Li Y, Cao Z: HIT: linking herbal active ingredients to targets. Nucleic Acids Res. 2010, 39: D1055-D1059.
Xue R, Fang Z, Zhang M, Yi Z, Wen C, Shi T: TCMID: Traditional Chinese Medicine integrative database for herb molecular mechanism analysis. Nucleic Acids Res. 2013, 41: D1089-D1095. 10.1093/nar/gks1100.
Gu J, Gui Y, Chen L, Yuan G, Xu X: CVDHD: a cardiovascular disease herbal database for drug discovery and network pharmacology. J Cheminformatics. 2013, 5: 51-10.1186/1758-2946-5-51.
Li Q, Cheng T, Wang Y, Bryant SH: PubChem as a public resource for drug discovery. Drug Discov Today. 2010, 15: 1052-1057. 10.1016/j.drudis.2010.10.003.
Gaulton A, Bellis LJ, Bento AP, Chambers J, Davies M, Hersey A, Light Y, McGlinchey S, Michalovich D, Al-Lazikani B, Overington JP: ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res. 2011, 40: D1100-D1107.
Pence HE, Williams A: ChemSpider: An Online Chemical Information Resource. J Chem Educ. 2010, 87: 1123-1124. 10.1021/ed100697w.
Zhang L, Liu T, Wang X, Wang J, Li G, Li Y, Yang L, Wang Y: Insight into the binding mode and the structural features of the pyrimidine derivatives as human A2A adenosine receptor antagonists. Biosystems. 2014, 115: 13-22.
O’Boyle NM, Banck M, James CA, Morley C, Vandermeersch T, Hutchison GR: Open Babel: An open chemical toolbox. J Cheminformatics. 2011, 3: 33-10.1186/1758-2946-3-33.
Xu X, Zhang W, Huang C, Li Y, Yu H, Wang Y, Duan J, Ling Y: A Novel Chemometric Method for the Prediction of Human Oral Bioavailability. Int J Mol Sci. 2012, 13: 6964-6982. 10.3390/ijms13066964.
Shen M, Tian S, Li Y, Li Q, Xu X, Wang J, Hou T: Drug-likeness analysis of traditional Chinese medicines: 1. property distributions of drug-like compounds, non-drug-like compounds and natural compounds from traditional Chinese medicines. J Cheminformatics. 2012, 4: 1-13. 10.1186/1758-2946-4-1.
CA L: Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 1997, 23: 3-25. 10.1016/S0169-409X(96)00423-1.
Knox C, Law V, Jewison T, Liu P, Ly S, Frolkis A, Pon A, Banco K, Mak C, Neveu V, Djoumbou Y, Eisner R, Guo AC, Wishart DS: DrugBank 3.0: a comprehensive resource for “omics” research on drugs. Nucleic Acids Res. 2011, 39: D1035-D1041. 10.1093/nar/gkq1126.
Yu H, Chen J, Xu X, Li Y, Zhao H, Fang Y, Li X, Zhou W, Wang W, Wang Y: A Systematic Prediction of Multiple Drug-Target Interactions from Chemical, Genomic, and Pharmacological Data. PLoS One. 2012, 7: e37608-10.1371/journal.pone.0037608.
Zhu F, Shi Z, Qin C, Tao L, Liu X, Xu F, Zhang L, Song Y, Liu X, Zhang J, Han B, Zhang P, Chen Y: Therapeutic target database update 2012: a resource for facilitating target-oriented drug discovery. Nucleic Acids Res. 2011, 40: D1128-D1136.
Lopes CT, Franz M, Kazi F, Donaldson SL, Morris Q, Bader GD: Cytoscape Web: an interactive web-based network browser. Bioinformatics. 2010, 26: 2347-2348. 10.1093/bioinformatics/btq430.
Assenov Y, Ramirez F, Schelhorn S-E, Lengauer T, Albrecht M: Computing topological parameters of biological networks. Bioinformatics. 2007, 24: 282-284.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Res. 2003, 13: 2498-2504. 10.1101/gr.1239303.
Su X, Kong L, Lei X, Hu L, Ye M, Zou H: Biological Fingerprinting Analysis of Traditional Chinese Medicines with Targeting ADME/Tox Property for Screening of Bioactive Compounds by Chromatographic and MS Methods. Mini-Rev Med Chem. 2007, 7: 87-98. 10.2174/138955707779317830.
Van de Waterbeemd H, Gifford E: ADMET in silico modelling: towards prediction paradise?. Nat Rev Drug Discov. 2003, 2: 192-204. 10.1038/nrd1032.
Li Y, Han C, Wang J, Xiao W, Wang Z, Zhang J, Yang Y, Zhang S, Ai C: Investigation into the mechanism of Eucommia ulmoides Oliv. based on a systems pharmacology approach. J Ethnopharmacol. 2014, 151: 452-460. 10.1016/j.jep.2013.10.067.
Wang X, Xu X, Li Y, Li X, Tao W, Li B, Wang Y, Yang L: Systems pharmacology uncovers Janus functions of botanical drugs: activation of host defense system and inhibition of influenza virus replication. Integr Biol. 2013, 5: 351-10.1039/c2ib20204b.
Yao Y, Zhang X, Wang Z, Zheng C, Li P, Huang C, Tao W, Xiao W, Wang Y, Huang L, Yang L: Deciphering the combination principles of Traditional Chinese Medicine from a systems pharmacology perspective based on Ma-huang Decoction. J Ethnopharmacol. 2013, 150: 619-638. 10.1016/j.jep.2013.09.018.
Funding: The research is supported by the Fund of Northwest A & F University and is financially supported by the National Natural Science Foundation of China (Grant No. 31170796 and 81373892).
All the authors declare that they have no competing interests.
YHW and LY conceived the study. JLR, PL and JNW constructed the database and drafted the manuscript. WZ performed the ADME properties calculation with the help of CH and ZHG. BHL designed and wrote the user manual of the database and the description of the website. PDL and JLR designed and developed the website. WYT, YFY, XX and YL participated in dataset collecting and processing. All authors read and agreed to the final manuscript.