- Software
- Open Access
- Published:
Probing the chemical–biological relationship space with the Drug Target Explorer
Journal of Cheminformatics volume 10, Article number: 41 (2018)
Abstract
Modern phenotypic high-throughput screens (HTS) present several challenges including identifying the target(s) that mediate the effect seen in the screen, characterizing ‘hits’ with a polypharmacologic target profile, and contextualizing screen data within the large space of drugs and screening models. To address these challenges, we developed the Drug–Target Explorer. This tool allows users to query molecules within a database of experimentally-derived and curated compound-target interactions to identify structurally similar molecules and their targets. It enables network-based visualizations of the compound-target interaction space, and incorporates comparisons to publicly-available in vitro HTS datasets. Furthermore, users can identify molecules using a query target or set of targets. The Drug Target Explorer is a multifunctional platform for exploring chemical space as it relates to biological targets, and may be useful at several steps along the drug development pipeline including target discovery, structure–activity relationship, and lead compound identification studies.
Background
In the modern drug discovery and development process, high-throughput screens (HTS) of drugs have become a common and important step in the identification of novel treatments for disease. In the past decade, studies describing or citing high throughput drug screening are increasingly prevalent, topping 1000 per year for the past 5 years (Fig. 1) and span many disease domains such as cancer, neurodegenerative disease, and cardiopulmonary diseases [1,2,3]. These screens are often phenotypic in nature whereby a large panel of compounds of known, presumed known, and/or unknown mechanisms of action are tested in a biological model of interest and generate phenotypic readouts such as apoptosis or proliferation. While these types of screens facilitate the rapid identification of biologically active drugs or chemical probes, they also present several challenges.
One prevailing challenge is the identification of the specific biological mechanisms within a cell that determine the response in a screen. The search for novel drugs constantly pushes the pharmaceutical researchers to include novel chemical sets in phenotypic screens, with the caveat that the underlying mechanism of action (MoA) of a particular compound cannot usually be gleaned from the phenotypic screens [4]. Most of the time, identifying the MoA requires additional experimentation, particularly if the molecule represents a novel or understudied chemical entity. Another challenge is that the polypharmacologic nature of many small molecules can make it difficult to interpret HTS results as a given drug may affect multiple targets with a range of efficacy. This, in turn, presents the difficulty of consolidating multiple targets into a unified biological mechanism or set of mechanisms leading to poorly annotated targets, misunderstood MoAs [5], and unknown or ambiguous off-targets with potential deadly side effects [6, 7]. A final challenge is that identification of related molecules and their targets is not always straightforward; in the context of HTS analysis, structurally and functionally related molecules that are not contained in a screening library might be useful to explore.
Multiple tools and databases have attempted to address various aspects of the challenges outlined above (see Table 1). These tools allow the user to explore known polypharmacology of small molecules. Many also allow users to explore compound-target relationships by querying either by molecule or by target: DGIdb, DT-Web, BindingDB, Probes and Drugs, CarlsbadOne, Polypharmacology Browser, STITCH, and SuperTarget allow users to identify MoAs/targets of a given compound by evaluating a query drug [8,9,10,11,12,13,14,15], while DT-Web, BindingDB, Polypharmacology Browser, and STITCH allow users to search by chemical similarity using any query molecule (Table 1). Probe Miner, alternatively, is designed primarily to handle target-based queries [16]. All tools listed in Table 1 allow users to identify molecules with known polypharmacology, but only three, STITCH, SuperTarget, and Probes and Drugs, provide the ability to summarize these targets into biological pathways/mechanisms using a gene list enrichment approach [12, 13, 15]. The final challenge—identifying structurally or functionally related molecules—is addressed by DT-Web, BindingDB, Probes and Drugs, CarlsbadOne, Polypharmacology Browser, and STITCH [9,10,11,12, 14, 15].
While several of the tools listed address one or more of these challenges, there are some gaps (Table 1). For example, ChEMBLSpace does not have a web interface and therefore requires installation on a compatible system before use [17]. In addition, not all of these tools are open-source (STITCH, SuperTarget, BindingDB, Probes and Drugs, CarlsbadOne). An easy to modify open-source application could enable users to create features that are helpful for their specific analyses. While most tools allow both drug-based and target-based queries, none appear to facilitate queries for molecules that affect several targets, which may be useful for users who want to leverage polypharmacology by employing drugs that inhibit multiple biological mechanisms. While multiple targets can be queried at one time in STITCH, it is not straightforward to identify single molecules that affect all query targets. In addition, DGIdb and ChEMBLSpace cannot be used to explore similar chemical space to the query molecule. These two, plus SuperTarget and CarlsbadOne, cannot be queried using molecules that are not in the database; a feature that might help users with novel preclinical candidate drugs. With the exception of DT-Web, STITCH and CarlsbadOne, these tools do not allow visualization of drug–target networks, which may help users address the challenge of identifying structurally or functionally related drugs. No tools other than STITCH and Probes & Drugs perform gene list enrichment, which may help users interpret the biological MoAs of polypharmacologic molecules. Finally, these tools do not allow users to evaluate query drugs in the context of publically available high throughput drug screening datasets.
To address these gaps, we developed the Drug–Target Explorer. Specifically, the Drug–Target Explorer enables the user to [1] look up targets for individual molecules and groups of molecules, [2] explore networks of targets and drugs, [3] perform gene list enrichment of targets to assess target pathways of compounds, [4] compare query molecules to cancer cell line screening datasets, and [5] discover bioactive molecules using a query target and exploration of these networks. We anticipate that the users will include biologists and chemists involved in drug discovery who are interested in performing hypothesis generation of human targets for novel molecules, identifying off-targets for bioactive small molecules of interest, and exploring of the polypharmacologic nature of small molecules.
Implementation
To build the database of known compound-target interactions, we aggregated five data sources containing qualitative and quantitative interactions (Fig. 2). We defined a target of a compound as any human protein with a measurable change in activity when directly exposed to that compound, or any human protein qualitatively reported to be directly modulated by that compound in the source datasets. We considered qualitative interactions to be curated compound-target associations with no associated numeric value. These associations are curated and evaluated by experts and are thus a source of high-confidence drug–target information. Quantitative interactions were defined as compound-target information with a numeric value indicating potency of compound-target binding or functional changes. Qualitative compound-target associations were retrieved from the DrugBank 5.1.0 XML database, the DGIdb v3.0.2 interactions.tsv file, and ChemicalProbes.org (acc. July 7 2018) [8, 18, 19]. pChEMBL, IC50, C50, EC50, AC50, Ki, Kd, and potency values for Homo sapiens targets were retrieved from the ChEMBL v24.1 MySQL database [20]. Kd values were also obtained from Klaeger et al. 2017, in which the authors determined the Kd of 244 kinase inhibitors against 343 kinases [21]. For all quantitative and qualitative data sources, compound structural information (SMILES) was retrieved when available. When not available, it was batch annotated using the Pubchem Identifier Exchange Service, or, in some cases, manually annotated via PubChem and ChemSpider search [22, 23].
Process for developing the Drug–Target Explorer. Molecule–target and chemical structure data were collected from public sources. In the case of DGIdb, chemical structures were assigned using the PubChem Chemical Identifier Exchange, manually assigned using ChemSpider and PubChem, or mapped to ChEMBL structures by ChEMBL ID. Chemical structures from all source databases were standardized, aggregated, and assigned internal Drug–Target Explorer identifiers. Qualitative and quantitative data were summarized by calculating several summary statistics, and these data were stored together with the internal identifiers to form the Drug–Target Explorer database
To consolidate data for identical molecules within and across multiple databases, structural strings (SMILES) were standardized using the standardize_smiles function from the MolVS v0.1.1 Python package [24]. Each standardized structure and all external IDs associated with that structure were then assigned an internal identifier, so that groups of molecules with identical structures were assigned to the same internal ID to permit integration of the different datasets. All datasets were combined and summaries were generated for each compound-target comparison using functions from the R ‘tidyverse’ [25]. The number of unique molecules, targets, and molecule–target associations after structural standardization of each source database is described in Table 2.
In addition, chemical fingerprints were generated using the R interface (rcdk) to the Java Chemical Development Kit (CDK) [26,27,28]. The package was modified to use the latest version of the CDK (2.1.1), which enables perception of chiral centers, enabling differentiation between isomeric molecules in circular fingerprints. Circular (functional connectivity fingerprint (FCFP6)-like), MACCS, and extended fingerprints were generated.
The summary metrics described in Table 3 were calculated. One of these metrics, pChEMBL, is used to convey the potency of a given molecule. It is calculated from one of several semi-comparable values in the ChEMBL database, and is defined as the negative log 10 molar of the IC50, XC50, EC50, AC50, Ki, Kd, or potency [20]. Therefore, pChEMBL permits a rough comparison of these values. For example, a pChEMBL value of 7 would indicate that there is a measurable effect on a given target in the presence of 100 nM of molecule. To harmonize the data from Klaeger et al. with ChEMBL data, the Kd values were converted to pChEMBLs. The mean pChEMBL was calculated for every molecule–target combination, as well as the number of quantitative and qualitative associations found in the source databases.
We devised a known selectivity index (KSI) for each molecule–target interaction:
where KSIdt is the known selectivity index for a given molecule–target association, pChEMBLdt is the mean pChEMBL for a specific molecule–target association between drug d and target t, and ΣpChEMBLdt is the sum of all target (T) pChEMBLs for drug d.
To quantify confidence in each of the interactions, we calculated a confidence score (c):
where cab is the confidence of an interaction between molecule a and target b, nab is the number of quantitative measurements for an interaction between molecule a and target b, lab is the number of qualitative associations found between molecule a and target b, µab is the mean number of associations across all molecule–target associations, and σall is the standard deviation of all molecule–target associations; subtracting the mean and dividing by the standard deviation converts the confidence score to a z-score. A larger confidence score indicates greater confidence in the relationship between molecule a and target b.
This resulted in a database containing 3650 human targets (represented by HUGO gene symbols), 304,790 small molecules, and 507,059 molecule–target associations summarized from 673,439 quantitative interactions and 18,918 qualitative interactions. Finally, this database as well as fingerprints and chemical aliases for each molecule were saved as R binary files and stored on Synapse. All of the data, as well as snapshots of the source databases used to build the Drug Target Explorer database (with the exception of DrugBank, which requires a license to access) are accessible at www.synapse.org/dtexplorer. The Drug–Target Database is licensed under CC BY-SA 4.0.
We developed a Shiny application to permit exploration of the database [29, 30]. For chemical queries, users can search for molecules in the database by one of three methods: from a list of aliases obtained from the source databases, retrieving the chemical structure using the ‘webchem’ interface to the Chemical Identifier Resolver, or by directly inputting the SMILES string [31]. A Tanimoto similarity threshold allows the user to narrow or widen the chemical space of the results. After querying, the input molecule is converted to a fingerprint and it’s similarity calculated relative to all molecules in the database, using ‘extended’ fingerprints. The user then can view the resulting set of molecules as well as the molecule–target relationships in interactive tables and graphs (Fig. 3). In addition, the user can remove or include molecules on an a-la-carte basis, view the 2D structural representation of the input molecule, and perform target list enrichment analysis [32, 33]. Furthermore, the query molecule can be compared against molecules in the CTRP and Sanger cancer cell line drug-screening datasets to identify identical or similar structures in these datasets, and compare the relationship between chemical structure and correlations in drug response.
For target queries, users can input one or more query HUGO gene(s) and identify molecules that are reported to bind those targets, and view these data in an interactive table. Users can also view these drugs in an interactive graph format to view their association with the query target and their other targets. The Drug–Target Explorer is available at www.synapse.org/dtexplorer. The source code for the Drug–Target Explorer app is available at https://github.com/Sage-Bionetworks/polypharmacology-db. The source code is licensed under Apache 2.0.
Results
The Drug Target Explorer was designed to facilitate the following use-cases: hypothesis generation of targets for newly-discovered molecules, identification of off-targets for existing bioactive research molecules, and exploration of polypharmacology, and identification of molecules for known targets. Below, we include vignettes highlighting how the Drug–Target Explorer can facilitate analysis in these areas.
Hypothesis generation of targets for newly-discovered molecules
To highlight the use of this app to find potential off-targets of a novel molecule, we queried the Drug–Target Explorer for C21, a recently-published Polo like kinase (PLK) inhibitor that is not captured in our database [34]. This molecule inhibits Plk2 and Plk1 in the low nM range, and Plk3 in the low uM range [34]. Starting at a similarity of 1, we decreased the similarity cutoff until we identified the most similar molecule in the database at a similarity of 0.74 (CHEMBL3609309), a BRD4-binding molecule. We then continued to decrease the similarity threshold to identify other molecules and targets in the database. At a similarity cutoff of 0.65 we identified 15 molecules (Fig. 4a, Additional file 1: Supplemental Table 1). PLK1 and PLK2 are known targets of several of these molecules, such as BI 2536 and volasertib. Curiously, CAMKK2, BRD4, BRDT, PLK3, PDXK, and PTK2 are also targeted by molecules in this chemical set, with pChEMBL values > 6–8. A plausible hypothesis could be that these targets are affected by this family of molecules, including the query molecule, in the 10–1000 nM range, which would indicate that further research is needed to determine the selectivity of C21 or other structurally related molecules.
Molecule–target networks highlight targets within chemical families. Using the novel Plk inhibitor C21 as a query with a Tanimoto cutoff of 0.65 (SMILES: CCNC(=O)C1=CC2=C(C=C1)N(C=C2)C1=NC=C2N(C)C(=O)[C@@H](CC)N(C3CCCC3)C2=N1), we identify 15 related molecules (blue vertices), and observe several targets (green vertices) common to multiple members of this group of structurally related molecules, including PLK1, PLK2, BRD4, CAMKK2, PTK2, and PDXK
Identification of targets for existing bioactive research molecules
This app may also be useful in identifying alternate targets of existing molecules in a preclinical or exploratory research setting. In order to confidently interrogate the role of cellular targets, one must use compounds with specificity for those targets. A well-known example of a non-specific inhibitor is imatinib. This molecule, developed for use in the treatment of chronic myelogenous leukemia, was initially considered a selective inhibitor of Abl [35]. More recently, several other targets have been identified for imatinib such as KIT, PDGFRA, and PDFGRB [36]. Querying the Drug Target Explorer indicates that there is evidence for 59 targets of imatinib, several of which have pChEMBL values within a reasonable range of Abl, PDGFRA, and PDFGRB (Additional file 2: Supplemental Table 2). These targets must all be considered when evaluating imatinib in human model systems.
A more recent example is the tool compound G-5555, a selective PAK1 inhibitor [37]. This compound has been used to demonstrate the role of PAK1 in cellular processes such as invasion [38]. A search of the Drug–Target Explorer database showed that this molecule not only binds PAK1 (mean pChEMBL = 7.79, Table 4), but there is qualitative evidence for effects on PAK2/3, and quantitative evidence suggesting an effect on SIK2, MAP4K5, and PAK2 at similar concentrations of G-5555 (mean pChEMBLs 8.05, 8, and 7.69 respectively, Table 4). G-5555 also may have an effect on STK family proteins (STK24, STK25, STK26) and LCK. Therefore, any findings with G-5555 with regards to PAK1 inhibition must be validated with other selective inhibitors or genetic approaches, as Jeannott and colleagues did (using other PAK inhibitors such as FRAX597 and FRAX1036, as well as PAK1 silencing RNA), to confirm that the effects observed are PAK1 specific [38].
Exploration of polypharmacology and related molecules in biological data
In order to provide biological context, this app allows the user to aggregate multiple targets from compounds into functional categories. Using the previous example of G-5555, we performed enrichment analysis on the list of targets to identify potential biological pathways that this molecule may disrupt. In doing so, we observed that G-5555 targets are enriched in several Gene Ontology terms and KEGG Pathways like T cell receptor signaling (p = 1.62e-6), Ras/MAPK signaling (p = 3.13e-4), and Golgi-localized proteins (p = 2.86e-2), among several others (Additional file 3: Supplemental Table 3). The app also allows the user to compare the query molecule to drugs in the Cancer Cell Line/CTRP and GDSC/Sanger cell line screening datasets. Specifically, the app identifies the most similar molecule (Tanimoto similarity) available in these datasets and uses that molecule as a reference to calculate the Spearman correlation of drug response across all drugs within the dataset. For example, a search for ABT-737, a BCL family inhibitor, (Fig. 5a) tests the Spearman correlation of AUCs in all cell lines treated with ABT-737 to the AUCs across all cell lines for all other drugs in the CCLE dataset. The app plots these correlations with respect to the Tanimoto similarity of each CCLE molecule to the input molecule. The plot shows several molecules with highly and significantly correlated biological activity to ABT-737 (such as other BCL family inhibitors, navitoclax, ABT-199, and combinations with navitoclax) that appear to be more chemically similar to ABT-737 than the majority of the CCLE-tested drugs. Interestingly, this also reveals that SZ4TA2, a BCL-XL inhibitor, while structurally related to ABT-737, does not have a correlated biological response in the CCLE dataset.
Exploration of similar molecules in biological datasets. a A search for ABT-737 identifies similarly-acting BCL family inhibitors in the CCLE dataset, including ABT-199, navitoclax, and combinations of navitoclax with other drugs. b A search for 2′-deoxycytidine identifies the most structurally similar molecule in the CCLE dataset (cytarabine HCl) and identifies several molecules with highly and significantly correlated drug responses, including clofarabine, gemcitabine, and decitabine
This approach can also be used to explore the landscape of chemically similar drugs even when the query drug is not in the drug screening datasets. For example, in Fig. 5b, a query for 2′-deoxycytidine, which is not in the CCLE dataset, identifies cytarabine hydrochloride as the most structurally similar molecule in the CCLE screening set. Using this molecule as a reference, the app calculates the drug response Spearman correlation with the other drugs in the dataset and identified several other molecules that are structurally similar to the input molecule. For example, gemcitabine is quite structurally similar to the input molecule (2′-deoxycytidine) (> 0.8) and has a highly correlated drug response to the reference molecule (cytarabine HCl) (> 0.75). We can also find examples of drugs that are structurally distinct but functionally similar (clofarabine) as well as drugs that are functionally distinct but structurally similar (zebularine).
Identification of molecules for known targets
Finally, the tool allows users to perform a reverse search, i.e. identify molecules that have an association with a query target or targets and assess the known selectivity of these molecules. For example, Petrilli et al. identified LIM domain kinases as targets of interest in tumors caused by the genetic disease neurofibromatosis type 2 (NF2) [39]. They found that pharmacologic (LIMK1/2 inhibitor BMS-5) and genetic modulation of LIMK1 and LIMK2 caused cell-cycle inhibition and reduced viability in merlin (Nf2) deficient Schwann cells [39]. In the context of follow-up studies, it may be beneficial to test alternate molecules that target LIMK1/2 with greater potency than BMS-5. We used the Drug–Target Explorer to find molecules that target LIMK1 and LIMK2 (Additional file 4: Supplemental Table 4, Fig. 6a). For example, BMS-5 (CHEMBL2141887 in the Drug–Target Explorer) has mean pChEMBLs of 7.33 and 7.07 for LIMK1 and LIMK2 respectively, while CHEMBL3623442 has pChEMBLs of 9 and 8.52 for LIMK1 and LIMK2 respectively. Another interesting possibility is the identification of multiple molecules with overlapping desired targets and non-overlapping off-targets to reduce off-target effects, or to identify single-target, multi-drug combinations as outlined by Fitzgerald et al. 2006 [40]. Using the above scenario with LIMK1/2, it may be possible to use structurally distinct molecules in combination or in sequence, like CHEMBL3356433 and others that do not have identical known off-targets (Fig. 6a) to reduce off-target effects while targeting LIMK1/2. The opposite approach could also be taken by finding a single molecule that binds multiple desired targets. In the case of merlin-deficient cells, focal adhesion kinases (FAKs) such as PTK2 (FAK2) and PTK2B, as well as Aurora kinase A (AURKA) have been highlighted as potential targets of interest [39, 41, 42]. Using the Drug–Target Explorer, we can identify molecules that target LIMK1/2, PTK2/2B, and AURKA (Additional file 5: Supplemental Table 5, Fig. 6b). Using this information, a rational hypothesis might be that CYC116 or danusertib could be effective and selective for NF2-deficient tumor cells; to our knowledge, the use of these molecules in this setting has yet not been explored.
Target-based queries identify molecules that target a gene of interest. a A gene-based query for two targets (green vertices), LIMK1 and LIMK2, identifies 10 molecules (blue vertices), as well as other targets affected by these molecules. b A query for multiple targets relevant to tumors caused by neurofibromatosis type 2 identifies three molecules that have associations with these targets
Conclusions
In the present study, we demonstrate that the Drug–Target Explorer enables the user to look up targets for novel and known molecules such as C21, G-5555, and imatinib, as well as explore networks of these drugs and their targets. Users can perform target enrichment to consolidate multiple targets to into pathways, compare query molecules to screening datasets, and identify bioactive molecules given a query target.
Several future directions are envisioned for this application. The code and database has been designed in such a way that any database with structural information and drug–gene target information (qualitative associations, or quantitative associations that can be coerced to pChEMBL values) can be harmonized and integrated into the database. Therefore, as new datasets become available, such as the recently-published Drug–Target Commons [43], they can be integrated and released. We also envision occasional errors being identified as the database is explored and vetted by users and have included a feedback form for users to suggest new data to integrate, as well as to highlight necessary corrections to the dataset. Currently, the query molecule to full database similarity calculation is computationally intensive. One solution to speed up calculation times may be to implement a locality sensitive hashing method in future versions of the database and web app, such as the method devised by Cao et al. 2010 [44]. An additional planned feature for this app is the implementation of a bulk annotation feature to allow users to annotate HTS data with targets and/or putative targets of identical or structurally related molecules. Finally, the integration of a predictive framework for identifying targets of query drugs based on drug and target feature data would enable users to quantitatively predict targets of novel molecular entities rather than manually exploring structurally similar molecules.
The Drug–Target Explorer enables users to explore known molecule–human target relationships as they relate to chemical similarity rapidly and with minimal effort. We anticipate that users such as biologists and chemists using chemical probes or studying preclinical therapeutics will find this tool useful in several areas. Specifically, this tool may aid drug discovery efforts by accelerating hypothesis generation, simplifying the transition from phenotypic HTS results to mechanistic studies, and streamlining the identification of candidate molecules that target a protein or mechanism of interest.
Availability and requirements
-
Project name: Drug–Target Explorer
-
Project home page: http://www.synapse.org/dtexplorer
-
Operating system(s): Platform independent
-
Programming language: R
-
Other requirements: Chrome, Safari, or Firefox
-
License: Apache 2.0
References
Cao L, Weetall M, Bombard J, Qi H, Arasu T, Lennox W et al (2016) Discovery of novel small molecule inhibitors of VEGF expression in tumor cells using a cell-based high throughput screening platform. PLoS ONE 11(12):e0168366
Finan GM, Realubit R, Chung S, Lütjohann D, Wang N, Cirrito JR et al (2016) Bioactive compound screen for pharmacological enhancers of apolipoprotein e in primary human astrocytes. Cell Chem Biol 23(12):1526–1538
Kirby RJ, Divlianska DB, Whig K, Bryan N, Morfa CJ, Koo A et al (2018) Discovery of novel small-molecule inducers of heme oxygenase-1 that protect human iPSC-derived cardiomyocytes from oxidative stress. J Pharmacol Exp Ther. 364(1):87–96
Wagner BK (2016) The resurgence of phenotypic screening in drug discovery and development. Expert Opin Drug Discov 11(2):121–125
Santos R, Ursu O, Gaulton A, Bento AP, Donadi RS, Bologa CG et al (2017) A comprehensive map of molecular drug targets. Nat Rev Drug Discov. 16(1):19–34
van Esbroeck ACM, Janssen APA, Cognetta AB, Ogasawara D, Shpak G, van der Kroeg M et al (2017) Activity-based protein profiling reveals off-target proteins of the FAAH inhibitor BIA 10-2474. Science 356(6342):1084–1087
Roy M, Dumaine R, Brown AM (1996) HERG, a primary human ventricular target of the nonsedating antihistamine terfenadine. Circulation 94(4):817–823
Cotto KC, Wagner AH, Feng Y-Y, Kiwala S, Coffman AC, Spies G et al (2017) DGIdb 3.0: a redesign and expansion of the drug–gene interaction database. Nucleic Acids Res. https://doi.org/10.1093/nar/gkx1143/4634012
Alaimo S, Bonnici V, Cancemi D, Ferro A, Giugno R, Pulvirenti A (2015) DT-Web: a web-based application for drug–target interaction and drug combination prediction through domain-tuned network-based inference. BMC Syst Biol 9(Suppl 3):S4
Gilson MK, Liu T, Baitaluk M, Nicola G, Hwang L, Chong J (2016) BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res 44(D1):D1045–D1053
Awale M, Reymond J-L (2017) The polypharmacology browser: a web-based multi-fingerprint target prediction tool using ChEMBL bioactivity data. J Cheminform 9(1):11
Szklarczyk D, Santos A, von Mering C, Jensen LJ, Bork P, Kuhn M (2016) STITCH 5: augmenting protein-chemical interaction networks with tissue and affinity data. Nucleic Acids Res 44(D1):D380–D384
Günther S, Kuhn M, Dunkel M, Campillos M, Senger C, Petsalaki E et al (2008) SuperTarget and Matador: resources for exploring drug–target relationships. Nucleic Acids Res 36(Database issue):D919–D922
Mathias SL, Hines-Kay J, Yang JJ, Zahoransky-Kohalmi G, Bologa CG, Ursu O et al (2013) The CARLSBAD database: a confederated database of chemical bioactivities. Database 2013:bat044
Skuta C, Popr M, Muller T, Jindrich J, Kahle M, Sedlak D et al (2017) Probes and drugs portal: an interactive, open data resource for chemical biology. Nat Methods 14(8):759–760
Antolin AA, Tym JE, Komianou A, Collins I, Workman P, Al-Lazikani B (2018) Objective, quantitative, data-driven assessment of chemical probes. Cell Chem Biol 25(2):194–205.e5
Fechner N, Papadatos G, Evans D, Morphy JR, Brewerton SC, Thorner D et al (2013) ChEMBLSpace—a graphical explorer of the chemogenomic space covered by the ChEMBL database. Bioinformatics 29(4):523–524
Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR et al (2018) DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res 46(D1):D1074–D1082
ChemicalProbes.org [Internet]. 2018 [cited 2018 Jan 17]. www.chemicalprobes.org
Gaulton A, Hersey A, Nowotka M, Bento AP, Chambers J, Mendez D et al (2017) The ChEMBL database in 2017. Nucleic Acids Res 45(D1):D945–D954
Klaeger S, Heinzlmeir S, Wilhelm M, Polzer H, Vick B, Koenig P-A et al (2017) The target landscape of clinical kinase drugs. Science 358(6367):eaan4368
Kim S, Thiessen PA, Bolton EE, Chen J, Fu G, Gindulyte A et al (2016) PubChem substance and compound databases. Nucleic Acids Res. 44(D1):D1202–D1213
Pence HE, Williams A (2010) ChemSpider: an online chemical information resource. J Chem Educ 87(11):1123–1124
Swain M (2016) MolVS: molecule validation and standardization. https://github.com/mcs07/MolVS. Accessed 24 July 2018
Wickham H (2017) Tidyverse: easily install and load the “Tidyverse”. https://cran.r-project.org/package=tidyverse. Accessed 24 July 2018
Guha R (2007) Chemical informatics functionality in R. J Stat Softw 18(5):1–16. https://doi.org/10.18637/jss.v018.i05
Willighagen EL, Mayfield JW, Alvarsson J, Berg A, Carlsson L, Jeliazkova N et al (2017) The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching. J Cheminform 9(1):33
Guha R (2017) Fingerprint: functions to operate on binary fingerprint data. https://cran.r-project.org/package=fingerprint. Accessed 24 July 2018
Chang W, Cheng J, Allaire JJ, Xie Y, McPherson J (2017) Shiny: web application framework for R. https://cran.r-project.org/package=shiny. Accessed 9 July 2018
R Core Team (2018) R: a language and environment for statistical computing. Vienna, Austria. https://www.r-project.org/. Accessed 24 July 2018
Szöcs E (2015) {webchem}: retrieve chemical information from the web. Zenodo. https://doi.org/10.5281/zenodo.33823
Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z et al (2016) Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 44(W1):W90–W97
Jawaid W (2017) Enrichr: provides an R interface to “Enrichr”. https://cran.r-project.org/package=enrichR. Accessed 24 July 2018
Zhan M-M, Yang Y, Luo J, Zhang X-X, Xiao X, Li S et al (2018) Design, synthesis, and biological evaluation of novel highly selective polo-like kinase 2 inhibitors based on the tetrahydropteridin chemical scaffold. Eur J Med Chem 143:724–731
Druker BJ, Tamura S, Buchdunger E, Ohno S, Segal GM, Fanning S et al (1996) Effects of a selective inhibitor of the Abl tyrosine kinase on the growth of Bcr-Abl positive cells. Nat Med 2(5):561–566
Pardanani A, Tefferi A (2004) Imatinib targets other than bcr/abl and their clinical relevance in myeloid disorders. Blood 104(7):1931–1939
Ndubaku CO, Crawford JJ, Drobnick J, Aliagas I, Campbell D, Dong P et al (2015) Design of selective PAK1 inhibitor G-5555: improving properties by employing an unorthodox low-p K a polar moiety. ACS Med Chem Lett 6(12):1241–1246
Jeannot P, Nowosad A, Perchey RT, Callot C, Bennana E, Katsube T et al (2017) p27Kip1 promotes invadopodia turnover and invasion through the regulation of the PAK1/Cortactin pathway. eLife 6:e22207
Petrilli A, Copik A, Posadas M, Chang L-S, Welling DB, Giovannini M et al (2014) LIM domain kinases as potential therapeutic targets for neurofibromatosis type 2. Oncogene 33(27):3571–3582
Fitzgerald JB, Schoeberl B, Nielsen UB, Sorger PK (2006) Systems biology and combination therapy in the quest for clinical efficacy. Nat Chem Biol 2(9):458–466
Poulikakos PI, Xiao G-H, Gallagher R, Jablonski S, Jhanwar SC, Testa JR (2006) Re-expression of the tumor suppressor NF2/merlin inhibits invasiveness in mesothelioma cells and negatively regulates FAK. Oncogene 25(44):5960–5968
Shapiro IM, Kolev VN, Vidal CM, Kadariya Y, Ring JE, Wright Q et al (2014) Merlin deficiency predicts FAK inhibitor sensitivity: a synthetic lethal relationship. Sci Transl Med 6(237):237ra68
Tang J, Tanoli Z-U-R, Ravikumar B, Alam Z, Rebane A, Vähä-Koskela M et al (2018) Drug target commons: a community effort to build a consensus knowledge base for Drug–Target interactions. Cell Chem Biol 25(2):224–229.e2
Cao Y, Jiang T, Girke T (2010) Accelerated similarity searching and clustering of large compound sets by geometric embedding and locality sensitive hashing. Bioinformatics 26(7):953–959
Authors’ contributions
RA, SLR, JG and SG developed the concept and features of the app. RA developed the database and application. JG and SG supervised the development of the app. RA, SLR, JG and SG wrote the manuscript. All authors read and approved the final manuscript.
Acknowledgements
We thank Darcy Bates, Stephanie Bouley, Heidi Chapman, Jennifer Ditano, William Feng, John Hinds, David Mallick, and Nicholas Warren for alpha testing and feedback.
Competing interests
The authors declare that they have no competing interests.
Availability
The app, source data, and Drug–Target Explorer data are available at www.synapse.org/dtexplorer. The data are licensed under CC-BY 4.0. The latest release of the Drug–Target Explorer database can be acquired without registration in the RData binary format from the polypharmacology-db GitHub or at https://doi.org/10.5281/zenodo.1324549. The latest source code is available at https://github.com/Sage-Bionetworks/polypharmacology-db and is licensed under Apache 2.0.
Funding
This work was funded by the Children’s Tumor Foundation.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Author information
Authors and Affiliations
Corresponding author
Additional files
Additional file 1: Supplemental Table 1.
Targets of C21-like compounds in the Drug–Target Explorer Database. Related to Fig. 2.
Additional file 2: Supplemental Table 2.
Targets of imatinib in the Drug–Target Explorer Database. Related to Table 2.
Additional file 4: Supplemental Table 4.
Molecules targeting LIMK1/2. The database was queried for molecules that may modulate LIMK1 and LIMK2; this analysis revealed a large set of putative tool compounds. Related to Fig. 2.
Additional file 5: Supplemental Table 5.
Identification of multi-kinase-targeting molecules for NF2. A query of the database for molecules that target several kinases of interest in NF2 (AURKA, LIMK1/2, PTK2/2B) identified 3 polypharmacologic compounds. Related to Fig. 2.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Allaway, R.J., La Rosa, S., Guinney, J. et al. Probing the chemical–biological relationship space with the Drug Target Explorer. J Cheminform 10, 41 (2018). https://doi.org/10.1186/s13321-018-0297-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13321-018-0297-4
Keywords
- Drug targets
- Polypharmacology
- Webapp
- Phenotypic drug screen
- Compound-target network