Data driven polypharmacological drug design for lung cancer: analyses for targeting ALK, MET, and EGFR

Narayanan, Dilip; Gani, Osman A. B. S. M.; Gruber, Franz X. E.; Engh, Richard A.

doi:10.1186/s13321-017-0229-8

Research article
Open access
Published: 04 July 2017

Data driven polypharmacological drug design for lung cancer: analyses for targeting ALK, MET, and EGFR

Dilip Narayanan¹^na1,
Osman A. B. S. M. Gani¹^na1,
Franz X. E. Gruber¹ &
…
Richard A. Engh¹

Journal of Cheminformatics volume 9, Article number: 43 (2017) Cite this article

3996 Accesses
4 Citations
3 Altmetric
Metrics details

Abstract

Drug design of protein kinase inhibitors is now greatly enabled by thousands of publicly available X-ray structures, extensive ligand binding data, and optimized scaffolds coming off patent. The extensive data begin to enable design against a spectrum of targets (polypharmacology); however, the data also reveal heterogeneities of structure, subtleties of chemical interactions, and apparent inconsistencies between diverse data types. As a result, incorporation of all relevant data requires expert choices to combine computational and informatics methods, along with human insight. Here we consider polypharmacological targeting of protein kinases ALK, MET, and EGFR (and its drug resistant mutant T790M) in non small cell lung cancer as an example. Both EGFR and ALK represent sources of primary oncogenic lesions, while drug resistance arises from MET amplification and EGFR mutation. A drug which inhibits these targets will expand relevant patient populations and forestall drug resistance. Crizotinib co-targets ALK and MET. Analysis of the crystal structures reveals few shared interaction types, highlighting proton-arene and key CH–O hydrogen bonding interactions. These are not typically encoded into molecular mechanics force fields. Cheminformatics analyses of binding data show EGFR to be dissimilar to ALK and MET, but its structure shows how it may be co-targeted with the addition of a covalent trap. This suggests a strategy for the design of a focussed chemical library based on a pan-kinome scaffold. Tests of model compounds show these to be compatible with the goal of ALK, MET, and EGFR polypharmacology.

Background

The importance and proven druggability of protein kinases as targets in cancer [1, 2], inflammation [3], and other disease areas have transformed antikinase drug discovery into an information driven research area of unprecedented scale [4]. Public and proprietary databases contain binding data for hundreds of thousands of active compounds [5]. Crystal structures are publicly available for some 3000 protein kinase inhibitor complexes in the Protein Database (www.rcsb.org) [6]. This data begins to enable “polypharmacological” targeting of multiple kinases [7,8,9], which may more effectively modify network behavior [10], or forestall drug resistance [11, 12], or provide broader applicability against heterogeneous cancers or larger patient groups. Such approaches [13] may involve “retargeting” via modification of known compounds, or simply “repurposing” known compounds to new applications when target profiles are suitable. Practical approaches to polypharmacological design include both experimental and computational methods [14,15,16,17,18]. There is however no single strategic approach to modify such starting compounds to achieve the final selectivity profile; this depends on the availability, identification and understanding of the essential selectivity determinants of the relevant targets, as we examine with example of this paper.

Non-small-cell lung cancer (NSCLC) represents a collection of diverse molecular pathologies. Most types are relatively insensitive to chemotherapy, but the identification of genomic abnormalities in subpopulations of NSCLC patients [19,20,21,22] have led to the development of protein kinase inhibitors against EGFR [23, 24] (gefitinib, 2003; erlotinib, 2004; afatinib, 2013) and ALK (crizotinib, 2011; ceritinib, 2014; alectinib, 2015), see Fig. 1. These inhibitors specifically target either EGFR or ALK, but not both; cross-reactive inhibitors are under investigation however [25]. Analogous to the results of imatinib and ABL inhibition as therapy for CML, treatment with gefitinib and erlotinib is associated with acquired resistance [26]. Unlike ABL inhibition, resistance development to EGFR inhibitors seems universal. The most common resistance mechanism is a secondary mutation of the gatekeeper residue (for EGFR, predominantly T790M); afatinib appears less [27] but not completely [28] insensitive to this mutation. Additional mechanisms of acquired resistance include the amplification of MET [29], HGF [30], or HER2 [31]. The universal appearance of drug resistance via diverse mechanisms following EGFR inhibition therapy has generated widespread interest in polypharmacological or combinatorial treatment strategies [32,33,34]. In this paper we examine the potential of polypharmacological targeting EGFR, ALK, and MET.

As is typical for protein kinase inhibitors, compounds known to inhibit ALK, MET, and EGFR most potently bind at the ATP site, where they are anchored to the interdomain hinge via one or more hydrogen bonds [35]. Together, they represent a small subset of the known scaffolds for hinge/ATP site binders [36], which are already a restricted set [37] compared to proposed extents of possible scaffold diversity [38]. The EGFR inhibitor lapatinib was one of the first nearly monospecific kinase inhibitors [39]; others may have much broader selectivity profiles [40]. Knowledge of the determinants of selectivity for specific protein kinases or subfamilies [41, 42] assists target specific or polypharmacological drug design. These include the “gatekeeper” residue at the hinge, and infrequent occurrences of unique residues, such as glycine [43] or cysteine [44, 45]. The ongoing expansion of public [40, 46,47,48,49] and proprietary [50] target-ligand binding data begins to enable “machine learning” prediction of target inhibition profiles [51,52,53,54,55,56]. Such methods, similar to the more common structure based methods [57, 58], generally do not allow precise (e.g. binding constant error <10-fold) prediction of binding properties of individual compound-target interactions, but they do provide guidance to focus efforts on compound classes or libraries with the best chances for success [59,60,61,62,63,64].

In addition to the currently approved drugs and known inhibitors, many crystal structures are available to support drug design against NSCLC targets [6]. For EGFR, ALK, and MET, truncated kinase domain structures have been determined many times, uncomplexed, in complexes with ATP, ATP analogs, and inhibitors, including mutants and truncation variants. These structures show considerable diversity of active and inactive conformations [11, 13], and increasingly enable sophisticated drug design approaches. The target EGFR has become the primary example for targeting cysteine residues for irreversible inhibition [44, 45, 65] and, along with ABL, for the design of broadened selectivity profiles to overcome or forestall drug resistance [66]. A growing catalog of resistance mutants appearing in ALK [67,68,69] and MET also belong to the set of targets to be considered in general polypharmacological targeting strategies against NSCLC [70,71,72,73,74,75,76,77,78].

In this paper, we examine combinations of structure, binding, and target validation data to suggest a strategy for polypharmacological targeting of ALK, MET, and EGFR. We use cheminformatics methods to analyse the similarities of the targets. Inhibitor binding data shows a high degree of correlation between ALK and MET with respect to inhibition profiles, but also an essential dissimilarity with EGFR. The drug resistant mutant EGFR-T790M is intermediate between the two groups. We compare crystal structures of ALK and MET, considering especially the cross-inhibitory compound crizotinib, in an attempt to identify the structural origins of the similarities. Despite the dissimilarity of EGFR, the availability of a cysteine residue in the ATP pocket provides an orthogonal approach for polypharmacological optimization: the ALK-MET similarity may be exploited for optimization, while the addition of a covalent trap to inhibitors may add effective EGFR inhibition to the profile. Finally, we test compounds synthesized with these properties to verify the approach.

Results and discussion

Cheminformatics show similarities of ALK and MET, dissimilarity of EGFR, and intermediate similarity of EGFR mutant T790M

Several measures are available to evaluate the similarities of kinase drug targets [15, 46, 79,80,81,82], including sequence, structure, and inhibitor properties. For drug discovery purposes, experimental binding data regarding cross-reactivity of inhibitors may be the most relevant, although this data may be generated in different ways, with diversity arising from choice of binding or activity assays, conditions, target protein form, etc. Significant discrepancies between assay formats are to be expected [80, 83, 84] which are more critically reflected in disparities between in vitro measurement conditions and their applicability in vivo. Based on single concentration measurement data from Ambit BioSciences, including estimated IC50 values for the binding of >20,000 compounds to 300–400 protein kinases, researchers at Bristol-Myers Squibb evaluated inhibitor selectivity profiles with an “activity homology” (AH) score [80] (see “Methods”). By this measure, ALK and MET are the most similar of the kinases considered in this manuscript, while EGFR is distinctly different (Fig. 2). Of the 400 protein kinases in the test set, some 80 kinases are more similar to ALK than MET, including the gatekeeper mutant M1250T of MET (approximately at position 50 of the 400 kinases). About 35% of the potent ALK inhibitors are shared with the MET inhibitor set (and about 43% with MET-M1250T). Less than 5% of the ALK inhibitors bind potently to EGFR and its mutants, with the notable exception of the double mutant EGFR-(L858R, T790M). This combination of the cancer primary mutation L858R and drug resistance gatekeeper mutation T790M is potently inhibited by 30% of the ALK inhibitors. (Figure 2 shows ALK4 as similar to EGFR, sensitive to over 50% of the EGFR inhibitors. This kinase “Activin-Like receptor Kinase” belongs to the Tyrosine Kinase Like (TKL) subfamily of the kinome, and is not related to ALK “Anaplastic Lymphoma Kinase” studied in this work).

A related measure of similarity that also uses inhibitor binding data is the correlations of inhibitor binding profiles between pairs of kinases. Highly correlated targets share similar sensitivities to changes in the inhibitors. Unlike the “activity homology” described above, correlation compares the pattern of variation of inhibition strengths, and not the absolute values. Thus, two kinases may have highly correlated inhibition profiles even if the inhibition pattern is significantly weaker for one kinase. This may occur, for example, if the overall shapes of the inhibitor binding sites are similar, but one of the kinases may lack one important binding feature. For drug polypharmacology design purposes, it may be advantageous to enhance recognition of correlated sensitivities to ligand variation. Using the binding data of the Ambit study of 72 inhibitors and their interactions with 442 kinases [40], correlation analysis highlights the similarity of ALK and MET, and the dissimilarity of EGFR (Fig. 3). The inhibitor set of the study shows a large number of protein kinases, widely distributed across the kinome, with moderate similarity to ALK. The kinases with the most correlated inhibition profiles are, like ALK, tyrosine kinases, and include the closely related LTK, INSR, IRR and IGF1R kinases, but also FAK, PYK2, FER, FES, MER, and AXL. MET is only moderately correlated, and EGFR has low correlation. Indeed, very few kinases are correlated with EGFR; of the test panel, only HER4 and HER2 are strongly correlated, while HER3, a few TKs, and the less related RIPK2 and GAK kinases show moderate correlation similarity.

A third measure of similarity is principle component analysis [85, 86] of multiple target-multiple inhibitor binding matrices (see “Methods”). Applied to the 2011 Ambit study [40], the protein kinase targets form a broad cluster, extended along the dimension of the first principal component (Fig. 4). Considering the PCA axes to represent “pseudo-inhibitors” as described above, there is a roughly Gaussian distribution for a majority of kinases around a value representing weak to moderate binding to “pseudo-inhibitor 1”, with some kinases in a skewed distribution toward tight binding. In order to maximize the variance in the first coordinate, the PCA method has constructed a “pseudo-inhibitor” that combines tight binding for the largest number of kinases possible (the skewed distribution). This favors the selection of targets which may be inhibited tightly by many inhibitors in the dataset, i.e. “generic” targets with high propensity for inhibition. ALK, MET, and EGFR are near the middle of the distribution in PC coordinate #1. The second PCA dimension similarly spreads the kinases into a moderately inhibited cluster, skewed toward a smaller set of kinases that are tightly inhibited; here,this includes EGFR and several of its variants, but not the two T790M mutants. The third PC dimension clearly separates EGFR and all of its mutants from the rest of the kinases, including ALK and MET. Thus, the activity homology (AH) data, the distributions of inhibitor correlation data on kinome plots, and PCA analysis of the inhibition data all show the statistical similarity of ALK and MET, the dissimilarity of EGFR, and an intermediate similarity for EGFR-T790M. The PCA analysis also identifies the inhibitors responsible for the clustering of EGFR and mutants away from the other kinases (see discussion below).

Heterogeneity of crystal structures obscures polypharmacology prediction

Because the structures of the target proteins determine the binding strengths of the ligands, it is natural to expect crystal structures to reveal target similarities and to aid the formulation of polypharmacological targeting strategies. Accordingly, considering the previous section, the ALK and MET ATP binding sites would be expected to appear similar to each other than to EGFR, with the EGFR-T790M mutants somewhere in between. However, examination of the crystal structures available for these targets reveals more the difficulties of structure based prediction of comparative binding strengths than mechanisms for doing so. This is due in large part to the structural flexibility and plasticity of proteins, leading to signficant ligand induced structural changes, but also derives from structural distributions that are affected by the crystallization constructs, conditions and crystal packing arrangements. Considering all structures available for the targets illustrates this.

The ALK structures are most homogeneous set; they superimpose with an average Cα RMS of 0.11Å, have the same essential crystal packing arrangement and, with one exception (4FNY), share an active “DFG-in” conformation of the activation loop (with the DFG phenylalanine in its hydrophobic spine position [87]). The activation loop (A-loop) adopts a unique conformation, however: The A-loop phosphorylation site Tyr1278 is anchored below the “C-helix” on the side opposite to the ATP pocket, analogous to active TK structures first observed for insulin receptor kinase [88, 89], but in ALK with a unique alpha-helical secondary structure. The exceptional ALK DFG-out conformation structure shares the crystal packing arrangement of the other structures, but at a lower symmetry, with the asymmetric unit comprising what was a crystallographic symmetry related pair in the other structures.

The greatest number of structures is available for EGFR (more than 100 PDB entries when including disease-related and other mutations). The largest group of these structures share an active conformation, whereby pairs of monomers related by a crystallographic three-fold symmetry operation (space group I23) form an “asymmetric dimer” that represents a structural model of the active form [90]. Other EGFR structures show variations in C-helix positions (in = active, out = inactive); one set of structures (e.g. 2JIU) has an asymmetric unit consisting of a dimer with an apparently active geometry. There are no observations of a “DFG-out” geometry among the EGFR structures. However, there are two clusters of conformations: one with the usual active DFG-in conformation that places the DFG Phe into the hydrophobic spine, and another conformation with altered main chain angles and position of the activation loop.

The MET structures show the greatest conformational diversity. Of the three kinases studied here, MET has crystallized in the largest number of space groups, and the N- and C-lobe “open-closed” variations are largest. The DFG states observed include DFG-in, DFG-out, and intermediate states. The C-helix is seen in both “in” and “out” geometries. The activation loop conformations are highly varied, including many that could not be resolved in the crystal structures. Many of the variations also involve inhibitor interactions, and one common inhibitor binding surface is quite conserved as aromatic, but is seen formed variously by three different residues with four or more distinct geometries.

Do crizotinib co-crystal structures explain cross-reactivity and reveal ALK and MET similarities?

Even if flexibility prevents reliable prediction, it seems reasonable to expect that the structures of cross-reactive inhibitors in their different targets would identify the basis for the cross-reactivity. This would enable structure based design of e.g. an inhibitor library focussed on the likelihood of cross-reactivity. For ALK and MET cross reactivity, the low nanomolar inhibition of both ALK and MET by crizotinib is likely the clearest and best known measure of similarity between the two targets [91] Besides ALK (3 nM) and MET (2 nM), crizotinib also shows single digit nM binding (KD) to protein kinases ROS1 (4 nM), MER (4 nM), EPHB6 (6 nM), and AXL (8 nM) [40], depending of course on assay conditions. Crystal structures of crizotinib in protein kinases in the PDB comprise 2WGJ (c-MET KD), 2XP2 (ALK-KD), 2YFX (L1196M), 3ZBF (ROS1), 4ANQ (G1269A), 4ANS (L1196M, G1269A), and 4C9W (NUDT1). (The S-stereoisomer of crizotinib is co-crystallized with NUDT1 also). Superposition of co-crystal structures of crizotinib with ALK (2XP2 [92]) and MET (2WGJ [91]) reveals more how the binding energies that correspond to the highly selective and nanomolar ALK and MET co-inhibition depend on interactions that are not readily identified with standard structural biology or bioinformatic methods (Fig. 5). Interacting side chains differ at many key sites, including: the residues that sandwich the adenine binding site (MET vs. Leu from the C-lobe, and Leu vs. Ile from the glycine-rich loop), the gatekeeper+2 (the site two residues C-terminal to the gatekeeper residue) aromatic/hydrophobic side chain at the hinge (Tyr vs Leu), and a key pi–pi stacking interaction with the activation loop phosphorylation site tyrosine that is observed only in the MET structure. Amino acid type specific crizotinib interactions that are shared between ALK and MET comprise the gatekeeper (Leu), the C-terminal ATP site anchor of the glycine-rich loop (Val), an alanine residue two positions N-terminal to the active site lysine, and a pyrazole–proton interaction at a gatekeeper+6 glycine residue. The importance of the gatekeeper and pyrazole-glycine interactions are highlighted by the occurrence of resistance via mutation at these sites [93]. While the shared interactions are consistent with the observed co-inhibition, these residues are highly conserved in the kinome, so that these interactions are not predictive of the selectivity of the co-inhibition. Other shared but non-residue specific interactions include the anchoring hydrogen bonds at the hinge, and a key interaction with a main chain carbonyl group. The latter forms a CH–O hydrogen bond from an aromatic ring hydrogen that has been polarized by the fluorine substituent at the adjacent site on the ring [92]. The pi–pi stacking interaction with Tyr1230 appears important for MET inhibition, but no comparable interaction is seen in the ALK structure. This difference was proposed to account in part for the tighter binding of crizotinib to MET vs ALK, along with differences of the backbone peptide orientation at G1269 (ALK) and A1221 (MET) [92]. Crizotinib binding to the MET mutant Y1230C is weakened 15-fold in a cellular assay [94], supporting the view that the pi stacking interaction in important in vivo. Taken together, these details indicate that the crystal structures would not have allowed the prediction of high affinity cross-reactivity, but that prior knowledge of the cross-reactivity enables the study of the structures to identify the key candidate interactions responsible for it.

Do the crystal structures in general reveal polypharmacological potential?

The importance of the CH–O, CH-arene, and arene–arene interactions for the total binding energetics (and, ultimately, therapeutic properties) of crizotinib is not entirely clear, but they highlight the scoring and searching dilemma that prevents in silico methods to predict ligand binding properties from target structures: The level of theory and concomitant CPU power required for computation of such binding features to provide effective scoring prevents their prediction a priori if binding geometries are unknown. (For example, the strength of the pyrazole–proton interaction described above depends on the electron-richness of the pyrazole and on energetic penalties of rotation away from the energy minimum, phenomena requiring quantum mechanics level calculations for evaluation [92]). Comparisons across all PDB structures provide some measure of the range of structural variability, but do not enable the calculation of binding energies, while inhibitor binding studies provide averaged binding energies averaged over structural distributions in the assay environmental conditions. In the PDB, there are currently 51 MET and 36 ALK structures; of these, only 3 MET structures are of the phosphorylated protein. For crizotinib, both MET and ALK were nonphosphorylated forms of truncated kinase domains, and the co-crystal structures showed inactive geometries. (MET has significant residual activity when unphosphorylated, and is activated 160-fold upon autophosphorylation [94]). For receptor TKs, phosphorylation often occurs autocatalytically in trans as a consquence of ligand binding to extracellular domains and oligomerization [95]. The mechanism of activation of tyrosine kinases by phosphorylation of the activation loop is usually considered to involve destabilization of possibly rigid inactive states of the kinase.

Oncogenic activation via mutation or fusion often disrupts inactivating geometries. Partially as a result of this, and partially due to their inherent plasticity, protein kinase cancer drug target structures are highly flexible; prioritization for in silico drug discovery purposes may be difficult [96]. Superposition of ALK and MET structures (Fig. 6) from the PDB illustrate this. The MET structures show great diversity in conformations of the activation loop, and the tyrosine that is involved in pi–pi stacking interactions with crizotinib (Tyr1035) is found distributed across the entire space accessible to the activation loop (Fig. 6a). The structures for ALK are more homogeneous, and cluster into one major group and one minor group. Most ALK structures show a helical conformation for the activation loop following the DFG sequence, anchored to the C-helix by two arginine residues flanking the conserved glutamic acid of the C-helix, and by hydrophobic and aromatic interactions involving Tyr1278 (Fig. 6a). For the exceptional geometry, the activation loop retains a helical conformation and salt bridge interactions with helix C, but the Tyr1278 anchoring is lost, and the helix is rotated away from the C-helix. Prediction of ATP site binding will usually depend critically on the choice of the “correct” target structure (Fig. 6b), but if the “correct” structure is induced by inhibitor, it will obviously not be available for a priori searches. Statistical analysis of the protein kinases currently in the PDB provides an excellent illustration of the structural diversity currently observed to date [97]. PLS analysis [98] to identify the geometric measures that most strongly differentiate active and inactive geometries [97], and cluster crystal structures accordingly, shows how the structures for ALK, MET, and EGFR are distributed between apparent activity states. The MET structures are mostly inactive and broadly distributed, the ALK structures are mostly active (or close to it), and the EGFR structures are both active and inactive (Fig. 6c). For ALK, the position on the horizontal axis (with the coordinate definition dominated by DFG related geometries) does not clearly mark them as active. However, the vertical axis and its inclusion of helix C position parameters, coupled with “nearly active” DFG geometries, clusters ALK together with the active group.

A closer look at the structural diversity reveals several interesting phenomena. One is the clustering of aromatic side chains at the ATP pocket (Fig. 7). Many of the MET structures show nearly identical positions for Tyr1230, similar to the geometry seen in the crizotinib complex. These are compatible with DFG-in geometries. Standard DFG-out geometries do not allow Tyr1230 to occupy that space, but replace it nearly exactly with Phe1223 of DFG, in place for inhibitor packing interactions. As a third alternative, this space may be occupied by the aromatic Phe1089 from the glycine-rich loop, represented by a small cluster of three structures in this superposition. There is also a unique positions for the DFG and glycine-rich loop positions. Inhibitor types are associated with the clusters of arenes at this site (Fig. 7b). Interactions with the DFG Phe in the DFG-out configurations are dominated by single aromatic rings from Type II inhibitors that occupy the deep pocket (vacated by the DFG-out Phe). Interactions with Tyr1230 of the activation loop involve a small variety of arenes: several halogenated or nitrated phenyl groups and a larger number of fused 5- and 6-membered heteroatomic aromatic ring systems, mostly in the same space. (One structure is displaced (3ZZE), but pi interactions may be maintained via resonance across nitrogen and amide bond linkages). One structure is unique: the complex with ARQ197 (PDBID: 3RHK [99]) shows a fused three-ring system sandwiched between Phe1089 and Phe1223 in unique positions. Finally, significant structural variation may be observed within a single crystal structure. The two monomers of the MET kinase domain in a crystal structure of a complex with a pyrimidone containing type II inhibitor (PDBID: 3EFJ [100]) shows that group to interact with the protein via pi–pi stacking interactions, whereby the interacting partner is the DFG Phe1223 for one monomer, but is the glycine-rich loop Phe1089 for the other (Fig. 7c).

Study of the variabilities shown by the crystal structures reveal physiologically relevant properties of the individual proteins and ligands studied, but their interpretation with respect to therapeutic properties requires much more experimental information and careful analyses of the differing environments in crystallo and in vivo. For protein kinases, crystal structures are commonly truncated single domain proteins with specific phosphorylation states, and the structures of flexible elements such as the activation loop may be determined by crystal packing interactions. In contrast, the disease targets are usually larger, multidomain proteins, often in larger assemblies, and with heterogeneities of chemical modifications and cellular environments. Highly potent inhibitors can compete with many of these effects, such that co-crystal structures generally reveal the key target-inhibitor interactions faithfully. But the crystal structures may also capture both the protein and ligand in states that are unimportant for therapeutic properties.

One uncertainty for protein kinase co-crystallography is the activation state of the enzyme. Crizotinib binding in MET described above involves pi–pi stacking with Tyr1230. However, the three activated MET structures in the PDB, phosphorylated on Tyr1235, show an activation loop structure with Tyr1230 far removed from the ATP binding pocket. The apparent structural homogeneity of ALK is also deceptive. The oncologically relevant forms of ALK are most commonly fusion proteins [76, 101] that remove membrane attachment and render ALK constitutively active. Dimer- or oligomerization is thereby essential for cell transformation, but the details of the structures and mechanism of activation are not known [76]. Mutations that confer resistance to crizotinib [102] include several that may destabilize the intramolecular A-loop helix packing that is apparently part of the inactivation mechanism for ALK [103]. The helix packs most prominently against the C-helix, with an Arg + Glu + Arg triplet, likely to stabilize a helical conformation, slotting into a space between two Glu residues extending from adjacent turns of the C-helix (of these, one is the usual partner to the active site lysine of active protein kinase structures. In addition, the phosphorylation site Tyr1278, which is adjacent to the anchoring Arg at the terminus of the activation loop helix, contributes to anchoring the helix via an edge-face pi–pi stacking interaction with Tyr1096.T his residue is found in a sequence N-terminal to the ALK kinase domain. The proximity of this anchor to the fusion position for the activation fusions with e.g. EML4- or NPM-suggest that the fusion may activate the protein by weakening the inactivating AL-helix interactions. For MET as a target in NSCLC, it is the wt protein which is of greatest relevance, although NSCLC related MET mutations have also been observed [104].

Summarizing the structures analysed here, we have seen most significantly: (1) that the diversity of activation forms of ALK, MET, and EGFR show how crystal structures cluster according to successful crystallization conditions, which is difficult to relate to the distribution of structures that are most relevant physiologically or biochemically (for in vitro binding measurements), (2) that knowledge of the cross-reactivity of crizotinib is a prerequisite for identifying key binding interactions, due to the divergence of sequences at the binding site, and (3) that these are most likely special interactions of arene groups and polarized bonds which would be overlooked by simplified molecular mechanics methods that are developed for rapid in silico approaches.

How can cheminformatics inform crystallography?

It is clear that structures need to be interpreted with respect to binding data. Inhibition and/or binding data [47, 49] (including variation of ATP concentration and by single site mutation) are available from a variety of protein forms and assay formats [40, 82, 91, 92, 105,106,107,108,109,110,111,112,113,114,115,116]. Crizotinib binds to an inactive conformation of MET [94]. These data show insensitivity to phosphorylation in Abl, expected for type I inhibitors [117]). It is however affected by resistance mutations, whereby the T315I mutant of Abl is most tightly bound (ca 10 nM), and the gatekeeper+2 mutations F317I or F317L are most weakly inhibited (3000 and 600 nM, respectively). This sensitivity is interesting for considerations of ALK and MET binding, as described above. It should be readily appreciated that prediction of the ALK-MET cross-reactivity by automated bioinformatics or structural analysis methods seems highly unlikely. Whether more general machine-learning approaches could do so, presumably by identifying underlying and subtly interlinked selectivity determinants without recourse to model assumptions, remains to be seen. In any case, it will not be a competition between cheminformatics and crystallography, but will involve the integration of structural data into informatics techniques.

Many statistical questions simply identify correlations, which may be recognized even with relatively sparse datasets. These may then generate hypotheses for more detailed study. The principal component analysis of the Ambit kinase and inhibitor panel of 2011 [40] generated transformed coordinates that indicated unique clustering for both EGFR and its drug resistant mutant T790M (Figs. 4, 8). The coordinate transformation highlights the inhibitors especially responsible for the unique position of EGFR. Early profiling data already indicated that EGFR could be targeted with unique selectivity [39]. The PCA transform of the 2011 data, and in particular the 2nd and 3rd dimensions, identifies inhibitors with particularly notable EGFR inhibition properties. PC dimension #2 (corresponding to a pseudo-inhibitor, see “Methods”), which separates EGFR T790M variants from the other EGFR mutants, also “bundles” selectivity determinants into the corresponding pseudo-inhibitor that have broad applicability to the rest of the kinome. One of these is most obviously the occurrence of methionine as gatekeeper, and the PCA plot shows the enrichment of protein kinase targets with this gatekeeper in the direction of the displacement of EGFR T790M variants relative to the remaining EGFR cluster. PC dimension #3, with less total variance than PC #2, has its clearest source of variance with the separation of all the EGFR targets. The “loading plot” for PC #2 and #3 (Fig. 8b) shows the inhibitors mostly responsible for the identification of these properties, and include highly selective EGFR inhibitors such as HKI-272, BIBW-2992, etc. (upper quadrants of Fig. 8b), and also antiselective inhibitors such as sorafenib (lower quadrants). Similarly, inhibitors that bind the native EGFR sequence preferentially (right-most quadrants, e.g. dasatinib) or the T790M mutant (left quadrants, e.g. staurosporine) are identified by PC dimension #2.

Focussing libraries toward ALK + MET + EGFR polypharmacological inhibition

The analyses of this work reveal statistical target similarities between Alk and Met, along with a fundamental dissimilarity of EGFR, and an intermediate position for the drug resistant EGFR-T790M mutant. They also show however how distributions of structural variations, the importance of subtle chemical interactions, and mismatches between the systems used to generate experimental binding data prevent direct design of an inhibitor with the desired polypharmacological selectivity profile. As a consequence, the design goal is to create a library of test compounds with the greatest likelihood of having the target properties. For ALK, MET, and EGFR, the challenge is to achieve cross-reactivity despite the dissimilarity of EGFR. (The dissimilarity is shown statistically, as in Fig. 2, which also shows the existence of some overlap between potent inhibitor sets for ALK or MET and EGFR. One example inhibitor is brigatinib [116]). The solution is simple: use the known cross-reactivity of ALK and MET, which we may consider to be based on the “shape” of the respective ATP pockets, and add covalent trapping groups at sites with a good potential for reaction with the cysteine at the gatekeeper+7 site in EGFR.

Although the unusual nature of EGFR-inhibitor interactions revealed by the cheminformatics above does not depend on its cysteine at the gatekeeper+7 position, this cysteine does provide an ideal mechanism [45] for a high affinity interaction type that is essentially decoupled or “orthogonal” to shape-based similarity that characterizes ALK and MET. Thus, the detailed strategy to focus a library for ALK + MET + EGFR polypharmacological inhibition is first to identify compound classes that provide ALK + MET co-inhibition, and then to select scaffolds from these that in addition allow modification to target covalent inhibition of EGFR. The strength of the EGFR interaction need not be high, but would have to satisfy geometric and dynamic requirements for covalent binding. Many examples of suitable scaffolds have been published, and superpositions of the targets and relevant inhibitors (Fig. 9) show the viability of the approach.

Crizotinib itself may be considered for this purpose, but it does not inhibit EGFR [40] (although is does bind the EGFR G719C mutant at micromolar levels). Other candidate scaffolds include for example (Fig. 10) staurosporine, bisindolylmaleimides, 4‐{2‐phenylimidazo [1,2‐a] pyrazin‐3‐yl} pyrimidine, and 3‐phenyl‐1‐(4‐{thieno[3,2‐c]pyridin‐3‐yl} phenyl) urea. Examination of the literature on candidate scaffolds and their suitability for ALK + MET + EGFR polypharmacology highlights especially the tricyclic scaffold found in the covalent EGFR inhibitor WZ4002 [118] (PDBID: 3IKA) and other derivatives known for covalent EGFR inhibition [119]. WZ4002 is known to bind both ALK [118] and MET [120], and the core dianilino-pyrimidine kinase binding scaffold also shows ALK and MET inhibition (Fig. 10) in other compounds [46]. This scaffold is well known in industry, including use as in connection with acrylamide groups for covalent binding, and a substructure search returns well over 104 compounds from patent literature. This need not hinder further research, however, because the earliest patents have expired or are due to expire soon (for example, methoxylated forms of (2,4) dianilino 5-chloropyrimidine were patented with a priority date of 1995 [121]).

To test the suitability of dianilino-pyrimidine kinase binding scaffolds for ALK + MET + EGFR polypharmacology, we profiled three compounds representing the basic scaffold (including chlorine as the gatekeeper interacting atom: 5-chloro-N2,N4-diphenylpyrimidine-2,4-diamine), with additional single acrylamide functional groups as substituents on each of the candidate aromatic rings. Acrylamide substitution on the N4 phenyl group (at the meta position) places the covalent trap analogous to its position in WZ4002. Acrylamide substitution on the N2 phenyl moiety (also at the meta position) places the covalent trap at a position potentially suitable for a covalent trap to the gatekeeper+7 site in EGFR; varying the linker to the acrylamide function allows for uncertainties regarding optimal geometries and protein plasticity (Fig. 11; Additional file 1).

Tests of the compounds confirm suitability for ALK + MET + EGFR polypharmacology optimization (Table 1). For the N2 phenyl ring substituted compounds 2a ^{Footnote 1} and 2b, ^{Footnote 2} K_d values as measured by KdELECT assays (DiscoverX) show submicromolar binding for ALK, MET and both tested forms of EGFR. For compound 1, ^{Footnote 3} with the acrylamide function at the site corresponding to that of WZ4002, both ALK and MET binding were inhibited more weakly compared to EGFR binding. Retesting the compounds under scanKINETIC assay conditions for the two EGFR targets showed K_d values that were considerably tighter than in the KdELECT assays, approximately 2-fold for compound 2a, and 4-7-fold for compounds 1 and 2b. The sensitivity to the assay conditions seen for both forms of EGFR and uniformly tighter binding under the new assay conditions may indicate tighter binding for ALK and MET under these altered assay conditions as well.

Table 1 Alk, Met, and EGFR binding properties of the test tricyclic compounds

Full size table

The kinetics testing results (Table 1) support several further important conclusions. First, comparison of the the apparent binding constants comparing 1-h and 6-h incubation times show that compounds 1 and 2a are relatively slow-binding, while binding of 2b is complete within 1 h. Second, comparison of the apparent binding constants after 30-fold dilution following one-hour of incubation times showed slow dissociation behavior for compounds 1 and 2a, and complete dissociation of 2b within 5 h. The simultaneous appearance of slow association and slow dissociation complicates the interpretation of the results, but the percentage values shown in Table 1 show the apparent residual amounts of compounds relative to bound values at 1 h. These values indicate retention of roughly half of compound 1, and nearly all of compound 2a, after dilution. Covalent binding is the most obvious interpretation of these data. In contrast, compound 2b follows fast association and dissociation kinetics, with no evidence of covalent binding. The similarity of compound 2a to 2b and 1 with respect to structure and binding strength, differing only in the linker to the acrylamide binding group, is further evidence that that it is the covalent binding property that determines the difference in binding kinetics, rather than e.g. a slow conformational change of the target enzymes.

The variations in properties of compounds 1, 2a and 2b indicate diversity of their potential application as basic scaffolds for generating libraries suitable for polypharmacological targeting of Alk, Met, and EGFR. Compound 1, with analog WZ4002 known to bind covalently to EGFR, has an intrinsic selectivity for EGFR, but still may be suitable for Alk-Met-EGFR polypharmacology, with suitable decoration. Compounds 2a and 2b, on the other hand, show greater affinity for Alk and Met, and good affinities for both EGFR variants. Both 2a and 2b are thus good candidates for optimization. However, because 2a and apparently not 2b is able to bind covalently to EGFR (both forms), the 2a scaffold seems most likely to provide good chances for polypharmacological optimization, as substituents may be chosen to optimize Alk and Met binding, using their intrinsically higher similarities, while maintaining EGFR binding via the covalent trap (providing high potency without the need for precise shape matching).

Conclusions

The chain of reasoning of this work began with the identification of a set of relevant key targets for a disease, here non small cell lung cancer (NSCLC), and narrowing the selection based on the extent of available data on compounds and targets. Targets validated with approved inhibitors include EGFR, its drug resistant gatekeeper mutant of T190M, as well as ALK and MET. Mutations or fusions of both EGFR and ALK may be primary causes of cancer, whereas both the EGFR gatekeeper mutant T190M and MET may generate drug resistance. Analysis of the relevant inhibitor binding data identified similarities between ALK and MET, and highlighted the essential dissimilarity of EGFR, with an intermediate position for the T790M mutant. A result of this analysis is the choice of a strategy for polypharmacology to optimize ALK and MET binding via shape and surface complementarity, while maintaining covalent binding to EGFR. A suitable inhibitor scaffold was chosen, based solely on binding data. An accompanying analysis of the published crystal structures for the targets, aiming to guide optimization, highlighted mostly the intractability of mapping the inhibitor binding similarities to structural properites. This is due both to great structural variability of the targets and to the subtle nature of the essential interactions, which generally are not recognizable by molecular mechanics methods. However, the structures did highlight key details, such as the unusual binding interactions of crizotinib that are conserved between ALK and MET, despite differences in sequence, and also the binding mode of the dianilino-pyrimidine inhibitor scaffold. Thus, crystallography informs cheminformatics. This polypharmacology targeting example, while highlighting specific characteristics of ALK, MET, and EGFR, may be generalizable to other kinase target profiles in several ways. The flexibility and large numbes of kinases means that similar intractabilities of structural analysis will occur, but also that other combinations of diverse selectivity mechanisms may be found for appropriate targeting. Automation of such approaches seem highly unlikely until these mechanisms may be catalogued in some machine-analysable form. Other non-kinase polypharmacology approaches will likely be quite different, possibly involving target profiles more diverse, more rigid, or otherwise distinct from protein kinases.

Methods

Activity homology data were taken from Posy et al. [80], with the data for Alk, Met, EGFR and EGFR-L858R, T790M plotted with Excel (Fig. 2a), and as a heat map (Fig. 2b) with tree clustering performed by the Clustermap function of the Seaborn python package (DOI: 10.5281/zenodo.45133). This measure is defined as the percentage of potent inhibitors (those with an estimated IC50 < 150 nM) of a reference kinase “A” that are also potent inhibitors of the comparison kinase “B”. As defined, this is “the prior probability that a compound will be active for kinase B given that it is active for kinase A” [80]. (Note that this defines a matrix which is not symmetric with respect to swapping the reference and compared kinases, because each will have a unique set of potent inhibitors.)

Kinome profiles of target similarity (Fig. 3) were evaluated using the data of Davis et al. [40], with the nM inhibition values x_i converted into logarithms to be proportional to binding energies for inhibition values tighter than the upper measurement limit of 10 micromolar, and arbitrarily set uniformly to 5 (corresponding to 100 micromolar) for values above the upper measurement limit. These values were used to calculate target similarity as Pearson’s correlation coefficient for the pairs of vectors of binding energy equivalents and plotted as disks at the kinase positions of the kinome plot of Manning et al. [122]. Hues and radii reflect the correlation values.

Principal components (Figs. 4, 8) were determined using Karhunen–Loeve Decomposition as implemented in Mathematica (version 10.3) with the binding energy equivalent values. This method of data analysis can be applied to an t · n matrix T _ij that represents how the t targets are inhibited by n inhibitors (the targets {i} are thus t positions in an n-dimensional space of inhibitors {j}). The analysis determines the orthonormal linear transformation of the coordinate system that diagonalizes the t · t covariance or correlation matrices of the targets with respect to their inhibition profiles. This transformation into a new n dimensional space of virtual or pseudo-inhibitors {k} is done (usually with the {T_i,1, T_i,2,…} vectors normalized to unit variance and zero average) such that the variance is maximal for k = 1 (PCA dimension 1), second highest for k = 2 (PCA dimension 2), and so on. This reduces the redundancy of similarities between inhibitors, and clusters the targets in spaces whose dimensionality may be reduced according to the degree of variance desired “to explain the data”. As the PCA transformation maximizes the spread in inhibition values by a “pseudo-inhibitor” created by the recombination of inhibitors of the dataset, inhibitors that are selective for subgroups of targets are most strongly represented in the relevant transformed coordinates, with the (mutually orthogonal) coordinates ranked (PC #1, PC #2, etc.) according to the total variance in that coordinate. The “loading plots” show the coefficients of the individual inhibitors in the “pseudo-inhibitor” principal components created by the transformation.

Superpositions of the ALK, MET, and EGFR protein kinase structures from the Protein Data Bank (PDB) (Additional file 2) were performed using PYMOL (version 1.7) and scripts that extracted protein kinase monomers from the entries. The scripts aligned the CA atoms from the gatekeeper+3 residue as hinge anchor position (1196–1199—ALK, 1158–1161—MET, 766–769 or 790–793—EGFR), along with aF helix atoms as the core of the C-lobe (1308–1324—ALK, 1262–1278—MET, 869-885 or 893–909—EGFR) residues.

SIMCA (v.13, Umetrics AB, Umeå, Sweden) was used to build a PLS regression plot [123] (Fib 6c) by taking 233 parameters (distances, angles, dihedrals etc.) from the annotated dataset of Möbitz [97] as independent variables and the designated active/inactive state of the kinase domains as the dependent variable. Thus, the PLS method transforms the dimensions of the individual structural parameters [97] into orthogonal dimensions that maximize covariance of the transformed dimensions with the activity state of the kinase. In the SIMCA implementation, a 7-fold cross validation technique prevents overfitting in the estimate of the number of significant components. The variables were scaled by unit variance. The PLS model built from this dataset has 4-components with both R2 (goodness of fit, maximum 1) and Q2 (predictive ability, maximum 1) values more than 0.9, where 1st and 2nd components account for about 80 and 6% of the variations respectively.

The three tricyclic compounds (1, 2a and 2b) were purchased from the company ARTTSynthesis (www.arttsynthesis.com). Tests for binding properties were performed by the company DiscoverX (www.discoverx.com), using a concentration of 1 µM against ALK, MET and EGFR variants L858R and L858R_T790M) using KdELECT and scanKINETIC from DiscoverX [124, 125].

Notes

N‐(3‐{[5‐chloro‐4‐(phenylamino)pyrimidin‐2‐yl]amino}phenyl)prop‐2‐enamide.
N‐[(3‐{[5‐chloro‐4‐(phenylamino)pyrimidin‐2‐yl]amino}phenyl)methyl]prop‐2‐enamide.
N‐(3‐{[5‐chloro‐2‐(phenylamino)pyrimidin‐4‐yl]amino}phenyl)prop‐2‐enamide.

Abbreviations

PK:: protein kinases; this work cites ABL, ALK, AXL, EGFR, EPHB2, FAK, FER, FES, GAK, HER2, IGF1R, INSR, IRR, LTK, MER, MET, PYK2, ROS1, and RIPK2
EML4:: echinoderm microtubule-associated protein-like4
HGF:: hepatocyte growth factor
NPM:: nucleophosmin
NUDT1:: hydix hydrolase 1
TK:: tyrosine kinase (protein kinase group)
TKL:: tyrosine kinase like (protein kinase group)
AH:: activity homology
ATP:: adenosine triphosphate
CML:: chronic myelogenous leukemia
DFG:: Asp-Phe-Gly initiating sequence of the activation loop
NSCLC:: nonsmall cell lung carcinoma
PC:: principal component
PCA:: principal component analysis
AL or A-loop:: activation loop
PLS:: projection on to latent spaces, or equivalently partial least squares
PDBID:: Protein Data Bank ID code

References

Hunter T (2007) Treatment for chronic myelogenous leukemia: the long road to imatinib. J Clin Invest 117:2036–2043. doi:10.1172/JCI31691
Article CAS Google Scholar
Zhang J, Yang PL, Gray NS (2009) Targeting cancer with small molecule kinase inhibitors. Nat Rev Cancer 9:28–39
Article CAS Google Scholar
Patterson H, Nibbs R, McInnes I, Siebert S (2014) Protein kinase inhibitors in the treatment of inflammatory and autoimmune diseases. Clin Exp Immunol 176:1–10. doi:10.1111/cei.12248
Article CAS Google Scholar
Rask-Andersen M, Zhang J, Fabbro D, Schiöth HB (2014) Advances in kinase targeting: current clinical use and clinical trials. Trends Pharmacol Sci 35:604–620. doi:10.1016/j.tips.2014.09.007
Article CAS Google Scholar
Swamidass SJ, Schillebeeckx CN, Matlock M, Hurle MR, Agarwal P (2014) Combined analysis of phenotypic and target-based screening in assay networks. J Biomol Screen 19:782–790. doi:10.1177/1087057114523068
Article CAS Google Scholar
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H et al (2000) The Protein Data Bank. Nucleic Acids Res 28:235–242. doi:10.1093/nar/28.1.235
Article CAS Google Scholar
Dar AC, Das TK, Shokat KM, Cagan RL (2012) Chemical genetic discovery of targets and anti-targets for cancer polypharmacology. Nature 486:80–84. doi:10.1038/nature11127
Article CAS Google Scholar
Garuti L, Roberti M, Bottegoni G (2015) Multi-kinase inhibitors. Curr Med Chem 22:695–712
Article CAS Google Scholar
Lavecchia A, Cerchia C (2015) In silico methods to address polypharmacology: current status, applications and future perspectives. Drug Discov Today. doi:10.1016/j.drudis.2015.12.007
Google Scholar
Tang J, Aittokallio T (2014) Network pharmacology strategies toward multi-target anticancer therapies: from computational models to experimental design principles. Curr Pharm Des 20:23–36
Article CAS Google Scholar
Hampton T (2004) “Promiscuous” anticancer drugs that hit multiple targets may thwart resistance. JAMA 292:419–422. doi:10.1001/jama.292.4.419
Article CAS Google Scholar
von Bubnoff N, Barwisch S, Speicher MR, Peschel C, Duyster J (2005) A cell-based screening strategy that predicts mutations in oncogenic tyrosine kinases: implications for clinical resistance in targeted cancer treatment. Cell Cycle Georget Tex 4:400–406
Article Google Scholar
Medina-Franco JL, Giulianotti MA, Welmaker GS, Houghten RA (2013) Shifting from the single to the multitarget paradigm in drug discovery. Drug Discov Today 18:495–501. doi:10.1016/j.drudis.2013.01.008
Article Google Scholar
Kuyoc-Carrillo VF, Medina-Franco JL (2014) Progress in the analysis of multiple activity profile of screening data using computational approaches. Drug Dev Res 75:313–323. doi:10.1002/ddr.21209
Article CAS Google Scholar
Milletti F, Vulpetti A (2010) Predicting polypharmacology by binding site similarity: from kinases to the protein universe. J Chem Inf Model 50:1418–1431. doi:10.1021/ci1001263
Article CAS Google Scholar
Morphy R (2010) Selectively nonselective kinase inhibition: striking the right balance. J Med Chem 53:1413–1437. doi:10.1021/jm901132v
Article CAS Google Scholar
Bamborough P, Drewry D, Harper G, Smith GK, Schneider K (2008) Assessment of chemical coverage of kinome space and its implications for kinase drug discovery. J Med Chem 51:7898–7914. doi:10.1021/jm8011036
Article CAS Google Scholar
Aronov AM, McClain B, Moody CS, Murcko MA (2008) Kinase-likeness and kinase-privileged fragments: toward virtual polypharmacology. J Med Chem 51:1214–1222. doi:10.1021/jm701021b
Article CAS Google Scholar
Nguyen K-SH (2014) Review of the current targeted therapies for non-small-cell lung cancer. World J Clin Oncol 5:576. doi:10.5306/wjco.v5.i4.576
Article Google Scholar
Parums DV (1998) Current status of targeted therapy in non-small cell lung cancer. Drugs Today Barc Spain 2014(50):503–525. doi:10.1358/dot.2014.50.7.2185913
Google Scholar
Chen Z, Fillmore CM, Hammerman PS, Kim CF, Wong K-K (2014) Non-small-cell lung cancers: a heterogeneous set of diseases. Nat Rev Cancer 14:535–546. doi:10.1038/nrc3775
Article CAS Google Scholar
Reungwetwattana T, Dy GK (2013) Targeted therapies in development for non-small cell lung cancer. J Carcinog 12:22. doi:10.4103/1477-3163.123972
Article CAS Google Scholar
Minuti G, D’Incecco A, Landi L, Cappuzzo F (2014) Protein kinase inhibitors to treat non-small-cell lung cancer. Expert Opin Pharmacother 15:1203–1213. doi:10.1517/14656566.2014.909412
Article CAS Google Scholar
Liu SV, Subramaniam D, Cyriac GC, Abdul-Khalek FJ, Giaccone G (2014) Emerging protein kinase inhibitors for non-small cell lung cancer. Expert Opin Emerg Drugs 19:51–65. doi:10.1517/14728214.2014.873403
Article CAS Google Scholar
Yamaguchi N, Lucena-Araujo AR, Nakayama S, de Figueiredo-Pontes LL, Gonzalez DA, Yasuda H et al (2014) Dual ALK and EGFR inhibition targets a mechanism of acquired resistance to the tyrosine kinase inhibitor crizotinib in ALK rearranged lung cancer. Lung Cancer Amst Neth 83:37–43. doi:10.1016/j.lungcan.2013.09.019
Article Google Scholar
Becker K (2014) Management of tyrosine kinase inhibitor resistance in lung cancer with EGFR mutation. World J Clin Oncol 5:560. doi:10.5306/wjco.v5.i4.560
Article Google Scholar
Engle JA, Kolesar JM (2014) Afatinib: A first-line treatment for selected patients with metastatic non-small-cell lung cancer. Am J Health-Syst Pharm AJHP Off J Am Soc Health-Syst Pharm 71:1933–1938. doi:10.2146/ajhp130654
Article CAS Google Scholar
Kim Y, Ko J, Cui Z, Abolhoda A, Ahn JS, Ou S-H et al (2012) The EGFR T790M mutation in acquired resistance to an irreversible second-generation EGFR inhibitor. Mol Cancer Ther 11:784–791. doi:10.1158/1535-7163.MCT-11-0750
Article CAS Google Scholar
Sequist LV, Waltman BA, Dias-Santagata D, Digumarthy S, Turke AB, Fidias P et al (2011) Genotypic and histological evolution of lung cancers acquiring resistance to EGFR inhibitors. Sci Transl Med 3:75ra26. doi:10.1126/scitranslmed.3002003
Article Google Scholar
Yano S, Wang W, Li Q, Matsumoto K, Sakurama H, Nakamura T et al (2008) Hepatocyte growth factor induces gefitinib resistance of lung adenocarcinoma with epidermal growth factor receptor-activating mutations. Cancer Res 68:9479–9487. doi:10.1158/0008-5472.CAN-08-1643
Article CAS Google Scholar
Yu HA, Arcila ME, Rekhtman N, Sima CS, Zakowski MF, Pao W et al (2013) Analysis of tumor specimens at the time of acquired resistance to EGFR-TKI therapy in 155 patients with EGFR-mutant lung cancers. Clin Cancer Res Off J Am Assoc Cancer Res 19:2240–2247. doi:10.1158/1078-0432.CCR-12-2246
Article CAS Google Scholar
Lin Y, Wang X, Jin H (2014) EGFR-TKI resistance in NSCLC patients: mechanisms and strategies. Am J Cancer Res 4:411–435
Google Scholar
Fumarola C, Bonelli MA, Petronini PG, Alfieri RR (2014) Targeting PI3K/AKT/mTOR pathway in non small cell lung cancer. Biochem Pharmacol 90:197–207. doi:10.1016/j.bcp.2014.05.011
Article CAS Google Scholar
Goffin JR, Zbuk K (2013) Epidermal growth factor receptor: pathway, therapies, and pipeline. Clin Ther 35:1282–1303. doi:10.1016/j.clinthera.2013.08.007
Article CAS Google Scholar
Xing L, Klug-Mcleod J, Rai B, Lunney EA (2015) Kinase hinge binding scaffolds and their hydrogen bond patterns. Bioorg Med Chem 23:6520–6527. doi:10.1016/j.bmc.2015.08.006
Article CAS Google Scholar
Hu Y, Bajorath J (2015) Exploring the scaffold universe of kinase inhibitors. J Med Chem 58:315–332. doi:10.1021/jm501237k
Article CAS Google Scholar
Xing L, Rai B, Lunney EA (2014) Scaffold mining of kinase hinge binders in crystal structure database. J Comput Aided Mol Des 28:13–23. doi:10.1007/s10822-013-9700-4
Article CAS Google Scholar
Zhao H, Caflisch A (2015) Current kinase inhibitors cover a tiny fraction of fragment space. Bioorg Med Chem Lett 25:2372–2376. doi:10.1016/j.bmcl.2015.04.005
Article CAS Google Scholar
Fabian MA, Biggs WH, Treiber DK, Atteridge CE, Azimioara MD, Benedetti MG et al (2005) A small molecule–kinase interaction map for clinical kinase inhibitors. Nat Biotechnol 23:329–336. doi:10.1038/nbt1068
Article CAS Google Scholar
Davis MI, Hunt JP, Herrgard S, Ciceri P, Wodicka LM, Pallares G et al (2011) Comprehensive analysis of kinase inhibitor selectivity. Nat Biotechnol 29:1046–1051. doi:10.1038/nbt.1990
Article CAS Google Scholar
Åberg E, Lund B, Pflug A, Gani OABSM, Rothweiler U, de Oliveira TM et al (2012) Structural origins of AGC protein kinase inhibitor selectivities: PKA as a drug discovery tool. Biol Chem 393:1121–1129. doi:10.1515/hsz-2012-0248
Article CAS Google Scholar
Gangwal RP, Bhadauriya A, Damre MV, Dhoke GV, Sangamwar AT (2013) p38 Mitogen-activated protein kinase inhibitors: a review on pharmacophore mapping and QSAR studies. Curr Top Med Chem 13:1015–1035
Article CAS Google Scholar
Fitzgerald CE, Patel SB, Becker JW, Cameron PM, Zaller D, Pikounis VB et al (2003) Structural basis for p38alpha MAP kinase quinazolinone and pyridol-pyrimidine inhibitor specificity. Nat Struct Biol 10:764–769. doi:10.1038/nsb949
Article CAS Google Scholar
Leproult E, Barluenga S, Moras D, Wurtz J-M, Winssinger N (2011) Cysteine mapping in conformationally distinct kinase nucleotide binding sites: application to the design of selective covalent inhibitors. J Med Chem 54:1347–1355. doi:10.1021/jm101396q
Article CAS Google Scholar
Liu Q, Sabnis Y, Zhao Z, Zhang T, Buhrlage SJ, Jones LH et al (2013) Developing irreversible inhibitors of the protein kinase cysteinome. Chem Biol 20:146–159. doi:10.1016/j.chembiol.2012.12.006
Article CAS Google Scholar
Metz JT, Johnson EF, Soni NB, Merta PJ, Kifle L, Hajduk PJ (2011) Navigating the kinome. Nat Chem Biol 7:200–202. doi:10.1038/nchembio.530
Article CAS Google Scholar
Kim S, Thiessen PA, Bolton EE, Chen J, Fu G, Gindulyte A et al (2015) PubChem substance and compound databases. Nucleic Acids Res. doi:10.1093/nar/gkv951
Google Scholar
Wang Y, Suzek T, Zhang J, Wang J, He S, Cheng T et al (2014) PubChem bioassay: 2014 update. Nucleic Acids Res 42:D1075–D1082. doi:10.1093/nar/gkt978
Article CAS Google Scholar
Davies M, Nowotka M, Papadatos G, Dedman N, Gaulton A, Atkinson F et al (2015) ChEMBL web services: streamlining access to drug discovery data and utilities. Nucleic Acids Res 43:W612–W620. doi:10.1093/nar/gkv352
Article CAS Google Scholar
Jacoby E, Tresadern G, Bembenek S, Wroblowski B, Buyck C, Neefs J-M et al (2015) Extending kinome coverage by analysis of kinase inhibitor broad profiling data. Drug Discov Today 20:652–658. doi:10.1016/j.drudis.2015.01.002
Article CAS Google Scholar
Urich R, Wishart G, Kiczun M, Richters A, Tidten-Luksch N, Rauh D et al (2013) De novo design of protein kinase inhibitors by in silico identification of hinge region-binding fragments. ACS Chem Biol 8:1044–1052. doi:10.1021/cb300729y
Article CAS Google Scholar
Bender A (2011) Bayesian methods in virtual screening and chemical biology. Methods Mol Biol Clifton NJ 672:175–196. doi:10.1007/978-1-60761-839-3_7
Article CAS Google Scholar
Clark AM, Ekins S (2015) Open source Bayesian models. 2. Mining a “big dataset” to create and validate models with ChEMBL. J Chem Inf Model 55:1246–1260. doi:10.1021/acs.jcim.5b00144
Article CAS Google Scholar
Mervin LH, Afzal AM, Drakakis G, Lewis R, Engkvist O, Bender A (2015) Target prediction utilising negative bioactivity data covering large chemical space. J Cheminform 7:51. doi:10.1186/s13321-015-0098-y
Article CAS Google Scholar
Hsin K-Y, Ghosh S, Kitano H (2013) Combining machine learning systems and multiple docking simulation packages to improve docking prediction reliability for network pharmacology. PLoS ONE 8:e83922. doi:10.1371/journal.pone.0083922
Article CAS Google Scholar
Erickson JA, Mader MM, Watson IA, Webster YW, Higgs RE, Bell MA et al (2010) Structure-guided expansion of kinase fragment libraries driven by support vector machine models. Biochim Biophys Acta 1804:642–652. doi:10.1016/j.bbapap.2009.12.002
Article CAS Google Scholar
Spyrakis F, Cavasotto CN (2015) Open challenges in structure-based virtual screening: receptor modeling, target flexibility consideration and active site water molecules description. Arch Biochem Biophys 583:105–119. doi:10.1016/j.abb.2015.08.002
Article CAS Google Scholar
Chen Y-C (2015) Beware of docking! Trends Pharmacol Sci 36:78–95. doi:10.1016/j.tips.2014.12.001
Article CAS Google Scholar
Kauvar LM, Higgins DL, Villar HO, Sportsman JR, Engqvist-Goldstein Å, Bukar R et al (1995) Predicting ligand binding to proteins by affinity fingerprinting. Chem Biol 2:107–118. doi:10.1016/1074-5521(95)90283-X
Article CAS Google Scholar
Wassermann AM, Lounkine E, Davies JW, Glick M, Camargo LM (2014) The opportunities of mining historical and collective data in drug discovery. Drug Discov Today. doi:10.1016/j.drudis.2014.11.004
Google Scholar
Ballester PJ, Mitchell JBO (2010) A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking. Bioinforma Oxf Engl 26:1169–1175. doi:10.1093/bioinformatics/btq112
Article CAS Google Scholar
Ashtawy HM, Mahapatra NR (2012) A comparative assessment of ranking accuracies of conventional and machine-learning-based scoring functions for protein–ligand binding affinity prediction. IEEEACM Trans Comput Biol Bioinforma IEEE ACM 9:1301–1313. doi:10.1109/TCBB.2012.36
Article Google Scholar
Li H, Leung K-S, Wong M-H, Ballester PJ (2015) Low-quality structural and interaction data improves binding affinity prediction via random forest. Mol Basel Switz 20:10947–10962. doi:10.3390/molecules200610947
CAS Google Scholar
Gabel J, Desaphy J, Rognan D (2014) Beware of machine learning-based scoring functions-on the danger of developing black boxes. J Chem Inf Model 54:2807–2815. doi:10.1021/ci500406k
Article CAS Google Scholar
Garuti L, Roberti M, Bottegoni G (2011) Irreversible protein kinase inhibitors. Curr Med Chem 18:2981–2994
Article CAS Google Scholar
Miller GD, Bruno BJ, Lim CS (2014) Resistant mutations in CML and Ph(+)ALL—role of ponatinib. Biol Targets Ther 8:243–254. doi:10.2147/BTT.S50734
CAS Google Scholar
Viala M, Brosseau S, Planchard D, Besse B, Soria J-C (2015) Second generation ALK inhibitors in non-small cell lung cancer: systemic review. Bull Cancer (Paris). doi:10.1016/j.bulcan.2015.02.016
Google Scholar
Rolfo C, Passiglia F, Castiglia M, Raez LE, Germonpre P, Gil-Bazo I et al (2014) ALK and crizotinib: after the honeymoon…what else? Resistance mechanisms and new therapies to overcome it. Transl Lung Cancer Res 3:250–261. doi:10.3978/j.issn.2218-6751.2014.03.01
CAS Google Scholar
Gainor J (2015) O10.1Next generation ALK inhibitors and mechanisms of resistance to therapy. Ann Oncol Off J Eur Soc Med Oncol ESMO 26(Suppl 2):ii14. doi:10.1093/annonc/mdv088.1
Article Google Scholar
Ai X, Shen S, Shen L, Lu S (2015) An interaction map of small-molecule kinase inhibitors with anaplastic lymphoma kinase (ALK) mutants in ALK-positive non-small cell lung cancer. Biochimie. doi:10.1016/j.biochi.2015.03.003
Google Scholar
Wilson FH, Johannessen CM, Piccioni F, Tamayo P, Kim JW, Van Allen EM et al (2015) A functional landscape of resistance to ALK inhibition in lung cancer. Cancer Cell 27:397–408. doi:10.1016/j.ccell.2015.02.005
Article CAS Google Scholar
Mologni L, Ceccon M, Pirola A, Chiriano G, Piazza R, Scapozza L et al. (2015) NPM/ALK mutants resistant to ASP3026 display variable sensitivity to alternative ALK inhibitors but succumb to the novel compound PF-06463922. Oncotarget 6:5720–5734. doi:10.18632/oncotarget.3122
Article Google Scholar
Zou HY, Li Q, Engstrom LD, West M, Appleman V, Wong KA et al (2015) PF-06463922 is a potent and selective next-generation ROS1/ALK inhibitor capable of blocking crizotinib-resistant ROS1 mutations. Proc Natl Acad Sci USA 112:3493–3498. doi:10.1073/pnas.1420785112
Article CAS Google Scholar
Iacono D, Chiari R, Metro G, Bennati C, Bellezza G, Cenci M et al (2015) Future options for ALK-positive non-small cell lung cancer. Lung Cancer Amst Neth 87:211–219. doi:10.1016/j.lungcan.2014.12.017
Article Google Scholar
Crystal AS, Shaw AT, Sequist LV, Friboulet L, Niederst MJ, Lockerman EL et al (2014) Patient-derived models of acquired resistance can identify effective drug combinations for cancer. Science 346:1480–1486. doi:10.1126/science.1254721
Article CAS Google Scholar
Bayliss R, Choi J, Fennell DA, Fry AM, Richards MW (2016) Molecular mechanisms that underpin EML4-ALK driven cancers and their response to targeted drugs. Cell Mol Life Sci CMLS. doi:10.1007/s00018-015-2117-6
Google Scholar
Watermann I, Schmitt B, Stellmacher F, Müller J, Gaber R, Kugler C et al (2015) Improved diagnostics targeting c-MET in non-small cell lung cancer: expression, amplification and activation? Diagn Pathol 10:130. doi:10.1186/s13000-015-0362-5
Article CAS Google Scholar
Nakagawa T, Takeuchi S, Yamada T, Nanjo S, Ishikawa D, Sano T et al (2012) Combined therapy with mutant-selective EGFR inhibitor and Met kinase inhibitor for overcoming erlotinib resistance in EGFR-mutant lung cancer. Mol Cancer Ther 11:2149–2157. doi:10.1158/1535-7163.MCT-12-0195
Article CAS Google Scholar
Gani OA, Thakkar B, Narayanan D, Alam KA, Kyomuhendo P, Rothweiler U et al. (2015) Assessing protein kinase target similarity: examples comparing sequence, structure, and cheminformatics approaches. Biochim Biophys Acta 1854:1605–1616. doi:10.1016/j.bbapap.2015.05.004
Article CAS Google Scholar
Posy SL, Hermsmeier MA, Vaccaro W, Ott K-H, Todderud G, Lippy JS et al (2011) Trends in kinase selectivity: insights for target class-focused library screening. J Med Chem 54:54–66. doi:10.1021/jm101195a
Article CAS Google Scholar
Gao Y, Davies SP, Augustin M, Woodward A, Patel UA, Kovelman R et al (2013) A broad activity screen in support of a chemogenomic map for kinase signalling research and drug discovery. Biochem J 451:313–328. doi:10.1042/BJ20121418
Article CAS Google Scholar
Gao C, Cahya S, Nicolaou CA, Wang J, Watson IA, Cummins DJ et al (2013) Selectivity data: assessment, predictions, concordance, and implications. J Med Chem 56:6991–7002. doi:10.1021/jm400798j
Article CAS Google Scholar
Bantscheff M, Eberhard D, Abraham Y, Bastuck S, Boesche M, Hobson S et al (2007) Quantitative chemical proteomics reveals mechanisms of action of clinical ABL kinase inhibitors. Nat Biotechnol 25:1035–1044. doi:10.1038/nbt1328
Article CAS Google Scholar
Sutherland JJ, Gao C, Cahya S, Vieth M (2013) What general conclusions can we draw from kinase profiling data sets? Biochim Biophys Acta 1834:1425–1433. doi:10.1016/j.bbapap.2012.12.023
Article CAS Google Scholar
Pearson K (1901) On lines and planes of closest fit to systems of points in space. Philos Mag Ser 6(2):559–572
Article Google Scholar
Hotelling H (1933) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24:417–520
Article Google Scholar
Azam M, Seeliger MA, Gray NS, Kuriyan J, Daley GQ (2008) Activation of tyrosine kinases by mutation of the gatekeeper threonine. Nat Struct Mol Biol 15:1109–1118. doi:10.1038/nsmb.1486
Article CAS Google Scholar
Knowles PP, Murray-Rust J, Kjær S, Scott RP, Hanrahan S, Santoro M et al (2006) Structure and chemical inhibition of the RET tyrosine kinase domain. J Biol Chem 281:33577–33587. doi:10.1074/jbc.M605604200
Article CAS Google Scholar
Schlessinger J (2003) Autoinhibition control. Science 300:750–752. doi:10.1126/science.1082024
Article CAS Google Scholar
Zhang X, Gureasko J, Shen K, Cole PA, Kuriyan J (2006) An allosteric mechanism for activation of the kinase domain of epidermal growth factor receptor. Cell 125:1137–1149. doi:10.1016/j.cell.2006.05.013
Article CAS Google Scholar
Cui JJ, Tran-Dubé M, Shen H, Nambu M, Kung P-P, Pairish M et al (2011) Structure based drug design of crizotinib (PF-02341066), a potent and selective dual inhibitor of mesenchymal-epithelial transition factor (c-MET) kinase and anaplastic lymphoma kinase (ALK). J Med Chem 54:6342–6363. doi:10.1021/jm2007613
Article CAS Google Scholar
Huang Q, Johnson TW, Bailey S, Brooun A, Bunker KD, Burke BJ et al (2014) Design of potent and selective inhibitors to overcome clinical anaplastic lymphoma kinase mutations resistant to crizotinib. J Med Chem 57:1170–1187. doi:10.1021/jm401805h
Article CAS Google Scholar
Maione P, Sacco PC, Sgambato A, Casaluce F, Rossi A, Gridelli C (2015) Overcoming resistance to targeted therapies in NSCLC: current approaches and clinical application. Ther Adv Med Oncol 7:263–273. doi:10.1177/1758834015595048
Article Google Scholar
Timofeevski SL, McTigue MA, Ryan K, Cui J, Zou HY, Zhu JX et al (2009) Enzymatic characterization of c-Met receptor tyrosine kinase oncogenic mutants and kinetic studies with aminopyridine and triazolopyrazine inhibitors. Biochemistry (Mosc) 48:5339–5349. doi:10.1021/bi900438w
Article CAS Google Scholar
Hubbard SR, Miller WT (2007) Receptor tyrosine kinases: mechanisms of activation and signaling. Curr Opin Cell Biol 19:117–123. doi:10.1016/j.ceb.2007.02.010
Article CAS Google Scholar
Gani OABSM, Narayanan D, Engh RA (2013) Evaluating the predictivity of virtual screening for ABL kinase inhibitors to hinder drug resistance. Chem Biol Drug Des 82:506–519. doi:10.1111/cbdd.12170
Article CAS Google Scholar
Möbitz H (2015) The ABC of protein kinase conformations. Biochim Biophys Acta. doi:10.1016/j.bbapap.2015.03.009
Google Scholar
Wold S, Sjöström M, Eriksson L (2001) PLS-regression: a basic tool of chemometrics. Chemom Intell Lab Syst 58:109–130. doi:10.1016/S0169-7439(01)00155-1
Article CAS Google Scholar
Eathiraj S, Palma R, Volckova E, Hirschi M, France DS, Ashwell MA et al (2011) Discovery of a novel mode of protein kinase inhibition characterized by the mechanism of inhibition of human mesenchymal-epithelial transition factor (c-Met) protein autophosphorylation by ARQ 197. J Biol Chem 286:20666–20676. doi:10.1074/jbc.M110.213801
Article CAS Google Scholar
D’Angelo ND, Bellon SF, Booker SK, Cheng Y, Coxon A, Dominguez C et al (2008) Design, synthesis, and biological evaluation of potent c-Met inhibitors. J Med Chem 51:5766–5779. doi:10.1021/jm8006189
Article CAS Google Scholar
Pearson JD, Lee JKH, Bacani JTC, Lai R, Ingham RJ, Pearson JD et al (2012) NPM-ALK: the prototypic member of a family of oncogenic fusion tyrosine kinases. J Signal Transduct 2012(2012):e123253. doi:10.1155/2012/123253
Google Scholar
Hallberg B, Palmer RH (2013) Mechanistic insight into ALK receptor tyrosine kinase in human cancer biology. Nat Rev Cancer 13:685–700. doi:10.1038/nrc3580
Article CAS Google Scholar
Sasaki T, Okuda K, Zheng W, Butrynski J, Capelletti M, Wang L et al (2010) The neuroblastoma-associated F1174L ALK mutation causes resistance to an ALK kinase inhibitor in ALK-translocated cancers. Cancer Res 70:10038–10043. doi:10.1158/0008-5472.CAN-10-2956
Article CAS Google Scholar
Gelsomino F, Facchinetti F, Haspinger ER, Garassino MC, Trusolino L, De Braud F et al (2014) Targeting the MET gene for the treatment of non-small-cell lung cancer. Crit Rev Oncol Hematol 89:284–299. doi:10.1016/j.critrevonc.2013.11.006
Article CAS Google Scholar
Deng X, Wang J, Zhang J, Sim T, Kim ND, Sasaki T et al (2011) Discovery of 3,5-diamino-1,2,4-triazole ureas as potent anaplastic lymphoma kinase inhibitors. ACS Med Chem Lett 2:379–384. doi:10.1021/ml200002a
Article CAS Google Scholar
Slavish PJ, Price JE, Jiang Q, Cui X, Morris SW, Webb TR (2011) Synthesis of an aryloxy oxo pyrimidinone library that displays ALK-selective inhibition. Bioorg Med Chem Lett 21:4592–4596. doi:10.1016/j.bmcl.2011.05.103
Article CAS Google Scholar
Marsilje TH, Pei W, Chen B, Lu W, Uno T, Jin Y et al (2013) Synthesis, structure–activity relationships, and in vivo efficacy of the novel potent and selective anaplastic lymphoma kinase (ALK) inhibitor 5-chloro-N2-(2-isopropoxy-5-methyl-4-(piperidin-4-yl)phenyl)-N4-(2-(isopropylsulfonyl)phenyl)pyrimidine-2,4-diamine (LDK378) currently in phase 1 and phase 2 clinical trials. J Med Chem 56:5675–5690. doi:10.1021/jm400402q
Article CAS Google Scholar
Li J, Wu N, Tian Y, Zhang J, Wu S (2013) Aminopyridyl/pyrazinyl spiro[indoline-3,4′-piperidine]-2-ones as highly selective and efficacious c-Met/ALK inhibitors. ACS Med Chem Lett 4:806–810. doi:10.1021/ml400203d
Article CAS Google Scholar
Tardy S, Orsato A, Mologni L, Bisson WH, Donadoni C, Gambacorti-Passerini C et al (2014) Synthesis and biological evaluation of benzo[4, 5]imidazo[1,2-c]pyrimidine and benzo[4, 5]imidazo[1,2-a]pyrazine derivatives as anaplastic lymphoma kinase inhibitors. Bioorg Med Chem 22:1303–1312. doi:10.1016/j.bmc.2014.01.007
Article CAS Google Scholar
Nishii H, Chiba T, Morikami K, Fukami TA, Sakamoto H, Ko K et al (2010) Discovery of 6-benzyloxyquinolines as c-Met selective kinase inhibitors. Bioorg Med Chem Lett 20:1405–1409. doi:10.1016/j.bmcl.2009.12.109
Article CAS Google Scholar
Johnson TW, Richardson PF, Bailey S, Brooun A, Burke BJ, Collins MR et al (2014) Discovery of (10R)-7-amino-12-fluoro-2,10,16-trimethyl-15-oxo-10,15,16,17-tetrahydro-2H-8,4-(metheno)pyrazolo[4,3-h][2, 5, 11]-benzoxadiazacyclotetradecine-3-carbonitrile (PF-06463922), a macrocyclic inhibitor of anaplastic lymphoma kinase (ALK) and c-ros oncogene 1 (ROS1) with preclinical brain exposure and broad-spectrum potency against ALK-resistant mutations. J Med Chem 57:4720–4744. doi:10.1021/jm500261q
Article CAS Google Scholar
Xie Q-Q, Zhong L, Pan Y-L, Wang X-Y, Zhou J-P, Di-Wu L et al (2011) Combined SVM-based and docking-based virtual screening for retrieving novel inhibitors of c-Met. Eur J Med Chem 46:3675–3680. doi:10.1016/j.ejmech.2011.05.031
Article CAS Google Scholar
Zhang D, Ai J, Liang Z, Li C, Peng X, Ji Y et al (2012) Discovery of novel 2-aminopyridine-3-carboxamides as c-Met kinase inhibitors. Bioorg Med Chem 20:5169–5180. doi:10.1016/j.bmc.2012.07.007
Article CAS Google Scholar
Zhang D, Zhang X, Ai J, Zhai Y, Liang Z, Wang Y et al (2013) Synthesis and biological evaluation of 2-amino-5-aryl-3-benzylthiopyridine scaffold based potent c-Met inhibitors. Bioorg Med Chem 21:6804–6820. doi:10.1016/j.bmc.2013.07.032
Article CAS Google Scholar
Liu Z, Ai J, Peng X, Song Z, Wu K, Zhang J et al (2014) Novel 2,4-diarylaminopyrimidine analogues (DAAPalogues) showing potent c-Met/ALK multikinase inhibitory activities. ACS Med Chem Lett 5:304–308. doi:10.1021/ml400373j
Article CAS Google Scholar
Szokol B, Gyulavári P, Kurkó I, Baska F, Szántai-Kis C, Greff Z et al (2014) Discovery and biological evaluation of novel dual EGFR/c-Met inhibitors. ACS Med Chem Lett 5:298–303. doi:10.1021/ml4003309
Article CAS Google Scholar
Wodicka LM, Ciceri P, Davis MI, Hunt JP, Floyd M, Salerno S et al (2010) Activation state-dependent binding of small molecule kinase inhibitors: structural insights from biochemistry. Chem Biol 17:1241–1249. doi:10.1016/j.chembiol.2010.09.010
Article CAS Google Scholar
Zhou W, Ercan D, Chen L, Yun C-H, Li D, Capelletti M et al (2009) Novel mutant-selective EGFR kinase inhibitors against EGFR T790M. Nature 462:1070–1074. doi:10.1038/nature08622
Article CAS Google Scholar
Basu D, Richters A, Rauh D (2015) Structure-based design and synthesis of covalent-reversible inhibitors to overcome drug resistance in EGFR. Bioorg Med Chem 23:2767–2780. doi:10.1016/j.bmc.2015.04.038
Article CAS Google Scholar
Nanjo S, Yamada T, Nishihara H, Takeuchi S, Sano T, Nakagawa T et al (2013) Ability of the Met kinase inhibitor crizotinib and new generation EGFR inhibitors to overcome resistance to EGFR inhibitors. PLoS ONE 8:e84700. doi:10.1371/journal.pone.0084700
Article CAS Google Scholar
Davis JM, Davis PD, Hutchings MC, Moffat DFC. Substituted 2-anilinopyrimidines useful as protein kinase inhibitors [Internet]. WO1997019065 A1, 1997. http://www.google.com/patents/WO1997019065A1
Manning G, Whyte DB, Martinez R, Hunter T, Sudarsanam S (2002) The protein kinase complement of the human genome. Science 298:1912–1934. doi:10.1126/science.1075762
Article CAS Google Scholar
Eriksson L, Byrne T, Johansson E, Trygg J, Vikström C (2013) Multi- and megavariate data analysis basic principles and applications. Umetrics Academy
Fabian MA et al (2005) A small molecule–kinase interaction map for clinical kinase inhibitors. Nat Biotechnol 23(3):329–336. doi:10.1126/10.1038/nbt1068
Article CAS Google Scholar
Wodicka LM et al (2010) Activation state-dependent binding of small molecule kinase inhibitors: structural insights from biochemistry. Chem Biol 17(11):1241–1249. doi:10.1016/j.chembiol.2010.09.010
Article CAS Google Scholar

Download references

Authors’ contributions

The initial polypharmacology targeting priority was made by FG. OA, DN, and RAE analysed binding and structural data, and proposed chemical syntheses. All authors wrote portions of this manuscript. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

The datasets supporting the conclusions of this article are available as cited above, including the crystal structures from the Protein Data Bank repository [http://www.rcsb.org], and in the cited ligand binding publications [40, 46, 97]. The PDB codes for the structures used in the figures may be found in the figure captions. The superpositions may be viewed interactively with PYMOL by loading the session files (with .pse extension), deposited as supplementary information.

Funding

Grants from the Norwegian Research Council (191303, OG & DN) and the Norwegian Cancer Society (OG & FG) supported this work.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Dilip Narayanan and Osman A. B. S. M. Gani contributed equally to this work

Authors and Affiliations

The Norwegian Structural Biology Center, Department of Chemistry, Faculty of Science, UiT The Arctic University of Norway, Tromsø, Norway
Dilip Narayanan, Osman A. B. S. M. Gani, Franz X. E. Gruber & Richard A. Engh

Authors

Dilip Narayanan
View author publications
You can also search for this author in PubMed Google Scholar
Osman A. B. S. M. Gani
View author publications
You can also search for this author in PubMed Google Scholar
Franz X. E. Gruber
View author publications
You can also search for this author in PubMed Google Scholar
Richard A. Engh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Richard A. Engh.

Additional file

Additional file 1: Compound_SMILES_strings.rtf. SMILES strings for compounds tested in this manuscript (Figure 11).

Additional file 2: PDBIDs-Used-in_analysis. Summary of PDB codes used in the figures of this manuscript.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Narayanan, D., Gani, O.A.B.S.M., Gruber, F.X.E. et al. Data driven polypharmacological drug design for lung cancer: analyses for targeting ALK, MET, and EGFR. J Cheminform 9, 43 (2017). https://doi.org/10.1186/s13321-017-0229-8

Download citation

Received: 30 October 2016
Accepted: 18 June 2017
Published: 04 July 2017
DOI: https://doi.org/10.1186/s13321-017-0229-8

Data driven polypharmacological drug design for lung cancer: analyses for targeting ALK, MET, and EGFR

Abstract

Background

Results and discussion

Cheminformatics show similarities of ALK and MET, dissimilarity of EGFR, and intermediate similarity of EGFR mutant T790M

Heterogeneity of crystal structures obscures polypharmacology prediction

Do crizotinib co-crystal structures explain cross-reactivity and reveal ALK and MET similarities?

Do the crystal structures in general reveal polypharmacological potential?

How can cheminformatics inform crystallography?

Focussing libraries toward ALK + MET + EGFR polypharmacological inhibition

Conclusions

Methods

Notes

Abbreviations

References

Authors’ contributions

Competing interests

Availability of data and materials

Funding

Publisher’s Note

Author information

Authors and Affiliations

Corresponding author

Additional file

Additional file 1: Compound_SMILES_strings.rtf. SMILES strings for compounds tested in this manuscript (Figure 11).

Additional file 2: PDBIDs-Used-in_analysis. Summary of PDB codes used in the figures of this manuscript.

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

Data driven polypharmacological drug design for lung cancer: analyses for targeting ALK, MET, and EGFR

Abstract

Background

Results and discussion

Cheminformatics show similarities of ALK and MET, dissimilarity of EGFR, and intermediate similarity of EGFR mutant T790M

Heterogeneity of crystal structures obscures polypharmacology prediction

Do crizotinib co-crystal structures explain cross-reactivity and reveal ALK and MET similarities?

Do the crystal structures in general reveal polypharmacological potential?

How can cheminformatics inform crystallography?

Focussing libraries toward ALK + MET + EGFR polypharmacological inhibition

Conclusions

Methods

Notes

Abbreviations

References

Authors’ contributions

Competing interests

Availability of data and materials

Funding

Publisher’s Note

Author information

Authors and Affiliations

Corresponding author

Additional file

Additional file 1: Compound_SMILES_strings.rtf. SMILES strings for compounds tested in this manuscript (Figure 11).

Additional file 2: PDBIDs-Used-in_analysis. Summary of PDB codes used in the figures of this manuscript.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us