Skip to main content

Advertisement

Molecular dynamics simulations and in silico peptide ligand screening of the Elk-1 ETS domain

Article metrics

Abstract

Background

The Elk-1 transcription factor is a member of a group of proteins called ternary complex factors, which serve as a paradigm for gene regulation in response to extracellular signals. Its deregulation has been linked to multiple human diseases including the development of tumours. The work herein aims to inform the design of potential peptidomimetic compounds that can inhibit the formation of the Elk-1 dimer, which is key to Elk-1 stability. We have conducted molecular dynamics simulations of the Elk-1 ETS domain followed by virtual screening.

Results

We show the ETS dimerisation site undergoes conformational reorganisation at the α1β1 loop. Through exhaustive screening of di- and tri-peptide libraries against a collection of ETS domain conformations representing the dynamics of the loop, we identified a series of potential binders for the Elk-1 dimer interface. The di-peptides showed no particular preference toward the binding site; however, the tri-peptides made specific interactions with residues: Glu17, Gln18 and Arg49 that are pivotal to the dimer interface.

Conclusions

We have shown molecular dynamics simulations can be combined with virtual peptide screening to obtain an exhaustive docking protocol that incorporates dynamic fluctuations in a receptor. Based on our findings, we suggest experimental binding studies to be performed on the 12 SILE ranked tri-peptides as possible compounds for the design of inhibitors of Elk-1 dimerisation. It would also be reasonable to consider the score-ranked tri-peptides as a comparative test to establish whether peptide size is a determinant factor of binding to the ETS domain.

Background

Regulation of gene expression is essential for the development of all living organisms through processes such as cell proliferation, differentiation and morphogenesis. Key to these processes are mitogen activated protein kinases (MAPK), which target nuclear transcription factors, in response to extracellular signals, to elicit the required genetic response. One such transcription factor is Elk-1. Elk-1 (Ets-like protein 1) is a member of a group of proteins called ternary complex factors (TCF), which are targeted by MAPKs for phosphorylation [13] to regulate the transcription of immediate early genes (IEG) [4, 5]. This event involves the formation of a ternary complex, induced by the cooperative binding of TCFs with serum response factor (SRF) dimers [6] on serum response elements found in IEG promoters [79]. TCFs are a subfamily of ETS (E twenty-six) domain proteins. ETS proteins share a ~85 residue DNA-binding domain (ETS domain) located at the N-terminus of TCFs, which comprises a 'winged helix-turn-helix' motif [10] that binds to a 10-bp ETS binding site containing a 5'-GGA-3' core sequence. Since ETS domains are highly conserved across ETS proteins, ETS binding sites are differentiated by the cooperation of other transcription factors [7, 11, 12] combined with base-specific interaction with variable bases flanking the central core sequence. Whilst TCFs naturally form a complex with SRF, they are also able to bind to DNA containing high-affinity, autonomous ETS binding motifs independent of a SRF [6, 13]. ETS domain proteins are involved in cellular development, growth and differentiation [1416]. Their deregulation has been linked to multiple human diseases [17].

The current X-ray crystal structure of the Elk-1 ETS domain is that of a dimer, with each unit bound to an autonomous 13-bp DNA double helix (PDB code 1DUX) [18] composed of a high affinity ETS binding site motif. Like other ETS domain proteins, the structure reveals three α-helices packed against four anti-parallel β-strands, giving an αββααββ secondary structure (Figure 1). The α3 helix forms the recognition helix, which slots into the major groove of the DNA target with a GGA core (Figure 2a). The dimer interface involves the carboxy-end of α1 and the α1β1 loop (Figure 2b). Contrary to the aforementioned structure, unequivocal experimental evidence has indicated that ETS dimers exist only in solution, [19, 20] whilst monomers occur predominantly in the nucleus, where they target DNA [21, 22]. To date, the structure of an unbound ETS domain is yet to be reported. However, Saven et al. [23] performed molecular dynamics (MD) simulations of a single Elk-1 ETS domain taken from the dimeric structure. They discerned regions within the simulated monomeric structure which showed large structural deviation with respect to the structure of the domain in the dimeric conformation. These regions include residues at the α1β1 loop involved in the ETS dimer interface and residues at the α2α3 loop involved in protein-DNA contacts.

Figure 1
figure1

ETS domain secondary structure. Amino acid sequence of the Elk-1 ETS domain, showing the locations of α-helices and β-strands.

Figure 2
figure2

ETS domain DNA and dimer complexes. (a) An Elk-1 ETS domain bound to its DNA recognition sequence. α-helices are in purple and β-strands in yellow. (b) An Elk-1 ETS domain dimer complex, showing the α1β1 loops providing the interface. The images were generated using VMD [63] and PovRay (http://www.povray.org/), using coordinates taken from the 1DUX crystal structure [18].

Thus far, work on characterising the mechanism for protein-DNA recognition in TCFs has been abundant [18, 2327]. However, there has been little on understanding the basis of Elk-1 dimerisation for transcriptional activity. Shaw and colleagues [21] have identified a region of the Elk-1 ETS domain encompassing the α1β1 loop which distinctly contributes to Elk-1 stability in the cytoplasm by directing Elk-1 dimer formation. Also, dimerisation in the cytoplasm appears to prevent rapid degradation and plays a role in translocation of the protein to the nucleus and its subsequent accumulation therein.

In the current work, we identify a series of peptides that can serve as leads for the design of potential peptidomimetic inhibitors of Elk-1 dimerisation. Using a docking-based approach, we screened entire libraries of all possible di- and tri-peptides against the Elk-1 ETS domain, targeting the stability region of the domain identified by Shaw et al [21]. Given the findings of Saven et al., [23] it was essential to consider possible structural deviations or fluctuations in the α1β1 loop region that may affect binding of such inhibitors. Therefore, we performed MD simulations for an Elk-1 ETS monomer, to generate an ensemble of monomeric ETS conformations to use as docking targets. Herein, we show that tri-peptides appear to be good candidates for the design of inhibitors/binders of the Elk-1 dimer interface, based on size and binding specificity; di-peptides, on the other hand, appeared to behave as generic protein surface binders. We have also identified a set of tri-peptides, which may bind competitively to the ETS dimer interface.

Computational Methods

Molecular Dynamics Simulations

All stages of the MD simulations were carried out using CHARMM version 34b1 [28, 29] with the all-atom CHARMM22 force field [30] and CMAP extensions [3133]. Our initial structure of a representative ETS domain monomer was chain C from the 1DUX crystal structure [18]. For residues with alternative positions, the pose with the highest occupancy was retained. Hydrogen atoms were assigned using the HBUILD module [34]. The system underwent three rounds of energy minimisation using the conjugated gradient method to remove any unphysical contacts until the system had converged. During the minimization all non-hydrogen atoms were harmonically restrained with a force constant of 30 kcal mol-1 Å-1, which was reduced by 10 kcal mol-1 Å-1 at each successive round. The system was solvated in a cubic solvation box (62.2 Å × 62.2 Å × 62.2 Å), containing 7460 TIP3P water molecules, [35] using periodic boundary conditions. The fully solvated system was minimised using the conjugated gradient method. First, the protein was fixed to allow the water molecules to minimise and then harmonically restrained with a force constant of 30 kcal mol-1 Å-1. A switched cut-off was used at an atom-pair distance of 10 Å for calculations of non-bonded interactions with a 2.0 Å switching region. The Particle Mesh Ewald algorithm was used for calculating long-range electrostatic interactions [36]. The system was gradually heated from 0 K to 300 K and allowed to equilibrate for 100 ps. The SHAKE algorithm [37] was applied to constrain all hydrogen-heavy atom bonds to remove the need to sample the high frequency vibrations. Simulations were performed with a 1 fs timestep with the Leapfrog integrator. Following equilibration, the simulation continued for a further 4 ns in the isobaric-isothermal (constant pressure and temperature, NPT) ensemble for the production run. During this phase, structural coordinates of the system were taken at 0.1 ps intervals to build a trajectory of the system dynamics. Time-dependent properties were calculated from the production trajectory. In preparation for this, the C α atoms from each frame of the trajectory were aligned using least-squares fitting to the coordinates of the starting conformation. The root mean square deviation (RMSD) from the initial conformation and radius of gyration were calculated to survey any structural fluctuations over the time-series. To evaluate local structural deviations between the simulated ETS monomer conformations and the initial dimmer conformation, a residue-specific RMSD of main-chain atoms (N, C α , C, O) was calculated, averaged for the entire conformational ensemble. To complement this, we examined the changes in the backbone dihedral angles for structural fluctuations at residues around the α1β1 loop region (16-23).

Several snapshots were extracted from the trajectory to represent the various conformations for an Elk-1 ETS monomer. This was done by clustering the trajectory using backbone dihedral angles for residues 20-22 and selecting the conformation closest to the centre of each cluster as a representative conformation. The threshold defining the size of each cluster was the average of the standard deviation for the six chosen angles, over the time-series.

Automated Peptide Docking

Libraries of all possible di-and tri-peptide were built using all 20 standard, genetically encoded amino acids (400 di-peptides and 8,000 tri-peptides). The first step was to generate a SMILES string [38] from the raw peptide sequences, using ChemAxon's MolConverter program [39]. For each peptide, tautomers at physiological pH (7.4) were produced using ChemAxon's Calculator Plugins [40]. Any unreasonable peptide structures were removed from each library, including any structures with protonated carbonyl groups, de-protonated amines, structures without formally charged termini, and structures with anionic amides. Each peptide library was docked to the ETS monomer conformations obtained from the clustering. The dockings were carried out using OpenEye's docking program FRED, [41] a rigid docking algorithm, which requires a pre-computed conformer ensemble for screening the conformational space of the ligands. The conformer ensembles were created using Omega version 2.3.2 (OpenEye Scientific Software) [42]. A maximum of 500 low energy conformers were constructed for each peptide, in vacuo, using the MMFF94s force field [43, 44]. The Coulombic and attractive part of the van der Waals terms were excluded from the force field, to reduce the effects of strong intermolecular interactions (e.g. hydrogen bonds) that can result in folded (peptide) conformations. Conformers with an energy difference greater than 25 kcal mol-1 from the lowest energy conformer were rejected and conformers in the final ensemble were required to have a heavy atom RMSD greater than the duplicate removal threshold (0.4 Å). These settings were in line with the "high quality screening" settings of Kirchmair et al [45]. All remaining parameters were the default values.

The docking site for each receptor was delineated by a grid box encasing residues at the Elk-1 dimer interface site. A protein contact constraint, which all successful dockings were require to satisfy, was defined on Leu45, which is a key pharmacophoric contact for the dimer interface. The di- and tri-peptide libraries (with conformers) were separately docked, using FRED version 2.2.5, to each of the Elk-1 ETS domain conformations. Each multi-conformer peptide-ligand was exhaustively docked to a receptor using default step-sizes and the ChemGauss2 scoring function (a propriety function of OpenEye) [41].

ChemGauss2 is a chemically aware shape-fitting scoring function, which uses Gaussian functions to describe the shape and chemistry of molecules. The best scoring poses for each compound were optimised in their docked state by half a rotation and translation step in each direction using the OEChemScore scoring function. OEChemscore is an OpenEye variant of the Chemscore [46] scoring function, but lacks a component for an entropy penalty upon complex formation.

On completion of the docking simulations, the single highest-scoring tautomeric state of each peptide was taken to give 400 unique di-peptides and 8,000 unique tri-peptides. Results from both libraries were analysed similarly but separately. The peptides were initially ranked by docking score, where rank 1 corresponded to the highest scoring peptide. Due to the variability in the size of peptides in both libraries, where size is a simple heavy atom count (HAC), and a systematic bias in the scoring functions (including OEChemscore), [47, 48] we employed a simple size-independent metric to rank peptides and select the best binders. We used the size-independent ligand efficiency (SILE) metric [49]:

S I L E = a f f i n i t y H A C 1 - x
(1)

where affinity can be any binding measurement, in our case the docking score; x is derived by fitting the maximal ligand efficiency (LE max ) values from all 12 docking screens against HAC, to a logarithmic function of the form:

ln ( L E m a x ) = k - x ln ( H A C )
(2)

Docking data from the di- and tri-peptide sets were fitted and examined separately.

Docked complexes between the highest-ranked peptides and the 12 protein conformations were analysed using HBPLUS, using default parameters [50]. Only hydrogen bonds between protein and peptide ligands were considered. The number of ETS residues participating in interactions with the top SILE-ranked peptide in each complex was counted. This count was also dissected into the number of specific contacts made, where specificity is defined as interactions between peptide side-chains and ETS residues.

Results and Discussion

Analysis of Elk-1 dimer interface

In order to aid the identification of possible peptide binders for the Elk-1 dimer interface, it was important to identify structural features contributing the dimerisation. Interactions between two Elk-1 ETS domains were calculated using the LIGPLOT program [51]. The minimum and maximum interatomic bond distances for non-bonded contacts were 2.90 Å and 3.90 Å, respectively, and for hydrogen bonds: 2.70 Å and 3.35 Å. The LIGPLOT diagram for chains C and F from the X-ray crystal structure of the ETS dimer (Figure 3) reveals a homodimeric interaction between the two ETS domains. Key to the interface were residues 17, 18 and 49, where Gln18 and Arg49 of one domain donate three hydrogen bonds to Glu17 of the partnering domain. Accompanying these hydrogen bond interactions, several residues make large steric contributions to the interface; these are listed in Table 1 together with a percentage accessible surface area of the interface, calculated using NACCESS [52]. The schematic depicting the secondary structure of the ETS domain in Figure 1 shows the relative positions of these residues in the domain.

Figure 3
figure3

ETS domain dimer interface. LIGPLOT representation of intermolecular interactions between two Elk-1 ETS domains according to the X-ray crystal structure (1DUX) of the dimer complex. Non-bonded interactions are indicated by spokes and hydrogen bonds by dashed green lines, with lengths given in Å. Residues from chain C are shown with purple bonds and chain F in orange.

Table 1 Residue contribution to the dimer interface accessible surface area (ASA), calculated using NACCESS [52]

MD simulations of an Elk-1 ETS domain

Over the course of the MD simulation, the radius of gyration (RoG) and the RMSD of the backbone atoms relative to the minimised (initial) structure of each frame in the trajectory remained stable. The mean values for the RMSD and the RoG were 1.64 ± 0.24 Å and 12.17 ± 0.08 Å, respectively. The latter was, in fact, identical to the RoG of the initial structure. This indicated that the overall shape and size (packing) of both the monomeric and dimeric conformation of the Elk-1 ETS domain is conserved. To focus on localised structural deviations, we calculated the time-averaged RMSD for each residue, with respect to the main-chain atoms of the initial conformation. This revealed substantial structural deviations for residues 20-22 compared to the dimeric conformation (Figure 4). These residues are situated at the centre of the α1β1 loop, which was identified by Shaw et al. [21] as the region accountable for Elk-1 stability. We also measured the backbone dihedral angles for residues in the loop across the entire trajectory. Residues 16 to 19 and residue 23 showed dihedral angle fluctuations within range of typical thermal fluctuations for proteins, with an average standard deviation about the mean of ±19° across the trajectory for the 10 angles; fluctuations of the backbone dihedrals for residues 21 and 22 were considerably larger, with the lowest standard deviation value of ±59° and the highest of ±88°. The high fluctuation of residues 21 and 22 are consistent with the high RMSD values seen in Figure 4.

Figure 4
figure4

Residue specific ETS monomer fluctuations. Time-averaged RMSD for the main-chain atoms of each residue over 4 ns of simulation of an Elk-1 ETS domain. The bars signify fluctuations about the mean and correspond to one standard deviation.

Since the structure fluctuates in the region coinciding with the α1β1 loop, which was our proposed docking binding site, it would have been unreasonable to dock to the single domain conformation taken from the crystal structure of the dimer, or to dock to the averaged structure of the MD trajectory. Instead, we clustered the trajectory, based on the backbone dihedral angles of residues 20-22, to extract several conformations representative of an Elk-1 ETS domain monomer. Using a clustering threshold of 49.2°, which was the average of the standard deviations of the six angles, 12 clusters were obtained. From each cluster, a single conformation was taken (Table 2) and used for the docking study. (see Additional file 1 for an alignment of the minimised structure and the 12 conformations).

Table 2 ETS monomer conformations obtained by clustering the MD trajectory on the backbone dihedral angles of residues 20-22 (cluster threshold = 49.2°)

Peptide Docking

Peptide screening

Libraries of all possible di- and tri-peptides, together with possible tautomers of each peptide were constructed. The final libraries (including protonation and tautomeric states) were made up of 1,128 di-peptides and 33,367 tri-peptides. The two peptide libraries were individually screened against the 12 monomer conformations of the Elk-1 ETS domain. Each multi-conformer peptide-ligand was exhaustively docked to the receptors, i.e., all rigid-body translations and rotations of a conformer were enumerated within the docking site, centred on residues for the Elk-1 dimer interface. Although with different affinities, all peptides bound to the docking site with favourable scores.

The docked peptide-ligands for each library were ranked according to docking score, retaining only the highest scoring tautomer of each peptide. Using this simple ranking scheme, particularly for the di-peptides, peptide-ligands with a larger heavy atom count (HAC) were ranked higher than those with a smaller one. Although this effect is apparent in experimental ligand binding data, [53] unfortunately it is unduly amplified in computational docking studies. The problem stems from the additive nature of scoring functions. The scoring function tends to favour larger ligands, as they contribute towards a greater number of intermolecular interactions with the target. This phenomenon is inherent to several docking scoring functions, including OEChemscore, which lack other terms in the function, such as a desolvation penalty term, that can counter-balance the favoured interaction terms. For our peptide ligands, the effect is seen in Figures 5a and 5b for the highest scoring di- and tri-peptides, respectively, taken at each HAC from the 12 docking screens.

Figure 5
figure5

Highest scoring peptides by size. Highest scoring (a) di-peptides and (b) tri-peptides taken at each HAC from all 12 docking screens. Docking scores have been plotted as non-negative values for convenience.

Because the bias towards larger ligands is counter to the rules for drug bioavailability, [54] a simple metric, called ligand efficiency has been developed to assess the binding of a compound, with respect to the number of atoms, and its potential for lead optimisation [53, 55]. Ligand efficiency (LE) is the binding affinity (potency) divided by a measure of the size of a ligand, often the HAC, as defined by Kuntz et al [53]. Compounds that can provide the desired binding affinity with fewer atoms are considered efficient. However, in large screening studies of ligands spanning a wide range of molecular sizes, ligand efficiency is non-linearly related to HAC, and appears to fall as size increases [56, 57]. This trend can be illustrated by plotting the LE versus HAC (Figure 6), for the peptides used in Figure 5. The trend may be related to the increased complexity of larger compounds. More complex compounds can bind a target with a less than optimal geometry, due to binding constraints and structural compromises [58]. They also offer a smaller surface area per atom to make favourable interactions compared to smaller, less complex compounds [56]. LE over-corrects for the size dependence in docking scores. Therefore, a size-normalised efficiency scale was needed. We used the size-independent ligand efficiency (SILE) [49] scale to rank peptides in the docked libraries. In order to apply a SILE metric for our data, a value for x, for Equation (1), was obtained by fitting the maximal LE (LE max ) values taken from Figure 6 to the function in Equation (2) (see Additional file 1). LE max is the highest LE value at each HAC. The x values for di- and tri-peptides were 0.649 and 0.665, respectively, which were close to the generic value of 0.7 suggested by Nissink [49].

Figure 6
figure6

Peptide binding efficiencies by size. Highest ligand efficiency values for (a) di-peptides and (b) tri-peptides taken at each HAC from all 12 docking screens.

By mapping the two sequence positions for ranked di-peptides on to a 20 × 20 matrix, where each square is graded according to the associated rank, we can see the difference in size-dependence between score- and SILE-ranked results (Figures 7a and 7b). Score-ranked matrices clearly show peptides consisting of heavier amino acid residues such as tryptophan and tyrosine ranked higher, whilst those of smaller residues such as alanine and glycine ranked lower. The SILE-ranked matrices reduce this bias (Figure 7b). Similarly, plots of the distribution of LE and SILE values for the di- and tri-peptide dockings as a function of HAC reveal a reduced size-dependency for SILE values compared to LE values (compare Figure 8b with 8a and 8d with 8c). However, the SILE values for di-peptides maintain some size dependence (Figure 8b) compared to the tri-peptides (Figure 8d). It may be that the binding site readily accommodates the di-peptides, due to their smaller size and low structural complexity, and thus the size bias remains dominant. Therefore, di-peptides with a lower HAC bind and fit the binding site more completely, where a greater number of atoms participate equally in protein-peptide interactions compared to tri-peptides and di-peptides with a higher HAC. A similar result was observed in a peptide docking study to the Fv fragment of a monoclonal IgM cryoglobulin [59]. In that study, docking results were skewed towards di-peptides composed of larger residues. It was suggested that the di-peptides were too small to discriminate between different binding cavities, which is consistent to the hypothesis of 'a small ball in a large hole'. Thus, the size-independent metric is less effective for compounds of lower complexity. This also suggests that di-peptides are fairly promiscuous protein surface binders and may not offer a specific binding preference for the dimer interface site had the docking site definition been larger.

Figure 7
figure7

Docked di-peptide ranking maps. 2D-maps representing the peptide rank by (a) docking score and (b) SILE values according to the positions occupied by each residue for di-peptides docked to ETS conformation 9 (ETS9). The ranks are represented as squares shaded from black (highest rank) to white (lowest rank).

Figure 8
figure8

Peptide efficiency distribution. Distribution of LE ((a) and (c)) and SILE ((b) and (d)) values for docked di- and tri-peptides as a function of the number of heavy atoms. (a) LE and (b) SILE values for di-peptides; (c) LE and (d) SILE values for tri-peptides.

Tables 3 and 4 list the highest score- and SILE-ranked di- and tri-peptides, respectively. These tables again reveal the preference for peptides consisting of large aromatic residues for the score-ranked results, particularly for the di-peptides. In addition to the factors discussed above, it is possible such residues may behave as anchors to aid the binding of the complete peptides. However, given that a minority (38%) of residues in the dimer interface are non-polar, it is perhaps unlikely that these hydrophobic residues, especially tryptophan and tyrosine, would show particular affinity for the binding site in experimental assays.

Table 3 Highest score- and SILE-ranked di-peptides
Table 4 Highest score- and SILE-ranked tri-peptides

Structural analysis of docked complexes

Interactions between the top SILE-ranked peptide-protein complexes were calculated for each Elk-1 ETS domain conformation. Given the systematic bias in the score-ranked results, they were not considered for interaction analysis. Overall, both sets of peptides interact with residues in the dimer interface, namely the regions at sequence positions 10-20 and 40-50. To investigate the specificity of binding, the number of ETS domain residues hydrogen bonded with a peptide were counted for each of the docked complexes. On average, di-peptide ligands interacted with fewer ETS domain residues compared to tri-peptides (column 2, Tables 5 and 6), although some of these interactions did include those made to ETS domain residues Glu17, Gln18 and Arg49, which were identified as key hydrogen-bond contacts at the dimer interface (see Figure 3). In addition, the highest SILE-ranked tri-peptides make more specific contacts to the protein compared to the highest-ranked di-peptides (column 3, Tables 5 and 6). Here, we measure specificity as interactions between peptide side-chains and ETS residues. Figure 9 shows an "Interaction fingerprint" of the hydrogen bonds between the highest SILE-ranked peptide and the corresponding ETS conformations. The Elk-1 ETS domain dimer fingerprint is given at the top of the figure as a reference.

Table 5 Number of ETS residues participating in hydrogen bond interactions with highest SILE-ranked di-peptides
Table 6 Number of ETS residues participating in hydrogen bond interactions with highest SILE-ranked tri-peptides
Figure 9
figure9

Peptide hydrogen bond fingerprints. Hydrogen bond "Interaction fingerprints" for docked complexes between Elk-1 ETS domain conformations and highest SILE-ranked (a) di- and (b) tri-peptides. Specific contacts, as described in the main text, are given in red and peptide main-chain contacts in blue.

Perhaps naively we may have expected a peptide corresponding to a contiguous sequence of residues involved in the Elk-1 dimer interface would have been ranked high in the docking simulation, but this was not the case. The most obvious such peptides were, the tri-peptide Arg-Glu-Gln, which corresponds to residues 16-18 in the ETS domain, and the di-peptide Glu-Gln corresponding to residues 17-18 (as seen in Figure 3). The best SILE-ranked Glu-Gln di-peptide was ranked 52 out of 400 in complex with ETS7 and had an average ranking of 133 for all 12 docked complexes. Whilst the best SILE-ranked Arg-Glu-Gln tri-peptide was ranked 1423 out of 8000 in complex with ETS8 and an average ranking of 3729 (Table 7). This is largely because these peptides, although capable of providing some of the hydrogen bond interactions found at the dimer interface, are unable to mimic the interactions of other residues involved in dimer interface (see Table 1 and Figure 3), particularly van der Waals contacts. This has been recognised in other efforts to discover small molecules that disrupt protein-protein interactions [60]. For this reason, the binding of the two aforementioned peptides may be weaker than the higher ranked peptides, which satisfy more of the pharmacophoric constraints of the dimer.

Table 7 SILE-rank of Glu-Gln di-peptide and Arg-Glu-Gln tri-peptide in complex with each ETS conformation

Conclusions

It is well-established that TCFs, such as Elk-1, play a critical role in transcriptional activation in response to extracellular signals and a consequent role in the growth and development of cells. Using MD simulations we have identified possible conformations for an Elk-1 ETS domain monomer and observed a structural variation from the dimeric form at the α1β1 loop, where two Elk-1 proteins dimerise. Against these monomeric conformations we screened all possible di- and tri-peptides and have identified several peptides with potential to mimic and possibly inhibit Elk-1 dimerisation. The size and binding specificity of the tri-peptides make them ideal candidates for the design of peptidomimetics of the Elk-1 dimer interface. The di-peptides, on the other hand, appear to be a generic set of protein surface binders and are unlikely to produce experimental binding affinity for the ETS dimer interface site that would correlate with the docking data. The notion of using tri-peptides as potential candidates for peptidomimetic design has also been supported in a recent review by Ung and Winkler [61].

Since docking scoring functions are based on a number of simplifications and assumptions, their predictions for binding free energies for a protein-ligand complex are not quantitative. This also makes it very difficult to discriminate between strong/weak binders and non-binders, particularly for a relatively at and exposed binding site, as investigated here. Although this is a major limitation in a docking protocol, the exhaustive search algorithm of docking programs has been successful in predicting correct binding geometries of known hits [48]. As with all docking protocols, true validation can only be achieved through experimental binding measurements correlating with the docking results. For an experimental binding study, it would be reasonable to test the binding affinity of the top SILE-ranked tri-peptides listed in Table 4. The score-ranked tri-peptides may also be worth considering as a comparative test to establish whether size of the peptide is a determinant factor of binding to the ETS domain or if it is, indeed, just an artefact of docking. Binding data for the Arg-Glu-Gln peptide may also be useful in explaining the poor predicted binding by the docking simulations.

It is quite clear that complex formation of a protein and ligand is a dynamic mechanism. Here we have shown a combination of MD and docking simulations can be used to provide an understanding of the effects on ligand binding to a dynamic representation of the receptor, which a single configuration crystal structure would fail to reveal. Thus, computer simulations on protein-ligand complexes can enhance crystal structure data in this respect. We plan to extend the current work by performing all-atom MD simulations of selected peptides complexed with an Elk-1 ETS domain to assess the stability of the complexes, whilst incorporating any induced fitting of the peptides and obtain accurate binding data for use in designing future docking studies of optimised peptides. We also plan to apply free energy perturbation methods [62] to a set of the best peptides to calculate relative binding free energy of alchemic transformations of the peptides in complex with the Elk-1 ETS domain. This may also go as far as identifying tetra-peptides with potentially superior binding affinities compared to the tri-peptides we have considered here.

References

  1. 1.

    Gille H, Kortenjann M, Thomae O, Moomaw C, Slaughter C, Cobb MH, Shaw PE: ERK phosphorylation potentiates Elk-1-mediated ternary complex formation and transactivation. EMBO J. 1995, 14: 951-962.

  2. 2.

    Janknecht R, Ernst WH, Pingoud V, Nordheim A: Activation of ternary complex factor Elk-1 by MAP kinases. EMBO J. 1993, 12: 5097-5104.

  3. 3.

    Janknecht R, Hunter T: Activation of the Sap-1a transcription factor by the c-Jun N-terminal kinase (JNK) mitogen-activated protein kinase. J Biol Chem. 1997, 272: 4219-4224. 10.1074/jbc.272.7.4219.

  4. 4.

    Buchwalter G, Gross C, Wasylyk B: Ets ternary complex transcription factors. Gene. 2004, 324: 1-14.

  5. 5.

    Shaw PE, Saxton J: Ternary complex factors: prime nuclear targets for mitogen-activated protein kinases. Int J Biochem Cell Biol. 2003, 35: 1210-1226. 10.1016/S1357-2725(03)00031-1.

  6. 6.

    Janknecht R, Nordheim A: Elk-1 protein domains required for direct and SRF-assisted DNA-binding. Nucleic Acids Res. 1992, 20: 3317-3324. 10.1093/nar/20.13.3317.

  7. 7.

    Cahill MA, Janknecht R, Nordheim A: Signalling pathways: Jack of all cascades. Curr Biol. 1996, 6: 16-19. 10.1016/S0960-9822(02)00410-4.

  8. 8.

    Shaw PE, Schrter H, Nordheim A: The ability of a ternary complex to form over the serum response element correlates with serum inducibility of the human c-fos promoter. Cell. 1989, 56: 563-572. 10.1016/0092-8674(89)90579-5.

  9. 9.

    Treisman R: Ternary complex factors: growth factor regulated transcriptional activators. Curr Opin Genet Dev. 1994, 4: 96-101. 10.1016/0959-437X(94)90097-3.

  10. 10.

    Brennan RG: The winged-helix DNA-binding motif: Another helix-turn-helix takeoff. Cell. 1993, 74: 773-776. 10.1016/0092-8674(93)90456-Z.

  11. 11.

    Verger A, Duterque-Coquillaud M: When Ets transcription factors meet their partners. Bioessays. 2002, 24: 362-370. 10.1002/bies.10068.

  12. 12.

    Coffer P, de Jonge M, Mettouchi A, Binetruy B, Ghysdael J, Kruijer W: junB promoter regulation: Ras mediated transactivation by c-Ets-1 and c-Ets-2. Oncogene. 1994, 9: 911-921.

  13. 13.

    Sharrocks AD: ERK2/p42 MAP kinase stimulates both autonomous and SRF-dependent DNA binding by Elk-1. FEBS Lett. 1995, 368: 77-80. 10.1016/0014-5793(95)00604-8.

  14. 14.

    Oikawa T, Yamada T: Molecular biology of the Ets family of transcription factors. Gene. 2003, 303: 11-34.

  15. 15.

    Sharrocks AD, Brown AL, Ling Y, Yates PR: The ETS-domain transcription factor family. Int J Biochem Cell Biol. 1997, 29: 1371-1387. 10.1016/S1357-2725(97)00086-1.

  16. 16.

    Wasylyk B, Hahn SL, Giovane A: The Ets family of transcription factors. Eur J Biochem. 1993, 211: 7-18. 10.1111/j.1432-1033.1993.tb19864.x.

  17. 17.

    Dittmer J, Nordheim A: Ets transcription factors and human disease. Biochim Biophys Acta. 1998, 1377: F1-11.

  18. 18.

    Mo Y, Vaessen B, Johnston K, Marmorstein R: Structure of the Elk-1-DNA complex reveals how DNA-distal residues affect ETS domain recognition of DNA. Nat Struct Biol. 2000, 7: 292-297. 10.1038/74055.

  19. 19.

    Carrre S, Verger A, Flourens A, Stehelin D, Duterque-Coquillaud M: Erg proteins, transcription factors of the Ets family, form homo, heterodimers and ternary complexes via two distinct domains. Oncogene. 1998, 16: 3261-3268. 10.1038/sj.onc.1201868.

  20. 20.

    Drewett V, Muller S, Goodall J, Shaw PE: Dimer formation by ternary complex factor ELK-1. J Biol Chem. 2000, 275: 1757-1762. 10.1074/jbc.275.3.1757.

  21. 21.

    Evans EL, Saxton J, Shelton SJ, Begitt A, Holliday ND, Hipskind RA, Shaw PE: Dimer formation and conformational flexibility ensure cytoplasmic stability and nuclear accumulation of Elk-1. Nucleic Acids Res. 2011, 39: 6390-6402. 10.1093/nar/gkr266.

  22. 22.

    Janknecht R, Zinck R, Ernst WH, Nordheim A: Functional dissection of the transcription factor Elk-1. Oncogene. 1994, 9: 1273-1278.

  23. 23.

    Park S, Boder ET, Saven JG: Modulating the DNA affinity of Elk-1 with computationally selected mutations. J Mol Biol. 2005, 348: 75-83. 10.1016/j.jmb.2004.12.062.

  24. 24.

    Shore P, Bisset L, Lakey J, Waltho JP, Virden R, Sharrocks AD: Characterization of the Elk-1 ETS DNA-binding domain. J Biol Chem. 1995, 270: 5805-5811. 10.1074/jbc.270.11.5805.

  25. 25.

    Szymczyna BR, Arrowsmith CH: DNA binding specificity studies of four ETS proteins support an indirect read-out mechanism of protein-DNA recognition. J Biol Chem. 2000, 275: 28363-28370.

  26. 26.

    Shore P, Whitmarsh AJ, Bhaskaran R, Davis RJ, Waltho JP, Sharrocks AD: Determinants of DNA-binding specificity of ETS-domain transcription factors. Mol Cell Biol. 1996, 16: 3338-3349.

  27. 27.

    Mo Y, Vaessen B, Johnston K, Marmorstein R: Structures of SAP-1 bound to DNA targets from the E74 and c-fos promoters: Insights into DNA sequence discrimination by Ets proteins. Mol Cell. 1998, 2: 201-212. 10.1016/S1097-2765(00)80130-6.

  28. 28.

    Brooks BR, Brooks CL, Mackerell AD, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, Caflisch A, Caves L, Cui Q, Dinner AR, Feig M, Fischer S, Gao J, Hodoscek M, Im W, Kuczera K, Lazaridis T, Ma J, Ovchinnikov V, Paci E, Pastor RW, Post CB, Pu JZ, Schaefer M, Tidor B, Venable RM, Woodcock HL, Wu X, Yang W, York DM, Karplus M: CHARMM: The biomolecular simulation program. J Comput Chem. 2009, 30: 1545-1614. 10.1002/jcc.21287.

  29. 29.

    Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M: CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. J Comput Chem. 1983, 4: 187-217. 10.1002/jcc.540040211.

  30. 30.

    MacKerell AD, Bashford D, Dunbrack RL, Evanseck JD, Field MJ, Fischer S, Gao J, Guo H, Ha S, Joseph-McCarthy D, Kuchnir L, Kuczera K, Lau FTK, Mattos C, Michnick S, Ngo T, Nguyen DT, Prodhom B, Reiher WE, Roux B, Schlenkrich M, Smith JC, Stote R, Straub J, Watanabe M, Wiorkiewicz-Kuczera J, Yin D, Karplus M: All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of Proteins. J Phys Chem B. 1998, 102: 3586-3616. 10.1021/jp973084f.

  31. 31.

    Buck M, Bouguet-Bonnet S, Pastor RW, MacKerell AD: Importance of the CMAP correction to the CHARMM22 protein force field: Dynamics of hen lysozyme. Biophys J. 2006, 90: L36-L38. 10.1529/biophysj.105.078154.

  32. 32.

    Mackerell AD, Feig M, Brooks CL: Extending the treatment of backbone energetics in protein force fields: Limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. J Comput Chem. 2004, 25: 1400-1415. 10.1002/jcc.20065.

  33. 33.

    MacKerell AD, Feig M, Brooks CL: Improved treatment of the protein backbone in empirical force fields. J Am Chem Soc. 2004, 126: 698-699. 10.1021/ja036959e.

  34. 34.

    Brünger AT, Karplus M: Polar hydrogen positions in proteins: empirical energy placement and neutron diffraction comparison. Proteins. 1988, 4: 148-156. 10.1002/prot.340040208.

  35. 35.

    Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML: Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983, 79: 926-935. 10.1063/1.445869.

  36. 36.

    Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG: A smooth particle mesh Ewald method. J Chem Phys. 1995, 103: 8577-8593. 10.1063/1.470117.

  37. 37.

    Ryckaert JP, Ciccotti G, Berendsen HJC: Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J Comput Phys. 1977, 23: 327-341. 10.1016/0021-9991(77)90098-5.

  38. 38.

    Weininger D: SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J Chem Inf Comp Sci. 1988, 28: 31-36. 10.1021/ci00057a005.

  39. 39.

    Marvin, version 5.3.1: MolConverter was used for converting peptide sequences to SMILES strings. 2010, [http://www.chemaxon.com]

  40. 40.

    Marvin, version 5.3.1: Calculator Plugins were used for tautomer and protonation state calculations. 2010, [http://www.chemaxon.com]

  41. 41.

    FRED, version 2.2.5: OpenEye Scientific Software Inc. 2010, Santa Fe, NM, USA, [http://www.eyesopen.com]

  42. 42.

    Omega, version 2.3.2: OpenEye Scientific Software Inc. 2010, Santa Fe, NM, USA, [http://www.eyesopen.com]

  43. 43.

    Halgren TA: MMFF VI. MMFF94s option for energy minimization studies. J Comput Chem. 1999, 20: 720-729. 10.1002/(SICI)1096-987X(199905)20:7<720::AID-JCC7>3.0.CO;2-X.

  44. 44.

    Halgren TA: MMFF VII. Characterization of MMFF94, MMFF94s, and other widely available force fields for conformational energies and for intermolecular-interaction energies and geometries. J Comput Chem. 1999, 20: 730-748. 10.1002/(SICI)1096-987X(199905)20:7<730::AID-JCC8>3.0.CO;2-T.

  45. 45.

    Kirchmair J, Wolber G, Laggner C, Langer T: Comparative performance assessment of the conformational model generators Omega and Catalyst: a large-scale survey on the retrieval of protein-bound ligand conformations. J Chem Inf Model. 2006, 46: 1848-1861. 10.1021/ci060084g.

  46. 46.

    Eldridge MD, Murray CW, Auton TR, Paolini GV, Mee RP: Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes. J Comput Aided Mol Des. 1997, 11: 425-445. 10.1023/A:1007996124545.

  47. 47.

    Kitchen DB, Decornez H, Furr JR, Bajorath J: Docking and scoring in virtual screening for drug discovery: methods and applications. Nat Rev Drug Discov. 2004, 3: 935-949. 10.1038/nrd1549.

  48. 48.

    Sousa SF, Fernandes PA, Ramos MJ: Protein-ligand docking: Current status and future challenges. Proteins. 2006, 65: 15-26. 10.1002/prot.21082.

  49. 49.

    Nissink JWM: Simple size-independent measure of ligand efficiency. J Chem Inf Model. 2009, 49: 1617-1622. 10.1021/ci900094m.

  50. 50.

    McDonald IK, Thornton JM: Satisfying hydrogen bonding potential in proteins. J Mol Biol. 1994, 238: 777-793. 10.1006/jmbi.1994.1334.

  51. 51.

    Wallace AC, Laskowski RA, Thornton JM: LIGPLOT: A program to generate schematic diagrams of protein-ligand interactions. Protein Eng. 1995, 8: 127-134. 10.1093/protein/8.2.127.

  52. 52.

    Hubbard SJ, Thornton JM: 'NACCESS' Computer Program. 1993, University College London: Department of Biochemistry and Molecular Biology

  53. 53.

    Kuntz ID, Chen K, Sharp KA, Kollman PA: The maximal affinity of ligands. Proc Natl Acad Sci USA. 1999, 96: 9997-10002. 10.1073/pnas.96.18.9997.

  54. 54.

    Lipinski CA, Lombardo F, Dominy BW, Feeney PJ: Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 2001, 46: 3-26. 10.1016/S0169-409X(00)00129-0.

  55. 55.

    Hopkins AL, Groom CR, Alex A: Ligand efficiency: a useful metric for lead selection. Drug Discov Today. 2004, 9: 430-431. 10.1016/S1359-6446(04)03069-7.

  56. 56.

    Reynolds CH, Tounge BA, Bembenek SD: Ligand binding efficiency: Trends, physical basis, and implications. J Med Chem. 2008, 51: 2432-2438. 10.1021/jm701255b.

  57. 57.

    Reynolds CH, Bembenek SD, Tounge BA: The role of molecular size in ligand efficiency. Bioorg Med Chem Lett. 2007, 17: 4258-4261. 10.1016/j.bmcl.2007.05.038.

  58. 58.

    Hann MM, Leach AR, Harper G: Molecular complexity and its impact on the probability of finding leads for drug discovery. J Chem Inf Comp Sci. 2001, 41: 856-864. 10.1021/ci000403i.

  59. 59.

    Yuriev E, Ramsland PA, Edmundson AB: Docking of combinatorial peptide libraries into a broadly cross-reactive human IgM. J Mol Recognit. 2001, 14: 172-184. 10.1002/jmr.533.

  60. 60.

    Wells JA, McClendon CL: Reaching for high-hanging fruit in drug discovery at protein-protein interfaces. Nature. 2007, 450: 1001-1009. 10.1038/nature06526.

  61. 61.

    Ung P, Winkler DA: Tripeptide motifs in biology: Targets for peptidomimetic design. J Med Chem. 2011, 54: 1111-1125. 10.1021/jm1012984.

  62. 62.

    Kollman P: Free energy calculations: Applications to chemical and biochemical phenomena. Chem Rev. 1993, 93: 2395-2417. 10.1021/cr00023a004.

  63. 63.

    Humphrey W, Dalke A, Schulten K: VMD: Visual molecular dynamics. J Mol Graph. 1996, 14: 33-38. 10.1016/0263-7855(96)00018-5.

Download references

Acknowledgements

We thank OpenEye Scientific Software for provision of academic licences for FRED and Omega. We also thank ChemAxon for providing a licence for Marvin, which was used to build the peptide libraries.

Author information

Correspondence to Jonathan D Hirst.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

PES and JDH together conceived the study. AH and JDH devised the strategy and analyses undertaken. AH performed the calculations, analysis and drafted the manuscript. JDH supervised the study and with PES participated in the discussion of the results. All authors have read and approved the final manuscript.

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Hussain, A., Shaw, P.E. & Hirst, J.D. Molecular dynamics simulations and in silico peptide ligand screening of the Elk-1 ETS domain. J Cheminform 3, 49 (2011) doi:10.1186/1758-2946-3-49

Download citation

Keywords

  • Dock
  • Root Mean Square Deviation
  • Serum Response Factor
  • Docking Score
  • Ligand Efficiency