Skip to main content
  • Poster presentation
  • Open access
  • Published:

Where are the boundaries? Automated pocket detection for druggability studies

Computer-based prediction of protein druggability is an essential task in the drug development process. Early identification of disease modifying targets that can be modulated by low-molecular weight compounds can help to speed up and reduce costs in drug discovery. Recently, first methods have been presented performing a druggability estimation solely based on the 3D structure of the protein [13]. The essential first step for such methods is the identification of the active site. A multitude of methods exist for automated active site prediction [46]. However, most methods developed for automated docking procedures do not explicitly focus on the definition of the boundary of the active site. Since druggability estimates are based on structural descriptors of the active site, a precise description of the active site boundaries is vital for correct predictions.

In this work, we present a method to predict protein binding pockets and split them into subpockets such that small molecules are mostly contained within one sub-pocket. The method is based on a novel strategy to geometrically detect narrow regions in pockets. For druggability predictions, such pocket descriptions result in more meaningful structural descriptors like active site surface or volume. Moreover, if several structures from one protein are known, sub-pockets can give hints about protein flexibility and induced fit conformational changes.

Our method was evaluated on 718 proteins from the PDBbind [7] data set, as well as 5419 proteins from the scPDB [8] data set. Binding pockets are correctly predicted in 94% and 93% of the datasets. 38%, respectively 45% of the proteins from the two datasets contain pockets which can be divided into more than one sub-pocket. In all cases one sub-pocket completely covers the co-crystallized ligand. Besides the classical overlap-measure of ligand versus predicted active site, we additionally considered the pocket coverage by the co-crystallized ligand. We found that the number of test cases with more than 30% pocket coverage rises from 30% to 74% (PDBbind) and from 28% to 63% (scPDB), respectively, when considering sub-pockets.


  1. Nayal M, Honig B: Proteins. 2006, 63 (4): 892-906. 10.1002/prot.20897.

    Article  CAS  Google Scholar 

  2. Cheng C, et al: Nat Biotechnol. 2007, 25 (1): 71-5. 10.1038/nbt1273.

    Article  Google Scholar 

  3. Halgren TA: J Chem Inf Model. 2009, 49 (2): 377-89. 10.1021/ci800324m.

    Article  CAS  Google Scholar 

  4. Hendlich M: J Mol Graph Model. 1997, 15 (6): 359-63. 10.1016/S1093-3263(98)00002-3.

    Article  CAS  Google Scholar 

  5. Weisel M, et al: Chem Cent J. 2007, 1: 7-10.1186/1752-153X-1-7.

    Article  Google Scholar 

  6. An J, et al: Mol Cell Proteomics. 2005, 4 (6): 752-61. 10.1074/mcp.M400159-MCP200.

    Article  CAS  Google Scholar 

  7. Wang R, et al: J Med Chem. 2004, 47 (12): 2977-80. 10.1021/jm030580l.

    Article  CAS  Google Scholar 

  8. Kellenberger E, et al: J Chem Inf Model. 2006, 46 (2): 717-27. 10.1021/ci050372x.

    Article  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations


Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License ( ), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Volkamer, A., Grombacher, T. & Rarey, M. Where are the boundaries? Automated pocket detection for druggability studies. J Cheminform 2 (Suppl 1), P11 (2010).

Download citation

  • Published:

  • DOI: