International
Tables for
Crystallography
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

International Tables for Crystallography (2006). Vol. F. ch. 22.4, pp. 566-567   | 1 | 2 |

Section 22.4.5.10. Modelling applications that use CSD data

F. H. Allen,a* J. C. Colea and M. L. Verdonka

aCambridge Crystallographic Data Centre, 12 Union Road, Cambridge CB2 1EZ, England
Correspondence e-mail:  allen@ccdc.cam.ac.uk

22.4.5.10. Modelling applications that use CSD data

| top | pdf |

Predicting binding modes of ligands at protein binding sites is a problem of paramount importance in drug design. One approach to this problem is to attempt to dock the ligand directly into the binding site. There are several protein–ligand docking programs available, e.g. DOCK (see Kuntz et al., 1994[link]), GRID (Goodford, 1985[link]), FLExX and FLExS (Rarey et al., 1996[link]; Lemmen & Lengauer, 1997[link]), and GOLD (Jones et al., 1995[link], 1997[link]). The docking program GOLD, developed by the University of Sheffield, Glaxo Wellcome and the CCDC, and which has the high docking success rate of 73%, uses a small torsion library, based on the data from the CSD, to explore the conformational space of the ligand. Its hydrogen-bond geometries and fitness functions are also partly based on CSD data. In the future, we intend to create a more direct link between the crystallographic data and the docking program, via IsoStar and the developing torsion library.

Another approach to the prediction of binding modes is to calculate the energy fields for different probes at each position of the binding site, for instance using the GRID program (Goodford, 1985[link]). The resulting maps can be displayed as contoured surfaces which can assist in the prediction and understanding of binding modes of ligands. CCDC is developing a program called SuperStar (Verdonk et al., 1999[link]) which uses a similar approach to that of the X-SITE program (Singh et al., 1991[link]; Laskowski et al., 1996[link]). However, SuperStar uses non-bonded interaction data from the CSD rather than the protein side chain–side chain interaction data employed in X-SITE. Thus, for a given binding site and contact group (probe), SuperStar selects the appropriate scatter plots from the IsoStar library, superimposes the scatter plots on the relevant functional groups in the binding site, and transforms them into one composite probability map. Such maps can then, for example, be used to predict where certain functional groups are likely to interact with the binding site. The strength of SuperStar is that it is based entirely on experimental data (although this is also the cause of some limitations). The fields simply represent what has been observed in crystal strucures. We are currently verifying SuperStar on a test set of more than 100 protein–ligand complexes from the PDB and preliminary results are encouraging.

Finally, CSD data are used in several de novo design programs. These types of programs, e.g. LUDI (Böhm, 1992a[link],b[link]), predict novel ligands that will interact favourably with a given protein and use hydrogen-bond geometries from the CSD (indirectly) to position their structural fragments in the binding site.

References

First citation Böhm, H.-J. (1992a). The computer program LUDI: a new method for the de novo design of enzyme inhibitors. J. Comput.-Aided Mol. Des. 6, 61–78.Google Scholar
First citation Böhm, H.-J. (1992b). LUDI: rule-based automatic design of new substituents for enzyme inhibitors. J. Comput.-Aided Mol. Des. 6, 593–606.Google Scholar
First citation Goodford, P. J. (1985). A computational procedure for determining energetically favourable binding sites on biologically important molecules. J. Med. Chem. 28, 849–857.Google Scholar
First citation Jones, G., Willett, P. & Glen, R. C. (1995). Molecular recognition of receptor sites using a genetic algorithm with a description of solvation. J. Mol. Biol. 245, 43–53.Google Scholar
First citation Jones, G., Willett, P., Glen, R. C., Leach, A. R. & Taylor, R. (1997). Development and validation of a genetic algorithm for flexible docking. J. Mol. Biol. 267, 727–748.Google Scholar
First citation Kuntz, I. D., Meng, E. C. & Stoichet, B. K. (1994). Structure-based molecular design. Acc. Chem. Res. 27, 117–122.Google Scholar
First citation Laskowski, R. A., Thornton, J. M., Humblet, C. & Singh, J. (1996). X-SITE: use of empirically derived atomic packing preferences to identify favourable interaction regions in the binding sites of proteins. J. Mol. Biol. 259, 175–201.Google Scholar
First citation Lemmen, C. & Lengauer, T. (1997). Time-efficient flexible superposition of medium sized molecules. J. Comput.-Aided Mol. Des. 11, 357–368.Google Scholar
First citation Rarey, M., Kramer, B., Lengauer, T. & Klebe, G. (1996). Predicting receptor-ligand interactions by an incremental construction algorithm. J. Mol. Biol. 261, 470–481.Google Scholar
First citation Singh, J., Saldanha, J. & Thornton, J. M. (1991). A novel method for the modelling of peptide ligands to their receptors. Protein Eng. 4, 251–261.Google Scholar
First citation Verdonk, M. L., Cole, J. C. & Taylor, R. (1999). SuperStar: a knowledge-based approach for identifying interaction sites in proteins. J. Mol. Biol. 289, 1093–1108.Google Scholar








































to end of page
to top of page