International
Tables for
Crystallography
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

International Tables for Crystallography (2006). Vol. F. ch. 21.1, p. 497   | 1 | 2 |

Section 21.1.1. Introduction

G. J. Kleywegta*

aDepartment of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Box 596, SE-751 24 Uppsala, Sweden
Correspondence e-mail: gerard@xray.bmc.uu.se

21.1.1. Introduction

| top | pdf |

Owing to the limited resolution and imperfect phase information that macromolecular crystallographers usually have to deal with, building and refining a protein model based on crystallographic data is not an exact science. Rather, it is a subjective process, governed by experience, prejudices, expectations and local practices (Brändén & Jones, 1990[link]; Kleywegt & Jones, 1995b[link], 1997[link]). This means that errors in this process are almost unavoidable, but it is the crystallographer's task to remove as many of these as possible prior to analysis, publication and deposition of the structure. With high-resolution data and good phases, the resulting model is probably more than 95% a consequence of the data, although even at atomic resolution, subjective choices must still be made: which refinement program to use, whether to include alternative conformations, whether to model explicit H atoms, how to model temperature factors, which restraints and constraints to apply, which peaks in the maps to interpret as solvent molecules and how to treat noncrystallographic symmetry (NCS). Once the resolution becomes worse than ∼2 Å, this balance shifts and some published protein models appear to have been determined more by some crystallographer's imagination than by any experimental data.

Subjectivity is not necessarily a problem, provided that the crystallographer is experienced, knows what he or she is doing and is aware of the limitations that the experimental data impose on the model. However, even inexperienced people can avoid many of the pitfalls of model building and refinement. Supervisors have a major responsibility in this respect: education is an important factor (Dodson et al., 1996[link]). Students who have built and refined a previously determined structure from scratch as a training exercise will have met most of the problems that can be encountered in real life (Jones & Kjeldgaard, 1997[link]). Apart from hands-on experience, there are many other methods to reduce or avoid errors. These include (1) the use of information derived from databases of well refined structures in model building (Kleywegt & Jones, 1998[link]) [e.g. to generate main-chain coordinates from a Cα trace (Jones & Thirup, 1986[link]) and side-chain coordinates from preferred rotamer conformations (Ponder & Richards, 1987[link])]; (2) the use of various sorts of local quality checks (to detect residues that for one or more reasons are deemed `unusual' and that require further scrutiny and perhaps adjustment; Kleywegt & Jones, 1996a[link], 1997[link]); and (3) the use of global quality indicators [e.g. the use of the free R value (Brünger, 1992a[link], 1993[link]) to signal major errors, to prevent over-fitting, and to monitor the progress of the rebuilding and refinement process (Kleywegt & Jones, 1995b[link]; Kleywegt & Brünger, 1996[link]; Brünger, 1997[link])].

References

First citation Brändén, C.-I. & Jones, T. A. (1990). Between objectivity and subjectivity. Nature (London), 343, 687–689.Google Scholar
First citation Brünger, A. T. (1992a). Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. Nature (London), 355, 472–475.Google Scholar
First citation Brünger, A. T. (1993). Assessment of phase accuracy by cross validation: the free R value. Methods and applications. Acta Cryst. D49, 24–36.Google Scholar
First citation Brünger, A. T. (1997). The free R value: a more objective statistic for crystallography. Methods Enzymol. 277, 366–396.Google Scholar
First citation Dodson, E., Kleywegt, G. J. & Wilson, K. S. (1996). Report of a workshop on the use of statistical validators in protein X-ray crystallography. Acta Cryst. D52, 228–234.Google Scholar
First citation Jones, T. A. & Kjeldgaard, M. (1997). Electron density map interpretation. Methods Enzymol. 277, 173–208.Google Scholar
First citation Jones, T. A. & Thirup, S. (1986). Using known substructures in protein model building and crystallography. EMBO J. 5, 819–822.Google Scholar
First citation Kleywegt, G. J. & Brünger, A. T. (1996). Checking your imagination: applications of the free R value. Structure, 4, 897–904.Google Scholar
First citation Kleywegt, G. J. & Jones, T. A. (1995b). Where freedom is given, liberties are taken. Structure, 3, 535–540.Google Scholar
First citation Kleywegt, G. J. & Jones, T. A. (1996a). Efficient rebuilding of protein structures. Acta Cryst. D52, 829–832.Google Scholar
First citation Kleywegt, G. J. & Jones, T. A. (1997). Model-building and refinement practice. Methods Enzymol. 277, 208–230.Google Scholar
First citation Kleywegt, G. J. & Jones, T. A. (1998). Databases in protein crystallography. Acta Cryst. D54, 1119–1131.Google Scholar
First citation Ponder, J. W. & Richards, F. M. (1987). Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. J. Mol. Biol. 193, 775–791.Google Scholar








































to end of page
to top of page