Coordinate uncertainty

Cruickshank, D. W. J.

doi:10.1107/97809553602060000697

International
Tables for
Crystallography
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

pdf | chapter contents | chapter index | related articles

International Tables for Crystallography (2006). Vol. F. ch. 18.5, pp. 403-418 | 1 | 2 |
https://doi.org/10.1107/97809553602060000697

Chapter 18.5. Coordinate uncertainty

D. W. J. Cruickshank^a ^*^‡

^a Chemistry Department, UMIST, Manchester M60 1QD, England
Correspondence e-mail: dwj_cruickshank@email.msn.com

Full-matrix least-squares is taken as the basis for an examination of protein-structure precision. A two-atom model is used to compare the precisions of unrestrained and restrained refinements. In this model, restrained refinement determines a bond length which is the weighted mean of the unrestrained diffraction-only length and the geometric-dictionary length. As a protein example, data with 0.94 Å resolution for concanavalin A are used in unrestrained and restrained full-matrix inversions to provide e.s.d.'s σ(r) for positions and σ(l) for bond lengths. σ(r) is as small as 0.01 Å for atoms with low Debye B values but increases strongly with B. The results emphasize the distinction between unrestrained and restrained refinements and also between σ(r) and σ(l). An unrestrained full-matrix inversion for an immunoglobulin with 1.7 Å data is also discussed. Several approximate methods are examined critically. These include Luzzati plots and the diffraction-component precision index (DPI). The DPI estimate of σ(r, B_avg) is given by a simple formula, which uses R or R_free and is based on a very rough approximation to the least-squares method. Examples show its usefulness as a precision comparator for high- and low-resolution structures.

Keywords: R factors; R_free; accuracy; atomic displacement parameters; block-matrix approximation; concanavalin A; coordinate uncertainty; DPI; diffraction-component precision index; errors; free R factor; full-matrix inversion; goodness of fit; least-squares methods; low-resolution structures; Luzzati plot; modified Fourier method for estimating coordinate uncertainty; normal equations; position error; precision; refinement; residual function; restrained full-matrix inversion for concanavalin A; restrained refinement; restraints; temperature factors; unrestrained full-matrix inversion; weighting.

18.5.1. Introduction

| top | pdf |

18.5.1.1. Background

| top | pdf |

Even in 1967 when the first few protein structures had been solved, it would have been hard to imagine a time when the best protein structures would be determined with a precision approaching that of small molecules. That time was reached during the 1990s. Consequently, the methods for the assessment of the precision of small molecules can be extended to good-quality protein structures.

The key idea is simply stated. At the conclusion and full convergence of a least-squares or equivalent refinement, the estimated variances and covariances of the parameters may be obtained through the inversion of the least-squares full matrix .

The inversion of the full matrix for a large protein is a gigantic computational task, but it is being accomplished in a rising number of cases. Alternatively, approximations may be sought. Often these can be no more than rough order-of-magnitude estimates. Some of these approximations are considered below.

Caveat. Quite apart from their large numbers of atoms, protein structures show features differing from those of well ordered small-molecule structures. Protein crystals contain large amounts of solvent, much of it not well ordered. Parts of the protein chain may be floppy or disordered. All natural protein crystals are noncentrosymmetric, hence the simplifications of error assessment for centrosymmetric structures are inapplicable. The effects of incomplete modelling of disorder on phase angles, and thus on parameter errors, are not addressed explicitly in the following analysis. Nor does this analysis address the quite different problem of possible gross errors or misplacements in a structure, other than by their indication through high B values or high coordinate standard uncertainties. These various difficulties are, of course, reflected in the values of $[\Delta |F|]$ used in the precision estimates.

On the problems of structure validation see Part 21 of this volume and Dodson (1998).

Some structure determinations do make a first-order correction for the effects of disordered solvent on phase angles by application of Babinet's principle of complementarity (Langridge et al., 1960 ; Moews & Kretsinger, 1975 ; Tronrud, 1997 ). Babinet's principle follows from the fact that if $[\rho ({\bf x})]$ is constant throughout the cell, then $[F({\bf h}) = 0]$ , except for F(0). Consequently, if the cell is divided into two regions C and D, $[F_{C}({\bf h}) = -F_{D}({\bf h})]$ . Thus if D is a region of disordered solvent, $[F_{D}({\bf h})]$ can be estimated from $[-F_{C}({\bf h})]$ . A first approximation to a disordered model may be obtained by placing negative point-atoms with very high Debye B values at all the ordered sites in region C. This procedure provides some correction for very low resolution planes. Alternatively, corrections are sometimes made by a mask bulk solvent model (Jiang & Brünger, 1994 ).

The application of restraints in protein refinement does not affect the key idea about the method of error estimation. A simple model for restrained refinement is analysed in Section 18.5.3 , and the effect of restraints is discussed in Section 18.5.4 and later.

Much of the material in this chapter is drawn from a Topical Review published in Acta Crystallographica, Section D (Cruickshank, 1999 ).

Protein structures exhibiting noncrystallographic symmetry are not considered in this chapter.

18.5.1.2. Accuracy and precision

| top | pdf |

A distinction should be made between the terms accuracy and precision. A single measurement of the magnitude of a quantity differs by error from its unknown true value λ. In statistical theory (Cruickshank, 1959 ), the fundamental supposition made about errors is that, for a given experimental procedure, the possible results of an experiment define the probability density function f(x) of a random variable. Both the true value λ and the probability density f(x) are unknown. The problem of assessing the accuracy of a measurement is thus the double problem of estimating f(x) and of assuming a relation between f(x) and λ.

Precision relates to the function f(x) and its spread.

The problem of what relationship to assume between f(x) and the true value λ is more subtle, involving particularly the question of systematic errors. The usual procedure, after correcting for known systematic errors, is to suppose that some typical property of f(x), often the mean, is the value of λ. No repetition of the same experiment will ever reveal the systematic errors, so statistical estimates of precision take into account only random errors. Empirically, systematic errors can be detected only by remeasuring the quantity with a different technique.

Care is needed in reading older papers. The word accuracy was sometimes intended to cover both random and systematic errors, or it may cover only random errors in the above sense of precision (known systematic errors having been corrected).

In recent years, the well established term estimated standard deviation (e.s.d.) has been replaced by the term standard uncertainty (s.u.). (See Section 18.5.2.3 on statistical descriptors.)

18.5.1.3. Effect of atomic displacement parameters (or `temperature factors')

| top | pdf |

It is useful to begin with a reminder that the Debye $[B = 8\pi^{2} \langle u^{2}\rangle]$ , where u is the atomic displacement parameter. If B = 80 Å², the r.m.s. amplitude is 1.01 Å. The centroid of an atom with such a B is unlikely to be precisely determined. For B = 40 Å², the 0.71 Å r.m.s. amplitude of an atom is approximately half a C—N bond length. For B = 20 Å², the amplitude is 0.50 Å. Even for B = 5 Å², the amplitude is 0.25 Å. The size of the atomic displacement amplitudes should always be borne in mind when considering the precision of the position of the centroid of an atom.

Scattering power depends on $[\exp [-2B(\sin \theta /\lambda)^{2}] = \exp [-B/(2d^{2})]]$ . For B = 20 Å² and d = 4, 2 or 1 Å, this factor is 0.54, 0.08 or 0.0001. For d = 2 Å and B = 5, 20 or 80 Å², the factor is again 0.54, 0.08 or 0.0001. The scattering power of an atom thus depends very strongly on B and on the resolution $[d = 1/s = \lambda /2 \sin \theta]$ . Scattering at high resolution (low d) is dominated by atoms with low B.

An immediate consequence of the strong dependence of scattering power on B is that the standard uncertainties of atomic coordinates also depend very strongly on B, especially between atoms of different B within the same structure.

[An IUCr Subcommittee on Atomic Displacement Parameter Nomenclature (Trueblood et al., 1996 ) has recommended that the phrase `temperature factor', though widely used in the past, should be avoided on account of several ambiguities in its meaning and usage. The Subcommittee also discourages the use of B and the anisotropic tensor B in favour of $[\langle u^{2} \rangle]$ and U, on the grounds that the latter have a more direct physical significance. The present author concurs (Cruickshank, 1956 , 1965 ). However, as the use of B or $[B_{\rm eq}]$ is currently so widespread in biomolecular crystallography, this chapter has been written in terms of B.]

18.5.2. The least-squares method

| top | pdf |

18.5.2.1. The normal equations

| top | pdf |

In the unrestrained least-squares method, the residual $[R = \textstyle\sum\limits_{3}\displaystyle w(hkl)\Delta^{2} (hkl) \eqno(18.5.2.1)]$ is minimized, where Δ is either $[|F_{o}| - |F_{c}|]$ for $[R_{1}]$ or $[|F_{o}|^{2} - |F_{c}|^{2}]$ for $[R_{2}]$ , and w(hkl) is chosen appropriately. The summation is over crystallographically independent planes.

When R is a minimum with respect to the parameter $[u_{j}]$ , $[\partial R/\partial u_{j} = 0]$ , i.e., $[\textstyle\sum\limits_{3}\displaystyle w\Delta (\partial \Delta / \partial u_{j}) = 0. \eqno(18.5.2.2)]$ For $[R_{1}]$ , $[\partial \Delta / \partial u_{j} = -\partial |F_{c}|/\partial u_{j}]$ ; for $[R_{2}]$ , $[\partial \Delta / \partial u_{j} =]$ $[-2|F_{c}|\partial |F_{c}|/ \partial u_{j}]$ . The n parameters have to be varied until the n conditions (18.5.2.2) are satisfied. For a trial set of the $[u_{j}]$ close to the correct values, we may expand Δ as a function of the parameters by a Taylor series to the first order. Thus for $[R_{1}]$ , $[\Delta ({\bf u} + {\bf e}) = \Delta ({\bf u}) - \textstyle\sum\limits_{i}\displaystyle \varepsilon_{i} (\partial |F_{c}|/ \partial u_{i}), \eqno(18.5.2.3)]$ where $[\varepsilon_{i}]$ is a small change in the parameter $[u_{i}]$ , and u and e represent the whole sets of parameters and changes. The minus sign occurs before the summation, since $[\Delta = |F_{o}| - |F_{c}|]$ , and the changes in $[|F_{c}|]$ are being considered.

Substituting (18.5.2.3) in (18.5.2.2), we get the normal equations for $[R_{1}]$ , $[\openup6pt\displaylines{\textstyle\sum\limits_{i}\displaystyle \varepsilon_{i} \left[\textstyle\sum\limits_{3}\displaystyle w(\partial |F_{c}|/ \partial u_{i}) (\partial |F_{c}|/ \partial u_{j})\right] \cr\hfill= \textstyle\sum\limits_{3}\displaystyle w\Delta (\partial |F_{c}|/ \partial u_{j}).\hfill (18.5.2.4)}]$ There are n of these equations for $[j = 1,\ldots, n]$ to determine the n unknown $[\varepsilon_{j}]$ .

For $[R_{2}]$ the normal equations are $[\openup6pt\displaylines{\textstyle\sum\limits_{i}\displaystyle \varepsilon_{i} \left[\textstyle\sum\limits_{3}\displaystyle w(\partial |F_{c}|^{2} / \partial u_{i}) (\partial |F_{c}|^{2} / \partial u_{j})\right] \cr\hfill= \textstyle\sum\limits_{3}\displaystyle w\Delta (\partial |F_{c}|^{2} / \partial u_{j}). \hfill(18.5.2.5)}]$ Both forms of the normal equations can be abbreviated to $[\textstyle\sum\limits_{i}\displaystyle \varepsilon_{i} a_{ij} = b_{j}. \eqno(18.5.2.6)]$

For the values of $[\partial |F_{c}| / \partial u_{j}]$ for common parameters see, e.g., Cruickshank (1970).

Some important points in the derivation of the standard uncertainties of the refined parameters can be most easily understood if we suppose that the matrix $[a_{ij}]$ can be approximated by its diagonal elements. Each parameter is then determined by a single equation of the form $[\varepsilon_{i} \textstyle\sum\limits_{3}\displaystyle wg^{2} = \textstyle\sum\limits_{3}\displaystyle wg\Delta, \eqno(18.5.2.7)]$ where $[g = \partial |F_{c}| / \partial u_{i}]$ or $[\partial |F_{c}|^{2} / \partial u_{i}]$ . Hence $[\varepsilon_{i} = \left(\textstyle\sum\limits_{3}\displaystyle wg\Delta \right)\bigg/ \left(\textstyle\sum\limits_{3}\displaystyle wg^{2}\right). \eqno(18.5.2.8)]$ At the conclusion of the refinement, when R is a minimum, the variance (square of the s.u.) of the parameter $[u_{i}]$ due to uncertainties in the Δ's is $[\sigma_{i}^{2} = \left[\textstyle\sum\limits_{3}\displaystyle w^{2}g^{2}\sigma^{2}(F)\right] \bigg/ \left(\textstyle\sum\limits_{3}\displaystyle wg^{2}\right)^{2}. \eqno(18.5.2.9)]$ If the weights have been chosen as $[w(hkl) = 1 / \sigma^{2}(|F_{hkl}|)]$ or $[1 / \sigma^{2} (|F_{hkl}|^{2})]$ , this simplifies to $[\sigma_{i}^{2} = 1 \bigg/ \left(\textstyle\sum\limits_{3}\displaystyle wg^{2}\right) = 1 / a_{ii}, \eqno(18.5.2.10)]$ which is appropriate for absolute weights. Equation (18.5.2.10) provides an s.u. for a parameter relative to the s.u.'s $[\sigma (|F|)]$ or $[\sigma (|F|^{2})]$ of the observations.

In general, with the full matrix $[a_{ij}]$ in the normal equations, $[\sigma_{i}^{2} = (a^{-1})_{ii}, \eqno(18.5.2.11)]$ where $[(a^{-1})_{ii}]$ is an element of the matrix inverse to $[a_{ij}]$ . The covariance of the parameters $[u_{i}]$ and $[u_{j}]$ is $[\hbox{cov} (i, j) \equiv \sigma_{i}\sigma_{j}\hbox{correl} (i, j) = (a^{-1})_{ij}. ]$

18.5.2.2. Weights

| top | pdf |

In the early stages of refinement, artificial weights may be chosen to accelerate refinement. In the final stages, the weights must be related to the precision of the structure factors if parameter variances are being sought. There are two distinct ways, covering two ranges of error, in which this may be done.

(1) The weights for $[R_{1}]$ , say, may reflect the precision of the $[|F_{o}|]$ , so that $[w(hkl) = 1 / \sigma^{2} (|F_{hkl}|)]$ , where $[\sigma^{2}]$ is the estimated variance of $[|F_{o}|]$ due to a specific class of experimental uncertainties. These absolute weights are derived from an analysis of the experiment. Weights chosen in this way lead to estimated parameter variances $[\sigma_{i}^{2} = (a^{-1})_{ii}]$ , (18.5.2.11), which cover only the specific class of experimental uncertainties.
(2) The weights may reflect the trends in the $[|\Delta| \equiv \|F_{o}| - |F_{c}\|]$ . A weighting function with a small number of parameters is chosen so that the averages of $[w \Delta^{2}]$ are constant when the set of $[w\Delta^{2}]$ values is analysed in any pertinent fashion (e.g. in bins of increasing $[|F_{o}|]$ and $[2\sin \theta/\lambda]$ ). Weights chosen in this way are relative weights, and the expression for the parameter variances needs a scaling factor, $[S^{2} = \left(\textstyle\sum\limits_{3}\displaystyle w\Delta^{2}\right) \bigg/ (n_{\rm obs} - n_{\rm params}). \eqno(18.5.2.12)]$ Hence, in the full-matrix case, $[\sigma_{i}^{2} = \left[\left(\textstyle\sum\limits_{3}\displaystyle w\Delta^{2}\right)\bigg / (n_{\rm obs} - n_{\rm params})\right] (a^{-1})_{ii}, \eqno(18.5.2.13)]$ which allows for all random experimental errors, such systematic experimental errors as cannot be simulated in the $[|F_{c}|]$ and imperfections in the calculated model.

18.5.2.3. Statistical descriptors and goodness of fit

| top | pdf |

In recent years, there have been developments and changes in statistical nomenclature and usage. Many aspects are summarised in the reports of the IUCr Subcommittee on Statistical Descriptors in Crystallography (Schwarzenbach et al., 1989 , 1995 ). In the second report, inter alia, the Subcommittee emphasizes the terms uncertainty and standard uncertainty (s.u.). The latter is a replacement for the older term estimated standard deviation (e.s.d.). The Subcommittee classify uncertainty components in two categories, based on their method of evaluation: type A, estimated by the statistical analysis of a series of observations, and type B, estimated otherwise. As an example of the latter, a type B component could allow for doubts concerning the estimated shape and dimensions of the diffracting crystal and the subsequent corrections made for absorption.

The square root S of the expression S², (18.5.2.12) above, is called the goodness of fit when the weights are the reciprocals of the absolute variances of the observations.

One recommendation in the second report does call for comment here. While agreeing that formulae like (18.5.2.13) lead to conservative estimates of parameter variances, the report suggests that this practice is based on the questionable assumption that the variances of the observations by which the weights are assigned are relatively correct but uniformly underestimated. When the goodness of fit $[S\gt1]$ , then either the weights or the model or both are suspect.

Comment is needed. The account in Section 18.5.2.2 describes two distinct ways of estimating parameter variances, covering two ranges of error. The kind of weights envisaged in the reports (based on variances of type A and/or of type B) are of a class described for method (1). They are not the weights to be used in method (2) (though they may be a component in such weights). Method (2) implicitly assumes from the outset that there are experimental errors, some covered and others not covered by method (1), and that there are imperfections in the calculated model (as is obviously true for proteins). Method (2) avoids exploring the relative proportions and details of these error sources and aims to provide a realistic estimate of parameter uncertainties which can be used in external comparisons. It can be formally objected that method (2) does not conform to the criteria of random-variable theory, since clearly the Δ's are partially correlated through the remaining model errors and some systematic experimental errors. But it is a useful procedure. Method (1) on its own would present an optimistic view of the reliability of the overall investigation, the degree of optimism being indicated by the inverse of the goodness of fit (18.5.2.12). In method (2), if the weights are on an arbitrary scale, then $[S^{2}]$ can have an arbitrary value.

For an advanced-level treatment of many aspects of the refinement of structural parameters, see Part 8 of International Tables for Crystallography, Volume C (2004). The detection and treatment of systematic error are discussed in Chapter 8.5 therein.

18.5.3. Restrained refinement

| top | pdf |

18.5.3.1. Residual function

| top | pdf |

Protein structures are often refined by a restrained refinement program such as PROLSQ (Hendrickson & Konnert, 1980 ). Here, a function of the type $[R' = \textstyle\sum\displaystyle w_{h} (\Delta F)^{2} + \textstyle\sum\displaystyle w_{\rm geom} (\Delta Q)^{2} \eqno(18.5.3.1)]$ is minimized, where Q denotes a geometrical restraint such as a bond length. Formally, all one is doing is extending the list of observations. One is adding to the protein diffraction data geometrical data from a stereochemical dictionary such as that of Engh & Huber (1991). A chain C—N bond length may be known from the dictionary with much greater precision $[1 / w_{\rm geom}^{1/2}]$ , say 0.02 Å, than from an unrestrained diffraction-data-only protein refinement.

In a high-resolution unrestrained refinement of a small molecule, the standard uncertainty (s.u.) of a bond length A—B is often well approximated by $[\sigma (l) = (\sigma_{A}^{2} + \sigma_{B}^{2})^{1/2}. \eqno(18.5.3.2)]$ However, in a protein determination $[\sigma (l)]$ is often much smaller than either $[\sigma_{A}]$ or $[\sigma_{B}]$ because of the excellent information from the stereochemical dictionary, which correlates the positions of A and B.

Laying aside computational size and complexity, the protein precision problem is straightforward in principle. When a restrained refinement has converged to an acceptable structure and the shifts in successive rounds have become negligible, invert the full matrix. The inverse matrix immediately yields estimates of the variances and covariances of all parameters.

The dimensions of the matrix are the same whether or not the refinement is restrained. The full matrix will be rather sparse, but not nearly as sparse as in a small-molecule refinement. For the purposes of Section 18.5.3 , it is irrelevant whether the residual for the diffraction data is based on [|F|] or $[|F|^{2}]$ . On the relative weighting of the diffraction and restraint terms, see Section 18.5.3.3 .

18.5.3.2. A very simple protein model

| top | pdf |

Some aspects of restrained refinement are easily understood by considering a one-dimensional protein consisting of two like atoms in the asymmetric unit, with coordinates $[x_{1}]$ and $[x_{2}]$ relative to a fixed origin and bond length $[l = x_{2} - x_{1}]$ . In the refinement, the normal equations are of the type $[{\bf N}\Delta {\bf x} = {\bf e}]$ . For two non-overlapping like atoms, the diffraction data will yield a normal matrix $[{\bf N} = \left(\matrix{a &0\cr 0 &a\cr}\right), \eqno(18.5.3.3)]$ with inverse $[\left(\matrix{1/a &0\cr 0 &1/a\cr}\right), \eqno(18.5.3.4)]$ where $[a = \textstyle\sum w_{h} (\partial |F_{n}| / \partial x_{i})^{2}. \eqno(18.5.3.5)]$

A geometric restraint on the length will yield a normal matrix $[\left(\matrix{b &-b\cr -b &b\cr}\right) \eqno(18.5.3.6)]$ with no inverse, since its determinant is zero, where $[b = w_{\rm geom} (\partial l / \partial x_{i})^{2}. \eqno(18.5.3.7)]$ Note $[\partial l / \partial x_{2} = -\partial l / \partial x_{i} = 1]$ , so that $[b = w_{\rm geom} = 1 / \sigma_{\rm geom}^{2} (l), \eqno(18.5.3.8)]$ where $[\sigma_{\rm geom}^{2} (l)]$ is the variance assigned to the length in the stereochemical dictionary.

Combining the diffraction data and the restraint, the normal matrix becomes $[\left(\matrix{a + b &-b\cr -b &a + b\cr}\right), \eqno(18.5.3.9)]$ with inverse $[\{1 / [a(a + 2b)]\} \left(\matrix{a + b &b\cr b &a + b\cr}\right). \eqno(18.5.3.10)]$ For the diffraction data alone, the variance of $[x_{i}]$ is $[\sigma_{\rm diff}^{2}(x_{i}) = 1 / a. \eqno(18.5.3.11)]$ For the diffraction data plus restraint, the variance of $[x_{i}]$ is $[\eqalignno{ \sigma_{\rm res}^{2}(x_{i}) &= (a + b) / [a(a + 2b)] &(18.5.3.12)\cr &\lt \sigma_{\rm diff}^{2}(x_{i}). &\cr}]$ Note that though the restraint says nothing about the position of $[x_{i}]$ , the variance of $[x_{i}]$ has been reduced because of the coupling to the position of the other atom. In the limit when $[a \ll b]$ , $[\sigma_{\rm res}^{2}(x_{i})]$ is only half $[\sigma_{\rm diff}^{2}(x_{i})]$ .

The general formula for the variance of the length $[l = x_{2} - x_{1}]$ is $[\sigma^{2}(l) = \sigma^{2}(x_{2}) - 2\hbox{cov}(x_{2}, x_{1}) + \sigma^{2}(x_{1}). \eqno(18.5.3.13)]$ For the diffraction data alone, this gives $[\sigma_{\rm diff}^{2}(l) = 1 / a + 0 + 1 / a = 2 / a = 2\sigma_{\rm diff}^{2}(x_{i}), \eqno(18.5.3.14)]$ as expected. For the diffraction data plus restraint, $[\eqalignno{ \sigma_{\rm res}^{2}(l) &= [1 / a(a + 2b)] [(a + b) - 2b + (a + b)] &\cr &= 1 / (a / 2 + b) &(18.5.3.15)\cr &\lt\sigma_{\rm diff}^{2}(l). &\cr}]$ For small a, $[\sigma_{\rm res}^{2}(l) \rightarrow 1 / b = \sigma_{\rm geom}^{2}(l)]$ , as expected. The variance of the restrained length, (18.5.3.15), can be re-expressed as $[1 / \sigma_{\rm res}^{2}(l) = 1 / \sigma_{\rm diff}^{2}(l) + 1 / \sigma_{\rm geom}^{2}(l). \eqno(18.5.3.16)]$

For the two-atom protein, it can be proved directly, as one would expect from (18.5.3.16), that restrained refinement determines a length which is the weighted mean of the diffraction-only length and the geometric dictionary length.

The centroid has coordinate $[c = (x_{1} + x_{2})/2]$ . It is easily found that $[\sigma_{\rm res}^{2}(c) = \sigma_{\rm diff}^{2}(c) = 1/2a]$ . Thus, as expected, the restraint says nothing about the position of the molecule in the cell.

For numerical illustrations of the s.u.'s in restrained refinement, suppose the stereochemical length restraint has $[\sigma_{\rm geom}(l) = 0.02]$ Å. Equation (18.5.3.16) gives the length s.u. $[\sigma_{\rm res}(l)]$ in restrained refinement. If the diffraction-only $[\sigma_{\rm diff}(x_{i}) = 0.01]$ Å, the restrained $[\sigma_{\rm res}(l)]$ is 0.012 Å. If $[\sigma_{\rm diff}(x_{i}) = 0.05]$ Å, $[\sigma_{\rm res}(l)]$ is 0.019 Å. However large $[\sigma_{\rm diff}(x_{i})]$ , $[\sigma_{\rm res}(l)]$ never exceeds 0.02 Å.

Equation (18.5.3.12) gives the position s.u. $[\sigma_{\rm res}(x_{i})]$ in restrained refinement. If the diffraction-only $[\sigma_{\rm diff}(x_{i}) = 0.01]$ Å, the restrained $[\sigma_{\rm res}(x_{i})]$ is 0.009 Å. If $[\sigma_{\rm diff}(x_{i}) = 0.05]$ Å, $[\sigma_{\rm res}(x_{i}) = 0.037]$ Å. For large $[\sigma_{\rm diff}(x_{i})]$ , $[\sigma_{\rm res}(x_{i})]$ tends to $[\sigma_{\rm diff}(x_{i})/(2)^{1/2}]$ as the strong restraint couples the two atoms together. For very small $[\sigma_{\rm diff}(x_{i})]$ , the relatively weak restraint has no effect.

18.5.3.3. Relative weighting of diffraction and restraint terms

| top | pdf |

When only relative diffraction weights are known, as in equation (18.5.2.13), it has been common (Rollett, 1970) to scale the geometric restraint terms against the diffraction terms by replacing the restraint weights $[w_{\rm geom} = 1/\sigma_{\rm geom}^{2}]$ by $[w_{\rm geom} = S^{2}/\sigma_{\rm geom}^{2}]$ , where $[S^{2} = (\sum w_{h} \Delta_{h}^{2})/(n_{\rm obs} - n_{\rm params})]$ . However, this scheme cannot be used for low-resolution structures if $[n_{\rm obs} \lt n_{\rm params}]$ .

The treatment by Tickle et al. (1998a) shows that the reduction $[n_{\rm params}]$ in the number of degrees of freedom has to be distributed among all the data, both diffraction observations and restraints. Since the geometric restraint weights are on an absolute scale (Å⁻²), they propose that the (absolute) scale of the diffraction weights should be determined by adjustment until the restrained residual R′ (18.5.3.1) is equal to its expected value $[(n_{\rm obs} + n_{\rm restraints} - n_{\rm params})]$ .

For a method of determining the scale of the diffraction weights based on $[R_{\rm free}]$ , see Brünger (1993).

The geometric restraint weights were classified by the IUCr Subcommittee (Schwarzenbach et al., 1995 ) as derived from observations supplementary to the diffraction data, with uncertainties of type B (Section 18.5.2.3).

18.5.4. Two examples of full-matrix inversion

| top | pdf |

18.5.4.1. Unrestrained and restrained inversions for concanavalin A

| top | pdf |

G. M. Sheldrick extended his SHELXL96 program (Sheldrick & Schneider, 1997 ) to provide extra information about protein precision through the inversion of least-squares full matrices. His programs have been used by Deacon et al. (1997) for the high-resolution refinement of native concanavalin A with 237 residues, using data at 110 K to 0.94 Å refined anisotropically. After the convergence and completion of full-matrix restrained refinement for the structure, the unrestrained full matrix (coordinates only) was computed and then inverted in a massive calculation. This led to s.u's $[\sigma (x)]$ , $[\sigma (y)]$ , $[\sigma (z)]$ and $[\sigma (r)]$ for all atoms, and to $[\sigma (l)]$ and $[\sigma (\theta)]$ for all bond lengths and angles. $[\sigma (r)]$ is defined as $[[\sigma^{2}(x) + \sigma^{2}(y) + \sigma^{2}(z)]^{1/2}]$ . For concanavalin A the restrained full matrix was also inverted, thus allowing the comparison of restrained and unrestrained s.u.'s.

The results for concanavalin A from the inversion of the coordinate matrices of order 6402 (= 2134 × 3) are plotted in Figs. 18.5.4.1 and 18.5.4.2 . Fig. 18.5.4.1 shows $[\sigma (r)]$ versus $[B_{\rm eq}]$ for the fully occupied atoms of the protein (a few atoms with B > 60 Å² are off-scale). The points are colour-coded black for carbon, blue for nitrogen and red for oxygen. Fig. 18.5.4.1(a) shows the restrained results, and Fig. 18.5.4.1(b) shows the unrestrained diffraction-data-only results. Superposed on both sets of data points are least-squares quadratic fits determined with weights $[1/B^{2}]$ . At high B, the unrestrained $[\sigma_{\rm diff}(r)]$ can be at least double the restrained $[\sigma_{\rm res}(r)]$ , e.g., for carbon at B = 50 Å², the unrestrained $[\sigma_{\rm diff}(r)]$ is about 0.25 Å, whereas the restrained $[\sigma_{\rm res}(r)]$ is about 0.11 Å. For B < 10 Å², both $[\sigma (r)]$ 's fall below 0.02 Å and are around 0.01 Å at B = 6 Å².

Figure 18.5.4.1| top | pdf |

Plots of $[\sigma (r)]$ versus $[B_{\rm eq}]$ for concanavalin A with 0.94 Å data, (a) restrained full-matrix $[\sigma_{\rm res}(r)]$ , (b) unrestrained full-matrix $[\sigma_{\rm diff}(r)]$ . Carbon black, nitrogen blue, oxygen red.

Figure 18.5.4.2| top | pdf |

Plots of $[\sigma (l)]$ versus average $[B_{\rm eq}]$ for concanavalin A with 0.94 Å data, (a) restrained full-matrix $[\sigma_{\rm res}(l)]$ , (b) unrestrained full-matrix $[\sigma_{\rm diff}(l)]$ . C—C black, C—N blue, C—O red.

For B < 10 Å², the better precision of oxygen as compared with nitrogen, and of nitrogen as compared with carbon, can be clearly seen. At the lowest B, the unrestrained $[\sigma_{\rm diff}(r)]$ in Fig. 18.5.4.1(b) are almost as small as the restrained $[\sigma_{\rm res}(r)]$ in Fig. 18.5.4.1(a). [The quadratic fits of the restrained results in Fig. 18.5.4.1(a) are evidently slightly imperfect in making $[\sigma_{\rm res}(r)]$ tend almost to 0 as B tends to 0.]

Fig. 18.5.4.2 shows $[\sigma (l)]$ versus $[B_{\rm eq}]$ for the bond lengths in the protein. The points are colour-coded black for C—C, blue for C—N and red for C—O. The restrained and unrestrained distributions are very different for high B. The restrained distribution in Fig. 18.5.4.2(a) tends to about 0.02 Å, which is the standard uncertainty of the applied restraint for 1–2 bond lengths, whereas the unrestrained distribution in Fig. 18.5.4.2(b) goes off the scale of the diagram. But for B < 10 Å², both distributions fall to around 0.01 Å.

The differences between the restrained and unrestrained $[\sigma (r)]$ and $[\sigma (l)]$ can be understood through the two-atom model for restrained refinement described in Section 18.5.3 . For that model, the equation $[1 / \sigma_{\rm res}^{2} (l) = 1 / \sigma_{\rm diff}^{2} (l) + 1 / \sigma_{\rm geom}^{2} (l) \eqno(18.5.3.16)]$ relates the bond-length s.u. in the restrained refinement, $[\sigma_{\rm res}(l)]$ , to the $[\sigma_{\rm diff}(l)]$ of the unrestrained refinement and the s.u. $[\sigma_{\rm geom}(l)]$ assigned to the length in the stereochemical dictionary. In the refinements, $[\sigma_{\rm geom}(l)]$ was 0.02 Å for all bond lengths. When this is combined in (18.5.3.16) with the unrestrained $[\sigma_{\rm diff}(l)]$ of any bond, the predicted restrained $[\sigma_{\rm res}(l)]$ is close to that found in the restrained full matrix.

It can be seen from Fig. 18.5.4.2(b) that many bond lengths with average B < 10 Å² have $[\sigma_{\rm diff}(l)\lt 0.014]$ Å. For these bonds the diffraction data have greater weight than the stereochemical dictionary. Some bonds have $[\sigma_{\rm diff}(l)]$ as low as 0.0080 Å, with $[\sigma_{\rm res}(l)]$ around 0.0074 Å. This situation is one consequence of the availability of diffraction data to the high resolution of 0.94 Å. For large $[\sigma_{\rm diff}(l)]$ (i.e., high B), equation (18.5.3.16) predicts that $[\sigma_{\rm res}(l) = \sigma_{\rm geom}(l) = 0.02]$ Å, as is found in Fig. 18.5.4.2(a).

In an isotropic approximation, $[\sigma (r) = 3^{1/2}\sigma (x)]$ . Equation (18.5.3.12) of the two-atom model can be recast to give $[\sigma_{\rm res}^{2} (r) = \sigma_{\rm diff}^{2} (r) \left\{\left[\sigma_{\rm diff}^{2} (r) + 3(0.02)^{2}\right]\bigg/\left[2\sigma_{\rm diff}^{2} (r) + 3(0.02)^{2}\right]\right\}. \eqno(18.5.4.1)]$ For low B, say $[B \leq 15\ \hbox{\AA}^{2}]$ in concanavalin, (18.5.4.1) gives quite good predictions of $[\sigma_{\rm res}(r)]$ from $[\sigma_{\rm diff}(r)]$ . For instance, for a carbon atom with B = 15 Å², the quadratic curve for carbon in Fig. 18.5.4.1(b) shows $[\sigma_{\rm diff}(r) = 0.034]$ Å, and Fig. 18.5.4.1(a) shows $[\sigma_{\rm res}(r) = 0.029]$ Å. While if $[\sigma_{\rm diff}(r) = 0.034]$ Å is used with (18.5.4.1), the resulting prediction for $[\sigma_{\rm res}(r)]$ is 0.028 Å.

However, for high B, say B = 50 Å², the quadratic curve for carbon in Fig. 18.5.4.1(b) shows $[\sigma_{\rm diff}(r) = 0.25]$ Å, and Fig. 18.5.4.1(a) shows $[\sigma_{\rm res}(r) = 0.11]$ Å, whereas (18.5.4.1) leads to the poor estimate $[\sigma_{\rm res}(r) = 0.18]$ Å.

Thus at high B, equation (18.5.4.1) from the two-atom model does not give a good description of the relationship between the restrained and unrestrained $[\sigma (r)]$ . The reason is obvious. Most atoms are linked by 1–2 bond restraints to two or three other atoms. Even a carbonyl oxygen atom linked to its carbon atom by a 0.02 Å restraint is also subject to 0.04 Å 1–3 restraints to chain $[\hbox{C}_{\alpha}]$ and N atoms. Consequently, for a high-B atom, when the restraints are applied it is coupled to several other atoms in a group, and its $[\sigma_{\rm res}(r)]$ is lower, compared with the diffraction-data-only $[\sigma_{\rm diff}(r)]$ , by a greater amount than would be expected from the two-atom model.

18.5.4.2. Unrestrained inversion for an immunoglobulin

| top | pdf |

Sheldrick has provided the results of the unrestrained lower-resolution refinement of a single-chain immunoglobulin mutant (T39K) with 218 amino-acid residues, with data to 1.70 Å refined isotropically (Usón et al., 1999 ). Fig. 18.5.4.3 shows $[\sigma_{\rm diff}(r)]$ versus $[B_{\rm eq}]$ for the fully occupied protein atoms. Superposed on the data points are least-squares quadratic fits. In a first very rough approximation for $[\sigma_{\rm diff}(x_{i})]$ suggested later by equation (18.5.6.3), the dependence on atom type is controlled by $[1/Z_{i}]$ , the reciprocal of the atomic number. Sheldrick found that a $[1/Z_{i}]$ dependence produced too little difference between C, N and O. The proportionalities between the quadratics for $[\sigma (r)]$ in Figs. 18.5.4.1 and 18.5.4.3 are based on the reciprocals of the scattering factors at $[\sin \theta /\lambda = 0.3\ \hbox{\AA}^{-1}]$ , symbolized by $[Z_{i}^{\# }]$ . For C, N and O, these are 2.494, 3.219 and 4.089, respectively. For potential use in later work, the least-squares fits to the $[\sigma (r_{i})Z_{i}^{\# }]$ in Å are recorded here as $[\eqalignno{ 0.11892 &+ 0.00891B + 0.0001462B^{2}, &(18.5.4.2a)\cr 0.01826 &+ 0.001043B + 0.0002230B^{2} \hbox{ and } &(18.5.4.2b)\cr 0.00115 &+ 0.004414B + 0.0000214B^{2} &(18.5.4.2c)\cr}]$ for the immunoglobulin (unrestrained), concanavalin A (unrestrained) and concanavalin A (restrained), respectively.

Figure 18.5.4.3| top | pdf |

Plot of $[\sigma_{\rm diff}(r)]$ versus $[B_{\rm eq}]$ from an unrestrained full matrix for immunoglobulin mutant (T39K) with 1.70 Å data. Carbon black, nitrogen blue, oxygen red.

As might be expected from the lower resolution, the lowest $[\sigma_{\rm diff}(r)]$ 's in the immunoglobulin are about six times the lowest $[\sigma_{\rm diff}(r)]$ 's in concanavalin. But at B = 50 Å², the immunoglobulin curve for carbon gives $[\sigma_{\rm diff}(r) = 0.37]$ Å, which is only 50% larger than the concanavalin value of 0.25 Å.

Fig. 18.5.4.4 shows $[\sigma_{\rm diff}(l)]$ versus $[B_{\rm eq}]$ for the immunoglobulin. Note that the lowest immunoglobulin unrestrained $[\sigma_{\rm diff}(l)]$ is about 0.06 Å, which is three times the 0.02 Å $[\sigma_{\rm geom}(l)]$ bond restraint.

Figure 18.5.4.4| top | pdf |

Plot of $[\sigma_{\rm diff}(l)]$ versus average $[B_{\rm eq}]$ from an unrestrained full matrix for immunoglobulin mutant (T39K) with 1.70 Å data. C—O black, C—N blue, C—O red.

18.5.4.3. Comments on restrained refinement

| top | pdf |

Geometric restraint dictionaries typically use bond-length weights based on $[\sigma_{\rm geom}(l)]$ of around 0.02 or 0.03 Å. Tables 18.5.7.1 –18.5.7.3 show that even 1.5 Å studies have diffraction-only errors $[\sigma_{\rm diff}(x, B_{\rm avg})]$ of 0.08 Å and upwards. Only for resolutions of 1.0 Å or so are the diffraction-only errors comparable with the dictionary weights. Of course, the dictionary offers no values for many of the configurational parameters of the protein structure, including the centroid and molecular orientation.

18.5.4.4. Full-matrix estimates of precision

| top | pdf |

The opening contention of this chapter in Section 18.5.1.1 is that the variances and covariances of the structural parameters of proteins can be found from the inverse of the least-squares normal matrix. But there is a caveat, chiefly that explicit account would not be taken of disorder of the solvent or of parts of the protein. Corrections by Babinet's principle of complementarity or by mask bulk solvent models are only first-order approximations. The consequences of such disorder problems, which make the variation of calculated structure factors nonlinear over the range of interest, may in future be better handled by maximum-likelihood methods (e.g. Read, 1990 ; Bricogne, 1993 ; Bricogne & Irwin, 1996 ; Murshudov et al., 1997 ). Pannu & Read (1996) have shown how the maximum-likelihood method can be cast computationally into a form akin to least-squares calculations. Full-matrix precision estimates along the lines of the present chapter are probably somewhat low.

It should also be noted that full-matrix estimates of coordinate precision are most reliably derived from matrices involving both coordinates and atomic displacement parameters. This is particularly important for lower-resolution analyses, in which atomic images overlap. The work on the high-resolution analysis of concanavalin A described in Section 18.5.4.1 was based on the very large coordinate matrix, of order 6402. The omission, because of computer limitations, of the anisotropic displacement parameters from the full matrix will have caused the coordinate s.u.'s of atoms with high $[B_{\rm eq}]$ to be underestimated.

Much information about the quality of a molecular model can be obtained from the eigenvalues and eigenvectors of the normal matrix (Cowtan & Ten Eyck, 2000 ).

18.5.5. Approximate methods

| top | pdf |

18.5.5.1. Block calculations

| top | pdf |

The full-matrix inversions described in the previous section require massive calculations. The length of the calculations is more a matter of the order of the matrix, i.e., the number of parameters, than of the number of observations. When restraints are applied, it is the diffraction-cum-restraints full matrix which should be inverted.

With the increasing power of computers and more efficient algorithms (e.g. Tronrud, 1999 ; Murshudov et al., 1999 ), a final full matrix should be computed and inverted much more regularly – and not just for high-resolution analyses. Low-resolution analyses have a need, beyond the indications given by B values, to identify through $[\sigma (x)]$ estimates their regions of tolerable and less tolerable precision.

If full-matrix calculations are impractical, partial schemes can be suggested. As far back as 1973, Watenpaugh et al. (1973), in a study of rubredoxin at 1.5 Å resolution, effectively inverted the diffraction full matrix in 200 parameter blocks to obtain individual s.u.'s. A similar scheme for restrained refinements could also use overlapping large blocks. A minimal block scheme in refinements of any resolution is to calculate blocks for each residue and for the block interactions between successive residues. The inversion process could then use the matrices in running groups of three successive residues, taking only the inverted elements for the central residue as the estimates of its variances and covariances.

For low-resolution analyses with very large numbers of atoms, it might be sufficient to gain a general idea of the behaviour of $[\sigma (x)]$ as a function of B by computing a limited number of blocks for representative or critical groups of residues. The parameters used in the blocks should include the B's, since atomic images overlap at low resolution, thus correlating the position of one atom with the displacement parameters of its neighbours.

18.5.5.2. The modified Fourier method

| top | pdf |

In the simplest form of the Fourier-map approach to centrosymmetric high-resolution structures, atomic positions are given by the maxima of the observed electron density. The uncertainty of such a position may be estimated as the uncertainty in the slope function (first derivative) divided by the curvature (second derivative) at the peak (Cruickshank, 1949a ), i.e., $[\sigma (x) = \sigma (\hbox{slope})/(\hbox{atomic peak `curvature'}). \eqno(18.5.5.1)]$ However, atomic positions are affected by finite-series and peak-overlapping effects.

Hence, more generally, atomic positions may be determined by the requirement that the slope of the difference map at the position of atom r should be zero, or equivalently that the slopes at atom r of the observed and calculated electron densities should be equal. As a criterion this becomes the basis of the modified Fourier method (Cruickshank, 1952 , 1959 , 1999 ; Bricogne, 2001 , Section 1.3.4.4.7.5 ), which, like the least-squares method, is applicable whether or not the atomic peaks are resolved and is applicable to noncentrosymmetric structures. For refinement, a set of n simultaneous linear equations are involved, analogous to the normal equations of least squares. Their right-hand sides are the slopes of the difference map at the trial atomic positions.

The diagonal elements of the matrix, for coordinate $[x_{r}]$ of an atom with Debye B value $[B_{r}]$ , are approximately equal to $[\hbox{`curvature'} = (4\pi^{2}/a^{2}V) \left[\textstyle\sum\limits_{hkl}\displaystyle (m / 2)h^{2} f_{r} \exp (-B_{r} \sin^{2}\theta / \lambda^{2})\right], \eqno(18.5.5.2)]$ where [m = 1] or 2 for acentric or centric reflections. The summation is over all independent planes and their symmetry equivalents. Strictly speaking, (18.5.5.2) is a curvature only for centrosymmetric structures.

In the modified Fourier method, $[\sigma (\hbox{slope}) = (2\pi/aV) \left[\textstyle\sum\limits_{hkl}\displaystyle h^{2} (\Delta |F|^{2})\right]^{1/2}. \eqno(18.5.5.3)]$ This is simply an estimate of the r.m.s. uncertainty at a general position (Cruickshank & Rollett, 1953 ) in the slope of the difference map, i.e., the r.m.s. uncertainty on the right-hand side of the modified Fourier method.

$[\sigma (x)]$ is then given by (18.5.5.1), using (18.5.5.3) and (18.5.5.2).

18.5.5.3. Application of the modified Fourier method

| top | pdf |

An extreme example of an apparently successful gross approximation to protein precision is represented by Daopin et al.'s (1994) treatment of two independent determinations (at 1.8 and 1.95 Å) of the structure of TGF-β2. They reported that the modified Fourier-map formulae given in Section 18.5.5.2 yielded a quite good description of the B dependence of the positional differences between the two independent determinations. However, there is a formal difficulty about this application. Equation (18.5.5.1) derives from a diffraction-data-only approach, whereas the two structures were determined from restrained refinements. Even though the TNT restraint parameters and weights may have been the same in both refinements, it is slightly surprising that (18.5.5.1) should have worked well.

Equation (18.5.2.1) requires the summation of various series over all (hkl) observations; such calculations are not customarily provided in protein programs. However, due to the fundamental similarities between Fourier and least-squares methods demonstrated by Cochran (1948), Cruickshank (1949b , 1952 , 1959 ), and Cruickshank & Robertson (1953), closely similar estimates of the precision of individual atoms can be obtained from the reciprocal of the diagonal elements of the diffraction-data-only least-squares matrix. These elements will often have been calculated already within the protein refinement programs, but possibly never output. Such estimates could be routinely available.

Between approximations using largish blocks and those using only the reciprocals of diagonal terms, a whole variety of intermediate approximations involving some off-diagonal terms could be envisaged.

Whatever method is used to estimate uncertainties, it is essential to distinguish between coordinate uncertainty, e.g., $[\sigma (x)]$ , and position uncertainty $[\sigma (r) = [\sigma^{2} (x) + \sigma^{2} (y) + \sigma^{2} (z)]^{1/2}]$ .

The remainder of this chapter discusses two rough-and-ready indicators of structure precision: the diffraction-component precision index (DPI) and Luzzati plots.

18.5.6. The diffraction-component precision index

| top | pdf |

18.5.6.1. Statistical expectation of error dependence

| top | pdf |

From general statistical theory, one would expect the s.u. of an atomic coordinate determined from the diffraction data alone to show dependence on four factors: $[\sigma (x) \propto ({\cal R}) \left[(n_{\rm atoms}) / (n_{\rm obs} - n_{\rm params})\right]^{1/2} (1 / s_{\rm rms}). \eqno(18.5.6.1)]$ Here, $[{\cal R}]$ is some measure of the precision of the data; $[n_{\rm atoms}]$ is the recognition that the information content of the data has to be shared out; $[n_{\rm obs}]$ is the number of independent data, but to achieve the correct number of degrees of freedom this must be reduced by $[n_{\rm params}]$ , the number of parameters determined; and $[1/s_{\rm rms}]$ is a more specialized factor arising from the sensitivity $[\partial |F|/\partial x]$ of the data to the parameter x. Here $[s_{\rm rms}]$ is the r.m.s. reciprocal radius of the data. Any statistical error estimate must show some correspondence to these four factors.

18.5.6.2. A simple error formula

| top | pdf |

Cruickshank (1960) offered a simple order-of-magnitude formula for $[\sigma (x)]$ in small molecules. It was intended for use in experimental design: how many data of what precision are needed to achieve a given precision in the results? The formula, derived from a very rough estimate of a least-squares diagonal element in non-centrosymmetric space groups, was $[\sigma (x_{i}) = (1 / 2) (N_{i} / p)^{1/2} [R / s_{\rm rms}] \eqno(18.5.6.2)]$ Here p = $[n_{\rm obs} - n_{\rm params}]$ , R is the usual residual $[\sum |\Delta F|/\sum |F|]$ and $[N_{i}]$ is the number of atoms of type i needed to give scattering power at $[s_{\rm rms}]$ equal to that of the asymmetric unit of the structure, i.e., $[\sum_{j}f_{j}^{2} \equiv N_{i}\>f_{i}^{2}]$ . [The formula has also proved very useful in a systematic study of coordinate precision in the many thousands of small-molecule structure analyses recorded in the Cambridge Structural Database (Allen et al., 1995a ,b ).]

For small molecules, the above definition of $[N_{i}]$ allowed the treatment of different types of atom with not-too-different B's. However, it is not suitable for individual atoms in proteins where there is a very large range of B values and some atoms have B's so large as to possess negligible scattering power at $[s_{\rm rms}]$ .

Often, as in isotropic refinement, $[n_{\rm params} \simeq 4n_{\rm atoms}]$ , where $[n_{\rm atoms}]$ is the total number of atoms in the asymmetric unit. For fully anisotropic refinement, $[n_{\rm params} \simeq 9n_{\rm atoms}]$ .

A first very rough extension of (18.5.6.2) for application in proteins to an atom with $[B = B_{i}]$ is $[\sigma (x_{i}) = k(N_{i} / p)^{1/2} \left[g(B_{i}) / g(B_{\rm avg})\right] C^{-1/3} Rd_{\min}, \eqno(18.5.6.3)]$ where k is about 1.0, $[N_{i} = \sum Z_{j}^{2}/Z_{i}^{2}, B_{\rm avg}]$ is the average B for fully occupied sites and C is the fractional completeness of the data to $[d_{\min}]$ . In deriving (18.5.6.3) from (18.5.6.2), $[1/s_{\rm rms}]$ has been replaced by $[1.3d_{\min}]$ , and the factor [(1/2)(1.3) = 0.65] has been increased to 1.0 as a measure of caution in the replacement of a full matrix by a diagonal approximation. $[g(B) = 1 + a_{1}B+ a_{2}B^{2}]$ is an empirical function to allow for the dependence of $[\sigma (x)]$ on B. However, the results in Section 18.5.4.2 showed that the parameters $[a_{1}]$ and $[a_{2}]$ depend on the structure.

As also mentioned in Section 18.5.4.2 , Sheldrick has found that the $[Z_{i}]$ in $[N_{i}]$ is better replaced by $[Z_{i}^{\#}]$ , the scattering factor at $[\sin \theta /\lambda = 0.3\ \hbox{\AA }^{-1}]$ . Hence, $[N_{i}]$ may be taken as $[N_{i} = (\textstyle\sum\displaystyle Z_{j}^{\# 2} / Z_{i}^{\# 2}). \eqno(18.5.6.4)]$

A useful comparison of the relative precision of different structures may be obtained by comparing atoms with the respective $[B = B_{\rm avg}]$ in the different structures. (18.5.6.3) then reduces to $[\sigma (x, B_{\rm avg}) = 1.0 (N_{i} / p)^{1/2} C^{-1/3} Rd_{\min}. \eqno(18.5.6.5)]$ The smaller the $[d_{\min}]$ and the R, the better the precision of the structure. If the difference between oxygen, nitrogen and carbon atoms is ignored, $[N_{i}]$ may be taken simply as the number of fully occupied sites. For heavy atoms, (18.5.6.4) must be used for $[N_{i}]$ .

Equation (18.5.6.5) is not to be regarded as having absolute validity. It is a quick and rough guide for the diffraction-data-only error component for an atom with Debye B equal to the $[B_{\rm avg}]$ for the structure. It is named the diffraction-component precision index, or DPI. It contains none of the restraint data.

18.5.6.3. Extension for low-resolution structures and use of R_free

| top | pdf |

For low-resolution structures, the number of parameters may exceed the number of diffraction data. In (18.5.6.3) and (18.5.6.5), $[p = n_{\rm obs} - n_{\rm params}]$ is then negative, so that $[\sigma (x)]$ is imaginary. This difficulty can be circumvented empirically by replacing p with $[n_{\rm obs}]$ and R with $[R_{\rm free}]$ (Brünger, 1992 ). The counterpart of the DPI (18.5.6.5) is then $[\sigma (x, B_{\rm avg}) = 1.0 (N_{i} / n_{\rm obs})^{1/2} C^{-1/3} R_{\rm free}d_{\min}. \eqno(18.5.6.6)]$ Here $[n_{\rm obs}]$ is the number of reflections included in the refinement, not the number in the $[R_{\rm free}]$ set.

It may be asked: how can there be any estimate for the precision of a coordinate from the diffraction data only when there are insufficient diffraction data to determine the structure? By following the line of argument of Cruickshank's (1960) analysis, (18.5.6.6) is a rough estimate of the square root of the reciprocal of one diagonal element of the diffraction-only least-squares matrix. All the other parameters can be regarded as having been determined from a diffraction-plus-restraints matrix.

Clearly, (18.5.6.6) can also be used as a general alternative to (18.5.6.5) as a DPI, irrespective of whether the number of degrees of freedom $[p = n_{\rm obs} - n_{\rm params}]$ is positive or negative.

Comment . When p is positive, (18.5.6.6) would be exactly equivalent to (18.5.6.5) only if $[R_{\rm free} = R[n_{\rm obs}/(n_{\rm obs} - n_{\rm params})]^{1/2}]$ . Tickle et al. (1998b) have shown that the expected relationship in a restrained refinement is actually $[{R_{\rm free} = R\{[n_{\rm obs} + (n_{\rm params} - h)] / [n_{\rm obs} - (n_{\rm params} - h)]\}^{1/2},} \eqno(18.5.6.7)]$ where $[h = n_{\rm restraints} - \sum w_{\rm geom}(\Delta Q)^{2}]$ , the latter term, as in (18.5.3.1), being the weighted sum of the squares of the restraint residuals.

18.5.6.4. Position error

| top | pdf |

Often an estimate of a position error $[|\Delta {\bf r}|]$ , rather than a coordinate error $[|\Delta x|]$ , is required. In the isotropic approximation, $[\sigma (r, B_{\rm avg}) = 3^{1/2} \sigma (x, B_{\rm avg}). \eqno(18.5.6.8)]$ Consequently, the DPI formulae for the position errors are $[\sigma (r, B_{\rm avg}) = 3^{1/2} (N_{i} / p)^{1/2} C^{-1/3} Rd_{\min} \eqno(18.5.6.9)]$ with R and $[\sigma (r, B_{\rm avg}) = 3^{1/2} (N_{i} / n_{\rm obs})^{1/2} C^{-1/3} R_{\rm free}d_{\min} \eqno(18.5.6.10)]$ with $[R_{\rm free}]$ .

18.5.7. Examples of the diffraction-component precision index

| top | pdf |

18.5.7.1. Full-matrix comparison with the diffraction-component precision index

| top | pdf |

The DPI (18.5.6.9) with R was offered as a quick and rough guide for the diffraction-data-only error for an atom with $[B = B_{\rm avg}]$ . The necessary data for the comparison with the two unrestrained full-matrix inversions of Section 18.5.5 are given in Table 18.5.7.1 . For concanavalin A with $[B_{\rm avg} = 14.8\ \hbox{\AA}^{2}]$ , the full-matrix quadratic (18.5.4.2b) gives 0.033 Å for a carbon atom and the DPI gives 0.034 Å for an unspecified atom. For the immunoglobulin with $[B_{\rm avg} = 26.8\ \hbox{\AA}^{2}]$ , the full-matrix quadratic (18.5.4.2a) gives $[\sigma_{\rm diff}(r) = 0.19\ \hbox{\AA}]$ for a carbon atom, while the DPI gives 0.22 Å.

Table 18.5.7.1| top | pdf |
Comparison of full-matrix $[\sigma (r, B_{\rm avg})]$ with the diffraction-component precision index (DPI)

Protein	$[(N_{i}/p)^{1/2}]$	R	$[d_{\min}\ {(\hbox{\AA})}]$	DPI $[\sigma (r, B_{\rm avg})\ {(\hbox{\AA})}]$	Full-matrix $[\sigma_{\rm diff}(r, B_{\rm avg})\ {(\hbox{\AA})}]$	Reference
Concanavalin A	0.148	0.128	0.94	0.034	0.033	(a)
Immunoglobulin	0.476	0.156	1.70	0.221	0.186	(b)

References: (a) Deacon et al. (1997)

; (b) Usón et al. (1999)

For these two structures, the simple DPI formula compares surprisingly well with the unrestrained full-matrix calculations at $[B_{\rm avg}]$ .

For the restrained full-matrix calculations on concanavalin A, the quadratic (18.5.4.2c) with $[B = B_{\rm avg}]$ gives $[\sigma_{\rm res}(r) = 0.028\ \hbox{\AA}]$ for a carbon atom, which is only 15% smaller than the unrestrained 0.033 Å. This small decrease matches the discussion of $[\sigma_{\rm res}(r)]$ and $[\sigma_{\rm diff}(r)]$ in Section 18.5.4.1 following equation (18.5.4.1). But that discussion also indicates that for the immunoglobulin, the restrained $[\sigma_{\rm res}(r, B_{\rm avg})]$ , which was not computed, will be proportionaly much lower than the unrestrained value of $[\sigma_{\rm diff}(r, B_{\rm avg}) = 0.19\ \hbox{\AA}]$ , since the restraints are relatively more important in the immunoglobulin.

18.5.7.2. Further examples of the DPI using R

| top | pdf |

Table 18.5.7.2 shows a range of examples of the application of the DPI (18.5.6.9) using R to proteins of differing precision, starting with the smallest $[d_{\rm min}]$ . In all the examples, $[N_{i}]$ has been set equal to $[n_{\rm atoms}]$ , the total number of atoms. The ninth and tenth columns show $[\langle \Delta r\rangle]$ values derived from Luzzati (1952) and Read (1986) plots described later in Section 18.5.8 .

Table 18.5.7.2| top | pdf |
Examples of diffraction-component precision indices (DPIs)

Protein	$[N_{i}]$	$[n_{\rm obs}]$	$[(N_{i}/p)^{1/2}]$	$[C^{-1/3}]$	R	$[d_{\min}]$ (Å)	DPI $[\sigma (r, B_{\rm avg})]$ (Å)	Luzzati $[\langle \Delta r\rangle]$ (Å)	Read $[\langle \Delta r\rangle]$ (Å)	Reference
Crambin	447	23759	0.150	1.074	0.090	0.83	0.021	0.055		(a)
Ribonuclease MGMP	1958	62845	0.208	1.046	0.109	1.15	0.047		0.08	(b)
Ribonuclease MSA	1832	60670	0.204	1.016	0.106	1.20	0.045		0.05	(b)
TGF- $[\beta 2]$ 1TGI	948	∼14000	0.305	∼1.0	0.173	1.80	0.16	0.21	0.18	(c)
TGF- $[\beta 2]$ 1TFG	974	∼11000	0.370	∼1.0	0.188	1.95	0.24	0.23		(c)
Lactoferrin	5907	39113	0.618	1.036	0.179	2.20	0.43	0.25–0.30	0.35	(d)
Thaumatin C2	1552	4622	^†	1.10	0.184	2.60	—	0.25		(e)

References: (a) Stec et al. (1995)

; (b) Sevcik et al. (1996)

; (c) Daopin et al. (1994)

; (d) Haridas et al. (1995)

; (e) Ko et al. (1994)

^† $[(N_{i}/p)]$ negative.

The first entry is for crambin at 0.83 Å resolution and 130 K (Stec et al., 1995 ). Their results were obtained from an unrestrained full-matrix anisotropic refinement. Inversion of the full matrix gave s.u.'s $[\sigma_{\rm diff}(x) = 0.0096\ \hbox{\AA}]$ for backbone atoms, 0.0168 Å for side-chain atoms and 0.0409 Å for solvent atoms, with an average for all atoms of 0.022 Å. The DPI $[\sigma (r, B_{\rm avg}) = 0.021\ \hbox{\AA}]$ corresponds to $[\sigma (x) = 0.012\ \hbox{\AA}]$ , which is satisfactorily intermediate between the full-matrix values for the backbone and side-chain atoms.

Sevcik et al. (1996) carried out restrained anisotropic full-matrix refinements on data from two slightly different crystals of ribonuclease Sa, with $[d_{\min}]$ of 1.15 and 1.20 Å. They inverted full-matrix blocks containing parameters of 20 residues to estimate coordinate errors. The overall r.m.s. coordinate error for protein atoms is given as 0.03 Å, and for all atoms (including waters and ligands) as 0.07 Å for MGMP and 0.05 Å for MSA. The DPI gives $[\sigma (r, B_{\rm avg}) = 0.05\ \hbox{\AA}]$ for both structures.

The next entries concern the two lower-resolution (1.8 and 1.95 Å) studies of TGF- $[\beta 2]$ (Daopin et al., 1994 ). The DPI gives $[\sigma (r) = 0.16\ \hbox{\AA}]$ for 1TGI and 0.24 Å for 1TGF. This indicates an r.m.s. position difference between the structures for atoms with $[B_{i} = B_{\rm avg}]$ of $[(0.16^{2} + 0.24^{2})^{1/2} = 0.29\ \hbox{\AA}]$ . Daopin et al. reported the differences between the two determinations, omitting poor parts, as $[\langle \Delta r\rangle_{\rm rms} = 0.15\ \hbox{\AA}]$ (main chain) and 0.29 Å (all atoms).

Human diferric lactoferrin (Haridas et al., 1995 ) is an example of a large protein at the lower resolution of 2.2 Å, with a high value of $[(N_{i}/p)^{1/2}]$ , leading to $[\sigma (r, B_{\rm avg}) = 0.43\ \hbox{\AA}]$ .

Three crystal forms of thaumatin were studied by Ko et al. (1994). The orthorhombic and tetragonal forms diffracted to 1.75 Å, but the monoclinic C2 form diffracted only to 2.6 Å. The structures with 1552 protein atoms were successfully refined with restraints by XPLOR and TNT. For the monoclinic form, the number of parameters exceeds the number of diffraction observations, so $[(N_{i}/p)]$ is negative and no estimate by (18.5.6.9) of the diffraction-data-only error is possible. The DPI (18.5.6.9) gives 0.17 and 0.16 Å for the orthorhombic and tetragonal forms, respectively.

18.5.7.3. Examples of the DPI using R_free

| top | pdf |

As in the case of monoclinic thaumatin, for low-resolution structures the number of parameters may exceed the number of diffraction data. To circumvent this difficulty, it was proposed in Section 18.5.6.3 to replace $[p = n_{\rm obs} - n_{\rm params}]$ by $[n_{\rm obs}]$ and R by $[R_{\rm free}]$ in a revised formula (18.5.6.10) for the DPI. Table 18.5.7.3 shows examples for some structures for which both R and $[R_{\rm free}]$ were available. The second row for each protein shows the alternative values for $[(N_{i}/n_{\rm obs})^{1/2}]$ , $[R_{\rm free}]$ and the DPI $[\sigma (r, B_{\rm avg})]$ from (18.5.6.10).

Table 18.5.7.3| top | pdf |
Comparison of DPIs using R and R_free

The second row for each protein contains values appropriate to the DPI equation (18.5.6.10) using R_free.

Protein	$[N_{i}]$	$[n_{\rm obs}]$	$[(N_{i}/p)^{1/2}]$ , $[(N_{i}/n_{\rm obs})^{1/2}]$	$[C^{-1/3}]$	R, $[R_{\rm free}]$	$[d_{\min}]$ (Å)	DPI $[\sigma (r, B_{\rm avg})]$ (Å)	Luzzati $[\langle \Delta r\rangle]$ (Å)	Read $[\langle \Delta r\rangle]$ (Å)	Reference
Concanavalin A	2130	116712	0.148	1.099	0.128	0.94	0.034	0.06		(a)
Concanavalin A			0.135		0.148		0.036

γB-Crystallin	1708	26151	0.297	1.032	0.180	1.49	0.14	0.16	0.12	(b)
γB-Crystallin			0.256		0.204		0.14

βB2-Crystallin	1558	18583	0.356	∼1.032	0.184	2.10	0.25	0.21	0.17	(b)
βB2-Crystallin			0.290		0.200		0.22

Ribonuclease A with RI	4416	18859	1.922	1.145	0.194	2.50	1.85	0.32	0.57	(c)
Ribonuclease A with RI			0.484		0.286		0.69

Fab HyHEL-5 with HEWL	4333	11754	^†	1.111	0.196	2.65	—	0.30		(d)
Fab HyHEL-5 with HEWL			0.607		0.288		0.69

References: (a) Deacon et al. (1997)

; (b) Tickle et al. (1998a)

; (c) Kobe & Deisenhofer (1995)

; (d) Cohen et al. (1996)

^† $[(N_{i}/p)]$ negative.

For the structures with $[d_{\min} \leq 2.0\ \hbox{\AA}]$ , the DPI is much the same whether it is based on R or $[R_{\rm free}]$ .

Tickle et al. (1998a) have made full-matrix error estimates for isotropic restrained refinements of γB-crystallin with $[d_{\min} = 1.49\ \hbox{\AA}]$ and of βB2-crystallin with $[d_{\rm min} = 2.10\ \hbox{\AA}]$ . The DPI $[\sigma (r, B_{\rm avg})]$ calculated for the two structures is 0.14 and 0.25 Å with R in (18.5.6.9), and 0.14 and 0.22 Å with $[R_{\rm free}]$ in (18.5.6.10). The full-matrix weighted averages of $[\sigma_{\rm res}(r)]$ for all protein atoms were 0.10 and 0.15 Å, for only main-chain atoms 0.05 and 0.08 Å, for side-chain atoms 0.14 and 0.20 Å, and for water oxygens 0.27 and 0.35 Å. Again, the DPI gives reasonable overall indices for the quality of the structures.

For the complex of bovine ribonuclease A and porcine ribonuclease inhibitor (Kobe & Deisenhofer, 1995 ) with $[d_{\min} = 2.50\ \hbox{\AA}]$ , the number of reflections is only just larger than the number of parameters, so that $[(N_{i}/p)^{1/2} = 1.922]$ is very large, and the DPI with R gives an unrealistic 1.85 Å. With $[R_{\rm free}]$ , $[\sigma (r, B_{\rm avg}) =0.69\ \hbox{\AA}]$ .

The HyHEL-5–lysozyme complex (Cohen et al., 1996 ) had $[d_{\rm min} = 2.65\ \hbox{\AA}]$ . Here the number of reflections is less than the number of parameters, but the $[R_{\rm free}]$ formula gives $[\sigma (r, B_{\rm avg}) =0.69\ \hbox{\AA}]$ .

18.5.7.4. Comments on the diffraction-component precision index

| top | pdf |

The DPI (18.5.6.9) or (18.5.6.10) provides a very simple formula for $[\sigma (r, B_{\rm avg})]$ . It is based on a very rough approximation to a diagonal element of the diffraction-data-only matrix. Using a diagonal element is a reasonable approximation for atomic resolution structures, but for low-resolution structures there will be significant off-diagonal terms between overlapping atoms. The effect can be simulated in the two-atom protein model of Section 18.5.3.2 by introducing positive off-diagonal elements into the diffraction-data matrix (18.5.3.3). As expected, $[\sigma_{\rm diff}^{2} (x_{i})]$ is increased. So the DPI will be an underestimate of the diffraction component in low-resolution structures.

However, the true restrained variance $[\sigma_{\rm res}^{2} (x_{i})]$ in the new counterpart of (18.5.3.12) remains less than the diagonal diffraction result (18.5.3.11) $[\sigma_{\rm diff}^{2} (x_{i}) = 1/a]$ . Thus for low-resolution structures, the DPI should be an overestimate of the true precision given by a restrained full-matrix calculation (where the restraints act to hold the overlapping atoms apart). This is confirmed by the results for the 2.1 Å study of βB2-crystallin (Tickle et al., 1998a ) discussed in Section 18.5.7.3 and Table 18.5.7.3 . The restrained full-matrix average for all protein atoms was $[\sigma_{\rm res}(r) = 0.15]$ Å, compared with the DPI 0.25 Å (on R) or 0.22 Å (on $[R_{\rm free}]$ ). The ratio between the unrestrained DPI and the restrained full-matrix average is consistent with a view of a low-resolution protein as a chain of effectively rigid peptide groups. The ratio no doubt gets much worse for resolutions of 3 Å and above.

The DPI estimate of $[\sigma (r, B_{\rm avg})]$ is given by a formula of `back-of-an-envelope' simplicity. $[B_{\rm avg}]$ is taken to be the average B for fully occupied sites, but the weights implicit in the averaging are not well defined in the derivation of the DPI. Thus the DPI should perhaps be regarded as simply offering an estimate of a typical $[\sigma_{\rm diff}(r)]$ for a carbon or nitrogen atom with a mid-range B. From the evidence of the tables in this section, except at low resolution, it seems to give a useful overall indication of protein precision, even in restrained refinements.

The DPI evidently provides a method for the comparative ranking of different structure determinations . In this regard it is a complement to the general use of $[d_{\min}]$ as a quick indicator of possible structural quality.

Note that (18.5.6.3) and (18.5.6.4) offer scope for making individual error estimates for atoms of different B and Z.

18.5.8. Luzzati plots

| top | pdf |

18.5.8.1. Luzzati's theory

| top | pdf |

Luzzati (1952) provided a theory for estimating, at any stage of a refinement, the average positional shifts which would be needed in an idealized refinement to reach [R = 0] . He did not provide a theory for estimating positional errors at the end of a normal refinement.

(1) His theory assumed that the $[F_{\rm obs}]$ had no errors, and that the $[F_{\rm calc}]$ model (scattering factors, thermal parameters etc.) was perfect, apart from coordinate errors.
(2) The Gaussian probability distribution for these coordinate errors was assumed to be the same for all atoms, independent of Z or B.
(3) The atoms were not required to be identical, and the position errors were not required to be small.

Luzzati gave families of curves for R versus $[2\sin\theta/\lambda]$ for varying average positional errors $[\langle \Delta r\rangle]$ for both centrosymmetric and noncentrosymmetric structures. The curves do not depend on the number N of atoms in the cell. They all rise from [R = 0] at $[2\sin\theta/\lambda = 0]$ to the Wilson (1950) values 0.828 and 0.586 for random structures at high $[2\sin\theta/\lambda]$ . Table 18.5.8.1 gives $[R = \langle |\Delta F|\rangle/\langle |F|\rangle]$ as a function of $[s\langle \Delta r\rangle]$ for three-dimensional noncentrosymmetric structures.

$[s\langle \Delta r\rangle]$	R	$[s\langle \Delta r\rangle]$	R
0.00	0.000	0.10	0.237
0.01	0.025	0.12	0.281
0.02	0.050	0.14	0.319
0.03	0.074	0.16	0.353
0.04	0.098	0.18	0.385
0.05	0.122	0.20	0.414
0.06	0.145	0.25	0.474
0.07	0.168	0.30	0.518
0.08	0.191	0.35	0.548
0.09	0.214	∞	0.586

In a footnote (p. 807), Luzzati suggested that at the end of a normal refinement (with R nonzero due to experimental and model errors, etc.), the curves would indicate an upper limit for $[\langle \Delta r\rangle]$ . He noted that typical small-molecule $[\sigma (r)]$ 's of 0.01–0.02 Å, if used as $[\langle \Delta r\rangle]$ in the plots, would give much smaller R's than are found at the end of a refinement.

As examples, the Luzzati plots for the two structures of TGF-β2 are shown in Fig. 18.5.8.1 . Daopin et al. (1994) inferred average $[\langle \Delta r\rangle]$ 's around 0.21 Å for 1TGI and 0.23 Å for 1TGF.

Figure 18.5.8.1| top | pdf |

Luzzati plots showing the refined R factor as a function of resolution for 1TGI (solid squares) and 1TGF (open squares) (Daopin et al., 1994 ).

Of the three Luzzati assumptions summarized above, the most attractive is the third, which does not require the atoms to be identical nor the position errors to be small. For proteins, there are very obvious difficulties with assumption (2). Errors do depend very strongly on Z and B. In the high-angle data shells, atoms with large B's contribute neither to $[\Delta F]$ nor to [|F|] , and so have no effect on R in these shells. In their important paper on protein accuracy, Chambers & Stroud (1979) said `the [Luzzati] estimate derived from reflections in this range applies mainly to [the] best determined atoms.'

Thus a Luzzati plot seems to allow a cautious upper-limit statement about the precision of the best parts of a structure, but it gives little indication for the poor parts.

One reason for the past popularity of Luzzati plots has been that the R values for the middle and outer shells of a structure often roughly follow a Luzzati curve. Evidently, the effective average $[\langle \Delta r\rangle]$ for the structure must be decreasing as $[2\sin\theta/\lambda]$ increases, since atoms of high B are ceasing to contribute, whereas the proportionate experimental errors must be increasing. This also suggests that the upper limit for $[\langle \Delta r\rangle]$ for the low-B atoms could be estimated from the lowest Luzzati theoretical curve touched by the experimental R plot. Thus in Fig. 18.5.8.1 the upper limits for the low-B atoms could be taken as 0.18 and 0.21 Å, rather than the 0.21 and 0.23 Å chosen by Daopin et al.

From the introduction of $[R_{\rm free}]$ by Brünger (1992) and the discussion of $[R_{\rm free}]$ by Tickle et al. (1998b), it can be seen that Luzzati plots should be based on a residual more akin to $[R_{\rm free}]$ than R in order to avoid bias from the fitting of data.

The mean positional error $[\langle \Delta r\rangle]$ of atoms can also be estimated from the $[\sigma_{A}]$ plots of Read (1986 , 1990 ). This method arose from Read's analysis of improved Fourier coefficients for maps using phases from partial structures with errors. It is preferable in several respects to the Luzzati method, but like the Luzzati method it assumes that the coordinate distribution is the same for all atoms. Luzzati and/or Read estimates of $[\langle \Delta r\rangle]$ are available for some of the structures in Tables 18.5.7.2 and 18.5.7.3 . Often, the two estimates are not greatly different.

18.5.8.2. Statistical reinterpretation of Luzzati plots

| top | pdf |

Luzzati plots are fundamentally different from other statistical estimates of error. The Luzzati theory applies to an idealized incomplete refinement and estimates the average shifts needed to reach [R = 0] . In the least-squares method, the equations for shifts are quite different from the equations for estimating variances in a converged refinement. However, Luzzati-style plots of R versus $[2\sin\theta/\lambda]$ can be reinterpreted to give statistically based estimates of $[\sigma (x)]$ .

During Cruickshank's (1960) derivation of the approximate equation (18.5.6.2) for $[\sigma (x)]$ in diagonal least squares, he reached an intermediate equation $[\sigma^{2} (x) = N_{i}\bigg / \left[4 \textstyle\sum\limits_{\rm obs}\displaystyle (s^{2} / R^{2})\right]. \eqno(18.5.8.1)]$ He then assumed R to be independent of $[s\ (= 2\sin \theta/\lambda)]$ and took R outside the summation to reach (18.5.6.2) above.

Luzzati (1952) calculated the acentric residual R as a function of $[\langle \Delta r\rangle]$ , the average radial error of the atomic positions. His analysis shows that R is a linear function of s and $[\langle \Delta r\rangle]$ for a substantial range of $[s\langle \Delta r\rangle]$ , with $[R(s,\ \langle \Delta r\rangle) = (2\pi)^{1/2} s\langle \Delta r\rangle. \eqno(18.5.8.2)]$ The theoretical Luzzati plots of R are nearly linear for small-to-medium $[s = 2\sin \theta / \lambda]$ (see Fig. 18.5.8.1). If we substitute this R in the least-squares estimate (18.5.8.1) and use the three-dimensional-Gaussian relation $[\sigma (r) = 1.085 \langle \Delta r \rangle]$ , some manipulation (Cruickshank, 1999 ) along the lines of Section 18.5.6 eventually yields a statistically based formula, $[\sigma_{\rm LS,Luzz} (r) = 1.33 (N_{i} / p)^{1/2} [R(s_{m}) / s_{m}], \eqno(18.5.8.3)]$ where $[R(s_{m})]$ is the value of R at some value of $[s = s_{m}]$ on the selected Luzzati curve. Equation (18.5.8.3) provides a means of making a very rough statistical estimate of error for an atom with $[B = B_{\rm avg}]$ (the average B for fully occupied sites) from a plot of R versus $[2\sin \theta / \lambda]$ .

The corresponding equation involving $[R_{\rm free}]$ is $[\sigma_{\rm LS,Luzz} (r) = 1.33 (N_{i} / n_{\rm obs})^{1/2} [R_{\rm free} (s_{m}) / s_{m}]. \eqno(18.5.8.4)]$

18.5.8.3. Comments on Luzzati plots

| top | pdf |

Protein structures always show a great range of B values. The Luzzati theory effectively assumes that all atoms have the same B. Nonetheless, the Luzzati method applied to high-angle data shells does provide an upper limit for $[\langle \Delta r \rangle]$ for the atoms with low B. It is an upper limit since experimental errors and model imperfections are not allowed for in the theory.

Low-resolution structures can be determined validly by using restraints, even though the number of diffraction observations is less than the number of atomic coordinates. The Luzzati method, based preferably on $[R_{\rm free}]$ , can be applied to the atoms of low B in such structures. As the number of observations increases, and the resolution improves, the Luzzati $[\langle \Delta r \rangle]$ increasingly overestimates the true $[\sigma (r)]$ of the low-B atoms.

In the use of Luzzati plots, the method of refinement, and its degree of convergence, is irrelevant. A Luzzati plot is a statement for the low-B atoms about the maximum errors associated with a given structure, whether converged or not.

References

Allen, F. H., Cole, J. C. & Howard, J. A. K. (1995a). A systematic study of coordinate precision in X-ray structure analyses. I. Descriptive statistics and predictive estimates of e.s.d.'s for C atoms. Acta Cryst. A51, 95–111.Google Scholar

Allen, F. H., Cole, J. C. & Howard, J. A. K. (1995b). A systematic study of coordinate precision in X-ray structure analyses. II. Predictive estimates of e.s.d.'s for the general-atom case. Acta Cryst. A51, 112–121.Google Scholar

Bricogne, G. (1993). Direct phase determination by entropy maximization and likelihood ranking: status report and perspectives. Acta Cryst. D49, 37–60.Google Scholar

Bricogne, G. (2001). Fourier transforms in crystallography: theory, algorithms, and applications. In International tables for crystallography, Vol. B, edited by U. Shmueli, ch. 1.3. Dordrecht: Kluwer Academic Publishers.Google Scholar

Bricogne, G. & Irwin, J. (1996). Maximum-likelihood structure refinement: theory and implementation within BUSTER + TNT. In Proceedings of the CCP4 study weekend. Macromolecular refinement, edited by E. Dodson, M. Moore, A. Ralph & S. Bailey, pp. 85–92. Warrington: Daresbury Laboratory.Google Scholar

Brünger, A. T. (1992). Free R-value: a novel statistical quantity for assessing the accuracy of crystal structures. Nature (London), 355, 472–475.Google Scholar

Brünger, A. T. (1993). Assessment of phase accuracy by cross validation: the free R value. Methods and application. Acta Cryst. D49, 24–36.Google Scholar

Chambers, J. L. & Stroud, R. M. (1979). The accuracy of refined protein structures: comparison of two independently refined models of bovine trypsin. Acta Cryst. B35, 1861–1874.Google Scholar

Cochran, W. (1948). The Fourier method of crystal-structure analysis. Acta Cryst. 1, 138–142.Google Scholar

Cohen, G. H., Sheriff, S. & Davies, D. R. (1996). Refined structure of the monoclonal antibody HyHEL-5 with its antigen hen egg-white lysozyme. Acta Cryst. D52, 315–326.Google Scholar

Cowtan, K. & Ten Eyck, L. F. (2000). Eigensystem analysis of the refinement of a small metalloprotein. Acta Cryst. D56, 842–856.Google Scholar

Cruickshank, D. W. J. (1949a). The accuracy of electron-density maps in X-ray analysis with special reference to dibenzyl. Acta Cryst. 2, 65–82.Google Scholar

Cruickshank, D. W. J. (1949b). The accuracy of atomic coordinates derived by least-squares or Fourier methods. Acta Cryst. 2, 154–157.Google Scholar

Cruickshank, D. W. J. (1952). On the relations between Fourier and least-squares methods of structure determination. Acta Cryst. 5, 511–518.Google Scholar

Cruickshank, D. W. J. (1956). The determination of the anisotropic thermal motion of atoms in crystals. Acta Cryst. 9, 747–753.Google Scholar

Cruickshank, D. W. J. (1959). Statistics. In International tables for X-ray crystallography, Vol. 2, edited by J. S. Kasper & K. Lonsdale, pp. 84–98. Birmingham: Kynoch Press.Google Scholar

Cruickshank, D. W. J. (1960). The required precision of intensity measurements for single-crystal analysis. Acta Cryst. 13, 774–777.Google Scholar

Cruickshank, D. W. J. (1965). Notes for authors; anisotropic parameters. Acta Cryst. 19, 153.Google Scholar

Cruickshank, D. W. J. (1970). Least-squares refinement of atomic parameters. In Crystallographic computing, edited by F. R. Ahmed, S. R. Hall & C. P. Huber, pp. 187–196. Copenhagen: Munksgaard.Google Scholar

Cruickshank, D. W. J. (1999). Remarks about protein structure precision. Acta Cryst. D55, 583–601.Google Scholar

Cruickshank, D. W. J. & Robertson, A. P. (1953). The comparison of theoretical and experimental determinations of molecular structures, with applications to naphthalene and anthracene. Acta Cryst. 6, 698–705.Google Scholar

Cruickshank, D. W. J. & Rollett, J. S. (1953). Electron-density errors at special positions. Acta Cryst. 6, 705–707.Google Scholar

Daopin, S., Davies, D. R., Schlunegger, M. P. & Grütter, M. G. (1994). Comparison of two crystal structures of TGF-β2: the accuracy of refined protein structures. Acta Cryst. D50, 85–92.Google Scholar

Deacon, A., Gleichmann, T., Kalb (Gilboa), A. J., Price, H., Raftery, J., Bradbrook, G., Yariv, J. & Helliwell, J. R. (1997). The structure of concanavalin A and its bound solvent determined with small-molecule accuracy at 0.94 Å resolution. J. Chem. Soc. Faraday Trans. 93, 4305–4312.Google Scholar

Dodson, E. (1998). The role of validation in macromolecular crystallography. Acta Cryst. D54, 1109–1118.Google Scholar

Engh, R. A. & Huber, R. (1991). Accurate bond and angle parameters for X-ray protein structure refinement. Acta Cryst. A47, 392–400.Google Scholar

Haridas, M., Anderson, B. F. & Baker, E. N. (1995). Structure of human diferric lactoferrin refined at 2.2 Å resolution. Acta Cryst. D51, 629–646.Google Scholar

Hendrickson, W. A. & Konnert, J. H. (1980). Incorporation of stereochemical information into crystallographic refinement. In Computing in crystallography, edited by R. Diamond, S. Ramaseshan & K. Venkatesan, pp. 13.01–13.23. Bangalore: Indian Academy of Sciences.Google Scholar

International Tables for Crystallography (2004). Vol. C. Mathematical, physical and chemical tables, edited by E. Prince. Dordrecht: Kluwer Academic Publishers.Google Scholar

Jiang, J.-S. & Brünger, A. T. (1994). Protein hydration observed by X-ray diffraction. Solvation properties of penicillopepsin and neuraminidase crystal structures. J. Mol. Biol. 243, 100–115.Google Scholar

Ko, T.-P., Day, J., Greenwood, A. & McPherson, A. (1994). Structures of three crystal forms of the sweet protein thaumatin. Acta Cryst. D50, 813–825.Google Scholar

Kobe, B. & Deisenhofer, J. (1995). A structural basis of the interactions between leucine-rich repeats and protein ligands. Nature (London), 374, 183–186.Google Scholar

Langridge, R., Marvin, D. A., Seeds, W. E., Wilson, H. R., Hooper, C. W., Wilkins, M. H. F. & Hamilton, L. D. (1960). The molecular configuration of deoxyribonucleic acid. II. Molecular models and their Fourier transforms. J. Mol. Biol. 2, 38–64.Google Scholar

Luzzati, V. (1952). Traitement statistique des erreurs dans la determination des structures cristallines. Acta Cryst. 5, 802–810.Google Scholar

Moews, P. C. & Kretsinger, R. H. (1995). Refinement of the structure of carp muscle calcium-binding parvalbumin by model building and difference Fourier analysis. J. Mol. Biol. 91, 201–228.Google Scholar

Murshudov, G. N., Vagin, A. A. & Dodson, E. J. (1997). Refinement of macromolecular structures by the maximum-likelihood method. Acta Cryst. D53, 240–255.Google Scholar

Murshudov, G. N., Vagin, A. A., Lebedev, A., Wilson, K. S. & Dodson, E. J. (1999). Efficient anisotropic refinement of macromolecular structures using FFT. Acta Cryst. D55, 247–255.Google Scholar

Pannu, N. S. & Read, R. J. (1996). Improved structure refinement through maximum likelihood. Acta Cryst. A52, 659–668.Google Scholar

Read, R. J. (1986). Improved Fourier coefficients for maps using phases from partial structures with errors. Acta Cryst. A42, 140–149.Google Scholar

Read, R. J. (1990). Structure-factor probabilities for related structures. Acta Cryst. A46, 900–912.Google Scholar

Rollett, J. S. (1970). Least-squares procedures in crystal structure analysis. In Crystallographic computing, edited by F. R. Ahmed, S. R. Hall & C. P. Huber, pp. 167–181. Copenhagen: Munksgaard.Google Scholar

Schwarzenbach, D., Abrahams, S. C., Flack, H. D., Gonschorek, W., Hahn, Th., Huml, K., Marsh, R. E., Prince, E., Robertson, B. E., Rollett, J. S. & Wilson, A. J. C. (1989). Statistical descriptors in crystallography: Report of the IUCr subcommittee on statistical descriptors. Acta Cryst. A45, 63–75.Google Scholar

Schwarzenbach, D., Abrahams, S. C., Flack, H. D., Prince, E. & Wilson, A. J. C. (1995). Statistical descriptors in crystallography. II. Report of a working group on expression of uncertainty in measurement. Acta Cryst. A51, 565–569.Google Scholar

Sevcik, J., Dauter, Z., Lamzin, V. S. & Wilson, K. S. (1996). Ribonuclease from Streptomyces aureofaciens at atomic resolution. Acta Cryst. D52, 327–344.Google Scholar

Sheldrick, G. M. & Schneider, T. R. (1997). SHELXL: high resolution refinement. Methods Enzymol. 277, 319–343.Google Scholar

Stec, B., Zhou, R. & Teeter, M. M. (1995). Full-matrix refinement of the protein crambin at 0.83 Å and 130 K. Acta Cryst. D51, 663–681.Google Scholar

Tickle, I. J., Laskowski, R. A. & Moss, D. S. (1998a). Error estimates of protein structure coordinates and deviations from standard geometry by full-matrix refinement of γB- and βB2-crystallin. Acta Cryst. D54, 243–252.Google Scholar

Tickle, I. J., Laskowski, R. A. & Moss, D. S. (1998b). R_free and the R_free ratio. I. Derivation of expected values of cross-validation residuals used in macromolecular least-squares refinement. Acta Cryst. D54, 547–557.Google Scholar

Tronrud, D. E. (1997). TNT refinement package. Methods Enzymol. 277, 306–319.Google Scholar

Tronrud, D. E. (1999). The efficient calculation of the normal matrix in least-squares refinement of macromolecular structures. Acta Cryst. A55, 700–703.Google Scholar

Trueblood, K. N., Bürgi, H.-B., Burzlaff, H., Dunitz, J. D., Gramaccioli, C. M., Schulz, H. H., Shmueli, U. & Abrahams, S. C. (1996). Atomic displacement parameter nomenclature. Report of a subcommittee on atomic displacement parameter nomenclature. Acta Cryst. A52, 770–781.Google Scholar

Usón, I., Pohl, E., Schneider, T. R., Dauter, Z., Schmidt, A., Fritz, H.-J. & Sheldrick, G. M. (1999). 1.7 Å structure of the stabilized REI_V mutant T39K. Application of local NCS restraints. Acta Cryst. D55, 1158–1167.Google Scholar

Watenpaugh, K. D., Sieker, L. C., Herriott, J. R. & Jensen, L. H. (1973). Refinement of the model of a protein: rubredoxin at 1.5 Å resolution. Acta Cryst. B29, 943–956.Google Scholar

Wilson, A. J. C. (1950). Largest likely values for the reliability index. Acta Cryst. 3, 397–398.Google Scholar

International Tables for Crystallography (2006). Vol. F. ch. 18.5, pp. 403-418
https://doi.org/10.1107/97809553602060000697

Chapter 18.5. Coordinate uncertainty

18.5.1. Introduction

18.5.1.1. Background

18.5.1.2. Accuracy and precision

18.5.1.3. Effect of atomic displacement parameters (or `temperature factors')

18.5.2. The least-squares method

18.5.2.1. The normal equations

18.5.2.2. Weights

18.5.2.3. Statistical descriptors and goodness of fit

18.5.3. Restrained refinement

18.5.3.1. Residual function

18.5.3.2. A very simple protein model

18.5.3.3. Relative weighting of diffraction and restraint terms

18.5.4. Two examples of full-matrix inversion

18.5.4.1. Unrestrained and restrained inversions for concanavalin A

18.5.4.2. Unrestrained inversion for an immunoglobulin

18.5.4.3. Comments on restrained refinement

18.5.4.4. Full-matrix estimates of precision

18.5.5. Approximate methods

18.5.5.1. Block calculations

18.5.5.2. The modified Fourier method

18.5.5.3. Application of the modified Fourier method

18.5.6. The diffraction-component precision index

18.5.6.1. Statistical expectation of error dependence

18.5.6.2. A simple error formula

18.5.6.3. Extension for low-resolution structures and use of Rfree

18.5.6.4. Position error

18.5.7. Examples of the diffraction-component precision index

18.5.7.1. Full-matrix comparison with the diffraction-component precision index

18.5.7.2. Further examples of the DPI using R

18.5.7.3. Examples of the DPI using Rfree

18.5.7.4. Comments on the diffraction-component precision index

18.5.8. Luzzati plots

18.5.8.1. Luzzati's theory

18.5.8.2. Statistical reinterpretation of Luzzati plots

18.5.8.3. Comments on Luzzati plots

References

18.5.6.3. Extension for low-resolution structures and use of R_free

18.5.7.3. Examples of the DPI using R_free