Averaging

Zhang, K. Y. J.; Cowtan, K. D.; Main, P.

doi:10.1107/97809553602060000687

International
Tables for
Crystallography
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

pdf | chapter contents | chapter index | related articles

International Tables for Crystallography (2006). Vol. F. ch. 15.1, pp. 316-317 | 1 | 2 |

Section 15.1.2.3. Averaging

K. Y. J. Zhang,^a K. D. Cowtan^b ^* and P. Main^c

^a Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N., Seattle, WA 90109, USA,^bDepartment of Chemistry, University of York, York YO1 5DD, England, and ^cDepartment of Physics, University of York, York YO1 5DD, England
Correspondence e-mail: cowtan+email@ysbl.york.ac.uk

15.1.2.3. Averaging

| top | pdf |

The averaging method enforces the equivalence of electron-density values between grid points in the map related by noncrystallographic symmetry. The averaging procedure can filter noise, correct systematic error and even determine the phases ab initio in favourable cases (Chapman et al., 1992; Tsao et al., 1992).

15.1.2.3.1. Introduction

| top | pdf |

Noncrystallographic symmetry (NCS) arises in crystals when there are two or more of the same molecules in one asymmetric unit. Such symmetries are local, since they only apply within a sub-region of a single unit cell. A fivefold axis, for example, must be noncrystallographic, since it is not possible to tessellate objects with fivefold symmetry. Since the symmetry does not map the crystal lattice back onto itself, the individual molecules that are related by the noncrystallographic symmetry will be in different environments; therefore, the symmetry relationships are only approximate.

Noncrystallographic symmetries provide phase information by the following means. Firstly, the related regions of the map may be averaged together, increasing the ratio of signal to noise in the map. Secondly, since the asymmetric unit must be proportionally larger to hold multiple copies of the molecule, the number of independent diffraction amplitudes available at any resolution is also proportionally larger. This redundancy in sampling the molecular transform leads to additional phase information which can be used for phase improvement.

15.1.2.3.2. The determination of noncrystallographic symmetry

| top | pdf |

The self-rotation symmetry is now routinely solved by the use of a Patterson rotation function (Rossmann & Blow, 1962). The translation symmetry can be determined by a translation function (Crowther & Blow, 1967) when a search model, either an approximate structure of the protein to be determined or the structure of a homologous protein, is available. The searches of the Patterson rotation and translation functions are achieved typically using fast automatic methods, such as X-PLOR (Brünger et al., 1987) or AMoRe (Navaza, 1994). In cases where no search model is available or the Patterson translation function is unsolvable, either the whole electron-density map, or a region which is expected to contain a molecule, may be rotated using the rotation solution and used as a search model in a phased translation function (Read & Schierbeek, 1988).

Once the averaging operators are determined, the mask can be determined using the local density correlation function as developed by Vellieux et al. (1995). This is achieved by a systematic search for extended peaks in the local density correlation, which must be carried out over a volume of several unit cells in order to guarantee finding the whole molecule. The local correlation function distinguishes those volumes of crystal space which map onto similar density under transformation by the averaging operator. Thus, in the case of improper NCS, a local correlation mask will cover only one monomer. In the case of a proper symmetry, a local correlation mask will cover the whole complex (Fig. 15.1.2.4a,b).

Figure 15.1.2.4| top | pdf |

Types of noncrystallographic symmetry and averaging calculation.

Special cases arise when there are combinations of crystallographic and noncrystallographic symmetries, of proper and improper symmetries, or when a noncrystallographic symmetry element maps a cell edge onto itself. In the latter case, the volume of matching density is infinite, and arbitrary limits must be placed upon the mask along one crystal axis.

15.1.2.3.3. The refinement of noncrystallographic symmetry

| top | pdf |

The initial NCS operation obtained from rotation and translation functions or heavy-atom positions can be fine-tuned by a density-space R-factor search in the six-dimensional rotation and translation space. The density-space R factor is defined as $[R = {\textstyle\sum\limits_{\bf r}} | \rho({\bf r}) - \rho({\bf r}') |\big/{\textstyle\sum\limits_{\bf r}} | \rho({\bf r}) + \rho({\bf r}')|, \eqno(15.1.2.24)]$ where $[{\bf r} = \{xyz\}]$ is the set of Cartesian coordinates, $[{\bf r}' = \Omega{\bf r}]$ is the NCS-related set of coordinates of r and Ω represents the NCS operator.

The six-dimensional search is very time-consuming. The search rate can be increased by using only a representative subset of grid points. The NCS operation is systematically altered to find the lowest density-space R factor for the selected subset of grid points.

The solution of the NCS operation from the six-dimensional search can be further refined by the following least-squares procedure. If $[\rho({\bf r})]$ is related to $[\rho({\bf r}')]$ by the NCS operation, Ω, $[\rho({\bf r}') = \rho(\Omega{\bf r}). \eqno(15.1.2.25)]$ Here, Ω is a function of $[\omega, \Omega = f(\omega)]$ , where $[\omega = \{\alpha, \beta, \gamma, t_{x},t_{y},t_{z}\}]$ represents the rotation and translation components of the NCS operation. The solution to the NCS parameters, ω, can be obtained by minimizing the density residual between the NCS-related molecules, $[\varepsilon ({\bf r}) = \rho({\bf r}) - \rho(\Omega{\bf r}), \eqno(15.1.2.26)]$ using a least-squares formula of the form $[\left({\partial \rho \over \partial \omega}\right)^{T} \left({\partial \rho \over \partial \omega}\right)\Delta \omega = \left({\partial \rho \over \partial \omega}\right)^{T} \varepsilon ({\bf r}), \eqno(15.1.2.27)]$ where Δω is the shift to the NCS parameters. Here, $[{\partial \rho \over \partial \omega} = {\partial \rho \over \partial {\bf r}} {\partial {\bf r} \over \partial \omega}. \eqno(15.1.2.28)]$ The partial derivatives, $[\partial \rho/\partial {\bf r} = \{\partial \rho/\partial x, \ \partial \rho/\partial y, \ \partial \rho/\partial z\}]$ , can be calculated by Fourier transforms, $[\eqalign{{\partial \rho \over \partial x} &= - {2\pi i \over V} {\sum\limits_{hkl}} hF_{hkl} \exp [- 2\pi i(hx + ky + lz)]\cr {\partial \rho \over \partial y} &= - {2\pi i \over V} {\sum\limits_{hkl}} kF_{hkl} \exp [- 2\pi i(hx + ky + lz)]\cr {\partial \rho \over \partial z} &= - {2\pi i \over V} {\sum\limits_{hkl}} lF_{hkl} \exp [- 2\pi i(hx + ky + lz)],} \eqno(5.1.2.29)]$ or more efficiently with a single Fourier transform by the use of spectral B-splines (Cowtan & Main, 1998). $[\partial {\bf r}/\partial \omega]$ is derived analytically based on the relationship between the Cartesian coordinates, r, and the rotational and translational coordinates of the NCS operation, ω, $[\left(\matrix{x'\cr y'\cr z'\cr}\right) = \left(\matrix{\cos \alpha \cos \beta \cos \gamma - \sin \alpha \sin \gamma &- \cos \alpha \cos \beta \sin \gamma - \sin \alpha \sin \gamma &\cos \alpha \sin \beta\cr \sin \alpha \cos \beta \cos \gamma + \cos \alpha \sin \gamma &- \sin \alpha \cos \beta \sin \gamma + \cos \alpha \cos \gamma &\sin \alpha \sin \beta\cr - \sin \beta \cos \gamma &\sin \beta \sin \gamma &\cos \beta\cr}\right) \left(\matrix{x\cr y\cr z\cr}\right) + \left(\matrix{t_{x}\cr t_{y}\cr t_{z}}\right). \eqno(15.1.2.30)]$

15.1.2.3.4. The averaging of NCS-related molecules

| top | pdf |

Once the mask and matrices are determined, the electron-density map may be modified by averaging. This can be achieved in one or two stages: The density for each copy of the molecule in the asymmetric unit may be replaced by the averaged density from every copy; however, this becomes slow for high-order NCS (Fig. 15.1.2.4c). Alternatively, a single averaged copy of the molecule may be created in an artificial cell [referred to by Rossmann et al. (1992) as an H-cell], and then each copy of the molecule may be reconstructed in the asymmetric unit from this copy (Fig. 15.1.2.4d). This is more efficient for high-order NCS, but additional errors are introduced in the second interpolation.

Interpolation of electron-density values at non-map grid sites is usually required, since the NCS operators will not normally map grid points onto each other. To obtain accurate interpolated values, either a fine grid or a complex interpolation function are required; suitable functions are described in Bricogne (1974) and Cowtan & Main (1998). Solvent flattening and histogram matching are frequently applied after averaging, since histogram matching tends to correct for any smoothing introduced by density interpolation.

In the case of flexible proteins, it may be necessary to average only part of the molecule, in which case the averaging mask will exclude some parts of the unit cell which are indicated as protein by the solvent mask. In other cases, it may be necessary to apply multi-domain averaging; in this case, the protein is divided into rigid domains which can appear in differing orientations. Each domain must then have a separate mask and set of averaging matrices.

Averaging may also be performed across similar molecules in multiple crystal forms (Schuller, 1996); in this case, density modification is performed on each crystal form simultaneously, with averaging of the molecular density across all copies of the molecule in all crystal forms. This is a powerful technique for phase improvement, even when no phasing is available in some crystal forms.

References

Bricogne, G. (1974). Geometric sources of redundancy in intensity data and their use for phase determination. Acta Cryst. A30, 395–405.Google Scholar

Brünger, A. T., Kuriyan, J. & Karplus, M. (1987). Crystallographic R factor refinement by molecular dynamics. Science, 235, 458–460.Google Scholar

Chapman, M. S., Tsao, J. & Rossmann, M. G. (1992). Ab initio phase determination for spherical viruses: parameter determination for spherical-shell models. Acta Cryst. A48, 301–312.Google Scholar

Cowtan, K. D. & Main, P. (1998). Miscellaneous algorithms for density modification. Acta Cryst. D54, 487–493.Google Scholar

Crowther, R. A. & Blow, D. M. (1967). A method of positioning a known molecule in an unknown crystal structure. Acta Cryst. 23, 544–548.Google Scholar

Navaza, J. (1994). AMoRe: an automated package for molecular replacement. Acta Cryst. A50, 157–163.Google Scholar

Read, R. J. & Schierbeek, A. J. (1988). A phased translation function. J. Appl. Cryst. 21, 490–495.Google Scholar

Rossmann, M. G. & Blow, D. M. (1962). The detection of sub-units within the crystallographic asymmetric unit. Acta Cryst. 15, 24–31.Google Scholar

Rossmann, M. G., McKenna, R., Tong, L., Xia, D., Dai, J.-B., Wu, H., Choi, H.-K. & Lynch, R. E. (1992). Molecular replacement real-space averaging. J. Appl. Cryst. 25, 166–180.Google Scholar

Schuller, D. J. (1996). MAGICSQUASH: more versatile non-crystallographic averaging with multiple constraints. Acta Cryst. D52, 425–434.Google Scholar

Tsao, J., Chapman, M. S. & Rossmann, M. G. (1992). Ab initio phase determination for viruses with high symmetry: a feasibility study. Acta Cryst. A48, 293–301.Google Scholar

Vellieux, F. M. D. A. P., Hunt, J. F., Roy, S. & Read, R. J. (1995). DEMON/ANGEL: a suite of programs to carry out density modification. J. Appl. Cryst. 28, 347–351.Google Scholar

International Tables for Crystallography (2006). Vol. F. ch. 15.1, pp. 316-317