International
Tables for
Crystallography
Volume B
Reciprocal space
Edited by U. Shmueli

International Tables for Crystallography (2006). Vol. B. ch. 4.5, pp. 478-479   | 1 | 2 |

Section 4.5.2.6.6. Multidimensional isomorphous replacement

R. P. Millanea*

4.5.2.6.6. Multidimensional isomorphous replacement

| top | pdf |

At low enough resolution, only one Fourier–Bessel structure factor contributes on each layer line of a fibre diffraction pattern, so that only the phase needs to be determined and the situation is no different to that in protein crystallography. If heavy-atom-derivative specimens can be prepared, the usual method of multiple isomorphous replacement (MIR) (Drenth, 1994[link]) can be applied, which in principle requires only two heavy-atom derivatives. At higher resolution, however, more than one Fourier–Bessel structure factor contributes on each layer line. A generalized form of isomorphous replacement which involves using diffraction data from several heavy-atom derivatives to determine the real and imaginary components of each contributing [G_{nl} (R)] is referred to as multidimensional isomorphous replacement (MDIR) (Namba & Stubbs, 1985[link]). MDIR was first described and used to determine the structure of TMV at 6.7 Å resolution (Stubbs & Diamond, 1975[link]; Holmes et al., 1975[link]), and has since been used to extend the resolution to 2.9 Å (Namba, Pattanayek & Stubbs, 1989[link]). A consequence of cylindrical averaging is that large numbers of heavy-atom derivatives are required: at least two for each Bessel term to be separated. The theory of MDIR is outlined here.

The first step in MDIR is location of the heavy atoms in the derivative structures. The radial coordinate of a heavy atom can be determined by analysis of the intensity distribution in the low-resolution region of the equator where only the [G_{00} (R)] Bessel term contributes. Since [G_{00} (R)] is real, and [I_{l} (R)] can be measured continuously in R, inspection of the positions of the minima and maxima in the low-resolution region of the equator generally allows the sign of [G_{00} (R)] to be assigned to [I_{0}^{1/2} (R)], i.e. [G_{00} (R)] can be determined from [I_{0} (R)]. If the sign is determined for both the native and a heavy-atom derivative, referring to equation (4.5.2.13)[link] shows that [G_{00}^{D} (R) - G_{00} (R) = o_{h} f_{h} J_{0} (2\pi Rr_{h}), \eqno(4.5.2.68)] where [G_{00}^{D} (R)] is the value derived from the derivative data, o denotes the occupancy and the subscript h denotes values for the heavy atom. The parameters [o_{h}] and [r_{h}] on the right-hand side of equation (4.5.2.68)[link] can be searched in a trial-and-error fashion to obtain the best agreement with the left-hand side (calculated from the data) to determine the radial coordinate [r_{h}] of the heavy atom (Mandelkow & Holmes, 1974[link]). Lobert et al. (1987)[link] applied the same method to cucumber green mottle mosaic virus (CGMMV), except that the sign of [G_{00} (R)] was taken from that of TMV.

Two approaches have been used to determine the angular and axial coordinates of the heavy atom. Mandelkow & Holmes (1974)[link] and Holmes et al. (1975)[link] used a search procedure in which the quantity [\Phi = -n\varphi_{h} + 2\pi lz_{h}/c] is varied and used to calculate the intensity of the Fourier–Bessel structure factor for the heavy atom alone. This is compared to [I_{l}^{D} (R) - I_{l} (R)] on each layer line, where only one Bessel order contributes, and Φ chosen to minimize the mean-square difference. The values of Φ found for each layer line can then be combined to determine [\varphi_{h}] and [z_{h}]. In the case of CGMMV, Lobert et al. (1987)[link] used the phases and Bessel-order separations from TMV to calculate Fourier–Bessel difference maps between the native and derivative data to determine the heavy-atom coordinates [(r_{h}, \varphi_{h}, z_{h})].

Consider a set of isomorphous heavy-atom derivatives indexed by j. Since the analysis is applied at any point [(l, R)] on the fibre diffraction pattern, the symbol [G_{n}] will be used for [G_{nl} (R)] where no confusion arises. Denote by [G_{n, \,  j}] the value of [G_{n}] for the jth derivative, so that [G_{n, \,  j} = G_{n} + g_{n, \,  j}, \eqno(4.5.2.69)] where [g_{n, \,  j}] denotes the Fourier–Bessel structure factor of a structure containing the heavy atom only. Denote by [A_{n}] and [B_{n}] the real and imaginary parts, respectively, of [G_{n}] (for the native structure), and by [a_{n, \,  j}] and [b_{n, \,  j}] the real and imaginary parts of [g_{n, \,  j}], i.e. for the jth heavy-atom structure alone. Equation (4.5.2.17)[link] can then be written as [I = {\textstyle\sum\limits_{n}} (A_{n}^{2} + B_{n}^{2}) \eqno(4.5.2.70)] for the native and [I_{j} = {\textstyle\sum\limits_{n}} [(A_{n} + a_{n, \,  j})^{2} + (B_{n} + b_{n, \,  j})^{2}] \eqno(4.5.2.71)] for the jth derivative. If intensity data are available from J heavy-atom derivatives, [a_{n, \,  j}] and [b_{n, \,  j}] can be calculated from the heavy-atom positions, and equations (4.5.2.70)[link] and (4.5.2.71)[link] represent a system of [J + 1] second-order equations for the m unknowns [A_{n}] and [B_{n}]. If [J + 1  \gt  m], then the system of equations is overdetermined and can be solved for the [A_{n}] and [B_{n}]. The solution of this nonlinear system can be eased by deriving a system of linear equations by substituting from (4.5.2.70)[link] into (4.5.2.71)[link], giving [{{\textstyle\sum\limits_{n}} (A_{n} a_{n, \,  j} + B_{n} b_{n, \,  j}) = (1/2) \left[I_{j} - I - {\textstyle\sum\limits_{n}} (a_{n, \,  j}^{2} + b_{n, \,  j}^{2})\right].} \eqno(4.5.2.72)] Equation (4.5.2.72)[link] is a system of linear equations for the unknowns [A_{n}] and [B_{n}], the solution being subject to the constraint equation (4.5.2.70)[link]. However, since the original problem is second-order, there may be up to m local minima. Stubbs & Diamond (1975)[link] describe a numerical procedure for locating all the local minima and selecting the best of these based on `continuity' of the [G_{nl} (R)]. This method was used to determine the structure of TMV at 6.7 Å resolution (Holmes et al., 1975[link]) and 4 Å resolution (Stubbs et al., 1977[link]). In current applications of MDIR a more direct solution technique is used in which the phase-determining equations (4.5.2.70)[link] and (4.5.2.71)[link] are solved by first solving the linear equations (4.5.2.72)[link] by linear least squares to obtain an approximate solution, which is then refined by solving the quadratic equations (4.5.2.70)[link] and (4.5.2.71)[link] directly using nonlinear least squares (Namba & Stubbs, 1985[link]).

The number of heavy-atom derivatives required can be quite demanding experimentally, although phasing with fewer heavy-atom derivatives is possible, particularly if additional information is available, such as from a related structure. The different Bessel terms may be assumed to contribute the same amplitude each, or, if the structure of a related molecule is known, the ratios of the amplitudes can be taken as being the same as those for the related molecule. Using the amplitude estimates derived using either of these two approaches, applied to both native and derivative data, the phases of the Bessel terms can be estimated using conventional MIR and data from at least two heavy-atom derivatives, allowing an initial electron-density map to be calculated. If only one heavy-atom derivative is available then two phase solutions are obtained, but the method of conventional single isomorphous replacement (SIR) (Drenth, 1994[link]) can be used to obtain an estimate of the electron density. The electron density obtained by MIR, and particularly by SIR, in this way tends to be noisy and low contrast as a result of inaccurate division of the intensities, as well as the usual sources of errors in MIR. The electron density can, however, be improved using solvent levelling. If no heavy-atom derivatives are available, both the relative amplitudes and the phases can be based on those of a related structure. Model bias can, however, be more serious than in conventional crystallography since both the phases and the relative amplitudes are based on the model.

The feasibility of structure determination with a limited number of heavy-atom derivatives was first demonstrated by Namba & Stubbs (1987b)[link] using data from TMV at 4 Å resolution. The structure of CGMMV has been determined at 5 Å resolution using data from two heavy-atom derivatives and the techniques described above (Lobert et al., 1987[link]; Lobert & Stubbs, 1990[link]). Structure determination at this resolution using MDIR would theoretically require six heavy-atom derivatives. Initial separation of the Bessel-term amplitudes was based on the equal-amplitude assumption and also on the relative amplitudes for (homologous) TMV.

In general, the equal-amplitude assumption appears to produce reliable electron-density maps where only two or three Bessel terms contribute. The corresponding resolution depends on the helix symmetry and the molecular diameter, but can be relatively high for molecules with high helix symmetry. At higher resolution where more Bessel terms contribute, use of related or partial structures can be used to calculate initial Bessel-term amplitudes and can lead to successful phasing.

If the molecule has only approximate helix symmetry, then layer-line splitting (Section 4.5.2.3.3[link]) can provide additional information which reduces the number of heavy-atom derivatives required. The degree of splitting is usually significantly less than the breadth of the layer lines so that the different Bessel terms within a (split) layer line overlap. The effect of splitting can be observed, however, since the centre of a layer line, at a particular value of R, is shifted towards the position of the stronger Bessel term contributing at that radius. The shift depends on the relative magnitudes of the contributing Bessel terms, and can be measured and used in phase determination as detailed by Stubbs & Makowski (1982)[link]. If P of the heavy-atom derivatives (in addition to the native) give accurate splitting information, then an additional P linear equations [analogous to equation (4.5.2.72)[link]] and one quadratic equation [analogous to equation (4.5.2.70)[link]] are available for solution of the phase problem, and the number of heavy-atom derivatives required is reduced by a factor of up to two. The value of layer-line splitting was first demonstrated by recalculating an electron-density map of TMV at 6.7 Å resolution using only two derivatives, rather than using six derivatives without the use of splitting data (Stubbs & Makowski, 1982[link]). Layer-line splitting was subsequently used in a structure determination of TMV at 3.6 Å resolution (Namba & Stubbs, 1985[link]).

Macromolecular fibre structures that have been built into an electron-density map have been refined using both restrained least-squares (RLS) and molecular-dynamics (MD) refinements. Restrained least squares has been used to refine the structure of TMV at 2.9 Å resolution (Namba, Pattanayek & Stubbs, 1989[link]); however, Wang & Stubbs (1993)[link] have shown that a larger radius of convergence is obtained using MD refinement (as in protein crystallography).

Molecular-dynamics refinement in fibre diffraction has been implemented by adding a fibre diffraction option (Wang & Stubbs, 1993[link]) to the X-PLOR program (Brünger, 1992[link]). This involves including the cylindrically averaged fibre diffraction intensities in the energy term and taking account of the inter-helical subunit contacts and covalent connections in the same way as described above for RLS refinement. The effective potential-energy function E used is [E = E_{e} + S {\textstyle\sum\limits_{l}} {\textstyle\sum\limits_{i}} w_{li} \{[I_{l}^{o} (R_{i})]^{1/2} - k [I_{l}^{c} (R_{i})]^{1/2}\}^{2}, \eqno(4.5.2.73)] where [E_{e}] is the empirical energy function (which typically includes bond-length, bond-angle and torsion-angle distortions, van der Waals and electrostatic interactions, and other terms such as ring planarity), [I_{l}^{o} (R_{i})] and [I_{l}^{c} (R_{i})] are the observed and calculated, respectively, cylindrically averaged diffraction intensities sampled at [R = R_{i}], the [w_{li}] are weights for the observed intensities [I_{l}^{o} (R_{i})] and k is a scale factor between the calculated and observed data. The quantity S is a weight to make the gradients of the two terms in equation (4.5.2.73)[link] comparable (Wang & Stubbs, 1993[link]), and can be estimated using the method of Brünger (1992)[link]. Molecular-dynamics refinement has been successfully used to refine the structure of CGMMV at 3.4 Å resolution (Wang & Stubbs, 1994[link]). In the case of ribgrass mosaic virus (RMV), the close isomorphism with TMV (identical helix symmetry, similar repeat distance, significant sequence homology and similar diffraction pattern) allowed an initial model to be built based on the TMV structure, and a solution obtained at 2.9 Å by alternating molecular-dynamics refinement with difference-map and omit-map calculations (Wang et al., 1997[link]).

References

First citation Brünger, A. T. (1992). X-PLOR. Version 3.1. New Haven: Yale University Press.Google Scholar
First citation Drenth, J. (1994). Principles of protein X-ray crystallography. New York: Springer-Verlag.Google Scholar
First citation Holmes, K. C., Stubbs, G. J., Mandelkow, E. & Gallwitz, U. (1975). Structure of tobacco mosaic virus at 6.7 Å resolution. Nature (London), 254, 192–196.Google Scholar
First citation Lobert, S., Heil, P. D., Namba, K. & Stubbs, G. (1987). Preliminary X-ray fibre diffraction studies of cucumber green mottle mosaic virus, watermelon strain. J. Mol. Biol. 196, 935–938.Google Scholar
First citation Lobert, S. & Stubbs, G. (1990). Fibre diffraction analysis of cucumber green mottle mosaic virus using limited numbers of heavy-atom derivatives. Acta Cryst. A46, 993–997.Google Scholar
First citation Mandelkow, E. & Holmes, K. C. (1974). The positions of the N-terminus and residue 68 in tobacco mosaic virus. J. Mol. Biol. 87, 265–273.Google Scholar
First citation Namba, K., Pattanayek, R. & Stubbs, G. J. (1989). Visualization of protein–nucleic acid interactions in a virus. Refined structure of intact tobacco mosaic virus at 2.9 Å resolution by X-ray fibre diffraction. J. Mol. Biol. 208, 307–325.Google Scholar
First citation Namba, K. & Stubbs, G. (1985). Solving the phase problem in fibre diffraction. Application to tobacco mosaic virus at 3.6 Å resolution. Acta Cryst. A41, 252–262.Google Scholar
First citation Namba, K. & Stubbs, G. (1987b). Isomorphous replacement in fibre diffraction using limited numbers of heavy-atom derivatives. Acta Cryst. A43, 64–69.Google Scholar
First citation Stubbs, G., Warren, S. & Holmes, K. (1977). Structure of RNA and RNA binding site in tobacco mosaic virus from a 4 Å map calculated from X-ray fibre diagrams. Nature (London), 267, 216–221.Google Scholar
First citation Stubbs, G. J. & Diamond, R. (1975). The phase problem for cylindrically averaged diffraction patterns. Solution by isomorphous replacement and application to tobacco mosaic virus. Acta Cryst. A31, 709–718.Google Scholar
First citation Stubbs, G. J. & Makowski, L. (1982). Coordinated use of isomorphous replacement and layer-line splitting in the phasing of fibre diffraction data. Acta Cryst. A38, 417–425.Google Scholar
First citation Wang, H., Culver, J. N. & Stubbs, G. (1997). Structure of ribgrass mosaic virus at 2.9 Å resolution: evolution and taxonomy of tobamoviruses. J. Mol. Biol. 269, 769–779.Google Scholar
First citation Wang, H. & Stubbs, G. (1993). Molecular dynamics refinement against fibre diffraction data. Acta Cryst. A49, 504–513.Google Scholar
First citation Wang, H. & Stubbs, G. J. (1994). Structure determination of cucumber green mottle mosaic virus by X-ray fibre diffraction. Significance for the evolution of tobamoviruses. J. Mol. Biol. 239, 371–384.Google Scholar








































to end of page
to top of page