International
Tables for Crystallography Volume F Crystallography of biological macromolecules Edited by M. G. Rossmann and E. Arnold © International Union of Crystallography 2006 |
International Tables for Crystallography (2006). Vol. F. ch. 15.1, pp. 321-323
Section 15.1.5. Combining constraints for phase improvement
a
Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N., Seattle, WA 90109, USA,bDepartment of Chemistry, University of York, York YO1 5DD, England, and cDepartment of Physics, University of York, York YO1 5DD, England |
The chemical and physical information of the underlying structure that the electron density represents serves as constraints on the phases. For small molecules, the constraints of positivity and atomicity are sufficient to solve the phase problem ab initio (Hauptman, 1986; Karle, 1986
; Woolfson, 1987
), because crystals of small molecules generally diffract to atomic resolution. However, no single constraint at our disposal is powerful enough to render the macromolecular phase problem determinable, because macromolecule crystals rarely diffract to atomic resolution. Therefore, individual constraints are combined to produce a more powerful density-modification protocol. This is because these constraints represent different characteristic features of the electron density and they contain independent phasing information.
The phasing power of a method increases with the number of independent constraints employed, the number of density points affected and the amplitude of changes imposed on the electron density. It also depends on the physical nature and accuracy of the constraints and how the constraints are applied. One obvious way of implementing several constraints is to apply them one after the other to the electron density. This sequential application, although easy to implement, suffers some drawbacks. The cyclic application of all constraints may not converge easily, since some constraints may contain contradicting information as to how the density should be modified. An alternative way of implementing various constraints is simultaneous application. The density solution that satisfies all the constraints is obtained by a global minimization procedure (Main, 1990b; Zhang & Main, 1990b
).
The constraints used in SQUASH/DM can be divided into three categories. The first category comprises the linear constraints, such as solvent flatness, density histogram and equal molecules. The second category comprises the nonlinear constraints, such as the local shape of electron density as expressed in Sayre's equation. The third category comprises the available structural data, such as the observed structure-factor amplitudes and the experimental phases. The first and second categories of constraints are used to solve new electron-density values. The third category of constraints is used as a means to filter the modified phases.
The modification to the density value at a grid point by a linear constraint is independent of the values at other grid points. These constraints include solvent flattening, histogram matching and molecular averaging. These density-modification methods construct an improved map directly from an initial density map as expressed by where
is the target electron density produced by these linear constraints.
The new electron density that satisfies both the linear constraints represented by equation (15.1.5.1) and the nonlinear constraints expressed by Sayre's equation (15.1.2.31)
can be obtained by solving the systems of simultaneous equations (Zhang & Main, 1990b
)
Equation (15.1.5.2) represents a system of nonlinear simultaneous equations with as many unknowns as the number of grid points in the asymmetric unit of the map and with twice as many equations as unknowns. The functions
and
are both known. The least-squares solution, using either the full matrix or the diagonal approximation, is obtained using the Newton–Raphson technique with fast Fourier transforms, as described in the next section
(Main, 1990b
).
For a system of nonlinear equations of electron density, where
0 is a null vector, n is the number of grid points and m is the number of equations, the Newton–Raphson method of solution is to find a set of shifts,
to
, through a system of linear equations,
where J is a matrix of partial derivatives of F with respect to
and is called the Jacobian matrix,
ɛ is a vector of residuals to equation (15.1.5.3)
for a trial solution,
, and
is a vector of shifts to the density. Hence, the solution for
is achieved in an iterative manner,
Therefore, the problem of solving a system of nonlinear equations (15.1.5.3)
is transformed into solving a system of linear equations (15.1.5.4)
, which forms one cycle of Newton–Raphson iteration.
If there are more equations than unknowns , the unknowns are obtained through a least-squares solution to equations (15.1.5.4)
,
Theoretically, the above system of equations could be solved by matrix multiplication and inversion, i.e.
However, the amount of calculation involved in setting up the normal matrix of least squares is huge for the problem presented by protein structures. This can be completely avoided by using the conjugate-gradient technique for solving the system of linear equations.
The conjugate-gradient method does not require the inversion of the normal matrix, and therefore the solution to a large system of linear equations can be achieved very quickly.
Starting from a trial solution to equations (15.1.5.4), such as a null vector,
the initial residual is
and the initial search step is
The iterative process is as follows. The new shift to the density is where
and
The new residual is
where
The next search step which conjugates with the residual is
where
The process is iterated by increasing k until convergence is reached, when
The number of iterations required for an exact solution is equal to the number of unknowns, because the search vector at each step is orthogonal with all the previous steps. However, a very satisfactory solution can normally be reached after very few iterations. This makes the conjugate-gradient method a very efficient and fast procedure for solving a system of equations. Note that the normal matrix never appears explicitly, although it is implicit in (15.1.5.10) and (15.1.5.16)
. The inversion of the normal matrix and matrix multiplication is completely avoided. Most of the calculation comes from the formation of the matrix-vector products in (15.1.5.10)
, (15.1.5.14)
, and (15.1.5.16)
. These can be expressed as convolutions and can be performed using FFTs, thus saving considerably more time.
The solution to at the end of conjugate-gradient iteration is substituted into equation (15.1.5.6)
to get a new solution for
. The solution to the system of nonlinear equations (15.1.5.3)
is obtained when the Newton–Raphson iteration has reached convergence.
The equations to be solved for the electron-density shifts, , are from the Jacobian of equation (15.1.5.2)
,
where
is the residual to Sayre's equation,
and
is the residual to the linear density-modification equations,
Starting from a trial solution of
, the initial residual vector is
where
and
Thus, only three FFTs are required to calculate the initial residual. The residual of Sayre's equation is given in equation (15.1.5.23)
.
The calculation of in equation (15.1.5.14)
is achieved in a similar manner using FFTs,
where the vector is partitioned as shown above, and
Similarly, vector in equation (15.1.5.16)
is obtained from
where
is defined in equation (15.1.5.26)
.
The remaining calculations in equations (15.1.5.12), (15.1.5.13)
, (15.1.5.15)
, (15.1.5.17)
and (15.1.5.18)
require either the inner product of a pair of vectors or a linear combination of vectors, both of which are very quick to calculate. Each iteration of the conjugate gradient requires four FFTs, as described in equations (15.1.5.26
–15.1.5.29
).
The full-matrix solution to equation (15.1.5.4) requires a significant amount of computing, although it can be achieved using FFTs. The diagonal approximation to the normal matrix has been used as an alternative method of solution to the electron-density shift in equation (15.1.5.4)
(Main, 1990b
). As with the full-matrix calculation, it can be done entirely by FFTs and a linear combination of vectors.
The diagonal element of the normal matrix, , in equation (15.1.5.7)
is
The right-hand side of equation (15.1.5.7)
,
, is identical to the residual vector,
, which can be calculated from equation (15.1.5.22)
. Therefore, the solution to the electron-density shift,
, can be calculated from
Compared with the full-matrix solution, all the calculations involved in between equations (15.1.5.12) and (15.1.5.18)
and the subsequent iterations are spared in the diagonal approximation. This makes calculation by the diagonal approximation much faster than by the full-matrix method.
References




