International
Tables for Crystallography Volume F Crystallography of biological macromolecules Edited by M. G. Rossmann and E. Arnold © International Union of Crystallography 2006 |
International Tables for Crystallography (2006). Vol. F. ch. 15.1, pp. 321-323
Section 15.1.5.2. Least-squares solution to the system of nonlinear constraint equations
a
Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N., Seattle, WA 90109, USA,bDepartment of Chemistry, University of York, York YO1 5DD, England, and cDepartment of Physics, University of York, York YO1 5DD, England |
For a system of nonlinear equations of electron density, where
0 is a null vector, n is the number of grid points and m is the number of equations, the Newton–Raphson method of solution is to find a set of shifts,
to
, through a system of linear equations,
where J is a matrix of partial derivatives of F with respect to
and is called the Jacobian matrix,
ɛ is a vector of residuals to equation (15.1.5.3)
for a trial solution,
, and
is a vector of shifts to the density. Hence, the solution for
is achieved in an iterative manner,
Therefore, the problem of solving a system of nonlinear equations (15.1.5.3)
is transformed into solving a system of linear equations (15.1.5.4)
, which forms one cycle of Newton–Raphson iteration.
If there are more equations than unknowns , the unknowns are obtained through a least-squares solution to equations (15.1.5.4)
,
Theoretically, the above system of equations could be solved by matrix multiplication and inversion, i.e.
However, the amount of calculation involved in setting up the normal matrix of least squares is huge for the problem presented by protein structures. This can be completely avoided by using the conjugate-gradient technique for solving the system of linear equations.
The conjugate-gradient method does not require the inversion of the normal matrix, and therefore the solution to a large system of linear equations can be achieved very quickly.
Starting from a trial solution to equations (15.1.5.4), such as a null vector,
the initial residual is
and the initial search step is
The iterative process is as follows. The new shift to the density is where
and
The new residual is
where
The next search step which conjugates with the residual is
where
The process is iterated by increasing k until convergence is reached, when
The number of iterations required for an exact solution is equal to the number of unknowns, because the search vector at each step is orthogonal with all the previous steps. However, a very satisfactory solution can normally be reached after very few iterations. This makes the conjugate-gradient method a very efficient and fast procedure for solving a system of equations. Note that the normal matrix never appears explicitly, although it is implicit in (15.1.5.10) and (15.1.5.16)
. The inversion of the normal matrix and matrix multiplication is completely avoided. Most of the calculation comes from the formation of the matrix-vector products in (15.1.5.10)
, (15.1.5.14)
, and (15.1.5.16)
. These can be expressed as convolutions and can be performed using FFTs, thus saving considerably more time.
The solution to at the end of conjugate-gradient iteration is substituted into equation (15.1.5.6)
to get a new solution for
. The solution to the system of nonlinear equations (15.1.5.3)
is obtained when the Newton–Raphson iteration has reached convergence.
The equations to be solved for the electron-density shifts, , are from the Jacobian of equation (15.1.5.2)
,
where
is the residual to Sayre's equation,
and
is the residual to the linear density-modification equations,
Starting from a trial solution of
, the initial residual vector is
where
and
Thus, only three FFTs are required to calculate the initial residual. The residual of Sayre's equation is given in equation (15.1.5.23)
.
The calculation of in equation (15.1.5.14)
is achieved in a similar manner using FFTs,
where the vector is partitioned as shown above, and
Similarly, vector in equation (15.1.5.16)
is obtained from
where
is defined in equation (15.1.5.26)
.
The remaining calculations in equations (15.1.5.12), (15.1.5.13)
, (15.1.5.15)
, (15.1.5.17)
and (15.1.5.18)
require either the inner product of a pair of vectors or a linear combination of vectors, both of which are very quick to calculate. Each iteration of the conjugate gradient requires four FFTs, as described in equations (15.1.5.26
–15.1.5.29
).
The full-matrix solution to equation (15.1.5.4) requires a significant amount of computing, although it can be achieved using FFTs. The diagonal approximation to the normal matrix has been used as an alternative method of solution to the electron-density shift in equation (15.1.5.4)
(Main, 1990b
). As with the full-matrix calculation, it can be done entirely by FFTs and a linear combination of vectors.
The diagonal element of the normal matrix, , in equation (15.1.5.7)
is
The right-hand side of equation (15.1.5.7)
,
, is identical to the residual vector,
, which can be calculated from equation (15.1.5.22)
. Therefore, the solution to the electron-density shift,
, can be calculated from
Compared with the full-matrix solution, all the calculations involved in between equations (15.1.5.12) and (15.1.5.18)
and the subsequent iterations are spared in the diagonal approximation. This makes calculation by the diagonal approximation much faster than by the full-matrix method.
References
