International
Tables for
Crystallography
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

International Tables for Crystallography (2006). Vol. F. ch. 25.2, pp. 698-699

Section 25.2.1.6. Parameter refinement

W. Fureya*

25.2.1.6. Parameter refinement

| top | pdf |

Several methods are provided for refinement of heavy-atom or anomalous-scatterer parameters and scaling parameters, depending on the desired function to be minimized. In all cases, the structure factor [F_{H}] corresponding to the heavy atom or anomalous scatterer is given by equation (25.2.1.5)[link]. The options available are briefly described below.

25.2.1.6.1. Against amplitude differences

| top | pdf |

The simplest procedure is to refine against the magnitudes of isomorphous or anomalous structure-factor amplitude differences, which can be carried out with the program GREF (batch mode). In this case, one minimizes [\textstyle\sum\limits_{j} W_{j} \left(|F_{PH_{j}} - F_{P_{j}}| - F_{H_{j}}\right)^{2} \eqno(25.2.1.11)] for isomorphous-replacement data or [\textstyle\sum\limits_{j} W_{j} \left(|F_{PH_{j}}^{+} - F_{PH_{j}}^{-}| - 2F_{H_{j}}\right)^{2} \eqno(25.2.1.12)] for anomalous-scattering data with respect to the desired parameters contributing to [F_{H}], where [W_{j}] is a weighting factor. For anomalous-scattering data, only the imaginary component of the scattering factors is used during the [F_{H}] structure-factor calculation. For isomorphous-replacement data, the summation is taken only over centric reflections, plus the strongest 25% of differences for acentric reflections if insufficient centric data are present. For anomalous-scattering data, the summation is taken only over the strongest 25% of Bijvoet differences. An advantage of these methods is that only data from the derivative being refined are used (plus the native with isomorphous data), hence there is no possibility of feedback between other derivatives which may not be truly independent. A disadvantage is that, apart for the centric reflections, the target value in the minimization is only an approximation to the true [F_{H}]. The accuracy of this approximation is improved by restricting the summations to the strongest differences.

25.2.1.6.2. By minimizing lack of closure

| top | pdf |

An alternative procedure available in the program PHASIT (batch) is to refine against the observed derivative amplitudes. In this case, one minimizes the `lack of closure' (now based on amplitudes instead of intensities) with respect to the desired parameters contributing to [F_{PH}], including the derivative-to-native scaling parameters. In all cases, the calculated derivative amplitudes [F_{PH_{\rm (calc)}}] are obtained from equation (25.2.1.9)[link]. To use this procedure, one must have an estimate of the protein phase φ. Several variations of this method, all available in PHASIT, are described below and are generally referred to as `phase refinement'.

25.2.1.6.2.1. `Classical' phase refinement

| top | pdf |

With this option, one minimizes [\textstyle\sum\limits_{j} W_{j} [F_{PH_{\rm (obs)}} - F_{PH_{\rm (calc)}} (\varphi)]^{2} \eqno(25.2.1.13)] for isomorphous-replacement data or [\textstyle\sum\limits_{j} W_{j} \left\{(F_{PH}^{+} - F_{PH}^{-})_{\rm obs} - [F_{PH}^{+} (\varphi) - F_{PH}^{-} (\varphi)]_{\rm calc}\right\}^{2} \eqno(25.2.1.14)] for anomalous-scattering data with respect to the desired parameters. Typically, the weights are taken as the reciprocal of the `standard error' (expected lack of closure) or its square. The summations are taken over all reflections for which the protein phase is thought to be reasonably valid, usually implied by a figure of merit of 0.4 or higher. The protein phase estimate usually comes from the centroid of the appropriate combined phase probability distribution given by equation (25.2.1.10)[link]; however, one has the option of including all data sets when combining the distributions, or including all except that for the derivative being refined. Once new heavy-atom and scaling parameters are obtained, new individual SIR or SAS phase probability distributions are computed and combined to provide new protein phases, and these phases are used to update the standard error estimates as described earlier. Then the individual distributions are recomputed once more using the new standard error estimates, and these distributions are combined again to give new protein phase estimates. The process is then iterated using the new phases and new heavy-atom parameters to start another round of refinement. After several iterations, the heavy-atom parameters, standard error estimates and protein phase estimates converge to their final values.

25.2.1.6.2.2. Approximate-likelihood method

| top | pdf |

This variation, also available in PHASIT, is similar to the classical phase refinement described above, except that instead of using only a single value for the protein phase φ during the calculation of [F_{PH}], all possible values are considered, with each contribution weighted by the corresponding protein phase probability (Otwinowski, 1991[link]). One minimizes [\textstyle\sum\limits_{j} W_{j} \textstyle\sum\limits_{i} P_{i} [F_{PH_{\rm (obs)}} - F_{PH_{\rm (calc)}} (\varphi_{i})]^{2} \eqno(25.2.1.15)] with respect to the desired parameters for isomorphous-replacement data, where [P_{i}] is the protein phase probability and the inner summation is over all allowed protein phase values, stepped in intervals of 5° (or 180° for centric reflections). For anomalous-scattering data, a similar modification is made to equation (25.2.1.14)[link]. The weights may be as in the classical phase refinement case or unity. Since each contribution is weighted by its phase probability regardless, there is no need to use a high figure-of-merit cutoff, as was done earlier. In fact, very good results are usually obtained using unit weights for [W_{j}] (that is, only the probability weighting) and a figure-of-merit cutoff of around 0.2 for inclusion of reflections in the summations. This variation has been found to increase stability in the refinement and works considerably better than conventional phase refinement when the phase probability distributions are strongly multimodal. Parameter refinement and phasing iterations proceed as described earlier. The combination of probability weighting during refinement with probability weighting during standard error estimation enables the key features of maximum-likelihood refinement to be carried out, although only approximately.

25.2.1.6.2.3. Using external phase information

| top | pdf |

When using either the conventional phase refinement or approximate-likelihood methods, protein phase estimates are required. In the former case, only a single value is used, whereas in the latter, information about all possibilities is provided by way of the phase probability distribution. Normally, this information comes from a prior phasing calculation; thus, the estimates are typically SIR, SAS, MIR etc. phases. However, in PHASIT, an option allows one to read in the protein phase information from an external source. This enables parameter refinement (by either conventional or approximate-likelihood methods) using protein phase estimates that are improvements over the initial ones. For example, one could get the best phases by one of the previously described methods, but then improve them by density-modification procedures, such as solvent flattening or negative-density truncation and/or NC symmetry averaging. Using these improved phases in the calculation of [F_{PH}] when refining should then lead to more accurate heavy-atom and scaling parameters, which in turn will produce still better protein phases. These new protein phases can either be treated as final and used to produce an electron-density map for interpretation, or be used to initiate another round of phase improvement by density modification. There are several cases where this type of refinement has been beneficial, and it is particularly useful for the refinement of derivative-to-native scaling parameters.

25.2.1.6.3. Rigid-group refinement

| top | pdf |

Although GREF can be used to refine individual heavy-atom or anomalous-scatterer parameters against isomorphous or anomalous structure-factor difference magnitudes, it is actually a group refinement program. Thus, all entities to be refined are treated as rigid bodies such that only group orientations, positions, scaling and temperature parameters can be refined. The groups, however, can be defined arbitrarily. For individual heavy-atom sites, they are simply defined as single atom `groups', and no orientation parameters are selected for refinement. This enables the program to serve two additional roles. In the case where the heavy-atom reagent is known to contain a rigid group, it can be properly treated. Also, if one chooses the target values to be native structure-factor amplitudes instead of difference magnitudes and inputs an entire protein molecule or domain, then conventional rigid-body or segmented rigid-body refinement can be carried out. The output consists of the refined parameters and a Fourier-coefficient file suitable for map or phase combination calculations.

References

First citation Otwinowski, Z. (1991). In Proceedings of the CCP4 study weekend. Isomorphous replacement and anomalous scattering, edited by W. Wolf, P. R. Evans & A. G. W. Leslie, pp. 80–86. Warrington: Daresbury Laboratory.Google Scholar








































to end of page
to top of page