International
Tables for Crystallography Volume F Crystallography of biological macromolecules Edited by M. G. Rossmann and E. Arnold © International Union of Crystallography 2006 |
International Tables for Crystallography (2006). Vol. F. ch. 16.2, p. 348
Section 16.2.3. Adaptation to crystallography
aLaboratory of Molecular Biology, Medical Research Council, Cambridge CB2 2QH, England |
The standard setting of probabilistic direct methods (Hauptman & Karle, 1953; Bertaut, 1955a
,b
; Klug, 1958
) uses implicitly as its starting point a source of random atomic positions. This can be described in the terms introduced in Section 16.2.2.1
by using a continuous alphabet
whose symbols s are fractional coordinates x in the asymmetric unit of the crystal, the uniform measure μ being the ordinary Lebesgue measure
. A message of length N generated by that source is then a random N-equal-atom structure.
The traditional theory of direct methods assumes a uniform distribution q(x) of random atoms and proceeds to derive joint distributions of structure factors belonging to an N-atom random structure, using the asymptotic expansions of Gram–Charlier and Edgeworth. These methods have been described in Section 1.3.4.5.2.2
of IT B (Bricogne, 2001
) as examples of applications of Fourier transforms. The reader is invited to consult this section for terminology and notation. These joint distributions of complex structure factors are subsequently used to derive conditional distributions of phases when the amplitudes are assigned their observed values, or of a subset of complex structure factors when the others are assigned certain values. In both cases, the largest structure-factor amplitudes are used as the conditioning information.
It was pointed out by the author (Bricogne, 1984) that this procedure can be problematic, as the Gram–Charlier and Edgeworth expansions have good convergence properties only in the vicinity of the expectation values of each structure factor: as the atoms are assumed to be uniformly distributed, these series afford an adequate approximation for the joint distribution
only near the origin of structure-factor space, i.e. for small values of all the structure amplitudes. It is therefore incorrect to use these local approximations to
near
as if they were the global functional form for that function `in the large' when forming conditional probability distributions involving large amplitudes.
These limitations can be overcome by recognizing that, if the locus (a high-dimensional torus) defined by the large structure-factor amplitudes to be used in the conditioning data is too extended in structure-factor space for a single asymptotic expansion of
to be accurate everywhere on it, then
should be broken up into sub-regions, and different local approximations to
should be constructed in each of them. Each of these sub-regions will consist of a `patch' of
surrounding a point
located on
. Such a point
is obtained by assigning `trial' phase values to the known moduli, but these trial values do not necessarily have to be viewed as `serious' assumptions concerning the true values of the phases: rather, they should be thought of as pointing to a patch of
and to a specialized asymptotic expansion of
designed to be the most accurate approximation possible to
on that patch. With a sufficiently rich collection of such constructs,
can be accurately calculated anywhere on
.
These considerations lead to the notion of recentring . Recentring the usual Gram–Charlier or Edgeworth asymptotic expansion for away from
, by making trial phase assignments that define a point
on
, is equivalent to using a non-uniform prior distribution of atoms q(x), reproducing the individual components of
among its Fourier coefficients. The latter constraint leaves q(x) highly indeterminate, but Jaynes' argument given in Section 16.2.2.3
shows that there is a uniquely defined `best' choice for it: it is that distribution
having maximum entropy relative to a uniform prior prejudice m(x), and having the corresponding values
of the unitary structure factors for its Fourier coefficients. This distribution has the unique property that it rules out as few random structures as possible on the basis of the limited information available in
.
In terms of the statistical mechanical language used in Section 16.2.1, the trial structure-factor values
used as constraints would be the macroscopic quantities that can be controlled externally; while the 3N atomic coordinates would be the internal degrees of freedom of the system, whose entropy should be a maximum under these macroscopic constraints.
It is possible to solve explicitly the maximum-entropy equations (ME1) to (ME3) derived in Section 16.2.2.4 for the crystallographic case that has motivated this study, i.e. for the purpose of constructing
from the knowledge of a set of trial structure-factor values
. These derivations are given in §3.4 and §3.5 of Bricogne (1984)
. Extensive relations with the algebraic formalism of traditional direct methods are exhibited in §4, and connections with the theory of determinantal inequalities and with the maximum-determinant rule of Tsoucaris (1970)
are studied in §6, of the same paper. The reader interested in these topics is invited to consult this paper, as space limitations preclude their discussion in the present chapter.
The saddlepoint method constitutes an alternative approach to the problem of evaluating the joint probability of structure factors when some of the moduli in
are large. It is shown in §5 of Bricogne (1984)
, and in more detail in Section 1.3.4.5.2.2
of Chapter 1.3 of IT B (Bricogne, 2001
), that there is complete equivalence between the maximum-entropy approach to the phase problem and the classical probabilistic approach by the method of joint distributions, provided the latter is enhanced by the adoption of the saddlepoint approximation.
References






