International
Tables for
Crystallography
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

International Tables for Crystallography (2006). Vol. F. ch. 16.2, p. 348   | 1 | 2 |

Section 16.2.3. Adaptation to crystallography

G. Bricognea*

aLaboratory of Molecular Biology, Medical Research Council, Cambridge CB2 2QH, England
Correspondence e-mail: gb10@mrc-lmb.cam.ac.uk

16.2.3. Adaptation to crystallography

| top | pdf |

16.2.3.1. The random-atom model

| top | pdf |

The standard setting of probabilistic direct methods (Hauptman & Karle, 1953[link]; Bertaut, 1955a[link],b[link]; Klug, 1958[link]) uses implicitly as its starting point a source of random atomic positions. This can be described in the terms introduced in Section 16.2.2.1[link] by using a continuous alphabet [{\cal A}] whose symbols s are fractional coordinates x in the asymmetric unit of the crystal, the uniform measure μ being the ordinary Lebesgue measure [\hbox{d}^{3}{\bf x}]. A message of length N generated by that source is then a random N-equal-atom structure.

16.2.3.2. Conventional direct methods and their limitations

| top | pdf |

The traditional theory of direct methods assumes a uniform distribution q(x) of random atoms and proceeds to derive joint distributions of structure factors belonging to an N-atom random structure, using the asymptotic expansions of Gram–Charlier and Edgeworth. These methods have been described in Section 1.3.4.5.2.2[link] of IT B (Bricogne, 2001[link]) as examples of applications of Fourier transforms. The reader is invited to consult this section for terminology and notation. These joint distributions of complex structure factors are subsequently used to derive conditional distributions of phases when the amplitudes are assigned their observed values, or of a subset of complex structure factors when the others are assigned certain values. In both cases, the largest structure-factor amplitudes are used as the conditioning information.

It was pointed out by the author (Bricogne, 1984[link]) that this procedure can be problematic, as the Gram–Charlier and Edgeworth expansions have good convergence properties only in the vicinity of the expectation values of each structure factor: as the atoms are assumed to be uniformly distributed, these series afford an adequate approximation for the joint distribution [{\cal P}({\bf F})] only near the origin of structure-factor space, i.e. for small values of all the structure amplitudes. It is therefore incorrect to use these local approximations to [{\cal P}({\bf F})] near [{\bf F} = {\bf 0}] as if they were the global functional form for that function `in the large' when forming conditional probability distributions involving large amplitudes.

16.2.3.3. The notion of recentring and the maximum-entropy criterion

| top | pdf |

These limitations can be overcome by recognizing that, if the locus [{\cal T}] (a high-dimensional torus) defined by the large structure-factor amplitudes to be used in the conditioning data is too extended in structure-factor space for a single asymptotic expansion of [{\cal P}({\bf F})] to be accurate everywhere on it, then [{\cal T}] should be broken up into sub-regions, and different local approximations to [{\cal P}({\bf F})] should be constructed in each of them. Each of these sub-regions will consist of a `patch' of [{\cal T}] surrounding a point [{\bf F^{*} \neq 0}] located on [{\cal T}]. Such a point [{\bf F^{*}}] is obtained by assigning `trial' phase values to the known moduli, but these trial values do not necessarily have to be viewed as `serious' assumptions concerning the true values of the phases: rather, they should be thought of as pointing to a patch of [{\cal T}] and to a specialized asymptotic expansion of [{\cal P}({\bf F})] designed to be the most accurate approximation possible to [{\cal P}({\bf F})] on that patch. With a sufficiently rich collection of such constructs, [{\cal P}({\bf F})] can be accurately calculated anywhere on [{\cal T}].

These considerations lead to the notion of recentring . Recentring the usual Gram–Charlier or Edgeworth asymptotic expansion for [{\cal P}({\bf F})] away from [{\bf F} = {\bf 0}], by making trial phase assignments that define a point [{\bf F^{*}}] on [{\cal T}], is equivalent to using a non-uniform prior distribution of atoms q(x), reproducing the individual components of [{\bf F^{*}}] among its Fourier coefficients. The latter constraint leaves q(x) highly indeterminate, but Jaynes' argument given in Section 16.2.2.3[link] shows that there is a uniquely defined `best' choice for it: it is that distribution [q^{\rm ME}({\bf x})] having maximum entropy relative to a uniform prior prejudice m(x), and having the corresponding values [{\bf U^{*}}] of the unitary structure factors for its Fourier coefficients. This distribution has the unique property that it rules out as few random structures as possible on the basis of the limited information available in [{\bf F^{*}}].

In terms of the statistical mechanical language used in Section 16.2.1[link], the trial structure-factor values [{\bf F^{*}}] used as constraints would be the macroscopic quantities that can be controlled externally; while the 3N atomic coordinates would be the internal degrees of freedom of the system, whose entropy should be a maximum under these macroscopic constraints.

16.2.3.4. The crystallographic maximum-entropy formalism

| top | pdf |

It is possible to solve explicitly the maximum-entropy equations (ME1) to (ME3) derived in Section 16.2.2.4[link] for the crystallographic case that has motivated this study, i.e. for the purpose of constructing [q^{\rm ME}({\bf x})] from the knowledge of a set of trial structure-factor values [{\bf F^{*}}]. These derivations are given in §3.4 and §3.5 of Bricogne (1984)[link]. Extensive relations with the algebraic formalism of traditional direct methods are exhibited in §4, and connections with the theory of determinantal inequalities and with the maximum-determinant rule of Tsoucaris (1970)[link] are studied in §6, of the same paper. The reader interested in these topics is invited to consult this paper, as space limitations preclude their discussion in the present chapter.

16.2.3.5. Connection with the saddlepoint method

| top | pdf |

The saddlepoint method constitutes an alternative approach to the problem of evaluating the joint probability [{\cal P}({\bf F^{*}})] of structure factors when some of the moduli in [{\bf F^{*}}] are large. It is shown in §5 of Bricogne (1984)[link], and in more detail in Section 1.3.4.5.2.2[link] of Chapter 1.3 of IT B (Bricogne, 2001[link]), that there is complete equivalence between the maximum-entropy approach to the phase problem and the classical probabilistic approach by the method of joint distributions, provided the latter is enhanced by the adoption of the saddlepoint approximation.

References

First citation Bertaut, E. F. (1955a). La méthode statistique en cristallographie. I. Acta Cryst. 8, 537–543.Google Scholar
First citation Bertaut, E. F. (1955b). La méthode statistique en cristallographie. II. Quelques applications. Acta Cryst. 8, 544–548.Google Scholar
First citation Bricogne, G. (1984). Maximum entropy and the foundations of direct methods. Acta Cryst. A40, 410–445.Google Scholar
First citation Bricogne, G. (2001). Fourier transforms in crystallography: theory, algorithms and applications. In International tables for crystallography, Vol. B. Reciprocal space, edited by U. Shmueli, 2nd ed., ch. 1.3. Dordrecht: Kluwer Academic Publishers.Google Scholar
First citation Hauptman, H. & Karle, J. (1953). The solution of the phase problem: I. The centrosymmetric crystal. ACA Monograph No. 3. Pittsburgh: Polycrystal Book Service.Google Scholar
First citation Klug, A. (1958). Joint probability distribution of structure factors and the phase problem. Acta Cryst. 11, 515–543.Google Scholar
First citation Tsoucaris, G. (1970). A new method of phase determination. The `maximum determinant rule'. Acta Cryst. A26, 492–499.Google Scholar








































to end of page
to top of page