International
Tables for
Crystallography
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

International Tables for Crystallography (2006). Vol. F. ch. 25.2, pp. 735-736   | 1 | 2 |

Section 25.2.10.3. Heavy-atom location using SHELXS and SHELXD

G. M. Sheldricku*

25.2.10.3. Heavy-atom location using SHELXS and SHELXD

| top | pdf |

One might expect that a small-molecule direct-methods program, such as SHELXS (Sheldrick, 1990[link]), that routinely solves structures with 20–100 unique atoms in a few minutes or even seconds of computer time would have no difficulty in locating a handful of heavy-atom sites from isomorphous or anomalous ΔF data. However, such data can be very noisy, and a single seriously aberrant reflection can invalidate a large number of probabilistic phase relations. The most important direct-methods formula is still the tangent formula of Karle & Hauptman (1956)[link]; most modern direct-methods programs (e.g. Busetta et al., 1980[link]; Debaerdemaeker et al., 1985[link]; Sheldrick, 1990[link]) use versions of the tangent formula that have been modified to incorporate information from weak reflections as well as strong reflections, which helps to avoid pseudo-solutions with translationally displaced molecules or a single dominant peak (the so-called uranium-atom solution). Isomorphous and anomalous ΔF values represent lower limits on the structure factors for the heavy-atom substructure and so do not give reliable estimates of weak reflections; thus, the improvements introduced into direct methods by the introduction of the weak reflections are largely irrelevant when they are applied to ΔF data. This does not apply when FA values are derived from a MAD experiment, since these are true estimates of the heavy-atom structure factors; however, aberrant large and small FA estimates are difficult to avoid and often upset the phase-determination process. A further problem in applying direct methods to ΔF data is that it is not always clear what the effective number of atoms in the cell should be for use in the probability formulae, especially when it is not known in advance how many heavy-atom sites are present.

25.2.10.3.1. The Patterson map interpretation algorithm in SHELXS

| top | pdf |

Space-group-general automatic Patterson map interpretation was introduced in the program SHELXS86 (Sheldrick, 1985[link]); completely different algorithms are employed in the current version of SHELXS, based on the Patterson superposition minimum function (Buerger, 1959[link], 1964[link]; Richardson & Jacobson, 1987[link]; Sheldrick, 1991[link], 1998a[link]; Sheldrick et al., 1993[link]). The algorithm used in SHELXS is as follows:

  • (1) A single Patterson peak, v, is selected automatically (or input by the user) and used as a superposition vector. A sharpened Patterson map [with coefficients [(E^{3}F)^{1/2}] instead of [F^{2}], where E is a normalized structure factor] is calculated twice, once with the origin shifted to −v/2 and once with the origin shifted to +v/2. At each grid point, the minimum of the two Patterson function values is stored, and this superposition minimum function is searched for peaks. If a true single-weight heavy atom-to-heavy atom vector has been chosen as the superposition vector, this function should consist ideally of one image of the heavy-atom structure and one inverted image, with two atoms (the ones corresponding to the superposition vector) in common. There are thus about 2N peaks in the map, compared with [N^{2}] in the original Patterson map, a considerable simplification. The only symmetry element of the superposition function is the inversion centre at the origin relating the two images.

  • (2) Possible origin shifts are found so that the full space-group symmetry is obeyed by one of the two images, i.e., for about half the peaks, most of the symmetry equivalents are present in the map. This enables the peaks belonging to the other image to be eliminated and, in principle, solves the heavy-atom substructure. In the space group P1, the double image cannot be resolved in this way.

  • (3) For each plausible origin shift, the potential atoms are displayed as a triangular table that gives the minimum distance and the Patterson superposition minimum function value for all vectors linking each pair of atoms, taking all symmetry equivalents into account. This table enables spurious atoms to be eliminated and occupancies to be estimated, and also in some cases reveals the presence of noncrystallographic symmetry.

  • (4) The whole procedure is then repeated for further superposition vectors as required. The program gives preference to general vectors (multiple vectors will lead to multiple images), and it is advisable to specify a minimum distance of (say) 8 Å for the superposition vector (3.5 Å for selenomethionine MAD data) to increase the chance of finding a true heavy atom-to-heavy atom vector.

25.2.10.3.2. Integrated Patterson and direct methods: SHELXD

| top | pdf |

The program SHELXD (Sheldrick & Gould, 1995[link]; Sheldrick, 1997[link], 1998b[link]) is now part of the SHELX system. It is designed both for the ab initio solution of macromolecular structures from atomic resolution native data alone and for the location of heavy-atom sites from ΔF or FA values at much lower resolution, in particular for the location of larger numbers of anomalous scatterers from MAD data. The dual-space approach of SHELXD was inspired by the Shake and Bake philosophy of Miller et al. (1993[link], 1994[link]) but differs in many details, in particular in the extensive use it makes of the Patterson function that proves very effective in applications involving ΔF or FA data. The ab initio applications of SHELXD have been described in Chapter 16.1[link] , so only the location of heavy atoms will be described here. An advantage of the Patterson function is that it provides a good noise filter for the ΔF or FA data: negative regions of the Patterson function can simply be ignored. On the other hand, the direct-methods approach is efficient at handling a large number of sites, whereas the number of Patterson peaks to analyse increases with the square of the number of atoms. Thus, for reasons of efficiency, the Patterson function is employed at two stages in SHELXD: at the beginning to obtain starting atom positions (otherwise random starting atoms would be employed) and at the end, in the form of the triangular table described above, to recognize which atoms are correct. In between, several cycles of real/reciprocal space alternation are employed as in the ab initio structure solution, alternating between tangent refinement, E-map calculation and peak search, and possibly random omit maps, in which a specified fraction of the potential atoms are left out at random.

25.2.10.3.3. Practical considerations

| top | pdf |

Since the input files for the direct and Patterson methods in SHELXS and the integrated method in SHELXD are almost identical (usually only one instruction needs to be changed), it is easy to try all three methods for difficult problems. The Patterson map interpretation in SHELXS is a good choice if the heavy atoms have variable occupancies and it is not known how many heavy-atom sites need to be found; the direct-methods approaches work best with equal atoms. In general, the conventional direct methods in SHELXS will tend to perform best in a non-polar space group that does not possess special positions; however, for more than about a dozen sites, only the integrated approach in SHELXD is likely to prove effective; the SHELXD algorithm works best when the number of sites is known. Especially for the MAD method, the quality of the data is decisive; it is essential to collect data with a high redundancy to optimize the signal-to-noise ratio and eliminate outliers. In general, a resolution of 3.5 Å is adequate for the location of heavy-atom sites. At the time of writing, SHELXD does not include facilities for the further calculations necessary to obtain maps. Experience indicates that it is only necessary to refine the B values of the heavy atoms using other programs; their coordinates are already rather precise.

Excellent accounts of the theory of direct and Patterson methods with extensive literature references have been presented in IT B Chapter 2.2[link] by Giacovazzo (2001)[link] and Chapter 2.3[link] by Rossmann & Arnold (2001)[link].

References

First citation Buerger, M. J. (1959). Vector space and its application in crystal structure investigation. New York: Wiley.Google Scholar
First citation Buerger, M. J. (1964). Image methods in crystal structure analysis. In Advanced methods of crystallography, edited by G. N. Ramachandran, pp. 1–24. Orlando, Florida: Academic Press.Google Scholar
First citation Busetta, B., Giacovazzo, C., Burla, M. C., Nunzi, A., Polidori, G. & Viterbo, D. (1980). The SIR program. I. Use of negative quartets. Acta Cryst. A36, 68–74.Google Scholar
First citation Debaerdemaeker, T., Tate, C. & Woolfson, M. M. (1985). On the application of phase relationships to complex structures. XXIV. The Sayre tangent formula. Acta Cryst. A41, 286–290.Google Scholar
First citation Giacovazzo, C. (2001). Direct methods. In International tables for crystallography, Vol. B. Reciprocal space, edited by U. Shmueli, ch. 2.2. Dordrecht: Kluwer Academic Publishers.Google Scholar
First citation Karle, J. & Hauptman, H. (1956). A theory of phase determination for the four types of non-centrosymmetric space groups 1P222, 2P22, 3P12, 3P22. Acta Cryst. 9, 635–651.Google Scholar
First citation Miller, R., DeTitta, G. T., Jones, R., Langs, D. A., Weeks, C. M. & Hauptman, H. A. (1993). On the application of the minimal principle to solve unknown structures. Science, 259, 1430–1433.Google Scholar
First citation Miller, R., Gallo, S. M., Khalak, H. G. & Weeks, C. M. (1994). SnB: crystal structure determination via Shake-and-Bake. J. Appl. Cryst. 27, 613–621.Google Scholar
First citation Richardson, J. W. & Jacobson, R. A. (1987). Computer-aided analysis of multi-solution Patterson superpositions. In Patterson and Pattersons, edited by J. P. Glusker, B. Patterson & M. Rossi, pp. 311–317. Oxford: IUCr and Oxford University Press.Google Scholar
First citation Rossmann, M. G. & Arnold, E. (2001). Patterson and molecular-replacement techniques. In International tables for crystallography, Vol. B. Reciprocal space, edited by U. Shmueli, ch. 2.3. Dordrecht: Kluwer Academic Publishers.Google Scholar
First citation Sheldrick, G. M. (1985). Computing aspects of crystal structure determination. J. Mol. Struct. 130, 9–16.Google Scholar
First citation Sheldrick, G. M. (1990). Phase annealing in SHELX-90: direct methods for larger structures. Acta Cryst. A46, 467–473.Google Scholar
First citation Sheldrick, G. M. (1991). Tutorial on automated Patterson interpretation to find heavy atoms. In Crystallographic computing 5. From chemistry to biology, edited by D. Moras, A. D. Podjarny & J. C. Thierry, pp. 145–157. Oxford: IUCr and Oxford University Press.Google Scholar
First citation Sheldrick, G. M. (1997). Direct methods based on real/reciprocal space iteration. In Proceedings of the CCP4 study weekend. Recent advances in phasing, edited by K. S. Wilson, G. Davies, A. W. Ashton & S. Bailey, pp. 147–157. Warrington: Daresbury Laboratory.Google Scholar
First citation Sheldrick, G. M. (1998a). Location of heavy atoms by automated Patterson interpretation. In Direct methods for solving macromolecular structures, edited by S. Fortier, pp. 131–141. Dordrecht: Kluwer Academic Publishers.Google Scholar
First citation Sheldrick, G. M. (1998b). SHELX: applications to macromolecules. In Direct methods for solving macromolecular structures, edited by S. Fortier, pp. 401–411. Dordrecht: Kluwer Academic Publishers.Google Scholar
First citation Sheldrick, G. M., Dauter, Z., Wilson, K. S., Hope, H. & Sieker, L. C. (1993). The application of direct methods and Patterson interpretation to high-resolution native protein data. Acta Cryst. D49, 18–23.Google Scholar
First citation Sheldrick, G. M. & Gould, R. O. (1995). Structure solution by iterative peaklist optimization and tangent expansion in space group P1. Acta Cryst. B51, 423–431.Google Scholar








































to end of page
to top of page