International
Tables for Crystallography Volume B Reciprocal space Edited by U. Shmueli © International Union of Crystallography 2006 |
International Tables for Crystallography (2006). Vol. B. ch. 4.2, pp. 407-442
https://doi.org/10.1107/97809553602060000564 Chapter 4.2. Disorder diffuse scattering of X-rays and neutronsThe scope of this chapter is outlined in Section 4.2.1 Keywords: diffuse scattering; disorder; X-ray scattering; neutron scattering; clustering; defects; local ordering; orientational disorder; short-range order; lamellar domains; density modulations; displacement modulations. |
Diffuse scattering of X-rays, neutrons and other particles is an accompanying effect in all diffraction experiments aimed at structure analysis with the aid of so-called elastic scattering. In this case the momentum exchange of the scattered photon (or particle) includes the crystal as a whole; the energy transfer involved becomes negligibly small and need not be considered in diffraction theory. Inelastic scattering processes, however, are due to excitation processes, such as ionization, phonon scattering etc. Distortions as a consequence of structural changes cause typical elastic or inelastic diffuse scattering. All these processes contribute to scattering, and a general theory has to include all of them. Hence, the exact treatment of diffuse scattering becomes very complex. Fortunately, approximations treating the phenomena independently are possible in most cases, but it should be kept in mind that difficulties may occasionally arise.
A separation of elastic from inelastic diffuse scattering may be made if detectors sensitive to the energy of radiation are used. Difficulties may sometimes result from small energy exchanges, which cannot be resolved for experimental reasons. The latter is true for scattering of X-rays by phonons which have energies of the order of 10−2−10−3 eV, a value which is considerably smaller than 10 keV, a typical value for X-ray quanta. Another equivalent explanation, frequently forwarded in the literature, is the high speed of X-ray photons, such that the rather slow motion of atoms cannot be `observed' by them during diffraction. Hence, all movements appear as static displacement waves of atoms, and temperature diffuse scattering is pseudo-elastic for X-rays. This is not true in the case of thermal neutrons, which have energies comparable to those of phonons. Since thermal diffuse scattering is discussed in Chapter 4.1
, this chapter is mainly concerned with the elastic (or pseudo-elastic other than thermal) part of diffuse scattering.
The full treatment of the complicated theoretical background for all other kinds of diffuse scattering lies beyond the scope of this article. It is also impossible to refer to all papers in this wide and complicated field. Different theoretical treatments of one and the same subject are often developed, but only some are given here, in most cases those which may be understood most easily – at least to the authors' feeling. As shown in this chapter, electron-density fluctuations and distribution functions of defects play an important role for the complete interpretation of diffraction patterns. Both quantities may best be studied in the low-angle scattering range, which occasionally represents the only Bragg peak dealing with the full information of the distribution function of the defects. Hence, many problems cannot be solved without a detailed interpretation of low-angle diffraction.
Disorder phenomena in magnetic structures are not specifically discussed here. Magnetic diffuse neutron scattering and special experimental techniques themselves constitute a large subject. Many aspects, however, may be analysed along similar lines as given here. For this particular topic the reader is referred to textbooks of neutron scattering, where the theory of diffraction by magnetic materials is generally included (see, e.g., Lovesey, 1984).
Glasses, liquids or liquid crystals show typical diffuse diffraction phenomena. Particle-size effects and strains have an important influence on the diffuse scattering. The same is true for dislocations and point defects such as interstitials or vacancies. These defects are mainly described by their strain field which influences the intensities of sharp reflections like an artificial temperature factor: the Bragg peaks diminish in intensity, while the diffuse scattering increases predominantly close to them. These phenomena are less important from a structural point of view, at least in the case of metals or other simple structures. This statement is true as long as the structure of the `kernel' of defects may be neglected when compared with the influence of the strain field. Whether dislocations in more complicated structures meet this condition is not yet known.
Radiation damage in crystals represents another field of diffuse scattering which cannot be treated here explicitly. As long as point defects only are generated, the strain field around these defects is the most important factor governing diffuse scattering. Particles with high energy, such as fast neutrons, protons and others, generate complicated defect structures which have to be treated with the aid of the cluster method described below, but no special reference is given here because of the complexity of these phenomena.
Diffuse scattering related to phase transitions, in particular the critical diffuse scattering observed at or close to the transition temperature, cannot be discussed here. In simple cases a satisfactory description may be given with the aid of a `soft phonon', which freezes at the critical temperature, thus generating typical temperature-dependent diffuse scattering. If the geometry of the lattice is maintained during the transformation (no breakdown into crystallites of different cell geometry), the diffuse scattering is very similar to diffraction phenomena described in this article. Sometimes, however, very complicated interim stages (ordered or disordered) are observed demanding a complicated theory for their full explanation (see, e.g., Dorner & Comes, 1977).
Commensurate and incommensurate modulated structures as well as quasicrystals are frequently accompanied by a typical diffuse scattering, demanding an extensive experimental and theoretical study in order to arrive at a satisfactory explanation. A reliable structure determination becomes very difficult in cases where the interpretation of diffuse scattering has not been incorporated. Many erroneous structural conclusions have been published in the past. The solution of problems of this kind needs careful thermodynamical consideration as to whether a plausible explanation of the structural data can be given.
Obviously, there is a close relationship between thermodynamics and diffuse scattering in disordered systems, representing a stable or metastable thermal equilibrium. From the thermodynamical point of view the system is then characterized by its grand partition function, which is intimately related to the correlation functions used in the interpretation of diffuse scattering. The latter is nothing other than a kind of `partial partition function' where two atoms, or two cell occupations, are fixed such that the sum of all partial partition functions represents the grand partition function. This fact yields the useful correlation between thermodynamics and diffuse scattering mentioned above, which may well be used for a determination of thermodynamical properties of the crystal. This subject could not be included here for the following reason: real three-dimensional crystals generally exhibit diffuse scattering by defects and/or disordering effects which are not in thermal equilibrium. They are created during crystal growth, or are frozen-in defects formed at higher temperatures. Hence, a thermodynamical interpretation of diffraction data needs a careful study of diffuse scattering as a function of temperature or some other thermodynamical parameters. This can be done in very rare cases only, so the omission of this subject seems justified.
For all of the reasons mentioned above, this article cannot be complete. It is hoped, however, that it will provide a useful guide for those who need the information for the full understanding of the crystal chemistry of a given structure.
There is no comprehensive treatment of all aspects of diffuse scattering. Essential parts are treated in the textbooks of James (1954), Wilson (1962)
, Wooster (1962)
and Schwartz & Cohen (1977)
; handbook articles are written by Jagodzinski (1963
, 1964a
,b
, 1987
), Schulz (1982)
, Welberry (1985)
; and a series of interesting papers is collected by Collongues et al. (1977)
.
Many differences are caused by different symbols and by different `languages' used in the various diffraction methods. Quite a few of the new symbols in use are not really necessary, but some are caused by differences in the experimental techniques. For example, the neutron scattering length b may usually be equated with the atomic form factor f in X-ray diffraction. The differential cross section introduced in neutron diffraction represents the intensity scattered into an angular range dΩ and an energy range dE. The famous scattering law in neutron work corresponds to the square of an (extended) structure factor; the `static structure factor', a term used by neutron diffractionists, is nothing other than the conventional Patterson function. The complicated resolution functions in neutron work correspond to the well known Lorentz factors in X-ray diffraction. These have to be derived in order to include all techniques used in diffuse-scattering work.
Diffuse scattering results from deviations from the identity of translational invariant scattering objects and from long-range correlations in space and time. Fluctuations of scattering amplitudes and/or phase shifts of the scattered wavetrains reduce the maximum capacity of interference (leading to Bragg reflections) and are responsible for the diffuse scattering, i.e. scattering parts which are not located in reciprocal space in distinct spots. Unfortunately, the terms `coherent' and `incoherent' scattering used in this context are not uniquely defined in the literature. Since all scattering processes are correlated in space and time, there is no incoherent scattering at all in its strict sense. (A similar relationship exists for `elastic' and `inelastic' scattering. Here pure inelastic scattering would take place if the momentum and the energy were transferred to a single scatterer; on the other hand, an elastic scattering process would demand a uniform exchange of momentum and energy with the whole crystal.) Obviously, both cases are idealized and the truth lies somewhere in between. In spite of this, a great many authors use the term `incoherent' systematically for the diffuse scattering away from the Bragg peaks, even if some diffuse maxima or minima, other than those due to structure factors of molecules or atoms, are observed. Although this definition is unequivocal as such, it is physically incorrect. Other authors use the term `coherent' for Bragg scattering only; all diffuse contributions are then called `incoherent'. This definition is clear and unique since it considers space and time, but it does not differentiate between incoherent and inelastic. In the case of neutron scattering both terms are essential and cannot be abandoned.
In neutron diffraction the term `incoherent' scattering is generally used in cases where no correlation between spin orientations or between isotopes of the same element exists. Hence, another definition of `incoherence' is proposed for scattering processes that are uncorrelated in space and time. In fact there may be correlations between the spins via their magnetic field, but the correlation length in space (and time) may be very small, such that the scattering process appears to be incoherent. Even in these cases the nuclei contribute to coherent (average structure) and incoherent scattering (diffuse background). Hence, the scattering process cannot really be understood by assuming nuclei which scatter independently. For this reason, it seems to be useful to restrict the term `incoherent' to cases where a random contribution to scattering is realized or, in other words, a continuous function exists in reciprocal space. This corresponds to a δ function in real four-dimensional space. The randomness may be attributed to a nucleus (neutron diffraction) or an atom (molecule). It follows from this definition that the scattering need not be continuous, but may be modulated by structure factors of molecules. In this sense we shall use the term `incoherent', remembering that it is incorrect from a physical point of view.
As mentioned in Chapter 4.1
the theory of thermal neutron scattering must be treated quantum mechanically. (In principle this is true also in the X-ray case.) In the classical limit, however, the final expressions have a simple physical interpretation. Generally, the quantum-mechanical nature of the scattering function of thermal neutrons is negligible at higher temperatures and in those cases where energy or momentum transfers are not too large. In almost all disorder problems this classical interpretation is sufficient for interpretation of diffuse scattering phenomena. This is not quite true in the case of orientational disorder (plastic crystals) where H atoms are involved.
The basic formulae given below are valid in either the X-ray or the neutron case: the atomic form factor f replaces the coherent scattering length (abbreviated to b). The formulation in the frame of the van Hove correlation function G(r, t) (classical interpretation, coherent part) corresponds to a treatment by a four-dimensional Patterson function P(r, t).
The basic equations for the differential cross sections are: (N = number of scattering nuclei of same chemical species;
,
= wavevectors after/before scattering).
The integrations over space may be replaced by summations in disordered crystals except for cases where structural elements exhibit a liquid-like behaviour. Then the van Hove correlation functions are:
gives the probability that if there is an atom j at
at time zero, there is an arbitrary atom j′ at
at arbitrary time t, while
refers to the same atom j at
at time t.
Equations (4.2.2.1) may be rewritten by use of the four-dimensional Fourier transforms of G, and
, respectively:
Incoherent scattering cross sections [(4.2.2.3b)
, (4.2.2.4b)
] refer to one and the same particle (at different times). In particular, plastic crystals (see Section 4.2.4.5)
may be studied by means of this incoherent scattering. It should be emphasized, however, that for reasons of intensity only disordered crystals with strong incoherent scatterers can be investigated by this technique. In practice, mostly samples with hydrogen atoms were investigated. This topic will not be treated further in this article (see, e.g., Springer, 1972
; Lechner & Riekel, 1983
). The following considerations are restricted to coherent scattering only.
Essentially the same formalism as given by equations (4.2.2.1a)–(4.2.2.4a)
may be described by the use of a generalized Patterson function, which is more familiar to crystallographers,
where τ denotes the time of observation. The only difference between
and
is the inclusion of the scattering weight (f or b) in
.
is an extension of the usual spatial Patterson function
.
One difficulty arises from neglecting the time of observation. Just as
is always proportional to the scattering volume V, in the frame of a kinematical theory or within Born's first approximation [cf. equation (4.2.2.1a)
], so
is proportional to volume and observation time. Generally one does not make S proportional to V, but one normalizes S to be independent of τ as
. Averaging over time τ gives therefore
Special cases (see, e.g., Cowley, 1981):
In most practical cases averaging over time is equivalent to averaging over space: the total diffracted intensity may be regarded as the sum of intensities from a large number of independent regions due to the limited coherence of a beam. At any time these regions take all possible configurations. Therefore, this sum of intensities is equivalent to the sum of intensities from any one region at different times: From the basic formulae one also derives the well known results of X-ray or neutron scattering by a periodic arrangement of particles in space [cf. equation (4.1.3.2) of Chapter 4.1
]:
denotes the Fourier transform of one cell (structure factor); the f's are assumed to be real.
The evaluation of the intensity expressions (4.2.2.6), (4.2.2.8)
or (4.2.2.9)
, (4.2.2.9a)
for a disordered crystal must be performed in terms of statistical relationships between scattering factors and/or atomic positions.
From these basic concepts the generally adopted method in a disorder problem is to try to separate the scattering intensity into two parts, namely one part from an average periodic structure where formulae (4.2.2.10)
, (4.2.2.11) apply and a second part
resulting from fluctuations from this average (see, e.g., Schwartz & Cohen, 1977
). One may write formally:
where
is defined to be time independent and periodic in space and
. Because cross terms
vanish by definition, the Patterson function is
Fourier transformation gives
Since
is periodic, the first term in (4.2.2.13)
describes Bragg scattering where
plays the normal role of a structure factor of one cell of the averaged structure. The second term corresponds to diffuse scattering. In many cases diffuse interferences are centred exactly at the positions of the Bragg reflections. It is then a serious experimental problem to decide whether the observed intensity distribution is Bragg scattering obscured by crystal-size limitations or other scattering phenomena.
If disordering is time dependent exclusively, represents the time average, whereas
gives the pure elastic scattering part [cf. equation (4.2.2.8)
]
and ΔF refers to inelastic scattering only.
Any structure analysis of disordered structures should start with a qualitative interpretation of diffuse scattering. This problem may be facilitated with the aid of Fourier transforms and their algebraic operations (see, e.g., Patterson, 1959). For simplicity the following modified notation is used in this section: functions in real space are represented by small letters, e.g. a(r), b(r), … except for F(r) and P(r) which are used as general symbols for a structure and the Patterson function, respectively; functions in reciprocal space are given by capital letters A(H), B(H); r and H are general vectors in real and reciprocal space, respectively,
is the scalar product H · r; dr and dH indicate integrations in three dimensions in real and reciprocal space, respectively. Even for X-rays the electron density
will generally be replaced by the scattering potential a(r). Consequently, anomalous contributions to scattering may be included if complex functions a(r) are admitted. In the neutron case a(r) refers to a quasi-potential. Using this notation we obtain the scattered amplitude and phase
(constant factors are omitted).
a(r) and A(H) are reversibly and uniquely determined by Fourier transformation. Consequently equations (4.2.3.1) may simply be replaced by
, where the double-headed arrow represents the two integrations given by (4.2.3.1)
and means: A(H) is the Fourier transform of a(r), and vice versa. The following relations may easily be derived from (4.2.3.1)
:
(
= scalar quantity).
On the other hand, the multiplication of two functions does not yield a relation of similar symmetrical simplicity: (laws of convolution and multiplication).
Since :
and vice versa. The convolution operation is commutative in either space.
For simplicity the symbol instead of the complete convolution integral is used. The distribution law
is valid for the convolution as well:
The associative law of multiplication does not hold if mixed products (convolution and multiplication) are used:
From equations (4.2.3.1)
one has:
(law of displacements).
Since symmetry operations are well known to crystallographers in reciprocal space also, the law of inversion is mentioned here only: Consequently, if
, then
. In order to calculate the intensity the complex conjugate
is needed:
Equations (4.2.3.9)
yield the relationship
(`Friedel's law') if a(r) is a real function. The multiplication of a function with its conjugate is given by:
with
Note that
is not valid if a(r) is complex. Consequently
. This is shown by evaluating
Equation (4.2.3.11)
is very useful for the determination of the contribution of anomalous scattering to diffuse reflections.
Most of the diffuse diffraction phenomena observed may be interpreted qualitatively or even semi-quantitatively in a very simple manner using a limited number of important Fourier transforms, given below.
Generally, structure determination of a disordered crystal should start in the usual way by solving the average structure. The effectiveness of this procedure strongly depends on the distribution of integrated intensities of sharp and diffuse reflections. In cases where the integrated intensities of Bragg peaks is predominant, the maximum information can be drawn from the averaged structure. The observations of fractional occupations of lattice sites, split positions and anomalous temperature factors are indications of the disorder involved. Since these aspects of disorder phenomena in the averaged structure may be interpreted very easily, a detailed discussion of this matter is not given here (see any modern textbook of X-ray crystallography).
Difficulties may arise from the intensity integration which should be carried out using a high-resolution diffraction method. The importance of this may be understood from the following argument. The averaged structure is determined by the coherent superposition of different structure factors. This interpretation is true if there is a strictly periodic subcell with long-range order which allows for a clear separation of sharp and diffuse scattering. There are important cases, however, where this procedure cannot be applied without loss of information.
|
The integrated intensity of any structure is independent of atomic positions if the atomic form factors remain unchanged by structural fluctuations. Small deviations of atomic form factors owing to electron-density changes of valence electrons are neglected. Consequently, the integrated diffuse intensities remain unchanged if the average structure is not altered by the degree of order. The latter condition is obeyed in cases where a geometrical long-range order of the lattice is independent of the degree of order, and no long-range order in the structure exists. This law is extremely useful for the interpretation of diffuse scattering. Unfortunately, intensity integration over coinciding sharp and diffuse maxima does not necessarily lead to a structure determination of the corresponding undistorted structure. This integration may be useful for antiphase domains without major structural changes at the boundaries. In all other cases the deviations of domains (or clusters) from the averaged structure determine the intensities of maxima which are no longer correlated with those of the average structure.
If the integrated intensity of diffuse scattering is comparable with, or even larger than, those of the Bragg peaks it is useful to begin the interpretation with a careful statistical study of the diffuse intensities. Intensity statistics can be applied in a way similar to the intensity statistics in classical structure determination. The following rules are briefly discussed in order to enable a semi-quantitative interpretation of the essential features of disorder to be realized.
|
Although it is highly improbable that exactly the same diffraction picture will really be found, the use of an atlas of optical transforms (Wooster, 1962; Harburn et al., 1975
; Welberry & Withers, 1987
) may be very helpful at the beginning of any study of diffuse scattering. The most important step is the separation of the distribution function from the molecular scattering. Since this information may be derived from a careful comparison of low-angle diffraction with the remaining sharp reflections, this task is not too difficult. If the influence of the distribution function is unknown, the reader is strongly advised to disregard the immediate neighbourhood of Bragg peaks in the first step of the interpretation. Obviously information may be lost in this way but, as has been shown in the past, much confusion caused by the attempt to interpret the scattering near the Bragg peaks with specific structural properties of a cluster or molecular model is avoided. The inclusion of this part of diffuse scattering can be made after the complete interpretation of the change of the influence of the distribution function on diffraction in the far-angle region.
In these sections quantitative interpretations of the elastic part of diffuse scattering (X-rays and neutrons) are outlined. Although similar relations are valid, magnetic scattering of neutrons is excluded. Obviously, all disorder phemomena are strongly temperature dependent if thermal equilibrium is reached. Consequently, the interpretation of diffuse scattering should include a statistical thermodynamical treatment. Unfortunately, no quantitative theory for the interpretation of structural phenomena is so far available: all quantitative solutions introduce formal order parameters such as correlation functions or distributions of defects. At low temperatures (low concentration of defects) the distribution function plays the dominant role in diffuse scattering. With increasing temperature the number of defects increases with corresponding strong interactions between them. Therefore, correlations become increasingly important, and phase transformations of first or higher order may occur which need a separate theoretical treatment. In many cases large fluctuations of structural properties occur which are closely related to the dynamical properties of the crystal. Theoretical approximations are possible but their presentation is far beyond the scope of this article. Hence we restrict ourselves to formal parameters in the following.
Point defects or limited structural units, such as molecules, clusters of finite size etc., may only be observed in diffraction for a sufficiently large number of defects. This statement is no longer true in high-resolution electron diffraction where single defects may be observed either by diffraction or by optical imaging if their contrast is high enough. Hence, electron microscopy and diffraction provide valuable methods for the interpretation of disorder phenomena.
The arrangement of a finite assembly of structural defects is described by its structure and its three-dimensional (3D) distribution function. Structures with a strict 1D periodicity (chain-like structures) need a 2D distribution function, while for structures with a 2D periodicity (layers) a 1D distribution function is sufficient. Since the distribution function is the dominant factor in statistics with correlations between defects, we define the dimensionality of disorder as that of the corresponding distribution function. This definition is more effective in diffraction problems because the dimension of the disorder problem determines the dimension of the diffuse scattering: 1D diffuse streaks, 2D diffuse layers, or a general 3D diffuse scattering.
Strictly speaking, completely random distributions cannot be realized as shown in Section 4.2.3. They occur approximately if the following conditions are satisfied.
|
As already mentioned, disorder phenomena may be observed in thermal equilibrium. Two completely different cases have to be considered.
|
In many cases the defects do not occur in thermal equilibrium. Nevertheless, their diffuse scattering is temperature dependent because of the anomalous thermal movements at the boundary of the defect. Hence, the observation of a temperature-dependent behaviour of diffuse scattering cannot be taken as a definite criterion of thermal equilibrium without further careful interpretation.
Ordering of defects may take place in a very anisotropic manner. This is demonstrated by the huge number of examples of 1D disorder. As shown by Jagodzinski (1963) this type of disorder cannot occur in thermal equilibrium for the infinite crystal. This type of disorder is generally formed during crystal growth or mechanical deformation. Similar arguments may be applied to 2D disorder. This is a further reason why the so-called Ising model can hardly be used in order to obtain interaction energies of structural defects. From these remarks it becomes clear that order parameters are more or less formal parameters without strict thermodynamical meaning.
The following section is organized as follows: first we discuss the simple case of 1D disorder where reliable solutions of the diffraction problem are available. Intensity calculations of diffuse scattering of 2D disorder by chain-like structures follow. Finally, the 3D case is treated, where formal solutions of the diffraction problem have been tried and applied successfully to metallic systems to some extent. A short concluding section concerns the special phenomenon of orientational disorder.
As has been pointed out above, it is often useful to start the interpretation of diffuse scattering by checking the diffraction pattern with respect to the dimensionality of the disorder concerned. Since each disordered direction in the crystal demands a violation of the corresponding Laue condition, this question may easily be answered by looking at the diffuse scattering. Diffuse streaks in reciprocal space are due to a one-dimensional violation of the Laue conditions, and will be called one-dimensional disorder. This kind of order is typical for layer structures, but it is frequently observed in cases where several sequences of layers do not differ in the interactions of next-nearest neighbours. Typical examples are structures which may be described in terms of close packing, e.g. hexagonal and cubic close packing.
For a quantitative interpretation of diffuse streaks we need one-dimensional correlation functions, which may uniquely be determined if a single independent correlation function is active. According to equation (4.2.3.28) Fourier transformation yields the information required. In all other cases a specific model has to be suggested for a full interpretation of diffuse streaks. Another comment seems to be necessary: disorder parameters can be defined uniquely only if the diffraction pattern allows for a differentiation between long-range and short-range order. This question can at least partly be answered by studying the line width of sharp reflections with a very good resolution. Since integrated intensities of sharp reflections have to be separated from the diffuse scattering, this question is of outstanding importance in most cases. Inclusion of diffuse parts in the diffraction pattern during intensity integration of sharp reflections may lead to serious errors in the interpretation of the average structure.
The existence of diffuse streaks in more than one direction of reciprocal space means that the diffraction problem is no longer one-dimensional. Sometimes the problem may be treated independently, if the streaks are sharp, and no interference effects may be observed in the diffraction pattern in areas where the diffuse streaks do overlap. In all other cases there are correlations between the various directions of one-dimensional disorder which may be determined with the aid of a model covering more than one of the pertinent directions of disorder.
Before starting the discussion of the quantitative solution of the one-dimensional problem, some remarks should be made on the usefulness of quantitative disorder parameters. It is well known from statistical thermodynamics that a one-dimensional system cannot show long-range order above . Obviously, this statement is in contradiction with many experimental observations where long-range order is realized even in layer structures. The reason for this behaviour is given by the following arguments which are valid for any structure. Let us assume a structure with strong interactions at least in two directions. From the theoretical treatment of the two-dimensional Ising model it is known that such a system shows long-range order below a critical temperature
. This statement is true even if the layer is finite, although the strict thermodynamic behaviour is not really critical in the thermodynamical sense. A three-dimensional crystal can be constructed by adding layer after layer. Since each layer has a typical two-dimensional free energy, the full statistics of the three-dimensional crystal may be calculated by introducing a specific free energy for the various stackings of layers. Obviously, this additional energy has to include terms describing potential and entropic energies as well. They may be formally developed into contributions of next, overnext etc. nearest neighbours. Apparently, the contribution to entropy must include configurational and vibrational parts which are strongly coupled. As long as the layers are finite, there is a finite probability of a fault in the stacking sequence of layers which approaches zero with increasing extension of the layers. Consequently, the free energy of a change in the favourite stacking sequence becomes infinite quadratically with the size of the layer. Therefore, the crystal should be either completely ordered or disordered; the latter case can only be realized if the free energies of one or more stacking sequences are exactly equal (very rare, but possible over a small temperature range of phase transformations). An additional positive entropy associated with a deviation from the periodic stacking sequence may lead to a kind of competition between entropy and potential energy, in such a way that periodic sequences of faults result. Obviously, this situation occurs in the transition range of two structures differing only in their stacking sequence. On the other hand, one must assume that defects in the stacking sequence may be realized if the size of the layers is small. This situation occurs during crystal growth, but one should remember that the number of stacking defects should decrease with increasing size of the growing crystal. Apparently, this rearrangement of layers may be suppressed as a consequence of relaxation effects. The growth process itself may influence the propagation of stacking defects and, consequently, the determination of stacking-fault probabilities, aiming at an interpretation of the chemical bonding seems to be irrelevant in most cases.
The quantitative solution of the diffraction problem of one-dimensional disorder follows a method similar to the Ising model. As long as next-nearest neighbours alone are considered, the solution is very simple only if two possibilities of structure factors are to be taken into account. Introducing the probability of equal pairs 1 and 2, α, one arrives at the known solution for the a priori probability and a posteriori probabilities
, respectively. In the one-dimensional Ising model with two spins and the interaction energies
, defining the pair probability
the full symmetry is
, and
.
Consequently: The scattered intensity is given by
where
,
in the c direction and
depends on
,
which are the eigenvalues of the matrix
From the characteristic equation
one has
describes a sharp Bragg reflection (average structure) which need not be calculated. Its intensity is simply proportional to
. The second characteristic value yields a diffuse reflection in the same position if the sign is positive
, and in a position displaced by
in reciprocal space if the sign is negative
. Because of the symmetry conditions
only is needed; it may be determined with the aid of the boundary conditions
and the general relation that
is given by
The final solution of our problem yields simply:
The calculation of the scattered intensity is now performed with the general formula
Evaluation of this expression yields
Since the characteristic solutions of the problem are real:
This particle size effect has been neglected in (4.2.4.5)
. This result confirms the fact mentioned above that the sharp Bragg peaks are determined by the averaged structure factor and the diffuse one by its mean-square deviation.
For the following reason there are no examples for quantitative applications: two different structures generally have different lattice constants; hence the original assumption of an undisturbed lattice geometry is no longer valid. The only case known to the authors is the typical lamellar structure of plagioclases, reported by Jagodzinski & Korekawa (1965). The authors interpret the well known `Schiller effect' as a consequence of optical diffraction. Hence, the size of the lamellae is of the order of 2000 Å. This long-period superstructure cannot be explained in terms of next-nearest-neighbour interactions. In principle, however, the diffraction effects are similar: instead of the diffuse peak as described by the second term in equation (4.2.4.5)
, satellites of first and second order, etc. accompanying the Bragg peaks are observed. The study of this phenomenon (Korekawa & Jagodzinski, 1967
) has not so far resulted in a quantitative interpretation.
Obviously, the symmetry relation used in the formulae discussed above is only valid if the structures described by the are related by symmetries such as translations, rotations or combinations of both. The type of symmetry has an important influence on the diffraction pattern.
|
From an historical point of view stacking disorder in close-packed systems is most important. The three relevant positions of ordered layers are represented by the atomic coordinates in the hexagonal setting of the unit cell, or simply by the figures 1, 2, 3 in the same sequence. Structure factors
refer to the corresponding positions of the same layer:
hence
According to the above discussion the said indices define the reciprocal-lattice rows exhibiting sharp reflections only, as long as the distances between the layers are exactly equal. The symmetry conditions caused by the translation are normally:
For the case of close packing of spheres and some other problems any configuration of m layers determining the a posteriori probability
, has a symmetrical counterpart where μ is replaced by
(if
).
In this particular case , and equivalent relations generated by translation.
Nearest-neighbour interactions do not lead to an ordered structure if the principle of close packing is obeyed (no pairs in equal positions) (Hendricks & Teller, 1942; Wilson, 1942
). Extension of the interactions to next-but-one or more neighbours may be carried out by introducing the method of matrix multiplication developed by Kakinoki & Komura (1954
, 1965
), or the method of overlapping clusters (Jagodzinski, 1954
). The latter procedure is outlined in the case of interactions between four layers. A given set of three layers may occur in the following 12 combinations:
Since three of them are equivalent by translation, only four representatives have to be introduced:
In the following the new indices 1, 2, 3, 4 are used for these four representatives for the sake of simplicity.
In order to construct the statistics layer by layer the next layer must belong to a triplet starting with the same two symbols with which the preceding one ended, e.g. 123 can only be followed by 231, or 232. In a similar way 132 can only be followed by 321 or 323. Since both cases are symmetrically equivalent the probabilities and
are introduced. In a similar way 121 may be followed by 212 or 213 etc. For these two groups the probabilities
and
are defined. The different translations of groups are considered by introducing the phase factors as described above. Hence, the matrix for the characteristic equation may be set up as follows. As representative cluster of each group is chosen that one having the number 1 at the centre, e.g. 312 is representative for the group 123, 231, 312; in a similar way 213, 212 and 313 are the remaining representatives. Since this arrangement of three layers is equivalent by translation, it may be assumed that the structure of the central layer is not influenced by the statistics to a first approximation. The same arguments hold for the remaining three groups. On the other hand, the groups 312 and 213 are equivalent by rotation only. Consequently their structure factors may differ if the influence of the two neighbours has to be taken into account. A different situation exists for the groups 212 and 313 which are correlated by a centre of symmetry, which causes different corresponding structure factors. It should be pointed out, however, that the structure factor is invariant as long as there is no influence of neighbouring layers on the structure of the central layer. The latter is often observed in close-packed metal structures, or in compounds like ZnS, SiC and others. For the calculation of intensities
is needed.
According to the following scheme of sequences any sequence of pairs is correlated with the same phase factor for due to translation, if both members of the pair belong to the same group. Consequently the phase factor may be attached to the sequence probability such that
remains unchanged, and the group may be treated as a single element in the statistics. In this way the reduced matrix for the solution of the characteristic equation is given by
There are three solutions of the diffraction problem:
|
In order to calculate the intensities one has to reconsider the symmetry of the clusters, which is different from the symmetry of the layers. Fortunately, a threefold rotation axis is invariant against the translations, but this is not true for the remaining symmetry operations in the layer if there are any more. Since we have two pairs of inequivalent clusters, namely 312, 213 and 212, 313, there are only two different a priori probabilities and
.
The symmetry conditions of the new clusters may be determined by means of the so-called `probability trees' described by Wilson (1942) and Jagodzinski (1949b
, pp. 208–214). For example:
,
,
,
etc.
It should be pointed out that clusters 1 and 3 describe a cubic arrangement of three layers in the case of simple close packing, while clusters 2 and 4 represent the hexagonal close packing. There may be a small change in the lattice constant c perpendicular to the layers. Additional phase factors then have to be introduced in the matrix for the characteristic equation, and a recalculation of the constants is necessary. As a consequence, the reciprocal-lattice rows become diffuse if
, and the diffuseness increases with l. A similar behaviour results for the remaining reciprocal-lattice rows.
The final solution of the diffraction problem results in the following general intensity formula: Here
and
represent the real and imaginary part of the constants to be calculated with the aid of the boundary conditions of the problem. The first term in equation (4.2.4.11)
determines the symmetrical part of a diffuse reflection with respect to the maximum, and is completely responsible for the integrated intensity. The second term causes an antisymmetrical contribution to intensity profiles but does not influence the integrated intensities. These general relations enable a semi-quantitative interpretation of the sharp and diffuse scattering in any case, without performing the time-consuming calculations of the constants which may only be done in more complicated disorder problems with the aid of a computer program evaluating the boundary conditions of the problem.
This can be carried out with the aid of the characteristic values and a linear system of equations (Jagodzinski, 1949a,b
,c
), or with the aid of matrix formalism (Kakinoki & Komura, 1954
; Takaki & Sakurai, 1976
). As long as only the line profiles and positions of the reflections are required, these quantities may be determined experimentally and fitted to characteristic values of a matrix. The size of this matrix is given by the number of sharp and diffuse maxima observed, while
and
may be found by evaluating the line width and the position of diffuse reflections. Once this matrix has been found, a semi-quantitative model of the disorder problem can be given. If a system of sharp reflections is available, the averaged structure can be solved as described in Section 4.2.3.2
. The determination of the constants of the diffraction problem is greatly facilitated by considering the intensity modulation of diffuse scattering, which enables a phase determination of structure factors to be made under certain conditions.
The theory of closed-packed structures with three equivalent translation vectors has been applied very frequently, even to systems which do not obey the principle of close-packing. The first quantitative explanation was published by Halla et al. (1953). It was shown there that single crystals of C18H24 from the same synthesis may have a completely different degree of order. This was true even within the same crystal. Similar results were found for C, Si, CdI2, CdS2, mica and many other compounds. Quantitative treatments are less abundant [e.g. CdI2: Martorana et al. (1986)
; MX3 structures: Conradi & Müller (1986)
]. Special attention has been paid to the quantitative study of polytypic phase transformations in order to gain information about the thermodynamical stability or the mechanism of layer displacements, e.g. Co (Edwards & Lipson, 1942
; Frey & Boysen, 1981
), SiC (Jagodzinski, 1972
; Pandey et al., 1980a
,b
,c
), ZnS (Müller, 1952
; Mardix & Steinberger, 1970
; Frey et al., 1986
) and others.
Certain laws may be derived for the reduced integrated intensities of diffuse reflections. `Reduction' in this context means a division of the diffuse scattering along l by the structure factor, or the difference structure factor if . This procedure is valuable if the number of stacking faults rather than the complete solution of the diffraction problem is required.
The discussion given above has been made under the assumption that the full symmetry of the layers is maintained in the statistics. Obviously, this is not necessarily true if external lower symmetries influence the disorder. An important example is the generation of stacking faults during plastic deformation. Problems of this kind need a complete reconsideration of symmetries. Furthermore, it should be pointed out that a treatment with the aid of an extended Ising model as described above is irrelevant in most cases. Simplified procedures describing the diffuse scattering of intrinsic, extrinsic, twin stacking faults and others have been described in the literature. Since their influence on structure determination can generally be neglected, the reader is referred to the literature for additional information.
In this section disorder phenomena are considered which are related to chain-like structural elements in crystals. This topic includes the so-called `1D crystals' where translational symmetry (in direct space) exists in one direction only – crystals in which highly anisotropic binding forces are responsible for chain-like atomic groups, e.g. compounds which exhibit a well ordered 3D framework structure with tunnels in a unique direction in which atoms, ions or molecules are embedded. Examples are compounds with platinum, iodine or mercury chains, urea inclusion compounds with columnar structures (organic or inorganic), 1D ionic conductors, polymers etc. Diffuse-scattering studies of 1D conductors have been carried out in connection with investigations of stability/instability problems, incommensurate structures, phase transitions, dynamic precursor effects etc. These questions are not treated here. For general reading of diffuse scattering in connection with these topics see, e.g., Comes & Shirane (1979, and references therein). Also excluded are specific problems related to polymers or liquid crystals (mesophases) (see Chapter 4.4
) and magnetic structures with chain-like spin arrangements.
Trivial diffuse scattering occurs as 1D Bragg scattering (diffuse layers) by internally ordered chains. Diffuse phenomena in reciprocal space are due to `longitudinal' disordering within the chains (along the unique direction) as well as to `transverse' correlations between different chains over a restricted volume. Only static aspects are considered; diffuse scattering resulting from collective excitations or diffusion-like phenomena which are of inelastic or quasielastic origin are not treated here.
As found in any elementary textbook of diffraction the simplest result of scattering by a chain with period c is described by one of the Laue equations:
which gives broadened profiles for small N. In the context of phase transitions the Ornstein–Zernike correlation function is frequently used, i.e. (4.2.4.13
) is replaced by a Lorentzian:
where ξ denotes the correlation length.
In the limiting case , (4.2.4.13
) becomes
The scattering by a real chain a(r) consisting of molecules with structure factor
is therefore determined by
The Patterson function is:
where the index l denotes the only relevant position
(the subscript M is omitted).
The intensity is concentrated in diffuse layers perpendicular to from which the structural information may be extracted. Projections are:
Obviously the z parameters can be determined by scanning along a meridian (00L) through the diffuse sheets (diffractometer recording). Owing to intersection of the Ewald sphere with the set of planes the meridian cannot be recorded on one photograph; successive equi-inclination photographs are necessary. Only in the case of large c spacings is the meridian well approximated in one photograph.
There are many examples where a tendency to cylindrical symmetry exists: chains with p-fold rotational or screw symmetry around the preferred direction or assemblies of chains (or domains) with statistical orientational distribution around the texture axis. In this context it should be mentioned that symmetry operations with rotational parts belonging to the 1D rod groups actually occur, i.e. not only p = 2, 3, 4, 6.
In all these cases a treatment in the frame of cylindrical coordinates is advantageous (see, e.g., Vainshtein, 1966):
The integrals may be evaluated by the use of Bessel functions:
.
The 2D problem is treated first; an extension to the general case
is easily made afterwards.
Along the theory of Fourier series one has: or with:
If contributions to anomalous scattering are neglected a(r, ψ) is a real function:
Analogously, one has
is a complex function;
are the Fourier coefficients which are to be evaluated from the
:
The formulae may be used for calculation of diffuse intensity distribution within a diffuse sheet, in particular when the chain molecule is projected along the unique axis [cf. equation (4.2.4.18)
].
Special cases are:
The general 3D expressions valid for extended chains with period c [equation (4.2.4.12)] are found in an analogous way:
using a series expansion analogous to (4.2.4.23
) and (4.2.4.24
):
one has:
In practice the integrals are often replaced by discrete summation of j atoms at positions:
,
,
:
or
Intensity in the lth diffuse layer is given by
|
Formulae concerning the reverse method (Fourier synthesis) are not given here (see, e.g., Vainshtein, 1966). Usually there is no practical use in diffuse-scattering work because it is very difficult to separate out a single component
. Every diffuse layer is affected by all components
. There is a chance if one diffuse layer corresponds predominantly to one Bessel function.
Deviations from strict periodicities in the z direction within one chain may be due to loss of translational symmetry of the centres of the molecules along z and/or due to varying orientations of the molecules with respect to different axes, such as azimuthal misorientation, tilting with respect to the z axis or combinations of both types. As in 3D crystals, there may or may not exist 1D structures in an averaged sense.
All formulae given in this section are only special cases of a 3D treatment (see, e.g., Guinier, 1963). The 1D lattice (4.2.4.12)
is replaced by a distribution:
The Patterson function is given by
Because the autocorrelation function
is centrosymmetric
the interference function
is given by
Sometimes, e.g. in the following example of orientational disorder, there is an order only within domains. As shown in Section 4.2.3
, this may be treated by a box or shape function
for
and 0 elsewhere.
with
If the order is perfect within one domain one has
;
; i.e. each reflection is affected by the shape function.
A misorientation of the chain molecules with respect to one another is taken into account by different structure factors .
A further discussion follows the same arguments outlined in Section 4.2.3
. For example, a very simple result is found in the case of uncorrelated orientations. Averaging over all pairs
yields
where
Besides the diffuse layer system there is a diffuse background modulated by the H dependence of
.
In this context the structure factor of a chain molecule is neglected. Irregular distances between the molecules within a chain occur owing to the shape of the molecules, intrachain interactions and/or interaction forces via a surrounding matrix. A general discussion is given by Guinier (1963). It is convenient to reformulate the discrete Patterson function, i.e. the correlation function (4.2.4.39
).
in terms of continuous functions
which describe the probability of finding the µth neighbour within an arbitrary distance
There are two principal ways to define . The first is the case of a well defined one-dimensional lattice with positional fluctuations of the molecules around the lattice points, i.e. long-range order is retained:
, where
denotes the displacement of the µth molecule in the chain. Frequently used are Gaussian distributions:
(
= normalizing constant;
= standard deviation). Fourier transformation [equation (4.2.4.45)
] gives the well known result
i.e. a monotonically increasing intensity with L (modulation due to a molecular structure factor neglected). This result is quite analogous to the treatment of the scattering of independently vibrating atoms. If (short-range) correlations exist between the molecules the Gaussian distribution is replaced by a multivariate normal distribution where correlation coefficients
between a molecule and its µth neighbour are incorporated.
is defined by the second moment:
.
Obviously the variance increases if the correlation diminishes and reaches an upper bound of twice the single site variance. Fourier transformation gives an expression for diffuse intensity (Welberry, 1985)
:
For small Δ, terms with
are mostly neglected. The terms become increasingly important with higher values of L. On the other hand,
becomes smaller with increasing j, each additional term in equation (4.2.4.46)
becomes broader and, as a consequence, the diffuse planes in reciprocal space become broader with higher L.
In a different way – in the paracrystal method – the position of the second and subsequent molecules with respect to some reference zero point depends on the actual position of the predecessor. The variance of the position of the µth molecule relative to the first becomes unlimited. There is a continuous transition to a fluid-like behaviour of the chain molecules. This 1D paracrystal (sometimes called distortions of second kind) is only a special case of the 3D paracrystal concept (see Hosemann & Bagchi, 1962; Wilke, 1983
). Despite some difficulties with this concept (Brämer, 1975
; Brämer & Ruland, 1976
) it is widely used as a theoretical model for describing diffraction of highly distorted lattices. One essential development is to limit the size of a paracrystalline grain so that fluctuations never become too large (Hosemann, 1975
).
If this concept is used for the 1D case, is defined by convolution products of
. For example, the probability of finding the next-nearest molecule is given by
and, generally:
(µ-fold convolution).
The mean distance between next-nearest neighbours is and between neighbours of the µth order:
. The average value of
, which is also the value of w(z) for
, where the distribution function is completely smeared out. The general expression for the interference function G(L) is
with
.
With , equation (4.2.4.47)
is written:
[Note the close similarity to the diffuse part of equation (4.2.4.5),
which is valid for 1D disorder problems.]
This function has maxima of height and minima of height
at positions
and
, respectively. With decreasing
the oscillations vanish; a critical L value (corresponding to
) may be defined by Gmax/Gmin ≲ 1.2. Actual values depend strongly on
.
The paracrystal method is substantiated by the choice , i.e. the disorder model. Again, frequently used is a Gaussian distribution:
with the two parameters
.
There are peaks of height which obviously decrease with
and
. The oscillations vanish for
, i.e.
. The width of the mth peak is
. The integral reflectivity is approximately
and the integral width (defined by integral reflectivity divided by peak reflectivity) (background subtracted!)
which, therefore, increases with
. In principle the same results are given by Zernike & Prins (1927)
. In practice a single Gaussian distribution is not fully adequate and modified functions must be used (Rosshirt et al., 1985
).
A final remark concerns the normalization [equation (4.2.4.39)]. Going from (4.2.4.39)
to (4.2.4.45)
it is assumed that N is a large number so that the correct normalization factors
for each
may be approximated by a uniform N. If this is not true then
The correction term may be important in the case of relatively small (1D) domains. As mentioned above, the structure factor of a chain molecule was neglected. The H dependence of
, of course, obscures the intensity variation of the diffuse layers as described by (4.2.4.47a)
.
The matrix method developed for the case of planar disorder was adapted to 2D disorder by Scaringe & Ibers (1979). Other models and corresponding expressions for diffuse scattering are developed from specific microscopic models (potentials), e.g. in the case of Hg3−δAsF6 (Emery & Axe, 1978
; Radons et al., 1983
), hollandites (Beyeler et al., 1980
; Ishii, 1983
), iodine chain compounds (Endres et al., 1982
) or urea inclusion compounds (Forst et al., 1987
).
In real cases there are more or less strong correlations between different chains at least within small domains. Deviations from a strict (3D) order of chain-like structural elements are due to several reasons: shape and structure of the chains, varying binding forces, thermodynamical or kinetic considerations.
Many types of disorder occur. (1) Relative shifts parallel to the common axis while projections along this axis give a perfect 2D ordered net (`axial disorder'). (2) Relative fluctuations of the distances between the chains (perpendicular to the unique axis) with short-range order along the transverse a and/or b directions. The net of projected chains down to the ab plane is distorted (`net distortions'). Disorder of types (1) and (2) is sometimes correlated owing to non-uniform cross sections of the chains. (3) Turns, twists and torsions of chains or parts of chains. This azimuthal type of disorder may be treated similarly to the case of azimuthal disorder of single-chain molecules. Correlations between axial shifts and torsions produce `screw shifts' (helical structures). Torsion of chain parts may be of dynamic origin (rotational vibrations). (4) Tilting or bending of the chains in a uniform or non-uniform way (`conforming/non-conforming'). Many of these types and a variety of combinations between them are found in polymer and liquid crystals and are treated therefore separately. Only some simple basic ideas are discussed here in brief.
For the sake of simplicity the paracrystal concept in combination with Gaussians is used again. Distribution functions are given by convolution products of next-nearest-neighbour distribution functions. As long as averaged lattice directions and lattice constants in a plane perpendicular to the chain axis exist, only two functions and
are needed to describe the arrangement of next-nearest chains. Longitudinal disorder is treated as before by a third distribution function
. The phenomena of chain bending or tilting may be incorporated by an x and y dependence of
. Any general fluctuation in the spatial arrangement of chains is given by
(m-fold, p-fold, q-fold self-convolution of
, respectively.)
are called fundamental functions. If an averaged lattice cannot be defined, more fundamental functions
are needed to account for correlations between them.
By Fourier transformations the interference function is given by If Gaussian functions are assumed, simple pictures are derived. For example:
describes the distribution of neighbours in the x direction (mean distance
). Parameter
concerns axial,
and
radial and tangential fluctuations, respectively. Pure axial distribution along c is given by projection of
on the z axis, pure net distortions by projection on the
plane. If the chain-like structure is neglected the interference function
describes a set of diffuse planes perpendicular to
with mean distance
. These diffuse layers broaden along H with
and decrease in intensity along K and L monotonically. There is an ellipsoidal-shaped region in reciprocal space defined by main axes of length
with a limiting surface given by
, beyond which the diffuse intensity is completely smeared out. The influence of
may be discussed in an analogous way.
If the chain-like arrangement parallel to c [equation (4.2.4.12)] is taken into consideration,
the set of planes perpendicular to
(and/or
) is subdivided in the L direction by a set of planes located at
[equation (4.2.4.15)].
Longitudinal disorder is given by [equation (4.2.4.48),
] and leads to two intersecting sets of broadened diffuse layer systems.
Particular cases like pure axial distributions , pure tangential distributions (net distortions:
), uniform bending of chains or combinations of these effects are discussed in the monograph of Vainshtein (1966)
.
In this section general formulae for diffuse scattering will be derived which may best be applied to crystals with a well ordered average structure, characterized by (almost) sharp Bragg peaks. Textbooks and review articles concerning defects and local ordering are by Krivoglaz (1969), Dederichs (1973)
, Peisl (1975)
, Schwartz & Cohen (1977)
, Schmatz (1973
, 1983
), Bauer (1979)
, and Kitaigorodsky (1984)
. A series of interesting papers on local order is given by Young (1975)
and also by Cowley et al. (1979)
. Expressions for polycrystalline sample material are given by Warren (1969)
and Fender (1973)
.
Two general methods may be applied:
Both methods are equivalent in principle. The cluster method is generally more convenient in cases where a single average cluster is a good approximation. This holds for small concentrations of clusters with sufficient space in between. The method of short-range-order parameters is optimal in cases where isolated clusters are not realized and the correlations do not extend to long distances. Otherwise periodic solutions are more convenient in most cases.
In any case, the first step towards the solution of the diffraction problem is the accurate determination of the average structure. As described in Section 4.2.3.2 important information on fractional occupations, interstitials and displacements (unusual thermal parameters) of atoms may be derived. Unfortunately all defects contribute to diffuse scattering; hence one has to start with the assumption that the disorder to be interpreted is predominant. Fractional occupancy of certain lattice sites by two or more kinds of atoms plays an important role in the literature, especially in metallic or ionic structures. Since vacancies may be treated as atoms with zero scattering amplitude, structures containing vacancies may be formally treated as multi-component systems.
Since the solution of the diffraction problem should not be restricted to metallic systems with a simple (primitive) structure, we have to consider the structure of the unit cell – as given by the average structure – and the propagation of order according to the translation group separately. In simple metallic systems this difference is immaterial. It is well known that the thermodynamic problem of propagation of order in a three-dimensional crystal can hardly be solved analytically in a general way. Some solutions have been published with the aid of the so-called Ising model using next-nearest-neighbour interactions. They are excellent for an understanding of the principles of order–disorder phenomena, but they can scarcely be applied quantitatively in practical problems. Hence, methods have been developed to derive the propagation of order from the diffraction pattern by means of Fourier transformation. This method has been described qualitatively in Section 4.2.3.1, and will be used here for a quantitative application. In a first approximation the assumption of a small number of different configurations of the unit cell is made, represented by the corresponding number of structure factors. Displacements of atoms caused by the configurations of the neighbouring cells are excluded. This problem will be treated subsequently.
The finite number of structures of the unit cell in the disordered crystal is given by Note that
is defined in real space, and
gives the position vector of site j;
if in the νth structure factor the site j is occupied by an atom of kind µ, and 0 elsewhere.
In order to apply the laws of Fourier transformation adequately, it is useful to introduce the distribution function of
with
, if the cell
has the
structure, and
elsewhere.
In the definitions given above are numbers (scalars) assigned to the cell. Since all these are occupied we have
with
= lattice in real space.
The structure of the disordered crystal is given by
consists of
points, where
is the total (large) number of unit cells and
denotes the a priori probability (concentration) of the νth cell occupation.
It is now useful to introduce with
Introducing (4.2.4.58)
into (4.2.4.57)
:
Similarly:
Using (4.2.4.60)
it follows from (4.2.4.58)
that
Comparison with (4.2.4.59)
yields
Fourier transformation of (4.2.4.61)
gives
with
The expression for the scattered intensity is therefore
Because of the multiplication by L(H) the third term in (4.2.4.62)
contributes to sharp reflections only. Since they are correctly given by the second term in (4.2.4.62)
, the third term vanishes, Hence, the diffuse part is given by
For a better understanding of the behaviour of diffuse scattering it is useful to return to real space:
and with (4.2.4.58)
:
Evaluation of this equation for a single term yields
Since l(r) is a periodic function of points, all convolution products with l(r) are also periodic. For the final evaluation the decrease of a number of overlapping points (maximum N) in the convolution products with increasing displacements of the functions is neglected (no particle-size effect). Then (4.2.4.66)
becomes
If the first term in (4.2.4.67)
is considered, the convolution of the two functions for a given distance n counts the number of coincidences of the function
with
. This quantity is given by
, where
is the probability of a pair occupation in the r direction.
Equation (4.2.4.67) then reads:
with
. The function
is usually called the pair-correlation function
in the physical literature.
The following relations hold: Also, functions normalized to unity are in use. Obviously the following relation is valid:
.
Hence: is unity for
. This property is especially convenient in binary systems.
With (4.2.4.68), equation (4.2.4.64)
becomes
and Fourier transformation yields
It may be concluded from equations (4.2.4.69)
that all functions
may be expressed by
in the case of two structure factors
,
. Then all
are symmetric in r; the same is true for the
. Consequently, the diffuse reflections described by (4.2.4.71)
are all symmetric. The position of the diffuse peak depends strongly on the behaviour of
; in the case of cluster formation Bragg peaks and diffuse peaks coincide. Diffuse superstructure reflections are observed if the
show some damped periodicities.
It should be emphasized that the condition may be violated for
if more than two cell occupations are involved. As shown below, the possibly asymmetric functions may be split into symmetric and antisymmetric parts. From equation (4.2.3.8)
it follows that the Fourier transform of the antisymmetric part of
is also antisymmetric. Hence, the convolution in the two terms in square brackets in (4.2.4.71)
yields an antisymmetric contribution to each diffuse peak, generated by the convolution with the reciprocal lattice L(h).
Obviously, equation (4.2.4.71) may also be applied to primitive lattices, occupied by two or more kinds of atoms. Then the structure factors
are merely replaced by the atomic scattering factors
and the
are equivalent to the concentrations of atoms
. In terms of the
(Warren short-range-order parameters) equation (4.2.4.71)
reads
In the simplest case of a binary system A, B
[The exponential in (4.2.4.71b)
may even be replaced by a cosine term owing to the centrosymmetry of this particular case.]
It should be mentioned that the formulations of the problem in terms of pair probabilities, pair correlation functions, short-range-order parameters or concentration waves (Krivoglaz, 1969) are equivalent. Using continuous electron (or nuclear) density functions where site occupancies are implied, the Patterson function may be used, too (Cowley, 1981
).
As shown above in the case of random distributions all are zero, except for
. Consequently,
may be replaced by
According to (4.2.4.59)
and (4.2.4.61)
the diffuse scattering can be given by the Fourier transformation of
or with (4.2.4.72)
:
Fourier transformation gives
This is the most general form of any diffuse scattering of systems ordered randomly (`Laue scattering'). Occasionally it is called `incoherent scattering' (see Section 4.2.2
).
The diffuse scattering of a disordered binary system without displacements of the atoms has already been discussed in Section 4.2.4.4.1. It could be shown that all distribution functions
are mutually dependent and may be replaced by a single function [cf. (4.2.4.69)
]. In that case
was valid for all. This condition, however, may be violated in multi-component systems. If a tendency towards an
order in a ternary system is assumed, for example,
is apparently different from
. In this particular case it is useful to introduce
and their Fourier transforms
, respectively.
The asymmetric correlation functions are therefore expressed by Consequently,
(4.2.4.70)
and
(4.2.4.71)
may be separated according to the symmetric and antisymmetric contributions. The final result is:
Obviously, antisymmetric contributions to line profiles will only occur if structure factors of acentric cell occupations are involved. This important property may be used to draw conclusions with respect to structure factors involved in the statistics. It should be mentioned here that the Fourier transform of the antisymmetric function
is imaginary and antisymmetric. Since the last term in (4.2.4.74)
is also imaginary, the product of the two factors in brackets is real, as it should be.
Even small displacements may have an important influence on the problem of propagation of order. Therefore, no structural treatments other than the introduction of formal parameters (e.g. Landau's theory) have been published in the literature. Most of the examples with really reliable results refer to binary systems, and even these represent very crude approximations, as will be shown below. For this reason we shall restrict ourselves here to binary systems, although general formulae where displacements are included may be developed in a formal way.
Two kinds of atoms, and
, are considered. Obviously, the position of any given atom is determined by its surroundings. Their extension depends on the forces acting on the atom under consideration. These may be very weak in the case of metals (repulsive forces, so-called `size effect'), but long-range effects have to be expected in ionic crystals. For the development of formulae authors have assumed that small displacements
may be assigned to the pair correlation functions
by adding a phase factor
which is then expanded in the usual way:
The displacements and correlation probabilities are separable if the change of atomic scattering factors in the angular range considered may be neglected. The formulae in use are given in the next section. As shown below, this method represents nothing other than a kind of average over certain sets of displacements. For this purpose the correct solution of the problem has to be discussed. In the simplest model the displacements are due to next-nearest neighbours only. It is assumed further that the configurations rather than the displacements determine the position of the central atom and a general displacement of the centre of the first shell does not occur (no influence of a strain field). Obviously, the formal correlation function of pairs is not independent of displacements. This difficulty may be avoided either by assuming that the pair correlation function has already been separated from the diffraction data, or by theoretical calculations of the correlation function (mean-field method) (Moss, 1966
; de Fontaine, 1972
, 1973
). The validity of this procedure is subject to the condition that the displacements have no influence on the correlation functions themselves.
The observation of a periodic average structure justifies the definition of a periodic array of origins which normally depends on the degree of order. Local deviations of origins may be due to fluctuations in the degree of order and due to the surrounding atoms of a given site occupation. For example, a b.c.c. lattice with eight nearest neighbours is considered. It is assumed that only these have an influence on the position of the central atom owing to different forces of the various configurations. With two kinds of atoms, there are possible configurations of the cluster (central atom plus 8 neighbours). Symmetry considerations reduce this number to 28. Each is characterized by a displacement vector. Hence, their a priori probabilities and the propagation of 28 different configurations have to be determined. Since each atom has to be considered as the centre once, this problem may be treated by introducing 28 different atomic scattering factors as determined from the displacements:
. The diffraction problem has to be solved with the aid of the propagation of order of overlapping clusters. This is demonstrated by a two-dimensional model with four nearest neighbours (Fig. 4.2.4.1)
. Here the central and the neighbouring cluster (full and broken lines) overlap with two sites in the, e.g., x direction. Hence, only neighbouring clusters with the same overlapping pairs are admitted. These restrictions introduce severe difficulties into the problem of propagation of cluster ordering which determines the displacement field. Since it was assumed that the problem of pair correlation had been solved, the cluster probabilities may be derived by calculating
In the product only next-nearest neighbours have to be included. This must be performed for the central cluster
and for the reference cluster at
, because all are characterized by different displacements. So far, possible displacement of the centre has not been considered; this may also be influenced by the problem of propagation of cluster ordering. These displacement factors should best be attached to the function describing the propagation of order which determines, in principle, the local fluctuations of the lattice constants (strain field etc.). This may be understood by considering a binary system with a high degree of order but with atoms of different size. Large fluctuations of lattice constants are involved in the case of exsolution of the two components because of their different lattice parameters, but they become small in the case of superstructure formation where a description in terms of antiphase domains is reasonable (equal lattice constants). This example demonstrates the mutual dependence of ordering and displacements which is mostly neglected in the literature.
The method of assigning phase factors to the pair correlation function is now discussed. Pair correlation functions average over all pairs of clusters having the same central atom. An analogous argument holds for displacements: using pair correlations for the determination of displacements means nothing other than averaging over all displacements caused by various clusters around the same central atom. There remains the general strain field due to the propagation of order, whereas actual displacements of atoms are realized by fluctuations of configurations. Since large fluctuations of this type occur in highly disordered crystals, the displacements become increasingly irrelevant. Hence, the formal addition of displacement factors to the pair correlation function does not yield too much information about the structural basis of the displacements. This situation corresponds exactly to the relationship between a Patterson function and a real structure: the structure has to be found which explains the more or less complicated function completely, and its unique solution is rather difficult. These statements seem to be necessary because in most publications related to this subject these considerations are not taken into account adequately. Displacements usually give rise to antisymmetric contributions to diffuse reflections. As pointed out above, the influence of displacements has to be considered as phase factors which may be attached either to the structure factors or to the Fourier transforms of the correlation functions in equation (4.2.4.71)
. As has been mentioned in the context of equation (4.2.4.74)
antisymmetric contributions will occur if acentric structure factors are involved. Apparently, this condition is met by the phase factors of displacements. In consequence, antisymmetric contributions to diffuse reflections may also originate from the displacements. This fact can also be demonstrated if the assignment of phase factors to the Fourier transforms of the correlation functions is advantageous. In this case equations (4.2.4.69a,b)
are no longer valid because the functions
become complex. The most important change is the relation corresponding to (4.2.4.69)
:
Strictly speaking we have to replace the a priori probabilities
by complex numbers
which are determined by the position of the central atom. In this way all correlations between displacements may be included with the aid of the clusters mentioned above. To a rough approximation it may be assumed that no correlations of this kind exist. In this case the complex factors may be assigned to the structure factors involved. Averaging over all displacements results in diffraction effects which are very similar to a static Debye–Waller factor for all structure factors. On the other hand, the thermal motion of atoms is treated similarly. Obviously both factors affect the sharp Bragg peaks. Hence, this factor can easily be determined by the average structure which contains a Debye–Waller factor including static and thermal displacements. It should be pointed out, however, that these static displacements cause elastic diffuse scattering which cannot be separated by inelastic neutron scattering techniques.
A careful study of the real and imaginary parts of and
and their Fourier transforms results, after some calculations, in the following relation for diffuse scattering:
It should be noted that all contributions are real. This follows from the properties of Fourier transforms of symmetric and antisymmetric functions. All
are antisymmetric; hence they generate antisymmetric contributions to the line profiles. In contrast to equation (4.2.4.75)
, the real and the imaginary parts of the structure factors contribute to the asymmetry of the line profiles.
In substitutional binary systems (primitive cell with only one sublattice) the Borie–Sparks method is widely used (Sparks & Borie, 1966; Borie & Sparks, 1971
). The method is formulated in the short-range-order-parameter formalism. The diffuse scattering may be separated into two parts (a) owing to short-range order and (b) owing to static displacements.
Corresponding to the expansion (4.2.4.75),
, where
is given by equation (4.2.4.71b)
and the correction terms
and
relate to the linear and the quadratic term in (4.2.4.75)
. The intensity expression will be split into terms of A–A, A–B, … pairs. More explicitly
and with the following abbreviations:
one finds (where the short-hand notation is self-explanatory):
With further abbreviations
If the
are independent of
in the range of measurement which is better fulfilled with neutrons than with X-rays (see below), γ, δ, ɛ are the coefficients of the Fourier series:
The functions Q, R, S are then periodic in reciprocal space.
The double sums over n, n′ may be replaced by where m, n, p are the coordinates of the interatomic vectors
and
becomes
The intensity is therefore modulated sinusoidally and increases with scattering angle. The modulation gives rise to an asymmetry in the intensity around a Bragg peak. Similar considerations for
reveal an intensity contribution
times a sum over cosine terms which is symmetric around the Bragg peaks. This term shows quite an analogous influence of local static displacements and thermal movements: an increase of diffuse intensity around the Bragg peaks and a reduction of Bragg intensities, which is not discussed here. The second contribution
has no analogue owing to the non-vanishing average displacement. The various diffuse intensity contributions may be separated by symmetry considerations. Once they are separated, the single coefficients may be determined by Fourier inversion. Owing to the symmetry constraints there are relations between the displacements
and, in turn, between the γ and Q components. The same is true for the δ, ɛ, and R, S components. Consequently, there are symmetry conditions for the individual contributions of the diffuse intensity which may be used to distinguish them. Generally the total diffuse intensity may be split into only a few independent terms. The single components of Q, R, S may be expressed separately by combinations of diffuse intensities which are measured in definite selected volumes in reciprocal space. Only a minimum volume must be explored in order to reveal the behaviour over the whole reciprocal space. This minimum repeat volume is different for the single components:
, Q, R, S or combinations of them.
The Borie–Sparks method has been applied very frequently to binary and even ternary systems; some improvements have been communicated by Bardhan & Cohen (1976). The diffuse scattering of the historically important metallic compound Cu3Au has been studied by Cowley (1950a
,b
), and the pair correlation parameters could be determined. The typical fourfold splitting was found by Moss (1966)
and explained in terms of atomic displacements. The same splitting has been found for many similar compounds such as Cu3Pd (Ohshima et al., 1976
), Au3Cu (Bessière et al., 1983
), and Ag1−xMgx (x = 0.15–0.20) (Ohshima & Harada, 1986
). Similar pair correlation functions have been determined. In order to demonstrate the disorder parameters in terms of structural models, computer programs were used (e.g. Gehlen & Cohen, 1965
). A similar microdomain model was proposed by Hashimoto (1974
, 1981
, 1983
, 1987
). According to approximations made in the theoretical derivation the evaluation of diffuse scattering is generally restricted to an area in reciprocal space where the influence of displacements is of the same order of magnitude as that of the pair correlation function. The agreement between calculation and measurement is fairly good but it should be remembered that the amount and quality of the experimental information used is low. No residual factors are so far available; these would give an idea of the reliability of the results.
The more general case of a multi-component system with several atoms per lattice point was treated similarly by Hayakawa & Cohen (1975). Sources of error in the determination of the short-range-order coefficients are discussed by Gragg et al. (1973)
. In general the assumption of constant
produces an incomplete separation of the order- and displacement-dependent components of diffuse scattering. By an alternative method, by separation of the form factors from the Q, R, S functions and solving a large array of linear relationships by least-squares methods, the accuracy of the separation of the various contributions is improved (Tibbals, 1975
; Georgopoulos & Cohen, 1977
; Wu et al., 1983
). The method does not work for neutron diffraction. Also, the case of planar short-range order with corresponding diffuse intensity along rods in reciprocal space may be treated along the Borie & Sparks method (Ohshima & Moss, 1983
).
Multi-wavelength methods taking advantage of the variation of the structure factor near an absorption edge (anomalous dispersion) are discussed by Cenedese et al. (1984). The same authors show that in some cases the neutron method allows for a contrast variation by using samples with different isotope substitution.
Evaluation of diffuse-scattering data from powder diffraction follows the same theoretical formulae developed for the determination of the radial distribution function for glasses and liquids (Debye & Menke, 1931; Warren & Gingrich, 1934
). The final formula for random distributions may be given as (Fender, 1973
)
represents the number of atoms at distance
from the origin. An equivalent expression for a substitutional binary alloy is
In the literature small concentrations are treated in terms of fluctuations of the functions as defined in equation (4.2.4.56)
. Generally we prefer the introduction of the distribution function of the defects or clusters. Since this problem has already been treated in Section 4.2.4.4.3
only some very brief remarks are given here. The most convenient way to derive the distribution function correctly from experimental data is the use of low-angle scattering which generally shows one or more clear maxima caused by partly periodic properties of the distribution function. For the deconvolution of the distribution function, received by Fourier transformation of the corrected diffused low-angle scattering, the reader is referred to the relevant literature. Since deconvolutions are not unique some reasonable assumptions are necessary for the final solution. Anomalous scattering may be very helpful if applicable.
As mentioned above, the cluster method may be useful for the interpretation of disorder problems. In the general formula of diffuse scattering of random distributions equation (4.2.2.13) may be used. Here
describes the sharp Bragg maxima, while
represents the contribution to diffuse scattering. Correlation effects can also be taken into account by using clusters of sufficient size if their distribution may be considered as random in good approximation. The diffuse intensity is then given by
where
represents the difference structure factor of the νth cluster and
is its a priori probability. Obviously equation (4.2.4.84)
is of some use in two cases only. (1) The number of clusters is sufficiently small and meets the condition of nearly random distribution. In principle, its structure may then be determined with the aid of refinement methods according to equation (4.2.4.84)
. Since the second term is assumed to be known from the average structure, the first term may be evaluated by introducing as many parameters as there are clusters involved. A special computer program for incoherent refinement has to be used if more than one representative cluster has to be introduced. In the case of more clusters, constraints are necessary. (2) The number of clusters with similar structures is not limited. It may be assumed that their size distribution may be expressed by well known analytical expressions, e.g. Gaussians or Lorentzians. The distribution is still assumed to be random.
An early application of the cluster method was the calculation of the diffuse intensity of Guinier–Preston zones, where a single cluster is sufficient (see, e.g., Gerold, 1954; Bubeck & Gerold, 1984
). Unfortunately no refinements of cluster structures have so far been published. The full theory of the cluster method was outlined by Jagodzinski & Haefner (1967)
.
Some remarks on the use of residual factors should be added here. Obviously the diffuse scattering may be used for refinements in a similar way as in conventional structure determination. For this purpose a sufficiently small reciprocal lattice has to be defined. The size of the reciprocal cell has to be chosen with respect to the maximum gradient of diffuse scattering. Then the diffuse intensity may be described by a product of the real intensity distribution and the small reciprocal lattice. Fourier transformation yields the convolution of the real disordered structure and a large unit cell. In other words, the disordered structure is subdivided into large units and subsequently superimposed (`projected') in a single cell. In cases where a clear model of the disorder could be determined, a refinement procedure for atomic and other relevant parameters can be started. In this way a residual factor may be determined. A first approach has been elaborated by Epstein & Welberry (1983) in the case of substitutional disorder of two molecules.
The outstanding limiting factor is the collection of weak intensity data. The amount increases rapidly with the complexity of the structure and could even exceed by far the amount which is needed in the case of protein structure refinement. Hence, it seems to be reasonable to restrict the measurement to distinct areas in reciprocal space. Most of these publications, however, use too little information when compared with the minimum of data which would be necessary for the confirmation of the proposed model. Hence, physical and chemical considerations should be used as an additional source of information.
Apart from experimental arguments in favour of either method, there are some specific points which should be mentioned in this context. The diffuse scattering in question must be separated from Bragg scattering and from other diffuse-scattering contributions. Generally both methods are complementary: neutrons are preferable in cases where X-rays show only a small scattering contrast: (heavy) metal hydrides, oxides, carbides, Al–Mg distribution etc. In favourable cases it is possible to suppress (nuclear) Bragg scattering of neutrons when isotopes are used so that for all equivalent positions. Another way to separate Bragg peaks is to record the diffuse intensity, if possible, at low
values. This can be achieved either by measurement at low θ angles or by using long wavelengths. For reasons of absorption the latter point is the domain of neutron scattering. Exceeding the Bragg cut-off, Bragg scattering is ruled out. In this way `diffuse' background owing to multiple Bragg scattering is avoided. Other diffuse-scattering contributions which increase with the
value are thus also minimized: thermal diffuse scattering (TDS) and scattering due to long-range static displacements. On the other hand, lattice distortions, Huang scattering, … should be measured at large values of
. TDS can be separated by purely elastic neutron methods within the limits given by the energy resolution of an instrument. This technique is of particular importance at higher temperatures where TDS becomes remarkably strong. Neutron scattering is a good tool only in cases where (isotope/spin-)incoherent scattering is not too strong. In the case of magnetic materials confusion with paramagnetic diffuse scattering could occur. This is also important when electrons are trapped by defects which themselves act as paramagnetic centres.
As mentioned in Section 4.2.4.4.4 the evaluations of the γ, δ, ɛ depend on the assumption that the f's do not depend on
strongly within the range of measurement. Owing to the atomic form factor, this is not always well approximated in the X-ray case and is one of the main sources of error in the determination of the short-range-order parameters.
Some brief remarks concerning the dynamic properties of defects as discussed in the previous sections now follow. Mass defects (impurity atoms), force-constant defects etc. influence the dynamic properties of the undistorted lattice and one could think of a modified TDS as discussed in Chapter 4.1
. In the case of low defect concentrations special vibrational modes characterized by large amplitudes at the defect with frequency shifts and reduced lifetimes (resonant modes) or vibrational modes localized in space may occur. Other modes with frequencies near these particular modes may also be affected. Owing to the very low intensity of these phenomena their influence on the normal TDS is negligible and may be neglected in diffuse-scattering work. Theoretical treatments of crystals with higher defect concentrations are extremely difficult and not developed so far. For further reading see Böttger (1983)
.
Molecular crystals show in principle disorder phenomena similar to those discussed in previous sections (substitutional or displacement disorder). Here we have to replace the structure factors , used in the previous sections, by the molecular structure factors in their various orientations. Usually these are rapidly varying functions in reciprocal space which may obscure the disorder diffuse scattering. Disorder in molecular crystals is treated by Guinier (1963)
, Amorós & Amorós (1968)
, Flack (1970)
, Epstein et al. (1982)
, Welberry & Siripitayananon (1986
, 1987
), and others.
A particular type of disorder is very common in molecular and also in ionic crystals: the centres of masses of molecules or ionic complexes form a perfect 3D lattice but their orientations are disordered. Sometimes these solids are called plastic crystals. For comparison, the liquid-crystalline state is characterized by an orientational order in the absence of long-range positional order of the centres of the molecules. A clear-cut separation is not possible in cases where translational symmetry occurs in low dimension, e.g. in sheets or parallel to a few directions in crystal space. For discussion of these mesophases see Chapter 4.4
.
An orientationally disordered crystal may be imagined in a static picture by freezing molecules in different sites in one of several orientations. Local correlations between neighbouring molecules and correlations between position and orientation may be responsible for orientational short-range order. Often thermal reorientations of the molecules are related to an orientationally disordered crystal. Thermal vibrations of the centres of masses of the molecules, librational or rotational excitations around one or more axes of the molecules, jumps between different equilibrium positions or diffusion-like phenomena are responsible for diffuse scattering of dynamic origin. As mentioned above the complexity of molecular structures and the associated large number of thermal modes complicate a separation from static disorder effects.
Generally high Debye–Waller factors are typical for scattering of orientationally disordered crystals. Consequently only a few Bragg reflections are observable. A large amount of structural information is stored in the diffuse background. It has to be analysed with respect to an incoherent and coherent part, elastic, quasielastic or inelastic nature, short-range correlations within one and the same molecule and between orientations of different molecules, and cross correlations between positional and orientational disorder scattering. Combined X-ray and neutron methods are therefore highly recommended.
On the assumption of a well ordered 3D lattice, a general expression for the scattering by an orientationally disordered crystal with one molecule per unit cell may be given. This is a very common situation. Moreover, orientational disorder is frequently related to molecules with an overall `globular' shape and consequently to crystals of high (in particular, averaged) spherical symmetry. In the following the relevant equations are given for this situation; these are discussed in some detail in a review article by Fouret (1979). The orientation of a molecule is characterized by a parameter
, e.g. the set of Eulerian angles of three molecular axes with respect to the crystal axes:
(D possible different orientations). The equilibrium position of the centre of mass of a molecule in orientation
is given by
, the equilibrium position of atom k within a molecule l in orientation
by
and a displacement from this equilibrium position by
. Averaging over a long time, i.e. supposing that the lifetime of a discrete configuration is long compared with the period of atomic vibrations, the observed intensity may be deduced from the intensity expression corresponding to a given configuration at time t:
Averaging procedures must be carried out with respect to the thermal vibrations (denoted by an overbar) and over all configurations (symbol
). The centre-of-mass translational vibrations and librations of the molecules are most important in this context. (Internal vibrations of the molecules are assumed to be decoupled and remain unconsidered.)
Thermal averaging gives (cf. Chapter 4.1
)
In the harmonic approximation
is replaced by
. This is, however, a more or less crude approximation because strongly anharmonic vibrations are quite common in an orientationally disordered crystal. In this approximation
becomes
is equal to
(Debye–Waller factor) and depends on the specific configuration
.
includes all the correlations between positions, orientations and vibrations of the molecules.
Averaging over different configurations demands a knowledge of the orientational probabilities. The probability of finding molecule l in orientation is given by
. The double probability
gives the probability of finding two molecules
in different orientations
and
, respectively. In the absence of correlations between the orientations we have:
. If correlations exist:
where
defines the conditional probability that molecule
has the orientation
if molecule l has the orientation
. For long distances between l and
tends to
.
The difference characterizes, therefore, the degree of short-range orientational correlation. Note that this formalism corresponds fully to the
,
used in the context of translational disorder.
The average structure factor, sometimes called averaged form factor, of the molecule is given by
|
In certain cases and with simplifying assumptions, [equation (4.2.4.89)
] and
[equation (4.2.4.92)
] may be calculated. Assuming only one molecule per unit cell and treating the molecule as a rigid body, one derives from the structure factor of an ordered crystal
and
If the molecules have random orientation in space the following expressions hold [see, e.g., Dolling et al. (1979)
]:
is the zeroth order of the spherical Bessel functions and describes an atom k uniformly distributed over a shell of radius
.
In practice the molecules perform more or less finite librations about the main orientation. The structure factor may then be found by the method of symmetry-adapted functions [see, e.g., Press (1973), Press & Hüller (1973)
, Dolling et al. (1979)
, Prandl (1981
, and references therein)].
is the νth order of spherical Bessel functions, the coefficients
characterize the angular distribution of
,
are the spherical harmonics where
denote polar coordinates of H.
The general case of an arbitrary crystal, site and molecular symmetry and the case of several symmetrically equivalent orientationally disordered molecules per unit cell are treated by Prandl (1981); an example is given by Hohlwein et al. (1986)
. As mentioned above, cubic plastic crystals are common and therefore mostly studied up to now. The expression for
may then be formulated as an expansion in cubic harmonics,
:
(
are modified expansion coefficients.)
Taking into account isotropic centre-of-mass translational displacements, which are not correlated with the librations, we obtain: U is the mean-square translational displacement of the molecule. Correlations between translational and vibrational displacements are treated by Press et al. (1979)
.
Equivalent expressions for crystals with symmetry other than cubic may be found from the same concept of symmetry-adapted functions [tables are given by Bradley & Cracknell (1972)].
The final terms in equations (4.2.4.90) and (4.2.4.93)
concern correlations between the orientations of different molecules. Detailed evaluations need a knowledge of a particular model. Examples are compounds with nitrate groups (Wong et al., 1984
; Lefebvre et al., 1984
), CBr4 (More et al., 1980
, 1984
), and many others (see Sherwood, 1979
). The situation is even more complicated when a modulation wave with respect to the occupation of different molecular orientations is superimposed. A limiting case would be a box-like function describing a pattern of domains. Within one domain all molecules have the same orientation. This situation is common in ferroelectrics where molecules exhibit a permanent dipole moment. The modulation may occur in one or more directions in space. The observed intensity in this type of orientationally disordered crystal is characterized by a system of more or less diffuse satellite reflections. The general scattering theory of a crystal with occupational modulation waves follows the same lines as outlined in Section 4.2.3.1.
To conclude this chapter experimental aspects are summarized which are specifically important in diffuse-scattering work. The summary is restricted to film methods commonly used in laboratories and (X-ray or neutron) diffractometer measurements. Sophisticated special techniques and instruments at synchrotron facilities and reactors dedicated to diffuse-scattering work are not described here. The full merit of these machines may be assessed after inspection of corresponding user handbooks which are available upon request. Also excluded from this section are instruments and methods related to diffuse scattering at low angles, i.e. small-angle scattering techniques. Although no fundamental differences exist between an X-ray experiment in a laboratory and at a synchrotron facility, some specific points have to be considered in the latter case. These are discussed by Matsubara & Georgopoulos (1985), Oshima & Harada (1986), and Ohshima et al. (1986)
.
Generally, diffuse scattering is weak in comparison with Bragg scattering, anisotropically and inhomogeneously distributed in reciprocal space, elastic, inelastic, or quasi-elastic in origin. It is frequently related to more than one structural element, which means that different parts may show different behaviour in reciprocal space and/or on an energy scale. Therefore special care has to be taken concerning the following points: (1) type of experiment: X-rays or neutrons, film or diffractometer/spectrometer, single crystal or powder; (2) strong sources; (3) best choice of wavelength (or energy) of incident radiation if no `white' technique is used; (4) monochromatic and focusing techniques; (5) sample environment and background reduction; (6) resolution and scanning procedure in diffractometer or densitometer recording.
On undertaking an investigation of a disorder problem by an analysis of the diffuse scattering an overall picture should first be recorded by X-ray diffraction experiments. Several sections through reciprocal space help to define the problem. For this purpose film methods are preferable. Cameras with relatively short crystal–film distances avoid long exposure times. Unfortunately, there are some disorder problems which cannot be tackled by X-ray methods. X-rays are rather insensitive for the elucidation of disorder problems where light atoms in the presence of heavy atoms play the dominant role, or when elements are involved which scarcely differ in X-ray scattering amplitudes (e.g. Al/Si/Mg). In these cases neutrons have to be used at an early stage. If a significant part of the diffuse scattering is suspected not to be of static origin concomitant purely elastic, quasi-elastic or inelastic neutron experiments have to be planned from the very beginning.
Because diffuse scattering is usually weak, intense radiation sources are needed, whereas the background level should be kept as low as possible. Coming to the background problem later, we should make some brief remarks concerning sources. Even a normal modern X-ray tube is a stronger source, defined by the flux density from an anode (number of photons cm−2 s−1), than a reactor with the highest available flux. For this reason most experimental work which can be performed with X-rays should be. Generally the characteristic spectrum will be used, but special methods have been developed where the white X-ray spectrum is of interest (see below). A most powerful source in this respect is a modern synchrotron storage ring (see, e.g., Kunz, 1979). With respect to rotating anodes one should bear in mind not only the power but also the flux density, because there is little merit for a broad focus in diffuse-scattering work (separation of sharp and diffuse scattering). One can suppose that synchrotron radiation in the X-ray range will also play an important role in the field of monochromatic diffraction methods, owing to the extremely high brilliance of these sources (number of quanta cm−2, sr−1, s−1 and wavelength interval). Diffuse neutron-diffraction work may only be performed on a high- or medium-flux reactor. Highly efficient monochromator systems are necessary. In combination with time-of-flight neutron methods pulsed sources are nowadays equivalent to reactors (Windsor, 1982
).
If film and (X-ray) diffractometer methods are compared, film techniques are highly recommended at an early stage to give a general survey of the disorder problem. Routine X-ray techniques such as rotation photographs, Weissenberg or precession techniques may be used. The Weissenberg method is preferred to the precession method in most cases because of the comparatively larger coverage of reciprocal space (with the same wavelength). The drawback of a distorted image of the reciprocal space may be compensated by digitizing the film blackening via a densitometer recording and subsequent plotting. With this procedure a distorted section through the reciprocal lattice may be transformed into a form suitable for easy interpretation (Welberry, 1983).
Frequently used are standing-crystal techniques in combination with monochromatic radiation, usually called monochromatic Laue techniques (see, e.g., Flack, 1970). The Noromosic technique (Jagodzinski & Korekawa, 1973
) is characterized by a convergent monochromatic beam which simulates an oscillation photograph over a small angular range. Heavily overexposed photographs, with respect to Bragg scattering, allow for sampling of diffuse intensity if a crystal is oriented in such a way that there is a well defined section between the Ewald sphere and the diffuse phenomenon under consideration. By combining single Noromosic photographs, Weissenberg patterns can be simulated. This relatively tedious way of comparison with a true Weissenberg photograph is often unavoidable because the heavily overexposed Bragg peaks obscure weak diffuse phenomena over a considerable area of a photograph. Furthermore, standing pictures are pointwise measurements in comparison with the normal continuous pattern with respect to the crystal setting. Long-exposure Weissenberg photographs are therefore not equivalent to a smaller set of standing photographs. In this context it should be mentioned that a layer-line screen has not only the simple function of a selecting diaphragm, but the gap width determines the resolution volume within which diffuse intensity is collected (Welberry, 1983
). For further discussion of questions of resolution see below. Single-crystal diffractometer measurements, either in the X-ray or in the neutron case, are frequently adopted for quantitative measurements of diffuse intensities. Microdensitometer recording of X-ray films is an equivalent method, incorporating corrections for background and other factors into this procedure. A comparison of Weissenberg and diffractometer methods for the measurement of diffuse scattering is given by Welberry & Glazer (1985)
.
In the case of powder-diffractometer experiments preferred orientations/textures could lead to a complete misidentification of the problem. Single-crystal experiments are preferable in some respect, because diffuse phenomena in a powder diagram may be analysed only after an idea about the disorder has been obtained and only in special cases. Nevertheless, high-resolution powder investigations give quick supporting information, e.g. about superlattice peaks, split reflections, lattice strains, domain size effects, lattice-constant change related to a disorder effect etc.
Before starting an experiment of any kind, one should specify the optimum wavelength. This is important with respect to the problem to be solved: e.g., point defects cause diffuse scattering to fall off with increasing scattering vector; short-range ordering between clusters causes broad peaks corresponding to large d spacings; lattice-relaxation processes induce a broadening of the interferences (Huang scattering); or static modulation waves with long periods give rise to satellite scattering close to Bragg peaks. In all these cases a long wavelength is preferable. On the other hand, a shorter wavelength is needed if diffuse phenomena are structured in a sense that broad peaks are observable up to large reciprocal vectors, or diffuse streaks or planes have to be recorded up to high values of the scattering vector in order to decide between different models. The -dependence of the scattered intensity, in the framework of the kinematical theory, is a crucial point for exposure or data-acquisition times. Moreover, the accuracy with which an experiment can be carried out suffers from a short wavelength: generally, momentum as well as energy resolution are lower. For a quantitative estimate detailed considerations of resolution in reciprocal (and energy) space are needed. Special attention must be paid to absorption phenomena, in particular when (in the X-ray case) an absorption edge of an element of the sample is close to the wavelength used.
must be avoided in combination with film methods owing to the K edge of Br. Strong fluorescence scattering may completely obscure weak diffuse-scattering phenomena. In comparison with X-rays, the generally lower absorption coefficients of neutrons of any wavelength makes absolute measurements easier. This also allows the use of larger sample volumes, which is not true in the X-ray case. An extinction problem does not exist in diffuse-scattering work. In particular, the use of a long wavelength is profitable when the main diffuse contributions can be recorded within an Ewald sphere as small as the Bragg cut-off of the sample:
; a contamination by Bragg scattering can then be avoided.
This is also advantageous from a different point of view: because the contribution of thermal diffuse scattering increases with increasing scattering vector H, the relative amount of this component becomes negligibly small within the first reciprocal cell.
Highly monochromatic radiation should be used in order to eliminate broadening effects due to the wavelength distribution. Focusing monochromators help to overcome the lack of luminosity. A focusing technique, in particular a focusing camera geometry, is very helpful for deciding between geometrical broadening and `true' diffuseness. With good success a method is used where in a monochromatic divergent beam the sample is placed with its selected axis lying in the scattering plane of the monochromator (Jagodzinski, 1968). The specimen is fully embedded in the incident beam which is focused onto the film. By this procedure the influence of the sample size is suppressed in one dimension. In an oscillation photograph a high resolution perpendicular to the diffuse layer lines may thus be achieved.
A serious problem is a careful suppression of background scattering. Incoherent X-ray scattering as an inherent property of a sample occurs as continuous blackening in the case of fluorescence, or as scattering at high 2θ angles owing to Compton scattering or `incoherent' inelastic effects. Protecting the film by a thin Al or Ni foil is of some help against fluorescence, but also attenuates the diffuse intensity. Scratching the film emulsion after the exposure from the `front' side of the film is another possibility for reducing the relative amount of the lower-energy fluorescence radiation. Obviously, energy-dispersive counter methods are highly efficient in this case (see below). Air scattering produces a background at low 2θ angles which may easily be avoided by special slit systems and evacuation of the camera.
In X-ray or neutron diffractometer measurements incoherent and multiple scattering contribute to a background which varies only slowly with 2θ and can be subtracted by linear interpolation or fitting a smooth curve, or can even be calculated quantitatively and then subtracted. In neutron diffraction there are rare cases when monoisotopic and `zero-spin' samples are available and, consequently, the corresponding incoherent scattering part vanishes completely. In some cases a separation of coherent and incoherent neutron scattering is possible by polarization analysis (Gerlach et al., 1982). An `empty' scan can take care of instrumental background contributions. Evacuation or controlled-atmosphere studies need a chamber which may give rise to spurious scattering. This can be avoided if no part of the vacuum chamber is hit by the primary beam. The problem is less serious in neutron work. Mounting a specimen, e.g., on a silica fibre with cement, poorly aligned collimators or beam catchers are further sources. Sometimes a specimen has to be enclosed in a capillary which will always be hit by the incident beam. Careful and tedious experimental work is necessary in the case of low- and high-temperature (or -pressure) investigations which have to be carried out in many disorder problems. Whereas the experimental situation is again less serious in neutron scattering, there are large problems with scattering from walls and containers in X-ray work. Most of the X-ray investigations have therefore been made on quenched samples. Because TDS is dominating at high temperatures, also in the presence of a static disorder problem, the quantitative separation can hardly be carried out in the case of high experimental background. Calculation and subtraction of the TDS is possible in principle, but difficult in practice.
A quantitative analysis of diffuse-scattering data is essential for a definite decision about a disorder model. By comparison of calculated and corrected experimental data the magnitudes of the parameters of the structural disorder model may be derived. A careful analysis of the data requires, therefore, corrections for polarization (X-ray case), absorption and resolution. These may be performed in the usual way for polarization and absorption. Very detailed considerations, however, are necessary for the question of instrumental resolution which depends, in addition to other factors, on the scattering angle and implies intensity corrections analogous to the Lorentz factor used in structure analysis from sharp Bragg reflections.
Resolution is conveniently described by a function, , which is defined as the probability of detecting a photon or neutron with momentum transfer
when the instrument is set to measure
. This function R depends on the instrumental parameters (collimations, mosaic spread of monochromator, scattering angle) and the spectral width of the source. Fig. 4.2.5.1
shows a schematic sketch of a diffractometer setting. Detailed considerations of resolution volume in X-ray diffractometry are given by Sparks & Borie (1966)
. If a triple-axis (neutron) instrument is used, for example in a purely elastic configuration, the set of instrumental parameters is extended by the mosaic of the analyser and the collimations between analyser and detector (see Chapter 4.1
).
If photographic (X-ray) techniques are used, the detector aperture is controlled by the slit width of the microdensitometer. A general formulation of R in neutron diffractometry is given by Cooper & Nathans (1968a,b
):
Gaussians are assumed for the mosaic distributions and for the transmission functions the parameters are involved in the coefficients
and
.
The general assumption of Gaussians is not too serious in the X-ray case (Iizumi, 1973). Restrictions are due to absorption which makes the profiles asymmetric. Box-like functions are considered to be better for the spectral distribution or for large apertures (Boysen & Adlhart, 1987
). These questions are treated in some detail by Klug & Alexander (1954)
. The main features, however, may also be derived by the Gaussian approximation. In practice the function R may be obtained either by calculation from the known instrumental parameters or by measuring Bragg peaks of a perfect unstrained crystal. In the latter case [cf. equation (4.2.5.2)
] the intensity profile is given solely by the resolution function. A normalization with the Bragg intensities is also useful in order to place the diffuse-scattering intensity on an absolute scale.
In single-crystal diffractometry the measured intensity is given by the convolution product of with R,
where
describes the scattering cross section for the disorder problem. In more accurate form the mosaic of the sample has to be included:
.
describes the mosaic block distribution around a most probable vector
:
.
In formulae (4.2.5.1) and (4.2.5.2)
all factors independent of 2θ are neglected. All intensity expressions have to be calculated from equations (4.2.5.2)
or (4.2.5.2a)
. In the case of a dynamical disorder problem, i.e. when the differential cross section also depends on energy transfer
ω, the integration must be extended over energy.
The intensity variation of diffuse peaks with 2θ was studied in detail by Yessik et al. (1973). In principle all special cases are included there. In practice, however, some important simplifications can be made if
is either very broad or very sharp compared with R, i.e. for Bragg peaks, sharp streaks, `thin' diffuse layers or extended 3D diffuse peaks (Boysen & Adlhart, 1987
).
In the latter case the cross section may be treated as nearly constant over the resolution volume so that the corresponding `Lorentz' factor is independent of 2θ:
For a diffuse plane within the scattering plane with very small thickness and slowly varying cross section within the plane, one derives for a point measurement in the plane:
exhibiting an explicit dependence on θ (
determining an effective vertical divergence before the sample, the divergence before the detector and the vertical mosaic spread of the sample, respectively).
In the case of relaxed vertical collimations
i.e. again independent of θ.
Scanning across the diffuse layer in a direction perpendicular to it one obtains an integrated intensity which is also independent of 2θ. This is even true if approximations other than Gaussians are used.
If, on the other hand, an equivalent diffuse plane is positioned perpendicular to the scattering plane, the equivalent expression for of a point measurement is given by
where ψ gives the angle between the line of intersection between the diffuse and the scattering plane and the vector
. The coefficients
,
,
,
, η are either instrumental parameters or functions of them, defining horizontal collimations and mosaic spreads. In the case of a (sharp) X-ray line (produced, for example, by filtering) the last two terms in equation (4.2.5.5)
vanish.
The use of integrated intensities from individual scans perpendicular to the diffuse plane, now carried out within the scattering plane, again gives a Lorentz factor independent of 2θ.
In the third fundamental special case, diffuse streaking along one reciprocal direction within the scattering plane (narrow cross section, slowly varying intensity along the streak), the Lorentz factor for a point measurement may be expressed by the product where ψ now defines the angle between the streak and
. The integrated intensity taken from an H scan perpendicular to the streak has to be corrected by a Lorentz factor which is equal to
[equation (4.2.5.4)
]. In the case of a diffuse streak perpendicular to the scattering plane a relatively complicated equation holds for the corresponding Lorentz factor (Boysen & Adlhart, 1987
). Again more simple expressions hold for integrated intensities from H scans perpendicular to the streaks. Such scans may be performed in the radial direction (corresponding to a θ–2θ scan):
or perpendicular to the radial direction (within the scattering plane) (corresponding to an ω scan):
Note that only the radial scan yields a simple θ dependence
.
From these considerations it is recommended that integrated intensities from scans perpendicular to a diffuse plane or a diffuse streak should be used in order to extract the disorder cross sections. For other scan directions, which make an angle α with the intersection line (diffuse plane) or with a streak, the L factors are simply: and
, respectively.
One point should be emphasized: since in a usual experiment the integration is performed over an angle via a general
scan, an additional correction factor arises:
β is the angle between
and the scan direction
defines the coupling ratio between the rotation of the crystal around a vertical axis and the rotation of the detector shaft. Most frequently used are the so-called 1:2 and ω-scan techniques where
and 90°, respectively.
It should be mentioned that the results in the neutron case are restricted to the elastic diffuse part, since in a diffractometer measurement the inelastic part deserves special attention concerning the integration over energy by the detector (Tucciarone et al., 1971; Grabcev, 1974
). If a triple-axis instrument is used, the collimations
and
have to be replaced by effective values after the sample owing to the analyser system.
In order to optimize a single-crystal experiment, the scan direction and also the instrumental collimations should be carefully chosen according to the anisotropy of the diffuse phenomenon. If the variation of is appreciable along a streak, the resolution should be held narrow in one direction and relaxed in the other to gain intensity and the scans should be performed perpendicular to that direction. If the variation is smooth the sharpest signal is measured by a scan perpendicular to the streak. In any case, a good knowledge of the resolution and its variation with 2θ is helpful.
Even the diffuse background in powder diagrams contains valuable information about disorder. Only in very simple cases can a model be deduced from a powder pattern alone; however, a refinement of a known disorder model can favourably be carried out, e.g. the temperature dependence may be studied. On account of the intensity integration the ratio of diffuse intensity to Bragg intensity is enhanced in a powder pattern. Moreover, a powder pattern contains, in principle, all the information about the sample and might thus reveal more than single-crystal work.
The quantitative calculation of a diffuse background is also helpful in combination with Rietveld's (1969) method for refining an averaged structure by fitting (powder) Bragg reflections. In particular, for highly anisotropic diffuse phenomena characteristic asymmetric line shapes occur.
The calculation of these line shapes is treated in the literature, mostly neglecting the instrumental resolution (see, e.g., Warren, 1941; Wilson, 1949
; Jones, 1949
; and de Courville-Brenasin et al., 1981
). This is not justified if the variation of the diffuse intensity becomes comparable with that of the resolution function as is often the case in neutron diffraction. It may be incorporated by taking advantage of a resolution function of a powder instrument (Caglioti et al., 1958
). A detailed analysis of diffuse peaks is given by Yessik et al. (1973)
, the equivalent considerations for diffuse planes and streaks by Boysen (1985)
. The case of 3D random disorder (incoherent neutron scattering, monotonous Laue scattering, averaged TDS, multiple scattering or short-range-order modulations) is treated by Sabine & Clarke (1977)
.
In polycrystalline samples the cross section has to be averaged over all orientations ( = number of crystallites in the sample):
and this averaged cross section enters the relevant expressions for the convolution product with the resolution function.
A general intensity expression may be written as (Yessik et al., 1973):
P denotes a scaling factor depending on the instrumental luminosity, τ the shortest distance to the origin of the reciprocal lattice,
the corresponding symmetry-induced multiplicity,
contains the structure factor of the structural units and the type of disorder, and
describes the characteristic modulation of the diffuse phenomenon of dimension n in the powder pattern. These expressions are given below with the assumption of Gaussian line shapes of width D for the narrow extension(s). The formulae depend on a factor
, where
describes the dependence of the Bragg peaks on the instrumental parameters U, V, W (see Caglioti et al., 1958
),
|
Techniques for the measurement of diffuse scattering using a white spectrum are common in neutron diffraction. Owing to the relatively low velocity of thermal or cold neutrons, time-of-flight (TOF) methods in combination with time-resolving detector systems, placed at a fixed angle 2θ, allow for a simultaneous recording along a radial direction through the origin of reciprocal space (see, e.g., Turberfield, 1970; Bauer et al., 1975
). The scan range is limited by the Ewald spheres corresponding to
and
, respectively. With several such detector systems placed at different angles, several scans may be carried out simultaneously during one neutron pulse. There is a renaissance of these methods in combination with high-flux pulsed neutron sources.
An analogue of neutron TOF diffractometry in the X-ray case is a combination of a white source of X-rays and an energy-dispersive detector. This technique, which has been known in principle for a long time, suffered from relatively weak white sources. With the development of high-power X-ray generators or the powerful synchrotron source this method has become highly interesting in recent times. Its use in diffuse-scattering work (in particular, resolution effects) is discussed by Harada et al. (1984).
Valuable developments with a view to diffuse-scattering work are multidetectors (see, e.g., Haubold, 1975) and position-sensitive detectors for X-rays (Arndt, 1986a
) and neutrons (Convert & Forsyth, 1983
). A linear position-sensitive detector allows one to record a large amount of data at the same time, which is very favourable in powder work and also in diffuse scattering with single crystals. By combining a linear position-sensitive detector and the TOF method a whole area in reciprocal space is accessible simultaneously (Niimura et al., 1982
; Niimura, 1986
). At present, area detectors are mainly used in combination with low-angle scattering techniques, but are also of growing interest for diffuse-scattering work (Arndt, 1986b
).
References








































































































































































