International
Tables for
Crystallography
Volume I
X-ray absorption spectroscopy and related techniques
Edited by C. T. Chantler, F. Boscherini and B. Bunker

International Tables for Crystallography (2024). Vol. I. ch. 5.15, pp. 702-704
https://doi.org/10.1107/S1574870722005511

Chapter 5.15. Bayesian techniques: an overview

Hans J. Krappe,a* Elizabeta Holub-Krappe,a Takehisa Konishib and Hermann H. Rossnera

aRetired from Helmholtz-Zentrum Berlin für Materialien und Energie GmbH, Hahn-Meitner-Platz 1, 14109 Berlin, Germany, and bGraduate School of Advanced Integration Science, Chiba University, 1-33 Yayoi-cho, Inage, Chiba 263-8522, Japan
Correspondence e-mail:  [email protected]

The standard single-scattering formula gives the experimental extended X-ray absorption fine-structure (EXAFS) cross section as function of the wavenumber in terms of a set of model parameters, including the average distances of the atoms involved in producing the EXAFS signal. To solve the inverse problem of determining the model parameters from the cross section, measured over some range of wavenumbers, a Bayesian approach is used. It allows the subspace of the model-parameter space in which the data essentially determine the model parameters to be determined.

Keywords: EXAFS; model parameters; Bayesian approach.

1. Introduction

The analysis of X-ray absorption fine-structure (XAFS) data is traditionally based on least-squares fitting methods. However, if there are more parameters than the data alone allow to be determined, this leads to an ill-conditioned system of equations (Stern, 1988link to reference; Rehr & Albers, 2000link to reference). We instead propose a Bayesian approach (Krappe & Rossner, 1985alink to reference,blink to reference, 2004link to reference). This allows the subspace Mathematical symbol of the total model-parameter space Mathematical symbol in which the data decide predominantly upon the outcome of the fit to be determined, whereas in complementary space a priori assumptions essentially fix the result.

2. The direct problem and the inverse problem in X-ray data fitting

We start the discussion with the direct problem, in which the cross section for photoelectron absorption μexp(k) on the K edge or an isolated L edge is calculated as a function of the wavenumber k of the absorbed X-ray. The calculation is based on the single-scattering formula for monoatomic, unoriented samples or samples with cubic symmetry, valid for k > kcut, where the energy k22/(2m) is sufficiently larger than the threshold energy for the photoeffect (Stern, 1988link to reference; Zabinsky et al., 1995link to reference; Bunker, 1983link to reference; Tröger et al., 1994link to reference), Mathematical equationwith the length correction δRj = δRj + δRj (Fornasini et al., 2004link to reference; Rossner et al., 2006link to reference), where Mathematical equationand truncating the sum beyond the Jth term, related to the parameter kcut. The model parameters in this approach are the following: the correction factor S0 for many-electron effects, the half-path distances of the J scattering paths, Rj, j = 1, …, J, the projected Debye–Waller (DW) parameters and the anharmonicity parameters, Mathematical symbol and C3;j, respectively. The correction factor S0 does in fact depend slightly on the path j and on k. We define Mathematical symbol as the average of the actual Mathematical symbol averaged over j and over k in the relevant k-range, with the remaining k and j dependence being absorbed in the scattering amplitudes fj(k, Rj). The absorption coefficient for the embedded absorbing atom μ0(k), the amplitudes fj(k, Rj), the scattering phases φj(k) and the damping parameter λ(k) follow from an (approximate) solution of the n-electron scattering problem and are, for example, calculated by the FEFF code (Zabinsky et al., 1995link to reference; Ankudinov, 1996link to reference; Rehr & Albers, 2000link to reference; Kas et al., 2024link to reference).

To solve the inverse problem, that is to determine the model parameters from a given set of measured values μexp(El) at energies El, we first obtain wavenumbers kl = [2sm(El − E0)]1/2/ℏ, where E0 is the effective threshold energy. Since the latter is known only approximately, it is often treated as another of the model parameters to be determined by the fit. As usual, a smooth background contribution μback(k) is subtracted from μexp(k), which is obtained from a polynomial extrapolation of the pre-edge μexp to the post-edge region as described by Victoreen (1948link to reference) and Milledge (1962link to reference). Unfortunately, the precise extrapolation recipe influences the final fit parameters somewhat. The μexp(k) are measured with an energy-dependent efficiency A(k). As in Krappe & Rossner (2004link to reference), we obtain Mathematical symbol from μexp(k) by a polynomial smoothing procedure, described in detail in the appendix to Krappe & Rossner (2004link to reference). Similarly, Mathematical symbol is obtained from μ0(k). The ratio A(k) = Mathematical symbol for k > kedge is then interpreted as the overall efficiency of the experimental setup in the EXAFS energy range k > kcut.

The FEFF result for μ0 needs corrections (Krappe & Rossner, 2004link to reference). We therefore write μ0(k) = Mathematical symbol + Mathematical symbol, where δμ0(k) is represented by a cubic spline on an equally spaced grid of support points, the number of which T is to be chosen to make the spline just sufficiently flexible for the purpose for which it is introduced (Krappe & Rossner, 2004link to reference). The ordinates δμt, t = 1, …, T are also treated as model parameters to be determined in the fit together with all other model parameters.

We have therefore to fit the function Mathematical equation for l = 1, …, L, where χ is given for k > kcut by equation (1link to equation). The set of model parameters is Mathematical equation

3. Bayesian approach to an ill-posed inversion problem

We give the experimental data Mathematical symbol a Gaussian probability distribution Mathematical symbol, characterized by the quadratic form Mathematical equationIt is usually assumed that the matrix F, which is the inverse of the variance matrix, is diagonal: Mathematical symbol. One also has to associate errors with the FEFF code because the electron multiple-scattering problem can only be treated approximately, for instance by including in the sum in equation (1link to equation) only terms which contribute more than 4% of the total, and integrals have to be approximated by finite sums. We again associate with these errors a Gaussian probability that z′ is true for a given x, i.e. Mathematical symbol, with Mathematical equationwhere the matrix B is the inverse of the variance matrix.

The conditional probability Mathematical symbol that the outcome of the observation is Mathematical symbol, once x is given, may be expressed in terms of Mathematical symbol and Mathematical symbol by Mathematical equationThe integral can be evaluated analytically and yields a Gaussian in g(x), Mathematical symbol, with Mathematical equationin terms of the L × L matrix Mathematical equation

We expand g(x) around a first-guess value x(0) for the solution of the inverse problem Mathematical equationwhere the L × N matrix G is defined as Mathematical equationInserting into equation (6link to equation) and calling xx(0) in the following x to simplify the notation, one obtains a second-order polynomial in x, Mathematical equationin terms of the N × N matrix Q = GTCG and the vector Mathematical symbol. The matrix Q is the inverse of the variance matrix of the distribution Pcond in terms of the variable x.

In order to find the probability distribution for the parameter values x, once the Mathematical symbol are given, Mathematical symbol, we use Bayes' theorem Mathematical equationBayes' theorem therefore solves the inversion problem in probability theory, but at the price of introducing the prior probability Pprior(x), which expresses the knowledge that we have about the model parameters before the experiment is made. Let us assume for the moment that we have an average value x(prior) and a variance matrix A−1 so that Mathematical symbol with Mathematical equationand let us further restrict the matrix A to be diagonal, Mathematical symbol and choose x(prior) = x(0). Maximizing Ppost yields the normal equations Mathematical equation

4. Turchin's proposal to determine the matrix A

An optimal choice of the diagonal matrix A must obviously take the quality of the data into account. Turchin and Nozik (Turchin & Nozik, 1969link to reference; Turchin et al., 1970link to reference; Turchin, 1985link to reference) assume that there is a probability distribution of αn which depends on Mathematical symbol. They first define the conditional probability Mathematical equationwhere equations (10)link to equation and (12)link to equation have been used to obtain the last equation. The normalization parameter c of this equation only contains terms that do not depend on A, Q or b. The dependence on the matrices A and Q is shown explicitly, including contributions from the normalization factors of Pcond and Pprior.

However, instead of Mathematical symbol the inverse conditional probability Mathematical symbol is needed. It is obtained by using Bayes' theorem once more: Mathematical equationVery often the function Mathematical symbol defined in equation (14)link to equation is sharply peaked in α-space at a point Mathematical symbol. One can then choose Mathematical symbol very broadly without affecting the α dependence of Mathematical symbol around the peak. Therefore, close to Mathematical symbol one has Mathematical symbol. One may use this peak value Mathematical symbol as the regularization vector in equation (13link to equation). With the condition Mathematical symbol, equation (14)link to equation yields N nonlinear equations for the vector of eigenvalues Mathematical symbol: Mathematical equationfor n = 1, …, N.

Note that the regularization method sketched above does not require an a priori restriction of the number of model parameters. Instead, it automatically determines that subspace Mathematical symbol of the whole model-parameter space Mathematical symbol in which the data determine the outcome of the fit. In the complementary space the a priori values Mathematical symbol determine the fit. Strong error correlations between two model parameters indicate that the data do not determine them independently.

More extended versions of this article, which includes applications to some typical EXAFS and magnetic EXAFS examples, can be found in Krappe & Rossner (2004link to reference) and Krappe et al. (2014link to reference).

References

First citationAnkudinov, A. L. (1996). PhD thesis. University of Washington, USA.Google Scholar
First citationBunker, G. (1983). Nucl. Instrum. Methods Phys. Res. 207, 437–444.Google Scholar
First citationFornasini, P., a Beccara, S., Dalba, G., Grisenti, R., Sanson, A., Vaccari, M. & Rocca, F. (2004). Phys. Rev. B, 70, 174301.Google Scholar
First citationKas, J. J., Vila, F. D. & Rehr, J. J. (2024). Int. Tables Crystallogr. I, ch. 6.8, 764–769 .Google Scholar
First citationKrappe, H. J., Holub-Krappe, E., Konishi, T. & Rossner, H. H. (2014). XAS Research Review, Vol. 13. The International X-ray Absorption Society.Google Scholar
First citationKrappe, H. J. & Rossner, H. H. (1985a). Advanced Methods in the Evaluation of Nuclear Scattering Data, edited by H. J. Krappe & R. Lipperheide, pp. 215–222. Berlin, Heidelberg: Springer-Verlag.Google Scholar
First citationKrappe, H. J. & Rossner, H. H. (1985b). Advanced Methods in the Evaluation of Nuclear Scattering Data, edited by H. J. Krappe & R. Lipperheide, pp. 242–248. Berlin, Heidelberg: Springer-Verlag.Google Scholar
First citationKrappe, H. J. & Rossner, H. H. (2004). Phys. Rev. B, 70, 104102.Google Scholar
First citationMilledge, H. J. (1962). International Tables for X-ray Crystallography, Volume III, edited by C. H. MacGillavry & G. D. Rieck, pp. 171–173. Birmingham: The Kynoch Press.Google Scholar
First citationRehr, J. J. & Albers, R. C. (2000). Rev. Mod. Phys. 72, 621–654.Google Scholar
First citationRossner, H. H., Schmitz, D., Imperia, P., Krappe, H. J. & Rehr, J. J. (2006). Phys. Rev. B, 74, 134107.Google Scholar
First citationStern, E. A. (1988). X-ray Absorption: Principles, Applications, Techniques of EXAFS, SEXAFS and XANES, edited by D. C. Koningsberger & R. Prins, pp. 3–52. New York: John Wiley & Sons.Google Scholar
First citationTröger, L., Yokoyama, T., Arvanitis, D., Lederer, T., Tischer, M. & Baberschke, K. (1994). Phys. Rev. B, 49, 888–903.Google Scholar
First citationTurchin, V. F. (1985). Advanced Methods in the Evaluation of Nuclear Scattering Data, edited by H. J. Krappe & R. Lipperheide, pp. 33–49. Berlin, Heidelberg: Springer-Verlag.Google Scholar
First citationTurchin, V. F., Kozlov, V. P. & Malkevich, M. S. (1970). Usp. Fiz. Nauk, 102, 345–386.Google Scholar
First citationTurchin, V. F. & Nozik, V. Z. (1969). Izv. Akad. Nauk. SSSR Ser. Fiz. Atm. Okeana, 5, 29.Google Scholar
First citationVictoreen, J. A. (1948). J. Appl. Phys. 19, 855–860.Google Scholar
First citationZabinsky, S. I., Rehr, J. J., Ankudinov, A., Albers, R. C. & Eller, M. J. (1995). Phys. Rev. B, 52, 2995–3009.Google Scholar








































to end of page
to top of page