International
Tables for Crystallography Volume G Definition and exchange of crystallographic data Edited by S. R. Hall and B. McMahon © International Union of Crystallography 2006 
International Tables for Crystallography (2006). Vol. G. ch. 3.4, pp. 132137
Section 3.4.3. Arrangement of the dictionary
^{
a
}
Departamento de Física de la Materia Condensada, Facultad de Ciencia y Tecnología, Universidad del País Vasco, Apartado 644, 48080 Bilbao, Spain 
The msCIF dictionary detailed in Chapter 4.3 includes 19 new categories. Another 18 already exist in the core CIF dictionary, but include new items (16) or revised definitions (2). The category structure of the msCIF dictionary is summarized in Table 3.4.2.1 and is listed in full in Appendix 3.4.1. The appendix also lists for each category the section of this chapter in which the category is described.

Many of the modifications to categories that already exist in the core CIF dictionary result from the need to use more than three integer indices to label the diffracted intensities (in the cases of CELL, DIFFRN_REFLN, DIFFRN_REFLNS, DIFFRN_STANDARD_REFLN and EXPTL_CRYSTAL_FACE) or the need to use superspace symmetry (in the cases of GEOM_ANGLE, GEOM_BOND, GEOM_CONTACT, GEOM_TORSION, SPACE_GROUP and SPACE_GROUP_SYMOP). Apart from the categories that describe the atomic modulation functions, there are two that are specific to composite structures (CELL_SUBSYSTEM and CELL_SUBSYSTEMS).
The rest of this section summarizes the contents of the dictionary, organized by categories within the functional groups outlined in Table 3.1.10.1 . As in the other chapters in this part of the volume, the classification is under the headings Experimental measurements (Section 3.4.3.1), Analysis (Section 3.4.3.2), Atomicity, chemistry and structure (Section 3.4.3.3) and File metadata (Section 3.4.3.4). The msCIF dictionary adds no new data items concerned with the publication or reporting of structures to those already present in the core CIF dictionary.
The data items within each category are listed in the detailed commentary below. Where relevant, data items that represent a unique identifier for a looped list (`category keys') are listed first and are marked by a bullet (). The remaining data items in each category are listed alphabetically.
The categories relevant to a structure determination experiment are in the CELL, DIFFRN and EXPTL groups. Most of the items extend the existing core CIF categories, but the categories describing cell subsystems are new.
The categories describing the unit cell (or cells for composite structures), the wave vectors of the modulations and, for composites, the cell subsystems are as follows:
Categories marked with ¶ are already defined in the core CIF dictionary.
The data items in these categories are as follows:
(and _cell_subsystem_matrix_W_m_n for all combinations )
As explained in Section 3.4.2, the msCIF dictionary arbitrarily allows an upper limit of 11 for the dimension of superspace for which data names are defined. _cell_modulation_dimension specifies the number of additional reciprocal vectors needed to index the whole diffraction pattern and has values d ranging from 1 to 8 to express the dimensionality (3 + d) of the superspace. _cell_reciprocal_basis_description is a text field allowing a free description of the higherdimensional basis chosen.
For a composite structure, different cell subsystems may be specified. Each such subsystem is identified and characterized by the data items in the CELL_SUBSYSTEM category (see Section 3.4.4.1). _cell_subsystems_number gives the number of such subsystems as an independent check of the completeness of the description.
Data items in the CELL_WAVE_VECTOR category specify the wave vectors of the modulation (see Section 3.4.4.1). In accordance with the limits on dimensionality of the current version of the msCIF dictionary, no more than eight additional modulation wave vectors may be specified. The number used must agree with the value of _cell_modulation_dimension.
The data items in the CELL_WAVE_VECTORS category describe the experimental conditions during the determination of the independent modulation wave vectors.
The categories describing data collection are as follows:
Categories marked with ¶ are already defined in the core CIF dictionary.
New data items in these categories are as follows:
The data items in these categories are straightforward extensions of the core CIF dictionary definitions to the indexing of diffraction intensities by higherdimensional components. The _diffrn_refln_index_m_* items are the additional Miller indices indexing the modulation wave vectors when the diffraction wave vector is written as .
The upper limit of 8 was chosen arbitrarily to limit the number of data items defined in the msCIF dictionary.
_diffrn_reflns_limit_index_m_* items provide independent checks on the range of values recorded for each of the additional Miller indices . _diffrn_standard_refln_index_m_* items allow the higherdimensional Miller indices of standard reflections to be recorded.
The categories describing measurements on the crystal or crystals used in the experiment are as follows:
Categories marked with ¶ are already defined in the core CIF dictionary.
New data items in these categories are as follows:
_exptl_crystal_type_of_structure specifies the structure type as cryst (crystalline), mod (modulated) or comp (composite). These are the only three types of structure handled at present by the msCIF dictionary.
The extensions to the EXPTL_CRYSTAL_FACE category permit the indexing of crystal faces using the higherdimensional Miller indices introduced for aperiodic structures.
The categories related to refinement that have been extended in this dictionary are as follows:
Categories marked with ¶ are already defined in the core CIF dictionary.
New data items in this category are as follows:
During the early stages of the development of the msCIF dictionary, several sets of data items were defined to accommodate the need to specify residual R factors for the different sets of main reflections and satellite reflections. It was then recognized that the binning of reflection classes had more general application, and these new data items were transferred to the core CIF dictionary, where, of course, they are still available for use in an msCIF.
The new items in the REFINE category in the msCIF dictionary are specific to the refinement of modulated structures. _refine_ls_mod_func_description allows a freetext description of the types of modulation present in the structural model and how they are handled. The treatment of hydrogenatom modulation parameters is specified by _refine_ls_mod_hydrogen_treatment. Information on an overall phason correction (the use of which should in general be discouraged) may be given using the _refine_ls_mod_overall_phason_* items.
New data items in these categories are as follows:
As with the _diffrn_refln_* and _diffrn_reflns_* items (Section 3.4.3.1.2), these data names extend the corresponding core data items into the higherdimensional space used in the treatment of modulated structures and composites. They apply to the list of reflections used in the refinement, as distinct from the experimentally collected set of intensities described by the _diffrn_* data items.
The categories relevant to the description of the structural model are as follows:
Categories marked with ¶ are already defined in the core CIF dictionary.
Most of the new categories introduced to the msCIF dictionary appear here, since their function is to describe in great detail the modulation of the atomsite properties. They fall naturally into families describing the modulation of atomic displacement, of site occupation or of thermal parameters.
New data items are added to several categories in the core CIF dictionary that describe molecular or packing geometry. There are also new data items to describe superspacegroup symmetry.
Data items in these categories are as follows:
The bullet () indicates a category key. The arrow () is a reference to a parent data item.
The ATOM_SITEcategory is extended in the msCIF dictionary by the addition of a small number of items that may appear in the main looped list of atomsite information (see Section 3.2.4.1.1 ). The *_flag items indicate whether each individual atom site has been modelled through modulation of atomic displacement, site occupation or thermal parameters. In each case, the default value of the item is no, so that any or all of the flags may be omitted when that particular type of modulation has not been applied to the structural model.
_atom_site_subsystem_code identifies the cell subsystem to which the atom site must be assigned in the description of composite structures. Each value of _atom_site_subsystem_code must match one of the values of _cell_subsystem_code in the overall description of the subsystems defined for a composite.
The ATOM_SITE_PHASON category allow details of an atomdependent phason correction, as implemented in JANA2000, to be given. The use of these phason corrections is discouraged.
Data items in these categories are as follows:
(a) ATOM_SITE_DISPLACE_FOURIER
(b) ATOM_SITE_DISPLACE_FOURIER_PARAM
(c) ATOM_SITE_FOURIER_WAVE_VECTOR
(e) ATOM_SITE_OCC_FOURIER_PARAM
(g) ATOM_SITE_ROT_FOURIER_PARAM
(j) ATOM_SITES_DISPLACE_FOURIER
The bullet () indicates a category key. The arrow () is a reference to a parent data item.
It is common to represent a modulated structure using a reference periodic structure on which are superimposed atomic modulation functions expanded as Fourier series. (A full discussion of this is given in Section 3.4.4.3.) The msCIF dictionary provides separate categories for listing the modulated parameters that apply to atom positions, site occupancies and thermal parameters. The structuring of the data items within each of these categories follows a similar pattern.
For example, consider the modulation of the atomic displacements. The ATOM_SITE_DISPLACE_FOURIER category allows a listing of the axis along which the displacement occurs (*_axis) and the wave vectors contributing to that displacement component (*_wave_vector_seq_id) for each relevant atom site (labelled by _atom_site_displace_Fourier_atom_site_label). *_wave_vector_seq_id is a pointer to the description of the separate modulation wave vectors and must match one of the identifiers _atom_site_Fourier_wave_vector_seq_id listed separately in the ATOM_SITE_FOURIER_WAVE_VECTOR category. Likewise, the *_atom_site_label data item must match a value of _atom_site_label in the main list of atom positions. This is how the modulation is linked to the atom list. The item _atom_site_displace_Fourier_id is the formal key for the ATOM_SITE_DISPLACE_FOURIER category. It is used to locate the matching Fourier coefficients in the ATOM_SITE_DISPLACE_FOURIER_PARAM category. The coefficients may be reported in a sine–cosine ( _atom_site_displace_Fourier_param_sin, *_cos) or modulus–argument (*_mod, *_phase) representation.
Where a group of atoms is treated as a rigid group, the categories above describe only the translational part of the positional distortion. ATOM_SITE_ROT_FOURIER and ATOM_SITE_ROT_FOURIER_PARAM are used to describe the rotational components.
ATOM_SITE_OCC_FOURIER and ATOM_SITE_U_FOURIER, and their associated *_PARAM categories, are the analogous categories for the modulation of site occupation and thermal parameters.
All the categories above describe the properties of individual atom sites. Largerscale descriptions of the displacive modulation or of the rotational component of a rigid group are covered by the categories ATOM_SITES_DISPLACE_FOURIER and ATOM_SITES_ROT_FOURIER, each of which at present contains one descriptive data item.
The ATOM_SITES_MODULATION category contains data items describing the initial phases of the modulation waves, which are essential for determining the space group of the commensurate superstructure. More details are given in the dictionary.
Data items in these categories are as follows:
(a) ATOM_SITE_DISPLACE_SPECIAL_FUNC
(b) ATOM_SITE_OCC_SPECIAL_FUNC
The bullet () indicates a category key. The arrow () is a reference to a parent data item.
Several data items cover modulation functions that are not expressed as Fourier expansions. The examples in the current msCIF dictionary are restricted to the onedimensional modulations (sawtooth displacive and occupational crenel functions) implemented in the program JANA2000 (see Section 3.4.2).
New data items in these categories are as follows:
For each of the geometry categories, there are two groups of extensions. One set covers maximum, minimum and average values of bonds, contact distances, angles and torsion angles. The other extends the symmetryoperation code used in geometry listings in the core CIF dictionary (see Section 3.2.4.3.2 ) to the higherdimensional superspace form.
New data items in these categories are as follows:
At present, the msCIF dictionary extends the core CIF dictionary symmetry categories to describe superspace groups for onedimensional modulated structures in four ways: as the superspacegroup number in Janssen et al. (2004) ( _space_group_ssg_IT_number), as the International Tables superspacegroup symbol (*_ssg_name_IT), as one of the notations from de Wolff et al. (1981) (*_ssg_name_WJJ, *_ssg_WJJ_code), or in some other formalism (*_ssg_name). At present, superspacegroup names for higher dimensions can only be indicated using _space_group_ssg_name.
Symmetry operations in the superspace group are specified in the SPACE_GROUP_SYMOP category by an obvious extension to the method used in the core dictionary. These items must always be present in a CIF corresponding to a modulated or composite structure.
The categories modified in the msCIF dictionary to formalize the construction of a multiblock description of modulated or composite structures are as follows:
Categories marked with ¶ are already defined in the core CIF dictionary.
Data items revised in these categories are as follows:
The core dictionary definitions of these items are revised in order to formalize the relationships between multiple data blocks representing reference and modulated structures. Guidance is provided in the msCIF dictionary on how to label data blocks in a way that makes their mutual relationships clear.
References
Janssen, T., Janner, A., LooijengaVos, A. & de Wolff, P. M. (2004). Incommensurate and commensurate modulated structures. International Tables for Crystallography, Volume C, Mathematical, chemical and physical tables, 3rd ed., edited by E. Prince, ch. 9.8. Dordrecht: Kluwer Academic Publishers.Google ScholarWolff, P. M. de, Janssen, T. & Janner, A. (1981). The superspace groups for incommensurate crystal structures with a onedimensional modulation. Acta Cryst. A37, 625–636.Google Scholar