International
Tables for Crystallography Volume F Crystallography of biological macromolecules Edited by M. G. Rossmann and E. Arnold © International Union of Crystallography 2006 |
International Tables for Crystallography (2006). Vol. F. ch. 17.1, pp. 353-356
https://doi.org/10.1107/97809553602060000691 Chapter 17.1. Around O
aDepartment of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Box 596, SE-751 24 Uppsala, Sweden, and bInstitute of Molecular and Structural Biology, University of Aarhus, Gustav Wieds Vej 10c, DK-8000 Aarhus C, Denmark The O program system and related programs are described. Topics covered include: RAVE, which is a suite of programs for electron-density map improvement and analysis; programs that interface with O for the analysis of protein models; utility programs; and internet-based services related to O and the associated programs. Keywords: O ; RAVE; electron-density averaging; model building; structure analysis. |
The first protein structures to be solved were built as wire models. It was only at the end of the 1970s and the beginning of the 1980s that the necessary hardware and software became available to allow crystallographers to construct their models using computers. The first commercially available computer graphics systems were very expensive, and by today's standards rather primitive in their line-drawing capabilities. They were usually controlled by mini-computers loaded with 64k words of memory and removable disks capable of holding 1 Mbyte of data. The limited amount of addressable memory was a severe limitation in software production. Furthermore, each computer graphics system had its own graphics programming library that was totally incompatible with those of other systems. Despite these limitations, the benefits of using computer-graphics-based systems became apparent and they were fairly rapidly adopted by the laboratories that could afford them. The main benefits were not in the construction of the initial model, but rather as a tool in crystallographic refinement (Jones, 1978). Making small manual changes to a model being refined was difficult and time-consuming work, but rather easy to accomplish even with low-powered graphics systems.
The most widely used program of the early days, Frodo, was available on most of the contemporary computer graphics systems. A major step forward occurred with the development of laboratory-scale 32-bit computers with virtual memory operating systems. In particular, the first Digital Equipment VAX models rapidly became the machines of choice in crystallographic laboratories. This allowed the contemplation of real-time improvements in models under construction (Jones, 1982; Jones & Liljas, 1984). Soon afterwards, colour became available in commercial graphics systems. This was much more than a cosmetic enhancement, since colour could be used to convey information vital to the crystallographer, such as main-chain/side-chain status codes for skeletonized electron density (Jones & Thirup, 1986). Unfortunately, there was still no common graphics programming standard, and moving to a new graphics system was a major effort (Pflugrath et al., 1984).
The next major advance in hardware occurred with the development of the workstation, combining the computer and graphics in one package. Although pioneered by Sun, the major player in the crystallographic community was a small Californian company, Silicon Graphics, which rapidly became large. Running the Unix operating system, workstations flourished, but still lacked a graphics environment that was portable between different hardware platforms. This changed when OpenGL was adopted as an industry standard. At the same time, prices stabilized and began to drop in terms of price/performance. Only in the late 1990s have price/performance indicators plummeted with the arrival of PC/graphics-board combinations capable of meeting the expectations of the current generation of crystallographers. The crystallographic workstation on every desk has finally arrived.
O was designed by Alwyn Jones to overcome some of the drawbacks associated with using Frodo. These problems had arisen because of the history of the program. In particular, O was designed to use a general-purpose memory allocation system to store any kind of model-related data. This would allow the display of any number of molecules and the use of databases for modelling. Although the latter had been introduced in a Frodo variant (Jones & Thirup, 1986), O was designed to take the concept of database use further, some would say to the extreme. Furthermore, O was designed to make it easier for different developers to work on the code without interfering with each other. In the event, only Morten Kjeldgaard and Jin-Yu Zou made any developments with the program.
Much of the data used by O are kept in a memory allocation system, the O database. This database is used to save parameters used by the program (including such things as keywords), macromolecular coordinates and information derived from them (such as graphical objects). As such, the program has no built-in limitations concerning what can be saved and used. A set of coordinates can be downloaded from the Protein Data Bank (PDB) (Bernstein et al., 1977) and stored as a series of vectors that describe the sequence, the residue names, the coordinates, the atom names, the unit cell etc. Some of these vectors contain residue-related data (e.g. the sequence), others contain atomic data (e.g. the atomic temperature factors), while yet others concern the molecule as a whole. The program therefore uses a strict naming convention in handling these data. Each molecule has a name, and the program then forces its own nomenclature for the standard atomic, residue and molecular properties. The user remains free to create new data outside O, bring them into the program by adopting the naming convention, and then make use of them to generate or manipulate graphical images. For example, a series of amino-acid sequences can be aligned with a computer program outside O and information on the degree of sequence conservation can be generated as a series of O data blocks. These can then be read into O and used to colour a Cα trace of a model, for example.
The program also has a strong macro capability that can be used to configure quite complex interactive tasks. It can also be used by a programmer outside O to generate data and a series of instructions for later interactive use.
Similarly, data generated within O can be exported to O-aware programs, significantly reducing the complexity of some crystallographic calculations. For example, real-space averaging of electron-density maps requires, as a minimum, both a series of operators describing the noncrystallographic symmetry (NCS) and a mask. These can be generated from scratch in O and improved and used by O-aware programs without the crystallographer needing to be concerned about the myriad details of axis definitions, rotations and translations.
Plotting is carried out within the O system via an intermediate metafile. When a user creates objects for display within O, calls are made to a set of low-level routines that create the OpenGL instructions on the workstation. Some objects are described in their entirety within the O database, but others are not. Molecular objects fall into the former category, whereas electron-density maps fall into the latter. There are two sets of plot commands, therefore, that are appropriate for each class. To plot an object made from a molecule, the user merely issues a command, and the appropriate metafile is written out, complete with viewing data. To plot other things, the user activates the command and then starts creating objects. Every time a low-level graphics routine is called, something gets written into the metafile. This is terminated with a command. The metafile contains much extraneous data, for example, instructions to the O pulldown menu system. However, it is built up from objects that are arranged in a hierarchy, where the highest-level object is called . Some objects, therefore, call instances of others, while other objects contain graphics instructions that define line start and end points, for example.
This metafile can be processed, and so far three different programs are available. OPLOT (written by Morten Kjeldgaard) generates PostScript output, carrying out a full traversal of the object hierarchy. The other two programs do not carry out such a traversal, but merely process the objects specified by the user. One (written by Mark Harris and Alwyn Jones) generates output suitable for input to the ray-tracing program PovRay (see http://www.povray.org ). The third program (written by Martin Berg) generates VRML output suitable for web-based viewing.
O is in continuous development, and interested readers are encouraged to visit the various internet sites that we maintain. There they will find detailed descriptions of the O command set, as well as various introductory exercises for learning how to use the program. The following publications describe various aspects of O-related features and methods:
RAVE is a suite of programs for electron-density map improvement and analysis, with a strong focus on averaging techniques (Kleywegt & Read, 1997). It is the successor of an older package (`A') (Jones, 1992), and at present it contains tools for single and multiple crystal form, single- and multiple-domain NCS averaging of electron-density maps, and the detection of structural units in such maps. The package works in conjunction with the CCP4 suite of programs (Collaborative Computational Project, Number 4, 1994).
RAVE contains the following programs for density averaging involving one crystal form:
RAVE also contains tools for averaging between different crystal forms, namely:
More recently, RAVE has been expanded to include tools that can be of use in map interpretation:
Finally, RAVE also contains three utility programs that can be used to manipulate three essential data structures encountered in averaging, map interpretation and refinement:
A number of programs are available for the analysis of protein models. Most of these programs are interfaced with O, producing files (such as maps and O macros) that allow for quick and easy visualization and inspection of their results.
Many utility programs are available from Uppsala, most of them aimed at practising crystallographers. Some of these (MAMA, MAPMAN, DATAMAN) have been discussed in Section 17.1.3. A few others are discussed below.
The following is an overview of internet-based services related to O and the associated programs.
|
Acknowledgements
This work has been supported by the the Swedish Natural Sciences Research Council (NFR), Uppsala University, the EU-funded 3-D Validation Network, the Swedish Foundation for Strategic Research (SSF) and its Structural Biology Network (SBNet).
References
Allen, F. H. & Johnson, O. (1991). Automated conformational analysis from crystallographic data. 4. Statistical descriptors for a distribution of torsion angles. Acta Cryst. B47, 62–67.Google ScholarBairoch, A. & Apweiler, R. (1997). The SWISS-PROT protein sequence data bank and its supplement TrEMBL. Nucleic Acids Res. 25, 31–36.Google Scholar
Bairoch, A. & Bucher, P. (1994). PROSITE: recent developments. Nucleic Acids Res. 22, 3583–3589.Google Scholar
Bernstein, F. C., Koetzle, T. F., Williams, G. J. B., Meyer, E. F. Jr, Brice, M. D., Rodgers, J. R., Kennard, O., Shimanouchi, T. & Tasumi, M. (1977). The Protein Data Bank: a computer-based archival file for macromolecular structures. J. Mol. Biol. 112, 535–542.Google Scholar
Brünger, A. T. (1992a). Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. Nature (London), 355, 472–475.Google Scholar
Brünger, A. T. (1992b). X-PLOR: a system for crystallography and NMR. Yale University, New Haven, CT, USA.Google Scholar
Brünger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J.-S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T. & Warren, G. L. (1998). Crystallography & NMR System: a new software suite for macromolecular structure determination. Acta Cryst. D54, 905–921.Google Scholar
Collaborative Computational Project, Number 4 (1994). The CCP4 suite: programs for protein crystallography. Acta Cryst. D50, 760–763.Google Scholar
Fan, C., Moews, P. C., Shi, Y., Walsh, C. T. & Knox, J. R. (1995). A common fold for peptide synthetases cleaving ATP to ADP: glutathione synthetase and D-alanine:D-alanine ligase of Escherichia coli. Proc. Natl Acad. Sci. USA, 92, 1172–1176.Google Scholar
Fan, C., Moews, P. C., Walsh, C. T. & Knox, J. R. (1994). Vancomycin resistance: structure of D-alanine:D-alanine ligase at 2.3 Å resolution. Science, 266, 439–443.Google Scholar
Gribskov, M., McLachlan, A. D. & Eisenberg, D. (1987). Profile analysis: detection of distantly related proteins. Proc. Natl Acad. Sci. USA, 84, 4355–4358.Google Scholar
Gribskov, M. & Veretnik, S. (1996). Identification of sequence patterns with profile analysis. Methods Enzymol. 266, 198–212.Google Scholar
Jones, T. A. (1978). A graphics model building and refinement system for macromolecules. J. Appl. Cryst. 11, 268–272.Google Scholar
Jones, T. A. (1982). FRODO: a graphics fitting program for macromolecules. In Computational crystallography, edited by D. Sayre, pp. 303–317. Oxford: Clarendon Press.Google Scholar
Jones, T. A. (1992). A, yaap, asap, @#*? A set of averaging programs. In Molecular replacement, edited by E. J. Dodson, S. Glover & W. Wolf, pp. 91–105. Warrington: Daresbury Laboratory.Google Scholar
Jones, T. A. & Kjeldgaard, M. (1997). Electron density map interpretation. Methods Enzymol. 277, 173–208.Google Scholar
Jones, T. A. & Kleywegt, G. J. (2001). New tools for the interpretation of macromolecular electron-density maps. In preparation.Google Scholar
Jones, T. A. & Liljas, L. (1984). Crystallographic refinement of macromolecules having noncrystallographic symmetry. Acta Cryst. A40, 50–57.Google Scholar
Jones, T. A. & Thirup, S. (1986). Using known substructures in protein model building and crystallography. EMBO J. 5, 819–822.Google Scholar
Jones, T. A., Zou, J.-Y., Cowan, S. W. & Kjeldgaard, M. (1991). Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Cryst. A47, 110–119.Google Scholar
Kleywegt, G. J. (1996). Use of non-crystallographic symmetry in protein structure refinement. Acta Cryst. D52, 842–857.Google Scholar
Kleywegt, G. J. (1997). Validation of protein models from Cα coordinates alone. J. Mol. Biol. 273, 371–376.Google Scholar
Kleywegt, G. J. (1999a). Recognition of spatial motifs in protein structures. J. Mol. Biol. 285, 1887–1897.Google Scholar
Kleywegt, G. J. (1999b). Experimental assessment of differences between related protein crystal structures. Acta Cryst. D55, 1878–1884.Google Scholar
Kleywegt, G. J. & Brünger, A. T. (1996). Checking your imagination: applications of the free R value. Structure, 4, 897–904.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1994a). Detection, delineation, measurement and display of cavities in macromolecular structures. Acta Cryst. D50, 178–185.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1994b). Halloween … masks and bones. In From first map to final model, edited by S. Bailey, R. Hubbard & D. A. Waller, pp. 59–66. Warrington: Daresbury Laboratory.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1996a). xdlMAPMAN and xdlDATAMAN – programs for reformatting, analysis and manipulation of biomacromolecular electron-density maps and reflection data sets. Acta Cryst. D52, 826–828.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1996b). Efficient rebuilding of protein structures. Acta Cryst. D52, 829–832.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1996c). Phi/Psi-chology: Ramachandran revisited. Structure, 4, 1395–1400.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1997a). Template convolution to enhance or detect structural features in macromolecular electron-density maps. Acta Cryst. D53, 179–185.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1997b). Detecting folding motifs and similarities in protein structures. Methods Enzymol. 277, 525–545.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1998). Databases in protein crystallography. Acta Cryst. D54, 1119–1131.Google Scholar
Kleywegt, G. J. & Jones, T. A. (1999). Software for handling macromolecular envelopes. Acta Cryst. D55, 941–944.Google Scholar
Kleywegt, G. J. & Read, R. J. (1997). Not your average density. Structure, 5, 1557–1569.Google Scholar
Korn, A. P. & Rose, D. R. (1994). Torsion angle differences as a means of pinpointing local polypeptide chain trajectory changes for identical proteins in different conformational states. Protein Eng. 7, 961–967.Google Scholar
Matte, A., Goldie, H., Sweet, R. M. & Delbaere, L. T. J. (1996). Crystal structure of Escherichia coli phosphoenolpyruvate carboxykinase: a new structural family with the P-loop nucleoside triphosphate hydrolase fold. J. Mol. Biol. 256, 126–143.Google Scholar
Mowbray, S. L., Helgstrand, C., Sigrell, J. A., Cameron, A. D. & Jones, T. A. (1999). Errors and reproducibility in electron-density map interpretation. Acta Cryst. D55, 1309–1319.Google Scholar
Pflugrath, J. W., Saper, M. A. & Quiocho, F. A. (1984). New generation graphics system for molecular modeling. In Methods and applications in crystallographic computing, edited by S. R. Hall & T. Ashida, pp. 404–407. Oxford: Clarendon Press.Google Scholar
Qiu, X., Verlinde, C. L. M. J., Zhang, S., Schmitt, M. P., Holmes, R. K. & Hol, W. G. J. (1995). Three-dimensional structure of the diphtheria toxin repressor in complex with divalent cation co-repressors. Structure, 3, 87–100.Google Scholar
Schirmer, T., Keller, T. A., Wang, Y. F. & Rosenbusch, J. P. (1995). Structural basis for sugar translocation through maltoporin channels at 3.1 Å resolution. Science, 267, 512–514.Google Scholar
Tronrud, D. E., Ten Eyck, L. F. & Matthews, B. W. (1987). An efficient general-purpose least-squares refinement program for macromolecular structures. Acta Cryst. A43, 489–501.Google Scholar
Vellieux, F. M. D. A. P., Hunt, J. F., Roy, S. & Read, R. J. (1995). DEMON/ANGEL: a suite of programs to carry out density modification. J. Appl. Cryst. 28, 347–351.Google Scholar
Zou, J. Y. & Jones, T. A. (1996). Towards the automatic interpretation of macromolecular electron-density maps: qualitative and quantitative matching of protein sequence to map. Acta Cryst. D52, 833–841.Google Scholar
Zou, J.-Y. & Mowbray, S. L. (1994). An evaluation of the use of databases in protein structure refinement. Acta Cryst. D50, 237–249.Google Scholar