International
Tables for
Crystallography
Volume F
Crystallography of biological macromolecules
Edited by M. G. Rossmann and E. Arnold

International Tables for Crystallography (2006). Vol. F. ch. 23.3, pp. 603-607   | 1 | 2 |

Section 23.3.4.1. Sequence-dependent deformability

R. E. Dickersona*

a Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095–1570, USA
Correspondence e-mail: red@mbi.ucla.edu

23.3.4.1. Sequence-dependent deformability

| top | pdf |

23.3.4.1.1. Minor groove width

| top | pdf |

The simplest and first-noticed sequence-dependent deformability of the B-DNA duplex was variation in minor groove width. The first B-DNA oligomer to be solved, C-G-C-G-A-A-T-T-C-G-C-G (B1–B6), had a narrow minor groove in the central A-A-T-T region, with only ca 3.5 Å of free space between opposing phosphates and sugar rings. (It has become conventional to define the free space between phosphates as the measured minimal P–P separation across the groove, less 5.8 Å to represent two phosphate-group radii. Similarly, the measured distance between sugar oxygens is decreased by 2.8 Å, representing two oxygen van der Waals radii.) The C-G-C-G ends of the helix had the 6–7 Å opening expected for ideal B-DNA, but the situation was clouded, because the outermost two base pairs at each end of the helix interlocked minor grooves with neighbours in the crystal. Hence, the wider ends could possibly be only an artifact of crystal packing.

After 1991, the situation was clarified by the structures of several decamers [Table A23.3.1.2[link], Part I(c)], which stack on top of one another without the interlocking of grooves. The normal minor groove opening is ca 7 Å. Regions of four or more AT base pairs can exhibit a significantly narrowed minor groove, although such narrowing is not mandatory. This behaviour is seen with the B-DNA decamer, C-A-A-A-G-A-A-A-A-G, in Fig. 23.3.4.1[link]. The narrowing arises mainly from the larger allowable propeller twist in AT base pairs, which displaces C1′ atoms at opposite ends of the pair in different directions, and moves the backbone chains in such a way as to partially close the groove (Fig. 23.3.4.2)[link].

[Figure 23.3.4.1]

Figure 23.3.4.1| top | pdf |

Structure of C-A-A-A-G-A-A-A-A-G (B107). The lower half of the helix, with -A-A-A-A-G, exhibits the narrow minor groove commonly associated with the AT region of the helix and a single zigzag spine of hydration, as was first seen in C-G-C-G-A-A-T-T-C-G-C-G (B1–B6). The upper half, with C-A-A-A-G-, has the wider minor groove of general-sequence B-DNA and two separate rows of hydrating water molecules along the two walls of the wider groove.

[Figure 23.3.4.2]

Figure 23.3.4.2| top | pdf |

Relationship between minor groove width and propeller twist. (a) View into the minor groove of B-DNA, with base pairs seen on edge and with the sugar–phosphate backbones shown schematically as inclined ladder uprights. (b) Consequences of propeller twisting the base pairs. Glycosyl bonds connected to sugar C1′ atoms are all displaced upward in the right strand and downward in the left strand. This shifts the backbone chains as indicated by the arrows. Hence, the gap between the chains is decreased, and the minor groove is narrowed.

This is an excellent example of the concept of sequence-dependent helix deformability, rather than simple deformation. The two hydrogen bonds of an AT base pair allow a larger propeller twist but do not require it. Hence, AT regions of helix permit a narrowing of the minor groove but do not demand it. Indeed, this lesson was brought home in the most dramatic way when Pelton & Wemmer (1989[link], 1990[link]) showed via NMR that a 2:1 complex of distamycin with C-G-C-A-A-A-T-T-G-G-C or C-G-C-A-A-A-T-T-T-G-C-G could exist, in which two drug molecules sat side-by-side within an enlarged central minor groove. Fig. 23.3.4.3[link] shows a narrow minor groove with a single netropsin molecule, and Fig. 23.3.4.4[link] shows a wide minor groove enclosing two di-imidazole lexitropsins side-by-side. In summary, an AT-rich region of minor groove is capable of narrowing but is not inevitably narrow, in contrast to GC-rich regions where the third hydrogen bond tends to keep the base pairs flat and the minor groove wide. The AT minor groove is potentially deformable without being inevitably deformed.

[Figure 23.3.4.3]

Figure 23.3.4.3| top | pdf |

Structure of the 1:1 complex of netropsin with C-G-C-G-A-A-T-T-C-G-C-G (B11, B12, B87). The drug binds to the central -A-A-T-T- region of the minor groove, which is barely wide enough to enclose the nearly planar polyamide molecule. The netropsin structure can be represented by [ ^{+}(\hbox{NH}_{2})_{2}\hbox{C---NH---CH}_{2}\hbox{---CONH---Py---CONH---Py---CONH---CH}_{2}\hbox{---CH}_{2}\hbox{---C}(\hbox{NH}_{2})_{2}^{+}] where Py is a five-membered methylpyrrole ring. An even more compact representation, useful when comparing other polyamide netropsin analogues or lexitropsins, is +=Py=Py=+, where the common cationic tails are indicated only by a plus sign, and = represents a —CONH— amide.

[Figure 23.3.4.4]

Figure 23.3.4.4| top | pdf |

Structure of the 2:1 complex of a di-imidazole lexitropsin with C-A-T-G-G-C-C-A-T-G (B108). The drug now is represented by [\hbox{H---CONH---Im---CONH---Im---CONH---CH}_{2}\hbox{---CH}_{2}\hbox{---C}(\hbox{NH}_{2})_{2}^{+}] where Im is a five-membered imidazole ring, or again more compactly by 0=Im=Im=+. The uncharged leading amide group, characteristic of distamycins, is identified by 0. Distamycin itself would be represented in this shorthand notation by 0=Py=Py=Py=+. Reprinted from B108, copyright (1977), with permission from Excerpta Medica Inc.

23.3.4.1.2. Helix bending

| top | pdf |

Sequence-dependent bendability has been reviewed recently by Dickerson (1988a[link],b[link],c[link]) and Dickerson & Chiu (1997)[link]. The relative bendability of different regions of B-DNA sequence is an important aspect of recognition, one that is used by countless control proteins that must bind to a particular region of double helix. Catabolite activator protein or CAP (Schultz et al., 1991[link]; Parkinson et al., 1996[link]), lacI (Lewis et al., 1996[link]) and purR (Schumacher et al., 1994[link]) repressors, γδ-resolvase (Yang & Steitz, 1995[link]), EcoRV restriction enzyme (Winkler et al., 1993[link]; Kostrewa & Winkler, 1995[link]), integration host factor or IHF (Rice et al., 1996[link]), and TBP or TATA-binding protein (Kim, Gerger et al., 1993[link]; Kim, Nikolov & Burley, 1993[link]; Nikolov et al., 1996[link]; Juo et al., 1996[link]) are all sequence-specific DNA-binding proteins that bend or deform the nucleic acid duplex severely during the recognition process. IHF in Fig. 23.3.4.5[link] may be taken as representative of this class of DNA-binding proteins. The bend is produced by two localized rolls of ca 60° in a direction compressing the major groove and are additive, because they are spaced nine base pairs, or roughly one turn of helix, apart. In IHF, the two helix segments flanking the bend should be straight and unbent, and this is accomplished in one segment via a six-adenine A-tract: -C-A-A-A-A-A-A-G-.

[Figure 23.3.4.5]

Figure 23.3.4.5| top | pdf |

DNA duplex (red and blue strands) looped around IHF or integration host factor. The two subunits of the IHF duplex are green and turquoise. Two antiparallel loops of protein chain, one from each subunit, insert into the minor groove of B-DNA at the sequence C-A-A-T/A-T-T-G and produce abrupt bends via local roll angles of 60°. The two localized bends are additive because they occur one helical turn apart. All other steps have roll angles of 5° or less. The two flanking helix segments pack against the IHF dimer and must be kept straight and unbent. This is accomplished in one of the two segments by an A-tract of sequence C-A-A-A-A-A-A-G. From Dickerson & Chiu (1997)[link]. Coordinates courtesy of P. Rice.

The bending locus in IHF is C-A-A-T/A-T-T-G . It is C-G in lacI and purR repressors (Fig. 23.3.4.6)[link], C-A = T-G in CAP (Fig. 10 of Dickerson, 1998b[link]), and T-A in EcoRV, γδ-resolvase and TBP (Fig. 23.3.4.7)[link]. Pyrimidine-purine or Y-R steps appear to be especially suitable loci for roll bending. The dashed lines in Figs. 23.3.4.6[link] and 23.3.4.7[link] plot tilt, and demonstrate its insignificance in bending, compared with roll. (This is intuitively obvious. Imagine yourself standing near a tall stack of wooden planks in a lumberyard during an earthquake. Where would you prefer to stand: alongside the stack, or at one end?)

[Figure 23.3.4.6]

Figure 23.3.4.6| top | pdf |

Roll-angle plots for sequence-specific DNA–protein complexes with lacI (top) and purR (bottom). In each case, bending occurs via localized roll at a C-G step. Other steps of the sequence have random rolls of ca 10° or less. Note that, as with IHF, A-tracts are especially straight and unbent. Dashed lines in the lacI plot demonstrate the unimportance of tilt in production of helix bending.

[Figure 23.3.4.7]

Figure 23.3.4.7| top | pdf |

Bending via roll at T-A steps in TBP or the TATA-binding protein (top) and in γδ-resolvase (bottom). Note that not every T-A step in TBP or γδ-resolvase is necessarily bent. Note also in γδ-resolvase that C-A = T-G steps, which in proteins such as CAP are used to generate sharp roll bends, here, frequently, are local roll maxima, even though they contribute little to the overall bending. They have a bending potential that is not used in this particular setting.

In summary, bending of the B-DNA helix nearly always involves roll, not tilt. The easier direction of bending is that which compresses the broad major groove, although examples of roll compression of the minor groove are known. Y-R steps are especially prone to roll bending. Again, the phenomenon is one of sequence-induced bendability, not mandatory bending. No one imagines that the IHF binding sequence of Fig. 23.3.4.5[link] is permanently kinked at its two C-A-A-T/A-T-T-G steps, wandering deformed through the nucleus, looking for an IHF molecule to bind to. Instead, this sequence has a potential bendability that other sequences, such as A-A-A-A-A-A, lack.

Table 23.3.4.1[link] summarizes the observed behaviour of Y-R, R-R and R-Y steps from a great many X-ray crystal structure analyses, with and without bound DNA. In the present context, these rules are termed the `Major Canon', since they are well established and generally well understood. Some understanding of the proneness of Y-R steps to bend can be obtained by looking at stereo pairs of two successive base pairs viewed down the helix axis. Fig. 23.3.4.8[link] gives a few representative examples; many more can be found in Figs. 4–6 of Dickerson (1988b)[link] and in the original literature. In brief, Y-R steps, especially C-A and T-A, tend to orient so that polar exocyclic N and O atoms stack against polarizable rings of the other base pair. This is the same type of polar-on-polarizable stacking stabilization mentioned earlier in connection with O4′ and guanine in Z-DNA (Bugg et al., 1971[link]; Thomas et al., 1982[link]; Hunter & Sanders, 1990[link]; B32). Base pairs in T-A steps tend not to slide over one another along their long axes, keeping pyrimidine O2 stacked over the purine five-membered ring (Fig. 23.3.4.8b)[link]. C-A steps can adopt this same stacking, or the base pairs can slide until the pyrimidine O2 sits over the purine six-membered ring instead (Fig. 23.3.4.8a)[link].

Table 23.3.4.1| top | pdf |
Sequence-dependent differential deformability in B-DNA. I. The Major Canon

See Dickerson (1998a[link],b[link],c[link]) and Dickerson & Chiu (1997)[link].

(1) Structural basis for helix bending in B-DNA
Bending is nearly always the result of roll between successive base pairs, seldom tilt.
Positive roll, compressing the wide major groove, is more common than negative roll, in which the narrower minor groove is compressed.
Observed bends in B-DNA are of three main types: (a) localized kinks (large positive roll at one or two discrete base steps), (b) three-dimensional writhe (positive roll at a series of successive steps), or (c) smooth curvature (alternation of positive and negative roll every half turn, with side-to-side zigzagging at intermediate positions). (a) and (b) are easier to accomplish than (c), and hence are more common.
Local writhe in a DNA helix produces macroscopic curvature only when the extent of writhe does not match the natural rotational periodicity of the helix. Endless writhe results in a straight helix, and indeed A-DNA can be regarded as a continuously writhed variant of the B form. Conversely, the bending effect of writhe can be amplified if it is repeated with the periodicity of the helix itself – that is, repeated alternation of writhed and unwrithed segments every ten base pairs, as with A-tract B-DNA.
 
(2) Pyrimidine-purine (Y-R) steps: C-A = T-G, T-A and C-G
Little ring–ring stacking overlap.
Polar N or O stacked over polarizable aromatic rings.
Y-R steps are natural fracture points for the helix. They can show (but are not required in every case to show) large twist and slide deformations, and bending mainly via positive roll, compressing the major groove.
 
(3) Purine-purine (R-R) steps: A-A = T-T, A-G = C-T, G-A = T-C and G-G = C-C
Extensive ring–ring overlap.
Base pairs tend to pivot about stacked purines as a hinge, with greater ring–ring separation at pyrimidine ends.
Tight stacking, with only minor roll, slide and twist deformations.
 
(4) Purine-pyrimidine (R-Y) steps: A-C = G-T, A-T and C-G
Behaviour in general like R-R steps, with extensive ring–ring overlap and tight stacking, with again only minor roll, slide and twist deformations.
 
(5) A-A and A-T steps, as contrasted with T-A
Especially resistant to roll bending, probably because of sawhorse interlocking of highly propellered base pairs, supplemented by inter-base-pair hydrogen bonds within grooves. In contrast, T-A is particularly weak and subject to roll bending.
A-tracts, defined as four or more consecutive AT base pairs without the disruptive T-A step, are especially straight and resistant to bending. Natural selection has apparently chosen short A-tracts for regions of protein–DNA contacts where bending is not wanted.
[Figure 23.3.4.8]

Figure 23.3.4.8| top | pdf |

Representative base-pair steps from B-DNA single-crystal X-ray analyses. (a) Pyrimidine-purine C-A step from C-C-A-A-G-A-T-T-G-G (B22, B46) (roll/slide/twist = −7.4°/2.6 Å/49.9°). Note the lack of ring-on-ring stacking, replaced by the stacking of pyrimidine O2 and purine N6 or O6, on aromatic rings of the adjacent base pair. This stacking opens up the twist angle to an unusual 50°. Note also the large +2.6 Å slide, which positions pyrimidine O2 over the six-membered rings of the neighbouring purines. (b) Pyrimidine-purine T-A step from C-G-A-T-A-T-A-T-C-G (B62) (roll/side/twist = 3.8°/−0.2 Å /39.5°). The stacking is similar to C-A, except that a near-zero slide positions pyrimidine O2 over the five-membered rings of purines. (c) Purine-purine A-A step from C-C-A-A-C-G-T-T-G-G (B46, B50) (roll/slide/twist = 8.8°/0.5 Å /28.7°). Ring-on-ring overlap now predominates, with consequently lowered twist angle and essentially zero slide. Note that purines are more extensively stacked than pyrimidines, which appear to be approaching the O2-on-ring stacking of Y-R steps. (d) Purine-pyrimidine A-T step from C-G-A-T-A-T-A-T-C-G (B62) (roll/slide/twist = 5.2°/0.0 Å/25.2°). Ring-on-ring stacking again lowers the twist angle and keeps slide around zero. Now there is no stacking of exocyclic N or O on neighbouring rings.

Purine-purine or R-R steps behave quite differently (Fig. 23.3.4.8c)[link]. They stack ring-on-ring, usually with greater overlap on the purine end than the pyrimidine. The net effect is that the pivot appears to pass through or near the purines, while pyrimidines at the other end of the pairs stack O2-on-ring as with Y-R steps. R-Y steps tend to stack ring-on-ring, with little contribution from exocyclic atoms.

El Hassan & Calladine (1997)[link] have recently examined roll, slide and twist behaviour at 400 different steps observed in crystal structures of 24 A- and 36 B-DNA oligomers. The author has carried out a similar analysis of 1137 steps from 86 sequence-specific protein–DNA complexes (Dickerson, 1998a[link],c[link]; Dickerson & Chiu, 1997[link]). A striking feature is that trends in local parameters are just the same in DNA crystals and in protein–DNA complexes. The frequently invoked nightmare of `crystal packing deformations' appears to be of only minor significance. In both studies (El Hassan & Calladine, 1997[link]; Dickerson, 1998b[link]), roll versus slide, slide versus twist and twist versus roll plots are presented for all ten possible base-pair steps. Fig. 23.3.4.9[link] illustrates roll versus slide plots for two Y-R, two R-R and two R-Y steps.

[Figure 23.3.4.9]

Figure 23.3.4.9| top | pdf |

Slide versus roll plots for six of the ten possible base-pair steps. Data points are from 971 steps in crystal structure analyses of 63 sequence-specific protein–DNA complexes. A complete set of 30 plots for slide/roll, twist/roll and slide/twist at all ten steps is to be found in Dickerson (1998b)[link], and equivalent plots for DNA alone are given by El Hassan & Calladine (1997)[link]. Y-R steps exhibit a broad range of roll, slide and twist values, with roughly linear correlations between pairs of variables. Points for A-A and other R-R steps cluster tightly around the origin, showing little tendency toward roll bending. Curiously enough, points for R-Y steps tend to favour negative values of slide and twist, and, hence, to concentrate in the lower left quadrant of a slide/tilt plot.

Table 23.3.4.2[link] summarizes observations from these roll/slide/twist plots. These are labelled the `Minor Canon' since they are recent, approximate and not well understood. However, they provide goals for future investigations of helix behaviour.

Table 23.3.4.2| top | pdf |
Sequence-dependent differential deformability in B-DNA. II. The Minor Canon

These generalizations are illustrated by Fig. 23.3.4.9[link], and are justified at greater length by El Hassan & Calladine (1997)[link] and Dickerson (1998a[link],b[link],c[link]).

(6) Heterogeneous steps ending in A: C-A, T-A and G-A
Steps ending in adenine, aside from A-A, tend to display (a) negative correlation between slide and roll, and between twist and roll, and (b) positive correlation between slide and twist.
 
(7) Purine-pyrimidine steps
R-Y steps display, on average, a systematic preference for negative slide and for twist below 36°.
 
(8) Relative step frequencies in sequence-specific protein–DNA complexes
Step A-A is the most common of all, and in 55% of the cases it occurs within A-tracts.
Steps containing only GC base pairs are least common, and seemingly are less compatible with formation of sequence-specific protein complexes.
 
(9) Local environment and DNA behaviour
Sequence-dependent local helix deformations are quite similar in DNA crystals and in protein–DNA complexes. DNA molecules packed against proteins in their normal biological environment appear to have more in common with DNA packed against other DNA helices in the crystal than with free DNA in solution.

References

First citation Bugg, C. E., Thomas, J. M., Sundaralingam, M. & Rao, S. T. (1971). Stereochemistry of nucleic acids and their constituents. X. Solid-state base-stacking patterns in nucleic acid consituents and polynucleotides. Biopolymers, 10, 175–219.Google Scholar
First citation Dickerson, R. E. (1998a). Sequence-dependent B-DNA conformation in crystals and in protein complexes. In Structure, motion, interaction and expression of biological macromolecules, edited by R. H. Sarma & M. H. Sarma, pp. 17–36. New York: Adenine Press.Google Scholar
First citation Dickerson, R. E. (1998b). Helix structure and molecular recognition by B-DNA. In Oxford handbook of nucleic acid structure, edited by S. Neidle, ch. 7, pp. 145–197. Oxford University Press.Google Scholar
First citation Dickerson, R. E. (1998c). DNA bending: the prevalence of kinkiness and the virtues of normality. Nucleic Acids Res. 26, 1906–1926.Google Scholar
First citation Dickerson, R. E. & Chiu, T. K. (1997). Helix bending as a factor in protein/DNA recognition. Biopolymers Nucleic Acid Sci. 44, 361–403.Google Scholar
First citation El Hassan, M. A. & Calladine, C. R. (1997). Conformational characteristics of DNA: empirical classifications and a hypothesis for the conformational behaviour of dinucleotide steps. Philos. Trans. R. Soc. London A, 355, 43–100.Google Scholar
First citation Hunter, C. A. & Sanders, J. K. M. (1990). The nature of π–π interactions. J. Am. Chem. Soc. 112, 5525–5534.Google Scholar
First citation Juo, Z. S., Chiu, T. K., Leiberman, P. M., Baikalov, I., Berk, A. J. & Dickerson, R. E. (1996). How proteins recognize the TATA box. J. Mol. Biol. 261, 239–254.Google Scholar
First citation Kim, J. L., Nikolov, D. B. & Burley, S. K. (1993). Co-crystal structure of TBP recognizing the minor groove of a TATA element. Nature (London), 365, 520–527.Google Scholar
First citation Kim, Y., Geiger, J. H., Hahn, S. & Sigler, P. B. (1993). Crystal structure of a yeast TBP/TATA-box complex. Nature (London), 365, 512–520.Google Scholar
First citation Kostrewa, D. & Winkler, F. K. (1995). Mg2+ binding to the active site of EcoRV endonuclease: a crystallographic study of complexes with substrate and product DNA at 2 Å resolution. Biochemistry, 34, 683–696.Google Scholar
First citation Lewis, M., Chang, G., Horton, N. C., Kercher, M. A., Pace, H. C., Schumacher, M. A., Brennan, R. G. & Lu, P. (1996). Crystal structure of the lactose operon repressor and its complexes with DNA and inducer. Science, 271, 1247–1254.Google Scholar
First citation Nikolov, D. B., Chen, H., Halay, E. D., Hoffman, A., Roeder, R. G. & Burley, S. K. (1996). Crystal structure of a human TATA box-binding protein/TATA element complex. Proc. Natl Acad. Sci. USA, 93, 4862–4867.Google Scholar
First citation Parkinson, G., Wilson, C., Gunasekera, A., Ebright, Y. W., Ebright, R. H. & Berman, H. M. (1996). Structure of the CAP–DNA complex at 2.5 angstroms resolution: a complete picture of the protein–DNA interface. J. Mol. Biol. 260, 395–408.Google Scholar
First citation Pelton, J. G. & Wemmer, D. E. (1989). Structural characterization of a 2:1 distamycin A/d(CGCAAATTGGC) complex by two-dimensional NMR. Proc. Natl Acad. Sci. USA, 86, 5723–5727. Google Scholar
First citation Pelton, J. G. & Wemmer, D. E. (1990). Binding modes of distamycin-A with d(CGCAAATTTGCG)2 determined by two-dimensional NMR. J. Am. Chem. Soc. 112, 1393–1399.Google Scholar
First citation Rice, P. A., Yang, S.-W., Mizuuchi, K. & Nash, H. A. (1996). Crystal structure of an IHF–DNA complex: a protein-induced DNA U-turn. Cell, 87, 1295–1306.Google Scholar
First citation Schultz, S. C., Shields, G. C. & Steitz, T. A. (1991). Crystal structure of a CAP–DNA complex: the DNA is bent by 90 degrees. Science, 253, 1001–1007.Google Scholar
First citation Schumacher, M. A., Choi, K. Y., Zalkin, H. & Brennan, R. G. (1994). Crystal structure of LacI member, PurR, bound to DNA: minor groove binding by alpha helices. Science, 266, 763–770.Google Scholar
First citation Thomas, K. A., Smith, G. M., Thomas, T. B. & Feldmann, R. J. (1982). Electronic distributions within protein phenylalanine aromatic rings are reflected by the three-dimensional oxygen atom environments. Proc. Natl Acad. Sci. USA, 79, 4843–4847.Google Scholar
First citation Winkler, F. K., Banner, D. W., Oefner, C., Tsernoglou, D., Brown, R. S., Heathman, S. P., Bryan, R. K., Martin, P. D., Petratos, K. & Wilson, K. S. (1993). The crystal structure of EcoRV endonuclease and of its complexes with cognate and non-cognate DNA fragments. EMBO J. 12, 1781–1795.Google Scholar
First citation Yang, W. & Steitz, T. A. (1995). Crystal structure of the site-specific recombinase gamma delta resolvase complexed with a 34 bp cleavage site. Cell, 82, 193–207.Google Scholar








































to end of page
to top of page