Beruflich Dokumente
Kultur Dokumente
The Helix-Turn-HelixDNA
Binding Motif
Richard G . Brennan and Brian W.Matthews
From the Institute of Molecular BWloBy and Department of
Physics, University of Oregon, Eugene, Oregon97403
1903
1904 Minireview: Helix-Turn-Helix DNA Binding Motif
k- H e l i x --+Turn +- H e l i=x= - . Score
- 2 -1 1 2 3 4 5 6 7 8 9 10 11 1 1
23 14 11
5167 18 1 9 20
M a s t e rs e t of s e q u e n c e s
X Cro 1 4 - F G Q T K T 4 K D L C V Y Q S A J N K A I H 0.70
434 C r o 1 6 - M T Q T E L A T K A G V K Q Q S l Q L I E A 0.66
P 2 2 R e p . ( C129)- 1 R Q A A L G K M V 6 V S N V A I S Q W E R 0.67
X C I I 2 4 - L G T E K T _ A E A V _ G V D K S Q J S R W K R 0.75
lacR 4 - V T L Y D V A E Y A G V S Y Q T P S R V V N 0.69
CAP 1 6 7 - 1 T R Q E I G Q I V G C S R E T ~ G R I L K 0.70
trp R 6 6 - M S Q R E L K N E L G A G I A T L T R G S N 0.71
GalR 2 -A T I K D V A R L A G V S V A T P S R V I N 0.73
0 32 2 5 1 - S T L Q L E _ A D R Y _ G V S A E R Y R Q L E K 0.79
Leu0 1 8 - Q N I T R A A H V L G M S Q P A Y S N A V A 0.79
LysR 1 9 - G S L T E A A H L L H T S Q P T Y S R E L A 0.79
AmpR 2 1 - L S F T H A A I E L N V T H S A I S Q H V K 0.80
AntP 9 1 4 3 P Q A Q T N G Q L G V P Q Q Q Q Q Q Q Q Q 0.80
Nul X 3 ” J N K K Q L _ A D I F C A S I R T J Q N w Q E 0.80
MuB 1 9 - T T F K Q 1 4 L E S p L S T G T L S S F I N 0.80
d n a B ( E . c o l i ) 366-R S L K A L A K E L N V P V V A L S Q L N K 0.80
BirA 2 0 - H S G E Q L G E T L G M S R A A L N K H I Q 0.80
T7 g e 1n 8e . 5 61-K Y Q E D L A A L E G T S D R I 1 S D I. R S 0.80
CytR l o - A M I K D V A L K A K V S T A T E S R A L M 0.80
MaLal 1 1 4 - K E K E E V A K K C G I T P L Q 1 R V W V C 0.80
P2R 1 8 - L S R Q Q L A D L T G V P Y G T L S Y Y E S 0.80
in Cro). (iii) Residues 3-8 and 15-20 should not be proline (being strongly predicted sequences in Fig. 2 (AAC < 0.79) violates these
within a-helices). (iv) Residue 5 should not be a @-branchedresidue expected characteristics.
(being wedged between the two helices). There are anumber of proteins that have been proposed to include
These characteristics are not absolute, and itis quitepossible that potential helix-turn-helix motifs but agree poorly with the master set
they could be violated by a bona fide helix-turn-helix motif (e.g. a of reference sequences used here. Examples include homeo boxes,
functional mutant of A-repressor lacks the “invariant” glycine at histones, some transcriptional activator proteins, resolvases and in-
position 9 (34)). However, it is worth noting that none of the most version proteins, and the AIDS protein “tat.”’ The poor scores asso-
Minireview: Helix-Turn-Helix DNA Binding Motif 1905
ciated with these proteins do not, of course, prove that they do not TABLE I
have helix-turn-helix motifs; it only suggests that the likelihood is Structural correspondence between helix-turn-helix motifs
not high. The table gives the root mean square discrepancy between 21 a-
Dodd and Egan (32) have suggested a method of predicting putative carbons in the helix-turn-helix of the subject protein and thatof Cro
helix-turn-helix motifs that is similar in principle to theone described repressor. The rank corresponds to the numbering used in Fig. 3.
here. It uses a masterset of 37 sequences and a somewhat complicated
method of scoring agreements. On the whole it gives results similar Protein Residues Discrepancy Rank
to those shown in Fig. 2, although it does not successfully predict the A
known helix-turn-helix motif in trp repressor. Another method of X Cro 16-36
predicting helix-turn-helix motifs has been proposed by White (33). Phage 434 repressor 17-37
1 0.43
This method also does not predict the known helix-turn-helix in trp Phage 434 Cro 19-39 0.56 2
repressor and, inaddition, incorrectly suggests helix-turn-helix motifs X repressor 33-53
3 0.68
in myoglobin and superoxide dismutase. Trp repressor 64-84 0.89 4
CAP 169-189 0.90 5
Structural Comparisons Cytochrome c peroxidase 132-152 1.39 6
The initial recognition of the significance of the helix-turn-helix DNA binding protein I1 7-27 1.43 7
motif derived as much from structural comparisons as from consid- L7/L12 ribosomal protein 16-36 1.61 8
erations of amino acid sequences. In particular, CAP and Cro were
seen to contain a similar helix-turn-helix substructure in their pre-
substructures similar to thehelix-turn-helix motif. The first of these
sumed DNA binding regions even though the rest of these protein
is "DNA-binding protein 11,” a nonspecific DNA-bindingprotein from
structures had little in common (4). 24 a-carbon atoms (residues 13-
36) including the helix-turn-helix of Cro could be superimposed with
Bacillus stearothermophilus (36), the second is cytochrome c peroxi-
A
a root mean square discrepancy of 1.1 on the corresponding 24 a- dase, and the third is the ribosomal protein L7/L12 (Table I). The
carbons (166-189) of CAP (4). The significance of this result was latter two helix-turn-helix structures were found in a recent search
through the Brookhaven Data Bank by Richardson and Richardson
enhanced by the finding that the structure of X-repressor also con-
(37). Although the helix-turn-helix units inthesethreeproteins
tained a helix-turn-helix substructure (5). In this case 23 a-carbons
correspond reasonably well with that of Cro, none agrees as well as
(residues 31-53) could be superimposed on residues 14-36 of Cro with
a root mean square discrepancy of only 0.7 A.
A systematic search the helix-turn-helix units in the proteins that are known to bind
through all protein structures in the Brookhaven Data Bank failed DNA sequence specifically (Fig. 3). Nevertheless, it is intriguing that
two of the three examples are proteins that interact with nucleic
to find any 22 a-carbon segment that corresponded to thehelix-turn-
acids. The amino acid sequences within these three helix-turn-helix
helix as seen in Cro, X-repressor, and CAP (4,35) (Fig. 3).
units do not obviously correspond with the master set of 10 sequences.
Now that the helix-turn-helix has been observed in a number of
other DNA-binding proteins, it is possible to extend the initial However, the amino acid sequences of several homologs of the non-
structural comparisons. The correspondence between the helix-turn- specific DNA-binding protein have been determined and one of these,
helix unit of Cro (1) with those seen in CAP (2), X-repressor (3), trp from Rhizobium melitoti (DBPZZ ( R m ) )yields a score of 0.80(Fig. 2).
repressor (24), phage 434 Cro (25), and phage 434 headpiece (26) is Furthermore, this score is for the set of 22 amino acids that corre-
given in Table I. In making these comparisons Cro was chosen as the spond to the observed helix-turn-helix in the B. stearothermophilus
“reference” structure, in part because it was used for the initial enzyme. The significance, if any, of this correspondence is a matter
comparisons, but also because it agrees better with most of the other for debate.
helix-turn-helix motifs than they do among themselves. The back-
bone segment that is most precisely conserved in all six of the above Role of the Helix-Turn-Helix in Recognition
proteins consistsof 21 a-carbon atoms (Table I). Outside this region, For Cro protein the “recognition helix” (i.e.the second helix of the
the individual structures may differ substantially. helix-turn-helix motif) protrudes from the surface of the protein and
There are three other proteins that have been reported to contain matches in shapethe major groove of the DNA (Fig. 1). This comple-
mentarityinshapecan facilitate binding of the protein to both
sequence-specific and nonspecific sites on the DNA.As shown in
Table I, the backbone conformations of the helix-turn-helix motifs
in other sequence-specific DNA-binding proteins are very similar to
that seen in Cro. Notwithstanding this overall similarity, the helix-
turn-helix motifs do not interact with the DNA in exactly the same
way in each repressor-operator complex (19-23). This had been
anticipated from early structural comparisons of Cro and X-repressor
(5, 13), although “helix-swap” experiments (38,39) suggested similar
rather than dissimilar geometries of binding. It can nowbe seen
directly in the different repressor-operator complexes that the re-
spective helix-turn-helix motifs adopt approximately similar but dis-
tinctly different binding geometries with respect to theDNA operator
(19-23).
I
15
The helix-turn-helix does not function as a fixed “reading head”
R.rn.s. dlfferencs (dl that is always aligned in the same way relative to the DNA. Rather,
recognition is achieved by a combination of factors. Specific hydrogen
FIG. 3. Histogram showing the result of a search through bonds and van der Waals contacts between the side chains of the
the Brookhaven Data Bank for helix-turn-helix motifs. The protein and the partsof the base pairs exposed within the grooves of
inset shows the superposition of the helix-turn-helix motifs as initially the DNA clearly play a major role. Appropriately placed water mole-
seen in Cro and CAP (4). The horizontal axis is the root-mean-square cules that bridge between the protein and the DNA may also partic-
discrepancy between the 24 a-carbon atoms in the helix-turn-helix ipate in the recognition of a specific sequence. The DNA may also
of Cro and all possible 24 a-carbon segments for the known protein adapt to enhance contacts within the grooves as well as interactions
structures. Also shown are the root mean square discrepancies be- with the phosphate backbone.
tween the helix-turn-helix of Cro and the eight other proteins listed The role of the helix-turn-helix appears to be to provide a rigid
in Table I. Note that thehistogram is taken from the original search underlying framework that supports the recognition surface of the
(4,34) and was carried out using the 24 a-carbon atoms 13-36 of Cro protein but not to define how that recognition surface is aligned with
repressor. In contrast, the individual values 1-8 shown in the figure the DNA.
were calculated for the shorter21 a-carbon segment 16-36. As is clear The complexes involving X-repressor headpiece, 434 Cro, and 434
from the figure and from Table I, this 21 a-carbon segment is well repressor headpiece all support the expectation that these proteins
conserved in all known repressor and activator crystal structures. recognize their specific binding sites on the DNA by direct contacts
Beyond the 21 a-carbons, however, the protein backbones may differ with the edges of the base pairs exposed within the grooves of the
substantially in the different structures. DNA (19-22). In the complex of trp repressor, however, there are no
1906 Minireview: Helix-Turn-Helix DNA Binding Motif
direct contacts between the protein and the base pairs that have been Acknowledgments-We thank Dr. Peter von Hippel for helpful
shown in uiuo to be important in recognition. Sigler and colleagues discussions on sequence-specificand sequence-nonspecificDNA-pro-
(23) argue that trp repressor provides an example of indirect readout. teininteraction. We are also grateful to Drs. C. 0. Pabo, S. C.
In other words, they suggest that trp operator is recognized indirectly Harrison, P. B. Sigler, and T. A. Steitz for providing coordinates, and
through sequence-specific changes induced in the geometry of the Drs. J. S. Richardson and D.C. Richardson for a copy of their
phosphate backbone that, in turn, permit the formation of a tight manuscript in advance of publication.
complex with the protein. This rationalization for the specific binding
of trp repressor poses several questions. In particular, the observed REFERENCES
distortions in the structure of the trp operator are notlarge, and it is
not obvious why different DNA sequences could not assume confor- 1. Anderson, W. F., Ohlendorf, D. H., Takeda, Y., and Matthews,B. W. (1981)
Nature 2 9 0 , 754-758
mations similar to that adopted by the trp operator. The crystals of 2. McKay, D. B., and Steitz, T. A. (1981) Nature 2 9 0 , 744-749
the trp repressor-operator complex were grown from 50 mM NaC1, 11 3. Pabo, C. O., and Lewis, M. (1982) Nature 298,443-447
mM CaCL, 10 mM cadodylate and 35% methylpentanediol(23). Such 4. Steitz, T. A., Ohlendorf D. H., McKay D.B., Anderson, W. F., and
Matthews, B. W. (198;) Proc. Natl. Acah. Sci. U. S. A. 7 9 , 3097-3100
low salt, high alcohol conditions would be expected to increase the 5. Ohlendorf, D. H., Anderson, W. F., Lewis, M., Paho, C. O., and Matthews,
affinity of nonspecific DNA binding. Indeed, von Hippel and co- B. W. (1983) J . Mol. Biol. 169,757-769
workers (40) have shown that in low salt the affinity of lac repressor 6. Anderson, W. F., Takeda, Y., Ohlendorf, D. H., and Matthews,B. W. (1982)
for nonspecific sites on DNA can approach, if not exceed, its affinity -1. _-
. ."
1KB. 745-751
- . -Mol R h / "_,
7. Sauer, R. T., Yocum, R. R., Doolittle, R. F., Lewis, M., and Paho, C. 0.
for the lac operator. Furthermore, they have also shown that the (1982) Nature 298,447-451
nonspecific binding of lac repressor is enhanced by the presence of 8. Matthews, B. W., Ohlendorf, D. H., Anderson, W. F., and Takeda,Y. (1982)
glycerol (41). If trp repressor behaves similarly to lac repressor, it Proc. Natl. Acad. Sci. U. S. A. 79.1428-1432
9. Ohlendorf, D. H., Anderson, W. F.; and Matthews, B. W. (1983) J. Mol.
raises the possibility, first suggested to us by von Hippel,' that the Euol. 1 9 . 109-114
conditions necessary to crystallize the trp repressor-operator complex 10. Giequel-Sanzey, B., and Cossart, P. (1982) EMBO J. 6,591-595
may have resulted in a sequence nonspecific complex. This question 11. Weber, I. T., McKay, D. B., and Steitz, T.A. (1982) Nucleic Acids Res. 1 0 ,
5nRx-m nv
will need to be addressed in future studies. 1
""
12. Ohlendorf, D. H.,Anderson, W. F., Fisher, R. G., Takeda, Y., and Matthews,
The reason for the precise conservation of the backbone configu- B. W. (1982) Nature 298.718-723
ration of the helix-turn-helix motif in the different repressors is not 13. Lewis, M'., Jefkrey, A., Wang, J., Ladner, R., Ptashne, M., and Pabo, C. 0.
obvious. If different helix-turn-helix motifs were required to align on (1983) Cold Spring Harbor Symp. Qunnt. Biol. 47,435-440
14. Weber, I. T., and Steitz, T. A. (1984) Proc. Natl. Acad. Sei. U. S. A. 8 1 ,
the DNA in exactly the same manner, then it would be easy to argue 3973-3877
~~ ~ ""
that this required strict conservation of the shape of the helix-turn- 15. Ohlendorf, D. H., and Matthews,B. W. (1983) Annu. Reu. Biophys. Bioeng.
helix motif. As was noted, this is not the case. Different helix-turn- ", 359-344-
12. _i_ "
helix motifs are seen to bind to DNA with different alignments. 16. Takeda, Y . ,Ohlendorf, D. H., Anderson, W. F., and Matthews,B. W. (1983)
Science 221,1020-1026
Recent evidence (42-44) suggests that thehelix-turn-helix unit of lac 17. Pabo, C. O., and Sauer, R. T. (1984) Annu. Reu. Biochem. 63,293-321
repressor could be aligned on the DNA in adirection opposite to that 18. Schleif R. (1988) Science 241,1182-1187
of X-repressor, cro 434, and trp repressor. 19. Andersbn, J. E., Ptashne, M., and Harrison,S. C. (1987) Nature 326,846-
Q50
There is also no obvious structural reason for the conservation of 20. Aggarwal, A. K., Rodgers, D., Drottar, M., Ptashne, M., and Harrison, S.
the helix-turn-helix motif. Because structures corresponding to the C. (1988) Science 242,899-907
helix-turn-helix motif are not found in protein structures in general 21 Jordan, S. R., and Pabo, C. 0. (1988) Science 242,893-899
(Fig. 3), itseems unlikely that thehelix-turn-helix unit has a confor- 22, Wolberger, C., Dong, Y., Ptashne, M., and Harrison, S. C. (1988) Nature
336,789-795
mation that is energetically favored during the folding process. In- 23. Otwinowski, R. W., Schevitz, R. W., Zhang, R.-G., Lawson, C. L., Jowh-
spection of observed helix-turn-helix motifs shows that there can be imiak, A., Marmorstein, R. Q., Luisi, B. F., and Sigler, P. B. (1988)
direct van der Waals contactsbetween residues in the "elbow" region, Nature 336,321-329
24. Schevitz, R. W., Otwinowski, Z., Joachimiak, A., Lawson, C. L., and Sigler,
but the area in contact isrelatively small. Pabo and Sauer (17) have P. B. (1985) Nature 317,782-786
noted that the contact residues include the highly conserved alanine 25. Mondragon, A,, Wolberger, C., and Harrison, S. C. (1988) J. Mol. Biol., in
at position 5 and the conserved isoleucine or valine at position 15 press
(Fig. 2) and have suggested that this"invariant" contact helps main- 26. Monhagon, A., Subbiah, S., Almo, S. C., Drottar, M., and Harrison, S. C.
(1988) J. Mol. Biol., in press
tain the angle between the two a-helices. On the other hand, the 27. Zuiderweg, E. R. P., Kaptein, R., and Wuthrich, K. (1983) Proc. Natl. Acad.
observations that CAP has a glycine and trp repressor has a lysine in Sci. U. S. A. 80,5837-5841
place of the critical alanine suggest that the interhelix angle is not 28. Howard, P. K., Shaw, J., and Otsuka, A. J. (1985) Gene (Amst.) 3 5 , 321-
331
determined by these contacts alone. 29. Buoncristiani, M. R., Howard, P. K., and Otsuka,A. J. (1986) Gene ( A m t . )
Perhaps the most reasonable explanation is that the conservation 44,255-261
of the backbone of the helix-turn-helix motif is anatural consequence 30. Doolittle, R. F. (1981) Science 214,149-159
31. Brennan, R. G., Weaver, L. H., and Matthews, B. W. (1986) Chem. Scr.
of evolution. Presumably all helix-turn-helix motifs that have amino 2 6 B , 251-255
acid sequence correspondence evolved from a common precursor. It 32. Dodd, I. B., and Egan, J. B. (1987) J. Mol. Bid. 1 9 4 , 557-564
is well known that the amino acid sequence of a protein changes 33. White, S. W. (1987) Protein Eng. 1 , 373-376
much more rapidly during evolution than does its three-dimensional 34. Hochschild, A., Irwin, N., and Ptasbne, M. (1983) Cell 32,319-325
35. Matthews, B. W., Ohlendorf, D. H., Anderson, W. F., Fisher, R. G., and
structure. For this reason, the backbone of the helix-turn-helix motif Takeda, Y. (1983) Cold Spring Harbor Symp. Qunnt. Biol. 47,427-433
would be expected to be conserved during evolution. If, in addition, 36. Tanaka, I., Appelt, K., Dijk, J., White, S. W., and Wilson, K. S. (1984)
the helix-turn-helix were to be intimately associated with the DNA Nature 310,376-381
37. Richardson, J. S., and Richardson,D. C. (1988) Proteins Structure, Func-
in the DNA-protein complex, then it would be heavily constrained tion and Genetics,in press
not to change its shape. In this scenario, the evolution of DNA- 38. Hochschild, A., Douhan, J., 111, and Ptashne, M. (1986) Cell 47,807-816
binding proteins specific for different operators would be achieved by 39. Wharton, R. P., and Ptashne, M. (1986) Trends Biochem. Sci 11, 71-73
a sequential process in which changes of individual amino acids within 40. Wang, A. C., Revzin, A., Butler, A. P., and von Hippel, P. H. (1977) Nucleic
Acids Res. 4,1579-1594
the helix-turn-helix motif, as well as larger changes elsewhere in the 41. Revzin, A,, and von Hippel, P. H. (1977) Biochemistry 16,4769-4776
DNA-binding protein, could result in new sequence preferences, but 42. Boelens. R.. Scheek. R. M.. van Boom. J. H.. and Kaotein. . . R. (1987) J .
the backbone of the helix-turn-helix motif would remain essentially Mol. biol.' 1 9 3 , 213-216 '
43. Lebming, N., Sartorius, J., Niemoller, M., Genenger, G., Wilcken-Berg-
invariant. mann, B. v., and Mdler-Hill, B. (1987) EMBO J. 6,3145-3153
44. Lebming, N., Sartorius, J., Oehler, S., Wilcken-Bergmann, B. v., and
' P. von Hippel, personal communication. Muller-Hill, B. (1988) Proc. Natl. Acad. Sci. U. S. A. 8 5 , 7947-7951