Beruflich Dokumente
Kultur Dokumente
1
Department of Chemistry and
Chemical Biology, Rutgers,
The State University of New
Jersey, 610 Taylor Road,
Piscataway, NJ 08854-8087,
USA
2
Department of Biochemistry,
University of Medicine and
Dentistry of New Jersey,
Robert Wood Johnson Medical
School, Piscataway,
NJ 08854, USA
The collagen model peptide with sequence (ProHypGly)
4
ProGly(Pro
HypGly)
5
contains a central GlyProGly interruption in the consensus
collagen sequence. Its high-resolution crystal structure defines the
molecular consequences of such an interruption for the collagen triple-
helical conformation, and provides insight into possible structural and
biological roles of similar interruptions in the GlyXY repeating pattern
found in non-fibrillar collagens. The peptide (denoted as the Hyp minus
peptide or Hyp
peptide,
which provides for the first time a high-resolution
picture of the changes of local conformation of the
collagen triple helix in the vicinity of a GlyXGly
interruption.
Figure 1. Asymmetric unit of the Hyp
crystal struc-
ture. (a) Ribbon diagram of the two triple helices in the
asymmetric unit (two perpendicular views), shown in red
and blue respectively. The region coloured in yellow
corresponds to the central GlyProGlyPro sequence for
each peptide (central zone). Chains are labelled ABCfor the
first triple helix (red), and DEF for the second (blue). Labels
are placed at the N-terminal end of each chain. (b) Stick
representation of the two molecules (two perpendicular
views). Pro residues are coloured in gray, Hyp in blue and
Gly in red, except those from the central zone, which are
shown in yellowfor all three chains. The ABCtriple helix is
shown on the left and the DEF triple helix is shown on the
right. Residue numbering is as follows: chain A129, chain
B 3159, chain C 6186, chain D 91119, chain E 121149,
and chain F 151189 (labels for N-terminal proline residues
are shown). The central zones (in yellow) correspond to
residues 1215 (A), 4245 (B), 7275 (C), 102105 (D), 132
135 (E), and 162165 (F). The C-terminal triplet for chain C
(residues 8789) is not shown due to conformational
disorder. Figures were generated with SETOR.
52
299 Crystal Structure of an Interrupted Collagen
Results
Overall structure and main chain conformation
The Hyp
peptide
shows most of the torsion angles clustered in the
region corresponding to the polyproline II helix.
However, six residues in the central zone and one
adjacent to it are outside this region (Figure 4). Two
Hyp residues with torsion angles in the -
helical region (Hyp58 and Hyp148; Figure 4)
illustrate the partial unraveling of the triple helix
at the C-terminal ends.
Side-chain conformation: imino acid ring
puckering
The ring puckering of Pro residues in the Hyp
F
calc
coefficients, and has been con-
toured at 2. The Figure was gener-
ated with SETOR.
52
300 Crystal Structure of an Interrupted Collagen
with 7
5
symmetry and rise per residue of 2.87 .
Each individual chain adopts a right-handed 7
1
helix
with constant twist angle 51.4 (360/7) and height
8.61 (32.87 ). The =51.4 rotation and
Z =8.61 translation thus define a single helical
step. The path of the left-handed superhelix (blue
broken line in Figure 5(a)) shows constant super-
helical twist =102.8 (exactly 2, but left-
handed), and superhelical height Z=2.87 (Z/
3). These values define a single superhelical step in
a 7
5
superhelix. The vertical broken lines in light
blue in Figure 5(a) indicate that there is no helical
drift: any given residue in the helix has the same
cylindrical coordinate as the seventh residue ahead
or behind, both on the superhelix and on its own
helix. Thus, an ideal superhelix twists monotoni-
cally and all helical and superhelical steps are the
same.
A similar Z helical net is shown in Figure 5(b)
for the Pro residues of the ABC triple helix (Pro1
Pro86) in the structure of the Hyp
. The main divergence occurs at the GlyProGlyPro central zone. (b) Variation with sequence of C
distances between the six chains after structural superposition, using chain B as reference. The GlyProGlyPro zone is
highlighted in yellow. (c) Average temperature factor plotted against sequence. The three sequences are shown staggered
to illustrate the correct alignment after the normal one-residue staggering in the collagen triple helix. The GlyProGly
Pro zone is highlighted in yellow. Amino acid one-letter codes used in this Figure: Pro, P; Hyp, O; Gly, G.
301 Crystal Structure of an Interrupted Collagen
All these Pro residues correspond to the central Gly
ProGlyPro region of the peptide, containing the
interruption. Values for between Pro residues in
the non-interrupted zones average 58(9) at the N-
terminal side and 54 (8) at the C-terminal side.
These values are consistent with an overall 7-fold
symmetry for the collagen triple helix (Table 1).
The effects on the superhelix (blue broken line)
appear to be more subtle. Only the superhelical step
between Pro13 and Pro43 results in a large rotation
angle =150. Short values occur at the
superhelical steps Pro40Pro70 (85) and Pro73
Pro15 (82). The rest of the superhelical steps
average =102, very close to the value for an
ideal 7
5
superhelix.
The Z plot allows visualization of the angular
relation between the two ProHypGly regions
flanking the central zones. Light blue lines in Figure
5 monitor the amount of helical drift. Such lines are
completely vertical for an ideal (ProHypGly)
10
triple helix (Figure 5(a)). The N-terminal side of the
Hyp
diagram
show some zigzag, which is more exaggerated
around the GlyProGlyPro zone. Remarkably,
however, the overall orientation of those lines is
mostly straight up, which means that the top part of
the helix has mostly regained its angular coherence
with a standard 7
5
helix. This is true even after the
conformational disruptions seen in the central,
interrupted region, and is mostly achieved through
compensating variations. For example in chain B the
long helical step Pro43Pro45 ( =78) follows
the short one Pro40Pro43 ( =19). The two
steps together add to 97, close to the 102 helical
rotation angle for a two-residue step in a standard 7
1
individual helix. As a result, Pro40 and Pro45 end up
pretty much aligned in the helical net with residues
Pro4 and Pro10 below (chain A), and Pro75 and
Pro81 above (chain C). Similar local variations
throughout the whole helix mostly compensate for
the local angular effect of the individual disruptions
caused by the deletion of a Hyp residue in each
chain. Analysis of the Z helical net for the DEF
triple helix (not shown) yields the same conclusions.
Hydrogen bonding
Hydrogen bonding topology and metrics in the
ProHypGly zones are consistent with the stan-
dard Rich and Crick II (RCII) and C
HO=C
hydrogen bonding patterns seen in other colla-
gen peptides (Table 2).
35
The standard hydrogen
bonding topology is interrupted in the central zone
(Figure 6). Substitution of a ProHypGly tripeptide
by a ProGly dipeptide affects the hydrogen bonding
environment of the Gly residue, but it does so in a
different way for each of the three chains. In the
leading chains, A and D, the NH amide groups
Table 1. Average conformational angles and helical
parameters for the Hyp
structure.
Pro residues are represented by black triangles, Hyp
residues by blue triangles, and Gly residues by red
triangles. Typical secondary structures are indicated
(, -helix; , -sheet; ppII, polyproline II and collagen).
Residues highlighted in yelloware fromthe GlyProGly
Pro zone and depart from the typical conformational
angles of the collagen triple helix. Two Hyp residues near
the C-terminal ends have helical conformational angles
(see the text).
302 Crystal Structure of an Interrupted Collagen
from Gly14 and Gly104 point towards the solvent
and thus lose the capability of inter-chain hydrogen
bonding. In the middle chains B and E, the NH
amide groups from Gly44 and Gly134 remain
pointing inwards, where they form non-standard
hydrogen bonds with carbonyl groups fromtrailing-
chain residues Gly72 and Gly162 respectively (Table
2). This new hydrogen bonding topology results
from the transition between the two sets of RCII
hydrogen bonds at either side of the interruption
(Figure 6). Finally, in the trailing chains C and F, the
NH amide groups from Gly74 and Gly164 revert to
Figure 5. Helical net plots for collagen triple helical molecules with their axes aligned along the Z polar coordinate
axis. (a) An ideal 7
5
triple helix. (b) The ABC triple helix in the Hyp
atoms from Pro residues. The origin of the angular coordinate is arbitrary. Red, yellow and green
lines represent the leading (A), intermediate (B) and trailing (C) chains, respectively. Broken blue lines show the path of
the superhelix, and cyan lines join residues that would be equivalent in an ideal 7
5
triple helix. Pro residues with
significant disruption in their helical twist are indicated (see the text).
303 Crystal Structure of an Interrupted Collagen
standard RCII hydrogen bonding topology. The
amide group of Gly72 points inside the triple helix
but does not have a suitable hydrogen-bonding
partner. Normal C
structure,
compared with 4.1 per tripeptide in the (ProHyp
Gly)
10
and (ProHypGly)
11
structures,
18
or 3.7 per
tripeptide in the GlyAla structure.
9
The reduc-
tion in the number of ordered water molecules
may be due to several reasons, including the
disruption of the water network around the Gly
ProGlyPro central zone and the somewhat lower
resolution of this particular crystal structure
determination. Furthermore, the angular distribu-
tion of water molecules surrounding any given
triple helix in the crystal is unequal, closely
matching the variation of interhelical distance to
its closest neighbours (Figure 8). In particular, the
number of water molecules in the direction of the
shortest separation between triple helices (12.2 ,
angle =0 in Figure 8), is significantly lower than
the number of water molecules in other directions,
which suggests that close proximity between
collagen triple helices results in a smaller number
of intervening ordered water molecules.
Despite this decrease in the observed number of
first shell water molecules, several water bridges
incorporating second and third-shell water mole-
cules are still observed, with very similar geometries
to those reported.
9
Notably, the same intermolecular
water bridges are repeated at either side of the Gly
ProGlyPro central zones, as illustrated by two
examples in Figure 9. The two water motifs shown
connect the two antiparallel triple helices in the
asymmetric unit (those separated by 12.2 ). One
motif is the fusion of two 2 bridges (see Bella et al.
9
Table 2. Standard and unusual hydrogen bonding parameters in the Hyp
HO=C H
O C
O C
O H
O=C C
O=C
HypPro standard 2.54 (0.19) 3.43 (0.20) 139 (5) 127 (6) 137 (6)
GlyPro standard 2.41 (0.21) 3.45 (0.19) 164 (7) 114 (8) 117 (7)
GlyGly standard 2.62 (0.21) 3.15 (0.16) 110 (9) 92 (8) 100 (7)
2.80 (0.20) 99 (8) 112 (7)
Gly42Gly12 2.07 3.15 176 114 114
Gly14Gly42 2.05 3.05 153 172 165
Gly134Gly162 2.43 3.08 118 143 156
Gly162Gly132 1.98 3.02 160 114 113
304 Crystal Structure of an Interrupted Collagen
for nomenclature), one from each triple helix (Figure
9(a) and (b)); the other is a cluster of two 2 bridges,
one from each triple helix, connected by several
additional waters (Figure 9(c) and (d)). Each of these
motifs has local 2-fold symmetry, which is perfectly
consistent with the antiparallel arrangement of the
triple helices that they connect. Each motif is
repeated twice, one at each side of the GlyPro
GlyPro central zone, and in almost exactly the same
angular orientation with respect to the respective
triple-helical axes. The identical positioning of these
water bridges is consistent with the observation
discussed above that the helical segments at either
side of the central zone are in almost perfect angular
register, but with a 2.8 shorter separation in the
vertical axis when compared with a triple helix with
ideal 7
5
symmetry.
Discussion
Interruptions of the characteristic GlyXY
repeating collagen sequence can be perfectly toler-
able in some collagens and have severe pathological
effects in others. Their respective structural signa-
tures are not well defined and may depend on the
nature, location and extent of the breaks. Here we
present the crystal structure of the Hyp
peptide at
2.0 resolution. This structure provides for the first
time a molecular picture of a collagen triple helix
accommodating a GlyXGly interruption in the
collagen repeating sequence GlyXY.
Conformation and hydrogen bonding are
perturbed in the GlyXGly interruption site
The central GlyProGlyPro zone shows a very
localized and subtle perturbation of the triple-helical
conformation, which rapidly reverts to normal in the
ProHypGly zones. Several residues in the central
zone adopt conformational angles outside the
polyproline II region (Figure 4), in contrast with
the situation seen in the structure of the GlyAla
peptide, in which conformational angles in the
interruption zone are similar to those of the rest of
the molecule.
8
High temperature factors and sig-
nificant variability between the individual chains
(Figure 3) suggest an increased conformational
flexibility in the central GlyProGlyPro zone.
The typical hydrogen bonding topology of col-
lagen is altered at the interruption site, which could
account in part for the large decrease in thermal
stability of the Hyp
structure around
the central GlyProGlyPro zone
(highlighted in yellow; residue col-
ouring as in the other Figures).
Green lines represent RCII hy-
drogen bonds and the single pur-
ple line represents the unusual
GlyGly hydrogen bonding pair-
ing (see the text). Only the DEF
triple helix is shown. Hydrogen
bonding in the ABC triple helix is
identical except for a missing
hydrogen bond between Gly72
and Pro13 (equivalent to Gly162
and Pro103 in this Figure). Chain D
is repeated at the right side to
provide a clearer description of
hydrogen bonds with chain F.
305 Crystal Structure of an Interrupted Collagen
chain with near-clockwork precision. With the
accompanying set of C
peptide is substantially
lower than that of the GlyAla peptide, where a
network of interstitial waters interconnects the
peptide groups of the three chains around the three
GlyAla substitutions.
8,9
It would seem that main-
taining a consistent pattern of inter-chain hydrogen
bonds, even if mediated through water, is less
detrimental for the thermal stability of the collagen
triple helix than locally disorganizing that pattern.
The zones flanking the interruption site are
coaxial and show 7-fold symmetry
Despite the local disruption introduced by the
interruption, supercoiling of the Hyp
molecules
occurs in a way as to maintain the two ProHypGly
zones flanking the central zone in angular register
(Figure 5). Local changes in helical and superhelical
twist in the GlyProGlyPro zone quickly com-
pensate each other. An indication of the precision of
this alignment is the repetition of identical inter-
molecular water bridges at either side of the central
zone, at almost exactly the same angular orientation
(Figure 9). Optimization of lateral interactions as
molecules assemble into the crystal lattice may
favour this intramolecular alignment, compensating
for the local perturbations of helical twist at the
central zone.
Fibre diffraction patterns of collagen in tendons
have been classically interpreted with collagen tri-
ple helices with 10
7
or left-handed 10/3 sym-
metry.
36,37
The Hyp
and
GlyAla structures showa more mixed distribution,
with a slight preference for the down conformation:
35 conformers down against 24 up in the Hyp