Collagen: Difference between revisions

No edit summary
Michal Harel (talk | contribs)
No edit summary
 
(146 intermediate revisions by 6 users not shown)
Line 1: Line 1:
<StructureSection load='4CLG' size='500' side='right' caption='Structure of Collagen (PDB entry [[4CLG]])'>Collagen, the most abundant protein in vertebrates, is an extracellular, inextensible fibrous protein that comprises the major protein component of such stress-bearing structures as bones, tendons, and ligaments. The objective of this exercise is to develop an understanding of the fibrous portion of collagen and to show how the different levels of protein structure come together and form a highly ordered and stable fiber.  Collagen's properties of rigidity and inextensibility are due to this highly ordered structure. The non-structurally order part of collagen is not illustrated in this model. This part of the protein complex having a different amino acid composition, lysine and hydroxylysine are particularly important residues, is globular in nature and not as structurally organized. Lysine and hydroxylysine form covalent crosslinks in the protein complex, thereby adding strength and some flexibility to the fiber. This covalent crosslinking continues throughout life and produces rigid collagen and brittle bones in older adults. [[Collagen Structure & Function]] for additional Information, and a link to movies of assembly of triple helix of type I and IV collagen is available in the External Links section.
<StructureSection load='4clg' size='350' side='right' caption='Structure of Collagen (PDB entry [[4clg]] or [[1cag]])' scene='Collagen/Opening/4' >


== Structure of a Segment ==
==Overview==


A fiber section is made up of 5 tropocollagens, each is shown in a <scene name='Collagen/Fiber_section/2'>different color</scene>. One limitation of this model of collagen segment is that instead of having flush cut ends as shown here, the ends of the tropocollagen in an actual fiber section would be <scene name='Collagen/Staggered_cut/2'>staggered</scene>. When the tropocollagens come together to form the fiber segment, they actually overlap one another in a staggered pattern. The presents of these staggered ends permit an association of the tropocollagens from different segments, and this association results in the formation of a strong fiber. <scene name='Collagen/Fiber_section_animation/8'>Add tropocollagens</scene> one at a time to form the fiber section.
'''Collagen''', the most abundant protein in vertebrates, is an extracellular, inextensible fibrous protein that comprises the major protein component of such stress-bearing structures as bones, tendons, and ligaments.  As with all fibrous proteins collagen is, for the most part, characterized by highly repetitive simple sequence. Here we study two model compounds (The structure of [[4clg]]<ref>J.M. Chen, C.E. Kung, S.H. Feairheller, E.M. Brown, AN ENERGETIC EVALUATION OF A "SMITH" COLLAGEN MICROFIBRIL MODEL, <I>J. Protein Chem., </I>'''10''', 535, 1991</ref> is shown in the applet to the right.) for naturally occurring collagen, in order to develop an understanding of the fibrous portion of collagen and to show how the different levels of protein structure come together and form a highly ordered and stable fiber. Collagen's properties of rigidity and inextensibility are due to this highly ordered structure. The part of collagen without structural order is not illustrated in this model. This part of the protein complex having a different amino acid composition, lysine and hydroxylysine are particularly important residues, is globular in nature and not as structurally organized. Lysine and hydroxylysine form covalent crosslinks in the protein complex, thereby adding strength and some flexibility to the fiber. This covalent crosslinking continues throughout life and produces a more rigid collagen and brittle bones in older adults. Go to [[Collagen Structure & Function]] for information on the functions and disorders of collagen and a link in the External Links section of this page for assembly movies of the triple helix of types I and IV.  See also [[Fibrous Proteins]].
select *:a,*:b,*:c; display selected; delay 2; select *:a,*:b,*:c, *:d,*:e,*:f; display selected; delay 2; select *:a,*:b,*:c, *:d,*:e,*:f, *:g,*:h,*:i; display selected; delay 2; select *:a,*:b,*:c, *:d,*:e,*:f, *:g,*:h,*:i, *:j,*:k,*:l; display selected; delay 2; select *:a,*:b,*:c, *:d,*:e,*:f, *:g,*:h,*:i, *:j,*:k,*:l, *:m,*:n,*:p; display selected;
</StructureSection>


<applet load='4CLG' scene='Collagen/Collagen_initial/1' size='300' frame='true' align='right' caption='Collagen' />
== Structure of a Segment ==


The collagen sequence is typically (Gly - Pro - hydroxy-Pro)<sub>n</sub>.  
A fiber segment is made up of 5 tropocollagens, each is shown in a <scene name='Collagen/Fiber_segment/2'>different color</scene>. One limitation of this model of collagen segment is that instead of having flush cut ends as shown here, the ends of the tropocollagen in an actual fiber section would be <scene name='Collagen/Staggered_cut/4'>staggered</scene>. This staggered pattern is produced when the tropocollagens associate to form the fiber segment. The collagen fiber is constructed by connecting the segments together, and the presence of these staggered ends permits the tropocollagens from different segments to form strong attractions adding to the strength of the fiber. Add tropocollagens <scene name='Collagen/Fiber_section_one/2'>one</scene> at a time to form the fiber section, <scene name='Collagen/Fiber_section_two/2'>two</scene>, <scene name='Collagen/Fiber_section_three/3'>three</scene>, <scene name='Collagen/Fiber_section_four/2'>four</scene>, <scene name='Collagen/Fiber_section_five/2'>five</scene>.  View fiber segment as <scene name='Collagen/Fiber_section_backbone/4'>backbone only</scene>.  Viewing the segment from the end one can see that without the side chains being displayed the center of the fiber is empty. Each <scene name='Collagen/One_tropocollagen/1'>tropocollagen molecule</scene> contains 3 parallel peptide chains wrapped around one another to make a right-handed triple helix that is 87 Å long and ~10 Å in diameter.  Tropocollagen displayed as <scene name='Collagen/One_tropocollagen_backbone/1'>backbone</scene> only.
== Lower Levels of Structure ==


Each  <scene name='Collagen/Collagen_chain/1'>chain</scene> forms an elongated left-handed helix. Three of these chains are associated to a right-handed <scene name='Collagen/1cag/5'>triple helix</scene>.
== Primary Structure of Peptide ==


Every third amino acid is <scene name='Collagen/1cag/1'>a glycine</scene>
<scene name='Collagen/One_peptide_wireframe/4'>Show side chains</scene> of the peptide in wireframe display.  Identify the amino acids making up the peptide by resting the cursor on a residue and observing the name in the label (Toggling spin off will make this easier.). Which three amino acids are present in the peptide in a reocurring pattern?  Collagen is characterized by a distinctive repeating sequence: (Gly-X-Y)n where X is often Pro, Y is usually 5-hydroxyproline (Hyp), and n may be >300. The model ([[4clg]]) being studied here contains a <scene name='Collagen/One_peptide_tricolored/3'>repeating sequence</scene> of residues - <font color="#ff0000">Gly</font>-<span style="color:limegreen;background-color:black;font-weight:bold;">Pro</span>-<span style="color:yellow;background-color:black;font-weight:bold;">Hyp</span>.  This sequence produces a conformation which is a <scene name='Collagen/One_peptide_backbone/1'>left-handed helix</scene> with a rise 10.0 Å/turn or <scene name='Collagen/Peptide_3_residue_segments/1'>3.3 residues per turn</scene>, the peptide is colored in three residue segments.  <scene name='Collagen/Peptide_helix_z_axis/1'>Looking down</scene> the center axis of a segment of the helix.  Since a helix with a larger rise is superimposed on the helix described above, the entire center axis does not align for viewing.  The <scene name='Collagen/Ramachandran/2'>Ramachandran plot</scene> shows that the psi and phi angles of the collagen helix are different from the α-helix, which has a rise of 3.6. The two clusters shown here are outside of the area expected for an α-helix. Review where you would expect a cluster of [[Ramachandran_Plots|α-helix]] residues to be located.


<scene name='Collagen/1cag/3'>proline</scene>
== Other Levels of Structure  ==


<scene name='Collagen/1cag/4'>hydroxyproline</scene>
As shown above tropocollagen is formed by <scene name='Collagen/One_tropocollagen/1'>three peptides</scene> twisting around each other, and in doing so the peptides make <scene name='Collagen/Peptide_3_residue_segments2/2'>one turn every ~7 three-residue repeats</scene> (Cyan colored residues mark the approximate length of one turn.).  <scene name='Collagen/One_tropocollagen2/1'>Three cyan colored residues</scene> mark the approximate distance of one turn of the peptides in a tropocollagen.  Tropocollagen displayed as <scene name='Collagen/One_tropocollagen_backbone2/1'>backbone only</scene> clearly shows both types of helical turns - the 3.3 residue/turn and ~21 residue/turn. 


<scene name='Collagen/1cag/2'>alanine</scene>
Looking down the axis of a tropocollagen displayed as wireframe, <font color="#ff0000">glycine</font> can be seen <scene name='Collagen/Gly_position_tropo/2'>positioned in the center</scene> of the triple helix.  The two types of helical turns consistently positions the Gly in the center of the triple helix. <span style="color:limegreen;background-color:black;font-weight:bold;">Proline</span> and the <span style="color:yellow;background-color:black;font-weight:bold;">hydroxyproline</span> are on the <scene name='Collagen/Pros_position_tropo/1'>outside</scene> of the triple helix.  With the hydroxyl group of Hyp extending to the surface of the triple helix, it can be involved in hydrogen bond formation, as will be seen in the next section. The cyclical side chains of Pro and Hyp are some what rigid, and this rigidity adds to the stability  of the collagen fiber. The primary structure of repeating Gly-Pro-Hyp along with the two types of helical turns determine the 3D positions of Gly, Pro and Hyp in the tropocollagen.


==Ribbon and Spacefilling Diagrams of the Collagen Triple Helix==
In order to make a compact strong fiber the interior residues of the triple helix need to be close packed.  The <scene name='Collagen/Gly_no_hindrance/1'>Gly side chain</scene> is the only one small enough to accommodate this close packing in the interior of the triple helix (realize that in this model the hydrogen on the <span style="color:limegreen;background-color:black;font-weight:bold;">α carbon</span> is not displayed).  <scene name='Collagen/Glys_close_pack/1'>Three Gly</scene>, one on each of three different chains, are close packed together.  The gray atoms of the yellow and lime Gly are the α-carbons, and only a hydrogen could fit between these carbons and the atoms of the adjacent Gly.  <scene name='Collagen/Glys_pro_close/2'>A Pro</scene> on each of the 3 chains are shown close packed to the three Gly (lime, cyan, yellow). Adding the <scene name='Collagen/Glys_pro_hyp/1'>Hyp</scene> shows that Pro and Hyp are tightly positioned around the small interior Gly leaving no space for side chains longer than the single hydrogen of Gly.
<kinemage align="right" width="450" height="400" file="collagen1.kin" />


Fibrous proteins are, for the most part, characterized by highly repetitive simple sequences. We shall examine here a trimer that forms a collagen-like triple helix.
Collagen, the most abundant protein in vertebrates, is an extracellular protein that comprises the major protein component of such stress-bearing structures as bones, tendons, and ligaments. Collagen is characterized by a distinctive repeating sequence: (Gly-X-Y)n where X is often Pro, Y is often 5-hydroxyproline (Hyp), and n may be >300. This, as we shall see, causes each collagen chain to assume a left-handed helical conformation with 3.3 residues per turn and a pitch (rise per turn) of 10.0 Å. Three such chains associate in parallel to form a right-handed triple helix.


Here we study a model compound for naturally occurring collagen, a 30-residue synthetic polypeptide of sequence (Pro-Hyp-Gly)4-Pro-Hyp-Ala-(Pro-Hyp-Gly)5, three chains of which associate to form a collagen-like triple helix of parallel strands that is 87 Å long and ~10 Å in diameter.
== Maintainance Forces ==


View1 shows the triple helical molecule in ribbon form seen perpendicular to its triple helical axis and with its three parallel and identical chains, "Chain 1", "Chain 2", and "Chain 3", colored purple, gold, and white, respectively. View2 is down the triple helical axis, a perspective in which this ribbon diagram appears to have a hollow center. However, click the "ANIMATE" button to show the spacefilling form and prove to yourself that the center is not hollow.  Return to the ribbon diagram by clicking the "ANIMATE" button again before continuing.
== Intra-tropocollagen Attractions ==


Go back to View1 and repeatedly click the "2ANIMATE" button. This "grows" Strand 1 from its N- to its C-terminus in differently colored 3-residue increments. Note how the molecule's three strands twist around each other and that the triple helix makes one turn every ~7 three-residue repeats.  
Intra-tropocollagen attractions are primarily hydrogen bonds formed between the peptides in the triple helix.  The three polypeptide chains are <scene name='Collagen/Intra-hbonds/4'>staggered</scene> in position by one residue, that is, a <span style="color:limegreen;background-color:black;font-weight:bold;">Pro</span> on Chain A is at the same level along the triple helix axis as a <font color="#ff0000">Gly</font> on Chain B and a <span style="color:gold;background-color:black;font-weight:bold;">Hyp</span> on Chain C. This staggered arrangement not only <scene name='Collagen/Intra-hbonds2/6'>aligns</scene> a <font color="#ff0000">Gly</font> backbone NH (imino group) with a <span style="color:limegreen;background-color:black;font-weight:bold;">Pro</span> backbone O (carbonyl oxygen) on one of the other peptides but also brings the two groups close enough so that a <scene name='Collagen/Intra-hbonds6/2'>hydrogen bond</scene> can form between the imino hydrogen and the carbonyl oxygen. This alignment occurs with Gly in each of the three peptides so that the Gly imino hydrogens of Chain A form <scene name='Collagen/Hbonds_a_to_b/6'>hydrogen bonds</scene> (<span style="color:orange;background-color:black;font-weight:bold;">orange</span>) with the Pro carbonyl oxygens on Chain B, and likewise Gly of <scene name='Collagen/Hbonds_a_to_b/5'>Chain B to Pro of Chain C</scene> (<span style="color:yellow;background-color:black;font-weight:bold;">yellow</span>) and Gly of <scene name='Collagen/Hbonds_c_to_a3/8'>Chain C to Pro of Chain A</scene> (<span style="color:limegreen;background-color:black;font-weight:bold;">green</span>).  The force of all these hydrogen bonds extending the length of the tropocollagen add up to a strong attractive force which mantain the integrity of the tropocollagen.  Since the main chain N atoms of both Pro and Hyp residues lack H atoms, only Gly can provide hydrogen to form these hydrogen bonds.
 


Repeatedly click the "ANIMATE" button to alternately display the original ribbon diagram and a spacefilling diagram of the polypeptide chains together with their side chains. The chains of the spacefilling diagram, which are colored identically to those of the ribbon diagram, can be individually turned on and off. Displaying one or two chains as ribbons and the remainder in spacefilling form may better reveal the helical character of the triple helix.
== Inter-tropocollagen Attractions ==


==Collagen Backbone and the Effect of a Mutation==
Hydrogen bonds are also an important inter-tropocollagen force which holds the tropocollagens together in the fiber segment. As shown above, <span style="color:gold;background-color:black;font-weight:bold;">Hyp</span> is the outer most residue on the <scene name='Collagen/Pros_position_tropo/1'>surface</scene> of the triple helix, and the hydroxyl groups are the atoms that extend out the most from the surface.  The hydrogen bonds are formed between the hydroxyl hydrogen of a Hyp and a backbone carbonyl oxygen.  As the peptides in a tropocollagen twist about each other they come into <scene name='Collagen/Hlite_c_k_peptides/1'>close contact</scene> with particular peptides in adjacent tropocollagens and then move away from them. The two peptide highlighted in spacefill are located in two different tropocollagens.  Notice that in this case, they make contact with each other in the middle of the strands, and a hydrogen bond is located at this point of contact.  The <scene name='Collagen/Inter-hbonds1/2'>hydrogen bond</scene> consist of the oxygen of a carbonyl of a Hyp in a <font bold="" color="blue"><strong>peptide</strong></font> of one tropocollagen and the hydroxyl hydrogen of a Hyp in a <span style="color:gold;background-color:black;font-weight:bold;">peptide</span> of another tropocollagen.  Another example shows <scene name='Collagen/Hlite_k_o/1'>two peptides</scene> from two different tropocollagens making contact at the ends of the fiber segment, and of course it is within these regions where the inter-tropocollagen attractions occur. At one end a <scene name='Collagen/Inter-hbond2/4'>hydrogen bond</scene> is formed between a hydrogen of Hyp in one <span style="color:gold;background-color:black;font-weight:bold;">peptide</span> and an oxygen of a Gly carbonyl in the second <font color="red"><strong>peptide</strong></font>. At the other end of the two peptides a <scene name='Collagen/Inter-hbond3/2'>Hyp carbonyl oxygen</scene> donates its electrons to a Hyp hydroxyl hydrogen. Show the <scene name='Collagen/2nd_view_hbond3/2'>hydrogen bond</scene> in the context of the six peptides of the two tropocollagens. The above examples of hydrogen bonding illustrate that Hyp plays a central role in maintaining the structures of both the tropocollagen and the collagen fiber.  Without the proper amount of vitamin C in their diets humans can not make Hyp, and therefore can not make stable collagen and strong bones.
<kinemage align="right" width="400" height="400" file="collagen2.kin" />
This kinemage displays all of the atoms of the collagen model compound (Pro-Hyp-Gly)4-Pro-Hyp-Ala-(Pro-Hyp-Gly)5 in stick form (note that the "essential" Gly residue in this model compound's central
triplet is replaced by Ala). View1 shows the triple helix in side view with "Chain 1" in pinktint, "Chain 2" in yellowtint, and "Chain 3" in white. The Pro, Hyp, and Ala side chains, which are independently controlled by the corresponding buttons, are green, cyan, and magenta, respectively. Use View1 and View2, which is down the triple helix axis, to prove to yourself that all Pro and Hyp side chains are on the periphery of the triple helix. These rigid groups are thought to help stabilize the collagen conformation.
== Effect of a Mutation ==
The mutation being considered is an Ala replacing a Gly.  Synthetic model PDB ID: [[1cag]]<ref>J.BELLA,M.EATON,B.BRODSKY,H.M.BERMAN, CRYSTAL AND MOLECULAR STRUCTURE OF A COLLAGEN-LIKE PEPTIDE AT 1.9 A RESOLUTION. ''SCIENCE'', '''266''', 75, 1994</ref> is <scene name='Collagen/1cag/7'>tropocollagen</scene> whose peptides contain thirty residues and have a <scene name='Collagen/Collagen_chain_1cag/4'>sequence</scene> of (Pro-Hyp-Gly)4-Pro-Hyp-Ala-(Pro-Hyp-Gly)5 (Ala displayed as large wireframe and colored as {{Template:ColorKey_Element_C}} {{Template:ColorKey_Element_O}} {{Template:ColorKey_Element_N}}).   Viewing [[1cag]] from the side of the fiber shows: the <scene name='Collagen/1cag1/1'>Gly</scene> is only partially visible because it is buried in the interior, <scene name='Collagen/1cag2/1'>Pro</scene> being much more visible is positioned closer to the surface, <scene name='Collagen/1cag3/1'>Hyp</scene> being entirely on the surface is clearly visible, and <scene name='Collagen/1cag4/1'>Ala</scene> being a substitute for Gly is only partially visible.  


View3 and View4 are side and top views of a segment of the collagen helix in which its three polypeptides all consist of repeating triplets of ideal sequence, (Gly-Pro-Hyp)n. Go to View3 to see that the three polypeptide chains are staggered in sequence by one residue, that is, a Gly on Chain 1 is at the same level along the triple helix axis as a Hyp on Chain 2 and a Pro on Chain 3. Turn on the "H bonds" button (H bonds are represented by dashed orange lines), to see that this staggered arrangement permits the formation of a hydrogen bond from the Gly main chain NH of Chain 1 to the Pro main chain O on Chain 2 (and likewise from Chain 2 to Chain 3 and from Chain 3 to Chain 1). Since the main chain N atoms of both Pro and Hyp residues lack H atoms, this exhausts the ability of the main chain to donate hydrogen bonds. Although the center of the triple helix appears to be hollow in View4, taking into account the van der Waals radii of its various atoms reveals that the center of the triple helix is, in fact, quite tightly packed. Indeed, the above hydrogen bonds pass very close to the center of the triple helix. This close packing accounts for the absolute requirement for a Gly at every third residue in a functional collagen molecule. Since, as you can see, the Gly Ca atoms are near the center of the triple helix, the side chain of any other residue at this position would, as we shall see below, significantly distort and hence destabilize the collagen triple helix.
The <scene name='Collagen/1cag_surface/4'>surface</scene> of the tropocollagen is shown with the Ala appearing as olive and the Pro and Hyp adjacent to the Ala appearing as dark brown. Notice that the surface at these Pro and Hyp bulges slightly.  This protrusion is due to the fact that the packing about the Ala side chains is not as close as it is about the Gly.  In the two side-by-side scenes shown below compare the amount of open space between the chains in the area of the scene center. In the [[1cag]] scene in the area of the Ala the distance between the chains is slightly greater than that of [[4clg]] scene.  


View5 and View6 show the side and top views of the triple helix segment containing an Ala on each chain instead of a Gly. The effect of replacing the Gly H atom side chain with a methyl group to form Ala, the smallest residue substitution possible, is quite striking. The interior of the collagen triple helix is too crowded to accommodate an Ala side chain without significant distortion. The triple helix in this region therefore unwinds and expands so that no H-bonds form in this region. The unwinding of the triple helix in the region about the Ala residues is, perhaps, best seen by returning to KINEMAGE above this one. You can see that the triple helix is bulged out in the center of View1. These conformational changes, which disrupt collagen's rope-like structure, are responsible for the symptoms of such human diseases as osteogenesis imperfecta and certain Ehlers-Danlos syndromes.
==3D structures of collagen==
[[Collagen 3D structures]]


</StructureSection>
__NOTOC__
<table width='100%' align='left' cellpadding='5'><tr><td rowspan='2'>&nbsp;</td><td bgcolor='#eeeeee'><Structure load='4clg' size='400' frame='true' align='left' name='id1' scene='Collagen/Glys_close_wf/1' /></td><td bgcolor='#eeeeee'><Structure load='1cag' size='400' frame='true' align='right' name='id2' scene='Collagen/1cag_ala_pack_wf/1' /></td></tr><tr><td bgcolor='#eeeeee'><center>'''Gly Packing in [[4clg]]'''&nbsp;(<scene name='Collagen/Glys_close_wf/1'> Initial scene</scene>)</center></td><td bgcolor='#eeeeee'><center>'''Ala Packing in [[1cag]] (Mutated Collagen)'''&nbsp;(<scene name='Collagen/1cag_ala_pack_wf/1'> Initial scene</scene>)</center></td></tr></table>




Exercise in large part by John H. Connor (present address: Department of Microbiology, Boston University School of Medicine, 850 Harrison Ave, Boston, MA, 02118, USA)
In order to convince yourself that there is a difference in the interchain distances in the area of the Ala, <scene name='Collagen/1cag_measurements/2'>show distances</scene> between Gly (Ala) and Pro which form intratropocollagen hydrogen bonds.  Hydrogen bonds are not formed between Ala and Pro because the distances between the atoms forming the bonds are too great.  The absence of the intratropocollagen hydrogen bonds, which is due to replacing Gly with a residue having a longer side chain, disrupts collagen's rope-like structure and is responsible for the symptoms of such human diseases as osteogenesis imperfecta and certain Ehlers-Danlos syndromes.


==Coordinates==
==References==
The coordinates for the collagen-like polypeptide were obtained from 1CAG.


{{Reflist}}


==External Links==
[http://www.mc.vanderbilt.edu/cmb/collagen/ Movies] of assembly of triple helix of type I and IV collagen.


{{Clear}}
== Contributor ==
==External Links==
Much of the content of this page was taken from an earlier non-Proteopedia version of Collagen which was in large part developed by '''Gretchen Heide Bisbort''', a 1999 graduate of Messiah College.
[http://www.mc.vanderbilt.edu/cmb/collagen/ Movies] of assembly of triple helix of type I and IV collagen.  


==Another Jmol tutorial==
[[Category:Topic Page]]
[http://www.messiah.edu/molscilab/Jmol/collagen/collagen_index.htm Tutorial] which illustrates and describes the 3D structure of collagen
[[pt:Collagen_(Portuguese)]]

Proteopedia Page Contributors and Editors (what is this?)Proteopedia Page Contributors and Editors (what is this?)

Ala Jelani, Karl Oberholser, Eran Hodis, Tilman Schirmer, Judy Voet, David Canner, Jaime Prilusky, Michal Harel, Alexander Berchansky, Eric Martz