Basics of Protein Structure: Difference between revisions

Ann Taylor (talk | contribs)
mNo edit summary
Ann Taylor (talk | contribs)
No edit summary
Line 7: Line 7:
Proteins perform many important functions in living organisms, including movement, immune responses, sensing the environment, energy acquisition, and catalyzing reactions. The protein shown to the right is insulin; when insulin isn't properly synthesized or recognized, diabetes occurs.
Proteins perform many important functions in living organisms, including movement, immune responses, sensing the environment, energy acquisition, and catalyzing reactions. The protein shown to the right is insulin; when insulin isn't properly synthesized or recognized, diabetes occurs.


Proteins are long [[chains]] of [[Amino Acids]], and are synthesized by the [[ribosome]], using messenger [[RNA]] as a template. There are 20 amino acids commonly found in proteins. <scene name='60/604417/Ala/2'>Amino acids</scene> contain an <scene name='60/604417/Ala_amino/1'>amino group</scene>, a central carbon atom called the <scene name='60/604417/Ala_alpha/1'>alpha carbon</scene>, and a <scene name='60/604417/Ala_cooh/1'>carboxylic acid</scene>. The 20 amino acids differ by what is attached to the central atom; is variable portion is referred to as the <scene name='60/604417/Ala_side_chain/1'>side chain</scene>. The amino acid shown is alanine; its side chain is a methyl (-CH3) group. The atoms are displayed using the [[CPK|coloring convention]] '''<font color="#808080">Carbon</font>, <span style="background-color:black;color:white;">&nbsp;Hydrogen&nbsp;</span>, <font color="red">Oxygen</font>, <font color="#3050f8">Nitrogen</font>''': {{Template:ColorKey_Element_C}}, {{Template:ColorKey_Element_H}}, {{Template:ColorKey_Element_O}}, {{Template:ColorKey_Element_N}}.
Proteins are long [[chains]] of [[Amino Acids|amino acids]], and are synthesized by the [[ribosome]], using messenger [[RNA]] as a template. There are 20 amino acids commonly found in proteins. <scene name='60/604417/Ala/2'>Amino acids</scene> contain an <scene name='60/604417/Ala_amino/1'>amino group</scene>, a central carbon atom called the <scene name='60/604417/Ala_alpha/1'>alpha carbon</scene>, and a <scene name='60/604417/Ala_cooh/1'>carboxylic acid</scene>. The 20 amino acids differ by what is attached to the central atom; is variable portion is referred to as the <scene name='60/604417/Ala_side_chain/1'>side chain</scene>. The amino acid shown is alanine; its side chain is a methyl (-CH3) group. The atoms are displayed using the [[CPK|coloring convention]] '''<font color="#808080">Carbon</font>, <span style="background-color:black;color:white;">&nbsp;Hydrogen&nbsp;</span>, <font color="red">Oxygen</font>, <font color="#3050f8">Nitrogen</font>''': {{Template:ColorKey_Element_C}}, {{Template:ColorKey_Element_H}}, {{Template:ColorKey_Element_O}}, {{Template:ColorKey_Element_N}}.


Proteins are sometimes compared to <scene name='60/604417/Ins_bead_backbone/3'>beads on a string</scene>, where each amino acid residue is a bead.  These long chains form complicated structures that allow them to perform their function. Even small alterations in any level of the structure can change how the protein does its job, and can lead to diseases.
Proteins are sometimes compared to <scene name='60/604417/Ins_bead_backbone/3'>beads on a string</scene>, where each amino acid residue is a bead.  These long chains form complicated structures that allow them to perform their function. Even small alterations in any level of the structure can change how the protein does its job, and can lead to diseases.
Line 17: Line 17:
There are [[Four levels of protein structure|four different levels of protein structure]].  The <scene name='60/604417/Ins_bead_backbone_labels/1'>primary structure</scene> is the amino acid sequence. The amino acids are connected by an amide bond, made from the amino group (NH2) of one amino acid, and the carboxylic acid (C=O) from another amino acid. In the process of making the bond, a water molecule is removed. The amino acids are linked in a repeating pattern. The [[Backbone representations|backbone]] of the protein is the repeating <scene name='60/604417/N_calpha_co/2'>N-C-C=O</scene> pattern, with the <scene name='60/604417/Side_chains/1'>side chains</scene> projecting out from the backbone. The end with the free -NH2 group is called the Amino or <scene name='60/604417/N_terminus/1'>N terminus</scene>, while the end with a free carboxylic acid is called the <scene name='60/604417/C_terminus/1'>C terminus</scene>. Notice that most protein structure representations do not show the hydrogens. The sequence of amino acids is written and numbered from the N terminus (where protein synthesis begins) to the C terminus (where amino acids are added during protein synthesis), so for <scene name='60/604417/N_to_c/1'>the segment shown</scene>, the sequence would be Val-Asn-Gln, or VNQ, if one letter abbreviations are used for the amino acids. For more practice identifying peptide bonds between amino acids, please try [[User:Stephen Mills/Peptide tutorial 1|Peptide tutorial 1 part 1]] and [[User:Stephen Mills/Peptide tutorial 2|Peptide tutorial 1 part 2]].   
There are [[Four levels of protein structure|four different levels of protein structure]].  The <scene name='60/604417/Ins_bead_backbone_labels/1'>primary structure</scene> is the amino acid sequence. The amino acids are connected by an amide bond, made from the amino group (NH2) of one amino acid, and the carboxylic acid (C=O) from another amino acid. In the process of making the bond, a water molecule is removed. The amino acids are linked in a repeating pattern. The [[Backbone representations|backbone]] of the protein is the repeating <scene name='60/604417/N_calpha_co/2'>N-C-C=O</scene> pattern, with the <scene name='60/604417/Side_chains/1'>side chains</scene> projecting out from the backbone. The end with the free -NH2 group is called the Amino or <scene name='60/604417/N_terminus/1'>N terminus</scene>, while the end with a free carboxylic acid is called the <scene name='60/604417/C_terminus/1'>C terminus</scene>. Notice that most protein structure representations do not show the hydrogens. The sequence of amino acids is written and numbered from the N terminus (where protein synthesis begins) to the C terminus (where amino acids are added during protein synthesis), so for <scene name='60/604417/N_to_c/1'>the segment shown</scene>, the sequence would be Val-Asn-Gln, or VNQ, if one letter abbreviations are used for the amino acids. For more practice identifying peptide bonds between amino acids, please try [[User:Stephen Mills/Peptide tutorial 1|Peptide tutorial 1 part 1]] and [[User:Stephen Mills/Peptide tutorial 2|Peptide tutorial 1 part 2]].   


The second level of structure is called secondary structure, and is the shapes (conformations) formed by short sequences of amino acids. This level of structure is stabilized by <scene name='60/604417/H_bonds/1'>hydrogen bonds</scene> along the backbone. Hydrogen bonds are attractions between an N, O or F and a hydrogen attached to an N, O or F (More about [[hydrogen bonds]].) The two most common shapes are [[Helices in Proteins|alpha helices]] and [[Sheets in Proteins|beta strands]].  These are favored simply because [[Tutorial:Ramachandran principle and phi psi angles|two atoms cannot occupy the same space]] (steric collisions). Insulin only contains <scene name='60/604417/Secondary_structure/1'>alpha helices</scene>; they are show in pink.
The second level of structure is called secondary structure, and is the shapes (conformations) formed by short sequences of amino acids. This level of structure is stabilized by <scene name='60/604417/H_bonds/2'>hydrogen bonds</scene> along the backbone. Hydrogen bonds are attractions between an N, O or F and a hydrogen attached to an N, O or F (More about [[hydrogen bonds]].) The two most common shapes are [[Helices in Proteins|alpha helices]] and [[Sheets in Proteins|beta strands]].  These are favored simply because [[Tutorial:Ramachandran principle and phi psi angles|two atoms cannot occupy the same space]] (steric collisions). Insulin only contains <scene name='60/604417/Secondary_structure/1'>alpha helices</scene>; they are show in pink.


The third level of structure, or tertiary structure, is how the secondary structures pack together to form the overall form of the entire peptide chain. Side chains play an important role in tertiary structure formation, especially the burying of hydrophobic ("water fearing") amino acids in the middle of the structure. In <scene name='60/604417/Hydrophilic/1'>this view</scene>, {{Template:ColorKey_Hydrophobic}} residues are grey and {{Template:ColorKey_Polar}} are shown in light purple. Water molecules are shown with red balls; notice that they tend to be close to the hydrophilic (water loving) groups. Some proteins, like insulin, are also stabilized by<scene name='60/604417/Disulfide_bonds/1'> covalent bonds between the sulfur atoms</scene> (shown in yellow) called disulfide bonds.   
The third level of structure, or tertiary structure, is how the secondary structures pack together to form the overall form of the entire peptide chain. Side chains play an important role in tertiary structure formation, especially the burying of hydrophobic ("water fearing") amino acids in the middle of the structure. In <scene name='60/604417/Hydrophilic/1'>this view</scene>, {{Template:ColorKey_Hydrophobic}} residues are grey and {{Template:ColorKey_Polar}} are shown in light purple. Water molecules are shown with red balls; notice that they tend to be close to the hydrophilic (water loving) groups. Some proteins, like insulin, are also stabilized by<scene name='60/604417/Disulfide_bonds/1'> covalent bonds between the sulfur atoms</scene> (shown in yellow) called disulfide bonds.   

Proteopedia Page Contributors and Editors (what is this?)Proteopedia Page Contributors and Editors (what is this?)

Joel L. Sussman, Ann Taylor, Eric Martz