Amino Acids: Difference between revisions
Eric Martz (talk | contribs) |
Eric Martz (talk | contribs) |
||
(38 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
For a general introduction to ''amino acids'', please see [http://en.wikipedia.org/wiki/Amino_acids Amino Acids in Wikipedia]. | For a general introduction to ''amino acids'', please see [http://en.wikipedia.org/wiki/Amino_acids Amino Acids in Wikipedia]. | ||
== | ==22 Standard Amino Acids and Mnemonics== | ||
Here are the names of the twenty standard amino acids, with their three and one-letter<ref>A one-letter notation for amino acid sequences (definitive rules). IUPAC—IUB Commission on Biochemical Nomenclature, pages 639-645, International Union of Pure and Applied Chemistry and International Union of Biochemistry, Butterworths, London, 1971. [ | Here are the names of the twenty-two standard amino acids, with their three and one-letter<ref>A one-letter notation for amino acid sequences (definitive rules). IUPAC—IUB Commission on Biochemical Nomenclature, pages 639-645, International Union of Pure and Applied Chemistry and International Union of Biochemistry, Butterworths, London, 1971. [https://old.iupac.org/publications/pac/1972/pdf/3104x0639.pdf Full Text].</ref> abbreviations. Mnemonic names are intended to help you to remember the one-letter codes, but are not the correct names. | ||
<center> | <center> | ||
Line 20: | Line 20: | ||
</big></tt></td><td><tt><big> | </big></tt></td><td><tt><big> | ||
Lys K Lysine<br> (mnemonic: li<b>K</b>esine) | Lys K Lysine<br> (mnemonic: li<b>K</b>esine) | ||
<br> | <br> | ||
Line 29: | Line 27: | ||
<br> | <br> | ||
Pro P <b>P</b>roline | Pro P <b>P</b>roline | ||
<br> | |||
Pyl O Pyrr<b>o</b>lysine* | |||
</big></tt></td><td rowspan="2"><tt><big> | </big></tt></td><td rowspan="2"><tt><big> | ||
Line 79: | Line 79: | ||
<br> | <br> | ||
Ile I <b>I</b>soleucine | Ile I <b>I</b>soleucine | ||
<br> | |||
Leu L <b>L</b>eucine | |||
</big></tt></td><td><tt><big> | </big></tt></td><td><tt><big> | ||
Sec U Selenocysteine*<br> (mnemonic: seleni<b>U</b>m) | |||
<br> | |||
Ser S <b>S</b>erine | Ser S <b>S</b>erine | ||
<br> | <br> | ||
Line 93: | Line 97: | ||
</tr> | </tr> | ||
<tr><td colspan="4"><center> | <tr><td colspan="4"><center> | ||
<nowiki>*</nowiki> | <nowiki>*</nowiki>Since the mid-20th century, there were 20 standard amino acids. Early in the 21st Century, | ||
[[#Unusual_Amino_Acids| | genetically encoded | ||
[[#Unusual_Amino_Acids|Selenocysteine and Pyrrolysine]] have been included as standard by the World Wide Protein Data Bank<ref name="pdb22" />. | |||
<br> | <br> | ||
† Asx: Asn or Asp; Glx: Gln or Glu; Unk: unknown. | † Asx: Asn or Asp; Glx: Gln or Glu; Unk: unknown. | ||
<br> | <br> | ||
[http:// | [http://old.iupac.org/publications/pac/1972/pdf/3104x0639.pdf IUPAC/IUB 1971 publication] defining one-letter codes. | ||
</center></td></tr> | </center></td></tr> | ||
</table> | </table> | ||
Click on the image below to see it full size | Click on the image below to see it full size with details on 21 amino acids. | ||
{| align="center" | {| align="center" | ||
Line 114: | Line 118: | ||
|} | |} | ||
</center> | </center> | ||
There is also a [http:// | There is also a [http://pdb101.rcsb.org/learn/guide-to-understanding-pdb-data/primary-sequences-and-the-pdb-format list of standard amino acids] at the Protein Data Bank. | ||
{{Clear}} | {{Clear}} | ||
==L- and D-Amino Acids== | ==L- and D-Amino Acids== | ||
<table border='1' cellpadding='3' align='right' hspace="8"><tr><td colspan='2'> | <table border='1' cellpadding='3' align='right' hspace="8" class="wikitable" style="margin:0px 0px 0px 8px;"><tr><td colspan='2'> | ||
PDB Nomenclature | PDB Nomenclature | ||
</td></tr><tr><td> | </td></tr><tr><td> | ||
Line 127: | Line 131: | ||
</td></tr><tr><td> | </td></tr><tr><td> | ||
ALA<br>ARG<br>ASN<br>ASP<br>CYS<br>GLN<br>GLU<br>HIS<br>ILE<br>LEU<br>LYS<br>MET<br> | ALA<br>ARG<br>ASN<br>ASP<br>CYS<br>GLN<br>GLU<br>HIS<br>ILE<br>LEU<br>LYS<br>MET<br> | ||
PHE<br>PRO<br>SER<br>THR<br>TRP<br>TYR<br>VAL | PHE<br>PRO<br>PYL<br>SEC<br>SER<br>THR<br>TRP<br>TYR<br>VAL | ||
</td><td> | </td><td> | ||
DAL<br>DAR<br>DSG<br>DAS<br>DCY<br>DGN<br>DGL<br>DHI<br>DIL<br>DLE<br>DLY<br>MED<br> | DAL<br>DAR<br>DSG<br>DAS<br>DCY<br>DGN<br>DGL<br>DHI<br>DIL<br>DLE<br>DLY<br>MED<br> | ||
DPN<br>DPR<br>DSN<br>DTH<br>DTR<br>DTY<br>DVA | DPN<br>DPR<br>?<ref name="ddd">In January, 2022, no D-selenocysteine nor D-pyrrolysine were present in the [[wwPDB]], according to David Armstrong of PDBe.</ref><br>?<ref name="ddd" /><br>DSN<br>DTH<br>DTR<br>DTY<br>DVA | ||
</td></tr></table> | </td></tr></table> | ||
The asymmetric α-carbon in | The asymmetric α-carbon in 21 of the standard amino acids (all except glycine) is a chiral (stereogenic) center<ref name='chirality' />. Thus, each amino acid can exist in either of two optical isomers or enantiomers (mirror images), designated the L-form and D-form. Nearly all naturally-occuring amino acids are L-amino acids, so named because they are analogous to the L-form of glyceraldehyde<ref name='chirality'>See [http://en.wikipedia.org/wiki/Chirality_(chemistry) Chirality in Wikipedia]].</ref>. (Not all L-amino acids are levorotatory; some rotate polarized light clockwise<ref name='chirality' />). | ||
Although D-amino acids are rare in nature, they do occur (see [http://en.wikipedia.org/wiki/Chirality_(chemistry)#D-Amino_Acid_Natural_Abundance natural abundance in Wikipedia]). D-alanine (DAL) occurs in over 100 entries in the [[PDB]], D-leucine (DLE), D-phenylalanine (DPN), D-serine (DSN), and D-valine (DVA) each occur in over 50. Some of these entries are chemically synthesized, not naturally occuring. Gramicidin is a naturally occuring antibiotic containing several D-amino acids, e.g. [[1nrm]]. [[4glu]] is the 102 amino acid chain of VEGF-A synthesized entirely from D-amino acids. | More than 700 entries in the [[PDB]] contain one or more D amino acids. Although D-amino acids are rare in nature, they do occur (see [http://en.wikipedia.org/wiki/Chirality_(chemistry)#D-Amino_Acid_Natural_Abundance natural abundance in Wikipedia]). D-alanine (DAL) occurs in over 100 entries in the [[PDB]], D-leucine (DLE), D-phenylalanine (DPN), D-serine (DSN), and D-valine (DVA) each occur in over 50. Some of these entries are chemically synthesized, not naturally occuring. Gramicidin is a naturally occuring antibiotic containing several D-amino acids, e.g. [[1nrm]]. [[4glu]] is the 102 amino acid chain of VEGF-A synthesized entirely from D-amino acids. | ||
==Unusual Amino Acids== | ==Unusual Amino Acids== | ||
There are at least two non-standard amino acids | There are at least two historically "non-standard" amino acids now recognized to be [http://en.wikipedia.org/wiki/Genetic_code genetically encoded], discussed below. In 2014, the [[PDB]] began including these two to make 22 standard amino acids<ref name="pdb22">[https://www.wwpdb.org/news/news?year=2014#5764490799cccf749a90cddf Announcement: Standardization of Amino Acid Nomenclature], World Wide Protein Data Bank News, January 8, 2014.</ref>. | ||
===Selenocysteine: the 21st amino acid=== | ===Selenocysteine: the 21st amino acid=== | ||
Rare proteins in all domains of life include '''[[Selenocysteine|selenocysteine (Sec, U)]]''', designated the 21st amino acid<ref name='21st'>PMID: 11028985</ref>. | Rare proteins in all domains of life include '''[[Selenocysteine|selenocysteine (Sec, U)]]''', designated the 21st amino acid<ref name='21st'>PMID: 11028985</ref>. In 2024, over 90 entries in the [[PDB]] include selenocysteine, identified as SEC. | ||
===Pyrrolysine: the 22nd amino acid=== | ===Pyrrolysine: the 22nd amino acid=== | ||
'''[[Pyrrolysine|pyrrolysine (Pyl, O)]]'''<ref>See [http://en.wikipedia.org/wiki/Pyrrolysine Pyrrolysine in Wikipedia].</ref> is a genetically encoded amino acid that occurs naturally in methanogenic archaea, | '''[[Pyrrolysine|pyrrolysine (Pyl, O)]]'''<ref>See [http://en.wikipedia.org/wiki/Pyrrolysine Pyrrolysine in Wikipedia].</ref> is a genetically encoded amino acid that occurs naturally in methanogenic archaea, now designated the 22nd amino acid<ref>PMID: 15380192</ref>. It occurs in [http://www.rcsb.org/pdb/ligand/ligandsummary.do?hetId=PYL several entries] in the PDB. | ||
See [[Pyrrolysine|more details]]. | See [[Pyrrolysine|more details]]. | ||
<!-- | <!-- | ||
Line 162: | Line 166: | ||
===Postranslational modifications=== | ===Postranslational modifications=== | ||
Many unusual amino acids are formed by enzyme-catalyzed reactions that modify a standard amino acid after it has been included in a polypeptide; these are called post-translational modifications. Among them: | Many unusual amino acids are formed by enzyme-catalyzed reactions that modify a genetically encoded standard amino acid after it has been included in a polypeptide; these are called post-translational modifications. Among them: | ||
* phosphorylation of serine, threonine, or tyrosine | * phosphorylation of serine, threonine, or tyrosine | ||
** Example: C kinase [[1bkx]] | |||
* hydroxylation of proline (to yield hydroxyproline, e.g. in [[collagen]]) | * hydroxylation of proline (to yield hydroxyproline, e.g. in [[collagen]]) | ||
* acetylation of lysine or the N terminus | * acetylation of lysine or the N terminus | ||
** Example of N-terminal acetylation: [[4mdh]] | |||
* methylation of lysine | * methylation of lysine | ||
* carboxylation of aspartate or glutamate | * carboxylation of aspartate or glutamate | ||
* nitrosylation, e.g. to produce [[nitrotyrosine]] | * nitrosylation, e.g. to produce [[nitrotyrosine]] | ||
* acylation with a fatty acid | * acylation with a fatty acid (see N-terminal myristoylation of [[Recoverin]]) | ||
* prenylation | * prenylation | ||
* glycosylation of serine, threonine or asparagine (yielding glycoproteins) | * glycosylation of serine, threonine or asparagine (yielding glycoproteins) | ||
** Example of N-linked glycolylation: [[1igy]] | |||
==Structure, Properties, Behaviors in Proteins== | ==Structure, Properties, Behaviors in Proteins== | ||
Angel Herráez has provided a [http://biomodel.uah.es/en/model1/prot/aa-intro.htm tutorial introduction to amino acid structure] in which | Angel Herráez has provided a [http://biomodel.uah.es/en/model1/prot/aa-intro.htm tutorial introduction to amino acid structure] in which 20 amino acids may be visualized in 3D using JSmol. | ||
Please help to expand the list below to include all | Please help to expand the list below to include all 22 amino acids. | ||
===Glycine=== | ===Glycine=== | ||
Gly G: Small. | Gly G: Small. | ||
See [http://en.wikipedia.org/wiki/Glycine Glycine] in Wikipedia, where the structure is shown. Glycine is the smallest amino acid, since its side-chain is nothing but a single hydrogen. Glycine is frequently found in | See also [[Glycine|Glycine in Proteopedia]] and [http://en.wikipedia.org/wiki/Glycine Glycine] in Wikipedia, where the structure is shown. Glycine is the smallest amino acid, since its side-chain is nothing but a single hydrogen. Glycine is frequently found in [[Turns in Proteins|turns]], since the steric clashes of larger amino acids would be problematic. Turns typically occur on the surfaces of proteins, between [[Secondary structure|beta-strands or alpha-helices]]. It is common to find '''[[Conservation, Evolutionary|highly conserved]] glycines in turns on protein surfaces''', since mutation to a bulkier residue would interfere with the fold. Glycine also occurs in [[Secondary structure|beta-strands and alpha-helices]], although its frequency in alpha helices is low (second lowest, after proline, p. 125 in <ref name="kesselbental">Kessel, Amit, and Nir Ben-Tal. ''Introduction to Proteins: Structure, Function, and Motion''. CRC Press, 2011. 653 pages.</ref>). Glycine is commonly found at the C-terminus of alpha helices, and is considered a ''helix terminator'' (p. 125 in <ref name="kesselbental" />). | ||
===Histidine=== | ===Histidine=== | ||
Line 215: | Line 222: | ||
*[[Standard Residues]] | *[[Standard Residues]] | ||
*[http://en.wikipedia.org/wiki/Amino_acids Amino Acids in Wikipedia] | *[http://en.wikipedia.org/wiki/Amino_acids Amino Acids in Wikipedia] | ||
*[ | *[[Basics of Protein Structure]] | ||
==References== | ==References== | ||
<references /> | <references /> |