Amino Acids: Difference between revisions

Eric Martz (talk | contribs)
Eric Martz (talk | contribs)
 
(31 intermediate revisions by 2 users not shown)
Line 1: Line 1:
For a general introduction to ''amino acids'', please see [http://en.wikipedia.org/wiki/Amino_acids Amino Acids in Wikipedia].
For a general introduction to ''amino acids'', please see [http://en.wikipedia.org/wiki/Amino_acids Amino Acids in Wikipedia].


==20 Standard Amino Acids and Mnemonics==
==22 Standard Amino Acids and Mnemonics==


Here are the names of the twenty standard amino acids, with their three and one-letter<ref>A one-letter notation for amino acid sequences (definitive rules). IUPAC—IUB Commission on Biochemical Nomenclature, pages 639-645, International Union of Pure and Applied Chemistry and International Union of Biochemistry, Butterworths, London, 1971. [http://pac.iupac.org/publications/pac/pdf/1972/pdf/3104x0639.pdf Full Text].</ref> abbreviations. Mnemonic names are intended to help you to remember the one-letter codes, but are not the correct names.
Here are the names of the twenty-two standard amino acids, with their three and one-letter<ref>A one-letter notation for amino acid sequences (definitive rules). IUPAC—IUB Commission on Biochemical Nomenclature, pages 639-645, International Union of Pure and Applied Chemistry and International Union of Biochemistry, Butterworths, London, 1971. [https://old.iupac.org/publications/pac/1972/pdf/3104x0639.pdf Full Text].</ref> abbreviations. Mnemonic names are intended to help you to remember the one-letter codes, but are not the correct names.


<center>
<center>
Line 20: Line 20:


</big></tt></td><td><tt><big>
</big></tt></td><td><tt><big>
Leu L <b>L</b>eucine
<br>
Lys K Lysine<br>&nbsp;&nbsp;(mnemonic:&nbsp;li<b>K</b>esine)
Lys K Lysine<br>&nbsp;&nbsp;(mnemonic:&nbsp;li<b>K</b>esine)
<br>
<br>
Line 29: Line 27:
<br>
<br>
Pro P <b>P</b>roline
Pro P <b>P</b>roline
<br>
Pyl O Pyrr<b>o</b>lysine*


</big></tt></td><td rowspan="2"><tt><big>
</big></tt></td><td rowspan="2"><tt><big>
Line 79: Line 79:
<br>
<br>
Ile I <b>I</b>soleucine
Ile I <b>I</b>soleucine
<br>
Leu L <b>L</b>eucine


</big></tt></td><td><tt><big>
</big></tt></td><td><tt><big>
Sec U Selenocysteine*<br>&nbsp;&nbsp;(mnemonic:&nbsp;seleni<b>U</b>m)
<br>
Ser S <b>S</b>erine
Ser S <b>S</b>erine
<br>
<br>
Line 93: Line 97:
</tr>
</tr>
<tr><td colspan="4"><center>
<tr><td colspan="4"><center>
<nowiki>*</nowiki>Non-standard:
<nowiki>*</nowiki>Since the mid-20th century, there were 20 standard amino acids. Early in the 21st Century,
[[#Unusual_Amino_Acids|Pyl O Pyrr<b>o</b>lysine; Sec U Selenocysteine]]
genetically encoded
(seleni<b>U</b>m).
[[#Unusual_Amino_Acids|Selenocysteine and Pyrrolysine]] have been included as standard by the World Wide Protein Data Bank<ref name="pdb22" />.
<br>
<br>
&dagger; Asx: Asn or Asp; Glx: Gln or Glu; Unk: unknown.
&dagger; Asx: Asn or Asp; Glx: Gln or Glu; Unk: unknown.
<br>
<br>
[http://pac.iupac.org/publications/pac/pdf/1972/pdf/3104x0639.pdf IUPAC/IUB 1971 publication] defining one-letter codes.
[http://old.iupac.org/publications/pac/1972/pdf/3104x0639.pdf IUPAC/IUB 1971 publication] defining one-letter codes.
</center></td></tr>
</center></td></tr>
</table>
</table>


Click on the image below to see it full size. It shows the 20 amino acids plus one unusual amino acid, [[selenocysteine]].
Click on the image below to see it full size with details on 21 amino acids.


{| align="center"
{| align="center"
Line 119: Line 123:
==L- and D-Amino Acids==
==L- and D-Amino Acids==


<table border='1' cellpadding='3' align='right' hspace="8"><tr><td colspan='2'>
<table border='1' cellpadding='3' align='right' hspace="8" class="wikitable" style="margin:0px 0px 0px 8px;"><tr><td colspan='2'>
PDB Nomenclature
PDB Nomenclature
</td></tr><tr><td>
</td></tr><tr><td>
Line 127: Line 131:
</td></tr><tr><td>
</td></tr><tr><td>
ALA<br>ARG<br>ASN<br>ASP<br>CYS<br>GLN<br>GLU<br>HIS<br>ILE<br>LEU<br>LYS<br>MET<br>
ALA<br>ARG<br>ASN<br>ASP<br>CYS<br>GLN<br>GLU<br>HIS<br>ILE<br>LEU<br>LYS<br>MET<br>
PHE<br>PRO<br>SER<br>THR<br>TRP<br>TYR<br>VAL
PHE<br>PRO<br>PYL<br>SEC<br>SER<br>THR<br>TRP<br>TYR<br>VAL
</td><td>
</td><td>
DAL<br>DAR<br>DSG<br>DAS<br>DCY<br>DGN<br>DGL<br>DHI<br>DIL<br>DLE<br>DLY<br>MED<br>
DAL<br>DAR<br>DSG<br>DAS<br>DCY<br>DGN<br>DGL<br>DHI<br>DIL<br>DLE<br>DLY<br>MED<br>
DPN<br>DPR<br>DSN<br>DTH<br>DTR<br>DTY<br>DVA
DPN<br>DPR<br>?<ref name="ddd">In January, 2022, no D-selenocysteine nor D-pyrrolysine were present in the [[wwPDB]], according to David Armstrong of PDBe.</ref><br>?<ref name="ddd" /><br>DSN<br>DTH<br>DTR<br>DTY<br>DVA
</td></tr></table>
</td></tr></table>


The asymmetric &alpha;-carbon in 19 of the standard amino acids (all except glycine) is a chiral center<ref name='chirality' />. Thus, each amino acid can exist in either of two optical isomers or enantiomers (mirror images), designated the L-form and D-form. Nearly all naturally-occuring amino acids are L-amino acids, so named because they are analogous to the L-form of glyceraldehyde<ref name='chirality'>See [http://en.wikipedia.org/wiki/Chirality_(chemistry) Chirality in Wikipedia]].</ref>. (Not all L-amino acids are levorotatory; some rotate polarized light clockwise<ref name='chirality' />).
The asymmetric &alpha;-carbon in 21 of the standard amino acids (all except glycine) is a chiral (stereogenic) center<ref name='chirality' />. Thus, each amino acid can exist in either of two optical isomers or enantiomers (mirror images), designated the L-form and D-form. Nearly all naturally-occuring amino acids are L-amino acids, so named because they are analogous to the L-form of glyceraldehyde<ref name='chirality'>See [http://en.wikipedia.org/wiki/Chirality_(chemistry) Chirality in Wikipedia]].</ref>. (Not all L-amino acids are levorotatory; some rotate polarized light clockwise<ref name='chirality' />).


Although D-amino acids are rare in nature, they do occur (see [http://en.wikipedia.org/wiki/Chirality_(chemistry)#D-Amino_Acid_Natural_Abundance natural abundance in Wikipedia]). D-alanine (DAL) occurs in over 100 entries in the [[PDB]], D-leucine (DLE), D-phenylalanine (DPN), D-serine (DSN), and D-valine (DVA) each occur in over 50. Some of these entries are chemically synthesized, not naturally occuring. Gramicidin is a naturally occuring antibiotic containing several D-amino acids, e.g. [[1nrm]]. [[4glu]] is the 102 amino acid chain of VEGF-A synthesized entirely from D-amino acids.
More than 700 entries in the [[PDB]] contain one or more D amino acids. Although D-amino acids are rare in nature, they do occur (see [http://en.wikipedia.org/wiki/Chirality_(chemistry)#D-Amino_Acid_Natural_Abundance natural abundance in Wikipedia]). D-alanine (DAL) occurs in over 100 entries in the [[PDB]], D-leucine (DLE), D-phenylalanine (DPN), D-serine (DSN), and D-valine (DVA) each occur in over 50. Some of these entries are chemically synthesized, not naturally occuring. Gramicidin is a naturally occuring antibiotic containing several D-amino acids, e.g. [[1nrm]]. [[4glu]] is the 102 amino acid chain of VEGF-A synthesized entirely from D-amino acids.


==Unusual Amino Acids==
==Unusual Amino Acids==


There are at least two non-standard amino acids that are [http://en.wikipedia.org/wiki/Genetic_code genetically encoded], discussed below. In 2014, the [[PDB]] is in the process of adding these two to make 22 standard amino acids.
There are at least two historically &quot;non-standard&quot; amino acids now recognized to be [http://en.wikipedia.org/wiki/Genetic_code genetically encoded], discussed below. In 2014, the [[PDB]] began including these two to make 22 standard amino acids<ref name="pdb22">[https://www.wwpdb.org/news/news?year=2014#5764490799cccf749a90cddf Announcement: Standardization of Amino Acid Nomenclature], World Wide Protein Data Bank News, January 8, 2014.</ref>.


===Selenocysteine: the 21st amino acid===
===Selenocysteine: the 21st amino acid===


Rare proteins in all domains of life include '''[[Selenocysteine|selenocysteine (Sec, U)]]''', designated the 21st amino acid<ref name='21st'>PMID: 11028985</ref>. Over [http://www.rcsb.org/pdb/ligand/ligandsummary.do?hetId=SEC 38 entries] in the [[PDB]] include selenocysteine, identified as SEC.
Rare proteins in all domains of life include '''[[Selenocysteine|selenocysteine (Sec, U)]]''', designated the 21st amino acid<ref name='21st'>PMID: 11028985</ref>. In 2024, over 90 entries in the [[PDB]] include selenocysteine, identified as SEC.


===Pyrrolysine: the 22nd amino acid===
===Pyrrolysine: the 22nd amino acid===


'''[[Pyrrolysine|pyrrolysine (Pyl, O)]]'''<ref>See [http://en.wikipedia.org/wiki/Pyrrolysine Pyrrolysine in Wikipedia].</ref> is a genetically encoded amino acid that occurs naturally in methanogenic archaea, sometimes called the 22nd amino acid<ref>PMID: 15380192</ref>. It occurs in [http://www.rcsb.org/pdb/ligand/ligandsummary.do?hetId=PYL several entries] in the PDB.
'''[[Pyrrolysine|pyrrolysine (Pyl, O)]]'''<ref>See [http://en.wikipedia.org/wiki/Pyrrolysine Pyrrolysine in Wikipedia].</ref> is a genetically encoded amino acid that occurs naturally in methanogenic archaea, now designated the 22nd amino acid<ref>PMID: 15380192</ref>. It occurs in [http://www.rcsb.org/pdb/ligand/ligandsummary.do?hetId=PYL several entries] in the PDB.
See [[Pyrrolysine|more details]].
See [[Pyrrolysine|more details]].
<!--
<!--
Line 162: Line 166:
===Postranslational modifications===
===Postranslational modifications===


Many unusual amino acids are formed by enzyme-catalyzed reactions that modify a standard amino acid after it has been included in a polypeptide; these are called post-translational modifications. Among them:
Many unusual amino acids are formed by enzyme-catalyzed reactions that modify a genetically encoded standard amino acid after it has been included in a polypeptide; these are called post-translational modifications. Among them:
* phosphorylation of serine, threonine, or tyrosine
* phosphorylation of serine, threonine, or tyrosine
** Example: C kinase [[1bkx]]
** Example: C kinase [[1bkx]]
Line 178: Line 182:
==Structure, Properties, Behaviors in Proteins==
==Structure, Properties, Behaviors in Proteins==


Angel Herráez has provided a [http://biomodel.uah.es/en/model1/prot/aa-intro.htm tutorial introduction to amino acid structure] in which each of the 20 amino acids may be visualized in 3D using J(S)mol.
Angel Herráez has provided a [http://biomodel.uah.es/en/model1/prot/aa-intro.htm tutorial introduction to amino acid structure] in which 20 amino acids may be visualized in 3D using JSmol.


Please help to expand the list below to include all 20 amino acids.
Please help to expand the list below to include all 22 amino acids.


===Glycine===
===Glycine===
Line 218: Line 222:
*[[Standard Residues]]
*[[Standard Residues]]
*[http://en.wikipedia.org/wiki/Amino_acids Amino Acids in Wikipedia]
*[http://en.wikipedia.org/wiki/Amino_acids Amino Acids in Wikipedia]
*[http://biochem.ncsu.edu/faculty/mattos/CrystallographyTutorial/AminoAcids.htm The Amino Acids by Paul Swartz]
*[[Basics of Protein Structure]]


==References==
==References==


<references />
<references />

Proteopedia Page Contributors and Editors (what is this?)Proteopedia Page Contributors and Editors (what is this?)

Eric Martz, Andrea Gorrell, Angel Herraez, Joel L. Sussman