User:Eric Martz/Introduction to Structural Bioinformatics 2016: Difference between revisions

From Proteopedia
Jump to navigation Jump to search
Eric Martz (talk | contribs)
Eric Martz (talk | contribs)
No edit summary
 
(80 intermediate revisions by the same user not shown)
Line 1: Line 1:
<!-- NOTES FOR 2016:
<!-- NOTES FROM 2016:
For 2017:
Some people had no catpi so give them an alternative.
2: chains in the asymmetric unit to be clear. And suggest cartoon so each chain has a different color.
 
10: Be sure to say what kind of bond you are describing, for example "hydrogen bond".
 
Show them where the link to the abstract is. Ask them to read it an use information learned there in their reports.
 
Insist that they do their own consurf jobs. The patterns in Ppda consurfDB are not very believable. Probably too many protein functions included.
 
------------
Section 2: "1-3 character"
Section 2: "1-3 character"


Line 17: Line 28:
<span style="font-size:160%">How to visualize, understand and share/present 3D protein molecular structures</span><br>
<span style="font-size:160%">How to visualize, understand and share/present 3D protein molecular structures</span><br>
<span style="font-size:130%">by Eric Martz, 2016
<span style="font-size:130%">by Eric Martz, 2016
<font color="magenta"> DRAFT UNDER REVISION</font></span>
<!--<font color="magenta"> DRAFT UNDER REVISION</font>-->
<br>
</span><br>
<span style="font-size:115%">for
<span style="font-size:115%">for
[https://www.micro.umass.edu/courses/catalog/microbio-497l-advanced-microbiology-lab-techniques Microbiology 497L: Advanced Microbiology Lab Techniques]</span>
[https://www.micro.umass.edu/courses/catalog/microbio-497l-advanced-microbiology-lab-techniques Microbiology 497L: Advanced Microbiology Lab Techniques]</span>
Line 41: Line 52:
==I. Getting Started in the [https://www.it.umass.edu/computer-classrooms/it-computer-classroom-fine-arts-center-444 FAC444]==
==I. Getting Started in the [https://www.it.umass.edu/computer-classrooms/it-computer-classroom-fine-arts-center-444 FAC444]==
<!--#Log in using your Biology Dept. account. If you brought your own '''laptop''', you are welcome to use it. (iPads will be too slow.)-->
<!--#Log in using your Biology Dept. account. If you brought your own '''laptop''', you are welcome to use it. (iPads will be too slow.)-->
#Use FIREFOX or Safari: click Spotlight ([[Image:Spotlight-icon.png]] upper right corner of screen), enter "firefox" and click on the top hit. (DO NOT USE Chrome because molecule rotation will be slow/jerky. Internet Explorer and Edge are even worse with this software.)
#Use <font color="red">'''FIREFOX''' or Safari</font>.
#In the Chrome browser, go to our syllabus: [http://497L.molviz.org 497L.MolviZ.Org].
<!--: click Spotlight ([[Image:Spotlight-icon.png]] upper right corner of screen), enter "firefox" and click on the top hit. -->
(DO NOT USE Chrome because molecule rotation will be slow/jerky. Internet Explorer and Edge are even worse with this software.)
#Go to our syllabus: [http://497L.molviz.org 497L.MolviZ.Org].
#Now you can see this document in your browser. Go to '''[http://atlas.molviz.org Atlas.MolviZ.Org]'''.
#Now you can see this document in your browser. Go to '''[http://atlas.molviz.org Atlas.MolviZ.Org]'''.
#In the Atlas, choose any molecule deemed ''Straightforward'' and click on the link to '''FirstGlance'''. After a minute or so to load, you should see a rotating molecule.
#In the Atlas, choose any molecule deemed ''Straightforward'' and click on the link to '''FirstGlance'''. After a minute or so to load, you should see a rotating molecule.
Line 55: Line 68:
3. Learn how ''FirstGlance in Jmol'' makes it easy to see structure-function relationships in the protein you chose.
3. Learn how ''FirstGlance in Jmol'' makes it easy to see structure-function relationships in the protein you chose.


4. Write a report including snapshots of your protein that illustrate your answers to the questions below. (Your report will be a Powerpoint file emailed to emartz@microbio.umass.edu. You will not present your report in class.)
4. Write a report including snapshots of your protein that illustrate your answers to the questions below. (Your report will be in the form of [http://slides.google.com Google Slides] emailed to emartz@microbio.umass.edu. You will not present your report in class.)


==III. Protein Structure and Structural Bioinformatics==
==III. Protein Structure and Structural Bioinformatics==
Line 68: Line 81:


|}
|}
{| align="right"
|-
|
[https://docs.google.com/presentation/d/1lS59hde5F6ur9VCBO1_eHwXbhQcUHByPPVi12w6PSRM/edit?usp=sharing '''SLIDESHOW''']
<imagemap>
Image:Protein-structure-4-levels-I-flat.png|300 px|
default [http://proteopedia.org/wiki/images/f/f3/Protein-structure-4-levels-I-flat.png]
</imagemap>
<hr>
|-
|
<imagemap>
Image:Protein-structure-4-levels-III-flat.png|300 px|
default [http://proteopedia.org/wiki/images/6/6e/Protein-structure-4-levels-III-flat.png]
</imagemap>
|}
:<span style="font-size:130%">1. [[Amino acid]] '''sequence''' + protein chain '''conformation''' = protein '''function'''.</span>
:<span style="font-size:130%">1. [[Amino acid]] '''sequence''' + protein chain '''conformation''' = protein '''function'''.</span>
::A. [http://www.umass.edu/molvis/workshop/allstruc/whycare.htm Why do we care] about protein 3D structure?
::A. [http://www.umass.edu/molvis/workshop/allstruc/whycare.htm Why do we care] about protein 3D structure?
Line 79: Line 109:


::A. [http://proteopedia.org/wiki/index.php/Backbone_representations Backbone Representations]  
::A. [http://proteopedia.org/wiki/index.php/Backbone_representations Backbone Representations]  
::B. [http://firstglance.jmol.org/fg.htm?mol=1pgb Small Protein in FirstGlance] (use the Views Tab: Vines, Cartoon)
::B. [http://firstglance.jmol.org/fg.htm?mol=4d7b Small Protein in FirstGlance] (use the Views Tab: Vines, Cartoon)




Line 89: Line 119:
::::*[[Resolution]] reflects the degree of order or disorder in the crystal.
::::*[[Resolution]] reflects the degree of order or disorder in the crystal.
::::*X-ray crystallography gives no models for [[Intrinsically Disordered Protein|intrinsically unstructured]] loops or molecules.
::::*X-ray crystallography gives no models for [[Intrinsically Disordered Protein|intrinsically unstructured]] loops or molecules.
:::*[[Nuclear magnetic resonance]] (NMR) in aqueous solution, 11%.
:::*[[Nuclear magnetic resonance]] (NMR) in aqueous solution, 9%.
::::*NMR is limited to small proteins (30 kD or smaller; median NMR in PDB is 10K; median X-ray is 50K).
::::*NMR is limited to small proteins (30 kD or smaller; median NMR in PDB is 10K; median X-ray is 50K).
::::*Result is an ensemble of models consistent with the data. Examples: [[2bbn]]
::::*Result is an ensemble of models consistent with the data. Examples: [[2bbn]]
:::*High resolution cryo-electron microscopy, 0.5%.
:::*High resolution cryo-electron microscopy, 0.8%.


::B. These methods are difficult and expensive. Less than 10% of proteins have known structure.
::B. These methods are difficult and expensive. Less than 10% of proteins have known structure.
Line 104: Line 134:


::E. Crystallographers publish the [[asymmetric unit]] of the crystal. It may be identical with the [[biological unit]] (the functional form of the molecule), or it may be only part of the biological unit, or it may contain multiple copies of the biological unit. See [[Biological unit#Examples|examples]].
::E. Crystallographers publish the [[asymmetric unit]] of the crystal. It may be identical with the [[biological unit]] (the functional form of the molecule), or it may be only part of the biological unit, or it may contain multiple copies of the biological unit. See [[Biological unit#Examples|examples]].
:::Interchain contacts that occur in the asymmetric unit that are absent in the biological unit are an artifact of crystallization, termed [[crystal contacts]].
:::Interchain contacts that occur in the asymmetric unit, which are absent in the biological unit, are an artifact of crystallization, termed [[crystal contacts]].


==IV. Choose a Molecule to Explore==
==IV. Choose a Molecule to Explore==
*Choose a molecule to use for your report.
*Choose a molecule to use for your report.
**Each student should choose a ''different'' molecule.
**Each student should choose a ''different'' molecule.
**Be sure to note the '''4-character PDB code''' of the molecule you choose. The PDB code makes it easy to retrieve the molecule and information about it.
**<u>Report the PDB code you chose to the instructor</u> to make sure it is not already taken.
**It must have protein.
**It must have protein.
**It will be more interesting if it contains some non-protein: DNA, RNA or ligand.
**It will be more interesting if it contains some non-protein: ligand, or DNA or RNA.
**X-ray results should have resolution of 3 &Aring; or better.
**X-ray results should have resolution of 3 &Aring; or better.
*Be sure to note the '''4-character PDB code''' of the molecule you choose. The PDB code makes it easy to retrieve the molecule and information about it. Here are some ways to find a protein with known structure:
*Here are some ways to find a protein with known structure:
# Atlas of Macromolecules ([http://atlas.molviz.org Atlas.MolviZ.Org]). Choose a "Straightforward" or "Challenging" (not "Enormous") molecule that has protein and ligand.
# '''Recommended:''' <span style="font-size:150%;">Atlas of Macromolecules ([http://atlas.molviz.org Atlas.MolviZ.Org]).</span> Choose a "Straightforward" or "Challenging" (not "Enormous") molecule that has protein and ligand.
# [http://www.pdb.org/pdb/motm.do Structural View of Biology] at the PDB. To get a PDB code, scroll to the bottom of an article and look for the boxes ''Discussed Structures''.
# [http://pdb101.rcsb.org/motm/motm-by-category Molecule of the Month] at the PDB. Look for PDB codes in the article, and use [http://firstglance.jmol.org FirstGlance] to view them.
# [http://www.pdb.org/pdb/101/motm_archive.do?sort=category#motm Molecule of the Month] at the PDB. To get a PDB code, scroll to the bottom of an article and look for the boxes ''Discussed Structures''.
# [http://proteopedia.org/wiki/index.php/Topic_pages Topic Pages] in Proteopedia, or its [http://proteopedia.org/wiki/index.php/Table_of_Contents Table of Contents].  
# [http://proteopedia.org/wiki/index.php/Topic_pages Topic Pages] in Proteopedia, or its [http://proteopedia.org/wiki/index.php/Table_of_Contents Table of Contents].  
# Random PDB Entry in Proteopedia (see ''Random'' at top left of this page in the ''navigation'' box).
# Random PDB Entry in Proteopedia (see ''Random'' at top left of this page in the ''navigation'' box).
# Search by molecule name or amino acid sequence at [http://www.pdb.org www.pdb.org], but remember that less than 10% of proteins have known structure.
# Search by molecule name or amino acid sequence at [http://www.pdb.org www.pdb.org], but remember that less than 10% of proteins have known structure. See also [[Practical Guide to Homology Modeling]] which includes instructions for finding empirical 3D models for a protein sequence.


==V. Explore Your Molecule==
==V. Explore Your Molecule==
Line 125: Line 156:


The main tool we will use is '''FirstGlance in Jmol''': [http://firstglance.jmol.org FirstGlance.Jmol.Org]. (To google it later, use the single word (no space) '''firstglance'''.)
The main tool we will use is '''FirstGlance in Jmol''': [http://firstglance.jmol.org FirstGlance.Jmol.Org]. (To google it later, use the single word (no space) '''firstglance'''.)
* Use '''Firefox''' (or Safari). The molecule will rotate slowly with jerky jumps in other browsers. Internet Explorer and Edge are especially bad for Jmol.


*Enter your 4-character PDB code at FirstGlance, and you should see the molecule you have chosen.
*Enter your 4-character PDB code at FirstGlance, and you should see the molecule you have chosen.
Line 131: Line 164:


*In the ''Views'' tab, there are 10 links at the top that show you different aspects of the molecule. Try them all, as well as any of the other tools in FirstGlance that interest you.
*In the ''Views'' tab, there are 10 links at the top that show you different aspects of the molecule. Try them all, as well as any of the other tools in FirstGlance that interest you.
 
<ul><ul>
 
FirstGlance does NOT use '''[[Java]]''' unless you tell it to. Using Java will make larger proteins load faster, rotate more smoothly, and change views quicker. Use '''Firefox or Internet Explorer''' (or Safari) for Java. Chrome and Edge do not support Java. In your Java-compatible browser, display a molecule in FirstGlance, and then click on the ''Preferences'' tab in FirstGlance. See [[Installing and enabling Java]].
:FirstGlance does NOT use '''Java''' unless you tell it to. Larger proteins will be slow to load and rotate, and to change the view, unless you use Java. Use ''Safari (or Firefox) for Java''  on Macs because Chrome does not support Java on Macs. On Windows, Internet Explorer is good for Java. In your Java-compatible browser, display a molecule in FirstGlance, and then click on the ''Preferences'' tab in FirstGlance. Java is installed and working in the BCRC. Simply give it permission when asked (several times).
</ul></ul>
 
Here are two Views in  FirstGlance that will be used in your report:
Here are two Views in  FirstGlance that will be used in your Powerpoint report:


====A. Hydrophobic/Polar====
====A. Hydrophobic/Polar====
Line 141: Line 173:
*Hydrophobic surface patches may be buried in chain-to-chain contacts -- check the [[#Section_6:_Biological_Unit|biological unit]] (example: [[Lac_repressor#Structure_of_the_lac_repressor|lac repressor homodimer]]).
*Hydrophobic surface patches may be buried in chain-to-chain contacts -- check the [[#Section_6:_Biological_Unit|biological unit]] (example: [[Lac_repressor#Structure_of_the_lac_repressor|lac repressor homodimer]]).
*Large, protruding hydrophobic surface areas (>25 &Aring; in their smaller diameter) may indicate transmembrane proteins (insoluble). Examples:
*Large, protruding hydrophobic surface areas (>25 &Aring; in their smaller diameter) may indicate transmembrane proteins (insoluble). Examples:
**[http://firstglance.jmol.org/fg.htm?mol=7ahl 7ahl]
**showing [http://opm.phar.umich.edu/protein.php?pdbid=7ahl bilayer boundaries] (click on "Jmol"; ligand toggles boundaries).
**[http://firstglance.jmol.org/fg.htm?mol=1bl8 1bl8]
**[http://firstglance.jmol.org/fg.htm?mol=1bl8 1bl8]
**showing [http://opm.phar.umich.edu/protein.php?pdbid=1r3j bilayer boundaries] (click on "Jmol"; ligand toggles boundaries).
**[[Gramicidin Channel in Lipid Bilayer]].
**[[Gramicidin Channel in Lipid Bilayer]].


Line 148: Line 181:
Most proteins have roughly equal numbers of positive and negative charges intermixed on their surfaces. Surface patches of exclusively positive charge often bind nucleic acids (negatively charged because of their phosphates). For example, examine the protein surface charges where the gal4 transcriptional regulator binds DNA ([http://firstglance.jmol.org/fg.htm?mol=1d66 1d66]).
Most proteins have roughly equal numbers of positive and negative charges intermixed on their surfaces. Surface patches of exclusively positive charge often bind nucleic acids (negatively charged because of their phosphates). For example, examine the protein surface charges where the gal4 transcriptional regulator binds DNA ([http://firstglance.jmol.org/fg.htm?mol=1d66 1d66]).


==VI. Powerpoint Report==
==VI. Report Slides==
Save your report with the filename '''yourLastName-565.pptx''', for example <font color='red'><b>sandler-565.pptx</b></font>. When completed, your Powerpoint report is to be emailed to '''emartz@microbio.umass.edu''' for grading. You will ''not'' be asked to present your report in class.
Answer the questions below in slides, using [http://slides.google.com '''Google Slides'''].
 
*Name your report '''YourLastName-497L''', for example <font color='red'><b>Sandler-497L</b></font>. ''If the name of your report does not begin with your family name, you will lose 2 points.'' When completed, email a link to your report to '''emartz@microbio.umass.edu''' for grading.
::(While viewing your report slides, click the blue <i>Share</i> button at the upper right, then <i>Get shareable link</i>, and paste the link into the email.)
:You will ''not'' be asked to present your report in class.
 
*Each slide MUST be labeled at the top with its section number, e.g. ''Section 1''.
 
*Each Section below may be answered in a single slide, or multiple slides. For example, suppose you want to show two snapshots for ''Section 3'', and make separate comments. You may choose to use two slides, labeled ''Section 3A'' and ''Section 3B''.


Each slide MUST be labeled at the top with its section number, e.g. ''Section 1''.
*Additional work beyond the minimum required may earn extra credit.


Each Section below may be answered in a single slide, or multiple slides. For example, suppose you want to show two snapshots for ''Section 3'', and make separate comments. You may choose to use two slides, labeled ''Section 3A'' and ''Section 3B''.
*<font color="red">Due date: midnight Thursday March 3.</font>


<font color='e000e0'>This is not a test. It is to help you learn by doing. Ask for help!</font>
<font color='e000e0'>This is not a test. It is to help you learn by doing. Ask for help!</font>
<br>
<br><!--http://bioinformatics.org/molvis/ppt/martz-565-2014rev2.ppt-->
<span style="font-size:140%;background:#c0ffc0;">[http://bioinformatics.org/molvis/ppt/martz-565-2014rev2.ppt Sample Completed Powerpoint Assignment]</span> (You may download it, rename the file, and use it as a template.)
<span style="font-size:140%;background:#c0ffc0;">[https://docs.google.com/presentation/d/1wW1fJfRBfQGXjoS989O5Wz2JuBXVU8LPMlhCq-dadnA/edit?usp=sharing Example of a Completed Report]</span> (You may import these slides into a new presentation of your own, and use them as a templates, putting in your own content.)


===Section 1: Identity===
===Section 1: Identity===
Line 165: Line 206:
*Your [[PDB identification code]].
*Your [[PDB identification code]].
*The name of your molecule.
*The name of your molecule.
*The function of your molecule.
*The function of your molecule (briefly, one sentence).
*The experimental method and [[resolution]] (or number of models for NMR). <font color='gray'>Available in the '''molecule information tab''' in FirstGlance.</font>
*The experimental method and [[resolution]] (or number of models for NMR). <font color='gray'>Available in the '''molecule information tab''' in FirstGlance.</font>


Line 177: Line 218:
===Section 2: Composition===
===Section 2: Composition===
*The number of
*The number of
**Protein chains
**Protein chains ([[Chain|What does "chain" mean?]])
**DNA chains
**DNA chains
**RNA chains
**RNA chains
***<font color='gray'>Available in the '''molecule information tab''' in FirstGlance: Chain Details. You can identify a residue in any chain by touching it with the mouse (spinning off!). DNA residues are DA, DG, DC, DT while RNA residues are A, G, C and U.</font>
:<font color='gray'>Available in the '''molecule information tab''' in FirstGlance: Chain Details. You can identify a residue in any chain by touching it with the mouse (spinning off!). DNA residues are DA, DG, DC, DT while RNA residues are A, G, C and U.</font>
*Ligands and Non-Standard Residues: Give the 3-letter abbreviations and '''full names''' for all ligands and non-standard residues. If none, so state. ([[Standard residues]])
*Ligands and Non-Standard Residues: Give the one to three-letter abbreviations and '''full names''' for all ligands and non-standard residues. If none, so state. ([[Standard residues]])
:<font color='gray'>The ''molecule information tab'' in FirstGlance lists the 1 to 3-letter abbreviations for each ligand and non-standard residue, and their full names. Click on an abbreviation to locate that entity in the model. See also ''Composition'' in FirstGlance's Views tab. </font>
:<font color='gray'>The ''molecule information tab'' in FirstGlance lists the 1 to 3-letter abbreviations for each ligand and non-standard residue, and their full names. Click on an abbreviation to locate that entity in the model. See also ''Composition'' in FirstGlance's Views tab. </font>


===Section 3: Evolutionary Conservation===
===Section 3: Evolutionary Conservation===
See [[Introduction to Evolutionary Conservation]].
See [[Introduction to Evolutionary Conservation]]. &nbsp; (Example 4d7b: [http://consurf.tau.ac.il/results/1454623073/output_with_form.php ConSurf], [http://firstglance.jmol.org/fg.htm?mol=http%3A//bioinformatics.org/molvis/atlas/pdb/4d7b_consurf1454623073_pipe.pdb Result])


Does your molecule have a highly conserved region? If so, what is its function? If there is no highly conserved region, is there a highly variable region? Show a <font color='e000e0'><b>snapshot</b></font> illustrating a highly conserved (or variable) region.
Does your molecule have a highly conserved region? If so, what is its function? If there is no highly conserved region, is there a highly variable region? Show <font color='e000e0'><b>two snapshots</b></font> illustrating a highly conserved region, and a contrasting region.


See [[How to see conserved regions]].
See [[How to see conserved regions]].


:If Proteopedia lacks a pre-calculated ''Evolutionary Conservation'' for your molecule, and you do your own calculation at the ConSurf Server, be sure to <font color='#d00000'>include the address of the ConSurf result in your Powerpoint slide!</font>.
:If Proteopedia lacks a pre-calculated ''Evolutionary Conservation'' for your molecule, and you do your own calculation at the ConSurf Server, be sure to <font color='#d00000'>include the address of the ConSurf result in your report slide!</font>.


===Section 4: Hydrophobic/Polar===
===Section 4: Hydrophobic/Polar===
Do you think your molecule is water soluble? Support your conclusion with a <font color='e000e0'><b>snapshot</b></font>. <font color='gray'>Be sure to use the ''Hydrophobic/Polar'' view from FirstGlance in a snapshot. Optionally, you may show other views in other snapshots.</font>
Do you think your molecule is water soluble? Support your conclusion with a <font color='e000e0'><b>snapshot</b></font>. <font color='gray'>Be sure to use the ''Hydrophobic/Polar'' view from FirstGlance in a snapshot. Optionally, you may show other views in other snapshots.</font>


===Section 4A: Hydrophobic Core===
===Section 5: Hydrophobic Core===
Is there a hydrophobic core in your molecule? Support your conclusion with a <font color='e000e0'><b>snapshot</b></font>. <font color='gray'>Be sure to use the ''Hydrophobic/Polar'' view from FirstGlance and turn on the ''Slab'' button.</font>
Are there hydrophobic cores in your molecule? For soluble proteins, expect a hydrophobic core in ''[http://proteopedia.org/wiki/index.php/Domain each domain]''. Support your conclusion with a <font color='e000e0'><b>snapshot</b></font>. <font color='gray'>Be sure to use the ''Hydrophobic/Polar'' view from FirstGlance and turn on the ''Slab'' button.</font>


===Section 5: Charge===
===Section 6: Charge===


Are there any areas on the surface of your molecule with only positive (or negative) charges? Show <font color='e000e0'><b>snapshots</b></font> illustrating your conclusions.  <font color='gray'>Be sure to use the ''Charge'' view from FirstGlance in a snapshot.</font>
Are there any areas on the surface of your molecule with only positive (or negative) charges? Show <font color='e000e0'><b>snapshots</b></font> illustrating your conclusions.  <font color='gray'>Be sure to use the ''Charge'' view from FirstGlance in your snapshots.</font>


===Section 5A: Cation-Pi Interactions===
===Section 7: Cation-Pi Interactions===


Show a <font color='e000e0'><b>snapshot</b></font> of an energetically significant [[Cation-pi_interactions|cation-pi interaction]]. Include a distance monitor in your snapshot. Also paste in the report from CaPTURE confirming its energetic significance. <font color='gray'>The cation-pi interaction tool, and instructions for measuring distances, are in the ''Tools Tab''.</font>
Show a <font color='e000e0'><b>snapshot</b></font> of an energetically significant [[Cation-pi_interactions|cation-pi interaction]]. Include a distance monitor in your snapshot. Also paste in the report from CaPTURE confirming its energetic significance. <font color='gray'>The cation-pi interaction tool, and instructions for measuring distances, are in the ''Tools Tab''.</font>


===Section 6: Biological Unit===
===Section 8: Biological Unit===
In FirstGlance, in the ''molecule information tab'' click ''Biological Unit''. (It is also in the ''Resources Tab''.)
In FirstGlance, in the ''molecule information tab'' click ''Biological Unit''. (It is also in the ''Resources Tab''.)


Line 214: Line 255:
Show side-by-side <font color='e000e0'><b>two snapshots</b></font> comparing the asymmetric unit with the biological unit. The ''Cartoon'' representation in FirstGlance is best for these snapshots. Make sure to label which is which.
Show side-by-side <font color='e000e0'><b>two snapshots</b></font> comparing the asymmetric unit with the biological unit. The ''Cartoon'' representation in FirstGlance is best for these snapshots. Make sure to label which is which.


===Section 7: Animation from Polyview-3D===
===Section 9: Animation from Polyview-3D===
Minimal steps to make an animation:
Minimal steps to make an animation:
<!--
<!--
Line 232: Line 273:
The above steps are the minimum for an animation that avoids putting a heavy load on the server. Feel free to try other options, but while the class is in session, please don't make a large (>300 pixel) animation, or increase the angle range, or decrease the angle step size. Otherwise, the server may get overloaded and take a very long time to produce results. <font color="gray">Optional: After class is over, feel free to submit more demanding jobs. If you highlight specific residues, please explain why. </font>
The above steps are the minimum for an animation that avoids putting a heavy load on the server. Feel free to try other options, but while the class is in session, please don't make a large (>300 pixel) animation, or increase the angle range, or decrease the angle step size. Otherwise, the server may get overloaded and take a very long time to produce results. <font color="gray">Optional: After class is over, feel free to submit more demanding jobs. If you highlight specific residues, please explain why. </font>


In Powerpoint, animations move only when the slides are projected (full-screen).  
<!--In Powerpoint, animations move only when the slides are projected (full-screen).
 
'''Method 1:''' When your Polyview-3D job is done, you will see the animation in the web browser. Simply drag the animation directly from the Polyview-3D web page and drop it into a  slide. If the result does not animate when you project the slide, try the following method.


'''Windows Powerpoint''': Simply drag the animation directly from the Polyview-3D web page and drop it into a Powerpoint slide. If the result does not animate when you project the slide, try the instructions for Mac below.
'''Method 2:''': -->


'''Mac Powerpoint''': The method below produces a slide that will animate continuously. Other methods we have tried do not.
After your animation is completed and appears in the PolyView-3D web page:
#Control-Click on the animation in the Polyview-3D web page, and select Save Image As ...
#Right-click (Mac: control-click, or trackpad 2-finger click) on the animation in the Polyview-3D web page, and select Save Image As ...
#Save the image to the Desktop.
#Save the image to the Desktop.
#Drag the image file (filename ending in .gif) from the Desktop and drop it into a Powerpoint slide.
#Drag the image file (filename ending in .gif) from the Desktop and drop it into a slide.
#As stated above, the animation will run only when the saved .ppt file is projected (full screen).
#In Google Slides, the animation should move immediately.


<span style="font-size:140%;background:#c0ffc0;">[http://bioinformatics.org/firstglance/fgij/ppt/polyview-3d-examples.ppt Powerpoint Slides with Polyview-3D Animations]</span> (These slides are only to show you what is possible. These are not in your assignment.)
<!--http://bioinformatics.org/firstglance/fgij/ppt/polyview-3d-examples.ppt-->
<span style="font-size:140%;background:#c0ffc0;">[https://docs.google.com/presentation/d/1PAzssvqDRBKFIl9DWvWpXXxM--gtIIdT0p8fMCXfE10/edit?usp=sharing Examples of Slides with Polyview-3D Animations]</span> (These slides are only to show you what is possible. These are not in your assignment.)




----
----


===Section 8 - Contacts/Non-covalent Bonds===
===Section 10 - Contacts/Non-covalent Bonds===
Example: [http://firstglance.jmol.org/fg.htm?mol=4d7b 4d7b].
#Click ''Contacts'' in the ''Tools Tab'' in FirstGlance.
#Click ''Contacts'' in the ''Tools Tab'' in FirstGlance.
#Change target selection to ''Residues/Groups''.
#Change target selection to ''Residues/Groups''.
Line 260: Line 305:
#Double click the hydrogen bond donor and acceptor atoms to insert a distance monitor.
#Double click the hydrogen bond donor and acceptor atoms to insert a distance monitor.


Describe the moiety you selected as a target. Include a <font color='e000e0'><b>snapshot</b></font> showing a hydrogen bond.
Describe the moiety you selected as a target. Include a <font color='e000e0'><b>snapshot</b></font> showing '''exactly one hydrogen bond'''. Be sure to identify the two entities (amino acids, nucleotides, ligand) by name, chain, and sequence number.  I need enough detail to be able to reproduce what you are reporting.
 
===Section 11 - How Structure Supports Function===
Write a brief description ''in your own words'' (avoid plagiarism!) of how the structure of this protein supports its function. Doing some online research will strengthen your description.
 
Include links to supporting references. Wikipedia can be cited, but authoritative sources, such as peer-reviewed scientific journal articles or government websites, will have more weight.
 
Your description should be at least 75 words. More work, if well done, will earn more credit.
 
Section 11 in the [https://docs.google.com/presentation/d/1wW1fJfRBfQGXjoS989O5Wz2JuBXVU8LPMlhCq-dadnA/edit?usp=sharing Sample Report] is longer than the minimum required, but illustrates the sort of thing that could be done if you can spend the time.


==VII. See Also==
==VII. See Also==

Latest revision as of 03:15, 28 March 2016

How to visualize, understand and share/present 3D protein molecular structures
by Eric Martz, 2016
for Microbiology 497L: Advanced Microbiology Lab Techniques
University of Massachusetts, Amherst MA USA
Get here via Moodle or with 497L.MolviZ.Org


I. Getting Started in the FAC444I. Getting Started in the FAC444

  1. Use FIREFOX or Safari.

(DO NOT USE Chrome because molecule rotation will be slow/jerky. Internet Explorer and Edge are even worse with this software.)

  1. Go to our syllabus: 497L.MolviZ.Org.
  2. Now you can see this document in your browser. Go to Atlas.MolviZ.Org.
  3. In the Atlas, choose any molecule deemed Straightforward and click on the link to FirstGlance. After a minute or so to load, you should see a rotating molecule.
  4. If you have any difficulty or the molecule does not appear, ask for help!

II. GoalsII. Goals

1. Review principles of protein 3D structure.

2. Choose an existing experimentally-determined 3D protein structure model to investigate.

3. Learn how FirstGlance in Jmol makes it easy to see structure-function relationships in the protein you chose.

4. Write a report including snapshots of your protein that illustrate your answers to the questions below. (Your report will be in the form of Google Slides emailed to emartz@microbio.umass.edu. You will not present your report in class.)

III. Protein Structure and Structural BioinformaticsIII. Protein Structure and Structural Bioinformatics

SLIDESHOW


1. Amino acid sequence + protein chain conformation = protein function.
A. Why do we care about protein 3D structure?
B. Conformation can be a stable fold or intrinsically unstructured. Both commonly exist in the same protein molecule.
C. Conformation is specified by sequence.
  • Folded domains fold spontaneously (Anfinson, 1960's[1]), or with the help of chaperonins.
  • The denaturation (unfolding) of a folded protein domain destroys its function.


2. Backbone Representation.
A. Backbone Representations
B. Small Protein in FirstGlance (use the Views Tab: Vines, Cartoon)


3. Structure Knowledge.
A. Although sequence specifies fold, scientists cannot yet predict the fold from the sequence. Therefore, fold must be determined by empirical (experimental) methods. The most common methods for determining the 3D structure of a protein molecule are:
  • Result is a single model representing the average of the molecules in the crystal.
  • Resolution reflects the degree of order or disorder in the crystal.
  • X-ray crystallography gives no models for intrinsically unstructured loops or molecules.
  • NMR is limited to small proteins (30 kD or smaller; median NMR in PDB is 10K; median X-ray is 50K).
  • Result is an ensemble of models consistent with the data. Examples: 2bbn
  • High resolution cryo-electron microscopy, 0.8%.
B. These methods are difficult and expensive. Less than 10% of proteins have known structure.
C. All published, empirically determined 3D macromolecular structure models are available from the Protein Data Bank (PDB; pdb.org; About the PDB).
D. Each model has a unique, 4-character accession code called a PDB identification code, for example
E. Crystallographers publish the asymmetric unit of the crystal. It may be identical with the biological unit (the functional form of the molecule), or it may be only part of the biological unit, or it may contain multiple copies of the biological unit. See examples.
Interchain contacts that occur in the asymmetric unit, which are absent in the biological unit, are an artifact of crystallization, termed crystal contacts.

IV. Choose a Molecule to ExploreIV. Choose a Molecule to Explore

  • Choose a molecule to use for your report.
    • Each student should choose a different molecule.
    • Be sure to note the 4-character PDB code of the molecule you choose. The PDB code makes it easy to retrieve the molecule and information about it.
    • Report the PDB code you chose to the instructor to make sure it is not already taken.
    • It must have protein.
    • It will be more interesting if it contains some non-protein: ligand, or DNA or RNA.
    • X-ray results should have resolution of 3 Å or better.
  • Here are some ways to find a protein with known structure:
  1. Recommended: Atlas of Macromolecules (Atlas.MolviZ.Org). Choose a "Straightforward" or "Challenging" (not "Enormous") molecule that has protein and ligand.
  2. Molecule of the Month at the PDB. Look for PDB codes in the article, and use FirstGlance to view them.
  3. Topic Pages in Proteopedia, or its Table of Contents.
  4. Random PDB Entry in Proteopedia (see Random at top left of this page in the navigation box).
  5. Search by molecule name or amino acid sequence at www.pdb.org, but remember that less than 10% of proteins have known structure. See also Practical Guide to Homology Modeling which includes instructions for finding empirical 3D models for a protein sequence.

V. Explore Your MoleculeV. Explore Your Molecule

FirstGlance in JmolFirstGlance in Jmol

The main tool we will use is FirstGlance in Jmol: FirstGlance.Jmol.Org. (To google it later, use the single word (no space) firstglance.)

  • Use Firefox (or Safari). The molecule will rotate slowly with jerky jumps in other browsers. Internet Explorer and Edge are especially bad for Jmol.
  • Enter your 4-character PDB code at FirstGlance, and you should see the molecule you have chosen.
  • Get familiar with what the molecule information tab tells you. Ask about anything you don't understand.
  • In the Views tab, there are 10 links at the top that show you different aspects of the molecule. Try them all, as well as any of the other tools in FirstGlance that interest you.
      FirstGlance does NOT use Java unless you tell it to. Using Java will make larger proteins load faster, rotate more smoothly, and change views quicker. Use Firefox or Internet Explorer (or Safari) for Java. Chrome and Edge do not support Java. In your Java-compatible browser, display a molecule in FirstGlance, and then click on the Preferences tab in FirstGlance. See Installing and enabling Java.

Here are two Views in FirstGlance that will be used in your report:

A. Hydrophobic/PolarA. Hydrophobic/Polar

  • Water-soluble proteins have polar/charged amino acids nearly everywhere on their surfaces (Examples: small 2hhd, large 1igy). Patches of hydrophobic amino acids on the surfaces of soluble proteins are usually less than ~10 å in their smaller diameter, and usually recessed.
  • Hydrophobic surface patches may be buried in chain-to-chain contacts -- check the biological unit (example: lac repressor homodimer).
  • Large, protruding hydrophobic surface areas (>25 Å in their smaller diameter) may indicate transmembrane proteins (insoluble). Examples:

B. ChargeB. Charge

Most proteins have roughly equal numbers of positive and negative charges intermixed on their surfaces. Surface patches of exclusively positive charge often bind nucleic acids (negatively charged because of their phosphates). For example, examine the protein surface charges where the gal4 transcriptional regulator binds DNA (1d66).

VI. Report SlidesVI. Report Slides

Answer the questions below in slides, using Google Slides.

  • Name your report YourLastName-497L, for example Sandler-497L. If the name of your report does not begin with your family name, you will lose 2 points. When completed, email a link to your report to emartz@microbio.umass.edu for grading.
(While viewing your report slides, click the blue Share button at the upper right, then Get shareable link, and paste the link into the email.)
You will not be asked to present your report in class.
  • Each slide MUST be labeled at the top with its section number, e.g. Section 1.
  • Each Section below may be answered in a single slide, or multiple slides. For example, suppose you want to show two snapshots for Section 3, and make separate comments. You may choose to use two slides, labeled Section 3A and Section 3B.
  • Additional work beyond the minimum required may earn extra credit.
  • Due date: midnight Thursday March 3.

This is not a test. It is to help you learn by doing. Ask for help!
Example of a Completed Report (You may import these slides into a new presentation of your own, and use them as a templates, putting in your own content.)

Section 1: IdentitySection 1: Identity

  • The label Section 1 at the top (and so forth for every slide).
  • Your name.
  • Your major; grad students, give the name of your grad program (Micro, MCB, etc.) and whose lab you work in.
  • Your PDB identification code.
  • The name of your molecule.
  • The function of your molecule (briefly, one sentence).
  • The experimental method and resolution (or number of models for NMR). Available in the molecule information tab in FirstGlance.
  • A snapshot of your molecule.

How to make a snapshot,
is also linked at the bottom left in FirstGlance.

Section 2: CompositionSection 2: Composition

Available in the molecule information tab in FirstGlance: Chain Details. You can identify a residue in any chain by touching it with the mouse (spinning off!). DNA residues are DA, DG, DC, DT while RNA residues are A, G, C and U.
  • Ligands and Non-Standard Residues: Give the one to three-letter abbreviations and full names for all ligands and non-standard residues. If none, so state. (Standard residues)
The molecule information tab in FirstGlance lists the 1 to 3-letter abbreviations for each ligand and non-standard residue, and their full names. Click on an abbreviation to locate that entity in the model. See also Composition in FirstGlance's Views tab.

Section 3: Evolutionary ConservationSection 3: Evolutionary Conservation

See Introduction to Evolutionary Conservation.   (Example 4d7b: ConSurf, Result)

Does your molecule have a highly conserved region? If so, what is its function? If there is no highly conserved region, is there a highly variable region? Show two snapshots illustrating a highly conserved region, and a contrasting region.

See How to see conserved regions.

If Proteopedia lacks a pre-calculated Evolutionary Conservation for your molecule, and you do your own calculation at the ConSurf Server, be sure to include the address of the ConSurf result in your report slide!.

Section 4: Hydrophobic/PolarSection 4: Hydrophobic/Polar

Do you think your molecule is water soluble? Support your conclusion with a snapshot. Be sure to use the Hydrophobic/Polar view from FirstGlance in a snapshot. Optionally, you may show other views in other snapshots.

Section 5: Hydrophobic CoreSection 5: Hydrophobic Core

Are there hydrophobic cores in your molecule? For soluble proteins, expect a hydrophobic core in each domain. Support your conclusion with a snapshot. Be sure to use the Hydrophobic/Polar view from FirstGlance and turn on the Slab button.

Section 6: ChargeSection 6: Charge

Are there any areas on the surface of your molecule with only positive (or negative) charges? Show snapshots illustrating your conclusions. Be sure to use the Charge view from FirstGlance in your snapshots.

Section 7: Cation-Pi InteractionsSection 7: Cation-Pi Interactions

Show a snapshot of an energetically significant cation-pi interaction. Include a distance monitor in your snapshot. Also paste in the report from CaPTURE confirming its energetic significance. The cation-pi interaction tool, and instructions for measuring distances, are in the Tools Tab.

Section 8: Biological UnitSection 8: Biological Unit

In FirstGlance, in the molecule information tab click Biological Unit. (It is also in the Resources Tab.)

How many total polymer chains (protein + DNA + RNA) are in the asymmetric unit? The Biological unit?

Show side-by-side two snapshots comparing the asymmetric unit with the biological unit. The Cartoon representation in FirstGlance is best for these snapshots. Make sure to label which is which.

Section 9: Animation from Polyview-3DSection 9: Animation from Polyview-3D

Minimal steps to make an animation:

  1. Go to Polyview-3D.
  2. Enter your PDB code in the PDB ID slot near the top. (If the slot is not visible, open the section Source of Structural Data.)
  3. Change "Type of request" from "Single slide" to "Animation". It is under the Image Settings section near the bottom.
  4. Click any "Preview" link.
  5. Optional: If you want to modify the orientation or zoom of the molecule, click on View by Jmol / Set orientation under Quick links at the upper left of the page. Use the mouse to rotate and zoom in Jmol. Then click the Set and close button.
  6. Optional: If you want to change the colors, hide portions of the molecule, emphasize certain residues, etc. feel free to try out these options in the form, using Preview to check your results.
  7. In the "Animation Settings" section at the bottom of the page, set Delay to 10/100.
  8. Change "Angle step" to 5 degrees.
  9. Check "Rocking".
  10. Change "angle range" for rocking to 30 degrees.
  11. Click "Submit".

The above steps are the minimum for an animation that avoids putting a heavy load on the server. Feel free to try other options, but while the class is in session, please don't make a large (>300 pixel) animation, or increase the angle range, or decrease the angle step size. Otherwise, the server may get overloaded and take a very long time to produce results. Optional: After class is over, feel free to submit more demanding jobs. If you highlight specific residues, please explain why.


After your animation is completed and appears in the PolyView-3D web page:

  1. Right-click (Mac: control-click, or trackpad 2-finger click) on the animation in the Polyview-3D web page, and select Save Image As ...
  2. Save the image to the Desktop.
  3. Drag the image file (filename ending in .gif) from the Desktop and drop it into a slide.
  4. In Google Slides, the animation should move immediately.

Examples of Slides with Polyview-3D Animations (These slides are only to show you what is possible. These are not in your assignment.)



Section 10 - Contacts/Non-covalent BondsSection 10 - Contacts/Non-covalent Bonds

Example: 4d7b.

  1. Click Contacts in the Tools Tab in FirstGlance.
  2. Change target selection to Residues/Groups.
  3. Click on something small to select it as a "target", such as a ligand, or a single amino acid. Choose an amino acid with an uncharged polar side chain, such as Ser, Thr, Asn, Gln, Tyr, His.
  4. Click the link to Show atoms contacting target.
  5. Click Center contacts.
  6. Uncheck Backbones.
  7. Click the 4th thumbnail image to display the contacts as balls and sticks colored by element. The element color key is at the bottom of the Contacts help panel in FirstGlance.
  8. Zoom in (and click Return to Contacts if necessary).
  9. Uncheck all categories of non-covalent bonds.
  10. Check hydrogen-bonded non-water. (Review hydrogen bonds.)
  11. Double click the hydrogen bond donor and acceptor atoms to insert a distance monitor.

Describe the moiety you selected as a target. Include a snapshot showing exactly one hydrogen bond. Be sure to identify the two entities (amino acids, nucleotides, ligand) by name, chain, and sequence number. I need enough detail to be able to reproduce what you are reporting.

Section 11 - How Structure Supports FunctionSection 11 - How Structure Supports Function

Write a brief description in your own words (avoid plagiarism!) of how the structure of this protein supports its function. Doing some online research will strengthen your description.

Include links to supporting references. Wikipedia can be cited, but authoritative sources, such as peer-reviewed scientific journal articles or government websites, will have more weight.

Your description should be at least 75 words. More work, if well done, will earn more credit.

Section 11 in the Sample Report is longer than the minimum required, but illustrates the sort of thing that could be done if you can spend the time.

VII. See AlsoVII. See Also

VIII. Notes and ReferencesVIII. Notes and References

  1. For a brief overview of Anfinson's protein folding experiments in the 1960's, see the first paragraph at Intrinsically Disordered Protein.