User:Karsten Theis/Sandbox 2: Difference between revisions

Karsten Theis (talk | contribs)
Karsten Theis (talk | contribs)
No edit summary
Line 1: Line 1:
__NOTOC__
==Identification of Unknown Protein 2QRU==
Proteopedia is the 3D encyclopedia of proteins and other biological molecules. This is the directory of help pages for visitors and for registered users.


{|
<StructureSection load='2qru' size='340' side='left' caption='The protein 2QRU is believed to be a hydrolase, functioning best in a near neutral pH environment which could potentially be in muscle cells or in neuronal cells.' scene=''>
|-
|style="width: 50%;" |


===<i class="fas fa-directions"></i> Navigating Proteopedia===
</StructureSection>
Proteopedia is organized like an encyclopedia, with entries (also called pages or articles) on different topics. If you already know the topic you are interested in, you can search for relevant entries using the '''search box on the left''' side of the page. For help, go to [[Help:Searching]]. You can also browse the entries, starting with the suggestions on the [[Main Page]] or the table of contents or the structure index available in the navigation box on the left. Entries have links to other entries (if you right-click them, the current entry will stay available and the linked entry will open in a new tab). This is a way to browse entries on related topics.


|valign="top" |
== '''Introduction''' ==


===<i class="fas fa-glasses"></i> Reading and viewing entries===
=== Research Question ===
The special feature of Proteopedia entries are the interactive 3D representations of molecules viewable on most pages. They appear in a window integrated into the page (often called "Jmol window"). As you read through the text of an entry, you will find "green links". When you click on them, you stay on the same page but the interactive 3D scene. To learn how to get the most out of the interactive 3D scenes, reference the [[Help:Viewing pages|Viewing guide]] and the cheat sheet that comes with it. If you encounter technical terms you are unfamiliar with, try searching for them to learn more about them (the [[About Macromolecular Structure]] entry is a good start.


|-
What is the function and qualities (location and optimal conditions) of our assigned unknown protein, 2QRU?
|valign="top" |


===<i class="fas fa-user-clock"></i> Who wrote the entry when?===
=== Problem Relevance ===
The section headed "Proteopedia Page Contributors and Editors" (on the bottom of the entry) lists all authors of the entry. If you are interested to learn who wrote what and when the entry was last updated, go to the history tab. Occasionally, the discussion tab contains information about how the entry was created or suggestions for improvement.
|valign="top"|


===<i class="fas fa-id-card"></i> Becoming a registered user on Proteopedia===
The function of this protein is not known past the fact that it is in the super family alpha/beta hydrolase. Knowing the function of this protein could be helpful since knowing a protein’s function allows for extensive classification of said protein.
To edit pages or create new ones, you need an account. At the top right of each page, there is a link to login in or request an account. With an account, you can edit pages and create new ones. You will also have a user page to introduce yourself, and access to sandboxes where you can try out creating content.
|-
|valign="top"|
===<i class="fas fa-user-edit"></i> For authors: contributing content===
To get started, read [[Help:Getting Started in Proteopedia]], or watch the videos linked there. You can '''start editing in a sandbox''' (see [[Help:Sandboxes]]). Before reusing content from others, consult [[Proteopedia:Guidelines for Ethical Writing]] for ethical aspects, and [[Help:Editing#Citing_Literature_References]] for formatting references. The text on Proteopedia has an open license that encourages remixing with attribution, see [[Remixing]]. For some suggestions how to improve the quality of a page, see [[Proteopedia:How to Make a Page]].


|valign="top"|
=== Research Significance ===


===<i class="fas fa-cubes"></i> For authors: creating 3D scenes===
Identifying the structure of the unknown protein would reveal its function which could be used in many different ways. By connecting the protein’s function to symptoms and mechanisms, it could be used as a target for treatment of various diseases.
To show a 3D scene, you need structural data (typically from the protein database PDB) and need to decide how to show it. This is described in [[Proteopedia:DIY:Scenes]]. The 3D scenes are created and previewed using the [[Scene authoring tools]]. If you want to show a structure not in the PDB, you have to upload the structure first ([[Help:Uploading molecules]]). Tips how to make 3D scenes that have a lot of information yet are easy to understand is available at [[Proteopedia:How to Make a Scene]].


|-
=== Hypothesis ===


|valign="top"|
Based on the super family classification, protein 2QRU is a type of hydrolase. Running experiments to test the protein under three different pH conditions based on the three most common locations of hydrolases (digestive system, neurons, and muscle cells) will reveal the activity of the protein in these different pH’s and therefore could suggest where the protein is found and consequently its function.


=== <i class="fas fa-film"></i> For authors: multimedia===
=== How Experimental Data Will Answer the Hypothesis ===
You can insert static images or animations directly in the text, embed videos, and integrate different data in 3D scenes, see [[Help:Multimedia]]. Proteopedia hosts images as long as they come with the appropriate license (see [[Proteopedia:Terms of Service]]). For uploading images, see [[Proteopedia:Video_Guide#Video_5:_Uploading_an_image_or_file_and_adding_an_image_to_a_page|Video Guide 5]]. For help on formatting the image within the page, see [http://en.wikipedia.org/wiki/Help:Wiki_markup#Images Wiki markup: Images] or for even more detail, [http://en.wikipedia.org/wiki/Wikipedia:Extended_image_syntax Wikipedia:Extended image syntax].
|valign="top"|


===<i class="fas fa-atom"></i> For authors: advanced topics===
* The different computer modeling strategies used such as Chimera, SPRITE, Dali, BLAST, InterPRO, and SwissDock allowed visualization of the protein’s structure as a whole
Advanced topics are discussed in [[Proteopedia:Cookbook]], e.g. integrated quizzes (for details, see [[Help:Quiz]]), how to construct a URL showing a specific scene, and inserting interactive buttons (for details, see [[Jmol/Interactivity]]). Consult [[Help:Jmol]] for advanced 3D scenes. To look under the hood of other pages, use the "edit this page" tab. [[Proteopedia:DIY:Templates|Templates]]  and [[Proteopedia:Macros|macros]] can help to achieve effects with less typing.  
and at the amino acid level, showed us different alignments and active sites, compared it to other proteins for function analysis, showed different substrate interactions and much
more which we will dive into in the experimental section.


|-
* PAGE Gel told us whether the purification process worked or not and which sample to use when we tested for enzyme activity based on the strongest band.


|valign="top"|
* Bradford assay provided a concentration of our samples and determined which sample would be used for further testing.


=== <i class="fas fa-chalkboard-teacher"></i> For teachers===
* UV-Vis provided protein activity in different pH’s (more activity meant more absorbance from color change and therefore possible conclusions that the
For Teaching with Proteopedia, a good starting point is [[Help: Teaching with Proteopedia]]. Also consult [[Teaching_Scenes, Tutorials, and Educators' Pages]] and if applicable, [[High school teachers' resources]]. Students might benefit from studying [[Proteopedia:Primer]] and [[Media:Proteopedia tutorial step by step.pdf]]. If you need a space for students to draft pages, reserve some sandboxes for your students [https://proteopedia.org/cgi-bin/sandboxReservation here].
enzyme functioned in an area of the body at that pH).
|valign="top"|


===<i class="fas fa-question"></i> More help? Contact us===
 
Feel free to [[Help:Contact Us|contact us]], or subscribe to and post questions on the [[Proteopedia:Email list]]. As a registered user, you can also contact other users (e.g. if you have a question about a page they contributed to) by going to their user page and clicking on "contact this user" on the left side panel. If you encounter error messages, consult [[Help:Errors]].  
 
|}
== '''Materials & Methods''' ==
 
=== Computer Modeling Strategies ===
 
* Chimera: allowed visualization of the protein; was used to compare conformation, identity, and conservation of amino acid side chains; was used instead of Dali (Dali didn’t analyze side chains); visualized results from SPRITE in a different way.
 
* SPRITE: evaluated local alignments- only a small part of the protein; identified active sites; was used to search for configurations of amino acid side chains that have a similar structure to those of known enzyme active sites.
 
* Dali: aligned the entire 3D protein (global alignment), matched structurally similar proteins rather than sequence similarities like BLAST. Dali is more limited than BLAST since it can only work with protein structures as a whole (there are fewer known structures than known sequences); only had matches based on the backbone of the protein, did not involve amino acid side chains or side chains that would show functionality of the enzyme.
* BLAST: sequence searching that only matched similar amino acid sequences rather than the whole protein; Looked at the sequence in sections, found matching overlaps with sequences in the database, scored the matches based on similarity, and provided a list of matches.
 
* InterPro: matched similar sequences rather than the whole protein; matched the unknown protein to a family which gave clues about its function, where it is, and general information about qualities that it might share with other proteins in that family.
 
* SwissDock: computationally predicted how substrates would interact and bind with the active site or allosteric sites; Tested various ligands and provided binding energies.
 
 
=== Key Information about Chemicals, Equipment, & Instrumentation ===
 
* Escherichia coli (E. coli) strain BL21(DE3)
 
* Buffers (Sodium Phosphate buffer, Tris-HCl, Re-Suspension Buffer, Cell Lysis Buffer, 1X Wash Buffer, 1X Elution Buffer, 10X SDS-PAGE Buffer, Coomassie Blue Stain, and Destain) were prepared by classmates with appropriate techniques and all materials were stored at necessary temperatures.
 
* Equipment used: computer modeling systems, Bradford Assay, PAGE, sonicator (Qsonica Sonicators), centrifuge (SORVALL RC 5C Plus), UV-Vis (Olis 8453), freezer (Glacier, -86°C, ULTRALOW TEMPERATURE FREEZER)
 
* Protein samples were kept on ice to avoid denaturing due to temperature changes.
 
* Glassware was cleaned, all equipment was properly prepared prior to the start of the lab, and contamination was reduced through the use of disposable cuvettes and pipette tips. 
 
 
     
== '''Results''' ==
 
[[Image:SPRITE_pic.png]]
 
Figure 1: List of hits of 2QRU from SPRITE (10 of 200 entries shown here). The first column tells what hit each row shows and the second column has the source PDB ID that the program uses to identify each protein in the database. The third column gives a description of the protein and the final column shows the RMSD value of that protein when compared to the search protein of 2QRU.
 
[[Image:SPRITE_pic_2.png]]
 
Figure 2: Alignment of 2QRU with 6 different motifs (A 33 GLY, A 102 SER, A 219 ASP, A 247 HIS; A 102 SER, A 130 GLY, A 219 ASP, A 247 HIS; A 102 SER, A 104 GLY, A 219 ASP, A 247 HIS; A 34 GLY, A 102 SER, A 219 ASP, A 247 HIS; A 102 SER, A 219 ASP, A 247 HIS; A 104 GLY, A 219 ASP, A 247 HIS) from 3 different views in SPRITE. Alignment was generated from the “Full Details” and the “Superposed motifs” function was utilized such that all proteins are visible.
 
 
[[Image:Chimera_pic_2.png]]
 
Figure 3: Protein #0 with colors by element. This image was generated in CHIMERA to observe the shape of the protein 2QRU and later is compared to other similar proteins given by SPRITE to determine their relative similarities.
 
 
[[Image:Chimera_pic_3.png]]
 
 
Figure 4: In Chimera, protein 2QRU matched the most with enzymes such as trypsins, proteases, and chymotrypsins. However, the best RMSD value of 2.328 Angstroms was found to be with this plasminogen.
 
 
[[Image:Dali_4.png]]
 
Figure 5: Results from 2QRU search in Dali and comparison to the full PDB. The first column shows the chain name, the second shows the z-value, and the third shows the RMSD value. Next is the lali value, which tells the number of equivalent residues and the total number of residues. Then the %id to identify what the result is in the program and finally a description of what the protein result is.
 
 
>2QRU_1|Chain A|Uncharacterized protein|Enterococcus faecalis (226185) :
SNAHLKNNQTLANGATVTIYPTTTEPTNYVVYLHGGGMIYGTKSDLPEELKELFTSNGYTVLALDYLLAPNTKIDHILRTLTETFQLLNEEIIQNQSFGLCGRSAGGYLMLQLTKQLQTLNLTPQFLVNFYGYTDLEFIKEPRKLLKQAISAKEIAAIDQTKPVWDDPFLSRYLLYHYSIQQALLPHFYGLPENGDWSAYALSDETLKTFPPCFSTASSSDEEVPFRYSKKIGRTIPESTFKAVYYLEHDFLKQTKDPSVITLFEQLDSWLKER
 
Figure 6: RCSB Website FASTA sequence of 2QRU. Decreasing word size from 5→2 did not change number of results in BLAST.
 
 
 
[[Image:BLAST.png]]
 
Figure 7: RCSB PDB Result Page for 2QRU. This page shows an overview of information about protein 2QRU including several classifications.
 
 
[[Image:InterPro_2.png]]
 
Figure 8: InterPRO results 2QRU. This shows the main page of InterPRO when searching 2QRU and it includes information such as the possible protein family membership (although there was no match for this particular protein). It also shows several domains that are similar matches to 2QRU, all of which are types of hydrolases, suggesting that 2QRU is a type of hydrolase as well. It gives the results for the biological process (none), the molecular function (hydrolase), and the cellular component (none). Some information is missing in this page which is what our research aimed to help fill in.
 
 
[[Image:InterPro_3.png]]
 
Figure 9: Abhydrolase_3 - PF07859: alpha/beta hydrolase fold profile. This result was found by selecting the domain from the InterPro overview page shown in figure 8. This gives more information about that domain specifically. The information includes several classifications as well as options on the side bar to investigate the result further.
 
 
[[Image:InterPro_4.png]]
 
Figure 10: AB_hydrolase_3 - IPR013094: alpha/beta hydrolase fold-3 profile. This result was found by selecting another one of the domains from the InterPro overview page shown in figure 8 and this gives more information about that domain specifically. The information includes several classifications as well as options on the side bar to investigate the result further.
 
 
[[Image:InterPro_5.png]]
 
Figure 11: AB_hydrolase - IPR029058: alpha.beta hydrolase fold profile. This result was found by selecting another one of  the domains from the InterPro overview page shown in figure 8 and this gives more information about that domain specifically. The information includes several classifications as well as options on the side bar to investigate the result further.
 
 
[[Image:InterPro_7.png]]
 
Figure 12: Structure results of IPR029058- alpha/beta hydrolase fold. This was found by selecting the side bar option of structure in Interpro. This provided information about the structural qualities of proteins in that domain.
 
 
[[Image:InterPro_9.png]]
 
Figure 13: Taxonomy results of IPR029058- alpha/beta hydrolase fold. This was found by selecting the side bar options of taxonomy in Interpro. This provided information about the taxonomic background of proteins in that domain. (KEY: purple- bacteria, pink- viruses, yellow- archaea, green- eukaryote, blue- other). 
 
 
[[Image:SwissDock.png]]
 
Figure 14: SwissDock results showing Alanine-p-nitroanilide interacting with the active site of protein 2QRU.
 
 
[[Image:SwissDock_2.png]]
 
Figure 15: SwissDock results showing 4-nitroacetanilide interacting with the active site of protein 2QRU.
 
 
[[Image:BA_calcss.png]]
 
Figure 16: Bradford Assay Standards Preparation.
 
 
[[Image:BA_2.jpg]]  [[Image:BA_3.jpg]]  [[Image:BA_4.jpg]]
 
Figure 17: Bradford Assay Calculations.
 
 
 
[[Image:SC1.png]]  [[Image:SC2.png]]
 
Figure 18: Bradford Assay Standard Curve.
 
 
[[Image:BA_5.png ]]
 
Figure 19: Absorbance of protein at 595nm.
 
 
[[Image:BA_6.png ]]
 
Figure 20: Absorbances of elutions 2-4 after serial dilutions.
 
 
[[Image:Gel_2.jpg]]
 
Figure 21: SDS-PAGE results showing bands at ~30 kD, indicating hydrolase activity. Well 1 to 10 (left to right): elutions 1-5, before sonication, before column, flowthrough, wash, ladder (molecular-weight size markers).
 
 
[[Image:PH_6.4.png]]
 
Figure 22: UV-vis results at pH 6.4. There was a very strong and steady increase in enzyme activity. The best enzymatic function was observed at this pH.
 
 
[[Image:PH_7.2.png]]
 
Figure 23: UV-vis results at pH 7.2. There was no steady increase in enzyme activity. Instead, it remained within a range despite a few random spikes which can clearly be seen as outliers.
 
 
[[Image:PH_8.png]]
 
Figure 24: UV-vis results at pH 8. There was a steady increase in enzyme activity and then it gradually plateaued.
 
 
[[Image:PA_6.4.png]]
 
Figure 25: Protein Activity Assay at pH 6.4. Vmax = 2 x 10-8 M/min.
 
 
[[Image:PA_8.png]]
 
Figure 26: Protein Activity Assay at pH 8. Vmax = 7 x 10-7 M/min.
 
 
 
== '''Discussion''' ==
 
Since the unknown enzyme had a recognizable activity with the common hydrolase substrate, p-nitrophenyl acetate, it is likely that 2QRU is indeed a hydrolase. When unknown protein 2QRU was tested in a pH of ~2, the enzyme completely dissociated from solution indicating it can in no way function in that acidic of an environment in the body. Since the protein worked best in the pH around neutral, it is likely that the enzyme functions as a hydrolase in either neuronal or muscle cells. The Vmax at a pH 8 (7 x 10-7 M/min) was slightly higher than the Vmax at pH 6.4 (2 x 10-8 M/min), indicating that although this hydrolase can function in both neuronal and muscle cells, it may favor the environment of muscles cells or be more abundant in muscle cells rather than neuronal cells.
 
=== Accuracy & Precision of Results ===
 
Our results are partially accurate as we tested our protein with a general substrate for hydrolases. Although not directly associated with our enzymes exact active site, the substrate we used, p-nitrophenyl acetate (PNPA), does generally work with hydrolases as a whole. In terms of our results from the computer models and directly from the equipment used, those results are accurate as the databases are reliable and the equipment was calibrated and used correctly.
 
Our results may be precise, but we would need further testing and repeats of our results to determine whether they are statistically different or precise enough to be the same. We only ran our experiments and each of our pH levels once, so we cannot confidently confirm precision.
 
=== Future Experiments ===
 
Using a substrate that has been shown in the computer modeling to work more directly with the expected active site of our protein can give more specific insight into the type of hydrolase that 2QRU may be.
 
When testing with UV-Vis, taking readings at closer time points can allow for more precise results.
 
Repeating the experiment multiple times to increase the reliability of the results.
 
 
 
== '''Conclusion''' ==
 
Overall, the hypothesis that the protein 2QRU is a type of hydrolase was determined through testing with a general hydrolase-recognized substrate p-nitrophenyl acetate. After testing at three different pH environments, the best pH level for this enzyme seemed to be at a more neutral, between 6.4 and 8, being the most active at a pH of 8 which correlates to muscle cells. After computational testing, the best matched substrates are suspected to be 4-nitroacetanilide and alanine-p-nitroanilide. Enzyme activity testing has not been completed thus far with these substrates, but that work could indicate what specific substrates would be best matched with the enzyme providing more specifics into the structure and function of 2QRU.
 
 
 
== References ==
 
<references/>
The BASIL Biochemistry Curriculum. basilbiochem.org (n.d.). Ashley Ringer McDonald1 , Herbert J. Bernstein2, S. Colette Daubner3, Jonathan D. Dattelbaum4, Anya Goodman1, Bonnie L. Hall5, Stefan M. Irby6, Julia R. Koeppe7, Jeffrey L. Mills2, Stephen A. Mills8, Suzanne F. O’Handley2, Michael Pikaart9, Rebecca Roberts10, Arthur Sikora11, Paul A. Craig2