Theoretical models: Difference between revisions

From Proteopedia
Jump to navigation Jump to search
Eric Martz (talk | contribs)
polishing
Eric Martz (talk | contribs)
→‎Ab Initio Models: added intrinsic disorder
Line 30: Line 30:
When there is no template with sufficient sequence identity to use for homology modeling, one can use ''ab initio'' or ''de novo'' folding theory to predict the structure of a target protein sequence. Such theory is about 70% successful at predicting [[secondary structure]]<ref>[http://cmgm.stanford.edu/WWW/www_predict.html Accuracy of Protein Structure Prediction] at Stanford University.</ref>. Tertiary structure prediction has modest success for small protein chains (80 amino acids or less), but is generally unable to predict the fold for longer chains. In about one out of four cases of small domains of less than 85 amino acids, the best predictions are within about 1.5 &Aring; (RMS for carbon alphas) of the true structure<ref>PMID: 16166519</ref>. (Independent determinations of the same protein by empirical methods generally agree within about 0.5 &Aring; RMS for carbon alphas.)
When there is no template with sufficient sequence identity to use for homology modeling, one can use ''ab initio'' or ''de novo'' folding theory to predict the structure of a target protein sequence. Such theory is about 70% successful at predicting [[secondary structure]]<ref>[http://cmgm.stanford.edu/WWW/www_predict.html Accuracy of Protein Structure Prediction] at Stanford University.</ref>. Tertiary structure prediction has modest success for small protein chains (80 amino acids or less), but is generally unable to predict the fold for longer chains. In about one out of four cases of small domains of less than 85 amino acids, the best predictions are within about 1.5 &Aring; (RMS for carbon alphas) of the true structure<ref>PMID: 16166519</ref>. (Independent determinations of the same protein by empirical methods generally agree within about 0.5 &Aring; RMS for carbon alphas.)


The success of fold prediction methods is assessed biannually in the ''Critical Assessment of Techniques for Protein Structure Prediction'' (CASP) competitions<ref>[http://predictioncenter.gc.ucdavis.edu/ Critical Assessment of Techniques for Protein Structure Prediction (CASP)].</ref>. Crystallographers submit sequences which they have solved, but for which the structures have not yet been published. Modelers predict the folds which are then compared with subsequently published structures. There are also competitions to predict protein-protein docking interactions<ref>[http://www.ebi.ac.uk/msd-srv/capri/ CAPRI: Critical Assessment of PRediction of Interactions].</ref>
The success of fold prediction methods is assessed biannually in the ''Critical Assessment of Techniques for Protein Structure Prediction'' (CASP) competitions<ref>[http://predictioncenter.gc.ucdavis.edu/ Critical Assessment of Techniques for Protein Structure Prediction (CASP)].</ref>. Crystallographers submit sequences which they have solved, but for which the structures have not yet been published. Modelers predict the folds which are then compared with subsequently published structures. In CASP5 (2002), the ability to predict [[Intrinsically Disordered Protein|intrinsic disorder]] was included<ref>PMID: 19774619</ref>. There are also competitions to predict protein-protein docking interactions<ref>[http://www.ebi.ac.uk/msd-srv/capri/ CAPRI: Critical Assessment of PRediction of Interactions].</ref>


==See Also==
==See Also==

Revision as of 18:35, 8 January 2010

This article needs to be improved, expanded, and more references need to be cited.


The term theoretical model refers to a molecular model obtained, wholly or in part, by the use of theory, such as homology modeling, energy minimization, molecular mechanics or molecular dynamics. Such theoretical models are distinguished from empirical models, which are usually obtained by X-ray crystallography or nuclear magnetic resonance (NMR).

The distinction between theoretical and empirical models is important because when theoretical models are compared with empirical models, the theoretical models often contain significant errors. In contrast, when the structure of a particular macromolecule is determined using empirical methods by different laboratories, or both by crystallography and NMR, the agreement is usually quite good.

1,390 theoretical models were historically deposited in the Protein Data Bank but removed from the main database in 2002. Pages automatically generated in Proteopedia for these theoretical models do not display these structures due to their unreliability (see Category:Theoretical Model).

Empirical ModelsEmpirical Models

Methods for judging the reliability and quality of empirical models are discussed at Quality assessment for molecular models. Independent determinations of the same protein by empirical methods generally agree within about 0.5 Å RMS for carbon alphas (reference needed).

Homology ModelsHomology Models

Homology models, also called comparative models, are obtained by folding a target protein sequence to fit an empirically-determined template model. The registration between residues in the target and template is determined by an amino acid sequence alignment between the target and template sequences. Errors or uncertainties in the sequence alignment result in errors or uncertainties in the homology model. Provided there is sufficient sequence identity between the target and template, the main chain in homology models is usually mostly correct. However, the positions of sidechains in homology models are usually incorrect.

Empirically-determined templates with adequate sequence identity are available for less than half of all protein sequences. One of the major goals of structural genomics is to increase the sequence diversity of the available empirically-determined structures that can be used as templates for homology modeling.

SWISS-MODEL provides a free, fully-automated homology modeling service. Using the Automated Mode, you submit a protein sequence. When the PDB contains an empirically-determined structure with sufficient sequence identity with your target sequence, it will be used as a template. The resulting homology model will be constructed automatically.

When no suitable template exists, the Structural Genomics Target Database should be searched with your sequence. In some cases, a sequence-similar protein has already been crystallized and diffracted, but the model may not have been completed, or the completed model may not yet have been deposited in the PDB. In such cases, it may be worthwhile to contact the team that has made the most progress on a closely related sequence.

See also the list of resources at User:Wayne Decatur/Homology Modeling.

ExamplesExamples

Ab Initio ModelsAb Initio Models

When there is no template with sufficient sequence identity to use for homology modeling, one can use ab initio or de novo folding theory to predict the structure of a target protein sequence. Such theory is about 70% successful at predicting secondary structure[1]. Tertiary structure prediction has modest success for small protein chains (80 amino acids or less), but is generally unable to predict the fold for longer chains. In about one out of four cases of small domains of less than 85 amino acids, the best predictions are within about 1.5 Å (RMS for carbon alphas) of the true structure[2]. (Independent determinations of the same protein by empirical methods generally agree within about 0.5 Å RMS for carbon alphas.)

The success of fold prediction methods is assessed biannually in the Critical Assessment of Techniques for Protein Structure Prediction (CASP) competitions[3]. Crystallographers submit sequences which they have solved, but for which the structures have not yet been published. Modelers predict the folds which are then compared with subsequently published structures. In CASP5 (2002), the ability to predict intrinsic disorder was included[4]. There are also competitions to predict protein-protein docking interactions[5]

See AlsoSee Also

References & LinksReferences & Links

  1. Accuracy of Protein Structure Prediction at Stanford University.
  2. Bradley P, Misura KM, Baker D. Toward high-resolution de novo structure prediction for small proteins. Science. 2005 Sep 16;309(5742):1868-71. PMID:16166519 doi:309/5742/1868
  3. Critical Assessment of Techniques for Protein Structure Prediction (CASP).
  4. Noivirt-Brik O, Prilusky J, Sussman JL. Assessment of disorder predictions in CASP8. Proteins. 2009 Aug 21. PMID:19774619 doi:10.1002/prot.22586
  5. CAPRI: Critical Assessment of PRediction of Interactions.

Proteopedia Page Contributors and Editors (what is this?)Proteopedia Page Contributors and Editors (what is this?)

Eric Martz, Jaime Prilusky, Wayne Decatur