Theoretical models
This article needs to be improved, expanded, and more references need to be cited.
The term theoretical model refers to a molecular model obtained, wholly or in part, by the use of theory, such as homology modeling, energy minimization, molecular mechanics or molecular dynamics. Such theoretical models are distinguished from empirical models, which are usually obtained by X-ray crystallography or nuclear magnetic resonance (NMR).
The distinction between theoretical and empirical models is important because when theoretical models are compared with empirical models, the theoretical models often contain significant errors. In contrast, when the structure of a particular macromolecule is determined using empirical methods by different laboratories, or both by crystallography and NMR, the agreement is usually quite good.
Empirical ModelsEmpirical Models
Methods for judging the reliability and quality of empirical models are discussed at Quality assessment for molecular models. Independent determinations of the same protein by empirical methods generally agree within about 0.5 Å RMS for carbon alphas (reference needed).
Homology ModelsHomology Models
Homology models, also called comparative models, are obtained by folding a target protein sequence to fit an empirically-determined template model. The registration between residues in the target and template is determined by an amino acid sequence alignment between the target and template sequences. Errors or uncertainties in the sequence alignment result in errors or uncertainties in the homology model. Provided there is sufficient sequence identity between the target and template, the main chain in homology models is usually mostly correct. However, the positions of sidechains in homology models are usually incorrect.
Empirically-determined templates with adequate sequence identity are available for less than half of all protein sequences. One of the major goals of structural genomics is to increase the sequence diversity of the available empirically-determined structures that can be used as templates for homology modeling.
SWISS-MODEL provides a free, fully-automated homology modeling service. Using the Automated Mode, you submit a protein sequence. When the PDB contains an empirically-determined structure with sufficient sequence identity with your target sequence, it will be used as a template. The resulting homology model will be constructed automatically.
When no suitable template exists, the Structural Genomics Target Database should be searched with your sequence. In some cases, a sequence-similar protein has already been crystallized and diffracted, but the model may not have been completed, or the completed model may not yet have been deposited in the PDB. In such cases, it may be worthwhile to contact the team that has made the most progress on a closely related sequence.
ExamplesExamples
- Structure of E. coli DnaC helicase loader concerns a homology model.
Ab Initio ModelsAb Initio Models
When there is no template with sufficient sequence identity to use for homology modeling, one can use ab initio or de novo folding theory to predict the structure of a target protein sequence. Such theory is about 70% successful at predicting secondary structure[1]. Tertiary structure prediction has modest success for small protein chains (80 amino acids or less), but is generally unable to predict the fold for longer chains. In about one out of four cases of small domains of less than 85 amino acids, the best predictions are within about 1.5 Å (RMS for carbon alphas) of the true structure[2]. (Independent determinations of the same protein by empirical methods generally agree within about 0.5 Å RMS for carbon alphas.)
The success of fold prediction methods is assessed biannually in the Critical Assessment of Techniques for Protein Structure Prediction (CASP) competitions[3]. Crystallographers submit sequences which they have solved, but for which the structures have not yet been published. Modelers predict the folds which are then compared with subsequently published structures. There are also competitions to predict protein-protein docking interactions[4]
See AlsoSee Also
- Theoretical models displayed in Proteopedia must be clearly identified: see Proteopedia:Policy#Theoretical Models using methods explained at Proteopedia:Cookbook#Theoretical Models.
References & LinksReferences & Links
- ↑ Accuracy of Protein Structure Prediction at Stanford University.
- ↑ Bradley P, Misura KM, Baker D. Toward high-resolution de novo structure prediction for small proteins. Science. 2005 Sep 16;309(5742):1868-71. PMID:16166519 doi:309/5742/1868
- ↑ Critical Assessment of Techniques for Protein Structure Prediction (CASP).
- ↑ CAPRI: Critical Assessment of PRediction of Interactions.