Free R: Difference between revisions

Eric Martz (talk | contribs)
Eric Martz (talk | contribs)
 
(15 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[Free R]] (also called R<sub>free</sub>) "is generally considered the most useful global measure of model-to-data agreement"<ref name="validation4pdb">PMID: 22000512</ref>. It is a statistical quantity introduced in 1992 by Axel T. Brünger<ref>PMID: 18481394</ref> to assess the quality of a model from X-ray crystallographic data. It is calculated in the same manner as the [[R value]], but from a subset of the data set aside for the calculation of free R, and not used in the refinement of the model. It is a more reliable tool for assessing the model than the R value because it is not self-referential -- that is, as an estimation of errors, free R is free of any bias that may have been introduced during refinement. Free R is an example of the application of the more general method of [http://en.wikipedia.org/wiki/Cross-validation_(statistics) cross validation].
[[Free R]] (also called R<sub>free</sub>) "is generally considered the most useful global measure of model-to-data agreement"<ref name="validation4pdb">PMID: 22000512</ref>. It is a statistical quantity introduced in 1992 by Axel T. Brünger<ref name="bruenger">PMID: 18481394</ref> to assess the quality of a model from X-ray crystallographic data. It is calculated in the same manner as the [[R value]], but from a subset of the data set aside for the calculation of free R, and not used in the refinement of the model. It is a more reliable tool for assessing the model than the R value because it is not self-referential -- that is, as an estimation of errors, free R is free of any bias that may have been introduced during refinement. Free R is an example of the application of the more general method of [http://en.wikipedia.org/wiki/Cross-validation_(statistics) cross validation].
 
==Help Interpreting Free R==
When a crystallographic protein structure is visualized in [[FirstGlance/Index|FirstGlance in Jmol]], its Free R will be automatically interpreted as ''Reliability'' with the following levels, each relative to its [[Resolution]]:
*<span style="color:#00e000;font-weight:bold;">Much better than average</span>
*<span style="color:green;font-weight:bold;">Better than average</span>
*<span style="color:black;font-weight:bold;">Average</span>
*<span style="color:#a0a000;font-weight:bold;">Worse than average</span>
*<span style="color:red;font-weight:bold;">Unreliable</span>


==Good Values for Free R==
==Good Values for Free R==


As a rule of thumb, for models with [[resolution]] of 2.0 &Aring or better, free R should not exceed (resolution/10) by more than 0.05; that is, if the resolution is 2.0 &Aring;, free R should not significantly exceed 0.25. For resolutions around 3.0 &Aring;, the free R should not exceed (resolution/10. Free R values exceeding the worst 25% for the resolution of the model (see table below) raise doubts about the quality of the model.
As a rule of thumb, for models with [[resolution]] of 2.0 &Aring; or better (&Aring; values <2.0), free R should not exceed (resolution/10) by more than 0.05; that is, if the resolution is 2.0 &Aring;, free R should not significantly exceed 0.25. For resolutions around 3.0 &Aring;, the free R should not exceed (resolution/10). Free R values exceeding the worst 25% for the resolution of the model (see table below) raise doubts about the quality of the model.  
 
In October, 2011, there are 79 X-ray crystallographic entries in the [[PDB]] deposited 1990 or later, with free R values of >= 0.40. The median resolution of these 79 entries is 4.2 &Aring;.


Free R is positively correlated with [[resolution]]<ref name="validation4pdb" />:
Free R is positively correlated with [[resolution]]<ref name="validation4pdb" />:
Line 66: Line 72:
|-
|-
| ~4.0
| ~4.0
| 0.32
| 0.31
| <font color="red">- 0.08</font>
| <font color="red">- 0.09</font>
| <0.30
| <0.28
| >0.37
| >0.35
|}
:*Data in the above table for resolutions <= 3.5 &Aring; are approximated from Figure 3C in <ref name="validation4pdb" />, based on X-ray entries deposited in the [[PDB]] since 1990.
:*Data for resolution ~4.0 &Aring; are based on 199 X-ray entries, for which free R values are given, with resolutions of 3.9-4.1 &Aring; (Feb 2013).
:*The median resolution for X-ray entries in the [[PDB]] is 2.07 &Aring; (Feb 2013).
 
==Bad Values for Free R==
<!--In October, 2011, there are 79 X-ray crystallographic entries in the [[PDB]] deposited 1990 or later, with free R values of >= 0.40. The median resolution of these 79 entries is 4.2 &Aring;.-->
In October, 2011, there are 182 X-ray crystallographic entries in the [[PDB]] with free R values of >= 0.37. 45 of these have a resolution <= 3.0 &Aring;, and 5 have a resolution of <= 2.0 &Aring;. It seems likely that there are serious errors in most of these 45 models.
 
In one amazing case, [[1zff]], the resolution is given as 0.94 &Aring;, the R value as 0.428, and the free R as 0.444.
 
==No Value for Free R==
In October, 2011, there are 5,908 X-ray crystallographic entries in the PDB for which no free R value is given. These represent 8.8% of all 66,847 X-ray entries. Most of these were deposited before  the free R methodology was described by Brünger in 1992<ref name="bruenger" />.
 
:{| cellpadding="4" style="text-align: center; border: 1px solid darkgray;"
|- style="background: #e0e0e0;"
! Deposition Date
! X-ray entries
! No Free R
|-
| Through 1990
| 666
| 666 (100%)
|-
| 1991-1995
| 3,209
| 2,852 (89%)
|-
| 1996-2000
| 8,832
| 1,858 (21%)
|-
| 2001-2005
| 18,522
| 342 (1.8%)
|-
| 2006-<br>Oct 2011
| 35,618
| 190 (0.5%)
|}
|}
**Data in the above table for resolutions <= 3.5 &Aring; are approximated from Figure 3C in <ref name="validation4pdb" />, based on X-ray entries deposited in the [[PDB]] since 1990.
**Data for resolution ~4.0 &Aring; are based on 168 X-ray entries deposited 1990 or later with resolutions of 3.9-4.1 &Aring; (Oct 2011). For comparison, there were 677 entries with resolutions 3.4-3.6 &Aring;.
**The median resolution for X-ray entries in the [[PDB]] is 2.06 &Aring; (Oct 2011).


==See Also==
==See Also==

Proteopedia Page Contributors and Editors (what is this?)Proteopedia Page Contributors and Editors (what is this?)

Eric Martz, Eran Hodis