Interpreting ConSurf Results
This page discusses how to decide whether a ConSurf result is optimal for the questions you wish to ask about a protein. It assumes that you already have one or more completed ConSurf results. For background principles and instructions on how to get a ConSurf result, please see ConSurf/Index.
Breadth of the MSABreadth of the MSA
A ConSurf result depends crucially on the sequences included in the multiple sequence alignment (MSA). The optimal breadth of coverage of those sequences depends on your goal. The breadth of an MSA is represented in the #Average Pairwise Distance (APD).
- If you want to know which residues are important for the specific function of one protein, then the MSA should not include proteins with different functions. See Limiting ConSurf Analysis to Proteins of a Single Function.
- If you want to know which residues remain important throughout an entire protein family (or superfamily), then the MSA should be broad enough to include representives of the entire family. Conserved residues will highlight residues that are conserved to enable proper folding of the domain. Such an MSA may obscure conservation of some residues important for a specific function.
For more about this, see What is the best way to collect homologous sequences in order to construct an MSA? at the ConSurf Server.
Average Pairwise DistanceAverage Pairwise Distance
The average pairwise distance (APD) in a multiple sequence alignment (MSA) is a measure of the evolutionary breadth of the range of sequences included. The APD is "The average number of replacements between any two sequences in the alignment; A distance of 0.01 means that on average, the expected replacement for every 100 positions is 1." (quoted from the ConSurf Server).