Jmol/Visualizing large molecules: Difference between revisions
Eric Martz (talk | contribs) |
Eric Martz (talk | contribs) |
||
Line 4: | Line 4: | ||
==Maximum Size Per Model== | ==Maximum Size Per Model== | ||
Strictly speaking, the format of [[PDB files]] is limited to 99,999 atoms in a single model, because there are only 5 columns allocated to atom serial numbers. (Files in the [[Atomic coordinate files|mmCIF format]] can be read by Jmol, and do not suffer from this limitation.) [[3cc2]] is a model of a large ribosomal subunit containing 99,049 atoms (close to the limit for a single PDB file). Most likely it will display in Jmol when you go to that page. Jmol ignores the atom serial number, columns 7-11 in the PDB file, instead assigning its own ''atomIndex'' number, unique for each atom, and not redundant between models. Jmol can handle PDB files containing >100,000 atoms. | Strictly speaking, the format of [[PDB files]] is limited to 99,999 atoms in a single model, because there are only 5 columns allocated to atom serial numbers. (Files in the [[Atomic coordinate files#mmCIF_Data_Format|mmCIF format]] can be read by Jmol, and do not suffer from this limitation.) [[3cc2]] is a model of a large ribosomal subunit containing 99,049 atoms (close to the limit for a single PDB file). Most likely it will display in Jmol when you go to that page. Jmol ignores the atom serial number, columns 7-11 in the PDB file, instead assigning its own ''atomIndex'' number, unique for each atom, and not redundant between models. Jmol can handle PDB files containing >100,000 atoms. | ||
This limitation requires that models containing >=100,000 atoms be split into two or more PDB files, or else represented as artificially separated models in a single PDB file. These work-arounds are awkward for visualization. An example is the combination of portions of the two files [[1jgo]] and [[1gix]] for visualization of a complete [[Ribosome]]. | This limitation requires that models containing >=100,000 atoms be split into two or more PDB files, or else represented as artificially separated models in a single PDB file. These work-arounds are awkward for visualization. An example is the combination of portions of the two files [[1jgo]] and [[1gix]] for visualization of a complete [[Ribosome]]. |
Revision as of 23:20, 1 January 2011
Inadequate Memory May Preclude DisplayInadequate Memory May Preclude Display
Some molecular models ("molecules") are so large that they will not fit within the default amount of computer memory allocated to Jmol (which is the default amount of memory allocated to java). While it is possible to increase the memory allocated to java, most users will not do this, and hence, will not be able to display, in Proteopedia or Jmol, molecules that exceed a certain size.
Maximum Size Per ModelMaximum Size Per Model
Strictly speaking, the format of PDB files is limited to 99,999 atoms in a single model, because there are only 5 columns allocated to atom serial numbers. (Files in the mmCIF format can be read by Jmol, and do not suffer from this limitation.) 3cc2 is a model of a large ribosomal subunit containing 99,049 atoms (close to the limit for a single PDB file). Most likely it will display in Jmol when you go to that page. Jmol ignores the atom serial number, columns 7-11 in the PDB file, instead assigning its own atomIndex number, unique for each atom, and not redundant between models. Jmol can handle PDB files containing >100,000 atoms.
This limitation requires that models containing >=100,000 atoms be split into two or more PDB files, or else represented as artificially separated models in a single PDB file. These work-arounds are awkward for visualization. An example is the combination of portions of the two files 1jgo and 1gix for visualization of a complete Ribosome.
Multiple Model FilesMultiple Model Files
The largest PDB files in the Protein Data Bank are those containing multiple models of large molecules. Since the atom serial numbers start at 1 in each model, these files can get very large (>1,000,000 atoms is possible). An example is 3ezb, which contains 40 models (determined by solution NMR). Each model contains 5,323 atoms (including 2,694 hydrogen atoms); the 40 model file contains 212,920 atoms, and the PDB file is 16.5 megabytes in size. When you visit the page 3ezb, the ensemble will fail to display, producing an "out of memory" error (unless you have allocated more than the default amount of memory to java on your computer). There are files in the PDB several-fold larger than 3ezb. For example, 2hyn is a 64 megabyte file containing 826,896 atoms in 184 models.
Displaying Only The First ModelDisplaying Only The First Model
Biological AssembliesBiological Assemblies
However, biological unit assemblies generated from PDB files often exceed this limit by using redundant serial numbers for copied chains. Luckily, Jmol ignores the serial numbers in the PDB file, and can handle models with >100,000 atoms.