Download IsoStar User Guide and Tutorials 2010 Release

Transcript
6
Experimental Details and Limitations
6.1 Details of Scatterplot Methodology
6.1.1 Structures Used for Calculating CSD-Based Scatterplots
• The Cambridge Structural Database (CSD; http://www.ccdc.cam.ac.uk/products/csd/) contains
the results of organic and metallo-organic crystal structure determinations. Standard software
(ConQuest) can be used to search the database for intermolecular contacts between any specified
pair of chemical groups.
• If the structure of a compound occurs more than once in the CSD (duplicate structure
determinations of the same crystal form or determinations of different crystal forms), only one
example is included in the searches used to set up the IsoStar scatterplots.
• With this exception, all CSD structures are used for the IsoStar searches, regardless of
crystallographic R-factor, presence of disorder, etc. However, CSD structures with missing
hydrogen atoms are not used in creating IsoStar scatterplots unless the missing hydrogen atoms
are irrelevant to the plot (i.e. not part of either the central or contact group).
6.1.2 Structures Used for Calculating PDB-Based Scatterplots
• The Protein Data Bank (PDB; http://www.rcsb.org/pdb/) contains the results of protein
structures, including many protein-ligand complexes. Only protein-ligand, ligand-water and
protein-water contacts are included in IsoStar scatterplots, i.e. contacts between protein atoms
are ignored.
• A subset of the database is used for IsoStar. Amongst the restrictions applied are:
• Crystallographic determinations only (no NMR structures).
• Resolution less than or equal to 2.5Å.
• Protein-ligand complexes only. A ligand is defined as a peptide of up to ten residues or a nonpeptide or non-haem molecule of at least nine atoms.
• DNA and RNA complexes excluded.
• Most covalently-bonded complexes excluded.
• Most duplicate and near-duplicate structures excluded (e.g. structures with the words mutant,
mutation or mutated in their PDB files).
• No ligands with atoms of partial occupancy.
• For structures containing more than one binding site with the same ligand, only one of the
binding sites is used.
• To generate the IsoStar plots, ligands of interest and their surrounding residues (ie. binding sites)
are identified in all structures. Crystallographic symmetry operators are applied to generate the
full ligand environment where appropriate.
• In addition, a few structures have been eliminated because they caused processing problems.
32
IsoStar User Guide