Kuntz Home / DOCK Home / DOCKumentation Contents / DOCK
prev section prev toc up next next section

chemgrid

Elaine Meng
Overview / Input / Output / Sample Output

Overview

chemgrid creates the grid files necessary for force field scoring (
Meng, Shoichet, and Kuntz, 1992). Force field scores are approximate molecular mechanics interaction energies, consisting of van der Waals and electrostatic components:

            (1)
where each term is a double sum over ligand atoms i and receptor atoms j, Aij and Bij are van der Waals repulsion and attraction parameters, rij is the distance between atoms i and j, qi and qj are the point charges on atoms i and j, D is the dielectric function, and 332.0 is a factor that converts the electrostatic energy into kilocalories per mole.

An efficient grid calculation requires the use of a geometric mean approximation for the van der Waals parameters:

     and        (2)
where the single-atom-type parameters are calculated from van der Waals radius, R, and well depth, epsilon, according to:

     and             (3)
Using this approximation, eq. 1 can be rewritten:

         (4)
Three values are stored for every grid point k, each a sum over receptor atoms that are within a user-defined cutoff distance of the point:

         (5)
These values, with or without interpolation, are multiplied by the appropriate ligand values to give the interaction energy. chemgrid calculates the grid values and stores them in files. The values will be read in during a DOCK run and used for force field scoring. Substituting eq. 5 into eq. 4, the interaction energy is:

                   (6)
Atoms that fall outside the grid, if any, are given interaction energies of zero.

The user determines the location and dimensions of the grid box. It is not necessary for the whole receptor to be enclosed; only the regions where ligand atoms may be placed need to be included. The box merely delimits the space where grid points are located, and does not cause receptor atoms to be excluded from the calculation. Besides a direct specification of coordinates, there is an option to center the grid at a sphere cluster center of mass. Any combination of spacing and x, y, and z extents may be used as long as the total number of points does not exceed the array size, maxgrd. The PDB box file from the program showbox is used for setting up the grid location. The programs showesp and showprobe may be helpful for examining the output grid files.

Input

The input file names and parameters are read from a file called INCHEM, which should not contain any blank lines:

recfil
is the name of the PDB-format file containing the receptor atom coordinates. The entire receptor, including all hydrogens on nitrogen, oxygen, or sulfur atoms should be included.

table
is the name of the receptor parameter file to be used. Three tables are supplied in the parms subdirectory: prot.table.ambcrg.ambH (protein, AMBER united-atom charges, AMBER hydrogen names), prot.table.ambcrg.pdbH (protein, AMBER united-atom charges, PDB hydrogen names), and na.table.ambcrg (nucleic acid, AMBER united-atom charges). The united-atom parameters are from the 1984 AMBER reference (Weiner et al., 1984). Each table consists of comment lines and atom parameter lines:

!  receptor parameter file sample
!  '!' at the beginning of a line indicates a comment
!
!aaaxxxrrrnnnncqqqqqqqqxtt   format (A4, 3X, A3, A4, A1, F8.3, X, I2)
! atom name, residue name, residue number (read as characters), chain
! indicator, charge, van der Waals type
!
N      ALA       -0.520  8         ALANINE
CA     ALA        0.215  4
C      ALA        0.526  1       
O      ALA       -0.500 11
H      ALA        0.248  6
CB     ALA        0.031  2
Receptor atom parameterization is done by hashing and typically takes only a few seconds. In the table, the atom name is mandatory, as well as the charge and van der Waals type. The residue name, residue number, and chain indicator are optional. A receptor atom will be parameterized according to most specific match that can be found. The order of lines in the table is immaterial. The user may add to the table to accommodate nonstandard residues, or may create a table with completely different parameters.

vdwfil
is the van der Waals parameters file. The file vdw.parms.amb is located in the parms subdirectory and contains AMBER all-atom and united-atom parameters (Weiner et al., 1984; Weiner et al., 1986). This table consists of comment lines and lines containing SQRT(A) and SQRT(B) for each van der Waals type. The lines must be in order of type, without any skipped integers. The types are referred to by number in the receptor parameter file, table.

inbox
is the name of the input file, created with showbox, defining the grid location. Due to upwards rounding when the grid is set up, the volume of the grid may be slightly larger than that of the box. The center and dimensions are read from the REMARK lines of this PDB-format file; the remaining lines are ignored. The dimensions are aligned along the x, y, and z axes.

grddiv
is the spacing of the grid points, in Ångstroms. Values of 0.2 to 0.5 Ångstroms are recommended for most purposes.

estype
indicates whether a constant or distance-dependent dielectric function will be used. 0: constant, 1: distance-dependent. Any other values default to 1.

esfact
is the dielectric multiplicative factor. To get D = 2, for example, estype should be 0 and esfact should be 2.0; to get D = 4.5r, estype should be 1 and esfact should be 4.5.

cutoff
is the cutoff in Ångstroms for nonbonded interactions. Values of 8.0 or greater are generally appropriate.

pcon
is the close contact limit, in Ångstroms, for receptor polar atoms. Values of 2.0 to 2.5 Ångstroms are reasonable for most purposes. This variable affects docking when force field scoring only is being done. If contact scoring alone or in combination with any other kind of score is being performed, the limits set in distmap define the number of bad contacts an orientation is making.

ccon
is the close contact limit, in Ångstroms, for receptor carbon atoms. Values of 2.5 to 3.0 Ångstroms are reasonable for most purposes. This variable affects docking when force field scoring only is being done. If contact scoring alone or in combination with any other kind of score is being performed, the limits set in distmap define the number of bad contacts an orientation is making.

grdfil
is the prefix name for the output files containing the grid values. Three files are created. For example, if grdfil is foo, the files will be named foo.bmp, foo.esp, and foo.vdw. The file with the .bmp extension is an ASCII file; it contains the grid parameters and a single character for each grid point indicating whether or not a ligand atom placed at that location would be violating any close contact limits, according to the values of ccon and pcon. The file with the .esp extension is binary and contains the electrostatic values. The file with the .vdw extension is binary and contains the van der Waals attraction and repulsion values.

Output

Three output files are generated in addition to the grid files and the PDB box file: OUTCHEM, OUTPARM, and PDBPARM. OUTCHEM lists the parameters specified in INCHEM, so that one can verify whether the intended values have been read in. PDBPARM explicitly lists the parameters along with the coordinates of each receptor atom. Whenever parameters are not found for a given receptor atom, they are set to zero (no volume and no charge), and a message is written to OUTPARM. OUTPARM ends with the calculated net charge on the receptor molecule. It is a good idea to check the net charge and make sure that it is correct (it should be an integer!) and agrees with that value from the accessory program
charge. For convenience in error detection, a line is written for each residue calculated to have a net charge; while +1 and -1 are normal, a noninteger charge probably indicates a parameterization error for that residue. The lines in OUTPARM reporting charged residues each contain the string "CHARGED RESIDUE".

What's all this stuff in my OUTPARM file?

Sample Output

 receptor pdb file:
 4dfr_model.pdb
 receptor parameters will be read from:
 /usr/einstein/dock/parms/prot.table.ambcrg.ambH
  van der Waals parameter file:
 /usr/einstein/dock/parms/vdw.parms.amb
 input box file defining grid location:
4dfr30.box
 box center coordinates [x y z]:
    25.00000       68.00000       46.00000    
 box x-dimension =    25.00000    
 box y-dimension =    15.00000    
 box z-dimension =    25.00000    
 grid spacing in Ångstroms
   0.3000000    
 grid points per side [x y z]:
           85          51          85
 total number of grid points =       368475
 a distance-dependent dielectric will be used
the dielectric function will be multiplied by   4.00
 cutoff distance for energy calculations:
    10.00000    
 distances defining bumps with receptor atoms:
receptor polar atoms  2.30
receptor carbon atoms  2.80
 output grid prefix name:
test

prev section prev toc up next next section

Curator: Daniel Gschwend, gschwend@cgl.ucsf.edu (rev. 1 September 1995)