next up previous contents index
Next: MALIGN3D align Up: Comparison and searching of Previous: COMPARE compare   Contents   Index

ALIGN3D -- align two structures

GAP_PENALTIES_3D = $\langle{\tt real:2}\rangle$ 0.0 1.75 gap creation and extension penalties for structure/structure superposition
FIT_ATOMS = $\langle{\tt string:1}\rangle$ 'CA' one atom type used for superposition
FIT = $\langle{\tt logical:1}\rangle$ on whether to align
STOP_ON_ERROR = $\langle{\tt integer:1}\rangle$ 1 whether to stop on error
OUTPUT = $\langle{\tt string:1}\rangle$ 'LONG' 'SHORT' | 'LONG' | 'VERY_LONG'
ALIGN3D_TRF = $\langle{\tt logical:1}\rangle$ off whether to transform the distances before dynamic programming
ALIGN3D_REPEAT = $\langle{\tt logical:1}\rangle$ off do several starts to maximize number of equivalent positions
OFF_DIAGONAL = $\langle{\tt integer:1}\rangle$ 100 to speed up the alignment
MATRIX_OFFSET = $\langle{\tt real:1}\rangle$ 0.00 substitution matrix offset for local alignment
OVERHANG = $\langle{\tt integer:1}\rangle$ 0 un-penalized overhangs in protein comparisons
LOCAL_ALIGNMENT = $\langle{\tt logical:1}\rangle$ off whether to do local as opposed to global alignment

Output:
MODELLER_STATUS = $\langle{\tt integer:1}\rangle$

Description:
This command uses the current alignment as the starting point for an iterative least-squares superposition of two 3D structures. This results in a new pairwise structural alignment. If no alignment is in memory, the initial alignment is the 1:1 alignment. A good initial alignment may be obtained by sequence alignment (ALIGN). For superpositions, only one atom per residue is used, as specified by FIT_ATOMS[1].

The alignment algorithm is as follows. First, structure 2 is least-squares fit on structure 1 using all the equivalent residue positions in the initial alignment that have the specified atom type. Next, the residue-residue distance matrix is obtained by calculating Euclidean distances between all pairs of selected atoms from the two structures. The alignment of the two structures is then obtained by the standard dynamic programming optimization based on the residue-residue distance matrix.

GAP_PENALTIES_3D[1] is a gap creation penalty (usually 0), and GAP_PENALTIES_3D[2] is a gap extension penalty, say 1.75. This procedure identifies pairs of residues as equivalent when they have their selected atoms at most 2 times GAP_PENALTIES_3D[2] angstroms apart in the current orientation (this is so when the gap initiation penalty is 0). The reason is that an equivalence costs the distance between the two residues while an alternative, the gap-residue and residue-gap matches, costs twice the gap extension penalty.

From the dynamic programming run, a new alignment is obtained. Thus, structure 2 can be fitted onto structure 1 again, using this new alignment, and the whole cycle is repeated until there is no change in the number of equivalent positions and until the difference in the rotation matrices for the last two superpositions is very small. At the end, the framework, that is the alignment positions without gaps, is written to the log file.

If FIT is off, no alignment is done.

If OUTPUT contains 'SHORT', only the best alignment and its summary are displayed. If OUTPUT contains 'LONG', summaries are displayed for all initial alignments in each framework cycle. If OUTPUT contains 'VERY_LONG', all alignments are displayed.

If ALIGN3D_TRF is on, the weights in the weight matrix are modified distances [Subbiah et al., 1993].

If ALIGN3D_REPEAT is on, three additional initial alignments are tried and the one resulting in the largest number of equivalent positions is selected.

Example:


# Example for: ALIGN3D, SUPERPOSE

# This will align 3D structures of two proteins:

SET OUTPUT_CONTROL = 1 1 1 1 1

# First example: read sequences from a sequence file:
READ_ALIGNMENT FILE = 'toxin.ali', ALIGN_CODES = '1fas' '2ctx'
ALIGN GAP_PENALTIES_1D= -600 -400
ALIGN3D GAP_PENALTIES_3D= 0 4.0
WRITE_ALIGNMENT FILE = 'toxin-str.ali'

# Second example: read sequences from PDB files to eliminate the 
# need for the toxin.ali sequence file:
READ_MODEL FILE = '1fas'
SEQUENCE_TO_ALI ATOM_FILES = '1fas', ALIGN_CODES = '1fas'
READ_MODEL FILE = '2ctx'
SEQUENCE_TO_ALI ADD_SEQUENCE = on, ATOM_FILES = ATOM_FILES '2ctx', ;
                ALIGN_CODES = ALIGN_CODES '2ctx'
ALIGN GAP_PENALTIES_1D= -600 -400
ALIGN3D GAP_PENALTIES_3D=  0 2.0
WRITE_ALIGNMENT FILE = 'toxin-str.ali'

# And now superpose the two structures using current alignment to get 
# various RMS's:
READ_MODEL  FILE = '1fas'
PICK_ATOMS ATOM_TYPES = 'CA'
READ_MODEL2 FILE = '2ctx'
SUPERPOSE FIT_ATOMS = 'CA'


next up previous contents index
Next: MALIGN3D align Up: Comparison and searching of Previous: COMPARE compare   Contents   Index
Ben Webb 2004-04-20