alignment.malign() -- align two or more sequences

Next: alignment.consensus() consensus Up: The alignment class: comparison Previous: alignment.align2d() align Contents Index

alignment.malign() -- align two or more sequences

rr_file = <str:1> '$(LIB)/as1.sim.mat' input residue-residue scoring file

gap_penalties_1d = <float:2> 900 50 gap creation and extension penalties for sequence/sequence alignment

off_diagonal = <int:1> 100 to speed up the alignment

align_block = <int:1> 0 the last sequence in the first block of sequences

matrix_offset = <float:1> 0.00 substitution matrix offset for local alignment

overhang = <int:1> 0 un-penalized overhangs in protein comparisons

local_alignment = <bool:1> False whether to do local as opposed to global alignment

This command performs a multiple sequence alignment. The sequences to be aligned are the sequences in the current alignment arrays. The command uses the dynamic programming method for the best sequence alignment, given the gap initiation and extension penalties specified by gap_penalties_1d, and residue type weights read from file rr_file. See command alignment.align() for more information.

The algorithm for the multiple alignment is as follows. First, sequence 2 is aligned with sequence 1 (i.e., block of sequences from 1-align_block). Next, sequence 3 is aligned with an average of the aligned sequences 1 and 2; i.e., the weight matrix is an average of the weights 1-3 and 2-3. For this averaging, the gap-residue and gap-gap weights are obtained from the residue-residue weight matrix file, not from gap penalties. If the corresponding weights are not in the file, they are set to the worst and best residue-residue score, respectively.

See instructions for alignment.align() for more details.

Example: examples/commands/malign.py

# Example for: alignment.malign()

# This will read all sequences from a file, align them, and write 
# the alignment to a new file:

from modeller import *

env = environ()

aln = alignment(env, file='toxin.ali', align_codes='all')
aln.malign(gap_penalties_1d=(-600, -400))
aln.write(file='toxin-seq.pap', alignment_format='PAP')

Next: alignment.consensus() consensus Up: The alignment class: comparison Previous: alignment.align2d() align Contents Index

Ben Webb 2007-01-19

rr_file = `<str:1>`	`'$(LIB)/as1.sim.mat'`	input residue-residue scoring file
gap_penalties_1d = `<float:2>`	`900 50`	gap creation and extension penalties for sequence/sequence alignment
off_diagonal = `<int:1>`	`100`	to speed up the alignment
align_block = `<int:1>`	`0`	the last sequence in the first block of sequences
matrix_offset = `<float:1>`	`0.00`	substitution matrix offset for local alignment
overhang = `<int:1>`	`0`	un-penalized overhangs in protein comparisons
local_alignment = `<bool:1>`	`False`	whether to do local as opposed to global alignment