[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[modeller_usage] (no subject)



I'm trying run align2d on 3dxn chain a's pdb and it's sequence provided by the pdb.

My code is as below:

from modeller import *
env = environ()
aln = alignment(env)
mdl = model(env, file="3dxn_nmin")
aln.append_model(mdl, align_codes='structure', atom_files="3dxn_nmin.pdb")
aln_block = len(aln)
aln.append('3dxn.a.fasta', alignment_format='FASTA')
aln.align2d(align_block=aln_block, output_weights_file='2dalign.mtx')
aln.check_sequence_structure()
aln.write(file='tmp.ali', alignment_format='PIR')

In the alignment, I get:
>P1;structure
structureX:3dxn_nmin.pdb:  19 :A:+258 :A:::-1.00:-1.00
------------------LSDRYQRVKKLG--SYGEVLLCKDK-TGAERAIKIIKKSSVTTTSNSGALLDEVAVL
KQLDHPNIMKLYEFFEDKRNYYLVMEVYRGGELFDEIILRQKFSEVDAAVIMKQVLSGTTYLHKHNIVHRDLKPE
NLLLESKS-DALIKIVDFGLSAHFE-------RLGTAYYIAPEVLRKKYDEKCDVWSCGVILYILLCGYPPFGGQ
TDQEILKRVEKGKFSFDPPDWTQVSDEAKQLVKLMLTYEPSKRISAEEALNHPWIVKFCSQK*

>P1;3DXN:A:sequence
sequence::     : :     : :::-1.00:-1.00
MHHHHHHSSGRENLYFQGLSDRYQRVKKLGSGAYGEVLLCKDKLTGAERAIKIIKKSSVTTTSNSGALLDEVAVL
KQLDHPNIMKLYEFFEDKRNYYLVMEVYRGGELFDEIILRQKFSEVDAAVIMKQVLSGTTYLHKHNIVHRDLKPE
NLLLESKSRDALIKIVDFGLSAHFEVGGKMKERLGTAYYIAPEVLRKKYDEKCDVWSCGVILYILLCGYPPFGGQ
TDQEILKRVEKGKFSFDPPDWTQVSDEAKQLVKLMLTYEPSKRISAEEALNHPWIVKFCSQK*

If
you look at the biggest gap, preceded by "HFE" in the alignment, that E
should be at the other end of the gap.  Sequence-wise, having it at
either end is equivalent (there is an E in both places), but the actual pdb file has "ERL" at the other end and "HF" preceding.

When I run the check_sequence_structure(), it agrees, saying:
Implied target CA(i)-CA(i+1) distances longer than  8.0 angstroms:

ALN_POS  TMPL  RID1  RID2  NAM1  NAM2     DIST
----------------------------------------------
    174     1  174   182      F     E   12.004
END OF TABLE


I've placed the files at github if they are of any help
They can be cloned at:
git clone git://gist.github.com/114942.git

Thanks,
David