I have aligned three homologous protein sequences by
salign_profile_profile.py , but the alignment file obtained is not accurate.
Can anyone tell me which script should I use for multiple sequence
On 07/01/2009 05:33 AM, shilpi wrote:
> I have aligned three homologous protein sequences by
> salign_profile_profile.py , but the alignment file obtained is not
> accurate. Can anyone tell me which script should I use for multiple
> sequence alignment.
That is not surprising - profile-profile alignment aligns two blocks of
previously-aligned sequences (two profiles). For example,
salign_profile_profile.py in the Modeller examples/salign/ directory
aligns two profiles read from mega_prune.faa; the first contains 15
sequences (align_block=15) and the second contains the remaining
sequences from mega_prune.faa.
For multiple alignment, if you have structures, use salign_multiple.py
or salign_iterative.py. If you only have sequences, either use malign or
modify the salign parameters in your existing script - at a minimum you
need to change align_what='BLOCK', alignment_type='TREE' and
similarity_flag=False, and remove align_block.
S?ren Skovstrup Hansen wrote:
> A very simple question: Is it possible to resume a crashed Modeller job?
> I was running a more or less standard automodel job, but my computer
> crashed after building 200 out of 250 models. Hence, is it possible to
> resume this job and thereby evaluating and ranking all 250 build models
> via e.g. the DOPE score?
There is no automatic method. To get the remaining 50 models, you could
simply modify your Python script and rerun it (set
a.starting_model=201). You can extract the DOPE scores from the logfile
of the original run - but if you don't have that, you can recalculate
the scores using a script similar to that at http://salilab.org/modeller/9v7/manual/node242.html
Date: Mon, 6 Jul 2009 00:27:59 -0700 (PDT)
From: Sehrish Batool <">>
Subject: [modeller_usage] Modeling large protein sequence having
homologues with less similarity.
Content-Type: text/plain; charset="iso-8859-1"
???????? My target sequence of protein is too large containing more than 1000 amino acids and I found 4 homologues in PDB showing very less similarity?i-e. two showing 28% similarity, one?showing 24% and last?one showing 19% similarity with my target sequence. Also the starting residues of templates have no homology with my target sequence and Modeller?has to?read from 1st residue of template, that bocomes impossible in this case. So?now how?can I?model my target sequence???