[modeller_usage] helix model

20 May 2009


      Hi all.
I have some sequences from real bacteria that correspond to one protein in the pdb, 1SIG.
Of course, they have lots of mutations. I would like  to know which changes 
the structure would have, due to these mutations.
I used Modeller to answer that question, for the first time. 
The predicted structures are very very similar to the given structure.
I am surprised that some helices are not broken or bended just a bit.
So I did the following experiment:  I replaced in the sequence, by hand, twenty amino 
acids that correspond to a helix in 1SIG, with 20 amino acids that are already in a loop in the sequence. 
Then I asked Modeller to calculate the
structure and I get almost the same structure again, with helix, even in the part in which 
I replaced the helix sites.
As far as I know, to get a helix requires several energy states to coincide, so it is 
difficult that by change I get the helix.
So it seems to me the program just copies the structure.
The details are the following:
1SIG is from the pdb. 
I attach the sequence I would like to model, Ppertucin, and the alignment between them.
I also attach the hand modified sequence, Ppertucintest  in which the 23  amino acids 
DPDDDGLSGTDNAVVPAAAAKPKD (that are present in Ppertucin (46-69) already but have no alignment with 1SIG) replace 23
amino acids that were between positions 97 and 120, matched to a helix in 1SIG, and I keep the same alignment.
The model for Ppertucin does not predicts a helix for the 23 amino acids that correspond to a 
gap in 1SIG. But when the same 23 amino acids appear again in Ppertucintest aligned to a helix,
then the model for Ppertucintest is a helix in that fragment.
I can conclude that if I assume the helix is conserved, then it will be, otherwise, not. 
How can make the results useful?
Modeller was not intended for these kind of situations?
Thanks for any hint,
Jairo Rocha
University of the Balearic Islands
Spain
>Ppertucin
RIEEGIREVMAAISMFPGTVDGILTEYQRIAEEGGRLTDIFNGYIDPDDDGLSGTDNAVV
PAAAAKPKDEKKASDDDEEEEEDTEDDTEEETDGGPDPEIARQRFGAVQEQLEKVRKVLK
AKKGDRSHPDVVAEMETLAQLFMPIKLVPKQYDALVARVRGVQDDIRARERAIMQLCVRD
ARMPRADFLRSFPGNETNEKWIDEVLAKKPAYADALAALQPDILRQQQQLIALEQDAQLT
IAQV
>P1;1SIG.pdb
structureX:1SIG.pdb:113:A:446:A:ferredoxin:Peptococcus aerogenes: 2.00:-1.00
MEGEIDIAKRIEDGINQVQCSVAEYPEAITYLLEQYNRVEAEEARLSDLITGFVD-----
------------DLAPTATHVGSELSQE-------------DLDIDPELAREKFAELRAQ
YVVTRDTIKA------HATAQEEILKLSEVFKQFRLVPKQFDYLVNSMRVMMDRVRTQER
LIMKLCVEQCKMPKKNFITLFTGNETSDTWFNAAIAMNKPWSEKLHDVSEEVHRALQKLQ
QIEEETGLTIEQVKDINRRMSIGEAKARRAKKEMVEANLRLVISIAKKYTNRGLQFLDLI
QEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQ*
>P1;Ppertucin
sequence:Ppertucin:1 :A:244   :A:ferredoxin:Peptococcus aerogenes: 2.00:-1.00
---------RIEEGIREVMAAISMFPGTVDGILTEYQRIAEEGGRLTDIFNGYIDPDDDG
LSGTDNAVVPAAAAKPKDEKKASDDDEEEEEDTEDDTEEETDGGPDPEIARQRFGAVQEQ
LEKVRKVLKAKKGDRSHPDVVAEMETLAQLFMPIKLVPKQYDALVARVRGVQDDIRARER
AIMQLCVRDARMPRADFLRSFPGNETNEKWIDEVLAKKPAYADALAALQPDILRQQQQLI
ALEQDAQLTIAQV-----------------------------------------------
-----------------------------------------*
>Ppertucintest
RIEEGIREVMAAISMFPGTVDGILTEYQRIAEEGGRLTDIFNGYIDPDDDGLSGTDNAVV
PAAAAKPKDEKKASDDDEEEEEDTEDDTEEETDGGPDPDDDGLSGTDNAVVPAAAAKPKD
AKKGDRSHPDVVAEMETLAQLFMPIKLVPKQYDALVARVRGVQDDIRARERAIMQLCVRD
ARMPRADFLRSFPGNETNEKWIDEVLAKKPAYADALAALQPDILRQQQQLIALEQDAQLT
IAQV
>P1;1SIG.pdb
structureX:1SIG.pdb:113:A:446:A:xxx:xxx: 2.00:-1.00
MEGEIDIAKRIEDGINQVQCSVAEYPEAITYLLEQYNRVEAEEARLSDLITGFVD-----
------------DLAPTATHVGSELSQE-------------DLDIDPELAREKFAELRAQ
YVVTRDTIKA------HATAQEEILKLSEVFKQFRLVPKQFDYLVNSMRVMMDRVRTQER
LIMKLCVEQCKMPKKNFITLFTGNETSDTWFNAAIAMNKPWSEKLHDVSEEVHRALQKLQ
QIEEETGLTIEQVKDINRRMSIGEAKARRAKKEMVEANLRLVISIAKKYTNRGLQFLDLI
QEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQ*
>P1;Ppertucintest
sequence:Ppertucintest:1 :A:244   :A:xxxx:xxx: 2.00:-1.00
---------RIEEGIREVMAAISMFPGTVDGILTEYQRIAEEGGRLTDIFNGYIDPDDDG
LSGTDNAVVPAAAAKPKDEKKASDDDEEEEEDTEDDTEEETDGGPDPDDDGLSGTDNAVV
PAAAAKPKDAKKGDRSHPDVVAEMETLAQLFMPIKLVPKQYDALVARVRGVQDDIRARER
AIMQLCVRDARMPRADFLRSFPGNETNEKWIDEVLAKKPAYADALAALQPDILRQQQQLI
ALEQDAQLTIAQV-----------------------------------------------
-----------------------------------------*