Hi,
I have been trying to generate a model of the AhR using three different templates with different ligands in each of them. When I submit the script for the model building, I get the error below:
"_modeller.ModellerError: read_te_290E> Number of residues in the alignment and pdb files are different: 110 109 For alignment entry: 1 3f1o_pasA"
I tried changing the number highlighted in the alignment file to 110, but the same error appeared. Was I supposed to change anything in the pdb file? Also, since I would like to consider the three ligands, is that correct to add three dots at the end of the alignment and pap file? The alignment, .pap, and script for the model building are below.
Model building script:
# Comparative modeling with ligand transfer from the template
from modeller import * # Load standard Modeller classes
from modeller.automodel import * # Load the AutoModel class
import sys
log.verbose() # request verbose output
env = Environ() # create a new MODELLER environment to build this model in
# directories for input atom files
env.io.atom_files_directory = ['.', '../atom_files']
# Read in HETATM records from template PDBs
env.io.hetatm = True
a = AutoModel(env, alnfile='ahr-mult.ali',
knowns=('3f1o_pasA','3h7w_pasA','3h82_pasA'), sequence='AhR', assess_methods=(assess.DOPE))
a.starting_model= 1 # index of the first model
a.ending_model = 100 # index of the last model
# (determines how many models to calculate)
a.make() # do the actual comparative modeling
Alignment file:
>P1;3f1o_pasA
structureX:3f1o_pas_fit.pdb:236:A:+109:A:MOL_ID 1; MOLECULE ENDOTHELIAL PAS DOMAIN-CONTAINING PROTEIN 1; CHAIN A; FRAGMENT HIF2 ALPHA C-TERMINAL PAS DOMAIN; SYNONYM EPAS-1, MEMBER OF PAS PROTEIN 2, BASIC-HELIX-LOOP- PROTEIN MOP2, HYPOXIA-INDUCIBLE FACTOR 2 ALPHA, HIF-2 ALPHA ALPHA, HIF-1 ALPHA-LIKE FACTOR, HLF; ENGINEERED YES; MUTATION YES; MOL_ID 2; MOLECULE ARYL HYDROCARBON RECEPTOR NUCLEAR TRANSLOCATOR; CHAIN B; FRAGMENT ARNT C-TERMINAL PAS DOMAIN; SYNONYM ARNT PROTEIN, DIOXIN RECEPTOR, NUCLEAR TRANSLOCATO HYPOXIA-INDUCIBLE FACTOR 1 BETA, HIF-1 BETA; ENGINEERED YES; MUTATION YES:MOL_ID 1; ORGANISM_SCIENTIFIC HOMO SAPIENS; ORGANISM_COMMON HUMAN; ORGANISM_TAXID 9606; GENE EPAS1, HIF2A, HYPOXIA INDUCIBLE FACTOR 2 ALPHA, MOP2; EXPRESSION_SYSTEM ESCHERICHIA COLI; EXPRESSION_SYSTEM_TAXID 562; EXPRESSION_SYSTEM_STRAIN BL21; EXPRESSION_SYSTEM_VECTOR_TYPE PHIS-GB1-PARALLEL; MOL_ID 2; ORGANISM_SCIENTIFIC HOMO SAPIENS; ORGANISM_COMMON HUMAN; ORGANISM_TAXID 9606; GENE ARNT, ARYL HYDROCARBON RECEPTOR NUCLEAR TRANSLOCATOR; EXPRESSION_SYSTEM ESCHERICHIA COLI; EXPRESSION_SYSTEM_TAXID 562; EXPRESSION_SYSTEM_STRAIN BL21; EXPRESSION_SYSTEM_VECTOR_TYPE PHIS-PARALLEL: 1.60: 0.17
-FKGLDSKTFLSEHSMDMKFTYCDDRITELIGYHPEELLGR-SAYEFYHALDSENMTKSHQNLCTKGQVVSGQYR
MLAKHGGYVWLETQGTVIYN-----PQCIMCVNYVLSEIEK.*
>P1;3h7w_pasA
structureX:3h7w_pas_fit.pdb:236:A:+108:A:MOL_ID 1; MOLECULE ENDOTHELIAL PAS DOMAIN-CONTAINING PROTEIN 1; CHAIN A; FRAGMENT HIF2ALPHA C-TERMINAL PAS DOMAIN (UNP RESIDUES 239 SYNONYM EPAS-1, MEMBER OF PAS PROTEIN 2, BASIC-HELIX-LOOP- PROTEIN MOP2, HYPOXIA-INDUCIBLE FACTOR 2 ALPHA, HIF-2 ALPHA ALPHA, HIF-1 ALPHA-LIKE FACTOR, HLF; ENGINEERED YES; MUTATION YES; MOL_ID 2; MOLECULE ARYL HYDROCARBON RECEPTOR NUCLEAR TRANSLOCATOR; CHAIN B; FRAGMENT ARNT C-TERMINAL PAS DOMAIN (UNP RESIDUES 356 TO 4 SYNONYM ARNT PROTEIN, CLASS E BASIC HELIX-LOOP-HELIX PROTE BHLHE2, DIOXIN RECEPTOR, NUCLEAR TRANSLOCATOR, HYPOXIA-INDU FACTOR 1 BETA, HIF-1 BETA; ENGINEERED YES; MUTATION YES:MOL_ID 1; ORGANISM_SCIENTIFIC HOMO SAPIENS; ORGANISM_COMMON HUMAN; ORGANISM_TAXID 9606; GENE EPAS1, HIF2A, HYPOXIA INDUCIBLE FACTOR 2 ALPHA, MOP2; EXPRESSION_SYSTEM ESCHERICHIA COLI; EXPRESSION_SYSTEM_TAXID 562; EXPRESSION_SYSTEM_STRAIN BL21(DE3); EXPRESSION_SYSTEM_VECTOR_TYPE PLASMID; EXPRESSION_SYSTEM_PLASMID PHIS-GB1-HIF2APAS-B; MOL_ID 2; ORGANISM_SCIENTIFIC HOMO SAPIENS; ORGANISM_COMMON HUMAN; ORGANISM_TAXID 9606; GENE ARNT, ARYL HYDROCARBON RECEPTOR NUCLEAR TRANSLOCATOR, EXPRESSION_SYSTEM ESCHERICHIA COLI; EXPRESSION_SYSTEM_TAXID 562; EXPRESSION_SYSTEM_STRAIN BL21(DE3); EXPRESSION_SYSTEM_VECTOR_TYPE PLASMID; EXPRESSION_SYSTEM_PLASMID PHIS-GB1-ARNT-PAS-B: 1.65: 0.20
-FKGLDSKTFLSEHSMDMKFTYCDDRITELIGYHPEELLGR-SAYEFYHALDSENMTKSHQNLCTKGQVVSGQYR
MLAKHGGYVWLETQGTVIY------PQCIMCVNYVLSEIEK.*
>P1;3h82_pasA
structureX:3h82_pas_fit.pdb:-1:A:+115:A:MOL_ID 1; MOLECULE ARYL HYDROCARBON RECEPTOR NUCLEAR TRANSLOCATOR; CHAIN B; FRAGMENT ARNT C-TERMINAL PAS DOMAIN (UNP RESIDUES 356 TO 4 SYNONYM ARNT PROTEIN, CLASS E BASIC HELIX-LOOP-HELIX PROTE BHLHE2, DIOXIN RECEPTOR, NUCLEAR TRANSLOCATOR, HYPOXIA-INDU FACTOR 1 BETA, HIF-1 BETA; ENGINEERED YES; MUTATION YES; MOL_ID 2; MOLECULE ENDOTHELIAL PAS DOMAIN-CONTAINING PROTEIN 1; CHAIN A; FRAGMENT HIF2ALPHA C-TERMINAL PAS DOMAIN (UNP RESIDUES 239 SYNONYM EPAS-1, MEMBER OF PAS PROTEIN 2, BASIC-HELIX-LOOP- PROTEIN MOP2, HYPOXIA-INDUCIBLE FACTOR 2 ALPHA, HIF-2 ALPHA ALPHA, HIF-1 ALPHA-LIKE FACTOR, HLF; ENGINEERED YES; MUTATION YES:MOL_ID 1; ORGANISM_SCIENTIFIC HOMO SAPIENS; ORGANISM_COMMON HUMAN; ORGANISM_TAXID 9606; GENE ARNT, ARYL HYDROCARBON RECEPTOR NUCLEAR TRANSLOCATOR, EXPRESSION_SYSTEM ESCHERICHIA COLI; EXPRESSION_SYSTEM_TAXID 562; EXPRESSION_SYSTEM_STRAIN BL21(DE3); EXPRESSION_SYSTEM_VECTOR_TYPE PLASMID; EXPRESSION_SYSTEM_PLASMID PHIS-GB1-ARNT-PAS-B; MOL_ID 2; ORGANISM_SCIENTIFIC HOMO SAPIENS; ORGANISM_COMMON HUMAN; ORGANISM_TAXID 9606; GENE EPAS1, HIF2A, HYPOXIA INDUCIBLE FACTOR 2 ALPHA, MOP2; EXPRESSION_SYSTEM ESCHERICHIA COLI; EXPRESSION_SYSTEM_TAXID 562; EXPRESSION_SYSTEM_STRAIN BL21(DE3); EXPRESSION_SYSTEM_VECTOR_TYPE PLASMID; EXPRESSION_SYSTEM_PLASMID PHIS-GB1-HIF2APAS-B: 1.50: 0.20
EFKGLDSKTFLSEHSMDMKFTYCDDRITELIGYHPEELLGR-SAYEFYHALDSENMTKSHQNLCTKGQVVSGQYR
MLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEK.*
>P1;AhR
sequence:AhR:: :: :::-1.00:-1.00
EIR-TKNFIFRTKHKLDFTPIGCDAKGRIVLGYTEAELCTRGSGYQFIHAADMLYCAESHIRMIKTGESGMIVFR
LLTKNNRWTWVQSNARLLYK--NGRPDYIIVTQRPLTDEEG...*
pap file:
_aln.pos 10 20 30 40 50 60
3f1o_pasA -FKGLDSKTFLSEHSMDMKFTYCDDRITELIGYHPEELLGR-SAYEFYHALDSENMTKSHQNLCTKG
3h7w_pasA -FKGLDSKTFLSEHSMDMKFTYCDDRITELIGYHPEELLGR-SAYEFYHALDSENMTKSHQNLCTKG
3h82_pasA EFKGLDSKTFLSEHSMDMKFTYCDDRITELIGYHPEELLGR-SAYEFYHALDSENMTKSHQNLCTKG
AhR EIR-TKNFIFRTKHKLDFTPIGCDAKGRIVLGYTEAELCTRGSGYQFIHAADMLYCAESHIRMIKTG
_consrvd * * * ** ** ** * * * * ** * ** *
_aln.pos 70 80 90 100 110
3f1o_pasA QVVSGQYRMLAKHGGYVWLETQGTVIYN-----PQCIMCVNYVLSEIEK/.--
3h7w_pasA QVVSGQYRMLAKHGGYVWLETQGTVIY------PQCIMCVNYVLSEIEK/.--
3h82_pasA QVVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEK/.--
AhR ESGMIVFRLLTKNNRWTWVQSNARLLYK--NGRPDYIIVTQRPLTDEEG/...
_consrvd * * * * * * * * *
Best Regards,
Amanda F. Ghilardi