modeller_usage

Download

modeller_usage@salilab.org

November 2024

3 participants
2 discussions

Reconstruction of CA position only files with all-atom template
by jingkaizeng 13 Nov '24

13 Nov '24

Hi all, Here is my task: I have some CA position only files of a protein of interested that underlying some important biological pathways. We also have the protein full Cryo-EM structures (pdb:8io4, the exact same protein as well as sequence as the CA only files). Then I would like to fill the keep the CA position only files' CA position (don't move at all) then reconstruct the detail like side chain/secondary structure information from template. So this task is not homology modeling but … [View More]reconstruction. After getting the reconstructed files, I would like to use these into MD simulations. I searched from previous mail list, and I found one did this task but not based on template. https://www.salilab.org/archives/modeller_usage/2008/msg00285.html I tried with " env = Environ() # Load the CA-only seed model seed_model = Model(env, file=seed_file) # Load the template structure (with full atomistic detail) template_model = Model(env, file=template_file) # Create an alignment between the seed and itself (1:1 alignment) aln = Alignment(env) aln.append_model(seed_model, align_codes='seed') aln.append_model(seed_model, align_codes='seed') # Write the alignment for debugging purposes aln.write(file='alignment.ali', alignment_format='PIR') # Define a custom model class to fix the Cα positions class MyModel(automodel): def select_atoms(self): # Select all atoms except the Cα atoms to allow refinement of everything but Cα s = selection(self) return s - s.only_atom_types('CA') # Set up environment for refinement env.edat.nonbonded_sel_atoms = 2 # Disable interactions between selected and unselected atoms a = MyModel(env, alnfile = aln, # Alignment file created earlier knowns = 'seed', # Seed model (CA-only) sequence = 'seed') # Target is also the seed (1:1 alignment) a.starting_model = 1 a.ending_model = 1 a.make() " But this code didn't use the side chain atom from the Cryo-EM structure, and just use CA position to reconstruct the detail. It seems the reconstruction by this code is not reliable enough that cause infinite energy error in MD simulation, it may indicate some overlap or incorrect side-chain orientations, or improper hydrogen placements. Therefore, I would need to try to reconstruct with the side chain information from template Cryo-EM structure's pdb. I tried some methods that use the template Cryo-EM structure, but it seems we just do the homology modeling but NOT reconstruction. The production files we get is almost same as the template without make use of the CA position from CA only files. The code I used like : " # Load the CA-only seed model seed_model = Model(env, file=seed_file) # Load the template structure (with full atomistic detail) template_model = Model(env, file=template_file) # Create an alignment between the seed and the template aln = Alignment(env) aln.append_model(template_model, align_codes='template') aln.append_model(seed_model, align_codes='seed') # Perform the alignment aln.align() # Now defining a custom AutoModel class to apply Cα restraints class MyModel(AutoModel): def special_restraints(self, aln): # Restrain all Cα atoms based on the seed model rsr = self.restraints atmsel = selection(self).only_atom_types('CA') # Apply restraints to the selected Cα atoms for atom in atmsel: rsr.add(forms.Gaussian(group=physical.xy_distance, feature=features.Distance(atom, atom), mean=0.0, # No change in CA positions stdev=0.01)) # Instantiate the custom model class a = MyModel(env, alnfile=aln, knowns='template', sequence='seed') a.starting_model = 1 a.ending_model = 1 a.make() " I think this task can be done since I found a paper (https://doi.org/10.7554/eLife.68369) that do the similar task, they claim to use Modeller to perform this: "First, side chain atoms from the template X-ray structure (PDB ID 4HFI) were added to each model, followed by a cycle of refinement with all Ca atoms restrained. Restraints on Ca atoms were then substituted with restraints on backbone hydrogen bonds, taken from helix and sheet annotations in the template PDB file, for another cycle of refinement to ensure proper secondary structure." Can anyone give some suggestions about this reconstruction task? Thanks. Best, Jingkai [View Less]

2 1

Missing residues modeling in Modeller
by bbmpresi＠gmail.com 04 Nov '24

04 Nov '24

Dear all, I am trying to model the missing residues in a beta barrel shaped protein that contains 12 missing residues in the original pdb. As instructed for the 1qg8 tutorial, I also first obtained the full amino acid sequence of the protein using: from modeller import * # Get the sequence of the 1qg8 PDB file, and write to an alignment file code = 'pdbid' e = Environ() m = Model(e, file=code) aln = Alignment(e) aln.append_model(m, align_codes=code) aln.write(file=code+'.seq') and I am … [View More]obtaining an output file named pdbid.seq The actual protein sequence contains 324 amino acid residues and 12 residues are missing in the pdbid.pdb file This is the alignment.ali file I am preparing for missing residue addition: >P1;pdbid structureX:pdbid:1:A:+312:A:MOL_ID 1; ASDQRGYKP------------GGHVGTSVEYEDKVTRGFNNTDKKEKTITNEVFNFFYNNPQWNFMGFYSFKIENREQKEPGYYENEDGIKQLFSLNKGHDLGNGWATGLIYELEYTRSKVYSPDVSGLRKNLAEHSIRPYLTYWNNDYNMGFYSNLEYLLSKEDRNAWGKRQEQGYSALFKPYKRFGNWEVGVEFYYQIKTNDEKQPDGTINEKSDFNERYIEPIVQYSFDDAGTLYTRVRVGKNETKNTDRSGGGNAGINYFKDIRKATVGYEQSIGESWVAKAEYEYANEVEKKSRLSGWEARNKSELTQHTFYAQALYRF* >P1;pdb_fill sequence::::::::: ASDQRGYKPEDVAFDESFFSFGGHVGTSVEYEDKVTRGFNNTDKKEKTITNEVFNFFYNNPQWNFMGFYSFKIENREQKEPGYYENEDGIKQLFSLNKGHDLGNGWATGLIYELEYTRSKVYSPDVSGLRKNLAEHSIRPYLTYWNNDYNMGFYSNLEYLLSKEDRNAWGKRQEQGYSALFKPYKRFGNWEVGVEFYYQIKTNDEKQPDGTINEKSDFNERYIEPIVQYSFDDAGTLYTRVRVGKNETKNTDRSGGGNAGINYFKDIRKATVGYEQSIGESWVAKAEYEYANEVEKKSRLSGWEARNKSELTQHTFYAQALYRF* But in the template I get residue number written as 312 instead of 324, I directly copied the entry from the pdbid.seq file that was provided by Modeller at the first step that I explained at the beginning. Due to 12 missing residues Modeller is reporting 312 residues in the second line of the alignment.ali file: structureX:pdbid:1:A:+312:A:MOL_ID 1; However, I checked other examples on Modeller tutorial and it seems in some examples the total residue numbers (including the missing residue numbers) are written in the template section for the alignment.ali file, so I could have used structureX:pdbid:1:A:+324:A:MOL_ID 1; writing 324 as total number of residues for the template; so my question is that does it matter if write structureX:pdbid:1:A:+312:A:MOL_ID 1; or structureX:pdbid:1:A:+324:A:MOL_ID 1; I generated a new pdb with all missing residues modelled by Modeller using: from modeller import * from modeller.automodel import * # Load the AutoModel class log.verbose() env = Environ() # directories for input atom files env.io.atom_files_directory = ['.', '../atom_files'] class MyModel(AutoModel): def select_atoms(self): return Selection(self.residue_range('10:A', '21:A')) #a = MyModel(env, alnfile = 'alignment.ali', # knowns = 'pdbid', sequence = 'pdbid_fill') a = AutoModel(env, alnfile = 'alignment.ali', knowns = 'pdbid', sequence = 'pdbid_fill') a.starting_model= 1 a.ending_model = 5 a.make() I checked that using structureX:pdbid:1:A:+312:A:MOL_ID 1; or structureX:pdbid:1:A:+324:A:MOL_ID 1; in the second line of alignment.ali for the template, I am obtaining a very similar or more or less identical modeled structure. Can anybody tell me if the number of residues written in the second line of alignment.ali file for template pdb matters or not? Or only the template sequence and the gaps really matter? Any help would be much appreciated, thank you [View Less]

2 1