Hi Ben, On Sep 12, 2012, at 5:28 PM, Modeller Caretaker wrote:
There are a couple of things that I'm still missing: 1. I still want to use B.pdb as my initial model and only change those residues in my selected residue_range. When I specify B.pdb as my inifile, Modeller kicks out an error: _modeller.ModellerError: rdpdb___303E> No atoms were read from the specified input PDB file, since the starting residue number and/or chain id in MODEL_SEGMENT (or the alignment file header) was not found; requested starting position: residue number " ", chain " "; atom file name: B.pdb I'm not sure where I'm suppose to be specifying the starting position. If I remove the inifile argument to the model instantiation, then everything runs fine, but it builds the model for all of 'C', rather than just 'B' plus the four missing residues of interest (see #2). 2. If 'C' is the complete sequence that fills in all of the gaps in 'A' and 'B', how do I stop Modeller from building in residues outside of my selected residue_range, ensuring that my output model is just B.pdb unchanged, except for the 4 residues that are built from A.pdb? Should I be changing the aligned sequence for A to be all dashes, except for the 4 residues that are missing in B? In this case what should the start and ending residues be in the alignment file in the fields after structureX? Should C by identical to B, except the 4 residues marked with dashes should be filled in? So basically if A and B are: >P1;A structureX:A: 1 :A:149 :A:::-1.00:-1.00 ---------FIRIMVFAIYVALPIGV--------------------AGKSLPWYAVGASLIAANISAEQFIGGMS SAYSIGLAIASYRWMSALTLIIVGKYFLPIFIEKGIYTIPEFVRKRSNKKLLTILAVFWISLYIFVNLTSVLYL* >P1;B structureX:B:1 :A:149 :A:::-1.00:-1.00 --------SFIRIMVFAIYVALPIGVGLWV-------------------SLPWYAVGASLIAANISAEQFIGGMS SAYSIGLAIASYRWMSALTLIIVGKYFLPIFIEKGIYTIPEFVRKRS----LTILAVFWISLYIFVNLTSVLYL* should C be: >P1;C structureX:A: 1 :A:149 :A:::-1.00:-1.00 AAAAAAAAAFIRIMVFAIYVALPIGVAAAAAAAAAAAAAAAAAAAAAGKSLPWYAVGASLIAANISAEQFIGGMS SAYSIGLAIASYRWMSALTLIIVGKYFLPIFIEKGIYTIPEFVRKRSNKKLLTILAVFWISLYIFVNLTSVLYL* or >P1;C structureX:B:1 :A:149 :A:::-1.00:-1.00 --------SFIRIMVFAIYVALPIGVGLWV-------------------SLPWYAVGASLIAANISAEQFIGGMS SAYSIGLAIASYRWMSALTLIIVGKYFLPIFIEKGIYTIPEFVRKRSNKKLLTILAVFWISLYIFVNLTSVLYL* Also should A alternatively be: >P1;A structureX:A: 1 :A:149 :A:::-1.00:-1.00 --------------------------------------------------------------------------- -----------------------------------------------NKKL-----------------------* (or maybe with a few extra surrounding residues?). If A is the latter should the start and end residues in the header fields just encompass the `NKKL` (123-126 in the entire sequence)? I've played around with some of these combinations, but when I actually get things to run, it's building residues 123-126 not in terms of the the absolute position in the sequence, but the relative position in terms of how many residues are present in the PDB. Should my sequences include the chain break character (/) somewhere? Thanks in advance for helping to clarify these issues. Josh
|