Re: [modeller_usage] Error running build_profile.py :Not enough bins in histogram

Dear Modeller users,

I am trying to search for the best template for my target molecule, and therefore I created a file in PIR format with my sequences and then

I ran build_profile.py. I got the following error in my log file:

MODELLER 9.10, 2011/09/28, r8346

PROTEIN STRUCTURE MODELLING BY SATISFACTION OF SPATIAL RESTRAINTS

Copyright(c) 1989-2011 Andrej Sali
All Rights Reserved

Written by A. Sali
with help from

B. Webb, M.S. Madhusudhan, M-Y. Shen, M.A. Marti-Renom,
N. Eswar, F. Alber, M. Topf, B. Oliva, A. Fiser,
R. Sanchez, B. Yerkovich, A. Badretdinov,

F. Melo, J.P. Overington, E. Feyfant
University of California, San Francisco, USA
Rockefeller University, New York, USA
Harvard University, Cambridge, USA

Imperial Cancer Research Fund, London, UK
Birkbeck College, University of London, London, UK

Kind, OS, HostName, Kernel, Processor: 4, Linux pan 3.0.0-12-generic x86_64

Date and time of compilation : 2011/09/28 17:58:44
MODELLER executable type : x86_64-intel8
Job starting time (YY/MM/DD HH:MM:SS): 2012/02/29 16:37:57

openf___224_> Open $(LIB)/restyp.lib
openf___224_> Open ${MODINSTALL9v10}/modlib/resgrp.lib
rdresgr_266_> Number of residue groups: 2
openf___224_> Open ${MODINSTALL9v10}/modlib/sstruc.lib

Dynamically allocated memory at amaxlibraries [B,KiB,MiB]: 3234076 3158.277 3.084

Dynamically allocated memory at amaxlibraries [B,KiB,MiB]: 3234604 3158.793 3.085

openf___224_> Open ${MODINSTALL9v10}/modlib/resdih.lib

Dynamically allocated memory at amaxlibraries [B,KiB,MiB]: 3283204 3206.254 3.131
rdrdih__263_> Number of dihedral angle types : 9

Maximal number of dihedral angle optima: 3
Dihedral angle names : Alph Phi Psi Omeg chi1 chi2 chi3 chi4 chi5
openf___224_> Open ${MODINSTALL9v10}/modlib/radii.lib

Dynamically allocated memory at amaxlibraries [B,KiB,MiB]: 3292444 3215.277 3.140
openf___224_> Open ${MODINSTALL9v10}/modlib/radii14.lib
openf___224_> Open ${MODINSTALL9v10}/modlib/af_mnchdef.lib

rdwilmo_274_> Mainchain residue conformation classes: APBLE
openf___224_> Open ${MODINSTALL9v10}/modlib/mnch.lib
rdclass_257_> Number of classes: 5
openf___224_> Open ${MODINSTALL9v10}/modlib/mnch1.lib

openf___224_> Open ${MODINSTALL9v10}/modlib/mnch2.lib
openf___224_> Open ${MODINSTALL9v10}/modlib/mnch3.lib
openf___224_> Open ${MODINSTALL9v10}/modlib/xs4.mat

rdrrwgh_268_> Number of residue types: 21
openf___224_> Open pdb.pir

Dynamically allocated memory at amaxsequence_db [B,KiB,MiB]: 3293057 3215.876 3.141

Dynamically allocated memory at amaxalignment [B,KiB,MiB]: 3294891 3217.667 3.142

Dynamically allocated memory at amaxalignment [B,KiB,MiB]: 3296341 3219.083 3.144

Dynamically allocated memory at amaxalignment [B,KiB,MiB]: 3299241 3221.915 3.146

Dynamically allocated memory at amaxalignment [B,KiB,MiB]: 3305041 3227.579 3.152

Dynamically allocated memory at amaxsequence_db [B,KiB,MiB]: 3310040 3232.461 3.157

SEQ_DATABASE_FILE : pdb.pir
SEQ_DATABASE_FORMAT : PIR

CHAINS_LIST : ALL
CLEAN_SEQUENCES : T
MINMAX_DB_SEQ_LEN : 30 4000
Number of sequences : 4
Number of residues : 1060

Length of longest sequence: 271

SEQ_DATABASE_FILE : pdb_3.bin
SEQ_DATABASE_FORMAT : BINARY
SEARCH_CHAINS_LIST : ALL
Number of sequences : 4
Number of residues : 1060
Length of longest sequence: 271

SEQ_DATABASE_FILE : pdb_3.bin

SEQ_DATABASE_FORMAT : BINARY
CHAINS_LIST : ALL
CLEAN_SEQUENCES : T
MINMAX_DB_SEQ_LEN : 0 999999
Number of sequences : 4

Number of residues : 1060
Length of longest sequence: 271

openf___224_> Open chst11.ali

Dynamically allocated memory at amaxalignment [B,KiB,MiB]: 3300046 3222.701 3.147

Dynamically allocated memory at amaxalignment [B,KiB,MiB]: 3301496 3224.117 3.149

Dynamically allocated memory at amaxalignment [B,KiB,MiB]: 3304396 3226.949 3.151

Dynamically allocated memory at amaxalignment [B,KiB,MiB]: 3310196 3232.613 3.157

Dynamically allocated memory at amaxsequence [B,KiB,MiB]: 3311600 3233.984 3.158

Read the alignment from file : chst11.ali
Total number of alignment positions: 352

# Code #_Res #_Segm PDB_code Name
-------------------------------------------------------------------------------

1 chst11 352 1 chst11

Dynamically allocated memory at amaxprofile [B,KiB,MiB]: 3313325 3235.669 3.160
openf___224_> Open ${LIB}/blosum62.sim.mat

rdrrwgh_268_> Number of residue types: 21
profile_iteration_> processing sequence: 1 352 1 0.0100000 0.0100000 0.0100000 1
regress_657E> Not enough bins in histogram - cannot calculate statistics, nbins: 1

My build_profile.py file is the following:

from modeller import *

log.verbose()
env = environ()

#-- Prepare the input files

#-- Read in the sequence database
sdb = sequence_db(env)
sdb.read(seq_database_file='pdb.pir', seq_database_format='PIR',
chains_list='ALL', minmax_db_seq_len=(30, 4000), clean_sequences=True)

#-- Write the sequence database in binary form
sdb.write(seq_database_file='pdb_3.bin', seq_database_format='BINARY',
chains_list='ALL')

#-- Now, read in the binary database
sdb.read(seq_database_file='pdb_3.bin', seq_database_format='BINARY',
chains_list='ALL')

#-- Read in the target sequence/alignment

aln = alignment(env)
aln.append(file='chst11.ali', alignment_format='PIR', align_codes='ALL')

#-- Convert the input sequence/alignment into
# profile format

prf = aln.to_profile()

#-- Scan sequence database to pick up homologous sequences
prf.build(sdb, matrix_offset=-450, rr_file='${LIB}/blosum62.sim.mat',
gap_penalties_1d=(-500, -50), n_prof_iterations=10,

check_profile=True, max_aln_evalue=0.01)

#-- Write out the profile in text format
prf.write(file='build_profile.prf', profile_format='TEXT')

#-- Convert the profile back to alignment format
aln = prf.to_alignment()

#-- Write out the alignment file
aln.write(file='build_profile.ali', alignment_format='PIR')

If anyone could help me with this I would be grateful.

Many thanks in advance,

Susana