Modeller Users Group,
I
have two protein sequences. They have already been aligned and the
lengths
are identical . I have a few gaps
in the alignment.
How
do I calculate the percent similarity or percent identity between the two
sequences.
I know I need to use something like
the BLOSUM50 or some other substitution matrix (for similarity).
How
exactly do I do the math so that I end up saying "protein A and protein
B
are 70 % similar and 40 % identical".
Please
note - I want to know how to do the math, step-by-step. I am not
interested
in using a computer program at this
point, even if it is free, because I want to write my own
code, once I understand how to do the
math.
I
have looked in several books, looked all over the Internet, BioPERL, etc,
but
I can not find a clear step-by-step
explanation of how to do the math.
I
would be most grateful if someone knowledgeable could explain this.
Thank
you,
Jim
Metz
James T. Metz, Ph.D.
Research Investigator Chemist
GPRD R46Y AP10-2
Abbott Laboratories
100 Abbott Park Road
Abbott Park, IL 60064-6100
U.S.A.
Office (847) 936 - 0441
FAX (847) 935 - 0548
james.metz@abbott.com
This communication may contain information that is legally privileged,
confidential, or exempt from disclosure. If you are not the intended
recipient, please note that any dissemination, distribution, use, or copying
of this communication is strictly prohibited. Anyone who receives
this message in error should notify the sender immediately by telephone
or return email and delete it from his or her computer.