Improvement in Protein Sequence-Structure Alignment Using Insertion/Deletion Frequency Arrays

Kyle Ellrott*, Jun-Tao Guo, Victor Olman, Ying Xu

Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology and Institute of Bioinformatics, The University of Georgia, Athens, Georgia 30602, USA

Proc LSS Comput Syst Bioinform Conf. August, 2007. Vol. 6, p. 335-342. Full-Text PDF

*To whom correspondence should be addressed.


As a protein evolves, not every part of the amino acid sequence has an equal probability of being deleted or for allowing insertions, because not every amino acid plays an equally important role in maintaining the protein structure. However the most prevalent models in fold recognition methods treat every amino acid deletion and insertion as equally probable events. We have analyzed the alignment patterns for homologous and analogous sequences to determine patterns of insertion and deletions, and used that information to determine the statistics of insertions and deletions for different amino acids of a target sequence. We define these patterns as Insertion/Deletion (Indel) Frequency Arrays (IFA). By applying IFA to the protein threading problem, we have been able to improve the alignment accuracy, especially for proteins with low sequence identity.


[CSB2007 Conference Home Page]....[CSB2007 Online Proceedings]....[Life Sciences Society Home Page]