CSB2009 Prediction of Gene Ontology terms using methods for structured output spaces

Prediction of Gene Ontology terms using methods for structured output spaces

Artem Sokolov, Asa Ben-Hur*

Department of Computer Science, Colorado State University Fort Collins, CO, 80523, USA. asa@cs.colostate.edu

Proc LSS Comput Syst Bioinform Conf. August, 2009. Vol. 8, p. 227-229. Full-Text PDF

*To whom correspondence should be addressed.


Protein function prediction is an active area of research in bioinformatics. And yet, transfer of annotation on the basis of sequence or structural similarity remains widely used as an annotation method. Most of today's machine learning approaches reduce the problem to a collection of binary classification problems: whether a protein performs a particular function, sometimes with a post-processing step to combine the binary outputs. We propose a method that directly predicts a full functional annotation of a protein by modeling the structure of the Gene Ontology hierarchy in the framework of kernel methods for structured-output spaces. Our empirical results show improved performance over a BLAST nearest-neighbor method, and over algorithms that employ a collection of binary classifiers as measured on the Mousefunc benchmark dataset.


[ CSB2009 Conference Home Page ] .... [ CSB2009 Online Proceedings ] .... [ Life Sciences Society Home Page ]