Using global sequence similarity to enhance biological sequence labeling

Cornelia Caragea, Drena Dobbs, Jivko Sinapov, Vasant Honavar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Identifying functionally important sites from biological sequences, formulated as a biological sequence labeling problem, has broad applications ranging from rational drug design to the analysis of metabolic and signal transduction networks. In this paper, we present an approach to biological sequence labeling that takes into account the global similarity between biological sequences. Our approach combines unsupervised and supervised learning techniques. Given a set of sequences and a similarity measure defined on pairs of sequences, we learn a mixture of experts model by using spectral clustering to learn the hierarchical structure of the model and by using bayesian approaches to combine the predictions of the experts. We evaluate our approach on two important biological sequence labeling problems: RNA-protein and DNA-protein interface prediction problems. The results of our experiments show that global sequence similarity can be exploited to improve the performance of classifiers trained to label biological sequence data.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008
Pages104-111
Number of pages8
DOIs
StatePublished - 2008
Event2008 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008 - Philadelphia, PA, United States
Duration: Nov 3 2008Nov 5 2008

Publication series

NameProceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008

Other

Other2008 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008
CountryUnited States
CityPhiladelphia, PA
Period11/3/0811/5/08

All Science Journal Classification (ASJC) codes

  • Molecular Biology
  • Information Systems
  • Biomedical Engineering

Fingerprint Dive into the research topics of 'Using global sequence similarity to enhance biological sequence labeling'. Together they form a unique fingerprint.

  • Cite this

    Caragea, C., Dobbs, D., Sinapov, J., & Honavar, V. (2008). Using global sequence similarity to enhance biological sequence labeling. In Proceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008 (pp. 104-111). [4684880] (Proceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2008). https://doi.org/10.1109/BIBM.2008.54