A space-efficient algorithm for three sequence alignment and ancestor inference

Feng Yue, Jijun Tang

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

We propose a novel algorithm to simultaneously align three biological sequences with affine gap model and infer their common ancestral sequence. It applies the divide-and-conquer strategy to reduce the memory usage from O(n3) to O(n2). At the same time, it is based on dynamic programming and thus the optimal alignment is guaranteed. We implemented the algorithm and tested it extensively with both BAliBASE dataset and simulation data generated by Random Model of Sequence Evolution (ROSE). Compared with other popular multiple sequence alignment tools such as ClustalW and T-Coffee, our program produces not only better alignment, but also better ancestral sequence.

Original languageEnglish (US)
Pages (from-to)192-204
Number of pages13
JournalInternational Journal of Data Mining and Bioinformatics
Volume3
Issue number2
DOIs
StatePublished - 2009

Fingerprint

Sequence Alignment
Coffee
programming
Dynamic programming
simulation
Data storage equipment
Datasets

All Science Journal Classification (ASJC) codes

  • Library and Information Sciences
  • Information Systems
  • Biochemistry, Genetics and Molecular Biology(all)

Cite this

@article{f4edd8c82b144e5390ece0334c80bc49,
title = "A space-efficient algorithm for three sequence alignment and ancestor inference",
abstract = "We propose a novel algorithm to simultaneously align three biological sequences with affine gap model and infer their common ancestral sequence. It applies the divide-and-conquer strategy to reduce the memory usage from O(n3) to O(n2). At the same time, it is based on dynamic programming and thus the optimal alignment is guaranteed. We implemented the algorithm and tested it extensively with both BAliBASE dataset and simulation data generated by Random Model of Sequence Evolution (ROSE). Compared with other popular multiple sequence alignment tools such as ClustalW and T-Coffee, our program produces not only better alignment, but also better ancestral sequence.",
author = "Feng Yue and Jijun Tang",
year = "2009",
doi = "10.1504/IJDMB.2009.024851",
language = "English (US)",
volume = "3",
pages = "192--204",
journal = "International Journal of Data Mining and Bioinformatics",
issn = "1748-5673",
publisher = "Inderscience Enterprises Ltd",
number = "2",

}

A space-efficient algorithm for three sequence alignment and ancestor inference. / Yue, Feng; Tang, Jijun.

In: International Journal of Data Mining and Bioinformatics, Vol. 3, No. 2, 2009, p. 192-204.

Research output: Contribution to journalArticle

TY - JOUR

T1 - A space-efficient algorithm for three sequence alignment and ancestor inference

AU - Yue, Feng

AU - Tang, Jijun

PY - 2009

Y1 - 2009

N2 - We propose a novel algorithm to simultaneously align three biological sequences with affine gap model and infer their common ancestral sequence. It applies the divide-and-conquer strategy to reduce the memory usage from O(n3) to O(n2). At the same time, it is based on dynamic programming and thus the optimal alignment is guaranteed. We implemented the algorithm and tested it extensively with both BAliBASE dataset and simulation data generated by Random Model of Sequence Evolution (ROSE). Compared with other popular multiple sequence alignment tools such as ClustalW and T-Coffee, our program produces not only better alignment, but also better ancestral sequence.

AB - We propose a novel algorithm to simultaneously align three biological sequences with affine gap model and infer their common ancestral sequence. It applies the divide-and-conquer strategy to reduce the memory usage from O(n3) to O(n2). At the same time, it is based on dynamic programming and thus the optimal alignment is guaranteed. We implemented the algorithm and tested it extensively with both BAliBASE dataset and simulation data generated by Random Model of Sequence Evolution (ROSE). Compared with other popular multiple sequence alignment tools such as ClustalW and T-Coffee, our program produces not only better alignment, but also better ancestral sequence.

UR - http://www.scopus.com/inward/record.url?scp=65549148077&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=65549148077&partnerID=8YFLogxK

U2 - 10.1504/IJDMB.2009.024851

DO - 10.1504/IJDMB.2009.024851

M3 - Article

C2 - 19517989

AN - SCOPUS:65549148077

VL - 3

SP - 192

EP - 204

JO - International Journal of Data Mining and Bioinformatics

JF - International Journal of Data Mining and Bioinformatics

SN - 1748-5673

IS - 2

ER -