An effective method for detecting gene conversion events in whole genomes

Chih Hao Hsu, Yu Zhang, Ross Cameron Hardison, Eric D. Green, Webb Miller

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

Gene conversion events are often overlooked in analyses of genome evolution. In a conversion event, an interval of DNA sequence (not necessarily containing a gene) overwrites a highly similar sequence. The event creates relationships among genomic intervals that can confound attempts to identify orthologs and to transfer functional annotation between genomes. Here we examine 1,616,329 paralogous pairs of mouse genomic intervals, and detect conversion events in about 7.5% of them. Properties of the putative gene conversions are analyzed, such as the lengths of the paralogous pairs and the spacing between their sources and targets. Our approach is illustrated using conversion events in primate CCL gene clusters. Source code for our program is included in the 3SEQ-2D package, which is freely available at www.bx.psu.edu/miller-lab.

Original languageEnglish (US)
Pages (from-to)1281-1297
Number of pages17
JournalJournal of Computational Biology
Volume17
Issue number9
DOIs
StatePublished - Sep 1 2010

Fingerprint

Gene Conversion
Genome
Genes
Gene
Multigene Family
Primates
Interval
Genomics
DNA Sequence
DNA sequences
Spacing
Annotation
Mouse
Target

All Science Journal Classification (ASJC) codes

  • Modeling and Simulation
  • Molecular Biology
  • Genetics
  • Computational Theory and Mathematics
  • Computational Mathematics

Cite this

Hsu, Chih Hao ; Zhang, Yu ; Hardison, Ross Cameron ; Green, Eric D. ; Miller, Webb. / An effective method for detecting gene conversion events in whole genomes. In: Journal of Computational Biology. 2010 ; Vol. 17, No. 9. pp. 1281-1297.
@article{9be8a7629daf4fef90287c8a50105d15,
title = "An effective method for detecting gene conversion events in whole genomes",
abstract = "Gene conversion events are often overlooked in analyses of genome evolution. In a conversion event, an interval of DNA sequence (not necessarily containing a gene) overwrites a highly similar sequence. The event creates relationships among genomic intervals that can confound attempts to identify orthologs and to transfer functional annotation between genomes. Here we examine 1,616,329 paralogous pairs of mouse genomic intervals, and detect conversion events in about 7.5{\%} of them. Properties of the putative gene conversions are analyzed, such as the lengths of the paralogous pairs and the spacing between their sources and targets. Our approach is illustrated using conversion events in primate CCL gene clusters. Source code for our program is included in the 3SEQ-2D package, which is freely available at www.bx.psu.edu/miller-lab.",
author = "Hsu, {Chih Hao} and Yu Zhang and Hardison, {Ross Cameron} and Green, {Eric D.} and Webb Miller",
year = "2010",
month = "9",
day = "1",
doi = "10.1089/cmb.2010.0103",
language = "English (US)",
volume = "17",
pages = "1281--1297",
journal = "Journal of Computational Biology",
issn = "1066-5277",
publisher = "Mary Ann Liebert Inc.",
number = "9",

}

An effective method for detecting gene conversion events in whole genomes. / Hsu, Chih Hao; Zhang, Yu; Hardison, Ross Cameron; Green, Eric D.; Miller, Webb.

In: Journal of Computational Biology, Vol. 17, No. 9, 01.09.2010, p. 1281-1297.

Research output: Contribution to journalArticle

TY - JOUR

T1 - An effective method for detecting gene conversion events in whole genomes

AU - Hsu, Chih Hao

AU - Zhang, Yu

AU - Hardison, Ross Cameron

AU - Green, Eric D.

AU - Miller, Webb

PY - 2010/9/1

Y1 - 2010/9/1

N2 - Gene conversion events are often overlooked in analyses of genome evolution. In a conversion event, an interval of DNA sequence (not necessarily containing a gene) overwrites a highly similar sequence. The event creates relationships among genomic intervals that can confound attempts to identify orthologs and to transfer functional annotation between genomes. Here we examine 1,616,329 paralogous pairs of mouse genomic intervals, and detect conversion events in about 7.5% of them. Properties of the putative gene conversions are analyzed, such as the lengths of the paralogous pairs and the spacing between their sources and targets. Our approach is illustrated using conversion events in primate CCL gene clusters. Source code for our program is included in the 3SEQ-2D package, which is freely available at www.bx.psu.edu/miller-lab.

AB - Gene conversion events are often overlooked in analyses of genome evolution. In a conversion event, an interval of DNA sequence (not necessarily containing a gene) overwrites a highly similar sequence. The event creates relationships among genomic intervals that can confound attempts to identify orthologs and to transfer functional annotation between genomes. Here we examine 1,616,329 paralogous pairs of mouse genomic intervals, and detect conversion events in about 7.5% of them. Properties of the putative gene conversions are analyzed, such as the lengths of the paralogous pairs and the spacing between their sources and targets. Our approach is illustrated using conversion events in primate CCL gene clusters. Source code for our program is included in the 3SEQ-2D package, which is freely available at www.bx.psu.edu/miller-lab.

UR - http://www.scopus.com/inward/record.url?scp=77958036731&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77958036731&partnerID=8YFLogxK

U2 - 10.1089/cmb.2010.0103

DO - 10.1089/cmb.2010.0103

M3 - Article

C2 - 20874409

AN - SCOPUS:77958036731

VL - 17

SP - 1281

EP - 1297

JO - Journal of Computational Biology

JF - Journal of Computational Biology

SN - 1066-5277

IS - 9

ER -