An exact nonparametric method for inferring mosaic structure in sequence triplets

Maciej F. Boni, David Posada, Marcus W. Feldman

Research output: Contribution to journalArticlepeer-review

480 Scopus citations

Abstract

Statistical tests for detecting mosaic structure or recombination among nucleotide sequences usually rely on identifying a pattern or a signal that would be unlikely to appear under clonal reproduction. Dozens of such tests have been described, but many are hampered by long running times, confounding of selection and recombination, and/or inability to isolate the mosaic-producing event. We introduce a test that is exact, nonparametric, rapidly computable, free of the infinite-sites assumption, able to distinguish between recombination and variation in mutation/fixation rates, and able to identify the breakpoints and sequences involved in the mosaic-producing event. Our test considers three sequences at a time: two parent sequences that may have recombined, with one or two breakpoints, to form the third sequence (the child sequence). Excess similarity of the child sequence to a candidate recombinant of the parents is a sign of recombination; we take the maximum value of this excess similarity as our test statistic Δm,n,b. We present a method for rapidly calculating the distribution of Δm,n,b and demonstrate that it has comparable power to and a much improved running time over previous methods, especially in detecting recombination in large data sets.

Original languageEnglish (US)
Pages (from-to)1035-1047
Number of pages13
JournalGenetics
Volume176
Issue number2
DOIs
StatePublished - Jun 2007

All Science Journal Classification (ASJC) codes

  • Genetics

Fingerprint Dive into the research topics of 'An exact nonparametric method for inferring mosaic structure in sequence triplets'. Together they form a unique fingerprint.

Cite this