Improving the power of structural variation detection by augmenting the reference

Jan Schröder, Santhosh Girirajan, Anthony T. Papenfuss, Paul Medvedev

Research output: Contribution to journalArticlepeer-review

6 Scopus citations


The uses of the Genome Reference Consortium's human reference sequence can be roughly categorized into three related but distinct categories: as a representative species genome, as a coordinate systemfor identifying variants, and as an alignment reference for variation detection algorithms. However, the use of this reference sequence as simultaneously a representative species genome and as an alignment reference leads to unnecessary artifacts for structural variation detection algorithms and limits their accuracy.We show how decoupling these two references and developing a separate alignment reference can significantly improve the accuracy of structural variation detection, lead to improved genotyping of disease related genes, and decrease the cost of studying polymorphismin a population.

Original languageEnglish (US)
Article numbere0136771
JournalPloS one
Issue number8
StatePublished - Aug 31 2015

All Science Journal Classification (ASJC) codes

  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)
  • General


Dive into the research topics of 'Improving the power of structural variation detection by augmenting the reference'. Together they form a unique fingerprint.

Cite this