accurate assembly of transcripts through phase-preserving graph decomposition

Mingfu Shao, Carl Kingsford

Research output: Contribution to journalArticle

16 Citations (Scopus)

Abstract

We introduce Scallop, an accurate reference-based transcript assembler that improves reconstruction of multi-exon and lowly expressed transcripts. Scallop preserves long-range phasing paths extracted from reads, while producing a parsimonious set of transcripts and minimizing coverage deviation. On 10 human RNA-seq samples, Scallop produces 34.5% and 36.3% more correct multi-exon transcripts than StringTie and TransComb, and respectively identifies 67.5% and 52.3% more lowly expressed transcripts. Scallop achieves higher sensitivity and precision than previous approaches over a wide range of coverage thresholds.

Original languageEnglish (US)
Pages (from-to)1167-1169
Number of pages3
JournalNature Biotechnology
Volume35
Issue number12
DOIs
StatePublished - Jan 1 2017

Fingerprint

Pectinidae
RNA
Exons
Decomposition

All Science Journal Classification (ASJC) codes

  • Biotechnology
  • Bioengineering
  • Applied Microbiology and Biotechnology
  • Molecular Medicine
  • Biomedical Engineering

Cite this

@article{b4fbf324b11e444fbf628725d7375527,
title = "accurate assembly of transcripts through phase-preserving graph decomposition",
abstract = "We introduce Scallop, an accurate reference-based transcript assembler that improves reconstruction of multi-exon and lowly expressed transcripts. Scallop preserves long-range phasing paths extracted from reads, while producing a parsimonious set of transcripts and minimizing coverage deviation. On 10 human RNA-seq samples, Scallop produces 34.5{\%} and 36.3{\%} more correct multi-exon transcripts than StringTie and TransComb, and respectively identifies 67.5{\%} and 52.3{\%} more lowly expressed transcripts. Scallop achieves higher sensitivity and precision than previous approaches over a wide range of coverage thresholds.",
author = "Mingfu Shao and Carl Kingsford",
year = "2017",
month = "1",
day = "1",
doi = "10.1038/nbt.4020",
language = "English (US)",
volume = "35",
pages = "1167--1169",
journal = "Nature Biotechnology",
issn = "1087-0156",
publisher = "Nature Publishing Group",
number = "12",

}

accurate assembly of transcripts through phase-preserving graph decomposition. / Shao, Mingfu; Kingsford, Carl.

In: Nature Biotechnology, Vol. 35, No. 12, 01.01.2017, p. 1167-1169.

Research output: Contribution to journalArticle

TY - JOUR

T1 - accurate assembly of transcripts through phase-preserving graph decomposition

AU - Shao, Mingfu

AU - Kingsford, Carl

PY - 2017/1/1

Y1 - 2017/1/1

N2 - We introduce Scallop, an accurate reference-based transcript assembler that improves reconstruction of multi-exon and lowly expressed transcripts. Scallop preserves long-range phasing paths extracted from reads, while producing a parsimonious set of transcripts and minimizing coverage deviation. On 10 human RNA-seq samples, Scallop produces 34.5% and 36.3% more correct multi-exon transcripts than StringTie and TransComb, and respectively identifies 67.5% and 52.3% more lowly expressed transcripts. Scallop achieves higher sensitivity and precision than previous approaches over a wide range of coverage thresholds.

AB - We introduce Scallop, an accurate reference-based transcript assembler that improves reconstruction of multi-exon and lowly expressed transcripts. Scallop preserves long-range phasing paths extracted from reads, while producing a parsimonious set of transcripts and minimizing coverage deviation. On 10 human RNA-seq samples, Scallop produces 34.5% and 36.3% more correct multi-exon transcripts than StringTie and TransComb, and respectively identifies 67.5% and 52.3% more lowly expressed transcripts. Scallop achieves higher sensitivity and precision than previous approaches over a wide range of coverage thresholds.

UR - http://www.scopus.com/inward/record.url?scp=85038395558&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85038395558&partnerID=8YFLogxK

U2 - 10.1038/nbt.4020

DO - 10.1038/nbt.4020

M3 - Article

C2 - 29131147

AN - SCOPUS:85038395558

VL - 35

SP - 1167

EP - 1169

JO - Nature Biotechnology

JF - Nature Biotechnology

SN - 1087-0156

IS - 12

ER -