TY - JOUR
T1 - Assembling single-cell genomes and mini-metagenomes from chimeric MDA products
AU - Nurk, Sergey
AU - Bankevich, Anton
AU - Antipov, Dmitry
AU - Gurevich, Alexey A.
AU - Korobeynikov, Anton
AU - Lapidus, Alla
AU - Prjibelski, Andrey D.
AU - Pyshkin, Alexey
AU - Sirotkin, Alexander
AU - Sirotkin, Yakov
AU - Stepanauskas, Ramunas
AU - Clingenpeel, Scott R.
AU - Woyke, Tanja
AU - McLean, Jeffrey S.
AU - Lasken, Roger
AU - Tesler, Glenn
AU - Alekseyev, Max A.
AU - Pevzner, Pavel A.
PY - 2013/10/1
Y1 - 2013/10/1
N2 - Recent advances in single-cell genomics provide an alternative to largely gene-centric metagenomics studies, enabling whole-genome sequencing of uncultivated bacteria. However, single-cell assembly projects are challenging due to (i) the highly nonuniform read coverage and (ii) a greatly elevated number of chimeric reads and read pairs. While recently developed single-cell assemblers have addressed the former challenge, methods for assembling highly chimeric reads remain poorly explored. We present algorithms for identifying chimeric edges and resolving complex bulges in de Bruijn graphs, which significantly improve single-cell assemblies. We further describe applications of the single-cell assembler SPAdes to a new approach for capturing and sequencing "microbial dark matter" that forms small pools of randomly selected single cells (called a mini-metagenome) and further sequences all genomes from the mini-metagenome at once. On single-cell bacterial datasets, SPAdes improves on the recently developed ESC and IDBA-UD assemblers specifically designed for single-cell sequencing. For standard (cultivated monostrain) datasets, SPAdes also improves on A5, ABySS, CLC, EULER-SR, Ray, SOAPdenovo, and Velvet. Thus, recently developed single-cell assemblers not only enable single-cell sequencing, but also improve on conventional assemblers on their own turf. SPAdes is available for free online download under a GPLv2 license.
AB - Recent advances in single-cell genomics provide an alternative to largely gene-centric metagenomics studies, enabling whole-genome sequencing of uncultivated bacteria. However, single-cell assembly projects are challenging due to (i) the highly nonuniform read coverage and (ii) a greatly elevated number of chimeric reads and read pairs. While recently developed single-cell assemblers have addressed the former challenge, methods for assembling highly chimeric reads remain poorly explored. We present algorithms for identifying chimeric edges and resolving complex bulges in de Bruijn graphs, which significantly improve single-cell assemblies. We further describe applications of the single-cell assembler SPAdes to a new approach for capturing and sequencing "microbial dark matter" that forms small pools of randomly selected single cells (called a mini-metagenome) and further sequences all genomes from the mini-metagenome at once. On single-cell bacterial datasets, SPAdes improves on the recently developed ESC and IDBA-UD assemblers specifically designed for single-cell sequencing. For standard (cultivated monostrain) datasets, SPAdes also improves on A5, ABySS, CLC, EULER-SR, Ray, SOAPdenovo, and Velvet. Thus, recently developed single-cell assemblers not only enable single-cell sequencing, but also improve on conventional assemblers on their own turf. SPAdes is available for free online download under a GPLv2 license.
UR - http://www.scopus.com/inward/record.url?scp=84885626275&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84885626275&partnerID=8YFLogxK
U2 - 10.1089/cmb.2013.0084
DO - 10.1089/cmb.2013.0084
M3 - Article
C2 - 24093227
AN - SCOPUS:84885626275
SN - 1066-5277
VL - 20
SP - 714
EP - 737
JO - Journal of Computational Biology
JF - Journal of Computational Biology
IS - 10
ER -