A single-cell RNA-sequencing training and analysis suite using the Galaxy framework

Mehmet Tekman, Berenice Batut, Alexander Ostrovsky, Christophe Antoniewski, Dave Clements, Fidel Ramirez, Graham J. Etherington, Hans Rudolf Hotz, Jelle Scholtalbers, Jonathan R. Manning, Lea Bellenger, Maria A. Doyle, Mohammad Heydarian, Ni Huang, Nicola Soranzo, Pablo Moreno, Stefan Mautner, Irene Papatheodorou, Anton Nekrutenko, James TaylorDaniel James Blankenberg, Rolf Backofen, Bjorn Gruning

Research output: Contribution to journalArticlepeer-review

Abstract

Background: The vast ecosystem of single-cell RNA-sequencing tools has until recently been plagued by an excess of diverging analysis strategies, inconsistent file formats, and compatibility issues between different software suites. The uptake of 10x Genomics datasets has begun to calm this diversity, and the bioinformatics community leans once more towards the large computing requirements and the statistically driven methods needed to process and understand these ever-growing datasets. Results: Here we outline several Galaxy workflows and learning resources for single-cell RNA-sequencing, with the aim of providing a comprehensive analysis environment paired with a thorough user learning experience that bridges the knowledge gap between the computational methods and the underlying cell biology. The Galaxy reproducible bioinformatics framework provides tools, workflows, and trainings that not only enable users to perform 1-click 10x preprocessing but also empower them to demultiplex raw sequencing from custom tagged and full-length sequencing protocols. The downstream analysis supports a range of high-quality interoperable suites separated into common stages of analysis: Inspection, filtering, normalization, confounder removal, and clustering. The teaching resources cover concepts from computer science to cell biology. Access to all resources is provided at the singlecell.usegalaxy.eu portal. Conclusions: The reproducible and training-oriented Galaxy framework provides a sustainable high-performance computing environment for users to run flexible analyses on both 10x and alternative platforms. The tutorials from the Galaxy Training Network along with the frequent training workshops hosted by the Galaxy community provide a means for users to learn, publish, and teach single-cell RNA-sequencing analysis.

Original languageEnglish (US)
Article numbergiaa102
JournalGigaScience
Volume9
Issue number10
DOIs
StatePublished - 2021

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Health Informatics

Fingerprint Dive into the research topics of 'A single-cell RNA-sequencing training and analysis suite using the Galaxy framework'. Together they form a unique fingerprint.

Cite this