Modeling RNA secondary structure folding ensembles using SHAPE mapping data

Aleksandar Spasic, Sarah Mary Assmann, Philip C. Bevilacqua, David H. Mathews

Research output: Contribution to journalArticle

13 Citations (Scopus)

Abstract

RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package.

Original languageEnglish (US)
Pages (from-to)314-323
Number of pages10
JournalNucleic acids research
Volume46
Issue number1
DOIs
StatePublished - Jan 1 2018

Fingerprint

RNA
Transcriptome
Base Pairing
Software
Nucleotides

All Science Journal Classification (ASJC) codes

  • Genetics

Cite this

@article{7d70d97494bc46a3a301ac3546dcdc43,
title = "Modeling RNA secondary structure folding ensembles using SHAPE mapping data",
abstract = "RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package.",
author = "Aleksandar Spasic and Assmann, {Sarah Mary} and Bevilacqua, {Philip C.} and Mathews, {David H.}",
year = "2018",
month = "1",
day = "1",
doi = "10.1093/nar/gkx1057",
language = "English (US)",
volume = "46",
pages = "314--323",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "1",

}

Modeling RNA secondary structure folding ensembles using SHAPE mapping data. / Spasic, Aleksandar; Assmann, Sarah Mary; Bevilacqua, Philip C.; Mathews, David H.

In: Nucleic acids research, Vol. 46, No. 1, 01.01.2018, p. 314-323.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Modeling RNA secondary structure folding ensembles using SHAPE mapping data

AU - Spasic, Aleksandar

AU - Assmann, Sarah Mary

AU - Bevilacqua, Philip C.

AU - Mathews, David H.

PY - 2018/1/1

Y1 - 2018/1/1

N2 - RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package.

AB - RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package.

UR - http://www.scopus.com/inward/record.url?scp=85045194833&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85045194833&partnerID=8YFLogxK

U2 - 10.1093/nar/gkx1057

DO - 10.1093/nar/gkx1057

M3 - Article

VL - 46

SP - 314

EP - 323

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - 1

ER -