On the significance of an RNA tertiary structure prediction

Christine E. Hajdin, Feng Ding, Nikolay V. Dokholyan, Kevin M. Weeks

Research output: Contribution to journalArticle

69 Scopus citations

Abstract

Tertiary structure prediction is important for understanding structure-function relationships for RNAs whose structures are unknown and for characterizing RNA states recalcitrant to direct analysis. However, it is unknown what root-mean-square deviation (RMSD) corresponds to a statistically significant RNA tertiary structure prediction. We use discrete molecular dynamics to generate RNA-like folds for structures up to 161 nucleotides (nt) that have complex tertiary interactions and then determine the RMSD distribution between these decoys. These distributions are Gaussian-like. The mean RMSD increases with RNA length and is smaller if secondary structure constraints are imposed while generating decoys. The compactness of RNA molecules with true tertiary folds is intermediate between closely packed spheres and a freely jointed chain. We use this scaling relationship to define an expression relating RMSD with the confidence that a structure prediction is better than that expected by chance. This is the prediction significance, and corresponds to a P-value. For a 100-nt RNA, the RMSD of predicted structures should be within 25 Å of the accepted structure to reach the P ≤ 0.01 level if the secondary structure is predicted de novo and within 14 Å if secondary structure information is used as a constraint. This significance approach should be useful for evaluating diverse RNA structure prediction and molecular modeling algorithms.

Original languageEnglish (US)
Pages (from-to)1340-1349
Number of pages10
JournalRNA
Volume16
Issue number7
DOIs
StatePublished - Jul 1 2010

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Molecular Biology

Cite this