TY - JOUR
T1 - Limits in accuracy and a strategy of RNA structure prediction using experimental information
AU - Wang, Jian
AU - Williams, Benfeard
AU - Chirasani, Venkata R.
AU - Krokhotin, Andrey
AU - Das, Rajeshree
AU - Dokholyan, Nikolay V.
N1 - Funding Information:
We gratefully acknowledge the G. Thomas Passananti endowment and U.S. National Institutes of Health [R01 GM123238–01, R01 GM064803–12]. Funding for open access charge: National Institutes of Health [R01 GM123238– 01, R01 GM064803–12]. Conflict of interest statement. None declared.
PY - 2019
Y1 - 2019
N2 - RNA structural complexity and flexibility present a challenge for computational modeling efforts. Experimental information and bioinformatics data can be used as restraints to improve the accuracy of RNA tertiary structure prediction. Regarding utilization of restraints, the fundamental questions are: (i) What is the limit in prediction accuracy that one can achieve with arbitrary number of restraints? (ii) Is there a strategy for selection of the minimal number of restraints that would result in the best structural model? We address the first question by testing the limits in prediction accuracy using native contacts as restraints. To address the second question, we develop an algorithm based on the distance variation allowed by secondary structure (DVASS), which ranks restraints according to their importance to RNA tertiary structure prediction. We find that due to kinetic traps, the greatest improvement in the structure prediction accuracy is achieved when we utilize only 40–60% of the total number of native contacts as restraints. When the restraints are sorted by DVASS algorithm, using only the first 20% ranked restraints can greatly improve the prediction accuracy. Our findings suggest that only a limited number of strategically selected distance restraints can significantly assist in RNA structure modeling.
AB - RNA structural complexity and flexibility present a challenge for computational modeling efforts. Experimental information and bioinformatics data can be used as restraints to improve the accuracy of RNA tertiary structure prediction. Regarding utilization of restraints, the fundamental questions are: (i) What is the limit in prediction accuracy that one can achieve with arbitrary number of restraints? (ii) Is there a strategy for selection of the minimal number of restraints that would result in the best structural model? We address the first question by testing the limits in prediction accuracy using native contacts as restraints. To address the second question, we develop an algorithm based on the distance variation allowed by secondary structure (DVASS), which ranks restraints according to their importance to RNA tertiary structure prediction. We find that due to kinetic traps, the greatest improvement in the structure prediction accuracy is achieved when we utilize only 40–60% of the total number of native contacts as restraints. When the restraints are sorted by DVASS algorithm, using only the first 20% ranked restraints can greatly improve the prediction accuracy. Our findings suggest that only a limited number of strategically selected distance restraints can significantly assist in RNA structure modeling.
UR - http://www.scopus.com/inward/record.url?scp=85068489689&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85068489689&partnerID=8YFLogxK
U2 - 10.1093/nar/gkz427
DO - 10.1093/nar/gkz427
M3 - Article
C2 - 31106330
AN - SCOPUS:85068489689
VL - 47
SP - 5563
EP - 5572
JO - Nucleic Acids Research
JF - Nucleic Acids Research
SN - 0305-1048
IS - 11
ER -