TY - JOUR
T1 - Exploring the landscape of model representations
AU - Foley, Thomas T.
AU - Kidder, Katherine M.
AU - Scott Shell, M.
AU - Noid, W. G.
N1 - Funding Information:
The authors gratefully acknowledge financial support from the National Science Foundation (Grants MCB-1053970 and CHE-1856337 to W.G.N. and CHE-1800344 to M.S.S.). Portions of this research were conducted with Advanced CyberInfrastructure computational resources provided by The Institute for CyberScience at The Pennsylvania State University (http://ics.psu.edu). In addition, parts of this research were conducted with XSEDE resources awarded by Grant TGCHE170062. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by the National Science Foundation (Grant ACI-1548562). Figs. 1 and 3 employed VMD. VMD is developed with NIH support by the Theoretical and Computational Biophysics group at the Beckman Institute, University of Illinois at Urbana-Champaign.
Funding Information:
ACKNOWLEDGMENTS The authors gratefully acknowledge financial support from the National Science Foundation (Grants MCB-1053970 and CHE-1856337 to W.G.N. and CHE-1800344 to M.S.S.). Portions of this research were conducted with Advanced CyberInfrastructure computational resources provided by The Institute for CyberScience at The Pennsylvania State University (http://ics.psu.edu). In addition, parts of this research were conducted with XSEDE resources awarded by Grant TG-CHE170062. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by the National Science Foundation (Grant ACI-1548562). Figs. 1 and 3 employed VMD. VMD is developed with NIH support by the Theoretical and Computational Biophysics group at the Beckman Institute, University of Illinois at Urbana– Champaign.
Publisher Copyright:
© 2020 National Academy of Sciences. All rights reserved.
PY - 2020/9/29
Y1 - 2020/9/29
N2 - The success of any physical model critically depends upon adopting an appropriate representation for the phenomenon of interest. Unfortunately, it remains generally challenging to identify the essential degrees of freedom or, equivalently, the proper order parameters for describing complex phenomena. Here we develop a statistical physics framework for exploring and quantitatively characterizing the space of order parameters for representing physical systems. Specifically, we examine the space of low-resolution representations that correspond to particle-based coarse-grained (CG) models for a simple microscopic model of protein fluctuations. We employ Monte Carlo (MC) methods to sample this space and determine the density of states for CG representations as a function of their ability to preserve the configurational information, I, and large-scale fluctuations, Q, of the microscopic model. These two metrics are uncorrelated in high-resolution representations but become anticorrelated at lower resolutions. Moreover, our MC simulations suggest an emergent length scale for coarse-graining proteins, as well as a qualitative distinction between good and bad representations of proteins. Finally, we relate our work to recent approaches for clustering graphs and detecting communities in networks.
AB - The success of any physical model critically depends upon adopting an appropriate representation for the phenomenon of interest. Unfortunately, it remains generally challenging to identify the essential degrees of freedom or, equivalently, the proper order parameters for describing complex phenomena. Here we develop a statistical physics framework for exploring and quantitatively characterizing the space of order parameters for representing physical systems. Specifically, we examine the space of low-resolution representations that correspond to particle-based coarse-grained (CG) models for a simple microscopic model of protein fluctuations. We employ Monte Carlo (MC) methods to sample this space and determine the density of states for CG representations as a function of their ability to preserve the configurational information, I, and large-scale fluctuations, Q, of the microscopic model. These two metrics are uncorrelated in high-resolution representations but become anticorrelated at lower resolutions. Moreover, our MC simulations suggest an emergent length scale for coarse-graining proteins, as well as a qualitative distinction between good and bad representations of proteins. Finally, we relate our work to recent approaches for clustering graphs and detecting communities in networks.
UR - http://www.scopus.com/inward/record.url?scp=85092325262&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85092325262&partnerID=8YFLogxK
U2 - 10.1073/pnas.2000098117
DO - 10.1073/pnas.2000098117
M3 - Article
C2 - 32929015
AN - SCOPUS:85092325262
SN - 0027-8424
VL - 117
SP - 24061
EP - 24068
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 39
ER -