Navigating protein landscapes with a machine-learned transferable coarse-grained model

dc.contributor.author
Charron, Nicholas E.
dc.contributor.author
Pérez Culubret, Adrià
dc.contributor.author
Majewski, Maciej
dc.contributor.author
De Fabritiis, Gianni
dc.contributor.author
Clementi, Cecilia
dc.date.accessioned
2025-09-17T14:31:54Z
dc.date.available
2025-09-17T14:31:54Z
dc.date.issued
2025-09-16T05:49:11Z
dc.date.issued
2025-09-16T05:49:11Z
dc.date.issued
2025
dc.identifier
Charron NE, Bonneau K, Pasos-Trejo AS, Guljas A, Chen Y, Musil F, et al. Navigating protein landscapes with a machine-learned transferable coarse-grained model. Nat Chem. 2025 Aug;17(8):1284-92. DOI: 10.1038/s41557-025-01874-0
dc.identifier
1755-4330
dc.identifier
http://hdl.handle.net/10230/71199
dc.identifier
http://dx.doi.org/10.1038/s41557-025-01874-0
dc.identifier.uri
https://hdl.handle.net/10230/71199
dc.description.abstract
The most popular and universally predictive protein simulation models employ all-atom molecular dynamics, but they come at extreme computational cost. The development of a universal, computationally efficient coarse-grained (CG) model with similar prediction performance has been a long-standing challenge. By combining recent deep-learning methods with a large and diverse training set of all-atom protein simulations, we here develop a bottom-up CG force field with chemical transferability, which can be used for extrapolative molecular dynamics on new sequences not used during model parameterization. We demonstrate that the model successfully predicts metastable states of folded, unfolded and intermediate structures, the fluctuations of intrinsically disordered proteins and relative folding free energies of protein mutants, while being several orders of magnitude faster than an all-atom model. This showcases the feasibility of a universal and computationally efficient machine-learned CG model for proteins.
dc.description.abstract
We thank all members of the Clementi and Noé groups for their help in different phases of this work. We gratefully acknowledge funding from the European Commission (grant no. ERC CoG 772230 ‘ScaleCell’), the International Max Planck Research School for Biology and Computation (IMPRS–BAC), the Bundesministerium für Bildung und Forschung BMBF (Berlin Institute for Learning and Data, BIFOLD, and project FAIME 01IS24076), the Berlin Mathematics Center MATH+ (AA1-6, EF1-2) and the Deutsche Forschungsgemeinschaft DFG (NO 825/2, NO 825/3, NO 825/4; SFB/TRR 186, Project A12; SFB 1114, Projects B03, B08 and A04; SFB 1078, Project C7; and RTG 2433, Projects Q05 and Q04), the National Science Foundation (CHE-1900374 and PHY-2019745) and the Einstein Foundation Berlin (Project 0420815101). We gratefully acknowledge the computing time provided on the supercomputer Lise at NHR@ZIB as part of the NHR infrastructure, and on the supercomputer JUWELS operated by the Jülich Supercomputing Centre. We thank volunteers at GPUGRID.net for contributing computational resources and Acellera for funding.
dc.format
application/pdf
dc.format
application/pdf
dc.language
eng
dc.publisher
Nature Research
dc.relation
Nat Chem. 2025 Aug;17(8):1284-92
dc.relation
info:eu-repo/grantAgreement/EC/H2020/772230
dc.rights
© The Author(s) 2025. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
dc.rights
http://creativecommons.org/licenses/by/4.0/
dc.rights
info:eu-repo/semantics/openAccess
dc.subject
Computational biophysics
dc.subject
Computational chemistry
dc.title
Navigating protein landscapes with a machine-learned transferable coarse-grained model
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)