Navigating protein landscapes with a machine-learned transferable coarse-grained model

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Charron, Nicholas E.
  • dc.contributor.author Pérez Culubret, Adrià
  • dc.contributor.author Majewski, Maciej
  • dc.contributor.author De Fabritiis, Gianni
  • dc.contributor.author Clementi, Cecilia
  • dc.date.accessioned 2025-09-16T05:49:11Z
  • dc.date.available 2025-09-16T05:49:11Z
  • dc.date.issued 2025
  • dc.description.abstract The most popular and universally predictive protein simulation models employ all-atom molecular dynamics, but they come at extreme computational cost. The development of a universal, computationally efficient coarse-grained (CG) model with similar prediction performance has been a long-standing challenge. By combining recent deep-learning methods with a large and diverse training set of all-atom protein simulations, we here develop a bottom-up CG force field with chemical transferability, which can be used for extrapolative molecular dynamics on new sequences not used during model parameterization. We demonstrate that the model successfully predicts metastable states of folded, unfolded and intermediate structures, the fluctuations of intrinsically disordered proteins and relative folding free energies of protein mutants, while being several orders of magnitude faster than an all-atom model. This showcases the feasibility of a universal and computationally efficient machine-learned CG model for proteins.
  • dc.description.sponsorship We thank all members of the Clementi and Noé groups for their help in different phases of this work. We gratefully acknowledge funding from the European Commission (grant no. ERC CoG 772230 ‘ScaleCell’), the International Max Planck Research School for Biology and Computation (IMPRS–BAC), the Bundesministerium für Bildung und Forschung BMBF (Berlin Institute for Learning and Data, BIFOLD, and project FAIME 01IS24076), the Berlin Mathematics Center MATH+ (AA1-6, EF1-2) and the Deutsche Forschungsgemeinschaft DFG (NO 825/2, NO 825/3, NO 825/4; SFB/TRR 186, Project A12; SFB 1114, Projects B03, B08 and A04; SFB 1078, Project C7; and RTG 2433, Projects Q05 and Q04), the National Science Foundation (CHE-1900374 and PHY-2019745) and the Einstein Foundation Berlin (Project 0420815101). We gratefully acknowledge the computing time provided on the supercomputer Lise at NHR@ZIB as part of the NHR infrastructure, and on the supercomputer JUWELS operated by the Jülich Supercomputing Centre. We thank volunteers at GPUGRID.net for contributing computational resources and Acellera for funding.
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Charron NE, Bonneau K, Pasos-Trejo AS, Guljas A, Chen Y, Musil F, et al. Navigating protein landscapes with a machine-learned transferable coarse-grained model. Nat Chem. 2025 Aug;17(8):1284-92. DOI: 10.1038/s41557-025-01874-0
  • dc.identifier.doi http://dx.doi.org/10.1038/s41557-025-01874-0
  • dc.identifier.issn 1755-4330
  • dc.identifier.uri http://hdl.handle.net/10230/71199
  • dc.language.iso eng
  • dc.publisher Nature Research
  • dc.relation.ispartof Nat Chem. 2025 Aug;17(8):1284-92
  • dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/772230
  • dc.rights © The Author(s) 2025. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.rights.uri http://creativecommons.org/licenses/by/4.0/
  • dc.subject.keyword Computational biophysics
  • dc.subject.keyword Computational chemistry
  • dc.title Navigating protein landscapes with a machine-learned transferable coarse-grained model
  • dc.type info:eu-repo/semantics/article
  • dc.type.version info:eu-repo/semantics/publishedVersion