Paper Summary

Citation: Bigi, F., Langer, M. F., & Ceriotti, M. (2025). The dark side of the forces: assessing non-conservative force models for atomistic machine learning. Proceedings of the 42nd International Conference on Machine Learning (ICML).

Publication: ICML 2025

What kind of paper is this?

This is a “big idea” and analysis paper. It challenges the growing practice of sacrificing a fundamental physical principle—energy conservation—in Machine Learning Interatomic Potentials (MLIPs) for the sake of computational efficiency. The paper systematically investigates the practical consequences of this choice and proposes a pragmatic path forward.

What is the motivation?

The motivation addresses a concerning trend in Machine Learning Interatomic Potentials (MLIPs): many recent architectures predict interatomic forces directly rather than computing them as the negative gradient of a learned potential energy ($F \neq -\nabla E$). This “non-conservative” approach avoids the computational overhead of automatic differentiation, leading to faster model inference. However, this breaks the fundamental physical principle of energy conservation, and the practical consequences for simulations were not well understood. The authors seek to rigorously assess whether the speed gains justify violating basic physics.

What is the novelty here?

The novelty is the comprehensive demonstration that non-conservative force models are fundamentally unsuitable for simulations, coupled with a practical solution. The key contributions are:

  1. Systematic Identification of Failure Modes: The first rigorous demonstration that non-conservative forces lead to catastrophic failures in common simulations, including unphysical runaway heating in constant-energy (NVE) molecular dynamics and violation of energy equipartition in constant-temperature (NVT) simulations, where different atom types equilibrate to different temperatures.

  2. Quantitative Non-Conservation Metric: They propose using the asymmetry of the force Jacobian ($J_{i\alpha, j\beta} = \partial f_{i\alpha} / \partial r_{j\beta}$) as a direct measure of a model’s deviation from conservative behavior.

  3. Hybrid Solution Framework: Instead of dismissing direct force prediction entirely, the authors propose using it to accelerate, rather than replace, physically correct conservative models:

    • Training Acceleration: Pre-training rapidly with a direct-force head, then fine-tuning to produce conservative forces via backpropagation, significantly reducing total training time.
    • Inference Acceleration: Using the hybrid model in Multiple Time-Stepping (MTS) simulations, where fast non-conservative forces integrate the equations of motion and are corrected periodically by slower, exact conservative forces.

What experiments were performed?

The authors performed systematic computational experiments comparing conservative and non-conservative models:

  1. Model Comparison: Experiments primarily used a bulk liquid water dataset, training custom conservative and non-conservative versions of the PET architecture (PET-C and PET-NC). They also tested several widely-used pre-trained models (ORB, MACE, SevenNet, Equiformer) on water and other systems (graphene, amorphous carbon, aluminum).

  2. Molecular Dynamics Simulations:

    • NVE Ensemble: Simulations without thermostats to check energy conservation, measuring kinetic temperature drift over time.
    • NVT Ensemble: Simulations with local (Langevin) and global (SVR) thermostats at varying coupling strengths, analyzing average temperature, atom-type-resolved temperatures, structural correlations, and dynamical properties.
  3. Geometry Optimization: Comparison of optimization algorithm stability and convergence (FIRE and L-BFGS) when driven by conservative versus non-conservative forces.

  4. Hybrid Model Evaluation: Implementation and benchmarking of the proposed accelerated training (pre-train then fine-tune) and inference (MTS) schemes to demonstrate their effectiveness.

What were the outcomes and conclusions drawn?

  • Non-Conservative Models Are Fundamentally Flawed: Purely non-conservative models, regardless of test-set accuracy, are unreliable for simulations. They produce constant, unphysical energy drift in NVE simulations (runaway heating).

  • Thermostats Cannot Fix the Problem: While aggressive thermostats can control average temperature in NVT simulations, they destroy the system’s natural dynamics and cannot fix subtle artifacts like different chemical species heating at different rates.

  • Geometry Optimization Suffers: Non-conservative forces make geometry optimization less stable and can prevent convergence to well-defined minima.

  • Hybrid Approach Works: The proposed framework combining fast direct-force prediction with correct conservative models provides both speed and physical correctness. This allows researchers to harness the efficiency of non-conservative models for training and inference (via MTS) while maintaining the physical reliability essential for scientific simulations.

  • Key Takeaway: Direct force prediction is powerful but should augment, not replace, conservative models. The speed benefits of non-conservative approaches can be captured without sacrificing physical correctness.


Note: This is a personal learning note and may be incomplete or evolving.