Sim2Science: ML with Imperfect Scientific Models

ICML 2026 Workshop | Seoul, South Korea

Important Dates

Submission Deadline:April 24, 2026
Author Notification:May 15, 2026
Camera-Ready Deadline:TBD
Workshop Date:July 2026, Seoul, South Korea

All deadlines are 23:59 Anywhere on Earth (AOE)

About the Workshop

The core tenet of scientific discovery is the construction of knowledge that aligns with experimental observations. This knowledge often takes the form of mechanistic models or simulators, traditionally hand-crafted by domain experts and fit to experimental data. Such simulators underpin predictive simulation, parameter and state inference, uncertainty quantification, and decision-making across the sciences.

Despite their utility, all simulators include simplifications and approximations of reality, inevitably inducing discrepancies between simulated and observed data. In other words, "all models are wrong, some are useful." Understanding and addressing this mismatch is a fundamental challenge for both machine learning and the natural sciences.

The central question of this workshop is: How can we best leverage imperfect scientific simulators when confronted with real-world data, and how can ML help account for and mitigate limitations in simulator-based workflows?

We pursue two complementary aims: (i) to bridge communities across machine learning and the natural sciences by establishing a common understanding of how simulator discrepancies can be identified and quantified across fields; and (ii) to clarify how machine learning methods can be used to learn from, account for, and reduce simulator discrepancies when integrating simulators with real-world data.

Topics of Interest

We invite contributions on theory and methods as well as applications spanning biology, chemistry, physics, materials science, climate science, and related fields. Topics include:

  • Simulation-based inference and related parameter inference methods
  • Understanding and mitigating model misspecification, including simulator diagnostics and discrepancy modeling
  • Emulator and surrogate modeling, as well as hybrid and physics-informed approaches
  • Analysis of simulator structure, degeneracy, simplifications, and identifiability
  • Simulator pipelines, including data handling, preprocessing, and integration with downstream ML models
  • Active learning and Bayesian optimization for fitting parameters or model components
  • Closed-loop and experiment-in-the-loop scientific workflows
  • Multi-fidelity and multi-resolution modeling
  • Model and equation discovery
  • Differentiable frameworks and LLM-assisted scientific reasoning

Contact

For questions or inquiries about the workshop, please contact us at:
sim2science@gmail.com