Sim2Science: ML with Imperfect Scientific Models

ICML 2026 Workshop | Seoul, South Korea

Important Dates

Submission Deadline:	April 24, 2026
Author Notification:	May 15, 2026
Camera-Ready Deadline:	TBD
Workshop Date:	July 2026, Seoul, South Korea

All deadlines are 23:59 Anywhere on Earth (AOE)

About the Workshop

The core tenet of scientific discovery is the construction of knowledge that aligns with experimental observations. This knowledge often takes the form of mechanistic models or simulators, traditionally hand-crafted by domain experts and fit to experimental data. Such simulators underpin predictive simulation, parameter and state inference, uncertainty quantification, and decision-making across the sciences.

Despite their utility, all simulators include simplifications and approximations of reality, inevitably inducing discrepancies between simulated and observed data. In other words, "all models are wrong, some are useful." Understanding and addressing this mismatch is a fundamental challenge for both machine learning and the natural sciences.

The central question of this workshop is: How can we best leverage imperfect scientific simulators when confronted with real-world data, and how can ML help account for and mitigate limitations in simulator-based workflows?

We pursue two complementary aims: (i) to bridge communities across machine learning and the natural sciences by establishing a common understanding of how simulator discrepancies can be identified and quantified across fields; and (ii) to clarify how machine learning methods can be used to learn from, account for, and reduce simulator discrepancies when integrating simulators with real-world data.

Topics of Interest

We invite contributions on theory and methods as well as applications spanning biology, chemistry, physics, materials science, climate science, and related fields. Topics include:

Simulation-based inference and related parameter inference methods
Understanding and mitigating model misspecification, including simulator diagnostics and discrepancy modeling
Emulator and surrogate modeling, as well as hybrid and physics-informed approaches
Analysis of simulator structure, degeneracy, simplifications, and identifiability
Simulator pipelines, including data handling, preprocessing, and integration with downstream ML models
Active learning and Bayesian optimization for fitting parameters or model components
Closed-loop and experiment-in-the-loop scientific workflows
Multi-fidelity and multi-resolution modeling
Model and equation discovery
Differentiable frameworks and LLM-assisted scientific reasoning

Contact

For questions or inquiries about the workshop, please contact us at:
sim2science@gmail.com