Differentiable Physical Modeling Sound Synthesis: Theory, Musical Application, and Programming

About this tutorial

Recent years have witnessed growing interest in bridging traditional sound synthesis methods with emerging machine learning technologies. This tutorial is motivated by the convergence of two previously distinct trajectories in audio research: physics-based sound synthesis and data-driven neural approaches. This session highlights how differentiable physical modeling opens new avenues for musical sound synthesis by combining the interpretability and realism of physical simulation with the learning capacity of modern neural networks. The tutorial is structured into five segments: an overview of digital synthesis history and physical modeling, a detailed introduction to finite difference time domain (FDTD) methods across various instrument classes, a broad survey of neural architectures relevant to physical modeling, an in-depth look at differentiable modeling for parameter estimation using automatic differentiation, and a concluding session to synthesize key takeaways. Attendees will engage with theoretical material, practical demonstrations, and programming exercises, gaining hands-on experience in combining physics-based simulation with neural networks.

This tutorial is designed for researchers and engineers interested in advanced sound synthesis, particularly those working in musical acoustics, AI-based audio modeling, or digital instrument design. It will benefit individuals seeking to build physically plausible audio models or hybrid machine learning systems for realistic sound generation. All ISMIR members are warmly encouraged to attend—whether newcomers or seasoned researchers. The tutorial is designed to be approachable rather than overly technical, while still offering a deep understanding of how differentiable simulation can enhance synthesis fidelity, support neural network training, and advance hybrid sound modeling.

Schedule

Our tutorial will be held on September 21 (all the times are based on KST = South Korea local time). Slides may be subject to updates.

Time	Section	Presenter
09:00—10:00 (60 min)	Part 1: Physics-based Audio: Models, Computation and Parameter Spaces Slides	Stefan Bilbao
10:00—11:00 (60 min)	Part 2: Neural Networks for Physical Modelling Synthesis Slides Notebook	Rodrigo Diaz
11:00—11:10 (10 min)	Q & A Session I
11:10—11:20 (10 min)	Coffee break
11:20—12:10 (50 min)	Part 3: Differentiable Physical Modeling for Parameter Estimation Slides Notebook	Jin Woo Lee
12:10—12:20 (10 min)	Part 4: Closing Remarks Slides	Jin Woo Lee
12:20—12:30 (10 min)	Q & A Session II

BibTeX

@article{ lee-bilbao-diaz-tutorial,
  author  = { Lee, Jin Woo and Bilbao, Stefan and Diaz, Rodrigo },
  title   = { Differentiable Physical Modeling Sound Synthesis: Theory, Musical Application, and Programming },
  journal = { ISMIR 2025 Tutorial },
  year    = { 2025 },
}

ISMIR 2025 Tutorial Differentiable Physical Modeling Sound Synthesis: Theory, Musical Application, and Programming

About this tutorial

Schedule

BibTeX

ISMIR 2025 Tutorial
Differentiable Physical Modeling Sound Synthesis: Theory, Musical Application, and Programming