Deep SPI
Safe Policy Improvement via World Models
Florent Delgrange, Raphaël Avalos, Willem Röpke
Last updated on
Mar 3, 2026
Reinforcement Learning, Safe Policy Improvement
Implementation of the techniques presented in our paper Deep SPI: Safe Policy Improvement via World Models.
The source code is available on GitHub.
Related
- Deep SPI: Safe Policy Improvement via World Models
- Composing RL policies, with formal guarantees
- Composing Reinforcement Learning Policies, with Formal Guarantees
- Activating Formal Verification of Deep Reinforcement Learning Policies by Model Checking Bisimilar Latent Space Models
- The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
