Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees @ BNAIC/BeNeLearn 2022

Name: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees @ BNAIC/BeNeLearn 2022
Start: 2022-11-07T10:42:42+01:00
End: 2022-11-09T10:42:42+01:00
Location: Mechelen, Belgium

Florent Delgrange, Ann Nowé, Guillermo A. Pérez

Abstract

While reinforcement learning (RL) has been applied to a wide range of challenging domains, from game playing to real-world applications such as effective canal control, more widespread deployment in the real world is hampered by the lack of guarantees provided with the learned policies. Although there are RL algorithms which have limit-convergence guarantees in the discrete setting (and even in some continuous settings with function approximation), these are lost when applying more advanced techniques which make use of general nonlinear function approximators to deal with continuous Markov decision processes (MDPs) such as deep-RL. In this work, we apply such advanced RL algorithms to unknown continuous MDPs with (safety constrained) reachability or discounted-reward objectives, and we consider the challenge of simplifying and verifying RL policies. Our goal is to enable model checking by learning an accurate, tractable model of the environment. Extended abstract here.

Date

Nov 7, 2022 — Nov 9, 2022

Event

BNAIC/BeNeLearn 2022: Joint International Scientific Conferences on AI and Machine Learning

Location

Mechelen, Belgium

Reinforcement Learning Formal Methods Representation Learning VAE-MDP WAE-MDP

Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees @ BNAIC/BeNeLearn 2022

Abstract

Related