2

A Framework for Flexibly Guiding Learning Agents
Life is Random, Time is Not: Markov Decision Processes with Window Objectives