Markov Decision Processes

Life is Random, Time is Not: Markov Decision Processes with Window Objectives
Safe Reinforcement Learning