Markov Decision Processes

Life is Random, Time is Not: Markov Decision Processes with Window Objectives
Simple Strategies in Multi-Objective MDPs
Safe Reinforcement Learning