Created on November 04, 2025
2025
Our latest work on optimality of RL under safety filtering has been posted on arXiv!