NON-PARAMETRIC METHODS IN REINFORCEMENT LEARNING:INSTANCE-OPTIMALITY, ADAPTIVITY AND DATA-DEPENDENT BOUNDS
$531,249FY2022Department of the NavyDOD
Massachusetts Institute Of Technology, Cambridge MA
Massachusetts Institute Of Technology, Cambridge MA