GGrantIndex
← Search

BUILDING ROBOTS THAT ACT SAFELY AROUND ASTRONAUTS AND AS INTENDED WHEN DEPLOYED WITH LIMITED OPERATOR SUPERVISION DEPENDS ON DESIGNING GOOD REWARD FUNCTIONS FOR THESE ROBOTS TO OPTIMIZE. UNFORTUNATELY DESIGNING REWARD FUNCTIONS IS DIFFICULT AND HENCE REWARDS ARE OFTEN MISSPECIFIED. THIS CAN ELICIT UNDESIRABLE BEHAVIOR AND EVEN CAUSE HARM TO THE ROBOT OR ASTRONAUTS AROUND IT. WE PROPOSE TO DEVELOP ROBOTS THAT ACTIVELY QUERY HUMANS FOR INFORMATION ABOUT THE DESIRED REWARD FUNCTION WHEN NEEDED SO AS TO COMPLETE THEIR TASK SAFELY AND AS INTENDED WITH MINIMAL SUPERVISION. TO THAT END WE WILL DEVELOP AN ACTIVE REWARD DESIGN FRAMEWORK ENABLING THE ROBOT TO QUERY THE HUMAN FOR INFORMATION ABOUT THE DESIRED REWARD THROUGH VARIOUS MODALITIES SUCH AS DEMONSTRATIONS CORRECTIONS PREFERENCES OR FULL REWARD FUNCTIONS. WE PLAN TO EXPERIMENTALLY VALIDATE THIS FRAMEWORK ON BOTH SIMULATED AND REAL ROBOTIC PLATFORMS IN TWO SETTINGS. THE FIRST IS A ROBOT THAT OPERATES IN CLOSE PROXIMITY TO HUMAN ASTRONAUTS SUCH AS ASTROBEE THAT IS SET TO BE DEPLOYED ON THE INTERNATIONAL SPACE STATION. THE SECOND IS A ROBOT SUCH AS THE K-REX PLANETARY ROVER OPERATING ON A DISTANT PLANET SO THAT OPERATOR SUPERVISION IS NECESSARILY LIMITED. IN THIS CASE THE ROBOT WILL LEARN THROUGH A LIMITED NUMBER OF QUERIES CONSTRAINED FOR EXAMPLE BY THE BANDWIDTH AVAILABLE

$233,369FY2020National Aeronautics and Space AdministrationNASA

Regents Of The University Of California, The

Investigators

View source on USAspending →