Data driven strategies for Active Monocular SLAM using Inverse Reinforcement Learning

Vignesh Prasad*1    Rishab Jangir*2    Balaraman Ravindran3    K. Madhava Krishna1   

1 IIIT Hyderabad, India    2 Department of Physics, Indian Institute of Technology, Guwahati    3 The Department of Computer Science and Engineering, Indian Institute of Technology Madras   


Learning a complex task such as low-level robot manoeuvres while preventing failure of Monocular SLAM is a challenging problem for both robots and humans. The data-driven identification of basic motion strategies in preventing Monocular SLAM failure is a largely unexplored problem. In this paper, we devise a computational model for representing and inferring strategies, formulated as Markov decision processes, where the reward function models the goal of the task as well as information about the strategy. We show how this reward function can be learnt from expert demonstrations using inverse reinforcement learning. The resulting framework allows one to identify the way in which a few chosen parameters affect the quality of Monocular SLAM estimates. The estimated reward function was able to capture expert demonstration information and the inherent expert strategy and it was possible to give an intuitive explanation to the obtained reward structure. A significant improvement in performance as compared to an intuitive hand-crafted reward function is also shown.