Abstract

Learning a complex task such as low-level robot manoeuvres while preventing failure of Monocular SLAM is a challenging problem for both robots and humans. The data-driven identification of basic motion strategies in preventing Monocular SLAM failure is a largely unexplored problem. In this paper, we devise a computational model for representing and inferring strategies, formulated as Markov decision processes, where the reward function models the goal of the task as well as information about the strategy. We show how this reward function can be learnt from expert demonstrations using inverse reinforcement learning. The resulting framework allows one to identify the way in which a few chosen parameters affect the quality of Monocular SLAM estimates. The estimated reward function was able to capture expert demonstration information and the inherent expert strategy and it was possible to give an intuitive explanation to the obtained reward structure. A significant improvement in performance as compared to an intuitive hand-crafted reward function is also shown.



[Paper]
You can also checkout our initial work on using RL to approach the problem here which is accepted at ICVGIP'18.

Contact

vigneshprasad141@gmail.com