Spatial Relation Graph and Graph Convolutional Network for Object Goal Navigation

D. A. Sasi Kiran*¹ Kritika Anand*² Chaitanya Kharyal∗¹ Gulshan Kumar¹ Nandiraju Gireesh¹ Snehasis Banerjee² Ruddra dev Roychoudhury² Mohan Sridharan³ Brojeshwar Bhowmick² Madhava Krishna¹

¹ Robotics Research Center, IIIT Hyderabad, India ² TCS Research, Tata Consultancy Services ³ Intelligent Robotics Lab, University of Birmingham, UK

This paper describes a framework for the objectgoal navigation task, which requires a robot to find and move to the closest instance of a target object class from a random starting position. The framework uses a history of robot trajectories to learn a Spatial Relational Graph (SRG) and Graph Convolutional Network (GCN)-based embeddings for the likelihood of proximity of different semantically-labeled regions and the occurrence of different object classes in these regions. To locate a target object instance during evaluation, the robot uses Bayesian inference and the SRG to estimate the visible regions, and uses the learned GCN embeddings to rank visible regions and select the region to explore next. This approach is tested using the Matterport3D benchmark dataset of indoor scenes in AI Habitat, a visually realistic simulation environment, to report substantial performance improvement in comparison with state of the art baselines.

Download paper Project page