Naman Kumar*1 Antareep Singha*1 Laksh Nanwani*1 Dhruv Potdar1 Tarun R1 Fatemeh Rastgar2 Simon Idoko2 Arun Kumar Singh2 K Madhava Krishna1
1 Robotics Research Center, IIIT Hyderabad, India 2 University of Tartu, Estonia
Navigation amongst densely packed crowds remains a challenge for mobile robots. The complexity increases further if the environment layout changes making the prior computed global plan infeasible. In this paper, we show that it is possible to dramatically enhance crowd navigation by just improving the local planner. Our approach combines generative modelling with inference time optimization to generate sophisticated long-horizon local plans at interactive rates. More specifically, we train a Vector Quantized Variational AutoEncoder to learn a prior over the expert trajectory distribution conditioned on the perception input. At run-time, this is used as an initialization for a sampling-based optimizer for further refinement. Our approach does not require any sophisticated prediction of dynamic obstacles and yet provides state-of-the- art performance. In particular, we compare against the recent DRL-VO approach and show a 40% improvement in success rate and a 6% improvement in travel time.