Adaptive Planning with Generative Models under Uncertainty

Purdue University
IROS 2024

Abstract

Planning with generative models has emerged as an effective decision-making paradigm across a wide range of domains, including robotics, reinforcement learning, and autonomous navigation. While continuous replanning at each timestep might seem intuitive because it allows decisions to be made based on the most recent environmental observations, it results in substantial computational challenges, primarily due to the complexity of the generative model's underlying deep learning architecture. Our work addresses this challenge by introducing a simple adaptive planning policy that leverages the generative model's ability to predict long-horizon state trajectories, enabling the execution of multiple actions consecutively without the need for immediate replanning. We propose to use the predictive uncertainty derived from a Deep Ensemble of inverse dynamics models to dynamically adjust the intervals between planning sessions. In our experiments conducted on locomotion tasks within the OpenAI Gym framework, we demonstrate that our adaptive planning policy allows for a reduction in replanning frequency to just about 10% of the steps without compromising performance. Our results underscore the potential of generative modeling as an efficient and effective tool for decision-making.


Deep Ensemble Adaptive Policy



The generative model generates a trajectory of states and the action model computes the initial action. The policy continuously predicts and executes subsequent actions as long as the uncertainty remains below a predefined threshold.


Results Overview



This table presents the average normalized rewards achieved by Decision Diffuser (DD), Static Plan Execution, and Ensemble Action (EA) for both MSE and NLL training criteria. The table also reports the percentage of actions executed without sampling from the generative model, illustrating the savings in NFEs.



Our adaptive policy completes 1000 steps in under 25 seconds, while the Decision Diffuser takes over 23 minutes.


Bibtex


    @inproceedings{jutrasdube2024adaptive,
      author    = {Jutras-Dubé, P. and Zhang, R. and Bera, A.},
      title     = {Adaptive Planning with Generative Models under Uncertainty},
      booktitle = {Proceedings of the International Conference on Intelligent Robots and Systems (IROS)},
      year      = {2024}
    }