
Share
Researchers introduce an automatic feasibility detection system for diffusion planners, ensuring safer execution in complex tasks where reliability is crucial.
Diffusion-based planning has emerged as a powerful technique for tackling long-horizon, sparse-reward tasks. By training trajectory diffusion models and conditioning the sampled trajectories using auxiliary guidance functions, these planners can generate complex behaviors. However, one significant drawback is that diffusion models are not guaranteed to produce feasible plans, leading to failed executions and making them less suitable for safety-critical applications.
In a recent paper titled "Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans," Kyowoon Lee, Seongun Kim, and Jaesik Choi introduce a novel approach to refine unreliable plans generated by diffusion models. This method leverages a new metric called the restoration gap (RG) to identify and correct infeasible plans.
Model Architecture:
Training Process:

The authors evaluate their approach on three different benchmarks in offline control settings that require long-horizon planning:
One of the key strengths of this approach is its explainability. By using attribution maps, the authors can highlight error-prone transitions in the plan, providing insights into why certain plans are infeasible. This not only helps in refining the plans but also aids in understanding the behavior synthesis process.
The work by Lee, Kim, and Choi addresses a critical issue in diffusion-based planning: the generation of infeasible plans. By introducing the restoration gap metric and leveraging an attribution map regularizer, they provide a robust framework for refining unreliable plans. This approach not only improves the reliability of behavior synthesis but also offers valuable insights into the plan refinement process.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
1 November 2023
88 articles
Related Articles
Related Articles
More Stories