Robotics & Embodied AI

FARE: Fast-Slow Agentic Robotic Exploration

SShuhao LiaoXXuxin LvJJeric LewSShizhe ZhangJJingsong LiangPPeizhuo LiYYuhong CaoWWenjun WuGGuillaume Sartoretti
Published
January 21, 2026
Authors
9
Word Count
5,351
Code
Includes code

Efficient robotic exploration with FARE framework.

Abstract

This work advances autonomous robot exploration by integrating agent-level semantic reasoning with fast local control. We introduce FARE, a hierarchical autonomous exploration framework that integrates a large language model (LLM) for global reasoning with a reinforcement learning (RL) policy for local decision making. FARE follows a fast-slow thinking paradigm. The slow-thinking LLM module interprets a concise textual description of the unknown environment and synthesizes an agent-level exploration strategy, which is then grounded into a sequence of global waypoints through a topological graph. To further improve reasoning efficiency, this module employs a modularity-based pruning mechanism that reduces redundant graph structures. The fast-thinking RL module executes exploration by reacting to local observations while being guided by the LLM-generated global waypoints. The RL policy is additionally shaped by a reward term that encourages adherence to the global waypoints, enabling coherent and robust closed-loop behavior. This architecture decouples semantic reasoning from geometric decision, allowing each module to operate in its appropriate temporal and spatial scale. In challenging simulated environments, our results show that FARE achieves substantial improvements in exploration efficiency over state-of-the-art baselines. We further deploy FARE on hardware and validate it in complex, large scale 200mtimes130m building environment.

Key Takeaways

  • 1

    FARE reduces travel distance and time by 50%.

  • 2

    Integrates global reasoning with local decision-making.

  • 3

    Successfully deployed on real-world robotic platform.

Limitations

  • Relies on accurate large language model.

  • Assumes environment can be described in text.

Keywords

large language modelreinforcement learningglobal reasoninglocal decision makingtopological graphmodularity-based pruningreward shapingclosed-loop behaviortemporal scalespatial scale

More in Robotics & Embodied AI

View all
FARE: Fast-Slow Agentic Robotic Exploration | Paperchime