Latest Robotics & Embodied AI Research Papers

Research on robots that learn, embodied intelligence, manipulation, and physical AI systems.

15 Papers

Showing 15 of 15 papers

RoboPocket: Improve Robot Policies Instantly with Your Phone

Junjie Fang, Wendi Chen, Han Xue +7 more

Scaling imitation learning is fundamentally constrained by the efficiency of data collection. While handheld interfaces have emerged as a scalable solution for in-the-wild data acquisition, they predominantly operate in an open-loop manner: operators blindly collect demonstrations without knowing th...

imitation learningdata collectionopen-loopDAggercovariate shift+7 more

Mar 5, 202630

EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents

Wenjia Wang, Liang Pan, Huaijin Pi +8 more

Human behaviors in the real world naturally encode rich, long-term contextual information that can be leveraged to train embodied agents for perception, understanding, and acting. However, existing capture systems typically rely on costly studio setups and wearable devices, limiting the large-scale ...

embodied agentsRGB-D sequencesmetric world coordinate framemonocular human-scene-reconstructionphysics-based character animation+2 more

Feb 26, 202611

TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

Shirui Chen, Cole Harrison, Ying-Chun Lee +6 more

While Vision-Language-Action (VLA) models have seen rapid progress in pretraining, their advancement in Reinforcement Learning (RL) remains hampered by low sample efficiency and sparse rewards in real-world settings. Developing generalizable process reward models is essential for providing the fine-...

Vision-Language-Action modelsReinforcement Learningtemporal value functionsVision-Language Modelstoken logits+2 more

Feb 22, 202621

Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

Runpei Dong, Ziyan Li, Xialin He +1 more

Visual loco-manipulation of arbitrary objects in the wild with humanoid robots requires accurate end-effector (EE) control and a generalizable understanding of the scene via visual inputs (e.g., RGB-D images). Existing approaches are based on real-world imitation learning and exhibit limited general...

end-effector tracking policyinverse kinematicsneural forward modelgoal adjustmentreplanning+3 more

Feb 18, 202626

SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation

Kushal Kedia, Tyler Ga Wei Lum, Jeannette Bohg +1 more

The ability to manipulate tools significantly expands the set of tasks a robot can perform. Yet, tool manipulation represents a challenging class of dexterity, requiring grasping thin objects, in-hand object rotations, and forceful interactions. Since collecting teleoperation data for these behavior...

sim-to-real reinforcement learningtool manipulationdexterous manipulationprocedural generationreinforcement learning policy+1 more

Feb 18, 202614

RynnBrain: Open Embodied Foundation Models

Ronghao Dang, Jiayan Guo, Bohan Hou +23 more

Despite rapid progress in multimodal foundation models, embodied intelligence community still lacks a unified, physically grounded foundation model that integrates perception, reasoning, and planning within real-world spatial-temporal dynamics. We introduce RynnBrain, an open-source spatiotemporal f...

multimodal foundation modelsembodied intelligencespatiotemporal foundation modelegocentric understandingspatiotemporal localization+6 more

Feb 13, 202636

ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning

Yandan Yang, Shuang Zeng, Tong Lin +11 more

Building general-purpose embodied agents across diverse hardware remains a central challenge in robotics, often framed as the ''one-brain, many-forms'' paradigm. Progress is hindered by fragmented data, inconsistent representations, and misaligned training objectives. We present ABot-M0, a framework...

embodied agentsdata curation pipelinemodel architecturetraining strategiesunified representations+11 more

Feb 11, 202610

MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation

Yejin Kim, Wilbert Pumacay, Omar Rayyan +23 more

Deploying robots at scale demands robustness to the long tail of everyday situations. The countless variations in scene layout, object geometry, and task specifications that characterize real environments are vast and underrepresented in existing robot benchmarks. Measuring this level of generalizat...

robot policiesembodied taskssim-to-real correlationzero-shot policiesprompt phrasing+2 more

Feb 11, 20265

RISE: Self-Improving Robot Policy with Compositional World Model

Jiazhi Yang, Kunyang Lin, Jinwei Li +10 more

Despite the sustained scaling on model capacity and data acquisition, Vision-Language-Action (VLA) models remain brittle in contact-rich and dynamic manipulation tasks, where minor execution deviations can compound into failures. While reinforcement learning (RL) offers a principled path to robustne...

Vision-Language-Action modelsreinforcement learningon-policy RLCompositional World Modelcontrollable dynamics model+4 more

Feb 11, 202626

Contact-Anchored Policies: Contact Conditioning Creates Strong Robot Utility Models

Zichen Jeff Cui, Omar Rayyan, Haritheja Etukuru +16 more

The prevalent paradigm in robot learning attempts to generalize across environments, embodiments, and tasks with language prompts at runtime. A fundamental tension limits this approach: language is often too abstract to guide the concrete physical understanding required for robust manipulation. In t...

Contact-Anchored Policieslanguage conditioningphysical contactmodular utility modelsreal-to-sim iteration+4 more

Feb 9, 202611

χ_{0}: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies

Checheng Yu, Chonghao Sima, Gangcheng Jiang +14 more

High-reliability long-horizon robotic manipulation has traditionally relied on large-scale data and compute to understand complex real-world dynamics. However, we identify that the primary bottleneck to real-world robustness is not resource scale alone, but the distributional shift among the human d...

model arithmeticstage-aware advantage estimatortrain-deploy alignmentdistributional shiftpolicy+6 more

Feb 9, 202625

RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI

Hongzhi Zang, Shu'ang Yu, Hao Lin +14 more

Online policy learning directly in the physical world is a promising yet challenging direction for embodied intelligence. Unlike simulation, real-world systems cannot be arbitrarily accelerated, cheaply reset, or massively replicated, which makes scalable data collection, heterogeneous deployment, a...

online policy learningembodied intelligencereal-world systemsheterogeneous robotshardware abstraction layer+12 more

Feb 8, 202646

SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation

Mu Huang, Hui Wang, Kerui Ren +5 more

Simulating deformable objects under rich interactions remains a fundamental challenge for real-to-sim robot manipulation, with dynamics jointly driven by environmental effects and robot actions. Existing simulators rely on predefined physics or data-driven dynamics without robot-conditioned control,...

3D Gaussian Splatdeformable dynamicslatent neural spacereal-to-sim simulationrobot manipulation+1 more

Feb 2, 202628

FARE: Fast-Slow Agentic Robotic Exploration

Shuhao Liao, Xuxin Lv, Jeric Lew +6 more

This work advances autonomous robot exploration by integrating agent-level semantic reasoning with fast local control. We introduce FARE, a hierarchical autonomous exploration framework that integrates a large language model (LLM) for global reasoning with a reinforcement learning (RL) policy for lo...

large language modelreinforcement learningglobal reasoninglocal decision makingtopological graph+5 more

Jan 21, 20265

RoboBrain 2.5: Depth in Sight, Time in Mind

Huajie Tan, Enshen Zhou, Zhiyu Li +32 more

We introduce RoboBrain 2.5, a next-generation embodied AI foundation model that advances general perception, spatial reasoning, and temporal modeling through extensive training on high-quality spatiotemporal supervision. Building upon its predecessor, RoboBrain 2.5 introduces two major capability up...

embodied AIspatiotemporal supervision3D spatial reasoningdepth-aware coordinate predictionmetric constraint comprehension+4 more

Jan 20, 20268

View all categories

Latest Robotics & Embodied AI Research | Robotics & Embodied AI Papers