Our Embodied Interpretability Paper Got Accepted by ICML 2026

May 7, 2026 1 min read

Our latest work has been accepted at ICML 2026. As VLA models become a key route towards general-purpose robot policies, we should not only ask whether a robot completes a task, but also why it chooses an action. Sometimes a policy can appear to work while relying on the wrong visual cues, such as background texture, lighting, or shortcuts in the scene. Our work provides a way to test this more directly: which parts of the image actually change the robot’s action? This helps researchers diagnose VLA failure modes and better understand why some policies generalise to new environments while others do not.

Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models

Project page: robot-future.github.io/vla-explain
Preprint: available here
Code: available soon for plug-and-play use

Many congratulations to my PhD students Hanxin Zhang, Mingshuo Xu and Abdulqader Dhafer, and to our co-authors Shigang Yue and Hongbiao Dong. We are also grateful to the reviewers for their constructive and helpful feedback.