\(\small \mathcal{T(R,O)}\) Grasp: Efficient Graph Diffusion of Robot-Object Spatial Transformation for Cross-Embodiment Dexterous Grasping

Xin Fei1,2*, Zhixuan Xu1,2*, Huaicong Fang3, Tianrui Zhang1,2,
Lin Shao1,2
1National University of Singapore, 2RoboScience 3Zhejiang University
* denotes equal contribution
\(\mathcal{T(R,O)}\) \(=\) \(\mathcal{T}\)ransformation \((\)\(\mathcal{R}\)obot, \(\mathcal{O}\)bject \()\)
Teaser Image

Abstract

Dexterous grasping remains a central challenge in robotics due to the complexity of its high-dimensional state and action space. We introduce \(\small \mathcal{T(R,O)}\) Grasp, a diffusion-based framework that efficiently generates accurate and diverse grasps across multiple robotic hands. At its core is the \(\small \mathcal{T(R,O)}\) Graph, a unified representation that models spatial transformations between robotic hands and objects while encoding their geometric properties. A graph diffusion model, coupled with an efficient inverse kinematics solver, supports both unconditioned and conditioned grasp synthesis.

Extensive experiments on a diverse set of dexterous hands show that \(\small \mathcal{T(R,O)}\) Grasp achieves average success rate of 94.83%, inference speed of 0.21s, and throughput of 41 grasps per second on an NVIDIA A100 40GB GPU, substantially outperforming existing baselines. In addition, our approach is robust and generalizable across embodiments while significantly reducing memory consumption. More importantly, the high inference speed enables closed-loop dexterous manipulation, underscoring the potential of \(\small \mathcal{T(R,O)}\) Grasp to scale into a foundation model for dexterous grasping.

Method Overview

Teaser Image

Overview of \(\small \mathcal{T(R,O)}\) Grasp: We first define \(\small \mathcal{T(R,O)}\) Grasp Graph to represent spatial transformations of robotic links and objects with auxiliary geometry information. Next, we introduce a graph diffusion model that enables both unconditioned and conditioned grasp synthesis.

Unconditioned Grasp Synthesis

Barrett

Allegro

Shadowhand

Grasp ID: 0

Conditioned Grasp Synthesis

Barrett

Allegro

Shadowhand

Grasp ID: 0

Real-world Results

XHand: 91%

LEAP Hand: 90%

Closed-loop Grasping

Citation

      
        @misc{fei2025trograspefficientgraph,
          title={T(R,O) Grasp: Efficient Graph Diffusion of Robot-Object Spatial Transformation for Cross-Embodiment Dexterous Grasping}, 
          author={Xin Fei and Zhixuan Xu and Huaicong Fang and Tianrui Zhang and Lin Shao},
          year={2025},
          eprint={2510.12724},
          archivePrefix={arXiv},
          primaryClass={cs.RO},
          url={https://arxiv.org/abs/2510.12724}, 
        }