Official repository for SpaceTools and the Toolshed system.
- β Toolshed system β installation, demos, and docs
- π Training and evaluation
- π Data
- π Pretrained model
SpaceTools empowers VLMs with vision tools and robotic tools to perform spatial reasoning and real-world manipulation.
It introduces Double Interactive Reinforcement Learning (DIRL), a two-phase training pipeline for effective multi-tool coordination, and Toolshed, a distributed toolkit that enables real-time interaction with compute-heavy multimodal tools during both RL training and inference.
The code release include:
A Ray-based distributed framework for hosting and replicating compute-heavy tools (neural networks, VLMs, code executors) during both training and inference:
- Included tools: pointing (RoboRefer, Molmo), depth estimation, SAM2 segmentation, 3D bounding box, grasp generation, code executor, and more
- Load balancing & queue management: multiple tool instances with automatic request routing and queuing
- Environment isolation: separate conda environments per tool for incompatible dependencies
- Schema generation: auto-converts tool docstrings to JSON schemas for LLM frameworks
- Agentic workflow: built-in agent with support for OpenAI, Anthropic, Bedrock, and SGLang providers
- Web UI & dashboard: interactive agent interface and real-time tool state visualization
- Multinode deployment: scale across machines via Ray clusters (SLURM supported)
- Code execution interface: Pythonic access to the full toolkit from generated code
- DIRL training recipe
- Toolshed intergrated RL framework
- SFT framework
- SFT + RL dataset
- Pretrained model checkpoint
- Spatial benchmark evaluation
@misc{chen2025spacetoolstoolaugmentedspatialreasoning,
title={SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL},
author={Siyi Chen and Mikaela Angelina Uy and Chan Hee Song and Faisal Ladhak and Adithyavairavan Murali and Qing Qu and Stan Birchfield and Valts Blukis and Jonathan Tremblay},
year={2025},
eprint={2512.04069},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2512.04069}
}