SpecVLMs: Fast Speculative Decoding in Vision-Language Models
For technical details and full experimental results, please check the paper of SpecVLM.
@article{huang2025specvlm,
title={SpecVLM: Fast Speculative Decoding in Vision-Language Models},
author={Huang, Haiduo and Yang, Fuwei and Liu, Zhenhua and Yin, Xuanwu and Li, Dong and Ren, Pengju and Barsoum, Emad},
journal={arXiv preprint arXiv:2509.11815},
year={2025}
}