Skip to content

zhang-guangyi/QS-Speculative-Decoding

Repository files navigation

QS-Speculative-Decoding

Citation (Preprint Version)

@article{ qs_speculative,
  title={Quantize-Sample-and-Verify: LLM Acceleration via Adaptive Edge-Cloud Speculative Decoding},
  author={Guangyi Zhang and Yunlong Cai and Guanding Yu and Petar Popovski and Osvaldo Simeone},
  journal={arXiv preprint arXiv:2507.00605},
  year={2025},}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors