AdaKD: LLM-Oriented Token-Adaptive Knowledge Distillation

Official implementation of the paper "LLM-Oriented Token-Adaptive Knowledge Distillation" (AAAI 2026).

AdaKD is a plug-and-play framework for logit-based distillation that dynamically adapts to the student's learning state. It features two synergistic modules:

Loss-driven Adaptive Token Focusing (LATF): Concentrates distillation on valuable tokens by monitoring learning stability.
Inverse Difficulty Temperature Scaling (IDTS): Applies token-level temperatures—low for hard tokens (error correction) and high for easy tokens (better generalization).

🛠️ Installation

Note

The requirements.txt and specific environment setup scripts are currently being finalized.

📂 Data

The training data is based on the databricks-dolly-15k dataset. You can download our processed version here:

Processed Data: Google Drive Link

Please place the downloaded data in the data/ directory.

🚀 Training and Evaluation

Important

Bash scripts for automated training and evaluation are coming soon.

🤝 Acknowledgements

Our code is built upon the following open-source projects:

distillm: Towards Streamlined Distillation for Large Language Models.
minillm: Knowledge Distillation of Large Language Models.

We thank the authors for their great work!

📝 Citation

If you find our work useful in your research, please consider citing:

@inproceedings{xie2026adakd,
  title={LLM-Oriented Token-Adaptive Knowledge Distillation},
  author={Xie, Xurong and Xue, Zhucun and Wu, Jiafu and Li, Jian and Wang, Yabiao and Hu, Xiaobin and Liu, Yong and Zhang, Jiangning},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence (AAAI)},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs/deepspeed		configs/deepspeed
data_utils		data_utils
distillm		distillm
llm_as_judge		llm_as_judge
loss		loss
minillm		minillm
scripts/gpt2		scripts/gpt2
tools		tools
.gitignore		.gitignore
README.md		README.md
arguments.py		arguments.py
distillation.py		distillation.py
evaluate.py		evaluate.py
generate.py		generate.py
rouge_metric.py		rouge_metric.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AdaKD: LLM-Oriented Token-Adaptive Knowledge Distillation

🛠️ Installation

📂 Data

🚀 Training and Evaluation

🤝 Acknowledgements

📝 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AdaKD: LLM-Oriented Token-Adaptive Knowledge Distillation

🛠️ Installation

📂 Data

🚀 Training and Evaluation

🤝 Acknowledgements

📝 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages