☕️ About Me
I am a second-year master student at Institute of Data and Information, Tsinghua University, under the supervision of Prof. Xiu Li. I earned my B.Eng. in Software Engineering from Southwest University in 2024, graduating with the honors of “Special Scholarship” and “Outstanding Student Representative”. Currently, I am conducting research on Medical AI and Multimodal Large Language Models at the Alibaba DAMO Academy as a research scientist intern. My research interests include Real World Oriented Computer Vision and MLLM Post-training.
I am currently seeking PhD positions for Fall 2027 and open to all forms of collaboration.
🔥 News
- 2026.01: Two papers accepted to ICLR 2026. Congrats to Zheng Jiang and Heng Guo!
- 2025.06: One paper accepted to TPAMI(IF=20.8). Congrats to Chunming!
- 2025.02: One paper accepted to IEEE TPAMI(IF=20.8). Congrats to Chunming and Yuqi!
- 2025.02: One paper has been selected as a Spotlight Paper to the ICLR 2025.
- 2025.01: Two papers accepted to ICLR 2025. Congrats to Chunming, Chenyang!
- 2024.09: One paper accepted to NeurIPS 2024 as a Spotlight Paper.
📝 Selected Publications
${*}$ Equal contribution, ${\dagger}$ Corresponding author
Recommendation

Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models
Chengyu Fang$^{*}$, Heng Guo$^{*}$, Zheng Jiang, Chunming He, Xiu Li$^{\dagger}$, Minfeng Xu$^{\dagger}$
- Photon is a variable-length 3D medical VQA framework with instruction-conditioned token scheduling and surrogate gradients, achieving adaptive acceleration and state-of-the-art performance.

Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding Network
Chengyu Fang$^{*}$, Chunming He$^{*}$$^{\dagger}$, Fengyang Xiao, Yulun Zhang$^{\dagger}$, Longxiang Tang, Yuelin Zhang, Kai Li, and Xiu Li$^{\dagger}$
- The cooperative unfolding network (CORUN) and the first plug-in-play iterative mean-teacher framework (Colabator) for real-world image dehazing.

Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model
Chunming He$^{*}$, Chengyu Fang$^{*}$$^{\dagger}$, Yulun Zhang$^{\dagger}$, Kai Li, Longxiang Tang, Chengyu You, Fengyang Xiao, Zhenhua Guo, Xiu Li and Sina Farsiu$^{\dagger}$
- The first latent diffusion model-based methods with strong generalizability in illumination degradation image restoration problems and promising performance in downstream tasks.

Chengyu Fang, Chunming He, Yuelin Zhang, Chubin Chen, Chenyang Zhu, Longxiang Tang, and Xiu Li$^{\dagger}$
- PRISM is a real-world dehazing framework that jointly reconstructs clear scenes and scattering variables, while bridging the sim2real domain gap through selective self-distillation and self-reinforcing prior.

Annotation-Free Medical Visual Reasoning via Agentic Reinforcement Learning
Zheng Jiang$^{*}$, Heng Guo$^{*}$, Chengyu Fang$^{*}$, Changchen Xiao, Xinyang Hu, Lifeng Sun$^{\dagger}$, Minfeng Xu$^{\dagger}$
- MedVR is the first end-to-end reinforcement learning framework that integrates visual and textual reasoning for medical VLMs, eliminating the need for costly intermediate supervision.

Diffusion Models in Low-Level Vision: A Survey
Chunming He$^{*}$$^{\dagger}$, Yuqi Shen$^{*}$, Chengyu Fang$^{*}$, Fengyang Xiao, Longxiang Tang, Yulun Zhang, Wangmeng Zuo, Zhenhua Guo, Xiu Li$^{\dagger}$
- A curated list of awesome Diffusion Models(DMs) in low-level vision.

MultiCOS: Unlocking the Potential of Limited Multimodal Data in Camouflaged Object Segmentation
Chengyu Fang$^{*}$, Chunming He$^{*}$, Yuqi Shen, Chenyang Zhu, Yuelin Zhang, Fengyang Xiao, Longxiang Tang, Chubin Chen, Xiu Li$^{\dagger}$
- A novel framework that effectively leverages diverse data modalities to improve segmentation performance.

M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision
Che Liu$^{*}$, Zheng Jiang$^{*}$, Chengyu Fang$^{*}$, Heng Guo, Yan-Jie Zhou, Jiaqi Qu, Le Lu, Minfeng Xu$^{\dagger}$
- A unified visual encoder without any modality-specific customization for various medical visual modalities in 2D and 3D.
More Selected Publications
- Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning, CVPR 2026 [PDF] Chubin Chen$^{*}$, Sujie Hu$^{*}$, Jiashu Zhu, Meiqi Wu, Jintao Chen, Yanxun Li, Nisha Huang, Chengyu Fang, Jiahong Wu, Xiangxiang Chu, Xiu Li$^{\dagger}$
- Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement, arXiv 2025 [PDF] Yuqi Shen$^{*}$, Fengyang Xiao$^{*}$, Sujie Hu, Youwei Pang, Yifan Pu, Chengyu Fang, Xiu Li, Chunming He
- Segment Concealed Objects with Incomplete Supervision, TPAMI 2025 [PDF] Chunming He$^{*}$, Kai Li$^{*}$, Yachao Zhang, Ziyun Yang, Youwei Pang, Longxiang Tang, Chengyu Fang, Yulun Zhang, Linghe Kong, Xiu Li, Sina Farsiu$^{\dagger}$
- UnfoldIR: Rethinking Deep Unfolding Network in Illumination Degradation Image Restoration, arXiv 2025 [PDF] Chunming He$^{*}$, Rihan Zhang$^{*}$, Fengyang Xiao, Chengyu Fang, Longxiang Tang, Yulun Zhang, Sina Farsiu$^{\dagger}$
- RUN: Reversible Unfolding Network for Concealed Object Segmentation, ICML 2025 [PDF] Chunming He, Rihan Zhang, Fengyang Xiao, Chengyu Fang, Longxiang Tang, Yulun Zhang, Linghe Kong, Deng-Ping Fan, Kai Li, Sina Farsiu$^{\dagger}$
- Instantswap: Fast customized concept swapping across sharp shape differences, ICLR 2025 [PDF] Chenyang Zhu$^{*}$, Kai Li$^{*}$$^{\dagger}$, Yue Ma$^{*}$, Longxiang Tang, Chengyu Fang, Chubin Chen, Qifeng Chen, Xiu Li$^{\dagger}$
- A Survey of Camouflaged Object Detection and Beyond, CAAI AIR 2024 [PDF] Fengyang Xiao$^{*}$, Sujie Hu$^{*}$, Yuqi Shen, Chengyu Fang, Jinfa Huang, Chunming He$^{\dagger}$, Longxiang Tang, Ziyun Yang, Xiu Li$^{\dagger}$
- A Unified Framework for Microscopy Defocus Deblur with Multi-Pyramid Transformer and Contrastive Learning, CVPR 2024 [PDF] Yuelin Zhang, Pengyu Zheng, Wanquan Yan, Chengyu Fang, Shing Shin Cheng$^{\dagger}$
You can find more paper in my Google Scholar._
📖 Teaching
- 2023.09 - 2024.01 & 2024.09 - 2025.01, Teaching Assistant for Frontiers of AI technology and industrial applications, Tsinghua University.
💻 Internships
- 2025.04 - Present, Research Scientist Intern, DAMO Academy, Alibaba Group.
- 2023.07 - 2024.08, Research Assistant, Prof. Xiu Li’s research group at Tsinghua University.
🎖 Honors and Awards
- 2024.06 Chongqing Outstanding Graduates
- 2024.04 Chongqing Merit Student
- 2023.12 Southwest University Outstanding Student Representative
- 2023.04 Chongqing Advanced Individual for Innovation Capability
📃 Scholarships
- 2023.12 National Scholarship
- 2022.12 National Scholarship
- 2023.12 Xiaomi Corporation Special Scholarship
- 2023.12 Southwest University Special Scholarship
- 2022.07 Professor Qiu Yuhui Scholarship
- 2022.07 Pisen Electronics Co. Ltd Scholarship
- 2021.10 Southwest University First Prize Scholarship
🏁 Competition
- 2023.08 🏅1st Prize of “Texas Instruments Cup” 2023 National Undergraduate Electronic Design Contest
- 2023.08 🏅1st Prize of “China Software Cup” University Student Software Design Competition
- 2023.08 🏅1st Prize of “China University Student Embedded Chip and System Design Competition
- 2023.04 🏅1st Prize of 2023 China University Robot Competition (RoboMaster RMUL)
- 2022.08 🏅️1st Prize of “China Software Cup” University Student Software Design Competition
- 2022.12 🏅1st Prize of 2022 China University Robot Competition (RoboMaster RMUL)
- 2023.06 🥈2nd Prize in China Robotics and Artificial Intelligence Competition
- 2022.08 🥈2nd Prize of “China Software Cup” University Student Software Design Competition
- 2022.06 🥈2nd Prize of 2022 China University Robot Competition (RoboMaster RMUT)
- 2023.08 🥉3rd Prize in Chinese Collegiate Computing Competition
🧑🤝🧑 My Friends, Collaborators, and Long-term Cooperative Professors
- Chunming He@Duke, Longxiang Tang@HKUST, Yuelin Zhang@CUHK, Assoc. Prof. Yulun Zhang@SJTU.