☕️ About Me

I am a second-year master student at Institute of Data and Information, Tsinghua University, under the supervision of Prof. Xiu Li. I earned my B.Eng. in Software Engineering from Southwest University in 2024, graduating with the honors of “Special Scholarship” and “Outstanding Student Representative”. Currently, I am conducting research on Medical AI and Multimodal Large Language Models at the Alibaba DAMO Academy as a research scientist intern. My research interests include Real World Oriented Computer Vision and MLLM Post-training.

I am currently seeking PhD positions for Fall 2027 and open to all forms of collaboration.

🔥 News

2026.01: Two papers accepted to ICLR 2026. Congrats to Zheng Jiang and Heng Guo!
2025.06: One paper accepted to TPAMI(IF=20.8). Congrats to Chunming!
2025.02: One paper accepted to IEEE TPAMI(IF=20.8). Congrats to Chunming and Yuqi!
2025.02: One paper has been selected as a Spotlight Paper to the ICLR 2025.
2025.01: Two papers accepted to ICLR 2025. Congrats to Chunming, Chenyang!
2024.09: One paper accepted to NeurIPS 2024 as a Spotlight Paper.

📝 Selected Publications

${*}$ Equal contribution, ${\dagger}$ Corresponding author

Recommendation

ICLR 2026

Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

Chengyu Fang$^{*}$, Heng Guo$^{*}$, Zheng Jiang, Chunming He, Xiu Li$^{\dagger}$, Minfeng Xu$^{\dagger}$

Photon is a variable-length 3D medical VQA framework with instruction-conditioned token scheduling and surrogate gradients, achieving adaptive acceleration and state-of-the-art performance.

NeurIPS 2024 Spotlight⭐️

Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding Network

Chengyu Fang$^{*}$, Chunming He$^{*}$$^{\dagger}$, Fengyang Xiao, Yulun Zhang$^{\dagger}$, Longxiang Tang, Yuelin Zhang, Kai Li, and Xiu Li$^{\dagger}$

The cooperative unfolding network (CORUN) and the first plug-in-play iterative mean-teacher framework (Colabator) for real-world image dehazing.

ICLR 2025 Spotlight⭐

Reti-Diff: Illumination Degradation Image Restoration with Retinex-based Latent Diffusion Model

Chunming He$^{*}$, Chengyu Fang$^{*}$$^{\dagger}$, Yulun Zhang$^{\dagger}$, Kai Li, Longxiang Tang, Chengyu You, Fengyang Xiao, Zhenhua Guo, Xiu Li and Sina Farsiu$^{\dagger}$

The first latent diffusion model-based methods with strong generalizability in illumination degradation image restoration problems and promising performance in downstream tasks.

ArXiv 2026

PRISM: Rethinking Scattered Atmosphere Reconstruction as a Unified Understanding and Generation Model for Real-world Dehazing

Chengyu Fang, Chunming He, Yuelin Zhang, Chubin Chen, Chenyang Zhu, Longxiang Tang, and Xiu Li$^{\dagger}$

PRISM is a real-world dehazing framework that jointly reconstructs clear scenes and scattering variables, while bridging the sim2real domain gap through selective self-distillation and self-reinforcing prior.

ICLR 2026

Annotation-Free Medical Visual Reasoning via Agentic Reinforcement Learning

Zheng Jiang$^{*}$, Heng Guo$^{*}$, Chengyu Fang$^{*}$, Changchen Xiao, Xinyang Hu, Lifeng Sun$^{\dagger}$, Minfeng Xu$^{\dagger}$

MedVR is the first end-to-end reinforcement learning framework that integrates visual and textual reasoning for medical VLMs, eliminating the need for costly intermediate supervision.

TPAMI 2025

Diffusion Models in Low-Level Vision: A Survey

Chunming He$^{*}$$^{\dagger}$, Yuqi Shen$^{*}$, Chengyu Fang$^{*}$, Fengyang Xiao, Longxiang Tang, Yulun Zhang, Wangmeng Zuo, Zhenhua Guo, Xiu Li$^{\dagger}$

A curated list of awesome Diffusion Models(DMs) in low-level vision.

arXiv 2025

MultiCOS: Unlocking the Potential of Limited Multimodal Data in Camouflaged Object Segmentation

Chengyu Fang$^{*}$, Chunming He$^{*}$, Yuqi Shen, Chenyang Zhu, Yuelin Zhang, Fengyang Xiao, Longxiang Tang, Chubin Chen, Xiu Li$^{\dagger}$

A novel framework that effectively leverages diverse data modalities to improve segmentation performance.

arXiv 2025

M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision

Che Liu$^{*}$, Zheng Jiang$^{*}$, Chengyu Fang$^{*}$, Heng Guo, Yan-Jie Zhou, Jiaqi Qu, Le Lu, Minfeng Xu$^{\dagger}$

A unified visual encoder without any modality-specific customization for various medical visual modalities in 2D and 3D.

More Selected Publications

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning, CVPR 2026 [PDF] Chubin Chen$^{*}$, Sujie Hu$^{*}$, Jiashu Zhu, Meiqi Wu, Jintao Chen, Yanxun Li, Nisha Huang, Chengyu Fang, Jiahong Wu, Xiangxiang Chu, Xiu Li$^{\dagger}$
Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement, arXiv 2025 [PDF] Yuqi Shen$^{*}$, Fengyang Xiao$^{*}$, Sujie Hu, Youwei Pang, Yifan Pu, Chengyu Fang, Xiu Li, Chunming He
Segment Concealed Objects with Incomplete Supervision, TPAMI 2025 [PDF] Chunming He$^{*}$, Kai Li$^{*}$, Yachao Zhang, Ziyun Yang, Youwei Pang, Longxiang Tang, Chengyu Fang, Yulun Zhang, Linghe Kong, Xiu Li, Sina Farsiu$^{\dagger}$
UnfoldIR: Rethinking Deep Unfolding Network in Illumination Degradation Image Restoration, arXiv 2025 [PDF] Chunming He$^{*}$, Rihan Zhang$^{*}$, Fengyang Xiao, Chengyu Fang, Longxiang Tang, Yulun Zhang, Sina Farsiu$^{\dagger}$
RUN: Reversible Unfolding Network for Concealed Object Segmentation, ICML 2025 [PDF] Chunming He, Rihan Zhang, Fengyang Xiao, Chengyu Fang, Longxiang Tang, Yulun Zhang, Linghe Kong, Deng-Ping Fan, Kai Li, Sina Farsiu$^{\dagger}$
Instantswap: Fast customized concept swapping across sharp shape differences, ICLR 2025 [PDF] Chenyang Zhu$^{*}$, Kai Li$^{*}$$^{\dagger}$, Yue Ma$^{*}$, Longxiang Tang, Chengyu Fang, Chubin Chen, Qifeng Chen, Xiu Li$^{\dagger}$
A Survey of Camouflaged Object Detection and Beyond, CAAI AIR 2024 [PDF] Fengyang Xiao$^{*}$, Sujie Hu$^{*}$, Yuqi Shen, Chengyu Fang, Jinfa Huang, Chunming He$^{\dagger}$, Longxiang Tang, Ziyun Yang, Xiu Li$^{\dagger}$
A Unified Framework for Microscopy Defocus Deblur with Multi-Pyramid Transformer and Contrastive Learning, CVPR 2024 [PDF] Yuelin Zhang, Pengyu Zheng, Wanquan Yan, Chengyu Fang, Shing Shin Cheng$^{\dagger}$

You can find more paper in my Google Scholar._

📖 Teaching

2023.09 - 2024.01 & 2024.09 - 2025.01, Teaching Assistant for Frontiers of AI technology and industrial applications, Tsinghua University.

💻 Internships

2025.04 - Present, Research Scientist Intern, DAMO Academy, Alibaba Group.
2023.07 - 2024.08, Research Assistant, Prof. Xiu Li’s research group at Tsinghua University.

🎖 Honors and Awards

2024.06 Chongqing Outstanding Graduates
2024.04 Chongqing Merit Student
2023.12 Southwest University Outstanding Student Representative
2023.04 Chongqing Advanced Individual for Innovation Capability

📃 Scholarships

2023.12 National Scholarship
2022.12 National Scholarship
2023.12 Xiaomi Corporation Special Scholarship
2023.12 Southwest University Special Scholarship
2022.07 Professor Qiu Yuhui Scholarship
2022.07 Pisen Electronics Co. Ltd Scholarship
2021.10 Southwest University First Prize Scholarship

🏁 Competition

2023.08 🏅1st Prize of “Texas Instruments Cup” 2023 National Undergraduate Electronic Design Contest
2023.08 🏅1st Prize of “China Software Cup” University Student Software Design Competition
2023.08 🏅1st Prize of “China University Student Embedded Chip and System Design Competition
2023.04 🏅1st Prize of 2023 China University Robot Competition (RoboMaster RMUL)
2022.08 🏅️1st Prize of “China Software Cup” University Student Software Design Competition
2022.12 🏅1st Prize of 2022 China University Robot Competition (RoboMaster RMUL)
2023.06 🥈2nd Prize in China Robotics and Artificial Intelligence Competition
2022.08 🥈2nd Prize of “China Software Cup” University Student Software Design Competition
2022.06 🥈2nd Prize of 2022 China University Robot Competition (RoboMaster RMUT)
2023.08 🥉3rd Prize in Chinese Collegiate Computing Competition

🧑‍🤝‍🧑 My Friends, Collaborators, and Long-term Cooperative Professors

Chunming He@Duke, Longxiang Tang@HKUST, Yuelin Zhang@CUHK, Assoc. Prof. Yulun Zhang@SJTU.