I am a PhD student at The Hong Kong University of Science and Technology, supervised by
Prof. Ying-Cong Chen and
Prof. Qifeng Chen. Feel free to ping me via Email or WeChat if you are interested in working with me.
Selected Preprints
DVD: Deterministic Video Depth Estimation with Generative Priors[arXiv]
Show, Don't Tell: Morphing Latent Reasoning into Image Generation[arXiv]
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation[arXiv]
Selected Publications
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative ModelsCVPR 2026[Paper]
Hierarchical Fine-grained Preference Optimization for Physically Plausible Video GenerationNeurIPS 2025[Paper]
FineQuest: Adaptive Knowledge-Assisted Sports Video Understanding via Agent-of-Thoughts ReasoningMM 2025 (Oral)[Paper]
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video ModelsICML 2025[Paper]
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning StabilizationAAAI 2025[Paper]
FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERsMM 2024[Paper]