Research Homepage

Building Reliable and Efficient Multimodal AI

I am a Ph.D. student in Electrical and Computer Engineering at Seoul National University, working with the Intelligent Computing Lab. My research focuses on multimodal language-vision systems, diffusion-based generation, and robust visual representation learning.

View Publications Open CV Google Scholar

Publications

Latest Publication

2026

Research Focus

Multimodal Language-Vision Models

Understanding and reducing object hallucinations with uncertainty-aware token analysis.

Diffusion Model Efficiency

Improving speed and memory efficiency with adaptive sampling and lightweight optimization.

Text-to-Image Alignment

Studying token geometry and cross-attention behavior for stronger semantic binding.

Practical Robustness

Designing methods that remain effective under limited compute and real-world constraints.

Recent News

Mar 12, 2026

3 Papers accepted to CVPR 2026 (2 first-authored, 1 co-authored)! See you in Denver, Colorado!

Oct 02, 2025

Received BK21Four - SNU Fellowship of Research Excellence ($7,000).

Sep 25, 2025

Our paper "On Epistemic Uncertainty of Visual Tokens~" was accepted to NeurIPS 2025 (poster).

Paper Project

Aug 19, 2025

Released RALU: region-adaptive latent sampling for accelerated diffusion transformers.

Paper Code

May 01, 2025

Skrr was accepted to ICML 2025 (poster).

Paper

Apr 26, 2025

Harmonization for a black-box deep learning model was selected as Summa cum laude at ISMRM 2025.

Apr 01, 2025

Efficient Personalization of Quantized Diffusion Model without Backpropagation was accepted to EDGE@CVPR 2025.

Mar 29, 2025

Released TeeMo on geometric properties of text token embeddings for stronger semantic binding.

Paper

Feb 27, 2025

Efficient Personalization of Quantized Diffusion Model without Backpropagation was accepted to CVPR 2025 (poster).

Paper Project Code

Feb 01, 2025

Harmonization for a black-box deep learning model was accepted to ISMRM 2025 as an oral presentation.

Hoigi Seo