Skip to content

ant-research/Awesome-AIGC-Image-Video-Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Awesome AIGC Image/Video Detection Awesome

Overview

A curated collection of the latest research and resources on AI-Generated Image and Video Detection. This repository encompasses datasets, benchmarks, research papers, and practical detection tools.

🚀🚀🚀Contributions are welcome! If you find any missing papers, datasets, or tools, feel free to open an issue or submit a pull request.

Contents


🔥 Hot Events


Benchmarks & Datasets

Modality Legend: [I] Image | [V] Video | [M] Multi-modal

Annotation Type Legend: Au: Authenticity | Ex: Explainability | Lo: Localization

Benchmark Paper Venue & Year Modality Notes Real Source Fake Source/Generator Annotation Scale Download
SciFigDetect SciFigDetect: A Benchmark for AI-Generated Scientific Figure Detection Arxiv 2026 [I] Scientific Figure Detection Nano Banana Pro, GPT-image-1.5 Au 150K SciFigDetect
ActivityForensics ActivityForensics: A Comprehensive Benchmark for Localizing Manipulated Activity in Videos CVPR 2026 [V] Action-level AIGC in videos Au 6K ActivityForensics
MintVid VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning Arxiv 2026 [V] OpenVid, VFHQ, HDTF, TikTok Jimeng3.0-Pro, Seedance, Kling2.5-Turbo, Sora2, TikTok, Youtube, etc. Au 4K MintVid
AIGVDBench Your One-Stop Solution for AI-Generated Video Detection CVPR 2026 [V] OpenVid-HD 31 generation models Au 440k AIGVDBench
HydraFake Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning ICLR 2026(Oral) [I] FFHQ, VFHQ, CelebAHQ, FF++, etc. GPT-4o, HailuoAI, ICLight, InfiniteYou, etc. Au, Ex 100K HydraFake
BR-Gen Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach AAAI 2026 [I] Au, Lo 150K BR-Gen
HiResolution No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection ICLR 2026 [I] Au 50K HiRes-50K
AIGI-Now AlignGemini: Generalizable AI-Generated Image Detection Through Task-Model Alignment Arxiv 2026 [I] COCO Nano Banana, GPT-4o, Jimeng, Kling, Minimax, etc. Au 18K AIGI-Now
RealChain Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection Arxiv 2026 [I] Au 14K RealChain
GenVidBench GenVidBench: A 6-Million Benchmark for AI-Generated Video Detection AAAI 2026 [V] Au 6M GenVidBench
Skyra Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning CVPR 2026 [V] Au, Ex, Lo 4K ViF-CoT-4K
So-Fake-Set So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection Arxiv 2025 [I] F30k, WIDER, FFHQ, CelebA, OpenImages, COCO, OpenForensics Qwen-image, GPT-4o, Nano Banana, Seedream3.0, Ideogram3.0, etc. Au 2M+ So-Fake-Set
So-Fake-OOD
GenBuster++ BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM Arxiv 2025 [M] Au 4K GenBuster++
GenBuster BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation Arxiv 2025 [I] Au 200K GenBuster-200K
AIGIBench Is Artificial Intelligence Generated Image Detection a Solved Problem? NeurIPS 2025 [I] FFHQ, CelebA-HQ, Open Images V7 Common generators & SocialRF, CommunityAI Au 200K AIGIBench
Ivy-Fake IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection Arxiv 2025 [M] Au, Ex 150K Ivy-Fake
AEGIS AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences ACM MM 2025 [V] Vript (YouTube, TikTok), DVF, YouTube (self-collected) Stable Video Diffusion, CogVideoX-5B, I2VGen-XL, Pika, KLing, Sora Au, Ex 10K+ AEGIS
NeXT-IMDL NeXT-IMDL: Build Benchmark for Next-Generation Image Manipulation Detection & Localization Arxiv 2025 [I] Flickr30k, COCO, OpenImages V7 SD2-Inpainting, SDXL-Inpainting, FLUX-Inpainting, etc. Au, Lo 558K NeXT-IMDL
ARForensics D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection ICCV 2025 [I] ImageNet Infinity, Janus_Pro, RAR, Switti, VAR, LlamaGen, Open_MAGVIT2 Au 300k ARForensics
OpenSDI OpenSDI: Spotting Diffusion-Generated Images in the Open World CVPR 2025 [I] Megalith-10M SD1.5, SD2.1, SDXL, SD3, Flux.1 Au, Lo 300K OpenSDI
Community Forensics Community Forensics: Using Thousands of Generators to Train Fake Image Detectors CVPR 2025 [I] LAION, ImageNet, COCO, FFHQ, CelebA, MetFaces, AFHQ, etc. 4803 generators (Latent Diffusion, GAN, Autoregressive, Pixel Diffusion, Commercial) Au 2.7M Community Forensics
FakeClue Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation NeurIPS 2025 [I] Au, Ex 100K FakeClue
XAIGID-RewardBench Explainable AI-Generated Image Detection RewardBench NeurIPS 2025 Workshop [I] COCO-2017 Imagen 4, Flux.1 Dev, Bagel, etc. Au, Ex 3K XAIGID-RewardBench
RewardData Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs Arxiv 2025 [V] Au, Ex 4.3K RewardData
OpenFake OPENFAKE: An Open Dataset and Platform Toward Real-World Deepfake Detection Arxiv 2025 [I] LAION-400M SD 1.5/2.1/XL/3.5, Flux 1.0-dev/1.1-Pro/Schnell, Midjourney v6/v7, DALL·E 3, Imagen 3/4, GPT Image 1, Ideogram 3.0, Grok-2, HiDream-I1, Recraft v3, Chroma, and 10 community LoRA/finetune variants Au ~4M OPENFAKE
Video Reality Test Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Arxiv 2025 [V] YouTube ASMR (social media) Veo3.1-Fast, Sora2, Wan2.2-A14B, Wan2.2-5B, OpenSora-V2, HunyuanVideo, StepVideo Au 149 real + dynamic fake Video Reality Test
DDL DDL: A Dataset for Interpretable Deepfake Detection and Localization in Real-World Scenarios Arxiv 2025 [M] Au 367K DDL
DiffSeg30k DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection Arxiv 2025 [I] COCO SD2, SD3.5, SDXL, Flux.1, Glide, Kolors, HunyuanDiT1.1, Kandinsky 2.2 Au, Lo 30K DiffSeg30k
FakeParts FakeParts: a New Family of AI-Generated DeepFakes Arxiv 2025 [V] Au, Lo 81K FakeParts
ForensicHub ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization NeurIPS 2025 [I] ProGAN, StyleGAN, LDM, SDv1.4, SDv1.5, SDv2, SDXL, SD-ControlNet, MidJourney, ADM, GLIDE, VQDM, BigGAN Au, Lo 23 datasets
42 models
ForensicHub
LOKI LOKI: A Comprehensive Synthetic Data Detection Benchmark Using Large Multimodal Models ICLR 2025 [M] SORA, Keling, Open-Sora, FLUX, Midjourney, Stable Diffusion, Nerf-based, Gaussian-based, GPT-4o, Qwen-Max, Llama 3.1-405B, MusicGen, AudioLDM2... Au, Ex 18K LOKI
Chameleon A Sanity Check for AI-Generated Image Detection ICLR 2025 [I] Unsplash Midjourney, DALLE-3, Stable Diffusion (various LoRA fine-tuned) Au 26K Chameleon
WildFake WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection AAAI 2025 [I] Au 3.7M WildFake
WildRF Real-Time Deepfake Detection in the Real-World Arxiv 2024 [I] Reddit, X (Twitter), Facebook (real images) Reddit, X (Twitter), Facebook (social media deepfakes) Au WidlRF
AIGCDetectBenchmark PatchCraft: Exploring Texture Patch for Efficient AI-generated Image Detection Arxiv 2024 [I] Au 100K AIGCDetectionBenchMark
GenVideo DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark Arxiv 2024 [V] Au 2.3M GenVideo
DRCT Drct: Diffusion reconstruction contrastive training towards universal detection of diffusion generated images ICML 2024 [I] MSCOCO LDM, SDv1.4, SDv1.5, SDv2, SDXL, SD-ControlNet Au 2M DRCT-2M
GenImage GenImage: A Million-Scale Benchmark for Detecting AI-Generated Image NeurIPS 2023 [I] ImageNet, Wukong MidJourney, SDv1.4, SDv1.5, ADM, GLIDE, VQDM, BigGAN Au 2.7M GenImage
DF40 DF40: Toward Next-Generation Deepfake Detection NeurIPS 2024 [I] [V] Au 0.1M+ videos, 1M+ images DF40
Forensics-Bench Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models CVPR 2025 [I], [V], [M] Various public datasets GAN, Diffusion, VAE, RNN, Encoder-Decoder, Graphics-based Au, Lo 63K Forensics-Bench

Back to Top


Research Papers

💡 Note: Papers are sorted by year (descending) within each category.
Modality Legend: [I] Image | [V] Video | [M] Multi-modal

MLLM-Based

This category focuses on utilizing Multimodal Large Language Models (MLLMs) like GPT-4V, LLaVA, or Qwen-VL to detect AI-generated content. These methods often provide natural language explanations (explainability) alongside binary detection.

Title Venue & Year Modality Highlights/Keywords Code
VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning Arxiv 2026 [V] Perception Pretext RL, Fact-based Reasoning, MintVid Dataset GitHub
Veritas: Generalizable deepfake detection via pattern-aware reasoning ICLR 2026(Oral) [I] Pattern-aware Reasoning, HydraFake Dataset Github
DF-LLaVA: Unlocking MLLMs for Synthetic Image Detection via Knowledge Injection and Conflict-Driven Self-Reflection Arxiv 2026 [I] Knowledge Injection, Self-Reflection N/A
DocShield: Towards AI Document Safety via Evidence-Grounded Agentic Reasoning Arxiv 2026 [M] Agentic Framework, Document Safety N/A
VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL ICLR 2026 [V] Multi-stage RL, Video Detection Dataset GitHub
FakeXplain: AI-Generated Image Detection via Human-Aligned Grounded Reasoning ICLR 2026 [I] Grounded Reasoning, Human-annotated Dataset N/A
AlignGemini: Generalizable AI-Generated Image Detection Through Task-Model Alignment Arxiv 2026 [I] Decoupling (Semantic & Pixel), AIGI-Now Dataset N/A
Zoom-In to Sort AI-Generated Images Out ICLR 2026 [I] Thinking with Images, MagniFake Dataset N/A
AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection Arxiv 2026 [I] Agentic framework Github
EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection Arxiv 2026 [I] Agentic Framework, Method Ensembling N/A
VIGIL: Part-Grounded Structured Reasoning for Generalizable Deepfake Detection Arxiv 2026 [I] Part-centric Forensic, OmniFake Dataset Project
GenVideoLens: Where LVLMs Fall Short in AI-Generated Video Detection? Arxiv 2026 [V] GenVideoLens benchmark N/A
Semantic Visual Anomaly Detection and Reasoning in AI-Generated Images ICLR 2026 [I] Semantic Anomaly Reasoning, AnomReason Dataset N/A
FAKE-HR1: RETHINKING REASONING OF VISION LANGUAGE MODEL FOR SYNTHETIC IMAGE DETECTION Arxiv 2026 [I] Hybrid-Reasoning, Dual-mode Dataset N/A
MIRAGE: Towards AI-Generated Image Detection in the Wild Arxiv 2025 [I] Human Curation Dataset, Heuristic-to-Analytic Reasoning N/A
BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM Arxiv 2025 [M] RL Post-training, Cross-Modal, Thinking Reward Mechanism Github
BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation Arxiv 2025 [V] GenBuster-200K Dataset, Cold Start + RL Training Github
REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection Arxiv 2025 [I] Chain-of-Evidence, Expert-grounded RL N/A
Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation NeurIPS 2025 [I] FakeClue Dataset, Fine-grained Artifact Clues, Artifact Explanation GitHub
AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models ICCV 2025 [I] Holmes-Set, Multi-Expert Jury, 3-Stage Training Pipeline Github
LEGION: Learning to Ground and Explain for Synthetic Image Detection ICCV 2025 [I] SynthScars Dataset, Defender & Controller, Image Refinement GitHub
Seeing Before Reasoning: A Unified Framework for Generalizable and Explainable Fake Image Detection Arxiv 2025 [I] Perception & Reasoning, ExplainFake-Bench N/A
SIDA: Social Media Image Deepfake Detection, Localization, and Explanation CVPR 2025 [I] SID-Set, Mask Prediction, Social Media Context Github
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models ICLR 2025 [I] Explainable IFDL, Domain Tag-guided, Multi-modal Localization GitHub
VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL ICLR 2026 [V] GRPO, Time Artifacts, Quality Evolutionary Videos N/A
FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics Arxiv 2025 [I] FakeChain Dataset, FakeInstruct, Trace Evidence N/A
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors Arxiv 2024 [I] VQA, InstructBLIP, Soft Prompt-tuning, Zero-shot GitHub

Classification-Based

This category includes supervised learning approaches that train neural networks (CNNs, ViTs, etc.) specifically to classify authentic vs. AI-generated content. They usually focus on robustness, generalization, and feature extraction.

Title Venue & Year Modality Highlights/Keywords Code
Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach AAAI 2026 [I] Localized AIGC Detection, Forgery Amplification, Scene-aware Local Forgery GitHub
Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale ICLR 2026 [V] Native scale video processing, Massive realistic video dataset, Preserves subtle generation artifacts N/A
Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection TMM 2026 [I] Continual Learning, Kronecker-Factored Approximate Curvature N/A
Simplicity Prevails: The Emergence of Generalizable AIGI Detection in Visual Foundation Models Arxiv 2026 [I] Linear Probe, Vision Foundation Models, Emergent Forensic Capability N/A
MIRROR: Manifold Ideal Reference ReconstructOR for Generalizable AI-Generated Image Detection Arxiv 2026 [I] Manifold Reconstruction, Memory Bank, Human-AIGI Benchmark GitHub
No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection ICLR 2026 [I] Detail-preserving dual-path architecture, Multi-task learning, HiRes-50K benchmark N/A
All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning ICLR 2026 [I] Random Patch Replacement, Patch-wise Contrastive Learning N/A
OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild Arxiv 2025 [I] Mixture-of-Experts, Semantic-Artifact Decoupling, Mirage Dataset GitHub
DINO-Detect: A Simple yet Effective Framework for Blur-Robust AI-Generated Image Detection Arxiv 2025 [I] Blur Robustness, Knowledge Distillation, DINOv3 Github
D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection ICCV 2025 [I] Discrete Distribution Discrepancy-aware Transformer, Vector Quantized Variational AutoEncoder Github
Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation NeurIPS 2025 [V] Wavelet-band Augmentation, Forensic Frequency Artifacts, Single-generator Generalization GitHub
AI-Generated Video Detection via Perceptual Straightening NeurIPS 2025 [V] Perceptual Straightening, DINOv2, Temporal Curvature GitHub
Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection NeurIPS 2025 [V] Normalized Spatiotemporal Gradient (NSG), Maximum Mean Discrepancy (MMD) Github
Dual Data Alignment Makes AI-Generated Image Detector Easier Generalizable NeurIPS 202 (Spotlight) [I] Dual-domain Alignment, Frequency-level Bias, VAE Reconstruction GitHub
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection ICML 2025 (Oral) [I] SVD Orthogonal Subspace, Asymmetry Phenomenon, Parameter-efficient Fine-tuning GitHub
Any-Resolution AI-Generated Image Detection by Spectral Learning CVPR 2025 [I] Spectral Context Attention, Frequency Reconstruction, OOD Detection Github
A Bias-Free Training Paradigm for More General AI-generated Image Detection CVPR 2025 [I] Bias-Free, Semantic Alignment, Stable Diffusion Self-conditioning Github
Forensics Adapter: Adapting CLIP for Generalizable Face Forgery Detection CVPR 2025 [I] CLIP, Blending Boundaries, Forgery-aware Prompt Learning Github
Towards a Universal Synthetic Video Detector: From Face or Background Manipulations to Fully AI-Generated Content CVPR 2025 [V] SigLIP-So400M, Attention-Diversity Loss, Full-frame Manipulations N/A
Exploring Unbiased Deepfake Detection via Token-Level Shuffling and Mixing AAAI 2025 [I] Token-Level Shuffling, Contrastive Loss, Bias Mitigation N/A
DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection TMM 2025 [V] Direction-aware Attention, SpatioTemporal Invariant Loss N/A
Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Space Domain Learning AAAI 2024 [I] Frequency Domain, FFT, Frequency Conv Layer (FCL), Lightweight GitHub
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark Arxiv 2024 [V] Mamba, State Space Model, Long-range Spatiotemporal Inconsistency GitHub
Rethinking the Up-Sampling Operations in CNN-based Generative Network CVPR 2024 [I] Neighboring Pixel Relationships, Generalized Structural Artifacts Github
Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features Arxiv 2024 [V] GenVidDet, Optical Flow, Dual-Branch 3D Transformer N/A
FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection Arxiv 2024 [I] Vulnerability-driven, Local Attention (L2-Att), Vision Transformer GitHub

Back to Top


Competitions

Competition Link Year Info
Robust AIGC Detection NTIRE 2026 Robust AI-Generated Image Detection in the Wild 2026 No restrictions on training data.
Evaluate ROC AUC metrics on robust samples.
Robust Deepfake Detection NTIRE 2026 Robust Deepfake Detection Challenge 2026 No restrictions on training data.
The 6th Face Anti-spoofing Challenge The 6th Face Anti-Spoofing: Unified Physical-Digital Attacks Detection@ICCV2025 2025 No external data or pre-trained models allowed.
Limited to a single DL model with under 100G FLOPs.
Detect AI vs. Human-Generated Images 2025 Women in AI (WAI) Kaggle Challenge 2025 Paired dataset of authentic and AI-generated images
The 5th Face Anti-spoofing Challenge 5th Chalearn Face Anti-spoofing Workshop and Challenge@CVPR2024 2024 UniAttackData+ for unified physical and digital attack detection.

Back to Top


Practical Detection Tools

  • Hive Moderation - Website
  • Tencent Zhuque AI Detection Assistant - Website
  • AI or Not - Website
  • Illuminarty - Website
  • Winston AI - Website
  • Is it AI? - Website
  • 中科睿鉴 (Zhongke Ruijian) - 微信小程序搜索 睿鉴AI

Back to Top


🏢 About Our Team

We are the Content Security Intelligence Team under Ant Group - Machine Intelligence. We are responsible for developing comprehensive content security and risk-mitigation capabilities for the Ant Group ecosystem, bridging the gap between rapidly evolving technologies and the urgent need for digital trust.

Why We Do It

In an era where synthetic media is increasingly sophisticated and pervasive, our research serves as a critical line of defense. By advancing AIGC detection technologies, we aim to:

  • Safeguard Digital Integrity: We provide essential defense mechanisms to protect the authenticity of visual content and combat the spread of misinformation in the digital space.
  • Empower Trust: Our solutions ensure the public can distinguish between genuine and synthetic media, fostering a more transparent and trustworthy digital ecosystem.
  • Industrial Application & Impact: We provide robust, scalable aigc detection solutions for Ant Group’s diverse content platforms, including Lingguang, Jingtan, and many others.

🤝 Collaborators

We are honored to collaborate with esteemed researchers and scholars in the field of AI and Computer Vision. We deeply value these academic partnerships that drive our innovation:

  • Prof. Jun Wan (万军) | CASIA & UCAS
    • Research Interests: Biometrics, Face Anti-spoofing, Gesture Recognition, and Computer Vision.
    • [Homepage]
  • Prof. Jianfu Zhang (张健夫) | Shanghai Jiao Tong University
    • Research Interests: Computer Vision, Pattern Recognition, and Image/Video Analysis & Synthesis.
    • [Homepage]
  • Prof. Zhuosheng Zhang (张倬胜) | Shanghai Jiao Tong University
    • Research Interests: Natural Language Processing, Large Language Models, and Multi-modal Learning.
    • [Homepage]

📝 Academic Publications

  • VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning | (Submitted), 2026
    • Highlights: Detected AI-generated videos using perception pretext reinforcement learning to capture temporal inconsistencies.
    • [Paper] [Code]
  • Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images | CVPR'26, 2026
    • Highlights: Improved detection accuracy through a two-stage approach of localizing suspicious regions followed by detailed examination.
    • [Code]
  • GAMMA: Generalizable Alignment via Multi-task and Manipulation-Augmented Training for AI-Generated Image Detection | ICASSP'26, 2026
    • Highlights: Enhanced generalization through multi-task learning and manipulation-augmented training strategies.
    • [Paper]
  • FakeXplain: AI-Generated Image Detection via Human-Aligned Grounded Reasoning | ICLR'26, 2026
    • Highlights: Detected AI-generated images through human-aligned grounded reasoning, providing interpretable visual evidence.
    • [Paper] [Code]
  • Veritas: Generalizable deepfake detection via pattern-aware reasoning | ICLR'26 Oral, 2026
    • Highlights: Achieved generalizable deepfake detection through pattern-aware reasoning, improving robustness across diverse manipulation types.
    • [Paper] [Code]
  • Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection | IEEE TMM, 2025
    • Highlights: Proposed a continual learning framework that adapts to new generative models while mitigating catastrophic forgetting.
    • [Paper]
  • Towards explainable fake image detection with multi-modal large language models | ACM MM'25, 2025
    • Highlights: Leveraged multi-modal large language models to provide human-interpretable explanations for fake image detection.
    • [Paper]
  • WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection | AAAI'25 Oral, 2024
    • Highlights: Introduced the largest and most comprehensive AIGC image dataset at the time, providing a challenging benchmark for detection models.
    • [Paper]

🏆 Competition Achievements

  • 1st Place Winner | NTIRE 2026 Robust AI-Generated Image Detection in the Wild Challenge
    • Secured the top rank in ROC AUC for delivering superior performance in large-scale, real-world AI-generated image detection.
    • [Challenge Website]
  • 1st Place Winner | ICCV 2025 VQualA Challenge - Image Super-Resolution Generated Content Quality Assessment, 2025
    • Achieved top performance in the VQualA 2025 challenge focused on assessing the quality of super-resolution generated content.
    • [Paper 1] [Paper 2]
  • 1st Place Winner | CVPR 2024 Face Anti-Spoofing Challenge, 2024
    • Secured first place in the prestigious Face Anti-Spoofing Challenge at CVPR 2024, demonstrating state-of-the-art detection capabilities.
    • [Challenge Website]

🛠️ Open-Source Resources

  • WildFake - A large and comprehensive AIGC image detection dataset.
  • GenVideo - A large and comprehensive AIGC video detection dataset.
  • HydraFake - A large-scale challenging dataset for AI-generated image detection.
  • MintVid - A comprehensive video dataset for AIGC detection research.

✉️ Contact Us

For questions or collaborations, please contact:

Back to Top


Star History

Star History Chart

Contributors