A curated collection of the latest research and resources on AI-Generated Image and Video Detection. This repository encompasses datasets, benchmarks, research papers, and practical detection tools.
🚀🚀🚀Contributions are welcome! If you find any missing papers, datasets, or tools, feel free to open an issue or submit a pull request.
- Hot Events
- Benchmarks & Datasets
- Research Papers
- Competitions
- Practical Detection Tools
- About Our Team
-
炸裂!技术力究极恐怖!米哈游蔡浩宇首款AI生成视频大模型LPM1.0曝光!自定义虚拟角色?支持多语言+无限真实+无限时长,实现自由对话、唱歌、表演等效果
-
A photo of Iran’s bombed schoolgirl graveyard went viral. Why did AI say it wasn’t real?
-
How AI Content Detection is Being Weaponized in the Iran War
-
Trump unveils his vision for 'Gaza Riviera' with AI video featuring belly dancers and luxury yachts
Modality Legend:
[I]Image |[V]Video |[M]Multi-modal
Annotation Type Legend:
Au: Authenticity |Ex: Explainability |Lo: Localization
| Benchmark | Paper | Venue & Year | Modality | Notes | Real Source | Fake Source/Generator | Annotation | Scale | Download |
|---|---|---|---|---|---|---|---|---|---|
| SciFigDetect | SciFigDetect: A Benchmark for AI-Generated Scientific Figure Detection | Arxiv 2026 | [I] |
Scientific Figure Detection | Nano Banana Pro, GPT-image-1.5 | Au |
150K | SciFigDetect | |
| ActivityForensics | ActivityForensics: A Comprehensive Benchmark for Localizing Manipulated Activity in Videos | CVPR 2026 | [V] |
Action-level AIGC in videos | Au |
6K | ActivityForensics | ||
| MintVid | VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning | Arxiv 2026 | [V] |
OpenVid, VFHQ, HDTF, TikTok | Jimeng3.0-Pro, Seedance, Kling2.5-Turbo, Sora2, TikTok, Youtube, etc. | Au |
4K | MintVid | |
| AIGVDBench | Your One-Stop Solution for AI-Generated Video Detection | CVPR 2026 | [V] |
OpenVid-HD | 31 generation models | Au |
440k | AIGVDBench | |
| HydraFake | Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning | ICLR 2026(Oral) | [I] |
FFHQ, VFHQ, CelebAHQ, FF++, etc. | GPT-4o, HailuoAI, ICLight, InfiniteYou, etc. | Au, Ex |
100K | HydraFake | |
| BR-Gen | Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach | AAAI 2026 | [I] |
Au, Lo |
150K | BR-Gen | |||
| HiResolution | No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection | ICLR 2026 | [I] |
Au |
50K | HiRes-50K | |||
| AIGI-Now | AlignGemini: Generalizable AI-Generated Image Detection Through Task-Model Alignment | Arxiv 2026 | [I] |
COCO | Nano Banana, GPT-4o, Jimeng, Kling, Minimax, etc. | Au |
18K | AIGI-Now | |
| RealChain | Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection | Arxiv 2026 | [I] |
Au |
14K | RealChain | |||
| GenVidBench | GenVidBench: A 6-Million Benchmark for AI-Generated Video Detection | AAAI 2026 | [V] |
Au |
6M | GenVidBench | |||
| Skyra | Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning | CVPR 2026 | [V] |
Au, Ex, Lo |
4K | ViF-CoT-4K | |||
| So-Fake-Set | So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection | Arxiv 2025 | [I] |
F30k, WIDER, FFHQ, CelebA, OpenImages, COCO, OpenForensics | Qwen-image, GPT-4o, Nano Banana, Seedream3.0, Ideogram3.0, etc. | Au |
2M+ | So-Fake-Set So-Fake-OOD |
|
| GenBuster++ | BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM | Arxiv 2025 | [M] |
Au |
4K | GenBuster++ | |||
| GenBuster | BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation | Arxiv 2025 | [I] |
Au |
200K | GenBuster-200K | |||
| AIGIBench | Is Artificial Intelligence Generated Image Detection a Solved Problem? | NeurIPS 2025 | [I] |
FFHQ, CelebA-HQ, Open Images V7 | Common generators & SocialRF, CommunityAI | Au |
200K | AIGIBench | |
| Ivy-Fake | IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection | Arxiv 2025 | [M] |
Au, Ex |
150K | Ivy-Fake | |||
| AEGIS | AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences | ACM MM 2025 | [V] |
Vript (YouTube, TikTok), DVF, YouTube (self-collected) | Stable Video Diffusion, CogVideoX-5B, I2VGen-XL, Pika, KLing, Sora | Au, Ex |
10K+ | AEGIS | |
| NeXT-IMDL | NeXT-IMDL: Build Benchmark for Next-Generation Image Manipulation Detection & Localization | Arxiv 2025 | [I] |
Flickr30k, COCO, OpenImages V7 | SD2-Inpainting, SDXL-Inpainting, FLUX-Inpainting, etc. | Au, Lo |
558K | NeXT-IMDL | |
| ARForensics | D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection | ICCV 2025 | [I] |
ImageNet | Infinity, Janus_Pro, RAR, Switti, VAR, LlamaGen, Open_MAGVIT2 | Au |
300k | ARForensics | |
| OpenSDI | OpenSDI: Spotting Diffusion-Generated Images in the Open World | CVPR 2025 | [I] |
Megalith-10M | SD1.5, SD2.1, SDXL, SD3, Flux.1 | Au, Lo |
300K | OpenSDI | |
| Community Forensics | Community Forensics: Using Thousands of Generators to Train Fake Image Detectors | CVPR 2025 | [I] |
LAION, ImageNet, COCO, FFHQ, CelebA, MetFaces, AFHQ, etc. | 4803 generators (Latent Diffusion, GAN, Autoregressive, Pixel Diffusion, Commercial) | Au |
2.7M | Community Forensics | |
| FakeClue | Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation | NeurIPS 2025 | [I] |
Au, Ex |
100K | FakeClue | |||
| XAIGID-RewardBench | Explainable AI-Generated Image Detection RewardBench | NeurIPS 2025 Workshop | [I] |
COCO-2017 | Imagen 4, Flux.1 Dev, Bagel, etc. | Au, Ex |
3K | XAIGID-RewardBench | |
| RewardData | Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs | Arxiv 2025 | [V] |
Au, Ex |
4.3K | RewardData | |||
| OpenFake | OPENFAKE: An Open Dataset and Platform Toward Real-World Deepfake Detection | Arxiv 2025 | [I] |
LAION-400M | SD 1.5/2.1/XL/3.5, Flux 1.0-dev/1.1-Pro/Schnell, Midjourney v6/v7, DALL·E 3, Imagen 3/4, GPT Image 1, Ideogram 3.0, Grok-2, HiDream-I1, Recraft v3, Chroma, and 10 community LoRA/finetune variants | Au |
~4M | OPENFAKE | |
| Video Reality Test | Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? | Arxiv 2025 | [V] |
YouTube ASMR (social media) | Veo3.1-Fast, Sora2, Wan2.2-A14B, Wan2.2-5B, OpenSora-V2, HunyuanVideo, StepVideo | Au |
149 real + dynamic fake | Video Reality Test | |
| DDL | DDL: A Dataset for Interpretable Deepfake Detection and Localization in Real-World Scenarios | Arxiv 2025 | [M] |
Au |
367K | DDL | |||
| DiffSeg30k | DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection | Arxiv 2025 | [I] |
COCO | SD2, SD3.5, SDXL, Flux.1, Glide, Kolors, HunyuanDiT1.1, Kandinsky 2.2 | Au, Lo |
30K | DiffSeg30k | |
| FakeParts | FakeParts: a New Family of AI-Generated DeepFakes | Arxiv 2025 | [V] |
Au, Lo |
81K | FakeParts | |||
| ForensicHub | ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization | NeurIPS 2025 | [I] |
ProGAN, StyleGAN, LDM, SDv1.4, SDv1.5, SDv2, SDXL, SD-ControlNet, MidJourney, ADM, GLIDE, VQDM, BigGAN | Au, Lo |
23 datasets 42 models |
ForensicHub | ||
| LOKI | LOKI: A Comprehensive Synthetic Data Detection Benchmark Using Large Multimodal Models | ICLR 2025 | [M] |
SORA, Keling, Open-Sora, FLUX, Midjourney, Stable Diffusion, Nerf-based, Gaussian-based, GPT-4o, Qwen-Max, Llama 3.1-405B, MusicGen, AudioLDM2... | Au, Ex |
18K | LOKI | ||
| Chameleon | A Sanity Check for AI-Generated Image Detection | ICLR 2025 | [I] |
Unsplash | Midjourney, DALLE-3, Stable Diffusion (various LoRA fine-tuned) | Au |
26K | Chameleon | |
| WildFake | WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection | AAAI 2025 | [I] |
Au |
3.7M | WildFake | |||
| WildRF | Real-Time Deepfake Detection in the Real-World | Arxiv 2024 | [I] |
Reddit, X (Twitter), Facebook (real images) | Reddit, X (Twitter), Facebook (social media deepfakes) | Au |
WidlRF | ||
| AIGCDetectBenchmark | PatchCraft: Exploring Texture Patch for Efficient AI-generated Image Detection | Arxiv 2024 | [I] |
Au |
100K | AIGCDetectionBenchMark | |||
| GenVideo | DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark | Arxiv 2024 | [V] |
Au |
2.3M | GenVideo | |||
| DRCT | Drct: Diffusion reconstruction contrastive training towards universal detection of diffusion generated images | ICML 2024 | [I] |
MSCOCO | LDM, SDv1.4, SDv1.5, SDv2, SDXL, SD-ControlNet | Au |
2M | DRCT-2M | |
| GenImage | GenImage: A Million-Scale Benchmark for Detecting AI-Generated Image | NeurIPS 2023 | [I] |
ImageNet, Wukong | MidJourney, SDv1.4, SDv1.5, ADM, GLIDE, VQDM, BigGAN | Au |
2.7M | GenImage | |
| DF40 | DF40: Toward Next-Generation Deepfake Detection | NeurIPS 2024 | [I] [V] |
Au |
0.1M+ videos, 1M+ images | DF40 | |||
| Forensics-Bench | Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models | CVPR 2025 | [I], [V], [M] |
Various public datasets | GAN, Diffusion, VAE, RNN, Encoder-Decoder, Graphics-based | Au, Lo |
63K | Forensics-Bench |
💡 Note: Papers are sorted by year (descending) within each category.
Modality Legend:[I]Image |[V]Video |[M]Multi-modal
This category focuses on utilizing Multimodal Large Language Models (MLLMs) like GPT-4V, LLaVA, or Qwen-VL to detect AI-generated content. These methods often provide natural language explanations (explainability) alongside binary detection.
This category includes supervised learning approaches that train neural networks (CNNs, ViTs, etc.) specifically to classify authentic vs. AI-generated content. They usually focus on robustness, generalization, and feature extraction.
| Competition | Link | Year | Info |
|---|---|---|---|
| Robust AIGC Detection | NTIRE 2026 Robust AI-Generated Image Detection in the Wild | 2026 | No restrictions on training data. Evaluate ROC AUC metrics on robust samples. |
| Robust Deepfake Detection | NTIRE 2026 Robust Deepfake Detection Challenge | 2026 | No restrictions on training data. |
| The 6th Face Anti-spoofing Challenge | The 6th Face Anti-Spoofing: Unified Physical-Digital Attacks Detection@ICCV2025 | 2025 | No external data or pre-trained models allowed. Limited to a single DL model with under 100G FLOPs. |
| Detect AI vs. Human-Generated Images | 2025 Women in AI (WAI) Kaggle Challenge | 2025 | Paired dataset of authentic and AI-generated images |
| The 5th Face Anti-spoofing Challenge | 5th Chalearn Face Anti-spoofing Workshop and Challenge@CVPR2024 | 2024 | UniAttackData+ for unified physical and digital attack detection. |
- Hive Moderation - Website
- Tencent Zhuque AI Detection Assistant - Website
- AI or Not - Website
- Illuminarty - Website
- Winston AI - Website
- Is it AI? - Website
- 中科睿鉴 (Zhongke Ruijian) - 微信小程序搜索 睿鉴AI
We are the Content Security Intelligence Team under Ant Group - Machine Intelligence. We are responsible for developing comprehensive content security and risk-mitigation capabilities for the Ant Group ecosystem, bridging the gap between rapidly evolving technologies and the urgent need for digital trust.
In an era where synthetic media is increasingly sophisticated and pervasive, our research serves as a critical line of defense. By advancing AIGC detection technologies, we aim to:
- Safeguard Digital Integrity: We provide essential defense mechanisms to protect the authenticity of visual content and combat the spread of misinformation in the digital space.
- Empower Trust: Our solutions ensure the public can distinguish between genuine and synthetic media, fostering a more transparent and trustworthy digital ecosystem.
- Industrial Application & Impact: We provide robust, scalable aigc detection solutions for Ant Group’s diverse content platforms, including Lingguang, Jingtan, and many others.
We are honored to collaborate with esteemed researchers and scholars in the field of AI and Computer Vision. We deeply value these academic partnerships that drive our innovation:
- Prof. Jun Wan (万军) | CASIA & UCAS
- Research Interests: Biometrics, Face Anti-spoofing, Gesture Recognition, and Computer Vision.
- [Homepage]
- Prof. Jianfu Zhang (张健夫) | Shanghai Jiao Tong University
- Research Interests: Computer Vision, Pattern Recognition, and Image/Video Analysis & Synthesis.
- [Homepage]
- Prof. Zhuosheng Zhang (张倬胜) | Shanghai Jiao Tong University
- Research Interests: Natural Language Processing, Large Language Models, and Multi-modal Learning.
- [Homepage]
- VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning | (Submitted), 2026
- Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images | CVPR'26, 2026
- Highlights: Improved detection accuracy through a two-stage approach of localizing suspicious regions followed by detailed examination.
- [Code]
- GAMMA: Generalizable Alignment via Multi-task and Manipulation-Augmented Training for AI-Generated Image Detection | ICASSP'26, 2026
- Highlights: Enhanced generalization through multi-task learning and manipulation-augmented training strategies.
- [Paper]
- FakeXplain: AI-Generated Image Detection via Human-Aligned Grounded Reasoning | ICLR'26, 2026
- Veritas: Generalizable deepfake detection via pattern-aware reasoning | ICLR'26 Oral, 2026
- Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection | IEEE TMM, 2025
- Highlights: Proposed a continual learning framework that adapts to new generative models while mitigating catastrophic forgetting.
- [Paper]
- Towards explainable fake image detection with multi-modal large language models | ACM MM'25, 2025
- Highlights: Leveraged multi-modal large language models to provide human-interpretable explanations for fake image detection.
- [Paper]
- WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection | AAAI'25 Oral, 2024
- Highlights: Introduced the largest and most comprehensive AIGC image dataset at the time, providing a challenging benchmark for detection models.
- [Paper]
- 1st Place Winner | NTIRE 2026 Robust AI-Generated Image Detection in the Wild Challenge
- Secured the top rank in ROC AUC for delivering superior performance in large-scale, real-world AI-generated image detection.
- [Challenge Website]
- 1st Place Winner | ICCV 2025 VQualA Challenge - Image Super-Resolution Generated Content Quality Assessment, 2025
- 1st Place Winner | CVPR 2024 Face Anti-Spoofing Challenge, 2024
- Secured first place in the prestigious Face Anti-Spoofing Challenge at CVPR 2024, demonstrating state-of-the-art detection capabilities.
- [Challenge Website]
- WildFake - A large and comprehensive AIGC image detection dataset.
- GenVideo - A large and comprehensive AIGC video detection dataset.
- HydraFake - A large-scale challenging dataset for AI-generated image detection.
- MintVid - A comprehensive video dataset for AIGC detection research.
For questions or collaborations, please contact:
- Zijian Yu: [email protected]
- Hao Tan: [email protected]
- Jun Lan: [email protected]