Awesome AIGC Image/Video Detection

A curated collection of the latest research and resources on AI-Generated Image and Video Detection. This repository encompasses datasets, benchmarks, research papers, and practical detection tools.

🚀🚀🚀Contributions are welcome! If you find any missing papers, datasets, or tools, feel free to open an issue or submit a pull request.

🔥 Hot Events

Benchmarks & Datasets

Modality Legend: [I] Image | [V] Video | [M] Multi-modal

Annotation Type Legend: Au: Authenticity | Ex: Explainability | Lo: Localization

Benchmark	Paper	Venue & Year	Modality	Notes	Real Source	Fake Source/Generator	Annotation	Scale	Download
SciFigDetect	SciFigDetect: A Benchmark for AI-Generated Scientific Figure Detection	Arxiv 2026	`[I]`	Scientific Figure Detection		Nano Banana Pro, GPT-image-1.5	`Au`	150K	SciFigDetect
ActivityForensics	ActivityForensics: A Comprehensive Benchmark for Localizing Manipulated Activity in Videos	CVPR 2026	`[V]`	Action-level AIGC in videos			`Au`	6K	ActivityForensics
MintVid	VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning	Arxiv 2026	`[V]`		OpenVid, VFHQ, HDTF, TikTok	Jimeng3.0-Pro, Seedance, Kling2.5-Turbo, Sora2, TikTok, Youtube, etc.	`Au`	4K	MintVid
AIGVDBench	Your One-Stop Solution for AI-Generated Video Detection	CVPR 2026	`[V]`		OpenVid-HD	31 generation models	`Au`	440k	AIGVDBench
HydraFake	Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning	ICLR 2026(Oral)	`[I]`		FFHQ, VFHQ, CelebAHQ, FF++, etc.	GPT-4o, HailuoAI, ICLight, InfiniteYou, etc.	`Au`, `Ex`	100K	HydraFake
BR-Gen	Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach	AAAI 2026	`[I]`				`Au`, `Lo`	150K	BR-Gen
HiResolution	No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection	ICLR 2026	`[I]`				`Au`	50K	HiRes-50K
AIGI-Now	AlignGemini: Generalizable AI-Generated Image Detection Through Task-Model Alignment	Arxiv 2026	`[I]`		COCO	Nano Banana, GPT-4o, Jimeng, Kling, Minimax, etc.	`Au`	18K	AIGI-Now
RealChain	Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection	Arxiv 2026	`[I]`				`Au`	14K	RealChain
GenVidBench	GenVidBench: A 6-Million Benchmark for AI-Generated Video Detection	AAAI 2026	`[V]`				`Au`	6M	GenVidBench
Skyra	Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning	CVPR 2026	`[V]`				`Au`, `Ex`, `Lo`	4K	ViF-CoT-4K
So-Fake-Set	So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection	Arxiv 2025	`[I]`		F30k, WIDER, FFHQ, CelebA, OpenImages, COCO, OpenForensics	Qwen-image, GPT-4o, Nano Banana, Seedream3.0, Ideogram3.0, etc.	`Au`	2M+	So-Fake-Set So-Fake-OOD
GenBuster++	BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM	Arxiv 2025	`[M]`				`Au`	4K	GenBuster++
GenBuster	BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation	Arxiv 2025	`[I]`				`Au`	200K	GenBuster-200K
AIGIBench	Is Artificial Intelligence Generated Image Detection a Solved Problem?	NeurIPS 2025	`[I]`		FFHQ, CelebA-HQ, Open Images V7	Common generators & SocialRF, CommunityAI	`Au`	200K	AIGIBench
Ivy-Fake	IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection	Arxiv 2025	`[M]`				`Au`, `Ex`	150K	Ivy-Fake
AEGIS	AEGIS: Authenticity Evaluation Benchmark for AI-Generated Video Sequences	ACM MM 2025	`[V]`		Vript (YouTube, TikTok), DVF, YouTube (self-collected)	Stable Video Diffusion, CogVideoX-5B, I2VGen-XL, Pika, KLing, Sora	`Au`, `Ex`	10K+	AEGIS
NeXT-IMDL	NeXT-IMDL: Build Benchmark for Next-Generation Image Manipulation Detection & Localization	Arxiv 2025	`[I]`		Flickr30k, COCO, OpenImages V7	SD2-Inpainting, SDXL-Inpainting, FLUX-Inpainting, etc.	`Au`, `Lo`	558K	NeXT-IMDL
ARForensics	D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection	ICCV 2025	`[I]`		ImageNet	Infinity, Janus_Pro, RAR, Switti, VAR, LlamaGen, Open_MAGVIT2	`Au`	300k	ARForensics
OpenSDI	OpenSDI: Spotting Diffusion-Generated Images in the Open World	CVPR 2025	`[I]`		Megalith-10M	SD1.5, SD2.1, SDXL, SD3, Flux.1	`Au`, `Lo`	300K	OpenSDI
Community Forensics	Community Forensics: Using Thousands of Generators to Train Fake Image Detectors	CVPR 2025	`[I]`		LAION, ImageNet, COCO, FFHQ, CelebA, MetFaces, AFHQ, etc.	4803 generators (Latent Diffusion, GAN, Autoregressive, Pixel Diffusion, Commercial)	`Au`	2.7M	Community Forensics
FakeClue	Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation	NeurIPS 2025	`[I]`				`Au`, `Ex`	100K	FakeClue
XAIGID-RewardBench	Explainable AI-Generated Image Detection RewardBench	NeurIPS 2025 Workshop	`[I]`		COCO-2017	Imagen 4, Flux.1 Dev, Bagel, etc.	`Au`, `Ex`	3K	XAIGID-RewardBench
RewardData	Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs	Arxiv 2025	`[V]`				`Au`, `Ex`	4.3K	RewardData
OpenFake	OPENFAKE: An Open Dataset and Platform Toward Real-World Deepfake Detection	Arxiv 2025	`[I]`		LAION-400M	SD 1.5/2.1/XL/3.5, Flux 1.0-dev/1.1-Pro/Schnell, Midjourney v6/v7, DALL·E 3, Imagen 3/4, GPT Image 1, Ideogram 3.0, Grok-2, HiDream-I1, Recraft v3, Chroma, and 10 community LoRA/finetune variants	`Au`	~4M	OPENFAKE
Video Reality Test	Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?	Arxiv 2025	`[V]`		YouTube ASMR (social media)	Veo3.1-Fast, Sora2, Wan2.2-A14B, Wan2.2-5B, OpenSora-V2, HunyuanVideo, StepVideo	`Au`	149 real + dynamic fake	Video Reality Test
DDL	DDL: A Dataset for Interpretable Deepfake Detection and Localization in Real-World Scenarios	Arxiv 2025	`[M]`				`Au`	367K	DDL
DiffSeg30k	DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection	Arxiv 2025	`[I]`		COCO	SD2, SD3.5, SDXL, Flux.1, Glide, Kolors, HunyuanDiT1.1, Kandinsky 2.2	`Au`, `Lo`	30K	DiffSeg30k
FakeParts	FakeParts: a New Family of AI-Generated DeepFakes	Arxiv 2025	`[V]`				`Au`, `Lo`	81K	FakeParts
ForensicHub	ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and Localization	NeurIPS 2025	`[I]`			ProGAN, StyleGAN, LDM, SDv1.4, SDv1.5, SDv2, SDXL, SD-ControlNet, MidJourney, ADM, GLIDE, VQDM, BigGAN	`Au`, `Lo`	23 datasets 42 models	ForensicHub
LOKI	LOKI: A Comprehensive Synthetic Data Detection Benchmark Using Large Multimodal Models	ICLR 2025	`[M]`			SORA, Keling, Open-Sora, FLUX, Midjourney, Stable Diffusion, Nerf-based, Gaussian-based, GPT-4o, Qwen-Max, Llama 3.1-405B, MusicGen, AudioLDM2...	`Au`, `Ex`	18K	LOKI
Chameleon	A Sanity Check for AI-Generated Image Detection	ICLR 2025	`[I]`		Unsplash	Midjourney, DALLE-3, Stable Diffusion (various LoRA fine-tuned)	`Au`	26K	Chameleon
WildFake	WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection	AAAI 2025	`[I]`				`Au`	3.7M	WildFake
WildRF	Real-Time Deepfake Detection in the Real-World	Arxiv 2024	`[I]`		Reddit, X (Twitter), Facebook (real images)	Reddit, X (Twitter), Facebook (social media deepfakes)	`Au`		WidlRF
AIGCDetectBenchmark	PatchCraft: Exploring Texture Patch for Efficient AI-generated Image Detection	Arxiv 2024	`[I]`				`Au`	100K	AIGCDetectionBenchMark
GenVideo	DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark	Arxiv 2024	`[V]`				`Au`	2.3M	GenVideo
DRCT	Drct: Diffusion reconstruction contrastive training towards universal detection of diffusion generated images	ICML 2024	`[I]`		MSCOCO	LDM, SDv1.4, SDv1.5, SDv2, SDXL, SD-ControlNet	`Au`	2M	DRCT-2M
GenImage	GenImage: A Million-Scale Benchmark for Detecting AI-Generated Image	NeurIPS 2023	`[I]`		ImageNet, Wukong	MidJourney, SDv1.4, SDv1.5, ADM, GLIDE, VQDM, BigGAN	`Au`	2.7M	GenImage
DF40	DF40: Toward Next-Generation Deepfake Detection	NeurIPS 2024	`[I]` `[V]`				`Au`	0.1M+ videos, 1M+ images	DF40
Forensics-Bench	Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models	CVPR 2025	`[I]`, `[V]`, `[M]`		Various public datasets	GAN, Diffusion, VAE, RNN, Encoder-Decoder, Graphics-based	`Au`, `Lo`	63K	Forensics-Bench

⬆ Back to Top

Research Papers

💡 Note: Papers are sorted by year (descending) within each category.
Modality Legend: [I] Image | [V] Video | [M] Multi-modal

MLLM-Based

This category focuses on utilizing Multimodal Large Language Models (MLLMs) like GPT-4V, LLaVA, or Qwen-VL to detect AI-generated content. These methods often provide natural language explanations (explainability) alongside binary detection.

Title	Venue & Year	Modality	Highlights/Keywords	Code
VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning	Arxiv 2026	`[V]`	Perception Pretext RL, Fact-based Reasoning, MintVid Dataset	GitHub
Veritas: Generalizable deepfake detection via pattern-aware reasoning	ICLR 2026(Oral)	`[I]`	Pattern-aware Reasoning, HydraFake Dataset	Github
DF-LLaVA: Unlocking MLLMs for Synthetic Image Detection via Knowledge Injection and Conflict-Driven Self-Reflection	Arxiv 2026	`[I]`	Knowledge Injection, Self-Reflection	N/A
DocShield: Towards AI Document Safety via Evidence-Grounded Agentic Reasoning	Arxiv 2026	`[M]`	Agentic Framework, Document Safety	N/A
VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL	ICLR 2026	`[V]`	Multi-stage RL, Video Detection Dataset	GitHub
FakeXplain: AI-Generated Image Detection via Human-Aligned Grounded Reasoning	ICLR 2026	`[I]`	Grounded Reasoning, Human-annotated Dataset	N/A
AlignGemini: Generalizable AI-Generated Image Detection Through Task-Model Alignment	Arxiv 2026	`[I]`	Decoupling (Semantic & Pixel), AIGI-Now Dataset	N/A
Zoom-In to Sort AI-Generated Images Out	ICLR 2026	`[I]`	Thinking with Images, MagniFake Dataset	N/A
AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection	Arxiv 2026	`[I]`	Agentic framework	Github
EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection	Arxiv 2026	`[I]`	Agentic Framework, Method Ensembling	N/A
VIGIL: Part-Grounded Structured Reasoning for Generalizable Deepfake Detection	Arxiv 2026	`[I]`	Part-centric Forensic, OmniFake Dataset	Project
GenVideoLens: Where LVLMs Fall Short in AI-Generated Video Detection?	Arxiv 2026	`[V]`	GenVideoLens benchmark	N/A
Semantic Visual Anomaly Detection and Reasoning in AI-Generated Images	ICLR 2026	`[I]`	Semantic Anomaly Reasoning, AnomReason Dataset	N/A
FAKE-HR1: RETHINKING REASONING OF VISION LANGUAGE MODEL FOR SYNTHETIC IMAGE DETECTION	Arxiv 2026	`[I]`	Hybrid-Reasoning, Dual-mode Dataset	N/A
MIRAGE: Towards AI-Generated Image Detection in the Wild	Arxiv 2025	`[I]`	Human Curation Dataset, Heuristic-to-Analytic Reasoning	N/A
BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM	Arxiv 2025	`[M]`	RL Post-training, Cross-Modal, Thinking Reward Mechanism	Github
BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation	Arxiv 2025	`[V]`	GenBuster-200K Dataset, Cold Start + RL Training	Github
REVEAL: Reasoning-enhanced Forensic Evidence Analysis for Explainable AI-generated Image Detection	Arxiv 2025	`[I]`	Chain-of-Evidence, Expert-grounded RL	N/A
Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation	NeurIPS 2025	`[I]`	FakeClue Dataset, Fine-grained Artifact Clues, Artifact Explanation	GitHub
AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models	ICCV 2025	`[I]`	Holmes-Set, Multi-Expert Jury, 3-Stage Training Pipeline	Github
LEGION: Learning to Ground and Explain for Synthetic Image Detection	ICCV 2025	`[I]`	SynthScars Dataset, Defender & Controller, Image Refinement	GitHub
Seeing Before Reasoning: A Unified Framework for Generalizable and Explainable Fake Image Detection	Arxiv 2025	`[I]`	Perception & Reasoning, ExplainFake-Bench	N/A
SIDA: Social Media Image Deepfake Detection, Localization, and Explanation	CVPR 2025	`[I]`	SID-Set, Mask Prediction, Social Media Context	Github
FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models	ICLR 2025	`[I]`	Explainable IFDL, Domain Tag-guided, Multi-modal Localization	GitHub
VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL	ICLR 2026	`[V]`	GRPO, Time Artifacts, Quality Evolutionary Videos	N/A
FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics	Arxiv 2025	`[I]`	FakeChain Dataset, FakeInstruct, Trace Evidence	N/A
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors	Arxiv 2024	`[I]`	VQA, InstructBLIP, Soft Prompt-tuning, Zero-shot	GitHub

Classification-Based

This category includes supervised learning approaches that train neural networks (CNNs, ViTs, etc.) specifically to classify authentic vs. AI-generated content. They usually focus on robustness, generalization, and feature extraction.

Title	Venue & Year	Modality	Highlights/Keywords	Code
Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach	AAAI 2026	`[I]`	Localized AIGC Detection, Forgery Amplification, Scene-aware Local Forgery	GitHub
Preserving Forgery Artifacts: AI-Generated Video Detection at Native Scale	ICLR 2026	`[V]`	Native scale video processing, Massive realistic video dataset, Preserves subtle generation artifacts	N/A
Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection	TMM 2026	`[I]`	Continual Learning, Kronecker-Factored Approximate Curvature	N/A
Simplicity Prevails: The Emergence of Generalizable AIGI Detection in Visual Foundation Models	Arxiv 2026	`[I]`	Linear Probe, Vision Foundation Models, Emergent Forensic Capability	N/A
MIRROR: Manifold Ideal Reference ReconstructOR for Generalizable AI-Generated Image Detection	Arxiv 2026	`[I]`	Manifold Reconstruction, Memory Bank, Human-AIGI Benchmark	GitHub
No Pixel Left Behind: A Detail-Preserving Architecture for Robust High-Resolution AI-Generated Image Detection	ICLR 2026	`[I]`	Detail-preserving dual-path architecture, Multi-task learning, HiRes-50K benchmark	N/A
All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning	ICLR 2026	`[I]`	Random Patch Replacement, Patch-wise Contrastive Learning	N/A
OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild	Arxiv 2025	`[I]`	Mixture-of-Experts, Semantic-Artifact Decoupling, Mirage Dataset	GitHub
DINO-Detect: A Simple yet Effective Framework for Blur-Robust AI-Generated Image Detection	Arxiv 2025	`[I]`	Blur Robustness, Knowledge Distillation, DINOv3	Github
D3QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection	ICCV 2025	`[I]`	Discrete Distribution Discrepancy-aware Transformer, Vector Quantized Variational AutoEncoder	Github
Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation	NeurIPS 2025	`[V]`	Wavelet-band Augmentation, Forensic Frequency Artifacts, Single-generator Generalization	GitHub
AI-Generated Video Detection via Perceptual Straightening	NeurIPS 2025	`[V]`	Perceptual Straightening, DINOv2, Temporal Curvature	GitHub
Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection	NeurIPS 2025	`[V]`	Normalized Spatiotemporal Gradient (NSG), Maximum Mean Discrepancy (MMD)	Github
Dual Data Alignment Makes AI-Generated Image Detector Easier Generalizable	NeurIPS 202 (Spotlight)	`[I]`	Dual-domain Alignment, Frequency-level Bias, VAE Reconstruction	GitHub
Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection	ICML 2025 (Oral)	`[I]`	SVD Orthogonal Subspace, Asymmetry Phenomenon, Parameter-efficient Fine-tuning	GitHub
Any-Resolution AI-Generated Image Detection by Spectral Learning	CVPR 2025	`[I]`	Spectral Context Attention, Frequency Reconstruction, OOD Detection	Github
A Bias-Free Training Paradigm for More General AI-generated Image Detection	CVPR 2025	`[I]`	Bias-Free, Semantic Alignment, Stable Diffusion Self-conditioning	Github
Forensics Adapter: Adapting CLIP for Generalizable Face Forgery Detection	CVPR 2025	`[I]`	CLIP, Blending Boundaries, Forgery-aware Prompt Learning	Github
Towards a Universal Synthetic Video Detector: From Face or Background Manipulations to Fully AI-Generated Content	CVPR 2025	`[V]`	SigLIP-So400M, Attention-Diversity Loss, Full-frame Manipulations	N/A
Exploring Unbiased Deepfake Detection via Token-Level Shuffling and Mixing	AAAI 2025	`[I]`	Token-Level Shuffling, Contrastive Loss, Bias Mitigation	N/A
DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection	TMM 2025	`[V]`	Direction-aware Attention, SpatioTemporal Invariant Loss	N/A
Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Space Domain Learning	AAAI 2024	`[I]`	Frequency Domain, FFT, Frequency Conv Layer (FCL), Lightweight	GitHub
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark	Arxiv 2024	`[V]`	Mamba, State Space Model, Long-range Spatiotemporal Inconsistency	GitHub
Rethinking the Up-Sampling Operations in CNN-based Generative Network	CVPR 2024	`[I]`	Neighboring Pixel Relationships, Generalized Structural Artifacts	Github
Distinguish Any Fake Videos: Unleashing the Power of Large-scale Data and Motion Features	Arxiv 2024	`[V]`	GenVidDet, Optical Flow, Dual-Branch 3D Transformer	N/A
FakeFormer: Efficient Vulnerability-Driven Transformers for Generalisable Deepfake Detection	Arxiv 2024	`[I]`	Vulnerability-driven, Local Attention (L2-Att), Vision Transformer	GitHub

⬆ Back to Top

Competitions

Competition	Link	Year	Info
Robust AIGC Detection	NTIRE 2026 Robust AI-Generated Image Detection in the Wild	2026	No restrictions on training data. Evaluate ROC AUC metrics on robust samples.
Robust Deepfake Detection	NTIRE 2026 Robust Deepfake Detection Challenge	2026	No restrictions on training data.
The 6th Face Anti-spoofing Challenge	The 6th Face Anti-Spoofing: Unified Physical-Digital Attacks Detection@ICCV2025	2025	No external data or pre-trained models allowed. Limited to a single DL model with under 100G FLOPs.
Detect AI vs. Human-Generated Images	2025 Women in AI (WAI) Kaggle Challenge	2025	Paired dataset of authentic and AI-generated images
The 5th Face Anti-spoofing Challenge	5th Chalearn Face Anti-spoofing Workshop and Challenge@CVPR2024	2024	UniAttackData+ for unified physical and digital attack detection.

⬆ Back to Top

Practical Detection Tools

Hive Moderation - Website
Tencent Zhuque AI Detection Assistant - Website
AI or Not - Website
Illuminarty - Website
Winston AI - Website
Is it AI? - Website
中科睿鉴 (Zhongke Ruijian) - 微信小程序搜索 睿鉴AI

⬆ Back to Top

🏢 About Our Team

We are the Content Security Intelligence Team under Ant Group - Machine Intelligence. We are responsible for developing comprehensive content security and risk-mitigation capabilities for the Ant Group ecosystem, bridging the gap between rapidly evolving technologies and the urgent need for digital trust.

Why We Do It

In an era where synthetic media is increasingly sophisticated and pervasive, our research serves as a critical line of defense. By advancing AIGC detection technologies, we aim to:

Safeguard Digital Integrity: We provide essential defense mechanisms to protect the authenticity of visual content and combat the spread of misinformation in the digital space.
Empower Trust: Our solutions ensure the public can distinguish between genuine and synthetic media, fostering a more transparent and trustworthy digital ecosystem.
Industrial Application & Impact: We provide robust, scalable aigc detection solutions for Ant Group’s diverse content platforms, including Lingguang, Jingtan, and many others.

🤝 Collaborators

We are honored to collaborate with esteemed researchers and scholars in the field of AI and Computer Vision. We deeply value these academic partnerships that drive our innovation:

Prof. Jun Wan (万军) | CASIA & UCAS
- Research Interests: Biometrics, Face Anti-spoofing, Gesture Recognition, and Computer Vision.
- [Homepage]
Prof. Jianfu Zhang (张健夫) | Shanghai Jiao Tong University
- Research Interests: Computer Vision, Pattern Recognition, and Image/Video Analysis & Synthesis.
- [Homepage]
Prof. Zhuosheng Zhang (张倬胜) | Shanghai Jiao Tong University
- Research Interests: Natural Language Processing, Large Language Models, and Multi-modal Learning.
- [Homepage]

📝 Academic Publications

VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning | (Submitted), 2026
- Highlights: Detected AI-generated videos using perception pretext reinforcement learning to capture temporal inconsistencies.
- [Paper] [Code]
Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images | CVPR'26, 2026
- Highlights: Improved detection accuracy through a two-stage approach of localizing suspicious regions followed by detailed examination.
- [Code]
GAMMA: Generalizable Alignment via Multi-task and Manipulation-Augmented Training for AI-Generated Image Detection | ICASSP'26, 2026
- Highlights: Enhanced generalization through multi-task learning and manipulation-augmented training strategies.
- [Paper]
FakeXplain: AI-Generated Image Detection via Human-Aligned Grounded Reasoning | ICLR'26, 2026
- Highlights: Detected AI-generated images through human-aligned grounded reasoning, providing interpretable visual evidence.
- [Paper] [Code]
Veritas: Generalizable deepfake detection via pattern-aware reasoning | ICLR'26 Oral, 2026
- Highlights: Achieved generalizable deepfake detection through pattern-aware reasoning, improving robustness across diverse manipulation types.
- [Paper] [Code]
Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection | IEEE TMM, 2025
- Highlights: Proposed a continual learning framework that adapts to new generative models while mitigating catastrophic forgetting.
- [Paper]
Towards explainable fake image detection with multi-modal large language models | ACM MM'25, 2025
- Highlights: Leveraged multi-modal large language models to provide human-interpretable explanations for fake image detection.
- [Paper]
WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection | AAAI'25 Oral, 2024
- Highlights: Introduced the largest and most comprehensive AIGC image dataset at the time, providing a challenging benchmark for detection models.
- [Paper]

🏆 Competition Achievements

1st Place Winner | NTIRE 2026 Robust AI-Generated Image Detection in the Wild Challenge
- Secured the top rank in ROC AUC for delivering superior performance in large-scale, real-world AI-generated image detection.
- [Challenge Website]
1st Place Winner | ICCV 2025 VQualA Challenge - Image Super-Resolution Generated Content Quality Assessment, 2025
- Achieved top performance in the VQualA 2025 challenge focused on assessing the quality of super-resolution generated content.
- [Paper 1] [Paper 2]
1st Place Winner | CVPR 2024 Face Anti-Spoofing Challenge, 2024
- Secured first place in the prestigious Face Anti-Spoofing Challenge at CVPR 2024, demonstrating state-of-the-art detection capabilities.
- [Challenge Website]

🛠️ Open-Source Resources

WildFake - A large and comprehensive AIGC image detection dataset.
- [ModelScope]
GenVideo - A large and comprehensive AIGC video detection dataset.
- [ModelScope]
HydraFake - A large-scale challenging dataset for AI-generated image detection.
- [ModelScope]
MintVid - A comprehensive video dataset for AIGC detection research.
- [ModelScope]

✉️ Contact Us

For questions or collaborations, please contact:

⬆ Back to Top

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
icon.png		icon.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome AIGC Image/Video Detection

Contents

🔥 Hot Events

Benchmarks & Datasets

Research Papers

MLLM-Based

Classification-Based

Competitions

Practical Detection Tools

🏢 About Our Team

Why We Do It

🤝 Collaborators

📝 Academic Publications

🏆 Competition Achievements

🛠️ Open-Source Resources

✉️ Contact Us

Star History

About

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Awesome AIGC Image/Video Detection

Contents

🔥 Hot Events

Benchmarks & Datasets

Research Papers

MLLM-Based

Classification-Based

Competitions

Practical Detection Tools

🏢 About Our Team

Why We Do It

🤝 Collaborators

📝 Academic Publications

🏆 Competition Achievements

🛠️ Open-Source Resources

✉️ Contact Us

Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!