VRAG-DFD: Verifiable Retrieval-Augmentation for MLLM-based Deepfake Detection

VRAG-DFD is a framework that introduces Verifiable Retrieval-Augmented Generation (RAG) into the Deepfake Detection (DFD) domain. By combining professional forensic knowledge retrieval with Reinforcement Learning (GRPO), we empower Multi-modal Large Language Models (MLLMs) to perform expert-level forensic analysis with critical reasoning.

Overview

Phase 1: deepfake_RAG

This phase involves building the Forensic Knowledge Database (FKD) and generating retrieval results for test sets.

Installation

git clone https://github.com/abigcatcat/VRAG-DFD.git
cd VRAG-DFD/deepfake_RAG
pip install -r requirements.txt

Build the Forensic Database

We use the annotated FF++ dataset to create the vector database.

Configure your paths and parameters in deepfake_RAG/config.py:

DATASET_CONFIG = {
    'json_path': "/path/to/yourjson",  # The corresponding JSON file for the dataset used to build the database
    'image_root': '/path/to/your/images',  # Root directory for images
    'max_images_per_video': 32,  # Number of sampled frames per video
    'batch_size': 32,
}

MODEL_CONFIG = {
    'model_path': '/path/to/pth',  # Path to the pre-trained retrievel model
}

Run the build command:

python demo.py --mode build

Generate Offline Retrieval Results

Retrieve forgery evidence for public datasets (e.g., Celeb-DF v1/v2) to generate JSON files for the MLLM.

Update parameters in deepfake_RAG/config.py:

DATASET_CONFIG = {
    'json_path': "/path/to/test_dataset.json",  # The JSON file corresponding to the test set.
    'image_root': '/path/to/test/images',  # Directory for test images
}

DATABASE_CONFIG = {
    'save_dir': './deepfake_rag_database',  # Path to the constructed database
}

Run the build command:

python demo.py --mode test --output_file <output_path/filename>.json

Phase 2: MLLM Training & Evaluation

The training follows a three-stage progressive strategy to cultivate critical reasoning.

Installation

cd VRAG-DFD
pip install -r requirements.txt

Training Pipeline The training data is located in datasets_json/, and execution scripts are in scripts_run/.

Stage	Training Data	Execution Script	Description
Stage 1	`stage1_train.json`	`bash scripts_run/stage1.sh`	Visual Alignment
Stage 2	`rag_finetune_data.json`	`bash scripts_run/stage2.sh`	Forensic SFT
Stage 3	`rag_grpo_data.json`	`bash scripts_run/stage3.sh`	Critical RL (GRPO)

Inference & Evaluation

Step A: Format Processing Convert Phase 1 retrieval results into the training-compatible format:

python utils/process.py --input <path_to_retrieval_json> --output <formatted_json>

Step B: Batch Inference Run the batch evaluation script:

bash scripts_run/eval_batch.sh

Step C: Metrics Calculation Calculate AUC and Acc:

python utils/get_metrics.py --results <prediction_file>.json

Citation

If you use our dataset, code or find VRAG-DFD useful, please cite our paper in your work as:

@article{han2026vragdfd,
  title={VRAG-DFD: Verifiable Retrieval-Augmentation for MLLM-based Deepfake Detection},
  author={Hui Han and Shunli Wang and Yandan Zhao and Taiping Yao and Shouhong Ding},
  journal={arXiv preprint arXiv:2604.13660},
  year={2026},
  url={https://arxiv.org/abs/2604.13660}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
annotations		annotations
asset		asset
deepfake_RAG		deepfake_RAG
ms-swift		ms-swift
scripts_run		scripts_run
utils		utils
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VRAG-DFD: Verifiable Retrieval-Augmentation for MLLM-based Deepfake Detection

Overview

Contents

Phase 1: deepfake_RAG

Installation

Build the Forensic Database

Generate Offline Retrieval Results

Phase 2: MLLM Training & Evaluation

Installation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VRAG-DFD: Verifiable Retrieval-Augmentation for MLLM-based Deepfake Detection

Overview

Contents

Phase 1: deepfake_RAG

Installation

Build the Forensic Database

Generate Offline Retrieval Results

Phase 2: MLLM Training & Evaluation

Installation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages