Related papers: Towards Interactive Deepfake Analysis

Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics

DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we…

Artificial Intelligence · Computer Science 2024-06-12 Shan Jia , Reilin Lyu , Kangran Zhao , Yize Chen , Zhiyuan Yan , Yan Ju , Chuanbo Hu , Xin Li , Baoyuan Wu , Siwei Lyu

Beyond Static Datasets: A Deep Interaction Approach to LLM Evaluation

Large Language Models (LLMs) have made progress in various real-world tasks, which stimulates requirements for the evaluation of LLMs. Existing LLM evaluation methods are mainly supervised signal-based which depends on static datasets and…

Computation and Language · Computer Science 2023-09-11 Jiatong Li , Rui Li , Qi Liu

COCO is "ALL'' You Need for Visual Instruction Fine-tuning

Multi-modal Large Language Models (MLLMs) are increasingly prominent in the field of artificial intelligence. Visual instruction fine-tuning (IFT) is a vital process for aligning MLLMs' output with user's intentions. High-quality and…

Computer Vision and Pattern Recognition · Computer Science 2024-01-18 Xiaotian Han , Yiqi Wang , Bohan Zhai , Quanzeng You , Hongxia Yang

Explainable Deepfake Detection with RL Enhanced Self-Blended Images

Most prior deepfake detection methods lack explainable outputs. With the growing interest in multimodal large language models (MLLMs), researchers have started exploring their use in interpretable deepfake detection. However, a major…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Ning Jiang , Dingheng Zeng , Yanhong Liu , Haiyang Yi , Shijie Yu , Minghe Weng , Haifeng Shen , Ying Li

Enhancing Non-English Capabilities of English-Centric Large Language Models through Deep Supervision Fine-Tuning

Large language models (LLMs) have demonstrated significant progress in multilingual language understanding and generation. However, due to the imbalance in training data, their capabilities in non-English languages are limited. Recent…

Computation and Language · Computer Science 2025-03-06 Wenshuai Huo , Xiaocheng Feng , Yichong Huang , Chengpeng Fu , Baohang Li , Yangfan Ye , Zhirui Zhang , Dandan Tu , Duyu Tang , Yunfei Lu , Hui Wang , Bing Qin

Investigating the Viability of Employing Multi-modal Large Language Models in the Context of Audio Deepfake Detection

While Vision-Language Models (VLMs) and Multimodal Large Language Models (MLLMs) have shown strong generalisation in detecting image and video deepfakes, their use for audio deepfake detection remains largely unexplored. In this work, we…

Sound · Computer Science 2026-01-05 Akanksha Chuchra , Shukesh Reddy , Sudeepta Mishra , Abhijit Das , Abhinav Dhall

HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model

Instruction tuning is widely used to improve a pre-trained Multimodal Large Language Model (MLLM) by training it on curated task-specific datasets, enabling better comprehension of human instructions. However, it is infeasible to collect…

Computation and Language · Computer Science 2025-05-30 Haiyang Guo , Fanhu Zeng , Ziwei Xiang , Fei Zhu , Da-Han Wang , Xu-Yao Zhang , Cheng-Lin Liu

Assessment Framework for Deepfake Detection in Real-world Situations

Detecting digital face manipulation in images and video has attracted extensive attention due to the potential risk to public trust. To counteract the malicious usage of such techniques, deep learning-based deepfake detection methods have…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Yuhang Lu , Touradj Ebrahimi

Fake-in-Facext: Towards Fine-Grained Explainable DeepFake Analysis

The advancement of Multimodal Large Language Models (MLLMs) has bridged the gap between vision and language tasks, enabling the implementation of Explainable DeepFake Analysis (XDFA). However, current methods suffer from a lack of…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Lixiong Qin , Yang Zhang , Mei Wang , Jiani Hu , Weihong Deng , Weiran Xu

FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking

Automatic fact-checking plays a crucial role in combating the spread of misinformation. Large Language Models (LLMs) and Instruction-Following variants, such as InstructGPT and Alpaca, have shown remarkable performance in various natural…

Computation and Language · Computer Science 2023-09-04 Tsun-Hin Cheung , Kin-Man Lam

Interactive Evaluation of Large Language Models for Multi-Requirement Software Engineering Tasks

Standard single-turn, static benchmarks fall short in evaluating the nuanced capabilities of Large Language Models (LLMs) on complex tasks such as software engineering. In this work, we propose a novel interactive evaluation framework that…

Artificial Intelligence · Computer Science 2025-08-27 Dimitrios Rontogiannis , Maxime Peyrard , Nicolas Baldwin , Martin Josifoski , Robert West , Dimitrios Gunopulos

Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Models

Multi-modal large language models (MLLMs) are expected to support multi-turn queries of interchanging image and text modalities in production. However, the current MLLMs trained with visual-question-answering (VQA) datasets could suffer…

Computation and Language · Computer Science 2024-11-06 Shengzhi Li , Rongyu Lin , Shichao Pei

SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms

The rapid advancement of deep generative models has significantly improved the realism of synthetic media, presenting both opportunities and security challenges. While deepfake technology has valuable applications in entertainment and…

Machine Learning · Computer Science 2025-06-09 Arnesh Batra , Anushk Kumar , Jashn Khemani , Arush Gumber , Arhan Jain , Somil Gupta

Mining the Explainability and Generalization: Fact Verification Based on Self-Instruction

Fact-checking based on commercial LLMs has become mainstream. Although these methods offer high explainability, it falls short in accuracy compared to traditional fine-tuning approaches, and data security is also a significant concern. In…

Computation and Language · Computer Science 2024-05-24 Guangyao Lu , Yulin Liu

Adaptive Meta-Learning for Robust Deepfake Detection: A Multi-Agent Framework to Data Drift and Model Generalization

Pioneering advancements in artificial intelligence, especially in genAI, have enabled significant possibilities for content creation, but also led to widespread misinformation and false content. The growing sophistication and realism of…

Artificial Intelligence · Computer Science 2024-11-14 Dinesh Srivasthav P , Badri Narayan Subudhi

MLLM-DataEngine: An Iterative Refinement Approach for MLLM

Despite the great advance of Multimodal Large Language Models (MLLMs) in both instruction dataset building and benchmarking, the independence of training and evaluation makes current MLLMs hard to further improve their capability under the…

Machine Learning · Computer Science 2023-09-12 Zhiyuan Zhao , Linke Ouyang , Bin Wang , Siyuan Huang , Pan Zhang , Xiaoyi Dong , Jiaqi Wang , Conghui He

Supervised Fine-Tuning as Inverse Reinforcement Learning

The prevailing approach to aligning Large Language Models (LLMs) typically relies on human or AI feedback and assumes access to specific types of preference datasets. In our work, we question the efficacy of such datasets and explore…

Machine Learning · Computer Science 2024-03-19 Hao Sun

FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant

The rapid advancement of deepfake technologies has sparked widespread public concern, particularly as face forgery poses a serious threat to public information security. However, the unknown and diverse forgery techniques, varied facial…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Zhengchao Huang , Bin Xia , Zicheng Lin , Zhun Mou , Wenming Yang , Jiaya Jia

Leveraging Deep Learning Approaches for Deepfake Detection: A Review

Conspicuous progression in the field of machine learning and deep learning have led the jump of highly realistic fake media, these media oftentimes referred as deepfakes. Deepfakes are fabricated media which are generated by sophisticated…

Machine Learning · Computer Science 2023-04-05 Aniruddha Tiwari , Rushit Dave , Mounika Vanamala

EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language Models

In recent years, instruction tuning has gained increasing attention and emerged as a crucial technique to enhance the capabilities of Large Language Models (LLMs). To construct high-quality instruction datasets, many instruction processing…

Computation and Language · Computer Science 2024-06-25 Yixin Ou , Ningyu Zhang , Honghao Gui , Ziwen Xu , Shuofei Qiao , Yida Xue , Runnan Fang , Kangwei Liu , Lei Li , Zhen Bi , Guozhou Zheng , Huajun Chen