Related papers: Machine Learning Model Attribution Challenge

Artificial Interrogation for Attributing Language Models

This paper presents solutions to the Machine Learning Model Attribution challenge (MLMAC) collectively organized by MITRE, Microsoft, Schmidt-Futures, Robust-Intelligence, Lincoln-Network, and Huggingface community. The challenge provides…

Computation and Language · Computer Science 2022-11-22 Farhan Dhanani , Muhammad Rafi

Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models

The wide applicability and adaptability of generative large language models (LLMs) has enabled their rapid adoption. While the pre-trained models can perform many tasks, such models are often fine-tuned to improve their performance on…

Computation and Language · Computer Science 2023-06-16 Myles Foley , Ambrish Rawat , Taesung Lee , Yufang Hou , Gabriele Picco , Giulio Zizzo

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

Large language models (LLMs) have shown impressive results while requiring little or no direct supervision. Further, there is mounting evidence that LLMs may have potential in information-seeking scenarios. We believe the ability of an LLM…

Computation and Language · Computer Science 2023-02-14 Bernd Bohnet , Vinh Q. Tran , Pat Verga , Roee Aharoni , Daniel Andor , Livio Baldini Soares , Massimiliano Ciaramita , Jacob Eisenstein , Kuzman Ganchev , Jonathan Herzig , Kai Hui , Tom Kwiatkowski , Ji Ma , Jianmo Ni , Lierni Sestorain Saralegui , Tal Schuster , William W. Cohen , Michael Collins , Dipanjan Das , Donald Metzler , Slav Petrov , Kellie Webster

Attribution-Guided Continual Learning for Large Language Models

Large language models (LLMs) often suffer from catastrophic forgetting in continual learning: after learning new tasks sequentially, they perform worse on earlier tasks. Existing methods mitigate catastrophic forgetting by data replay,…

Machine Learning · Computer Science 2026-05-08 Yazheng Liu , Yuxuan Wan , Rui Xu , Xi Zhang , Sihong Xie , Hui Xiong

AttributionBench: How Hard is Automatic Attribution Evaluation?

Modern generative search engines enhance the reliability of large language model (LLM) responses by providing cited evidence. However, evaluating the answer's attribution, i.e., whether every claim within the generated responses is fully…

Computation and Language · Computer Science 2024-02-26 Yifei Li , Xiang Yue , Zeyi Liao , Huan Sun

Automatic Evaluation of Attribution by Large Language Models

A recent focus of large language model (LLM) development, as exemplified by generative search engines, is to incorporate external references to generate and support its claims. However, evaluating the attribution, i.e., verifying whether…

Computation and Language · Computer Science 2023-10-10 Xiang Yue , Boshi Wang , Ziru Chen , Kai Zhang , Yu Su , Huan Sun

Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering

With the enhancement in the field of generative artificial intelligence (AI), contextual question answering has become extremely relevant. Attributing model generations to the input source document is essential to ensure trustworthiness and…

Computation and Language · Computer Science 2024-05-29 Anirudh Phukan , Shwetha Somasundaram , Apoorv Saxena , Koustava Goswami , Balaji Vasan Srinivasan

Large Language Models as Attribution Regularizers for Efficient Model Training

Large Language Models (LLMs) have demonstrated remarkable performance across diverse domains. However, effectively leveraging their vast knowledge for training smaller downstream models remains an open challenge, especially in domains like…

Machine Learning · Computer Science 2025-07-28 Davor Vukadin , Marin Šilić , Goran Delač

Attribute or Abstain: Large Language Models as Long Document Assistants

LLMs can help humans working with long documents, but are known to hallucinate. Attribution can increase trust in LLM responses: The LLM provides evidence that supports its response, which enhances verifiability. Existing approaches to…

Computation and Language · Computer Science 2024-10-24 Jan Buchmann , Xiao Liu , Iryna Gurevych

ViTaB-A: Evaluating Multimodal Large Language Models on Visual Table Attribution

Multimodal Large Language Models (mLLMs) are often used to answer questions in structured data such as tables in Markdown, JSON, and images. While these models can often give correct answers, users also need to know where those answers come…

Computation and Language · Computer Science 2026-02-18 Yahia Alqurnawi , Preetom Biswas , Anmol Rao , Tejas Anvekar , Chitta Baral , Vivek Gupta

Document Attribution: Examining Citation Relationships using Large Language Models

As Large Language Models (LLMs) are increasingly applied to document-based tasks - such as document summarization, question answering, and information extraction - where user requirements focus on retrieving information from provided…

Information Retrieval · Computer Science 2025-05-13 Vipula Rawte , Ryan A. Rossi , Franck Dernoncourt , Nedim Lipka

A Human-Centric Framework for Data Attribution in Large Language Models

In the current Large Language Model (LLM) ecosystem, creators have little agency over how their data is used, and LLM users may find themselves unknowingly plagiarizing existing sources. Attribution of LLM-generated text to LLM input data…

Computers and Society · Computer Science 2026-05-11 Amelie Wührl , Mattes Ruckdeschel , Kyle Lo , Anna Rogers

Enhancing Answer Attribution for Faithful Text Generation with Large Language Models

The increasing popularity of Large Language Models (LLMs) in recent years has changed the way users interact with and pose questions to AI-based conversational systems. An essential aspect for increasing the trustworthiness of generated LLM…

Computation and Language · Computer Science 2024-10-23 Juraj Vladika , Luca Mülln , Florian Matthes

Fine-tuning Large Language Model for Automated Algorithm Design

The integration of large language models (LLMs) into automated algorithm design has shown promising potential. A prevalent approach embeds LLMs within search routines to iteratively generate and refine candidate algorithms. However, most…

Machine Learning · Computer Science 2026-05-20 Fei Liu , Rui Zhang , Xi Lin , Zhichao Lu , Qingfu Zhang

CHALLENGER: Training with Attribution Maps

We show that utilizing attribution maps for training neural networks can improve regularization of models and thus increase performance. Regularization is key in deep learning, especially when training complex models on relatively small…

Machine Learning · Computer Science 2022-05-31 Christian Tomani , Daniel Cremers

An Explorative Study on Distributed Computing Techniques in Training and Inference of Large Language Models

Large language models (LLM) are advanced AI systems trained on extensive textual data, leveraging deep learning techniques to understand and generate human-like language. Today's LLMs with billions of parameters are so huge that hardly any…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-14 Sheikh Azizul Hakim , Saem Hasan

Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges

Accurate attribution of authorship is crucial for maintaining the integrity of digital content, improving forensic investigations, and mitigating the risks of misinformation and plagiarism. Addressing the imperative need for proper…

Computers and Society · Computer Science 2026-05-27 Baixiang Huang , Canyu Chen , Kai Shu

Evaluating Fine-Tuning Efficiency of Human-Inspired Learning Strategies in Medical Question Answering

Fine-tuning Large Language Models (LLMs) incurs considerable training costs, driving the need for data-efficient training with optimised data ordering. Human-inspired strategies offer a solution by organising data based on human learning…

Computation and Language · Computer Science 2024-11-06 Yushi Yang , Andrew M. Bean , Robert McCraith , Adam Mahdi

How Fine-Tuning Allows for Effective Meta-Learning

Representation learning has been widely studied in the context of meta-learning, enabling rapid learning of new tasks through shared representations. Recent works such as MAML have explored using fine-tuning-based metrics, which measure the…

Machine Learning · Computer Science 2021-05-06 Kurtland Chua , Qi Lei , Jason D. Lee

LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance

Existing research on large language models (LLMs) for automated code compliance has primarily focused on performance, treating the models as black boxes and overlooking how training decisions affect their interpretive behavior. This paper…

Computation and Language · Computer Science 2026-04-20 Jack Wei Lun Shi , Minghao Dang , Wawan Solihin , Justin K. W. Yeoh