English
Related papers

Related papers: Machine Learning Model Attribution Challenge

200 papers

This paper presents solutions to the Machine Learning Model Attribution challenge (MLMAC) collectively organized by MITRE, Microsoft, Schmidt-Futures, Robust-Intelligence, Lincoln-Network, and Huggingface community. The challenge provides…

Computation and Language · Computer Science 2022-11-22 Farhan Dhanani , Muhammad Rafi

The wide applicability and adaptability of generative large language models (LLMs) has enabled their rapid adoption. While the pre-trained models can perform many tasks, such models are often fine-tuned to improve their performance on…

Computation and Language · Computer Science 2023-06-16 Myles Foley , Ambrish Rawat , Taesung Lee , Yufang Hou , Gabriele Picco , Giulio Zizzo

Large language models (LLMs) have shown impressive results while requiring little or no direct supervision. Further, there is mounting evidence that LLMs may have potential in information-seeking scenarios. We believe the ability of an LLM…

Large language models (LLMs) often suffer from catastrophic forgetting in continual learning: after learning new tasks sequentially, they perform worse on earlier tasks. Existing methods mitigate catastrophic forgetting by data replay,…

Machine Learning · Computer Science 2026-05-08 Yazheng Liu , Yuxuan Wan , Rui Xu , Xi Zhang , Sihong Xie , Hui Xiong

Modern generative search engines enhance the reliability of large language model (LLM) responses by providing cited evidence. However, evaluating the answer's attribution, i.e., whether every claim within the generated responses is fully…

Computation and Language · Computer Science 2024-02-26 Yifei Li , Xiang Yue , Zeyi Liao , Huan Sun

A recent focus of large language model (LLM) development, as exemplified by generative search engines, is to incorporate external references to generate and support its claims. However, evaluating the attribution, i.e., verifying whether…

Computation and Language · Computer Science 2023-10-10 Xiang Yue , Boshi Wang , Ziru Chen , Kai Zhang , Yu Su , Huan Sun

With the enhancement in the field of generative artificial intelligence (AI), contextual question answering has become extremely relevant. Attributing model generations to the input source document is essential to ensure trustworthiness and…

Computation and Language · Computer Science 2024-05-29 Anirudh Phukan , Shwetha Somasundaram , Apoorv Saxena , Koustava Goswami , Balaji Vasan Srinivasan

Large Language Models (LLMs) have demonstrated remarkable performance across diverse domains. However, effectively leveraging their vast knowledge for training smaller downstream models remains an open challenge, especially in domains like…

Machine Learning · Computer Science 2025-07-28 Davor Vukadin , Marin Šilić , Goran Delač

LLMs can help humans working with long documents, but are known to hallucinate. Attribution can increase trust in LLM responses: The LLM provides evidence that supports its response, which enhances verifiability. Existing approaches to…

Computation and Language · Computer Science 2024-10-24 Jan Buchmann , Xiao Liu , Iryna Gurevych

Multimodal Large Language Models (mLLMs) are often used to answer questions in structured data such as tables in Markdown, JSON, and images. While these models can often give correct answers, users also need to know where those answers come…

Computation and Language · Computer Science 2026-02-18 Yahia Alqurnawi , Preetom Biswas , Anmol Rao , Tejas Anvekar , Chitta Baral , Vivek Gupta

As Large Language Models (LLMs) are increasingly applied to document-based tasks - such as document summarization, question answering, and information extraction - where user requirements focus on retrieving information from provided…

Information Retrieval · Computer Science 2025-05-13 Vipula Rawte , Ryan A. Rossi , Franck Dernoncourt , Nedim Lipka

In the current Large Language Model (LLM) ecosystem, creators have little agency over how their data is used, and LLM users may find themselves unknowingly plagiarizing existing sources. Attribution of LLM-generated text to LLM input data…

Computers and Society · Computer Science 2026-05-11 Amelie Wührl , Mattes Ruckdeschel , Kyle Lo , Anna Rogers

The increasing popularity of Large Language Models (LLMs) in recent years has changed the way users interact with and pose questions to AI-based conversational systems. An essential aspect for increasing the trustworthiness of generated LLM…

Computation and Language · Computer Science 2024-10-23 Juraj Vladika , Luca Mülln , Florian Matthes

The integration of large language models (LLMs) into automated algorithm design has shown promising potential. A prevalent approach embeds LLMs within search routines to iteratively generate and refine candidate algorithms. However, most…

Machine Learning · Computer Science 2026-05-20 Fei Liu , Rui Zhang , Xi Lin , Zhichao Lu , Qingfu Zhang

We show that utilizing attribution maps for training neural networks can improve regularization of models and thus increase performance. Regularization is key in deep learning, especially when training complex models on relatively small…

Machine Learning · Computer Science 2022-05-31 Christian Tomani , Daniel Cremers

Large language models (LLM) are advanced AI systems trained on extensive textual data, leveraging deep learning techniques to understand and generate human-like language. Today's LLMs with billions of parameters are so huge that hardly any…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-14 Sheikh Azizul Hakim , Saem Hasan

Accurate attribution of authorship is crucial for maintaining the integrity of digital content, improving forensic investigations, and mitigating the risks of misinformation and plagiarism. Addressing the imperative need for proper…

Computers and Society · Computer Science 2026-05-27 Baixiang Huang , Canyu Chen , Kai Shu

Fine-tuning Large Language Models (LLMs) incurs considerable training costs, driving the need for data-efficient training with optimised data ordering. Human-inspired strategies offer a solution by organising data based on human learning…

Computation and Language · Computer Science 2024-11-06 Yushi Yang , Andrew M. Bean , Robert McCraith , Adam Mahdi

Representation learning has been widely studied in the context of meta-learning, enabling rapid learning of new tasks through shared representations. Recent works such as MAML have explored using fine-tuning-based metrics, which measure the…

Machine Learning · Computer Science 2021-05-06 Kurtland Chua , Qi Lei , Jason D. Lee

Existing research on large language models (LLMs) for automated code compliance has primarily focused on performance, treating the models as black boxes and overlooking how training decisions affect their interpretive behavior. This paper…

Computation and Language · Computer Science 2026-04-20 Jack Wei Lun Shi , Minghao Dang , Wawan Solihin , Justin K. W. Yeoh
‹ Prev 1 2 3 10 Next ›