Related papers: EEE-QA: Exploring Effective and Efficient Question…

EfficientQA : a RoBERTa Based Phrase-Indexed Question-Answering System

State-of-the-art extractive question-answering models achieve superhuman performances on the SQuAD benchmark. Yet, they are unreasonably heavy and need expensive GPU computing to answer questions in a reasonable time. Thus, they cannot be…

Computation and Language · Computer Science 2025-03-11 Sofian Chaybouti , Achraf Saghe , Aymen Shabou

A Comparative Study of Transformer-Based Language Models on Extractive Question Answering

Question Answering (QA) is a task in natural language processing that has seen considerable growth after the advent of transformers. There has been a surge in QA datasets that have been proposed to challenge natural language processing…

Computation and Language · Computer Science 2021-10-08 Kate Pearce , Tiffany Zhan , Aneesh Komanduri , Justin Zhan

Improved and Efficient Conversational Slot Labeling through Question Answering

Transformer-based pretrained language models (PLMs) offer unmatched performance across the majority of natural language understanding (NLU) tasks, including a body of question answering (QA) tasks. We hypothesize that improvements in QA…

Computation and Language · Computer Science 2022-04-06 Gabor Fuisz , Ivan Vulić , Samuel Gibbons , Inigo Casanueva , Paweł Budzianowski

PEA: Enhancing LLM Performance on Computational-Reasoning Tasks

Large Language Models (LLMs) have exhibited remarkable capabilities across diverse domains, prompting investigations into their potential as generic reasoning engines. While recent studies have explored inference-time computation to enhance…

Artificial Intelligence · Computer Science 2025-02-18 Zi Wang , Shiwei Weng , Mohannad Alhanahnah , Somesh Jha , Tom Reps

QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism

While large language models (LLMs) have made notable advancements in natural language processing, they continue to struggle with processing extensive text. Memory mechanism offers a flexible solution for managing long contexts, utilizing…

Computation and Language · Computer Science 2024-09-27 Bo Wang , Heyan Huang , Yixin Cao , Jiahao Ying , Wei Tang , Chong Feng

Memory Centric Power Allocation for Multi-Agent Embodied Question Answering

This paper considers multi-agent embodied question answering (MA-EQA), which aims to query robot teams on what they have seen over a long horizon. In contrast to existing edge resource management methods that emphasize sensing,…

Robotics · Computer Science 2026-04-21 Chengyang Li , Shuai Wang , Kejiang Ye , Weijie Yuan , Boyu Zhou , Yik-Chung Wu , Chengzhong Xu , Huseyin Arslan

Maximizing Efficiency of Language Model Pre-training for Learning Representation

Pre-trained language models in the past years have shown exponential growth in model parameters and compute time. ELECTRA is a novel approach for improving the compute efficiency of pre-trained language models (e.g. BERT) based on masked…

Computation and Language · Computer Science 2021-10-14 Junmo Kang , Suwon Shin , Jeonghwan Kim , Jaeyoung Jo , Sung-Hyon Myaeng

Crafting Interpretable Embeddings by Asking LLMs Questions

Large language models (LLMs) have rapidly improved text embeddings for a growing array of natural-language processing tasks. However, their opaqueness and proliferation into scientific domains such as neuroscience have created a growing…

Computation and Language · Computer Science 2024-05-28 Vinamra Benara , Chandan Singh , John X. Morris , Richard Antonello , Ion Stoica , Alexander G. Huth , Jianfeng Gao

Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering

Retrieval augmented language models have recently become the standard for knowledge intensive tasks. Rather than relying purely on latent semantics within the parameters of large neural models, these methods enlist a semi-parametric memory…

Computation and Language · Computer Science 2023-01-24 Wenhu Chen , Pat Verga , Michiel de Jong , John Wieting , William Cohen

Beyond Prompting: An Efficient Embedding Framework for Open-Domain Question Answering

Large language models have recently pushed open domain question answering (ODQA) to new frontiers. However, prevailing retriever-reader pipelines often depend on multiple rounds of prompt level instructions, leading to high computational…

Computation and Language · Computer Science 2025-09-23 Zhanghao Hu , Hanqi Yan , Qinglin Zhu , Zhenyi Shen , Yulan He , Lin Gui

Towards leveraging LLMs for Conditional QA

This study delves into the capabilities and limitations of Large Language Models (LLMs) in the challenging domain of conditional question-answering. Utilizing the Conditional Question Answering (CQA) dataset and focusing on generative…

Computation and Language · Computer Science 2023-12-05 Syed-Amad Hussain , Parag Pravin Dakle , SaiKrishna Rallabandi , Preethi Raghavan

A Comparative Study of Pre-trained Encoders for Low-Resource Named Entity Recognition

Pre-trained language models (PLM) are effective components of few-shot named entity recognition (NER) approaches when augmented with continued pre-training on task-specific out-of-domain data or fine-tuning on in-domain data. However, their…

Computation and Language · Computer Science 2022-04-12 Yuxuan Chen , Jonas Mikkelsen , Arne Binder , Christoph Alt , Leonhard Hennig

SPOT: Knowledge-Enhanced Language Representations for Information Extraction

Knowledge-enhanced pre-trained models for language representation have been shown to be more effective in knowledge base construction tasks (i.e.,~relation extraction) than language models such as BERT. These knowledge-enhanced language…

Computation and Language · Computer Science 2022-10-25 Jiacheng Li , Yannis Katsis , Tyler Baldwin , Ho-Cheol Kim , Andrew Bartko , Julian McAuley , Chun-Nan Hsu

Self-Convinced Prompting: Few-Shot Question Answering with Repeated Introspection

While large language models (LLMs) such as ChatGPT and PaLM have demonstrated remarkable performance in various language understanding and generation tasks, their capabilities in complex reasoning and intricate knowledge utilization still…

Computation and Language · Computer Science 2023-10-11 Haodi Zhang , Min Cai , Xinhe Zhang , Chen Jason Zhang , Rui Mao , Kaishun Wu

Exploring Question Understanding and Adaptation in Neural-Network-Based Question Answering

The last several years have seen intensive interest in exploring neural-network-based models for machine comprehension (MC) and question answering (QA). In this paper, we approach the problems by closely modelling questions in a neural…

Computation and Language · Computer Science 2017-03-28 Junbei Zhang , Xiaodan Zhu , Qian Chen , Lirong Dai , Si Wei , Hui Jiang

Texts or Images? A Fine-grained Analysis on the Effectiveness of Input Representations and Models for Table Question Answering

In table question answering (TQA), tables are encoded as either texts or images. Prior work suggests that passing images of tables to multi-modal large language models (MLLMs) performs comparably to or even better than using textual input…

Computation and Language · Computer Science 2025-05-21 Wei Zhou , Mohsen Mesgar , Heike Adel , Annemarie Friedrich

Question Answering Infused Pre-training of General-Purpose Contextualized Representations

We propose a pre-training objective based on question answering (QA) for learning general-purpose contextual representations, motivated by the intuition that the representation of a phrase in a passage should encode all questions that the…

Computation and Language · Computer Science 2022-03-17 Robin Jia , Mike Lewis , Luke Zettlemoyer

LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

The rapid development of large language models (LLM) has greatly enhanced everyday applications. While many FPGA-based accelerators, with flexibility for fine-grained data control, exhibit superior speed and energy efficiency compared to…

Hardware Architecture · Computer Science 2026-03-24 Zifan He , Shengyu Ye , Rui Ma , Yang Wang , Jason Cong

Collaborative Large Language Model Inference via Resource-Aware Parallel Speculative Decoding

The growing demand for on-device large language model (LLM) inference highlights the need for efficient mobile edge computing (MEC) solutions, especially in resource-constrained settings. Speculative decoding offers a promising solution by…

Machine Learning · Computer Science 2025-12-01 Jungyeon Koh , Hyun Jong Yang

Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute

Retrieval-augmented language models such as Fusion-in-Decoder are powerful, setting the state of the art on a variety of knowledge-intensive tasks. However, they are also expensive, due to the need to encode a large number of retrieved…

Computation and Language · Computer Science 2023-06-06 Michiel de Jong , Yury Zemlyanskiy , Nicholas FitzGerald , Joshua Ainslie , Sumit Sanghai , Fei Sha , William Cohen