English
Related papers

Related papers: Effect of Tuned Parameters on a LSA MCQ Answering …

200 papers

Large Language Models (LLMs) have become essential in many Natural Language Processing (NLP) tasks, leveraging extensive pre-training and fine-tuning to achieve high accuracy. However, like humans, LLMs exhibit biases, particularly…

Computation and Language · Computer Science 2025-10-23 Bianca Raimondi , Maurizio Gabbrielli

The fast-growing amount of information on the Internet makes the research in automatic document summarization very urgent. It is an effective solution for information overload. Many approaches have been proposed based on different…

Computation and Language · Computer Science 2018-08-01 Kamal Al-Sabahi , Zuping Zhang , Jun Long , Khaled Alwesabi

In education, the capability of generating human-like text of Large Language Models (LLMs) inspired work on how they can increase the efficiency of learning and teaching. We study the affordability of these models for educators and students…

Computation and Language · Computer Science 2025-03-06 Bianca Raimondi , Saverio Giallorenzo , Maurizio Gabbrielli

Large language models (LLMs) acquire substantial world knowledge during pre-training, which is further shaped by post-training techniques such as supervised fine-tuning (SFT). However, the impact of SFT on a model's knowledge remains…

Computation and Language · Computer Science 2026-02-11 Junjie Ye , Yuming Yang , Yang Nan , Shuo Li , Qi Zhang , Tao Gui , Xuanjing Huang , Peng Wang , Zhongchao Shi , Jianping Fan

Both latent semantic analysis (LSA) and correspondence analysis (CA) are dimensionality reduction techniques that use singular value decomposition (SVD) for information retrieval. Theoretically, the results of LSA display both the…

Information Retrieval · Computer Science 2023-09-15 Qianqian Qi , David J. Hessen , Peter G. M. van der Heijden

As large language models (LLMs) are increasingly deployed in multi-turn dialogue and other sustained interactive scenarios, it is essential to understand how extended context affects their performance. Popular benchmarks, focusing primarily…

Computation and Language · Computer Science 2025-06-03 Robert Hankache , Kingsley Nketia Acheampong , Liang Song , Marek Brynda , Raad Khraishi , Greig A. Cowan

This paper investigates and validates the impact of fine-tuning on large language model performance, focusing on parameter-efficient methods (LoRA and QLoRA). We evaluate model capabilities across three key domains: (1) commonsense…

Computation and Language · Computer Science 2025-06-13 Qingda , Mai

Multiple-choice question answering (MCQA) is a key competence of performant transformer language models that is tested by mainstream benchmarks. However, recent evidence shows that models can have quite a range of performance, particularly…

Computation and Language · Computer Science 2025-03-11 Sarah Wiegreffe , Oyvind Tafjord , Yonatan Belinkov , Hannaneh Hajishirzi , Ashish Sabharwal

The integration of speech into Large Language Models (LLMs) has substantially expanded their capabilities, but often at the cost of weakening their core textual competence. This degradation limits the ability of speech-enabled LLMs to fully…

Computation and Language · Computer Science 2025-09-30 Chao Wang , Rui-Chen Zheng , Yang Ai , Zhen-Hua Ling

The current research has two aims. First, to demonstrate the utility conditional nonparametric multilevel latent class analysis (NP-MLCA) for multi-site program evaluation using an empirical dataset. Second, to investigate how…

This study examines how user-provided suggestions affect Large Language Models (LLMs) in a simulated educational context, where sycophancy poses significant risks. Testing five different LLMs from the OpenAI GPT-4o and GPT-4.1 model classes…

Computation and Language · Computer Science 2025-06-13 Chuck Arvin

Low-rank approximations, of the weight and feature space can enhance the performance of deep learning models, whether in terms of improving generalization or reducing the latency of inference. However, there is no clear consensus yet on…

Computation and Language · Computer Science 2024-05-24 Arnav Chavan , Nahush Lele , Deepak Gupta

Semantic parsing is a key NLP task that maps natural language to structured meaning representations. As in many other NLP tasks, SOTA performance in semantic parsing is now attained by fine-tuning a large pretrained language model (PLM).…

Computation and Language · Computer Science 2022-03-08 Weiqi Sun , Haidar Khan , Nicolas Guenon des Mesnards , Melanie Rubino , Konstantine Arkoudas

This paper explores the impact of extending input lengths on the capabilities of Large Language Models (LLMs). Despite LLMs advancements in recent times, their performance consistency across different input lengths is not well understood.…

Computation and Language · Computer Science 2024-07-11 Mosh Levy , Alon Jacoby , Yoav Goldberg

Parameter-efficient tuning aims at updating only a small subset of parameters when adapting a pretrained model to downstream tasks. In this work, we introduce PASTA, in which we only modify the special token representations (e.g., [SEP] and…

Computation and Language · Computer Science 2023-02-15 Xiaocong Yang , James Y. Huang , Wenxuan Zhou , Muhao Chen

The rapid advancement of Large Language Models (LLMs) has established standardized evaluation benchmarks as the primary instrument for model comparison. Yet, their reliability is increasingly questioned due to sensitivity to shallow…

Computation and Language · Computer Science 2026-02-20 Bogdan Kostić , Conor Fallon , Julian Risch , Alexander Löser

Automated short answer scoring (ASAS) is shifting from discriminative, fine-tuned models to large language models (LLMs) used in few-shot settings. This paradigm leverages LLMs broad world knowledge and ease of deployment, but limited…

Computation and Language · Computer Science 2026-05-26 Abigail Victoria Gurin Schleifer , Moriah Ariely , Beata Beigman Klebanov , Asaf Salman , Giora Alexandron

Recent advances in large language model (LLM) pruning have shown state-of-the-art (SotA) compression results in post-training and retraining-free settings while maintaining high predictive performance. However, previous research mainly…

Computation and Language · Computer Science 2025-11-18 Simon Kurz , Jian-Jia Chen , Lucie Flek , Zhixue Zhao

Proprietary Large Language Models (LLMs), such as ChatGPT, have garnered significant attention due to their exceptional capabilities in handling a diverse range of tasks. Recent studies demonstrate that open-sourced smaller foundational…

Computation and Language · Computer Science 2023-10-10 Yue Zhang , Leyang Cui , Deng Cai , Xinting Huang , Tao Fang , Wei Bi

Second language acquisition (SLA) modeling is to predict whether second language learners could correctly answer the questions according to what they have learned. It is a fundamental building block of the personalized learning system and…

Computation and Language · Computer Science 2020-09-01 Yong Hu , Heyan Huang , Tian Lan , Xiaochi Wei , Yuxiang Nie , Jiarui Qi , Liner Yang , Xian-Ling Mao
‹ Prev 1 2 3 10 Next ›