Related papers: Effect of Tuned Parameters on a LSA MCQ Answering …

Exploiting Primacy Effect To Improve Large Language Models

Large Language Models (LLMs) have become essential in many Natural Language Processing (NLP) tasks, leveraging extensive pre-training and fine-tuning to achieve high accuracy. However, like humans, LLMs exhibit biases, particularly…

Computation and Language · Computer Science 2025-10-23 Bianca Raimondi , Maurizio Gabbrielli

An Enhanced Latent Semantic Analysis Approach for Arabic Document Summarization

The fast-growing amount of information on the Internet makes the research in automatic document summarization very urgent. It is an effective solution for information overload. Many approaches have been proposed based on different…

Computation and Language · Computer Science 2018-08-01 Kamal Al-Sabahi , Zuping Zhang , Jun Long , Khaled Alwesabi

Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs

In education, the capability of generating human-like text of Large Language Models (LLMs) inspired work on how they can increase the efficiency of learning and teaching. We study the affordability of these models for educators and students…

Computation and Language · Computer Science 2025-03-06 Bianca Raimondi , Saverio Giallorenzo , Maurizio Gabbrielli

Analyzing the Effects of Supervised Fine-Tuning on Model Knowledge from Token and Parameter Levels

Large language models (LLMs) acquire substantial world knowledge during pre-training, which is further shaped by post-training techniques such as supervised fine-tuning (SFT). However, the impact of SFT on a model's knowledge remains…

Computation and Language · Computer Science 2026-02-11 Junjie Ye , Yuming Yang , Yang Nan , Shuo Li , Qi Zhang , Tao Gui , Xuanjing Huang , Peng Wang , Zhongchao Shi , Jianping Fan

Improving information retrieval through correspondence analysis instead of latent semantic analysis

Both latent semantic analysis (LSA) and correspondence analysis (CA) are dimensionality reduction techniques that use singular value decomposition (SVD) for information retrieval. Theoretically, the results of LSA display both the…

Information Retrieval · Computer Science 2023-09-15 Qianqian Qi , David J. Hessen , Peter G. M. van der Heijden

Evaluating the Sensitivity of LLMs to Prior Context

As large language models (LLMs) are increasingly deployed in multi-turn dialogue and other sustained interactive scenarios, it is essential to understand how extended context affects their performance. Popular benchmarks, focusing primarily…

Computation and Language · Computer Science 2025-06-03 Robert Hankache , Kingsley Nketia Acheampong , Liang Song , Marek Brynda , Raad Khraishi , Greig A. Cowan

Slimming Down LLMs Without Losing Their Minds

This paper investigates and validates the impact of fine-tuning on large language model performance, focusing on parameter-efficient methods (LoRA and QLoRA). We evaluate model capabilities across three key domains: (1) commonsense…

Computation and Language · Computer Science 2025-06-13 Qingda , Mai

Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions

Multiple-choice question answering (MCQA) is a key competence of performant transformer language models that is tested by mainstream benchmarks. However, recent evidence shows that models can have quite a range of performance, particularly…

Computation and Language · Computer Science 2025-03-11 Sarah Wiegreffe , Oyvind Tafjord , Yonatan Belinkov , Hannaneh Hajishirzi , Ashish Sabharwal

Understanding Textual Capability Degradation in Speech LLMs via Parameter Importance Analysis

The integration of speech into Large Language Models (LLMs) has substantially expanded their capabilities, but often at the cost of weakening their core textual competence. This degradation limits the ability of speech-enabled LLMs to fully…

Computation and Language · Computer Science 2025-09-30 Chao Wang , Rui-Chen Zheng , Yang Ai , Zhen-Hua Ling

Classification Accuracy and Parameter Estimation in Multilevel Contexts: A Study of Conditional Nonparametric Multilevel Latent Class Analysis

The current research has two aims. First, to demonstrate the utility conditional nonparametric multilevel latent class analysis (NP-MLCA) for multi-site program evaluation using an empirical dataset. Second, to investigate how…

Methodology · Statistics 2020-11-02 Chi Chang , Kimberly Kelly , M. Lee Van Horn , Richard T. Houang , Joseph Gardiner , Laurie Van Egeren , Heng-Chieh Wu

"Check My Work?": Measuring Sycophancy in a Simulated Educational Context

This study examines how user-provided suggestions affect Large Language Models (LLMs) in a simulated educational context, where sycophancy poses significant risks. Testing five different LLMs from the OpenAI GPT-4o and GPT-4.1 model classes…

Computation and Language · Computer Science 2025-06-13 Chuck Arvin

Surgical Feature-Space Decomposition of LLMs: Why, When and How?

Low-rank approximations, of the weight and feature space can enhance the performance of deep learning models, whether in terms of improving generalization or reducing the latency of inference. However, there is no clear consensus yet on…

Computation and Language · Computer Science 2024-05-24 Arnav Chavan , Nahush Lele , Deepak Gupta

Unfreeze with Care: Space-Efficient Fine-Tuning of Semantic Parsing Models

Semantic parsing is a key NLP task that maps natural language to structured meaning representations. As in many other NLP tasks, SOTA performance in semantic parsing is now attained by fine-tuning a large pretrained language model (PLM).…

Computation and Language · Computer Science 2022-03-08 Weiqi Sun , Haidar Khan , Nicolas Guenon des Mesnards , Melanie Rubino , Konstantine Arkoudas

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

This paper explores the impact of extending input lengths on the capabilities of Large Language Models (LLMs). Despite LLMs advancements in recent times, their performance consistency across different input lengths is not well understood.…

Computation and Language · Computer Science 2024-07-11 Mosh Levy , Alon Jacoby , Yoav Goldberg

Parameter-Efficient Tuning with Special Token Adaptation

Parameter-efficient tuning aims at updating only a small subset of parameters when adapting a pretrained model to downstream tasks. In this work, we introduce PASTA, in which we only modify the special token representations (e.g., [SEP] and…

Computation and Language · Computer Science 2023-02-15 Xiaocong Yang , James Y. Huang , Wenxuan Zhou , Muhao Chen

Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation

The rapid advancement of Large Language Models (LLMs) has established standardized evaluation benchmarks as the primary instrument for model comparison. Yet, their reliability is increasingly questioned due to sensitivity to shallow…

Computation and Language · Computer Science 2026-02-20 Bogdan Kostić , Conor Fallon , Julian Risch , Alexander Löser

Quality-Conditioned Agreement in Automated Short Answer Scoring: Mid-Range Degradation and the Impact of Task-Specific Adaptation

Automated short answer scoring (ASAS) is shifting from discriminative, fine-tuned models to large language models (LLMs) used in few-shot settings. This paradigm leverages LLMs broad world knowledge and ease of deployment, but limited…

Computation and Language · Computer Science 2026-05-26 Abigail Victoria Gurin Schleifer , Moriah Ariely , Beata Beigman Klebanov , Asaf Salman , Giora Alexandron

On the Limitations of Language Targeted Pruning: Investigating the Calibration Language Impact in Multilingual LLM Pruning

Recent advances in large language model (LLM) pruning have shown state-of-the-art (SotA) compression results in post-training and retraining-free settings while maintaining high predictive performance. However, previous research mainly…

Computation and Language · Computer Science 2025-11-18 Simon Kurz , Jian-Jia Chen , Lucie Flek , Zhixue Zhao

Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A Preliminary Study on Writing Assistance

Proprietary Large Language Models (LLMs), such as ChatGPT, have garnered significant attention due to their exceptional capabilities in handling a diverse range of tasks. Recent studies demonstrate that open-sourced smaller foundational…

Computation and Language · Computer Science 2023-10-10 Yue Zhang , Leyang Cui , Deng Cai , Xinting Huang , Tao Fang , Wei Bi

Multi-task Learning for Low-resource Second Language Acquisition Modeling

Second language acquisition (SLA) modeling is to predict whether second language learners could correctly answer the questions according to what they have learned. It is a fundamental building block of the personalized learning system and…

Computation and Language · Computer Science 2020-09-01 Yong Hu , Heyan Huang , Tian Lan , Xiaochi Wei , Yuxiang Nie , Jiarui Qi , Liner Yang , Xian-Ling Mao