English
Related papers

Related papers: Prototype-Based Dynamic Steering for Large Languag…

200 papers

Chain-of-thought (CoT) prompting has been extended to large audio-language models (LALMs) to elicit reasoning, yet enhancing its effectiveness without training remains challenging. We study inference-time model steering as a training-free…

Sound · Computer Science 2026-03-17 Lok-Lam Ieong , Chia-Chien Chen , Chih-Kai Yang , Yu-Han Huang , An-Yu Cheng , Hung-yi Lee

Chain-of-Thought (CoT) prompting has improved the reasoning performance of large language models (LLMs), but it remains unclear why it works and whether it is the unique mechanism for triggering reasoning in large language models. In this…

Computation and Language · Computer Science 2026-01-14 Zhenghao He , Guangzhi Xiong , Bohan Liu , Sanchit Sinha , Aidong Zhang

Recent work on activation and latent steering has demonstrated that modifying internal representations can effectively guide large language models (LLMs) toward improved reasoning and efficiency without additional training. However, most…

Machine Learning · Computer Science 2026-01-07 Tuc Nguyen , Thai Le

Large language models can be steered at inference time through prompting or activation interventions, but activation steering methods often underperform compared to prompt-based approaches. We propose a framework that formulates prompt…

Computation and Language · Computer Science 2026-05-06 Geert Heyman , Frederik Vandeputte

Large language models (LLMs) demonstrate strong chain-of-thought (CoT) reasoning abilities, while smaller models (<= 3B parameters) significantly underperform on multi-step reasoning tasks. Based on empirical analyses of the Qwen-2.5 model…

Artificial Intelligence · Computer Science 2026-05-29 Yang Ouyang , Shuhang Lin , Jung-Eun Kim

Reinforcement learning (RL) finetuning has become a key technique for enhancing the reasoning abilities of large language models (LLMs). However, its effectiveness critically depends on the selection of training data. Recent advances…

Machine Learning · Computer Science 2026-03-12 Yixiu Mao , Yun Qu , Qi Wang , Heming Zou , Xiangyang Ji

Large language models (LLMs) can perform complex reasoning by generating intermediate reasoning steps. Providing these steps for prompting demonstrations is called chain-of-thought (CoT) prompting. CoT prompting has two major paradigms. One…

Computation and Language · Computer Science 2022-10-10 Zhuosheng Zhang , Aston Zhang , Mu Li , Alex Smola

Large Language Models (LLMs) often rely on long chain-of-thought (CoT) reasoning to solve complex tasks. While effective, these trajectories are frequently inefficient, leading to high latency from excessive token generation, or unstable…

This paper investigates how Large Language Models (LLMs) represent non-English tokens -- a question that remains underexplored despite recent progress. We propose a lightweight intervention method using representation steering, where a…

Computation and Language · Computer Science 2025-08-27 Omar Mahmoud , Buddhika Laknath Semage , Thommen George Karimpanal , Santu Rana

Large Language Models (LLMs) are important tools for reasoning and problem-solving, while they often operate passively, answering questions without actively discovering new ones. This limitation reduces their ability to simulate human-like…

Computational Engineering, Finance, and Science · Computer Science 2025-09-26 Hong Su

Current large-language models (LLMs) typically adopt a fixed reasoning strategy, either simple or complex, for all questions, regardless of their difficulty. This neglect of variation in task and reasoning process complexity leads to an…

Computation and Language · Computer Science 2025-05-27 Yi Wang , Junxiao Liu , Shimao Zhang , Jiajun Chen , Shujian Huang

Activation steering methods enable inference-time control of large language model (LLM) behavior without retraining, but current approaches face a fundamental trade-off: sample-efficient methods suboptimally capture steering signals from…

Machine Learning · Computer Science 2026-03-09 Kartik Sharma , Rakshit S. Trivedi

Large Language Models (LLMs) have demonstrated remarkable performance across diverse tasks and exhibited impressive reasoning abilities by applying zero-shot Chain-of-Thought (CoT) prompting. However, due to the evolving nature of sentence…

Computation and Language · Computer Science 2024-02-09 Feihu Jin , Yifan Liu , Ying Tan

Role-playing has emerged as an effective technique for enhancing the reasoning capabilities of large language models (LLMs). However, existing methods primarily rely on prompt engineering, which often lacks stability and interpretability.…

Computation and Language · Computer Science 2025-09-30 Anyi Wang , Dong Shu , Yifan Wang , Yunpu Ma , Mengnan Du

Reflection, the ability of large language models (LLMs) to evaluate and revise their own reasoning, has been widely used to improve performance on complex reasoning tasks. Yet, most prior works emphasizes designing reflective prompting…

Machine Learning · Computer Science 2025-12-12 Fu-Chieh Chang , Yu-Ting Lee , Pei-Yuan Wu

Activation steering methods control large language model (LLM) behavior by modifying internal activations at inference time. However, most existing activation steering methods rely on a fixed steering strength, leading to either…

Computation and Language · Computer Science 2025-10-16 Arthur Vogels , Benjamin Wong , Yann Choho , Annabelle Blangero , Milan Bhan

Large Language Models employing extended chain-of-thought (CoT) reasoning often suffer from the overthinking phenomenon, generating excessive and redundant reasoning steps that increase computational costs while potentially degrading…

Computation and Language · Computer Science 2026-04-14 Aryasomayajula Ram Bharadwaj

We propose cognitive prompting as a novel approach to guide problem-solving in large language models (LLMs) through structured, human-like cognitive operations, such as goal clarification, decomposition, filtering, abstraction, and pattern…

Computation and Language · Computer Science 2024-12-03 Oliver Kramer , Jill Baumann

Preference alignment is a critical step in making Large Language Models (LLMs) useful and aligned with (human) preferences. Existing approaches such as Reinforcement Learning from Human Feedback or Direct Preference Optimization typically…

Computation and Language · Computer Science 2025-09-30 Lucio La Cava , Andrea Tagarelli

Large language models (LLMs) have shown impressive abilities in leveraging pretrained knowledge through prompting, but they often struggle with unseen tasks, particularly in data-scarce scenarios. While cross-task in-context learning offers…

Computation and Language · Computer Science 2025-07-18 Xinyu Tang , Zhihao Lv , Xiaoxue Cheng , Junyi Li , Wayne Xin Zhao , Zujie Wen , Zhiqiang Zhang , Jun Zhou
‹ Prev 1 2 3 10 Next ›