Related papers: Prototype-Based Dynamic Steering for Large Languag…

Nudging Hidden States: Training-Free Model Steering for Chain-of-Thought Reasoning in Large Audio-Language Models

Chain-of-thought (CoT) prompting has been extended to large audio-language models (LALMs) to elicit reasoning, yet enhancing its effectiveness without training remains challenging. We study inference-time model steering as a training-free…

Sound · Computer Science 2026-03-17 Lok-Lam Ieong , Chia-Chien Chen , Chih-Kai Yang , Yu-Han Huang , An-Yu Cheng , Hung-yi Lee

Reasoning Beyond Chain-of-Thought: A Latent Computational Mode in Large Language Models

Chain-of-Thought (CoT) prompting has improved the reasoning performance of large language models (LLMs), but it remains unclear why it works and whether it is the unique mechanism for triggering reasoning in large language models. In this…

Computation and Language · Computer Science 2026-01-14 Zhenghao He , Guangzhi Xiong , Bohan Liu , Sanchit Sinha , Aidong Zhang

ATLAS: Adaptive Test-Time Latent Steering with External Verifiers for Enhancing LLMs Reasoning

Recent work on activation and latent steering has demonstrated that modifying internal representations can effectively guide large language models (LLMs) toward improved reasoning and efficiency without additional training. However, most…

Machine Learning · Computer Science 2026-01-07 Tuc Nguyen , Thai Le

Steer Like the LLM: Activation Steering that Mimics Prompting

Large language models can be steered at inference time through prompting or activation interventions, but activation steering methods often underperform compared to prompt-based approaches. We propose a framework that formulates prompt…

Computation and Language · Computer Science 2026-05-06 Geert Heyman , Frederik Vandeputte

DenseSteer: Steering Small Language Models towards Dense Math Reasoning

Large language models (LLMs) demonstrate strong chain-of-thought (CoT) reasoning abilities, while smaller models (<= 3B parameters) significantly underperform on multi-step reasoning tasks. Based on empirical analyses of the Qwen-2.5 model…

Artificial Intelligence · Computer Science 2026-05-29 Yang Ouyang , Shuhang Lin , Jung-Eun Kim

Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models

Reinforcement learning (RL) finetuning has become a key technique for enhancing the reasoning abilities of large language models (LLMs). However, its effectiveness critically depends on the selection of training data. Recent advances…

Machine Learning · Computer Science 2026-03-12 Yixiu Mao , Yun Qu , Qi Wang , Heming Zou , Xiangyang Ji

Automatic Chain of Thought Prompting in Large Language Models

Large language models (LLMs) can perform complex reasoning by generating intermediate reasoning steps. Providing these steps for prompting demonstrations is called chain-of-thought (CoT) prompting. CoT prompting has two major paradigms. One…

Computation and Language · Computer Science 2022-10-10 Zhuosheng Zhang , Aston Zhang , Mu Li , Alex Smola

Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Large Language Models (LLMs) often rely on long chain-of-thought (CoT) reasoning to solve complex tasks. While effective, these trajectories are frequently inefficient, leading to high latency from excessive token generation, or unstable…

Computation and Language · Computer Science 2026-01-21 Zhenyu Zhang , Xiaoxia Wu , Zhongzhu Zhou , Qingyang Wu , Yineng Zhang , Pragaash Ponnusamy , Harikaran Subbaraj , Jue Wang , Shuaiwen Leon Song , Ben Athiwaratkun

Improving Multilingual Language Models by Aligning Representations through Steering

This paper investigates how Large Language Models (LLMs) represent non-English tokens -- a question that remains underexplored despite recent progress. We propose a lightweight intervention method using representation steering, where a…

Computation and Language · Computer Science 2025-08-27 Omar Mahmoud , Buddhika Laknath Semage , Thommen George Karimpanal , Santu Rana

Difference-Guided Reasoning: A Temporal-Spatial Framework for Large Language Models

Large Language Models (LLMs) are important tools for reasoning and problem-solving, while they often operate passively, answering questions without actively discovering new ones. This limitation reduces their ability to simulate human-like…

Computational Engineering, Finance, and Science · Computer Science 2025-09-26 Hong Su

PATS: Process-Level Adaptive Thinking Mode Switching

Current large-language models (LLMs) typically adopt a fixed reasoning strategy, either simple or complex, for all questions, regardless of their difficulty. This neglect of variation in task and reasoning process complexity leads to an…

Computation and Language · Computer Science 2025-05-27 Yi Wang , Junxiao Liu , Shimao Zhang , Jiajun Chen , Shujian Huang

COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics

Activation steering methods enable inference-time control of large language model (LLM) behavior without retraining, but current approaches face a fundamental trade-off: sample-efficient methods suboptimally capture steering signals from…

Machine Learning · Computer Science 2026-03-09 Kartik Sharma , Rakshit S. Trivedi

Zero-Shot Chain-of-Thought Reasoning Guided by Evolutionary Algorithms in Large Language Models

Large Language Models (LLMs) have demonstrated remarkable performance across diverse tasks and exhibited impressive reasoning abilities by applying zero-shot Chain-of-Thought (CoT) prompting. However, due to the evolving nature of sentence…

Computation and Language · Computer Science 2024-02-09 Feihu Jin , Yifan Liu , Ying Tan

Improving LLM Reasoning through Interpretable Role-Playing Steering

Role-playing has emerged as an effective technique for enhancing the reasoning capabilities of large language models (LLMs). However, existing methods primarily rely on prompt engineering, which often lacks stability and interpretability.…

Computation and Language · Computer Science 2025-09-30 Anyi Wang , Dong Shu , Yifan Wang , Yunpu Ma , Mengnan Du

Unveiling the Latent Directions of Reflection in Large Language Models

Reflection, the ability of large language models (LLMs) to evaluate and revise their own reasoning, has been widely used to improve performance on complex reasoning tasks. Yet, most prior works emphasizes designing reflective prompting…

Machine Learning · Computer Science 2025-12-12 Fu-Chieh Chang , Yu-Ting Lee , Pei-Yuan Wu

In-Distribution Steering: Balancing Control and Coherence in Language Model Generation

Activation steering methods control large language model (LLM) behavior by modifying internal activations at inference time. However, most existing activation steering methods rely on a fixed steering strength, leading to either…

Computation and Language · Computer Science 2025-10-16 Arthur Vogels , Benjamin Wong , Yann Choho , Annabelle Blangero , Milan Bhan

STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning

Large Language Models employing extended chain-of-thought (CoT) reasoning often suffer from the overthinking phenomenon, generating excessive and redundant reasoning steps that increase computational costs while potentially degrading…

Computation and Language · Computer Science 2026-04-14 Aryasomayajula Ram Bharadwaj

Unlocking Structured Thinking in Language Models with Cognitive Prompting

We propose cognitive prompting as a novel approach to guide problem-solving in large language models (LLMs) through structured, human-like cognitive operations, such as goal clarification, decomposition, filtering, abstraction, and pattern…

Computation and Language · Computer Science 2024-12-03 Oliver Kramer , Jill Baumann

Toward Preference-aligned Large Language Models via Residual-based Model Steering

Preference alignment is a critical step in making Large Language Models (LLMs) useful and aligned with (human) preferences. Existing approaches such as Reinforcement Learning from Human Feedback or Direct Preference Optimization typically…

Computation and Language · Computer Science 2025-09-30 Lucio La Cava , Andrea Tagarelli

Enhancing Cross-task Transfer of Large Language Models via Activation Steering

Large language models (LLMs) have shown impressive abilities in leveraging pretrained knowledge through prompting, but they often struggle with unseen tasks, particularly in data-scarce scenarios. While cross-task in-context learning offers…

Computation and Language · Computer Science 2025-07-18 Xinyu Tang , Zhihao Lv , Xiaoxue Cheng , Junyi Li , Wayne Xin Zhao , Zujie Wen , Zhiqiang Zhang , Jun Zhou