Related papers: Reasoning-Based Software Testing

Search-based Software Testing Driven by Domain Knowledge: Reflections and New Perspectives

Search-based Software Testing (SBST) can automatically generate test cases to search for requirements violations. Unlike manual test case development, it can generate a substantial number of test cases in a limited time. However, SBST does…

Software Engineering · Computer Science 2025-12-12 Federico Formica , Mark Lawford , Claudio Menghi

Towards a Model of Testers' Cognitive Processes: Software Testing as a Problem Solving Approach

Software testing is a complex, intellectual activity based (at least) on analysis, reasoning, decision making, abstraction and collaboration performed in a highly demanding environment. Naturally, it uses and allocates multiple cognitive…

Software Engineering · Computer Science 2020-12-10 Eduard Enoiu , Gerald Tukseferi , Robert Feldt

REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once

Recent Large Reasoning Models (LRMs) have achieved remarkable progress on task-specific benchmarks, yet their evaluation methods remain constrained by isolated problem-solving paradigms. Existing benchmarks predominantly assess…

Computation and Language · Computer Science 2025-07-16 Zhuoshi Pan , Qizhi Pei , Yu Li , Qiyao Sun , Zinan Tang , H. Vicky Zhao , Conghui He , Lijun Wu

Instance Space Analysis of Search-Based Software Testing

Search-based software testing (SBST) is now a mature area, with numerous techniques developed to tackle the challenging task of software testing. SBST techniques have shown promising results and have been successfully applied in the…

Software Engineering · Computer Science 2023-12-06 Neelofar Neelofar , Kate Smith-Miles , Mario Andres Munoz , Aldeida Aleti

Requirements-Driven Automated Software Testing: A Systematic Review

Automated software testing has significant potential to enhance efficiency and reliability within software development processes. However, its broader adoption faces considerable challenges, particularly concerning alignment between test…

Software Engineering · Computer Science 2025-08-26 Fanyu Wang , Chetan Arora , Chakkrit Tantithamthavorn , Kaicheng Huang , Aldeida Aleti

Applications of Causality and Causal Inference in Software Engineering

Causal inference is a study of causal relationships between events and the statistical study of inferring these relationships through interventions and other statistical techniques. Causal reasoning is any line of work toward determining…

Software Engineering · Computer Science 2023-04-03 Patrick Chadbourne , Nasir Eisty

Language-Conditioned Robotic Manipulation with Fast and Slow Thinking

The language-conditioned robotic manipulation aims to transfer natural language instructions into executable actions, from simple pick-and-place to tasks requiring intent recognition and visual reasoning. Inspired by the dual process theory…

Robotics · Computer Science 2024-02-02 Minjie Zhu , Yichen Zhu , Jinming Li , Junjie Wen , Zhiyuan Xu , Zhengping Che , Chaomin Shen , Yaxin Peng , Dong Liu , Feifei Feng , Jian Tang

STaR: Bootstrapping Reasoning With Reasoning

Generating step-by-step "chain-of-thought" rationales improves language model performance on complex reasoning tasks like mathematics or commonsense question-answering. However, inducing language model rationale generation currently…

Machine Learning · Computer Science 2022-05-23 Eric Zelikman , Yuhuai Wu , Jesse Mu , Noah D. Goodman

Testing the Untestable? An Empirical Study on the Testing Process of LLM-Powered Software Systems

Background: Software systems powered by large language models are becoming a routine part of everyday technologies, supporting applications across a wide range of domains. In software engineering, many studies have focused on how LLMs…

Software Engineering · Computer Science 2025-08-05 Cleyton Magalhaes , Italo Santos , Brody Stuart-Verner , Ronnie de Souza Santos

Native Reasoning Models: Training Language Models to Reason on Unverifiable Data

The prevailing paradigm for training large reasoning models--combining Supervised Fine-Tuning (SFT) with Reinforcement Learning with Verifiable Rewards (RLVR)--is fundamentally constrained by its reliance on high-quality, human-annotated…

Machine Learning · Computer Science 2026-03-24 Yuanfu Wang , Zhixuan Liu , Xiangtian Li , Chaochao Lu , Chao Yang

Establishing trust in automated reasoning

Since its beginnings in the 1940s, automated reasoning by computers has become a tool of ever growing importance in scientific research. So far, the rules underlying automated reasoning have mainly been formulated by humans, in the form of…

Computers and Society · Computer Science 2025-10-20 Konrad Hinsen

Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations

Inductive reasoning, a cornerstone of human cognition, enables generalization from limited data but hasn't yet been fully achieved by large language models (LLMs). While modern LLMs excel at reasoning tasks, their ability to maintain stable…

Artificial Intelligence · Computer Science 2025-05-29 Chunyang Li , Weiqi Wang , Tianshi Zheng , Yangqiu Song

ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning

Large language models show improved downstream task performance when prompted to generate step-by-step reasoning to justify their final answers. These reasoning steps greatly improve model interpretability and verification, but objectively…

Computation and Language · Computer Science 2023-09-13 Olga Golovneva , Moya Chen , Spencer Poff , Martin Corredor , Luke Zettlemoyer , Maryam Fazel-Zarandi , Asli Celikyilmaz

Representation Improvement in Latent Space for Search-Based Testing of Autonomous Robotic Systems

Testing autonomous robotic systems, such as self-driving cars and unmanned aerial vehicles, is challenging due to their interaction with highly unpredictable environments. A common practice is to first conduct simulation-based testing,…

Neural and Evolutionary Computing · Computer Science 2025-03-27 Dmytro Humeniuk , Foutse Khomh

Leveraging Time Distortion for seamless Navigation into Data Space-Time Continuum

Intelligent software systems continuously analyze their surrounding environment and accordingly adapt their internal state. Depending on the criticality index of the situation, the system should dynamically focus or widen its analysis and…

Software Engineering · Computer Science 2014-07-18 Thomas Hartmann , Francois Fouquet , Yves Le Traon , Brice Morin

CRISP: Complex Reasoning with Interpretable Step-based Plans

Recent advancements in large language models (LLMs) underscore the need for stronger reasoning capabilities to solve complex problems effectively. While Chain-of-Thought (CoT) reasoning has been a step forward, it remains insufficient for…

Computation and Language · Computer Science 2025-07-14 Matan Vetzler , Koren Lazar , Guy Uziel , Eran Hirsch , Ateret Anaby-Tavor , Leshem Choshen

Reasoning in machine vision: learning to think fast and slow

Reasoning is a hallmark of human intelligence, enabling adaptive decision-making in complex and unfamiliar scenarios. In contrast, machine intelligence remains bound to training data, lacking the ability to dynamically refine solutions at…

Computer Vision and Pattern Recognition · Computer Science 2025-06-30 Shaheer U. Saeed , Yipei Wang , Veeru Kasivisvanathan , Brian R. Davidson , Matthew J. Clarkson , Yipeng Hu , Daniel C. Alexander

RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner

The reasoning abilities of large language models (LLMs) have improved with chain-of-thought (CoT) prompting, allowing models to solve complex tasks stepwise. However, training CoT capabilities requires detailed reasoning data, which is…

Artificial Intelligence · Computer Science 2025-04-11 Fu-Chieh Chang , Yu-Ting Lee , Hui-Ying Shih , Yi Hsuan Tseng , Pei-Yuan Wu

Testing Research Software: A Survey

Background: Research software plays an important role in solving real-life problems, empowering scientific innovations, and handling emergency situations. Therefore, the correctness and trustworthiness of research software are of absolute…

Software Engineering · Computer Science 2022-07-27 Nasir U. Eisty , Jeffrey C. Carver

Reward Reasoning Model

Reward models play a critical role in guiding large language models toward outputs that align with human expectations. However, an open challenge remains in effectively utilizing test-time compute to enhance reward model performance. In…

Computation and Language · Computer Science 2025-05-21 Jiaxin Guo , Zewen Chi , Li Dong , Qingxiu Dong , Xun Wu , Shaohan Huang , Furu Wei