软件工程 — Scifaro

Beyond Objects

A core principle of object orientation -- that the functionality of a system can be partitioned amongst objects that correspond to individuals in the problem domain -- has influenced how software has been specified, designed and implemented…

软件工程 · 计算机科学 2026-06-25 Daniel Jackson

Smaller Models, Unexpected Costs: Trade-offs in LLM Quantization for Automated Program Repair

Language Models (LLMs) are powerful toolsand have been increasingly adopted for complex software engineering tasks. As the number of parameters increases, results can often be improved, but this also imposes substantialmemory requirements.…

软件工程 · 计算机科学 2026-06-25 Fernando Vallecillos-Ruiz , Giordano d'Aloisio , Max Hort , Luca Traini , Antinisca Di Marco , Leon Moonen

On the Reproducibility of Quantum Software Defect Datasets: A Case Study of Bugs4Q

The reproducibility of software defect datasets is essential for obtaining reliable and comparable research results. Zhu et al. have shown that defect datasets such as Defects4J suffer from reproduction failures (i.e., reported bugs become…

软件工程 · 计算机科学 2026-06-25 Haruto Ohto , Yuta Ishimoto , Shinsuke Matsumoto , Shinji Kusumoto

ATGBuilder: Feature-Assisted Graph Learning for Activity Transition Graph Construction with Seed Supervision

Android applications are organized around activities that provide visual Graphical User Interface (GUI) containers that host the UI and handle user interaction events. Activity Transition Graphs (ATGs) have been widely used to model apps'…

软件工程 · 计算机科学 2026-06-25 Chenhui Cui , Zixiang Xian , Danyu Li , Tao Li , Rubing Huang , Dave Towey , Shikai Guo , Jiakun Liu

The Spec Growth Engine: Spec-Anchored, Code-Coupled, Drift-Enforced Architecture for AI-Assisted Software Development

AI coding agents dramatically accelerate implementation speed but introduce two structural failure modes that existing spec-driven approaches do not fully solve: (1) context explosion -- the agent must reason over an entire repository at…

软件工程 · 计算机科学 2026-06-25 Hartwig Grabowski

Cleaning Logs for Downstream Tasks (Registered Report)

Background: Software systems generate logs during execution to record critical events and runtime information for troubleshooting and monitoring. However, in practice, logs often contain significant amounts of redundant and irrelevant…

软件工程 · 计算机科学 2026-06-25 Zahra G. Yazdi , Van-Hoang Le , Nyyti Saarimäki , Donghwan Shin , Domenico Bianculli , Lionel Briand

How Much Static Structure Do Code Agents Need? A Study of Deterministic Anchoring

LLM-based code agents navigate repositories through keyword search but miss the structural relationships, such as call graphs, inheritance hierarchies, and configuration dependencies, that define how software actually works. This makes…

软件工程 · 计算机科学 2026-06-25 Zhihao Lin , Mingyi Zhou , Yizhuo Yang , Li Li

To Run or Not to Run: Analyzing the Cost-Effectiveness of Code Execution in LLM-Based Program Repair

LLM-based agents for program repair are increasingly built on a "generate-run-revise" paradigm, iteratively executing tests to evaluate and refine patches. This execution-based approach has become standard practice in state-of-the-art…

软件工程 · 计算机科学 2026-06-25 Zhihao Lin , Junhua Zhu , Mingyi Zhou , Xin Wang , Zhensu Sun , Renyu Yang , David Lo , Li Li

CLIR: Liveness-Driven and Structure-Aware Fuzzing for the Cranelift Compiler

Modern compilers are complex software systems that must correctly translate high-level programming languages into machine code across multiple architectures. Cranelift, a fast and modern compiler backend originally developed for WebAssembly…

软件工程 · 计算机科学 2026-06-25 Shangtong Cao , Tianlei Song , Qiuping Yi , Tianyu Chen , Guoai Xu , Ningyu He , Haoyu Wang

Are LLMs Ready for Anti-Pattern Detection in Microservice Architectures?

Microservice systems are prone to recurrent architectural anti-patterns (APs) that hinder maintainability, evolvability, and operational quality. Most existing AP detection approaches rely on static analysis and handcrafted rules, which can…

软件工程 · 计算机科学 2026-06-25 Marco De Luca , Domenico Amalfitano , Porfirio Tramontana , Anna Rita Fasolino

A Deterministic Control Plane for LLM Coding Agents

LLM coding harnesses grant agents broad file and shell access, yet the configuration layer that steers them -- rules files, agent definitions, IDE-specific markdown -- is largely unmanaged. A prevalence study of 10,008 public GitHub…

软件工程 · 计算机科学 2026-06-25 Padmaraj Madatha

Knowledge-Based Pull Requests: A Trusted Workflow for Agent-Mediated Knowledge Collaboration

AI coding agents are changing the bottleneck in software collaboration: code is increasingly cheap, while understanding intent, negotiating scope, and governing long-term project responsibility remain costly. This paper proposes…

软件工程 · 计算机科学 2026-06-25 Xinyu Zhang , Weiwei Sun

Quantum Mutant Equivalence via Transpilation

Mutation testing evaluates test suite quality by introducing artificial faults (mutants) and checking whether tests detect (kill) them. A central challenge is the equivalent mutant problem: some mutants are syntactically different from the…

软件工程 · 计算机科学 2026-06-25 José Campos , Andriy Miranskyy

ConcoLixir: Reactive LLM Discovery Oracles for Python Concolic Testing

Concolic testing combines concrete execution with symbolic constraint solving, but Python programs expose recurring limits. Library calls can cause symbolic variables to downgrade to concrete values. Regular expressions, checksums, parsers,…

软件工程 · 计算机科学 2026-06-25 Dong Chen , Chih-Duo Hong , Fang Yu

Same Scrutiny, More Time: Eye Tracking Insights into Reviewing LLM-Labelled Code

Modern software development increasingly involves the use of large language models (LLMs) to generate code. Despite their rapid advancement, LLMs remain prone to errors and hallucinations, emphasizing the importance of careful code…

软件工程 · 计算机科学 2026-06-25 Ranim Khojah , Francisco Gomes de Oliveira Neto , Mazen Mohamad , Julian Frattini , Philipp Leitner

Evaluation-Strategy Gap in Fault Diagnosis of Deep Learning Programs

Deep Learning (DL) programs can fail during training for many reasons, and diagnosing the cause is a costly and time-consuming maintenance task. Techniques for diagnosing such failures are commonly assessed using within-program…

软件工程 · 计算机科学 2026-06-25 Sigma Jahan

An Empirical Study of LLM-Generated Specifications for VeriFast

Static verification tools can assure industrial scale software, but require significant human labor to write specifications. This is particularly true of static verifiers based on separation logic (SL verifiers), which excel at verifying…

软件工程 · 计算机科学 2026-06-25 Wen Fan , Minh Tran , Sanya Dod , Xin Hu , Marilyn Rego , Danning Xie , Jenna DiVincenzo , Lin Tan

Towards Safety-Aware Mutation Testing for Autonomous Driving Systems

Simulation-based testing is essential for ensuring the safety of Autonomous Driving Systems (ADS), yet the community lacks a systematic criterion for determining when we can safely stop additional test scenario generation. Existing coverage…

软件工程 · 计算机科学 2026-06-24 Donghwan Shin

Augmentation with Dilution: A Large-Scale Empirical Study of Human Contributor Ecosystems After AI Coding Agent Adoption

AI coding agents are penetrating open-source software development at an unprecedented pace, yet existing research predominantly treats human contributors as a static backdrop rather than as the subject of inquiry. This paper presents the…

软件工程 · 计算机科学 2026-06-24 Weixing Zhang , Bowen Jiang , Anne Koziolek

Orchestrating Black-Box Schema Converters: An Empirical Study of Automated, Quality-Ranked Conversion Across Heterogeneous Schema Languages

Modern software systems routinely need the same data model in several schema languages: a model may exist as JSON Schema for a web API, as XSD for data exchange, and as SHACL for a knowledge graph. Keeping these representations consistent…

软件工程 · 计算机科学 2026-06-24 Felix Neubauer , Giridhar Chinnikkaramadom Govindan , Jürgen Pleiss , Benjamin Uekermann