软件工程 — Scifaro

Towards Better Linux Kernel Fault Localization: Leveraging Contrastive Reasoning and Hierarchical Context Analysis

Debugging the Linux kernel remains a formidable challenge due to its vast codebase, complex architecture, and low-level programming intricacies. Effective fault localization (FL) is thus essential for efficient kernel debugging and…

软件工程 · 计算机科学 2026-07-01 Haichi Wang , Ruiguo Yu , Yesong Pang , Yingquan Zhao , Junjie Chen , Jiajun Jiang , Zan Wang

A Methodology for Investigating AI Patterns Prevalence in Software Repositories

As Artificial Intelligence(AI)-based applications take off, a clear understanding of AI patterns can uplift the quality of AI applications. Many AI patterns have been proposed in the literature; however, their prevalence in real-life code…

软件工程 · 计算机科学 2026-07-01 Srinath Perera , Hasinthaka Piyumal , Frank Leymann , Rania Khalaf

Rise From The Ashes: LLM-based Static Analysis for Deep Learning Framework Bugs

Deep learning (DL) frameworks are critical AI infrastructures that often hide bugs with serious security implications. While dynamic approaches such as fuzzing are effective in uncovering these bugs, they require real test execution and…

软件工程 · 计算机科学 2026-07-01 Shaoyu Yang , Haifeng Lin , Chunrong Fang , Xiang Chen , Wei Cheng , Jiawei Liu , Yiyu Zhang , Hongyu Liu , Zhenyu Chen

Auditing Empirical Comparisons in Quantum Software

Empirical quantum-software papers often report that one compiler, optimizer, backend, or ansatz outperforms another. Such comparisons are not properties of a tool alone: they can change with benchmark scope, circuit construction,…

软件工程 · 计算机科学 2026-07-01 Boshuai Ye , Peng Liang , Maryam Tavassoli Sabzevari , Arif Ali Khan

Large Language Models for Multi-Lingual Equivalent Mutant Detection: An Extended Empirical Study

Mutation testing is a powerful technique for ensuring software quality. However, the presence of equivalent mutants introduces unnecessary costs and biases, limiting its practical effectiveness. Although numerous equivalent mutant detection…

软件工程 · 计算机科学 2026-07-01 Honglin Shu , Zhao Tian , Dong Wang , Junji Yu , Jiazhe Zhang , Xuejie Cao , Junjie Chen , Yasutaka Kamei

Social coding platforms such as GitHub host millions of repositories, yet many suffer from high mortality rates. Despite this, several survival factors remain poorly understood. Human capital is widely recognized as essential. Social…

软件工程 · 计算机科学 2026-07-01 Mohit Kaushik , Kuljit Kaur Chahal

BT-APE: A Computationally Light Backtracking Approach to Automatic Prompt Engineering for Requirements Classification

Large language models (LLMs) are increasingly applied to requirements engineering (RE) tasks, yet the prompts guiding them are typically designed manually through trial and error, yielding inconsistent and suboptimal results. Automated…

软件工程 · 计算机科学 2026-07-01 Mohammad Amin Zadenoori , Waad Alhoshan , Jacek Dąbrowski , Liping Zhao , Alessio Ferrari

Registry-Governed Agent Lifecycle:Completing EDDOps with Evaluation-DrivenRegistration, Promotion, and Retirement on AWS AgentCore

Enterprise adoption of LLM agents requires model selection methods that balance quality, reliability, safety, latency, and cost. Evaluation-Driven Development and Operations (EDDOps) positions evaluation as a continuous governing function…

软件工程 · 计算机科学 2026-07-01 Richard Kang , Vincent Wang

The Illusion of Safety: Multi-Tier Verification of AI vs. Human C++ Code

Large language models increasingly generate C++, a memory-unsafe language where a single overlooked violation can become an exploitable bug. Yet most security evaluations of AI-generated code rely on static analysis alone, which flags…

软件工程 · 计算机科学 2026-06-30 Saif Mahmud , Fadul Sikder , Yuede Ji , Zhang Haotian , Jeff , Lei

AlgoBench: Benchmarking Algorithmic Adaptation in Code Generation

High pass rates on established programming benchmarks such as HumanEval and LiveCodeBench do not always show whether a model can reason about algorithms. Many fixed benchmarks eventually become part of the public training ecosystem through…

软件工程 · 计算机科学 2026-06-30 Xinyuan Song , Zekun Cai , Liang Zhao

A Quantitative Framework for Estimating System Complexity and Cost via Component Interface Analysis

This paper introduces a formal modeling framework designed to estimate the complexity and cost associated with system changes induced by external requirements. We model a system as a directed graph of couplings, capturing the intricate…

软件工程 · 计算机科学 2026-06-30 Ken Y. Chan

SWE-Router: Routing in Multi-turn Agentic Software Engineering Tasks

Large language models (LLMs) embedded in multi-turn agentic harnesses are reshaping software engineering (SWE), but routing every task to a frontier model is wasteful when many issues admit cheap fixes. Existing LLM routers operate on the…

软件工程 · 计算机科学 2026-06-30 Seongho Son , Sangwoong Yoon , Jiahua Tang , Shuhan Wang , Lorenz Wolf , Ilija Bogunovic

CoCoMUT: A Tool for Code-Context Mining and Automated Dataset Generation

Software-engineering assistants often need method-level context beyond an isolated body, including enclosing-class information, documentation, callers, callees, type hierarchy, and structural characteristics. Manually collecting this…

软件工程 · 计算机科学 2026-06-30 Alessandro Botta , Shiven Garisa , Jaya Vardhini Akurathi , Ahsanul Ameen Sabit , Trey Woodlief , Soneya Binta Hossain

Interface-Variant Dynamics in Software Ecosystems: Resolver-Induced Selection and Adoption in Package Graphs

Compatibility research usually treats an interface change as a local writer-reader decision. Distributed software stacks make that decision population structured: an RPC, telemetry, middleware, or service-contract variant is introduced by…

软件工程 · 计算机科学 2026-06-30 Faruk Alpay , Baris Basaran

JETO-Bench: A Reproducible Benchmark for Execution Time Improvement Patches in Java

Automated fixing of performance issues is gaining increasing attention. However, existing benchmarks of execution time improvement patches are fixed datasets that target Python, C++, or .NET and cannot be extended to new patches according…

软件工程 · 计算机科学 2026-06-30 Khashayar Etemadi , Zhendong Su

Do Machines Struggle Where Humans Do? LLM and Human Comprehension of Obfuscated Code

While code obfuscation impairs human code comprehension, it remains unclear if large language models share these failure modes. Building directly on a recent human study of program comprehension under code obfuscation, we evaluate whether…

软件工程 · 计算机科学 2026-06-30 Jack Le , Anh H. N. Nguyen , Tien N. Nguyen

AdaTrans: Automated C to Rust Transformation via Error-Adaptive Repair

The automated transformation of C code to Rust is challenging due to Rust's strict ownership and borrowing semantics. While Large Language Models (LLMs) show promise, they often produce code that violates these rules or relies on unsafe…

软件工程 · 计算机科学 2026-06-30 Xiaofan Liu , Zecan Li , Zhuang Zhao , Ziqi Shuai , Yanming Yang , Qi Xin , Jifeng Xuan

ScratchWorld: Evaluating If World Models Compute Executable Consequences

World-model evaluations often score a predicted future by overlap with a target state or observation. In sparse-change worlds, this can turn copied persistent state into apparent accuracy. We introduce ScratchWorld, an offline diagnostic…

软件工程 · 计算机科学 2026-06-30 Yufeng Lin , Jialu Zhang

Digital Sovereignty as a Quality Attribute for Software Architectures

Digital sovereignty (DS) is an increasingly important concept and political agenda throughout the world, including in the European Union (EU). However, the concept is also regrettably vague. With this critical point in mind, the paper…

软件工程 · 计算机科学 2026-06-30 Jukka Ruohonen , Justin Stark , Scott Wilkie , Mikkel Baun Kjærgaard

From Failure to Alignment: A Requirements Engineering Framework for Machine Learning Systems

Organisations designing, developing, and deploying machine learning systems (MLS) need to be able to check that these systems are trustworthy, and communicate this clearly to their stakeholders, be they different categories of users,…

软件工程 · 计算机科学 2026-06-30 Amel Bennaceur , Gopi Krishnan Rajbahadur , Prince Mercy , Bashar Nuseibeh , Faeq Alrimawi