Related papers: Compiled AI: Deterministic Code Generation for LLM…

Compiling Prompts, Not Crafting Them: A Reproducible Workflow for AI-Assisted Evidence Synthesis

Large language models (LLMs) offer significant potential to accelerate systematic literature reviews (SLRs), yet current approaches often rely on brittle, manually crafted prompts that compromise reliability and reproducibility. This…

Computation and Language · Computer Science 2025-09-03 Teo Susnjak

Towards automated data analysis: A guided framework for LLM-based risk estimation

Large Language Models (LLMs) are increasingly integrated into critical decision-making pipelines, a trend that raises the demand for robust and automated data analysis. Current approaches to dataset risk analysis are limited to manual…

Artificial Intelligence · Computer Science 2026-05-28 Panteleimon Rodis

Security and Quality in LLM-Generated Code: A Multi-Language, Multi-Model Analysis

Artificial Intelligence (AI)-driven code generation tools are increasingly used throughout the software development lifecycle to accelerate coding tasks. However, the security of AI-generated code using Large Language Models (LLMs) remains…

Cryptography and Security · Computer Science 2026-03-10 Mohammed Kharma , Soohyeon Choi , Mohammed AlKhanafseh , David Mohaisen

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain dominated by ad-hoc…

Machine Learning · Computer Science 2025-12-19 Hao Liang , Xiaochen Ma , Zhou Liu , Zhen Hao Wong , Zhengyang Zhao , Zimo Meng , Runming He , Chengyu Shen , Qifeng Cai , Zhaoyang Han , Meiyi Qiang , Yalin Feng , Tianyi Bai , Zewei Pan , Ziyi Guo , Yizhen Jiang , Jingwen Deng , Qijie You , Peichao Lai , Tianyu Guo , Chi Hsu Tsai , Hengyi Feng , Rui Hu , Wenkai Yu , Junbo Niu , Bohan Zeng , Ruichuan An , Lu Ma , Jihao Huang , Yaowei Zheng , Conghui He , Linpeng Tang , Bin Cui , Weinan E , Wentao Zhang

Enhancing LLM Code Generation: A Systematic Evaluation of Multi-Agent Collaboration and Runtime Debugging for Improved Accuracy, Reliability, and Latency

The use of large language models (LLMs) for automated code generation has emerged as a significant focus within AI research. As these pretrained models continue to evolve, their ability to understand and generate complex code structures has…

Software Engineering · Computer Science 2025-05-06 Nazmus Ashrafi , Salah Bouktif , Mohammed Mediani

STELP: Secure Transpilation and Execution of LLM-Generated Programs

Rapid evolution of Large Language Models (LLMs) has achieved major advances in reasoning, planning, and function-calling capabilities. Multi-agentic collaborative frameworks using such LLMs place them at the center of solving software…

Software Engineering · Computer Science 2026-01-16 Swapnil Shinde , Sahil Wadhwa , Andy Luo , Akshay Gupta , Mohammad Shahed Sorower

Towards Single-System Illusion in Software-Defined Vehicles -- Automated, AI-Powered Workflow

We propose a novel model- and feature-based approach to development of vehicle software systems, where the end architecture is not explicitly defined. Instead, it emerges from an iterative process of search and optimization given certain…

Software Engineering · Computer Science 2024-03-22 Krzysztof Lebioda , Viktor Vorobev , Nenad Petrovic , Fengjunjie Pan , Vahid Zolfaghari , Alois Knoll

Automated Vulnerability Validation and Verification: A Large Language Model Approach

Software vulnerabilities remain a critical security challenge, providing entry points for attackers into enterprise networks. Despite advances in security practices, the lack of high-quality datasets capturing diverse exploit behavior…

Cryptography and Security · Computer Science 2025-11-17 Alireza Lotfi , Charalampos Katsis , Elisa Bertino

A Blueprint Architecture of Compound AI Systems for Enterprise

Large Language Models (LLMs) have showcased remarkable capabilities surpassing conventional NLP challenges, creating opportunities for use in production use cases. Towards this goal, there is a notable shift to building compound AI systems,…

Databases · Computer Science 2024-06-04 Eser Kandogan , Sajjadur Rahman , Nikita Bhutani , Dan Zhang , Rafael Li Chen , Kushan Mitra , Sairam Gurajada , Pouya Pezeshkpour , Hayate Iso , Yanlin Feng , Hannah Kim , Chen Shen , Jin Wang , Estevam Hruschka

Generating Automotive Code: Large Language Models for Software Development and Verification in Safety-Critical Systems

Developing safety-critical automotive software presents significant challenges due to increasing system complexity and strict regulatory demands. This paper proposes a novel framework integrating Generative Artificial Intelligence (GenAI)…

Software Engineering · Computer Science 2025-06-05 Sven Kirchner , Alois C. Knoll

Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks

Large Language Models (LLMs) have shown remarkable capabilities in code generation tasks, yet they face significant limitations in handling complex, long-context programming challenges and demonstrating complex compositional reasoning…

Artificial Intelligence · Computer Science 2025-01-14 Amr Almorsi , Mohanned Ahmed , Walid Gomaa

Rethinking Autonomy: Preventing Failures in AI-Driven Software Engineering

The integration of Large Language Models (LLMs) into software engineering has revolutionized code generation, enabling unprecedented productivity through promptware and autonomous AI agents. However, this transformation introduces…

Software Engineering · Computer Science 2025-08-19 Satyam Kumar Navneet , Joydeep Chandra

Generative AI for CAD Automation: Leveraging Large Language Models for 3D Modelling

Large Language Models (LLMs) are revolutionizing industries by enhancing efficiency, scalability, and innovation. This paper investigates the potential of LLMs in automating Computer-Aided Design (CAD) workflows, by integrating FreeCAD with…

Human-Computer Interaction · Computer Science 2025-08-05 Sumit Kumar , Sarthak Kapoor , Harsh Vardhan , Yao Zhao

An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs

In recent advancements, large language models (LLMs) have exhibited proficiency in code generation and chain-of-thought reasoning, laying the groundwork for tackling automatic formal planning tasks. This study evaluates the potential of…

Artificial Intelligence · Computer Science 2025-02-28 Kaustubh Vyas , Damien Graux , Sébastien Montella , Pavlos Vougiouklis , Ruofei Lai , Keshuang Li , Yang Ren , Jeff Z. Pan

An Interpretable Automated Mechanism Design Framework with Large Language Models

Mechanism design has long been a cornerstone of economic theory, with traditional approaches relying on mathematical derivations. Recently, automated approaches, including differentiable economics with neural networks, have emerged for…

Machine Learning · Computer Science 2025-02-19 Jiayuan Liu , Mingyu Guo , Vincent Conitzer

FlowCompile: An Optimizing Compiler for Structured LLM Workflows

Structured LLM workflows, where specialized LLM sub-agents execute according to a predefined graph, have become a powerful abstraction for solving complex tasks. Optimizing such workflows, i.e., selecting configurations for each sub-agent…

Computation and Language · Computer Science 2026-05-14 Junyan Li , Zhang-Wei Hong , Maohao Shen , Yang Zhang , Chuang Gan

CodableLLM: Automating Decompiled and Source Code Mapping for LLM Dataset Generation

The generation of large, high-quality datasets for code understanding and generation remains a significant challenge, particularly when aligning decompiled binaries with their original source code. To address this, we present CodableLLM, a…

Software Engineering · Computer Science 2025-07-31 Dylan Manuel , Paul Rad

AI-Augmented CI/CD Pipelines: From Code Commit to Production with Autonomous Decisions

Modern software delivery has accelerated from quarterly releases to multiple deployments per day. While CI/CD tooling has matured, human decision points interpreting flaky tests, choosing rollback strategies, tuning feature flags, and…

Software Engineering · Computer Science 2026-02-24 Mohammad Baqar , Saba Naqvi , Rajat Khanda

Type-Constrained Code Generation with Language Models

Large language models (LLMs) have achieved notable success in code generation. However, they still frequently produce uncompilable output because their next-token inference procedure does not model formal aspects of code. Although…

Machine Learning · Computer Science 2025-05-09 Niels Mündler , Jingxuan He , Hao Wang , Koushik Sen , Dawn Song , Martin Vechev

Auto-repair without test cases: How LLMs fix compilation errors in large industrial embedded code

The co-development of hardware and software in industrial embedded systems frequently leads to compilation errors during continuous integration (CI). Automated repair of such failures is promising, but existing techniques rely on test…

Software Engineering · Computer Science 2025-10-16 Han Fu , Sigrid Eldh , Kristian Wiklund , Andreas Ermedahl , Philipp Haller , Cyrille Artho