Related papers: ParEVO: Synthesizing Code for Irregular Data: High…

Can Large Language Models Write Parallel Code?

Large language models are increasingly becoming a popular tool for software development. Their ability to model and generate source code has been demonstrated in a variety of contexts, including code completion, summarization, translation,…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-15 Daniel Nichols , Joshua H. Davis , Zhaojun Xie , Arjun Rajaram , Abhinav Bhatele

RefEvo: Agentic Design with Co-Evolutionary Verification for Agile Reference Model Generation

As the complexity of System-on-Chip (SoC) designs grows, the shift-left paradigm necessitates the rapid development of high-fidelity reference models (typically written in SystemC) for early architecture exploration and verification. While…

Software Engineering · Computer Science 2026-04-28 Yifan Zhang , Jianmin Ye , Jiahao Yang , Xi Wang

GEVO-ML: Optimizing Machine Learning Code with Evolutionary Computation

Parallel accelerators, such as GPUs, are key enablers for large-scale Machine Learning (ML) applications. However, ML model developers often lack detailed knowledge of the underlying system architectures, while system programmers usually do…

Machine Learning · Computer Science 2023-10-17 Jhe-Yu Liou , Stephanie Forrest , Carole-Jean Wu

Understanding the Power of Evolutionary Computation for GPU Code Optimization

Achieving high performance for GPU codes requires developers to have significant knowledge in parallel programming and GPU architectures, and in-depth understanding of the application. This combination makes it challenging to find…

Software Engineering · Computer Science 2022-08-29 Jhe-Yu Liou , Muaaz Awan , Steven Hofmeyr , Stephanie Forrest , Carole-Jean Wu

PaREM: A Novel Approach for Parallel Regular Expression Matching

Regular expression matching is essential for many applications, such as finding patterns in text, exploring substrings in large DNA sequences, or lexical analysis. However, sequential regular expression matching may be time-prohibitive for…

Formal Languages and Automata Theory · Computer Science 2015-06-30 Suejb Memeti , Sabri Pllana

KCoEvo: A Knowledge Graph Augmented Framework for Evolutionary Code Generation

Code evolution is inevitable in modern software development. Changes to third-party APIs frequently break existing code and complicate maintenance, posing practical challenges for developers. While large language models (LLMs) have shown…

Software Engineering · Computer Science 2026-03-10 Jiazhen Kang , Yuchen Lu , Chen Jiang , Jinrui Liu , Tianhao Zhang , Bo Jiang , Ningyuan Sun , Tongtong Wu , Guilin Qi

Parsel: Algorithmic Reasoning with Language Models by Composing Decompositions

Despite recent success in large language model (LLM) reasoning, LLMs struggle with hierarchical multi-step reasoning tasks like generating complex programs. For these tasks, humans often start with a high-level algorithmic design and…

Computation and Language · Computer Science 2023-05-30 Eric Zelikman , Qian Huang , Gabriel Poesia , Noah D. Goodman , Nick Haber

CoupleEvo: Evolving Heuristics for Coupled Optimization Problems Using Large Language Models

Many real-world optimization problems consist of multiple tightly coupled subproblems whose solutions must be coordinated to achieve high overall performance. However, existing large language model driven automated heuristic design…

Neural and Evolutionary Computing · Computer Science 2026-05-08 Thomas Bömer , Bastian Amberg , Max Disselnmeyer , Anne Meyer

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback

Acquiring high-quality instruction-code pairs is essential for training Large Language Models (LLMs) for code generation. Manually curated data is expensive and inherently limited in scale, motivating the development of code-centric…

Software Engineering · Computer Science 2025-07-31 Qiushi Sun , Jinyang Gong , Lei Li , Qipeng Guo , Fei Yuan

HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs

Automatic Heuristic Design (AHD) is an active research area due to its utility in solving complex search and NP-hard combinatorial optimization problems in the real world. The recent advancements in Large Language Models (LLMs) introduce…

Neural and Evolutionary Computing · Computer Science 2024-12-20 Pham Vu Tuan Dat , Long Doan , Huynh Thi Thanh Binh

DHEvo: Data-Algorithm Based Heuristic Evolution for Generalizable MILP Solving

Primal heuristics play a critical role in improving the efficiency of mixed integer programming (MILP) solvers. As large language models (LLMs) have demonstrated superior code generation abilities, recent MILP works are devoted to…

Neural and Evolutionary Computing · Computer Science 2025-07-22 Zhihao Zhang , Siyuan Li , Chenxi Li , Feifan Liu , Mengjing Chen , Kai Li , Tao Zhong , Bo An , Peng Liu

PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation

The autoregressive nature of large language models (LLMs) fundamentally limits inference speed, as each forward pass generates only a single token and is often bottlenecked by memory bandwidth. Speculative decoding has emerged as a…

Machine Learning · Computer Science 2025-12-02 Zihao An , Huajun Bai , Ziqiong Liu , Dong Li , Emad Barsoum

InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding

Large language models (LLMs) with long sequences begin to power more and more fundamentally new applications we use every day. Existing methods for long-sequence LLM training are neither efficient nor compatible with commonly-used training…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-23 Qiaoling Chen , Diandian Gu , Guoteng Wang , Xun Chen , YingTong Xiong , Ting Huang , Qinghao Hu , Xin Jin , Yonggang Wen , Tianwei Zhang , Peng Sun

AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data

Large language models (LLMs) have shown remarkable performance on various tasks, but existing evaluation benchmarks are often static and insufficient to fully assess their robustness and generalization in realistic scenarios. Prior work…

Computation and Language · Computer Science 2025-07-01 JiaRu Wu , Mingwei Liu

Understanding Robustness of Model Editing in Code LLMs

Large language models (LLMs) for code are increasingly used in software development, but they remain static after pretraining while APIs and software libraries continue to evolve. Model editing offers a lightweight alternative to retraining…

Software Engineering · Computer Science 2026-05-11 Vinaik Chhetri , Moghis Fereidouni , A. B Siddique , Umar Farooq

HumanEvo: An Evolution-aware Benchmark for More Realistic Evaluation of Repository-level Code Generation

To evaluate the repository-level code generation capabilities of Large Language Models (LLMs) in complex real-world software development scenarios, many evaluation methods have been developed. These methods typically leverage contextual…

Software Engineering · Computer Science 2025-03-19 Dewu Zheng , Yanlin Wang , Ensheng Shi , Ruikai Zhang , Yuchi Ma , Hongyu Zhang , Zibin Zheng

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

Existing benchmarks for AI coding agents focus on isolated, single-issue tasks such as fixing a bug or adding a small feature. However, real-world software engineering is a long-horizon endeavor: developers interpret high-level…

Software Engineering · Computer Science 2026-05-25 Tue Le , Minh V. T. Thai , Dung Nguyen Manh , Huy Phan Nhat , Nghi D. Q. Bui

RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation

Large Language Models (LLMs) have become pivotal tools for automating code generation in software development. However, these models face significant challenges in producing version-aware code for rapidly evolving languages like Rust, where…

Software Engineering · Computer Science 2025-03-24 Linxi Liang , Jing Gong , Mingwei Liu , Chong Wang , Guangsheng Ou , Yanlin Wang , Xin Peng , Zibin Zheng

Increasing LLM Coding Capabilities through Diverse Synthetic Coding Tasks

Large language models (LLMs) have shown impressive promise in code generation, yet their progress remains limited by the shortage of large-scale datasets that are both diverse and well-aligned with human reasoning. Most existing resources…

Machine Learning · Computer Science 2025-10-28 Amal Abed , Ivan Lukic , Jörg K. H. Franke , Frank Hutter

COEVO: Co-Evolutionary Framework for Joint Functional Correctness and PPA Optimization in LLM-Based RTL Generation

LLM-based RTL code generation methods increasingly target both functional correctness and PPA quality, yet existing approaches universally decouple the two objectives, optimizing PPA only after correctness is fully achieved. Whether through…

Artificial Intelligence · Computer Science 2026-04-20 Heng Ping , Peiyu Zhang , Shixuan Li , Wei Yang , Anzhe Cheng , Shukai Duan , Xiaole Zhang , Paul Bogdan