Related papers: Fork, Explore, Commit: OS Primitives for Agentic E…

Toward Systems Foundations for Agentic Exploration

Agentic exploration, letting LLM-powered agents branch, backtrack, and search across many execution paths, demands systems support well beyond today's pass-at-k resets. Our benchmark of six snapshot/restore mechanisms shows that generic…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-08 Jiakai Xu , Tianle Zhou , Eugene Wu , Kostis Kaffes

BranchBench: Aligning Database Branching with Agentic Demands

Branchable databases are evolving from developer tools to infrastructure for agentic workloads characterized by speculative mutations and non-linear state exploration. Traditional RDBMS mechanisms such as nested transactions do not provide…

Databases · Computer Science 2026-04-21 Elaine Ang , Sam Weldon , In Keun Kim , Kevin Durand , Kostis Kaffes , Eugene Wu

Agentic Repository Mining: A Multi-Task Evaluation

Mining software repositories often requires classifying artifacts like commits, reviews, code lines, or entire repositories into categories. Human labeling is expensive and error-prone; limited context frequently leads to misclassifications…

Software Engineering · Computer Science 2026-05-07 Johannes Härtel

AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem

The rapid emergence of open-source, locally hosted intelligent agents marks a critical inflection point in human-computer interaction. Systems such as OpenClaw demonstrate that Large Language Model (LLM)-based agents can autonomously…

Artificial Intelligence · Computer Science 2026-03-13 Rui Liu , Tao Zhe , Dongjie Wang , Zijun Yao , Kunpeng Liu , Yanjie Fu , Huan Liu , Jian Pei

Sandlock: Confining AI Agent Code with Unprivileged Linux Primitives

AI agents increasingly run untrusted code on developer machines: shell commands generated by language models, third-party scripts retrieved at runtime, and tool plugins of unknown provenance. Existing isolation mechanisms impose tradeoffs…

Cryptography and Security · Computer Science 2026-05-27 Cong Wang , Yusheng Zheng

ANCHOR: Branch-Point Data Generation for GUI Agents

End-to-end GUI agents for real desktop environments require large amounts of high-quality interaction data, yet collecting human demonstrations is expensive and existing synthetic pipelines often suffer from limited task diversity or noisy,…

Artificial Intelligence · Computer Science 2026-04-14 Jinbiao Wei , Yilun Zhao , Kangqi Ni , Arman Cohan

AgentCgroup: Understanding and Controlling OS Resources of AI Agents

AI agents are increasingly deployed in multi-tenant cloud environments, where they execute diverse tool calls within sandboxed containers, each call with distinct resource demands and rapid fluctuations. We present a systematic…

Operating Systems · Computer Science 2026-02-24 Yusheng Zheng , Jiakun Fan , Quanzhi Fu , Yiwei Yang , Wei Zhang , Andi Quinn

FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Coworking AI agents operating within local file systems are rapidly emerging as a paradigm in human-AI interaction; however, effective personalization remains limited by severe data constraints, as strict privacy barriers and the difficulty…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Shuai Liu , Shulin Tian , Kairui Hu , Yuhao Dong , Zhe Yang , Bo Li , Jingkang Yang , Chen Change Loy , Ziwei Liu

Inject, Fork, Compare: Defining an Interaction Vocabulary for Multi-Agent Simulation Platforms

LLM-based multi-agent simulations are a rapidly growing field of research, but current simulations often lack clear modes for interaction and analysis, limiting the "what if" scenarios researchers are able to investigate. In this demo, we…

Multiagent Systems · Computer Science 2025-09-18 HwiJoon Lee , Martina Di Paola , Yoo Jin Hong , Quang-Huy Nguyen , Joseph Seering

Interpretable Context Methodology: Folder Structure as Agentic Architecture

Current approaches to AI agent orchestration typically involve building multi-agent frameworks that manage context passing, memory, error handling, and step coordination through code. These frameworks work well for complex, concurrent…

Artificial Intelligence · Computer Science 2026-03-19 Jake Van Clief , David McDermott

MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents

Developing AI agents powered by large language models (LLMs) faces significant challenges in achieving true Turing completeness and adaptive, code-driven evolution. Current approaches often generate code independently of its runtime…

Software Engineering · Computer Science 2024-09-25 Ming Zhu , Yi Zhou

Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation

The automation of scientific discovery represents a critical milestone in Artificial Intelligence (AI) research. However, existing agentic systems for science suffer from two fundamental limitations: rigid, pre-programmed workflows that…

Artificial Intelligence · Computer Science 2025-10-20 Ed Li , Junyu Ren , Xintian Pan , Cat Yan , Chuanhao Li , Dirk Bergemann , Zhuoran Yang

SciFi: A Safe, Lightweight, User-Friendly, and Fully Autonomous Agentic AI Workflow for Scientific Applications

Recent advances in agentic AI have enabled increasingly autonomous workflows, but existing systems still face substantial challenges in achieving reliable deployment in real-world scientific research. In this work, we present a safe,…

Artificial Intelligence · Computer Science 2026-04-16 Qibin Liu , Julia Gonski

Context Branching for LLM Conversations: A Version Control Approach to Exploratory Programming

Large Language Models (LLMs) have become integral to software engineering workflows, yet their effectiveness degrades significantly in multi-turn conversations. Recent studies demonstrate an average 39% performance drop when instructions…

Software Engineering · Computer Science 2025-12-17 Bhargav Chickmagalur Nanjundappa , Spandan Maaheshwari

Everything is Context: Agentic File System Abstraction for Context Engineering

Generative AI (GenAI) has reshaped software system design by introducing foundation models as pre-trained subsystems that redefine architectures and operations. The emerging challenge is no longer model fine-tuning but context…

Software Engineering · Computer Science 2025-12-08 Xiwei Xu , Robert Mao , Quan Bai , Xuewu Gu , Yechao Li , Liming Zhu

Context is all you need: Towards autonomous model-based process design using agentic AI in flowsheet simulations

Agentic AI systems integrating large language models (LLMs) with reasoning and tooluse capabilities are transforming various domains - in particular, software development. In contrast, their application in chemical process flowsheet…

Artificial Intelligence · Computer Science 2026-03-16 Pascal Schäfer , Lukas J. Krinke , Martin Wlotzka , Norbert Asprion

The Next-Generation OS Process Abstraction

Operating Systems are built upon a set of abstractions to provide resource management and programming APIs for common functionality, such as synchronization, communication, protection, and I/O. The process abstraction is the bridge across…

Operating Systems · Computer Science 2022-05-26 Rodrigo Siqueira , Nelson Lago , Fabio Kon , Dejan Milojičić

TClone: Low-Latency Forking of Live GUI Environments for Computer-Use Agents

Computer-use agents increasingly operate inside live personal workspaces, where their actions can modify files, applications, GUI state, credentials, and authenticated sessions. This creates a tension between safety and quality: agents need…

Operating Systems · Computer Science 2026-05-19 Yutong Huang , Vikranth Srivatsa , Alex Asch , Hansin Tushar Patwa , Yiying Zhang

{\mu}Fork: Supporting POSIX fork Within a Single-Address-Space OS

Single-address-space operating systems have well-known lightweightness benefits that result from their central design idea: the kernel and applications share a unique address space. This model makes these operating systems (OSes)…

Operating Systems · Computer Science 2025-09-16 John Alistair Kressel , Hugo Lefeuvre , Pierre Olivier

Don't Let AI Agents YOLO Your Files: Shifting Information and Control to Filesystems for Agent Safety and Autonomy

AI coding agents operate directly on users' filesystems, where they regularly corrupt data, delete files, and leak secrets. Current approaches force a tradeoff between safety and autonomy: unrestricted access risks harm, while frequent…

Operating Systems · Computer Science 2026-04-17 Shawn Wanxiang Zhong , Junxuan Liao , Jing Liu , Mai Zheng , Andrea C. Arpaci-Dusseau , Remzi H. Arpaci-Dusseau