操作系统 — Scifaro

WIO: Upload-Enabled Computational Storage on CXL SSDs

The widening gap between processor speed and storage latency has made data movement a dominant bottleneck in modern systems. Two lines of storage-layer innovation attempted to close this gap: persistent memory shortened the latency…

操作系统 · 计算机科学 2026-04-06 Yiwei Yang , Yanpeng Hu , Yusheng Zheng , Estabon Ramos , Jianchang Su , Andi Quinn , Wei Zhang

HACache: Leveraging Read Performance with Cache in a Heterogeneous Array

In cost-sensitive deployments, RAID arrays may combine SSDs with different performance levels. Such heterogeneity arises when aging SSDs degrade yet remain usable, or when failed drives are replaced with new devices of explicitly better…

操作系统 · 计算机科学 2026-04-03 Jialin Liu , Liang Shi , Dingcui Yu

DAXFS: A Lock-Free Shared Filesystem for CXL Disaggregated Memory

CXL (Compute Express Link) enables multiple hosts to share byte-addressable memory with hardware cache coherence, but no existing filesystem exploits this for lock-free multi-host coordination. We present DaxFS, a Linux filesystem for CXL…

操作系统 · 计算机科学 2026-04-03 Cong Wang , Yiwei Yang , Yusheng Zheng

StepCache: Step-Level Reuse with Lightweight Verification and Selective Patching for LLM Serving

We address LLM serving workloads where repeated requests share a common solution structure but differ in localized constraints, such as output schema, variable names, or numeric constants. Prior caching approaches typically reuse either…

操作系统 · 计算机科学 2026-04-01 Azam Nouri

GateANN: I/O-Efficient Filtered Vector Search on SSDs

We present GateANN, an I/O-efficient SSD-based graph ANNS system that supports filtered vector search on an unmodified graph index. Existing SSD-based systems either waste I/O by post-filtering, or require expensive filter-aware index…

操作系统 · 计算机科学 2026-03-27 Nakyung Lee , Soobin Cho , Jiwoong Park , Gyuyeong Kim

From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents

Computer-use agents (CUAs) powered by large language models (LLMs) have emerged as a promising approach to automating computer tasks, yet they struggle with the existing human-oriented OS interfaces - graphical user interfaces (GUIs). GUIs…

操作系统 · 计算机科学 2026-03-26 Yuan Wang , Mingyu Li , Haibo Chen

Wayfinder: Automated Operating System Specialization

Specializing an OS to optimize the performance of a particular application is typically a manual process that requires great expertise. Specialization through configuration lends itself well to automation; however, it is challenging due to…

操作系统 · 计算机科学 2026-03-25 Alexander Jung , Cezar Crăciunoiu , Nikolaos Karaolidis , Hugo Lefeuvre , Daniel Oñoro Rubio , Felipe Huici , Charalampos Rotsos , Pierre Olivier

Tock: From Research to Securing 10 Million Computers

Tock began 10 years ago as a research operating system developed by academics to help other academics build urban sensing applications. By leveraging a new language (Rust) and new hardware protection mechanisms, Tock enabled…

操作系统 · 计算机科学 2026-03-25 Leon Schuermann , Brad Campbell , Branden Ghena , Philip Levis , Amit Levy , Pat Pannuto

2DIO: A Cache-Accurate Storage Microbenchmark

We introduce 2DIO, a microbenchmark creating cache-accurate, stressful I/O traces. While existing tools are limited to generating traces with well-behaved, concave hit ratio curves, 2DIO produces ones with tunable complex cache behaviors,…

操作系统 · 计算机科学 2026-03-23 Yirong Wang , Isaac Khor , Peter Desnoyers

Fork, Explore, Commit: OS Primitives for Agentic Exploration

AI agents increasingly perform agentic exploration: pursuing multiple solution paths in parallel and committing only the successful one. Because each exploration path may modify files and spawn processes, agents require isolated…

操作系统 · 计算机科学 2026-03-20 Cong Wang , Yusheng Zheng

AppFlow: Memory Scheduling for Cold Launch of Large Apps on Mobile and Vehicle Systems

GB-scale large apps like on-device LLMs and rich media editors are becoming the next-generation trend, but their heavy memory and I/O demands, especially during multitasking, cause devices to reclaim or kill processes, turning warm apps…

操作系统 · 计算机科学 2026-03-19 Xiaochen Li , Sicong Liu , Bin Guo , Yu Ouyang , Fengmin Wu , Yuan Xu , Zhiwen Yu

Idiosyncrasies of Programmable Caching Engines

Programmable caching engines like CacheLib are widely used in production systems to support diverse workloads in multi-tenant environments. CacheLib's design focuses on performance, portability, and configurability, allowing applications to…

操作系统 · 计算机科学 2026-03-17 José Peixoto , Alexis Gonzalez , Janki Bhimani , Raju Rangaswami , Cláudia Brito , João Paulo , Ricardo Macedo

AgentRM: An OS-Inspired Resource Manager for LLM Agent Systems

Large Language Model (LLM) agent systems have experienced rapid adoption across diverse domains, yet they suffer from critical user experience problems that limit their practical deployment. Through an empirical analysis of over 40,000…

操作系统 · 计算机科学 2026-03-16 Jianshu She

ThunderAgent: A Simple, Fast and Program-Aware Agentic Inference System

Large language models(LLMs) are now used to power complex multi-turn agentic workflows. Existing systems run agentic inference by loosely assembling isolated components: an LLM inference engine (e.g., vLLM) and a tool orchestrator (e.g.,…

操作系统 · 计算机科学 2026-03-12 Hao Kang , Ziyang Li , Xinyu Yang , Weili Xu , Yinfang Chen , Junxiong Wang , Beidi Chen , Tushar Krishna , Chenfeng Xu , Simran Arora

Ensuring Data Freshness in Multi-Rate Task Chains Scheduling

In safety-critical autonomous systems, data freshness presents a fundamental design challenge. While the Logical Execution Time (LET) paradigm ensures compositional determinism, it often does so at the cost of injected latency, degrading…

操作系统 · 计算机科学 2026-03-11 José Luis Conradi Hoffmann , Antônio Augusto Fröhlich

The Missing Memory Hierarchy: Demand Paging for LLM Context Windows

The context window of a large language model is not memory. It is L1 cache: a small, fast, expensive resource that the field treats as the entire memory system. There is no L2, no virtual memory, no paging. Every tool definition, every…

操作系统 · 计算机科学 2026-03-11 Tony Mason

OBASE: Object-Based Address-Space Engineering to Improve Memory Tiering

Hardware and OS mechanisms for memory tiering are widely deployed, yet datacenters still overprovision DRAM. The root cause is hotness fragmentation: allocators place objects by size rather than access pattern, so hot and cold objects…

操作系统 · 计算机科学 2026-03-03 Vinay Banakar , Suli Yang , Kan Wu , Andrea C. Arpaci-Dusseau , Remzi H. Arpaci-Dusseau , Kimberly Keeton

Exploiting Dependency and Parallelism: Real-Time Scheduling and Analysis for GPU Tasks

With the rapid advancement of Artificial Intelligence, the Graphics Processing Unit (GPU) has become increasingly essential across a growing number of safety-critical application domains. Applying a GPU is indispensable for parallel…

操作系统 · 计算机科学 2026-02-25 Yuanhai Zhang , Songyang He , Ruizhe Gou , Mingyue Cui , Boyang Li , Shuai Zhao , Kai Huang

AgentCgroup: Understanding and Controlling OS Resources of AI Agents

AI agents are increasingly deployed in multi-tenant cloud environments, where they execute diverse tool calls within sandboxed containers, each call with distinct resource demands and rapid fluctuations. We present a systematic…

操作系统 · 计算机科学 2026-02-24 Yusheng Zheng , Jiakun Fan , Quanzhi Fu , Yiwei Yang , Wei Zhang , Andi Quinn

BYOS: Knowledge-driven Large Language Models Bring Your Own Operating System More Excellent

Operating system (OS) kernel tuning is a critical yet challenging problem for performance optimization, due to the large configuration space, complex interdependencies among configuration options, and the rapid evolution of kernel versions.…

操作系统 · 计算机科学 2026-02-13 Hongyu Lin , Yuchen Li , Haoran Luo , Kaichun Yao , Libo Zhang , Zhenghong Lin , Mingjie Xing , Yanjun Wu , Carl Yang