操作系统 — Scifaro

FlexInfer: Breaking Memory Constraint via Flexible and Efficient Offloading for On-Device LLM Inference

Large Language Models (LLMs) face challenges for on-device inference due to high memory demands. Traditional methods to reduce memory usage often compromise performance and lack adaptability. We propose FlexInfer, an optimized offloading…

操作系统 · 计算机科学 2025-03-07 Hongchao Du , Shangyu Wu , Arina Kharlamova , Nan Guan , Chun Jason Xue

TUNA: Tuning Unstable and Noisy Cloud Applications

Autotuning plays a pivotal role in optimizing the performance of systems, particularly in large-scale cloud deployments. One of the main challenges in performing autotuning in the cloud arises from performance variability. We first…

操作系统 · 计算机科学 2025-03-07 Johannes Freischuetz , Konstantinos Kanellis , Brian Kroth , Shivaram Venkataraman

CHRONOS: Compensating Hardware Related Overheads with Native Multi Timer Support for Real-Time Operating Systems

The management of timing constraints in a real-time operating system (RTOS) is usually realized through a global tick counter. This counter acts as the foundational time unit for all tasks in the systems. In order to establish a connection…

操作系统 · 计算机科学 2025-03-04 Kay Heider , Christian Hakert , Kuan-Hsun Chen , Jian-Jia Chen

Scalable and Accurate Application-Level Crash-Consistency Testing via Representative Testing

Crash consistency is essential for applications that must persist data. Crash-consistency testing has been commonly applied to find crash-consistency bugs in applications. The crash-state space grows exponentially as the number of…

操作系统 · 计算机科学 2025-03-04 Yile Gu , Ian Neal , Jiexiao Xu , Shaun Christopher Lee , Ayman Said , Musa Haydar , Jacob Van Geffen , Rohan Kadekodi , Andrew Quinn , Baris Kasikci

Taming and Controlling Performance and Energy Trade-offs Automatically in Network Applications

In this paper, we demonstrate that a server running a single latency-sensitive application can be treated as a black box to reduce energy consumption while meeting an SLA target. We find that when the mean offered load is stable, one can…

操作系统 · 计算机科学 2025-02-24 Han Dong , Yara Awad , Sanjay Arora , Orran Krieger , Jonathan Appavoo

Ariadne: A Hotness-Aware and Size-Adaptive Compressed Swap Technique for Fast Application Relaunch and Reduced CPU Usage on Mobile Devices

Growing application memory demands and concurrent usage are making mobile device memory scarce. When memory pressure is high, current mobile systems use a RAM-based compressed swap scheme (called ZRAM) to compress unused execution-related…

操作系统 · 计算机科学 2025-02-19 Yu Liang , Aofeng Shen , Chun Jason Xue , Riwei Pan , Haiyu Mao , Nika Mansouri Ghiasi , Qingcai Jiang , Rakesh Nadig , Lei Li , Rachata Ausavarungnirun , Mohammad Sadrosadati , Onur Mutlu

Phoenix -- A Novel Technique for Performance-Aware Orchestration of Thread and Page Table Placement in NUMA Systems

The emergence of symmetric multi-processing (SMP) systems with non-uniform memory access (NUMA) has prompted extensive research on process and data placement to mitigate the performance impact of NUMA on applications. However, existing…

操作系统 · 计算机科学 2025-02-19 Mohammad Siavashi , Alireza Sanaee , Mohsen Sharifi , Gianni Antichi

EDM: An Ultra-Low Latency Ethernet Fabric for Memory Disaggregation

Achieving low remote memory access latency remains the primary challenge in realizing memory disaggregation over Ethernet within the datacenters. We present EDM that attempts to overcome this challenge using two key ideas. First, while…

操作系统 · 计算机科学 2025-02-17 Weigao Su , Vishal Shrivastav

Analyzing Configuration Dependencies of File Systems

File systems play an essential role in modern society for managing precious data. To meet diverse needs, they often support many configuration parameters. Such flexibility comes at the price of additional complexity which can lead to subtle…

操作系统 · 计算机科学 2025-02-12 Tabassum Mahmud , Om Rameshwar Gatla , Duo Zhang , Carson Love , Ryan Bumann , Varun S Girimaji , Mai Zheng

Automatic ISA analysis for Secure Context Switching

Instruction set architectures are complex, with hundreds of registers and instructions that can modify dozens of them during execution, variably on each instance. Prose-style ISA specifications struggle to capture these intricacies of the…

操作系统 · 计算机科学 2025-02-11 Neelu S. Kalani , Thomas Bourgeat , Guerney D. H. Hunt , Wojciech Ozga

Cache is King: Smart Page Eviction with eBPF

The page cache is a central part of an OS. It reduces repeated accesses to storage by deciding which pages to retain in memory. As a result, the page cache has a significant impact on the performance of many applications. However, its…

操作系统 · 计算机科学 2025-02-06 Tal Zussman , Ioannis Zarkadas , Jeremy Carin , Andrew Cheng , Hubertus Franke , Jonas Pfefferle , Asaf Cidon

CORD: Co-design of Resource Allocation and Deadline Decomposition with Generative Profiling

As multicore hardware is becoming increasingly common in real-time systems, traditional scheduling techniques that assume a single worst-case execution time for a task are no longer adequate, since they ignore the impact of shared resources…

操作系统 · 计算机科学 2025-01-16 Robert Gifford , Abby Eisenklam , Georgiy A. Bondar , Yifan Cai , Tushar Sial , Linh Thi Xuan Phan , Abhishek Halder

Symbol Resolution MatRs: Make it Fast and Observable with Stable Linking

Dynamic linking is the standard mechanism for using external dependencies since it enables code reuse, streamlines software updates, and reduces disk/network use. Dynamic linking waits until runtime to calculate an application's relocation…

操作系统 · 计算机科学 2025-01-14 Farid Zakaria , Andrew Quinn , Thomas R. W. Scogland

ByteFS: System Support for (CXL-based) Memory-Semantic Solid-State Drives

Unlike non-volatile memory that resides on the processor memory bus, memory-semantic solid-state drives (SSDs) support both byte and block access granularity via PCIe or CXL interconnects. They provide scalable memory capacity using NAND…

操作系统 · 计算机科学 2025-01-10 Shaobo Li , Yirui Eric Zhou , Hao Ren , Jian Huang

Exploiting Application-to-Architecture Dependencies for Designing Scalable OS

With the advent of hundreds of cores on a chip to accelerate applications, the operating system (OS) needs to exploit the existing parallelism provided by the underlying hardware resources to determine the right amount of processes to be…

操作系统 · 计算机科学 2025-01-07 Yao Xiao , Nikos Kanakaris , Anzhe Cheng , Chenzhong Yin , Nesreen K. Ahmed , Shahin Nazarian , Andrei Irimia , Paul Bogdan

Combining Type Checking and Formal Verification for Lightweight OS Correctness

This paper reports our experience of providing lightweight correctness guarantees to an open-source Rust OS, Theseus. First, we report new developments in intralingual design that leverage Rust's type system to enforce additional invariants…

操作系统 · 计算机科学 2025-01-03 Ramla Ijaz , Kevin Boos , Lin Zhong

Revisiting Cache Freshness for Emerging Real-Time Applications

Caching is widely used in industry to improve application performance by reducing data-access latency and taking the load off the backend infrastructure. TTLs have become the de-facto mechanism used to keep cached data reasonably fresh…

操作系统 · 计算机科学 2024-12-31 Ziming Mao , Rishabh Iyer , Scott Shenker , Ion Stoica

Interference-free Operating System: A 6 Years' Experience in Mitigating Cross-Core Interference in Linux

Real-time operating systems employ spatial and temporal isolation to guarantee predictability and schedulability of real-time systems on multi-core processors. Any unbounded and uncontrolled cross-core performance interference poses a…

操作系统 · 计算机科学 2024-12-25 Zhaomeng Deng , Ziqi Zhang , Ding Li , Yao Guo , Yunfeng Ye , Yuxin Ren , Ning Jia , Xinwei Hu

Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 Processors

High-Performance Computing (HPC) and Artificial Intelligence (AI) workloads typically demand substantial memory bandwidth and, to a degree, memory capacity. CXL memory expansion modules, also known as CXL "type-3" devices, enable…

操作系统 · 计算机科学 2024-12-18 Rohit Sehgal , Vishal Tanna , Vinicius Petrucci , Anil Godbole

Revitalising the Single Batch Environment: A 'Quest' to Achieve Fairness and Efficiency

In the realm of computer systems, efficient utilisation of the CPU (Central Processing Unit) has always been a paramount concern. Researchers and engineers have long sought ways to optimise process execution on the CPU, leading to the…

操作系统 · 计算机科学 2024-12-18 Supriya Manna , Krishna Siva Prasad Mudigonda