Related papers: SplitFS: Reducing Software Overhead in File System…

Persistent Memory File Systems: A Survey

Persistent Memory (PM) is non-volatile byte-addressable memory that offers read and write latencies in the order of magnitude smaller than flash storage, such as SSDs. This survey discusses how file systems address the most prominent…

Operating Systems · Computer Science 2023-10-05 Wiebe van Breukelen , Animesh Trivedi

SplitCom: Communication-efficient Split Federated Fine-tuning of LLMs via Temporal Compression

Federated fine-tuning of on-device large language models (LLMs) mitigates privacy concerns by preventing raw data sharing. However, the intensive computational and memory demands pose significant challenges for resource-constrained edge…

Networking and Internet Architecture · Computer Science 2026-02-13 Tao Li , Yulin Tang , Yiyang Song , Cong Wu , Xihui Liu , Pan Li , Xianhao Chen

Shared-PIM: Enabling Concurrent Computation and Data Flow for Faster Processing-in-DRAM

Processing-in-Memory (PIM) enhances memory with computational capabilities, potentially solving energy and latency issues associated with data transfer between memory and processors. However, managing concurrent computation and data flow…

Hardware Architecture · Computer Science 2025-05-09 Ahmed Mamdouh , Haoran Geng , Michael Niemier , Xiaobo Sharon Hu , Dayane Reis

SplitFT: An Adaptive Federated Split Learning System For LLMs Fine-Tuning

Federated Split Learning has been identified as an efficient approach to address the computational resource constraints of clients in classical federated learning, while guaranteeing data privacy for distributed model training across data…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-30 Yimeng Shan , Zhaorui Zhang , Sheng Di , Yu Liu , Xiaoyi Lu , Benben Liu

Persistent Memory Objects: Fast and Easy Crash Consistency for Persistent Memory

DIMM-compatible persistent memory unites memory and storage. Prior works utilize persistent memory either by combining the filesystem with direct access on memory mapped files or by managing it as a collection of objects while abolishing…

Operating Systems · Computer Science 2022-04-08 Derrick Greenspan , Naveed Ul Mustafa , Zoran Kolega , Mark Heinrich , Yan Solihin

Efficient Memory Partitioning in Software Defined Hardware

As programmers turn to software-defined hardware (SDH) to maintain a high level of productivity while programming hardware to run complex algorithms, heavy-lifting must be done by the compiler to automatically partition on-chip arrays. In…

Hardware Architecture · Computer Science 2022-03-31 Matthew Feldman , Tian Zhao , Kunle Olukotun

CFS: A Distributed File System for Large Scale Container Platforms

We propose CFS, a distributed file system for large scale container platforms. CFS supports both sequential and random file accesses with optimized storage for both large files and small files, and adopts different replication protocols for…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-11-11 Haifeng Liu , Wei Ding , Yuan Chen , Weilong Guo , Shuoran Liu , Tianpeng Li , Mofei Zhang , Jianxing Zhao , Hongyin Zhu , Zhengyi Zhu

SplitBrain: Hybrid Data and Model Parallel Deep Learning

The recent success of deep learning applications has coincided with those widely available powerful computational resources for training sophisticated machine learning models with huge datasets. Nonetheless, training large models such as…

Machine Learning · Computer Science 2022-01-03 Farley Lai , Asim Kadav , Erik Kruus

SimplePIM: A Software Framework for Productive and Efficient Processing-in-Memory

Data movement between memory and processors is a major bottleneck in modern computing systems. The processing-in-memory (PIM) paradigm aims to alleviate this bottleneck by performing computation inside memory chips. Real PIM hardware (e.g.,…

Hardware Architecture · Computer Science 2023-10-04 Jinfan Chen , Juan Gómez-Luna , Izzat El Hajj , Yuxin Guo , Onur Mutlu

ByteFS: System Support for (CXL-based) Memory-Semantic Solid-State Drives

Unlike non-volatile memory that resides on the processor memory bus, memory-semantic solid-state drives (SSDs) support both byte and block access granularity via PCIe or CXL interconnects. They provide scalable memory capacity using NAND…

Operating Systems · Computer Science 2025-01-10 Shaobo Li , Yirui Eric Zhou , Hao Ren , Jian Huang

Revamping Storage Class Memory With Hardware Automated Memory-Over-Storage Solution

Large persistent memories such as NVDIMM have been perceived as a disruptive memory technology, because they can maintain the state of a system even after a power failure and allow the system to recover quickly. However, overheads incurred…

Hardware Architecture · Computer Science 2021-06-29 Jie Zhang , Miryeong Kwon , Donghyun Gouk , Sungjoon Koh , Nam Sung Kim , Mahmut Taylan Kandemir , Myoungsoo Jung

FedSplitX: Federated Split Learning for Computationally-Constrained Heterogeneous Clients

Foundation models (FMs) have demonstrated remarkable performance in machine learning but demand extensive training data and computational resources. Federated learning (FL) addresses the challenges posed by FMs, especially related to data…

Machine Learning · Computer Science 2023-10-24 Jiyun Shin , Jinhyun Ahn , Honggu Kang , Joonhyuk Kang

Assise: Performance and Availability via NVM Colocation in a Distributed File System

The adoption of very low latency persistent memory modules (PMMs) upends the long-established model of disaggregated file system access. Instead, by colocating computation and PMM storage, we can provide applications much higher I/O…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-03 Thomas E. Anderson , Marco Canini , Jongyul Kim , Dejan Kostić , Youngjin Kwon , Simon Peter , Waleed Reda , Henry N. Schuh , Emmett Witchel

MOD: Minimally Ordered Durable Datastructures for Persistent Memory

Persistent Memory (PM) makes possible recoverable applications that can preserve application progress across system reboots and power failures. Actual recoverability requires careful ordering of cacheline flushes, currently done in two…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-09-02 Swapnil Haria , Mark D. Hill , Michael M. Swift

SplitSim: Large-Scale Simulations for Evaluating Network Systems Research

When physical testbeds are out of reach for evaluating a networked system, we frequently turn to simulation. In today's datacenter networks, bottlenecks are rarely at the network protocol level, but instead in end-host software or hardware…

Networking and Internet Architecture · Computer Science 2024-02-09 Hejing Li , Praneeth Balasubramanian , Marvin Meiers , Jialin Li , Antoine Kaufmann

OpenFPM: A scalable open framework for particle and particle-mesh codes on parallel computers

Scalable and efficient numerical simulations continue to gain importance, as computation is firmly established as the third pillar of discovery, alongside theory and experiment. Meanwhile, the performance of computing hardware grows through…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-26 Pietro Incardona , Antonio Leo , Yaroslav Zaluzhnyi , Rajesh Ramaswamy , Ivo F. Sbalzarini

PIM-STM: Software Transactional Memory for Processing-In-Memory Systems

Processing-In-Memory (PIM) is a novel approach that augments existing DRAM memory chips with lightweight logic. By allowing to offload computations to the PIM system, this architecture allows for circumventing the data-bottleneck problem…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-18 André Lopes , Daniel Castro , Paolo Romano

Optimizing Split Points for Error-Resilient SplitFed Learning

Recent advancements in decentralized learning, such as Federated Learning (FL), Split Learning (SL), and Split Federated Learning (SplitFed), have expanded the potentials of machine learning. SplitFed aims to minimize the computational…

Artificial Intelligence · Computer Science 2024-05-31 Chamani Shiranthika , Parvaneh Saeedi , Ivan V. Bajić

Basic Performance Measurements of the Intel Optane DC Persistent Memory Module

Scalable nonvolatile memory DIMMs will finally be commercially available with the release of the Intel Optane DC Persistent Memory Module (or just "Optane DC PMM"). This new nonvolatile DIMM supports byte-granularity accesses with access…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-08-13 Joseph Izraelevitz , Jian Yang , Lu Zhang , Juno Kim , Xiao Liu , Amirsaman Memaripour , Yun Joon Soh , Zixuan Wang , Yi Xu , Subramanya R. Dulloor , Jishen Zhao , Steven Swanson

SplitNN-driven Vertical Partitioning

In this work, we introduce SplitNN-driven Vertical Partitioning, a configuration of a distributed deep learning method called SplitNN to facilitate learning from vertically distributed features. SplitNN does not share raw data or model…

Machine Learning · Computer Science 2020-08-11 Iker Ceballos , Vivek Sharma , Eduardo Mugica , Abhishek Singh , Alberto Roman , Praneeth Vepakomma , Ramesh Raskar