Related papers: Milliscale: Fast Commit on Low-Latency Object Stor…

Blitzcrank: Fast Semantic Compression for In-memory Online Transaction Processing

We present BLITZCRANK, a high-speed semantic compressor designed for OLTP databases. Previous solutions are inadequate for compressing row-stores: they suffer from either low compression factor due to a coarse compression granularity or…

Databases · Computer Science 2024-07-01 Yiming Qiao , Yihan Gao , Huanchen Zhang

RefineDetLite: A Lightweight One-stage Object Detection Framework for CPU-only Devices

Previous state-of-the-art real-time object detectors have been reported on GPUs which are extremely expensive for processing massive data and in resource-restricted scenarios. Therefore, high efficiency object detectors on CPU-only devices…

Computer Vision and Pattern Recognition · Computer Science 2020-09-10 Chen Chen , Mengyuan Liu , Xiandong Meng , Wanpeng Xiao , Qi Ju

TokenScale: Timely and Accurate Autoscaling for Disaggregated LLM Serving with Token Velocity

The architectural shift to prefill/decode (PD) disaggregation in LLM serving improves resource utilization but struggles with the bursty nature of modern workloads. Existing autoscaling policies, often retrofitted from monolithic systems…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-04 Ruiqi Lai , Hongrui Liu , Chengzhi Lu , Zonghao Liu , Siyu Cao , Siyang Shao , Yixin Zhang , Luo Mai , Dmitrii Ustiugov

L-Store: A Real-time OLTP and OLAP System

Arguably data is the new natural resource in the enterprise world with an unprecedented degree of proliferation. But to derive real-time actionable insights from the data, it is important to bridge the gap between managing the data that is…

Databases · Computer Science 2017-02-28 Mohammad Sadoghi , Souvik Bhattacherjee , Bishwaranjan Bhattacharjee , Mustafa Canim

OptCon: An Adaptable SLA-Aware Consistency Tuning Framework for Quorum-based Stores

Users of distributed datastores that employ quorum-based replication are burdened with the choice of a suitable client-centric consistency setting for each storage operation. The above matching choice is difficult to reason about as it…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-17 Subhajit Sidhanta , Wojciech Golab , Supratik Mukhopadhyay , Saikat Basu

Asymmetry-aware Scalable Locking

The pursuit of power-efficiency is popularizing asymmetric multicore processors (AMP) such as ARM big.LITTLE, Apple M1 and recent Intel Alder Lake with big and little cores. However, we find that existing scalable locks fail to scale on AMP…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-12-30 Nian Liu , Jinyu Gu , Dahai Tang , Kenli Li , Binyu Zang , Haibo Chen

OmniSparse: Training-Aware Fine-Grained Sparse Attention for Long-Video MLLMs

Existing sparse attention methods primarily target inference-time acceleration by selecting critical tokens under predefined sparsity patterns. However, they often fail to bridge the training-inference gap and lack the capacity for…

Computer Vision and Pattern Recognition · Computer Science 2025-11-20 Feng Chen , Yefei He , Shaoxuan He , Yuanyu He , Jing Liu , Lequan Lin , Akide Liu , Zhaoyang Li , Jiyuan Zhang , Zhenbang Sun , Bohan Zhuang , Qi Wu

PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity

Multimodal large language models (MLLMs) have demonstrated strong general-purpose capabilities in open-world visual comprehension. However, most existing MLLMs primarily focus on holistic, scene-level understanding, often overlooking the…

Computer Vision and Pattern Recognition · Computer Science 2025-11-04 Yuqian Yuan , Wenqiao Zhang , Xin Li , Shihao Wang , Kehan Li , Wentong Li , Jun Xiao , Lei Zhang , Beng Chin Ooi

OPTIMUM-DERAM: Highly Consistent, Scalable, and Secure Multi-Object Memory using RLNC

This paper introduces OPTIMUM-DERAM, a highly consistent, scalable, secure, and decentralized shared memory solution. Traditional distributed shared memory implementations offer multi-object support by multi-threading a single object memory…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-21 Nicolas Nicolaou , Kishori M. Konwar , Moritz Grundei , Aleksandr Bezobchuk , Muriel Médard , Sriram Vishwanath

Low-Latency and Low-Complexity MLSE for Short-Reach Optical Interconnects

To meet the high-speed, low-latency, and low-complexity demand for optical interconnects, simplified maximum likelihood sequence estimation (MLSE) is proposed in this paper. Simplified MLSE combines computational simplification and reduced…

Information Theory · Computer Science 2026-01-28 Mengqi Guo , Ji Zhou , Haide Wang , Changyuan Yu , Xiangjun Xin , Liangchuan Li

Scale-aware Pixel-wise Object Proposal Networks

Object proposal is essential for current state-of-the-art object detection pipelines. However, the existing proposal methods generally fail in producing results with satisfying localization accuracy. The case is even worse for small objects…

Computer Vision and Pattern Recognition · Computer Science 2016-07-26 Zequn Jie , Xiaodan Liang , Jiashi Feng , Wen Feng Lu , Eng Hock Francis Tay , Shuicheng Yan

A Study on Performance and Power Efficiency of Dense Non-Volatile Caches in Multi-Core Systems

In this paper, we present a novel cache design based on Multi-Level Cell Spin-Transfer Torque RAM (MLC STTRAM) that can dynamically adapt the set capacity and associativity to use efficiently the full potential of MLC STTRAM. We exploit the…

Hardware Architecture · Computer Science 2017-06-13 Amin Jadidi , Mohammad Arjomand , Mahmut T. Kandemir , Chita R. Das

Elevating commodity storage with the SALSA host translation layer

To satisfy increasing storage demands in both capacity and performance, industry has turned to multiple storage technologies, including Flash SSDs and SMR disks. These devices employ a translation layer that conceals the idiosyncrasies of…

Operating Systems · Computer Science 2019-01-11 Nikolas Ioannou , Kornilios Kourtis , Ioannis Koltsidas

MSACT: Multistage Spatial Alignment for Stable Low-Latency Fine Manipulation

Real-world fine manipulation, particularly in bimanual manipulation, typically requires low-latency control and stable visual localization, while collecting large-scale data is costly and limited demonstrations may lead to localization…

Robotics · Computer Science 2026-05-04 Xianbo Cai , Hideyuki Ichiwara , Masaki Yoshikawa , Tetsuya Ogata

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

The computational challenges of Large Language Model (LLM) inference remain a significant barrier to their widespread deployment, especially as prompt lengths continue to increase. Due to the quadratic complexity of the attention…

Computation and Language · Computer Science 2024-10-31 Huiqiang Jiang , Yucheng Li , Chengruidong Zhang , Qianhui Wu , Xufang Luo , Surin Ahn , Zhenhua Han , Amir H. Abdi , Dongsheng Li , Chin-Yew Lin , Yuqing Yang , Lili Qiu

Optimizing Video Object Detection via a Scale-Time Lattice

High-performance object detection relies on expensive convolutional networks to compute features, often leading to significant challenges in applications, e.g. those that require detecting objects from video streams in real time. The key to…

Computer Vision and Pattern Recognition · Computer Science 2018-04-17 Kai Chen , Jiaqi Wang , Shuo Yang , Xingcheng Zhang , Yuanjun Xiong , Chen Change Loy , Dahua Lin

Transactional Partitioning: A New Abstraction for Main-Memory Databases

The growth in variety and volume of OLTP (Online Transaction Processing) applications poses a challenge to OLTP systems to meet performance and cost demands in the existing hardware landscape. These applications are highly interactive…

Databases · Computer Science 2017-01-17 Vivek Shah

MileStone: A Multi-Objective Compiler Phase Ordering Framework for Graph-based IR-Level Optimization

Compiler phase ordering has a strong effect on program performance. Finding an effective sequence of passes is still a difficult task because the search space is large and execution time, code size and energy consumption often conflict.…

Programming Languages · Computer Science 2026-05-25 Amirhosein Sadr , Mehran Alidoost Nia

OnlineSplatter: Pose-Free Online 3D Reconstruction for Free-Moving Objects

Free-moving object reconstruction from monocular video remains challenging, particularly without reliable pose or depth cues and under arbitrary object motion. We introduce OnlineSplatter, a novel online feed-forward framework generating…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Mark He Huang , Lin Geng Foo , Christian Theobalt , Ying Sun , De Wen Soh

Efficient Proactive Caching for Supporting Seamless Mobility

We present a distributed proactive caching approach that exploits user mobility information to decide where to proactively cache data to support seamless mobility, while efficiently utilizing cache storage using a congestion pricing scheme.…

Networking and Internet Architecture · Computer Science 2014-04-21 Vasilios A. Siris , Xenofon Vasilakos , George C. Polyzos