English
Related papers

Related papers: Efficient Representative Subset Selection over Sli…

200 papers

Streaming computation plays an important role in large-scale data analysis. The sliding window model is a model of streaming computation which also captures the recency of the data. In this model, data arrives one item at a time, but only…

Data Structures and Algorithms · Computer Science 2021-11-01 Alessandro Epasto , Mohammad Mahdian , Vahab Mirrokni , Peilin Zhong

The sliding window model of computation captures scenarios in which data are continually arriving in the form of a stream, and only the most recent $w$ items are used for analysis. In this setting, an algorithm needs to accurately track…

Cryptography and Security · Computer Science 2024-06-13 Yiping Wang , Yanhao Wang , Cen Chen

Self-speculative decoding (SSD) accelerates LLM inference by skipping layers to create an efficient draft model, yet existing methods often rely on static heuristics that ignore the dynamic computational overhead of attention in…

Machine Learning · Computer Science 2026-02-25 Seongjin Cha , Gyuwan Kim , Dongsu Han , Tao Yang , Insu Han

Selecting a small set of representatives from a large database is important in many applications such as multi-criteria decision making, web search, and recommendation. The $k$-regret minimizing set ($k$-RMS) problem was recently proposed…

Databases · Computer Science 2021-06-30 Yanhao Wang , Yuchen Li , Raymond Chi-Wing Wong , Kian-Lee Tan

Semi-Supervised Learning (SSL) has become a preferred paradigm in many deep learning tasks, which reduces the need for human labor. Previous studies primarily focus on effectively utilising the labelled and unlabeled data to improve…

Machine Learning · Computer Science 2024-10-29 Qian Shao , Jiangrui Kang , Qiyuan Chen , Zepeng Li , Hongxia Xu , Yiwen Cao , Jiajuan Liang , Jian Wu

In this work, we first propose a parallel batch switching algorithm called Small-Batch Queue-Proportional Sampling (SB-QPS). Compared to other batch switching algorithms, SB-QPS significantly reduces the batch size without sacrificing the…

Networking and Internet Architecture · Computer Science 2020-10-20 Jingfan Meng , Long Gong , Jun , Xu

3D models surpass 2D models in CT/MRI segmentation by effectively capturing inter-slice relationships. However, the added depth dimension substantially increases memory consumption. While patch-based training alleviates memory constraints,…

Image and Video Processing · Electrical Eng. & Systems 2025-06-30 Young Seok Jeon , Hongfei Yang , Huazhu Fu , Mengling Feng

The sliding window model of computation captures scenarios in which data is arriving continuously, but only the latest $w$ elements should be used for analysis. The goal is to design algorithms that update the solution efficiently with each…

Data Structures and Algorithms · Computer Science 2020-10-26 Michele Borassi , Alessandro Epasto , Silvio Lattanzi , Sergei Vassilvitskii , Morteza Zadimoghaddam

Keyword Spotting (KWS) systems with small footprint models deployed on edge devices face significant accuracy and robustness challenges due to domain shifts caused by varying noise and recording conditions. To address this, we propose a…

Sound · Computer Science 2026-01-23 Prakash Dhungana , Sayed Ahmad Salehi

Multi-dimensional data streams, prevalent in applications like IoT, financial markets, and real-time analytics, pose significant challenges due to their high velocity, unbounded nature, and complex inter-dimensional dependencies. Sliding…

Machine Learning · Computer Science 2025-07-10 Abolfazl Zarghani , Sadegh Abedi

In this paper we study the extraction of representative elements in the data stream model in the form of submodular maximization. Different from the previous work on streaming submodular maximization, we are interested only in the recent…

Data Structures and Algorithms · Computer Science 2016-11-02 Jiecao Chen , Huy L. Nguyen , Qin Zhang

Data subset selection aims to find a smaller yet informative subset of a large dataset that can approximate the full-dataset training, addressing challenges associated with training neural networks on large-scale datasets. However, existing…

Machine Learning · Computer Science 2024-06-06 Hoyong Choi , Nohyun Ki , Hye Won Chung

Clustering is an important technique for identifying structural information in large-scale data analysis, where the underlying dataset may be too large to store. In many applications, recent data can provide more accurate information and…

Data Structures and Algorithms · Computer Science 2023-11-02 David P. Woodruff , Peilin Zhong , Samson Zhou

In this work, we consider the Submodular Maximization under Knapsack (SMK) constraint problem over the ground set of size $n$. The problem recently attracted a lot of attention due to its applications in various domains of combination…

Data Structures and Algorithms · Computer Science 2024-05-22 Canh V. Pham

Video super-resolution (VSR) is the task of restoring high-resolution frames from a sequence of low-resolution inputs. Different from single image super-resolution, VSR can utilize frames' temporal information to reconstruct results with…

Image and Video Processing · Electrical Eng. & Systems 2022-08-25 Wenyi Lian , Wenjing Lian

The problem of selecting a small-size representative summary of a large dataset is a cornerstone of machine learning, optimization and data science. Motivated by applications to recommendation systems and other scenarios with query-limited…

Data Structures and Algorithms · Computer Science 2019-10-15 Dmitrii Avdiukhin , Grigory Yaroslavtsev , Samson Zhou

Machine learning systems have been extensively used as auxiliary tools in domains that require critical decision-making, such as healthcare and criminal justice. The explainability of decisions is crucial for users to develop trust on these…

Artificial Intelligence · Computer Science 2023-02-10 Chen Peng , Zhengqi Dai , Guangping Xia , Yajie Niu , Yihui Lei

Trajectory mining has attracted significant attention. This paper addresses the Top-k Representative Similar Subtrajectory Query (TRSSQ) problem, which aims to find the k most representative subtrajectories similar to a query. Existing…

Databases · Computer Science 2025-07-09 Mingchang Ge , Liping Wang , Xuemin Lin , Yuang Zhang , Kunming Wang

The Max-k-Cut problem is a fundamental combinatorial optimization challenge that generalizes the classic NP-complete Max-Cut problem. While relaxation techniques are commonly employed to tackle Max-k-Cut, they often lack guarantees of…

Optimization and Control · Mathematics 2025-06-12 Yeqing Qiu , Ye Xue , Akang Wang , Yiheng Wang , Qingjiang Shi , Zhi-Quan Luo

Massive volumes of data continuously generated on social platforms have become an important information source for users. A primary method to obtain fresh and valuable information from social streams is \emph{social search}. Although there…

Social and Information Networks · Computer Science 2019-03-25 Yanhao Wang , Yuchen Li , Kian-Lee Tan
‹ Prev 1 2 3 10 Next ›