Related papers: tsdownsample: high-performance time series downsam…

Data Point Selection for Line Chart Visualization: Methodological Assessment and Evidence-Based Guidelines

Time series visualization plays a crucial role in identifying patterns and extracting insights across various domains. However, as datasets continue to grow in size, visualizing them effectively becomes challenging. Downsampling, which…

Human-Computer Interaction · Computer Science 2023-04-04 Jonas Van Der Donckt , Jeroen Van Der Donckt , Michael Rademaker , Sofie Van Hoecke

Plotly-Resampler: Effective Visual Analytics for Large Time Series

Visual analytics is arguably the most important step in getting acquainted with your data. This is especially the case for time series, as this data type is hard to describe and cannot be fully understood when using for example summary…

Human-Computer Interaction · Computer Science 2022-07-19 Jonas Van Der Donckt , Jeroen Van Der Donckt , Emiel Deprost , Sofie Van Hoecke

MinMaxLTTB: Leveraging MinMax-Preselection to Scale LTTB

Visualization plays an important role in analyzing and exploring time series data. To facilitate efficient visualization of large datasets, downsampling has emerged as a well-established approach. This work concentrates on LTTB…

Human-Computer Interaction · Computer Science 2023-05-02 Jeroen Van Der Donckt , Jonas Van Der Donckt , Michael Rademaker , Sofie Van Hoecke

Diversity Subsampling: Custom Subsamples from Large Data Sets

Subsampling from a large data set is useful in many supervised learning contexts to provide a global view of the data based on only a fraction of the observations. Diverse (or space-filling) subsampling is an appealing subsampling approach…

Methodology · Statistics 2023-11-27 Boyang Shang , Daniel W. Apley , Sanjay Mehrotra

Learning to Upsample by Learning to Sample

We present DySample, an ultra-lightweight and effective dynamic upsampler. While impressive performance gains have been witnessed from recent kernel-based dynamic upsamplers such as CARAFE, FADE, and SAPA, they introduce much workload,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-30 Wenze Liu , Hao Lu , Hongtao Fu , Zhiguo Cao

STULL: Unbiased Online Sampling for Visual Exploration of Large Spatiotemporal Data

Online sampling-supported visual analytics is increasingly important, as it allows users to explore large datasets with acceptable approximate answers at interactive rates. However, existing online spatiotemporal sampling techniques are…

Databases · Computer Science 2020-09-01 Guizhen Wang , Jingjing Guo , Mingjie Tang , José Florencio de Queiroz Neto , Calvin Yau , Anas Daghistani , Morteza Karimzadeh , Walid G. Aref , David S. Ebert

Downstream: efficient cross-platform algorithms for fixed-capacity stream downsampling

Due to ongoing accrual over long durations, a defining characteristic of real-world data streams is the requirement for rolling, often real-time, mechanisms to coarsen or summarize stream history. One common data structure for this purpose…

Data Structures and Algorithms · Computer Science 2025-06-17 Connor Yang , Joey Wagner , Emily Dolson , Luis Zaman , Matthew Andres Moreno

GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time

Repeated Sampling (RS) is a simple inference-time algorithm that has been shown to improve model performance on complex tasks. Although it is an effective way of scaling inference time, it often struggles to generate diverse solution…

Artificial Intelligence · Computer Science 2026-02-17 Divij Handa , Mihir Parmar , Aswin RRV , Md Nayem Uddin , Hamid Palangi , Chitta Baral

Structured Downsampling for Fast, Memory-efficient Curation of Online Data Streams

Operations over data streams typically hinge on efficient mechanisms to aggregate or summarize history on a rolling basis. For high-volume data steams, it is critical to manage state in a manner that is fast and memory efficient --…

Data Structures and Algorithms · Computer Science 2024-09-24 Matthew Andres Moreno , Luis Zaman , Emily Dolson

dynsight: an Open Python Platform for Simulation and Experimental Trajectory Data Analysis

The study of complex many-body systems via analysis of the trajectories of the units that dynamically move and interact within them is a non-trivial task. The workflow for extracting meaningful information from the raw trajectory data is…

Materials Science · Physics 2025-10-31 Simone Martino , Matteo Becchi , Andrew Tarzia , Daniele Rapetti , Giovanni M. Pavan

TSPulse: Tiny Pre-Trained Models with Disentangled Representations for Rapid Time-Series Analysis

Time-series tasks often benefit from signals expressed across multiple representation spaces (e.g., time vs. frequency) and at varying abstraction levels (e.g., local patterns vs. global semantics). However, existing pre-trained time-series…

Machine Learning · Computer Science 2026-03-05 Vijay Ekambaram , Subodh Kumar , Arindam Jati , Sumanta Mukherjee , Tomoya Sakai , Pankaj Dayama , Wesley M. Gifford , Jayant Kalagnanam

Continuous Simulation Data Stream: A dynamical timescale-dependent output scheme for simulations

Exa-scale simulations are on the horizon but almost no new design for the output has been proposed in recent years. In simulations using individual time steps, the traditional snapshots are over resolving particles/cells with large time…

Instrumentation and Methods for Astrophysics · Physics 2022-10-25 Loic Hausammann , Pedro Gonnet , Matthieu Schaller

Enhancing Algorithm Performance Understanding through tsMorph: Generating Semi-Synthetic Time Series for Robust Forecasting Evaluation

Time series forecasting is a subject of significant scientific and industrial importance. Despite the widespread utilization of forecasting methods, there is a dearth of research aimed at comprehending the conditions under which these…

Machine Learning · Computer Science 2024-10-23 Moisés Santos , André de Carvalho , Carlos Soares

GPGPU Linear Complexity t-SNE Optimization

The t-distributed Stochastic Neighbor Embedding (tSNE) algorithm has become in recent years one of the most used and insightful techniques for the exploratory data analysis of high-dimensional data. tSNE reveals clusters of high-dimensional…

Machine Learning · Computer Science 2019-08-12 Nicola Pezzotti , Julian Thijssen , Alexander Mordvintsev , Thomas Hollt , Baldur van Lew , Boudewijn P. F. Lelieveldt , Elmar Eisemann , Anna Vilanova

Visualization-Aware Sampling for Very Large Databases

Interactive visualizations are crucial in ad hoc data exploration and analysis. However, with the growing number of massive datasets, generating visualizations in interactive timescales is increasingly challenging. One approach for…

Databases · Computer Science 2017-01-25 Yongjoo Park , Michael Cafarella , Barzan Mozafari

SIMPLE: Disaggregating Sampling from GPU Inference into a Decision Plane for Faster Distributed LLM Serving

As large language models (LLMs) scale out with tensor parallelism (TP) and pipeline parallelism (PP) and production stacks have aggressively optimized the data plane (attention/GEMM and KV cache), sampling, the decision plane that turns…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-02 Bohan Zhao , Zane Cao , Yongchao He

Optimal Dataset Size for Recommender Systems: Evaluating Algorithms' Performance via Downsampling

This thesis investigates dataset downsampling as a strategy to optimize energy efficiency in recommender systems while maintaining competitive performance. With increasing dataset sizes posing computational and environmental challenges,…

Information Retrieval · Computer Science 2025-02-17 Ardalan Arabzadeh

Modular Deep Learning for Multivariate Time-Series: Decoupling Imputation and Downstream Tasks

Missing values are pervasive in large-scale time-series data, posing challenges for reliable analysis and decision-making. Many neural architectures have been designed to model and impute the complex and heterogeneous missingness patterns…

Machine Learning · Computer Science 2026-02-26 Joseph Arul Raj , Linglong Qian , Zina Ibrahim

TiVy: Time Series Visual Summary for Scalable Visualization

Visualizing multiple time series presents fundamental tradeoffs between scalability and visual clarity. Time series capture the behavior of many large-scale real-world processes, from stock market trends to urban activities. Users often…

Graphics · Computer Science 2025-07-30 Gromit Yeuk-Yin Chan , Luis Gustavo Nonato , Themis Palpanas , Cláudio T. Silva , Juliana Freire

Learning to Downsample for Segmentation of Ultra-High Resolution Images

Many computer vision systems require low-cost segmentation algorithms based on deep learning, either because of the enormous size of input images or limited computational budget. Common solutions uniformly downsample the input images to…

Computer Vision and Pattern Recognition · Computer Science 2022-08-19 Chen Jin , Ryutaro Tanno , Thomy Mertzanidou , Eleftheria Panagiotaki , Daniel C. Alexander