English
Related papers

Related papers: Multivariate Pointwise Information-Driven Data Sam…

200 papers

With the increasing computational power of current supercomputers, the size of data produced by scientific simulations is rapidly growing. To reduce the storage footprint and facilitate scalable post-hoc analyses of such scientific data…

Machine Learning · Computer Science 2021-04-14 Subhashis Hazarika , Ayan Biswas , Phillip J. Wolfram , Earl Lawrence , Nathan Urban

Multivariate spatial data plays an important role in computational science and engineering simulations. The potential features and hidden relationships in multivariate data can assist scientists to gain an in-depth understanding of a…

Human-Computer Interaction · Computer Science 2019-08-30 Xiangyang He , Yubo Tao , Qirui Wang , Hai Lin

Subsampling is one of the popular methods to balance statistical efficiency and computational efficiency in the big data era. Most approaches aim at selecting informative or representative sample points to achieve good overall information…

Methodology · Statistics 2024-07-10 Haolin Chen , Holger Dette , Jun Yu

In the era of burgeoning data generation, managing and storing large-scale time-varying datasets poses significant challenges. With the rise of supercomputing capabilities, the volume of data produced has soared, intensifying storage and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-04 Humayra Tasnim , Soumya Dutta , Melanie Moses

Prompted by modern technologies in data acquisition, the statistical analysis of spatially distributed function-valued quantities has attracted a lot of attention in recent years. In particular, combinations of functional variables and…

Methodology · Statistics 2023-07-12 Matthias Eckardt , Carles Comas , Jorge Mateu

Statistical topic models efficiently facilitate the exploration of large-scale data sets. Many models have been developed and broadly used to summarize the semantic structure in news, science, social media, and digital humanities. However,…

Machine Learning · Computer Science 2016-12-02 Jian Tang , Cheng Li , Ming Zhang , Qiaozhu Mei

Interactive visualizations are crucial in ad hoc data exploration and analysis. However, with the growing number of massive datasets, generating visualizations in interactive timescales is increasingly challenging. One approach for…

Databases · Computer Science 2017-01-25 Yongjoo Park , Michael Cafarella , Barzan Mozafari

In the present work we have selected a collection of statistical and mathematical tools useful for the exploration of multivariate data and we present them in a form that is meant to be particularly accessible to a classically trained…

Statistics Theory · Mathematics 2010-09-01 Magnus Fontes

The extensive adoption of Deep Neural Networks has led to their increased utilization in challenging scientific visualization tasks. Recent advancements in building compressed data models using implicit neural representations have shown…

Machine Learning · Computer Science 2025-10-20 Abhay Kumar Dwivedi , Shanu Saklani , Soumya Dutta

The influx of massive amounts of data from current and upcoming cosmological surveys necessitates compression schemes that can efficiently summarize the data with minimal loss of information. We introduce a method that leverages the…

Cosmology and Nongalactic Astrophysics · Physics 2023-12-18 Aizhan Akhmetzhanova , Siddharth Mishra-Sharma , Cora Dvorkin

While advances in computing resources have made processing enormous amounts of data possible, human ability to identify patterns in such data has not scaled accordingly. Efficient computational methods for condensing and simplifying data…

Information Retrieval · Computer Science 2020-04-03 Yike Liu , Tara Safavi , Abhilash Dighe , Danai Koutra

The basic objective of data visualization is to provide an efficient graphical display for summarizing and reasoning about quantitative information. During the last decades, political science has accumulated a large corpus of various kinds…

Graphics · Computer Science 2010-08-09 Andrei Zinovyev

Class imbalance and distributional differences in large datasets present significant challenges for classification tasks machine learning, often leading to biased models and poor predictive performance for minority classes. This work…

Machine Learning · Statistics 2024-12-20 Alex Mak , Shubham Sahoo , Shivani Pandey , Yidan Yue , Linglong Kong

Modern scientific simulations, observations, and large-scale experiments generate data at volumes that often exceed the limits of storage, processing, and analysis. This challenge drives the development of data reduction methods that…

Machine Learning · Computer Science 2025-11-18 Minh Vu , Andrey Lokhov

The continuous and rapid growth of highly interconnected datasets, which are both voluminous and complex, calls for the development of adequate processing and analytical techniques. One method for condensing and simplifying such datasets is…

Databases · Computer Science 2020-05-13 Angela Bonifati , Stefania Dumbrava , Haridimos Kondylakis

The overview-driven visual analysis of large-scale dynamic graphs poses a major challenge. We propose Multiscale Snapshots, a visual analytics approach to analyze temporal summaries of dynamic graphs at multiple temporal scales. First, we…

Human-Computer Interaction · Computer Science 2020-09-17 Eren Cakmak , Udo Schlegel , Dominik Jäckle , Daniel Keim , Tobias Schreck

Complex systems are fascinating because their rich macroscopic properties emerge from the interaction of many simple parts. Understanding the building principles of these emergent phenomena in nature requires assessing natural complex…

Neurons and Cognition · Quantitative Biology 2022-11-17 Anna Levina , Viola Priesemann , Johannes Zierenberg

The ubiquitous availability of computing devices and the widespread use of the internet have generated a large amount of data continuously. Therefore, the amount of available information on any given topic is far beyond humans' processing…

Artificial Intelligence · Computer Science 2023-07-11 Samira Ghodratnama

Data-driven approaches to sequence-to-sequence modelling have been successfully applied to short text summarization of news articles. Such models are typically trained on input-summary pairs consisting of only a single or a few sentences,…

Computation and Language · Computer Science 2018-04-25 Nikola I. Nikolov , Michael Pfeiffer , Richard H. R. Hahnloser

Visualizations support rapid analysis of scientific datasets, allowing viewers to glean aggregate information (e.g., the mean) within split-seconds. While prior research has explored this ability in conventional charts, it is unclear if…

Human-Computer Interaction · Computer Science 2024-06-21 Victor A. Mateevitsi , Michael E. Papka , Khairi Reda
‹ Prev 1 2 3 10 Next ›