English
Related papers

Related papers: Dynamic statistical inference in massive datastrea…

200 papers

Modern longitudinal data, for example from wearable devices, measures biological signals on a fixed set of participants at a diverging number of time points. Traditional statistical methods are not equipped to handle the computational…

Methodology · Statistics 2023-03-23 Lan Luo , Jingshen Wang , Emily C. Hector

Click-through rate (CTR) prediction is an essential task in industrial applications such as video recommendation. Recently, deep learning models have been proposed to learn the representation of users' overall interests, while ignoring the…

Machine Learning · Computer Science 2020-01-10 Shu-Ting Shi , Wenhao Zheng , Jun Tang , Qing-Guo Chen , Yao Hu , Jianke Zhu , Ming Li

Given a stream of entries in a multi-aspect data setting i.e., entries having multiple dimensions, how can we detect anomalous activities in an unsupervised manner? For example, in the intrusion detection setting, existing work seeks to…

Machine Learning · Computer Science 2021-06-09 Siddharth Bhatia , Arjit Jain , Pan Li , Ritesh Kumar , Bryan Hooi

In many real-world scenarios, distribution shifts exist in the streaming data across time steps. Many complex sequential data can be effectively divided into distinct regimes that exhibit persistent dynamics. Discovering the shifted…

Machine Learning · Computer Science 2023-09-07 Weijieying Ren , Tianxiang Zhao , Wei Qin , Kunpeng Liu

Data collection at a massive scale is becoming ubiquitous in a wide variety of settings, from vast offline databases to streaming real-time information. Learning algorithms deployed in such contexts must rely on single-pass inference, where…

Methodology · Statistics 2012-01-27 Christoforos Anagnostopoulos , Robert B. Gramacy

Given real-time sensor data streams obtained from machines, how can we continuously predict when a machine failure will occur? This work aims to continuously forecast the timing of future events by analyzing multi-sensor data streams. A key…

Machine Learning · Computer Science 2026-01-16 Kota Nakamura , Koki Kawabata , Yasuko Matsubara , Yasushi Sakurai

As a typical Cyber-Physical System (CPS), smart water distribution networks require monitoring of underground water pipes with high sample rates for precise data analysis and water network control. Due to poor underground wireless channel…

Social and Information Networks · Computer Science 2017-03-30 Sokratis Kartakis , Shusen Yang , Julie A. McCann

Anomaly detection is critical for finding suspicious behavior in innumerable systems. We need to detect anomalies in real-time, i.e. determine if an incoming entity is anomalous or not, as soon as we receive it, to minimize the effects of…

Machine Learning · Computer Science 2023-01-31 Siddharth Bhatia

We propose an online debiased lasso (ODL) method for statistical inference in high-dimensional linear models with streaming data. The proposed ODL consists of an efficient computational algorithm for streaming data and approximately normal…

Statistics Theory · Mathematics 2021-08-20 Ruijian Han , Lan Luo , Yuanyuan Lin , Jian Huang

A growing number of applications that generate massive streams of data need intelligent data processing and online analysis. Real-time surveillance systems, telecommunication systems, sensor networks and other dynamic environments are such…

Databases · Computer Science 2011-05-11 Mahnoosh Kholghi , Mohammadreza Keyvanpour

In a data stream management system (DSMS), users register continuous queries, and receive result updates as data arrive and expire. We focus on applications with real-time constraints, in which the user must receive each result update…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-04-24 Tom Z. J. Fu , Jianbing Ding , Richard T. B. Ma , Marianne Winslett , Yin Yang , Zhenjie Zhang

The complex dynamics of physical systems can often be modeled with stochastic differential equations. However, computational constraints inhibit the estimation of dynamics from large time-series datasets. I present a method for estimating…

Data Analysis, Statistics and Probability · Physics 2023-11-02 William Davis

While differentially private synthetic data generation has been explored extensively in the literature, how to update this data in the future if the underlying private data changes is much less understood. We propose an algorithmic…

Cryptography and Security · Computer Science 2024-09-04 Girish Kumar , Thomas Strohmer , Roman Vershynin

Data streams (streaming data) consist of transiently observed, evolving in time, multidimensional data sequences that challenge our computational and/or inferential capabilities. In this paper we propose user friendly approaches for robust…

Applications · Statistics 2015-01-20 Daniel Kosiorowski

High-dimensional streaming data are becoming increasingly ubiquitous in many fields. They often lie in multiple low-dimensional subspaces, and the manifold structures may change abruptly on the time scale due to pattern shift or occurrence…

Machine Learning · Statistics 2022-04-13 Ruiyu Xu , Jianguo Wu , Xiaowei Yue , Yongxiang Li

This paper proposes a novel dynamic Hierarchical Dirichlet Process topic model that considers the dependence between successive observations. Conventional posterior inference algorithms for this kind of models require processing of the…

Machine Learning · Statistics 2016-06-29 Olga Isupova , Danil Kuzin , Lyudmila Mihaylova

Many techniques have been developed, such as model compression, to make Deep Neural Networks (DNNs) inference more efficiently. Nevertheless, DNNs still lack excellent run-time dynamic inference capability to enable users trade-off accuracy…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 Li Yang , Zhezhi He , Yu Cao , Deliang Fan

Day-to-day traffic dynamics are widely used to model flow evolution due to travelers' learning and adjustment behavior, yet empirical analysis of these models often relies on descriptive calibration with limited inferential content. This…

Optimization and Control · Mathematics 2026-05-05 Minghui Wu , Yafeng Yin , Jerome P. Lynch , Zhichen Liu

Uncertain data streams have been widely generated in many Web applications. The uncertainty in data streams makes anomaly detection from sensor data streams far more challenging. In this paper, we present a novel framework that supports…

Artificial Intelligence · Computer Science 2016-07-21 Jiangang Ma , Le Sun , Hua Wang , Yanchun Zhang , Uwe Aickelin

Dynamic statistical process monitoring methods have been widely studied and applied in modern industrial processes. These methods aim to extract the most predictable temporal information and develop the corresponding dynamic monitoring…

Methodology · Statistics 2022-11-10 Wei Fan , Qinqin Zhu , Shaojun Ren , Liang Zhang , Fengqi Si
‹ Prev 1 2 3 10 Next ›