English
Related papers

Related papers: An Online Sparse Streaming Feature Selection Algor…

200 papers

The processing of high-dimensional streaming data commonly utilizes online streaming feature selection (OSFS) techniques. However, practical implementations often face challenges with data incompleteness due to equipment failures and…

Machine Learning · Computer Science 2025-11-26 Ruiyang Xu

Traditional feature selections need to know the feature space before learning, and online streaming feature selection (OSFS) is proposed to process streaming features on the fly. Existing methods divide features into relevance or…

Machine Learning · Computer Science 2023-03-01 RuiYang Xu , Di Wu , Xin Luo

In real-world applications involving high-dimensional streaming data, online streaming feature selection (OSFS) is widely adopted. Yet, practical deployments frequently face data incompleteness due to sensor failures or technical…

Neural and Evolutionary Computing · Computer Science 2025-08-29 Ruiyang Xu

Current AI/ML methods for data-driven engineering use models that are mostly trained offline. Such models can be expensive to build in terms of communication and computing cost, and they rely on data that is collected over extended periods…

Machine Learning · Computer Science 2021-12-16 Xiaoxuan Wang , Rolf Stadler

Data-driven functions for operation and management often require measurements collected through monitoring for model training and prediction. The number of data sources can be very large, which requires a significant communication and…

Machine Learning · Computer Science 2020-10-29 Xiaoxuan Wang , Forough Shahab Samani , Rolf Stadler

Online Streaming Feature Selection (OSFS) is a sequential learning problem where individual features across all samples are made available to algorithms in a streaming fashion. In this work, firstly, we assert that OSFS's main assumption of…

Machine Learning · Computer Science 2020-03-17 Salimeh Yasaei Sekeh , Madan Ravi Ganesh , Shurjo Banerjee , Jason J. Corso , Alfred O. Hero

Streaming feature selection techniques have become essential in processing real-time data streams, as they facilitate the identification of the most relevant attributes from continuously updating information. Despite their performance,…

Machine Learning · Computer Science 2024-06-21 Zhangling Duan , Tianci Li , Xingyu Wu , Zhaolong Ling , Jingye Yang , Zhaohong Jia

As an emerging research direction, online streaming feature selection deals with sequentially added dimensions in a feature space while the number of data instances is fixed. Online streaming feature selection provides a new, complementary…

Machine Learning · Computer Science 2016-10-24 Kui Yu , Wei Ding , Xindong Wu

Online selection of dynamic features has attracted intensive interest in recent years. However, existing online feature selection methods evaluate features individually and ignore the underlying structure of feature stream. For instance, in…

Computer Vision and Pattern Recognition · Computer Science 2016-08-23 Jing Wang , Meng Wang , Peipei Li , Luoqi Liu , Zhongqiu Zhao , Xuegang Hu , Xindong Wu

Sparse regression has been a popular approach to perform variable selection and enhance the prediction accuracy and interpretability of the resulting statistical model. Existing approaches focus on offline regularized regression, while the…

Machine Learning · Statistics 2023-01-03 Shuoguang Yang , Yuhao Yan , Xiuneng Zhu , Qiang Sun

In the era of big data, it is becoming common to have data with multiple modalities or coming from multiple sources, known as "multi-view data". Multi-view data are usually unlabeled and come from high-dimensional spaces (such as language…

Machine Learning · Computer Science 2016-09-28 Weixiang Shao , Lifang He , Chun-Ta Lu , Xiaokai Wei , Philip S. Yu

Screening feature selection methods are often used as a preprocessing step for reducing the number of variables before training step. Traditional screening methods only focus on dealing with complete high dimensional datasets. Modern…

Machine Learning · Statistics 2021-04-08 Mingyuan Wang , Adrian Barbu

Online learning, where feature spaces can change over time, offers a flexible learning paradigm that has attracted considerable attention. However, it still faces three significant challenges. First, the heterogeneity of real-world data…

Machine Learning · Computer Science 2025-07-17 Shengda Zhuo , Di Wu , Yi He , Shuqiang Huang , Xindong Wu

Its crux lies in the optimization of a tradeoff between accuracy and fairness of resultant models on the selected feature subset. The technical challenge of our setting is twofold: 1) streaming feature inputs, such that an informative…

Machine Learning · Computer Science 2024-08-26 Leizhen Zhang , Lusi Li , Di Wu , Sheng Chen , Yi He

Streaming tensor factorization is a powerful tool for processing high-volume and multi-way temporal data in Internet networks, recommender systems and image/video data analysis. In many applications the full tensor is not known, but instead…

Signal Processing · Electrical Eng. & Systems 2018-09-06 Cole Hawkins , Zheng Zhang

We present a sparse analogue to stochastic gradient descent that is guaranteed to perform well under similar conditions to the lasso. In the linear regression setup with irrepresentable noise features, our algorithm recovers the support set…

Statistics Theory · Mathematics 2014-12-16 Jacob Steinhardt , Stefan Wager , Percy Liang

In this paper we propose a computationally efficient algorithm for on-line variable selection in multivariate regression problems involving high dimensional data streams. The algorithm recursively extracts all the latent factors of a…

Machine Learning · Statistics 2009-02-10 Brian McWilliams , Giovanni Montana

We propose an online debiased lasso (ODL) method for statistical inference in high-dimensional linear models with streaming data. The proposed ODL consists of an efficient computational algorithm for streaming data and approximately normal…

Statistics Theory · Mathematics 2021-08-20 Ruijian Han , Lan Luo , Yuanyuan Lin , Jian Huang

This paper addresses the prevalent issue of label shift in an online setting with missing labels, where data distributions change over time and obtaining timely labels is challenging. While existing methods primarily focus on adjusting or…

Machine Learning · Computer Science 2024-11-01 Ruihan Wu , Siddhartha Datta , Yi Su , Dheeraj Baby , Yu-Xiang Wang , Kilian Q. Weinberger

The amount of data in our society has been exploding in the era of big data today. In this paper, we address several open challenges of big data stream classification, including high volume, high velocity, high dimensionality, high…

Machine Learning · Computer Science 2015-07-28 Dayong Wang , Pengcheng Wu , Peilin Zhao , Steven C. H. Hoi
‹ Prev 1 2 3 10 Next ›