English
Related papers

Related papers: Hybrid Forest: A Concept Drift Aware Data Stream M…

200 papers

Dealing with memory and time constraints are current challenges when learning from data streams with a massive amount of data. Many algorithms have been proposed to handle these difficulties, among them, the Very Fast Decision Tree (VFDT)…

Learning from data streams is an increasingly important topic in data mining, machine learning, and artificial intelligence in general. A major focus in the data stream literature is on designing methods that can deal with concept drift, a…

Machine Learning · Computer Science 2018-10-05 Jesse Read

IoT Big Data requires new machine learning methods able to scale to large size of data arriving at high speed. Decision trees are popular machine learning models since they are very effective, yet easy to interpret and visualize. In the…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-07-29 Nicolas Kourtellis , Gianmarco De Francisci Morales , Albert Bifet , Arinto Murdopo

Data stream learning is a very relevant paradigm because of the increasing real-world scenarios generating data at high velocities and in unbounded sequences. Stream learning aims at developing models that can process instances as they…

Machine Learning · Computer Science 2024-10-29 Aurora Esteban , Alberto Cano , Amelia Zafra , Sebastián Ventura

Learning from data streams is among the most vital fields of contemporary data mining. The online analysis of information coming from those potentially unbounded data sources allows for designing reactive up-to-date models capable of…

Machine Learning · Computer Science 2020-10-16 Łukasz Korycki , Bartosz Krawczyk

Big Data streams are being generated in a faster, bigger, and more commonplace. In this scenario, Hoeffding Trees are an established method for classification. Several extensions exist, including high-performing ensemble setups such as…

Machine Learning · Computer Science 2015-11-04 Diego Marrón , Jesse Read , Albert Bifet , Nacho Navarro

Data streams are ubiquitous in modern business and society. In practice, data streams may evolve over time and cannot be stored indefinitely. Effective and transparent machine learning on data streams is thus often challenging. Hoeffding…

Machine Learning · Computer Science 2022-09-08 Johannes Haug , Klaus Broelemann , Gjergji Kasneci

The last decade has seen a surge of interest in adaptive learning algorithms for data stream classification, with applications ranging from predicting ozone level peaks, learning stock market indicators, to detecting computer security…

Machine Learning · Statistics 2018-08-13 Ali Pesaranghader , Herna Viktor , Eric Paquet

Hoeffding trees are the state-of-the-art methods in decision tree learning for evolving data streams. These very fast decision trees are used in many real applications where data is created in real-time due to their efficiency. In this…

Machine Learning · Computer Science 2020-10-19 Chaitanya Manapragada , Geoffrey I Webb , Mahsa Salehi , Albert Bifet

Database research can help machine learning performance in many ways. One way is to design better data structures. This paper combines the use of incremental computation and sequential and probabilistic filtering to enable "forgetful"…

Machine Learning · Computer Science 2022-12-16 Zhehu Yuan , Yinqi Sun , Dennis Shasha

State-of-the-art data stream mining has long drawn from ensembles of the Very Fast Decision Tree, a seminal algorithm honored with the 2015 KDD Test-of-Time Award. However, the emergence of large tabular models, i.e., transformers designed…

Machine Learning · Computer Science 2025-12-16 Afonso Lourenço , João Gama , Eric P. Xing , Goreti Marreiros

We propose soft Hoeffding trees (SoHoT) as a new differentiable and transparent model for possibly infinite and changing data streams. Stream mining algorithms such as Hoeffding trees grow based on the incoming data stream, but they…

Machine Learning · Computer Science 2025-09-24 Kirsten Köbschall , Lisa Hartung , Stefan Kramer

The Internet of Things generates massive data streams, with edge computing emerging as a key enabler for online IoT applications and 5G networks. Edge solutions facilitate real-time machine learning inference, but also require continuous…

Machine Learning · Computer Science 2025-12-09 Afonso Lourenço , João Rodrigo , João Gama , Goreti Marreiros

The systems monitoring the location of public transport vehicles rely on wireless transmission. The location readings from GPS-based devices are received with some latency caused by periodical data transmission and temporal problems…

Networking and Internet Architecture · Computer Science 2018-02-28 Maciej Grzenda , Karolina Kwasiborska , Tomasz Zaremba

This paper explores the concepts of modelling a software development project as a process that results in the creation of a continuous stream of data. In terms of the Jazz repository used in this research, one aspect of that stream of data…

Software Engineering · Computer Science 2014-07-24 Andy M. Connor , Jacqui Finlay , Russel Pears

Data stream mining aims at extracting meaningful knowledge from continually evolving data streams, addressing the challenges posed by nonstationary environments, particularly, concept drift which refers to a change in the underlying data…

Machine Learning · Computer Science 2025-01-03 Kleanthis Malialis , Jin Li , Christos G. Panayiotou , Marios M. Polycarpou

Machine learning software accounts for a significant amount of energy consumed in data centers. These algorithms are usually optimized towards predictive performance, i.e. accuracy, and scalability. This is the case of data stream mining…

Machine Learning · Computer Science 2018-08-06 Eva García-Martín , Niklas Lavesson , Håkan Grahn , Emiliano Casalicchio , Veselka Boeva

In many real-world scenarios, we often deal with streaming data that is sequentially collected over time. Due to the non-stationary nature of the environment, the streaming data distribution may change in unpredictable ways, which is known…

Machine Learning · Computer Science 2022-06-07 Wendi Li , Xiao Yang , Weiqing Liu , Yingce Xia , Jiang Bian

The amount of real-time communication between agents in an information system has increased rapidly since the beginning of the decade. This is because the use of these systems, e. g. social media, has become commonplace in today's society.…

Machine Learning · Computer Science 2020-07-13 Christoph Raab , Moritz Heusinger , Frank-Michael Schleif

A fundamental issue for statistical classification models in a streaming environment is that the joint distribution between predictor and response variables changes over time (a phenomenon also known as concept drifts), such that their…

Machine Learning · Statistics 2019-02-11 Shujian Yu , Zubin Abraham , Heng Wang , Mohak Shah , Yantao Wei , José C. Príncipe
‹ Prev 1 2 3 10 Next ›