English
Related papers

Related papers: Correct classification for big/smart/fast data mac…

200 papers

We introduce a method for computing immediately human interpretable yet accurate classifiers from tabular data. The classifiers obtained are short Boolean formulas, computed via first discretizing the original data and then using feature…

Machine Learning · Computer Science 2024-09-19 Reijo Jaakkola , Tomi Janhunen , Antti Kuusisto , Masood Feyzbakhsh Rankooh , Miikka Vilander

Interpretability and explainability are among the most important challenges of modern artificial intelligence, being mentioned even in various legislative sources. In this article, we develop a method for extracting immediately human…

Machine Learning · Computer Science 2024-06-04 Reijo Jaakkola , Tomi Janhunen , Antti Kuusisto , Masood Feyzbakhsh Rankooh , Miikka Vilander

We present an algorithm for classification tasks on big data. Experiments conducted as part of this study indicate that the algorithm can be as accurate as ensemble methods such as random forests or gradient boosted trees. Unlike ensemble…

Machine Learning · Statistics 2017-10-27 Rajiv Sambasivan , Sourish Das

Boolean matrix factorisation aims to decompose a binary data matrix into an approximate Boolean product of two low rank, binary matrices: one containing meaningful patterns, the other quantifying how the observations can be expressed as a…

Machine Learning · Statistics 2017-02-28 Tammo Rukat , Chris C. Holmes , Michalis K. Titsias , Christopher Yau

Time-series data classification is central to the analysis and control of autonomous systems, such as robots and self-driving cars. Temporal logic-based learning algorithms have been proposed recently as classifiers of such data. However,…

Machine Learning · Computer Science 2022-07-08 Erfan Aasi , Cristian Ioan Vasile , Mahroo Bahreinian , Calin Belta

Training deep learning models and performing hyperparameter tuning can be computationally demanding and time-consuming. Meanwhile, traditional machine learning methods like gradient-boosting algorithms remain the preferred choice for most…

Machine Learning · Computer Science 2024-02-23 David Bonet , Daniel Mas Montserrat , Xavier Giró-i-Nieto , Alexander G. Ioannidis

In today's data driven world, storing, processing, and gleaning insights from large-scale data are major challenges. Data compression is often required in order to store large amounts of high-dimensional data, and thus, efficient inference…

Machine Learning · Statistics 2018-09-11 Denali Molitor , Deanna Needell

Deep learning is computationally intensive, with significant efforts focused on reducing arithmetic complexity, particularly regarding energy consumption dominated by data movement. While existing literature emphasizes inference, training…

Machine Learning · Statistics 2025-06-09 Van Minh Nguyen , Cristian Ocampo , Aymen Askri , Louis Leconte , Ba-Hien Tran

In recent years, machine learning has begun automating decision making in fields as varied as college admissions, credit lending, and criminal sentencing. The socially sensitive nature of some of these applications together with increasing…

Machine Learning · Computer Science 2021-07-06 Connor Lawless , Oktay Gunluk

Classification of datasets into two or more distinct classes is an important machine learning task. Many methods are able to classify binary classification tasks with a very high accuracy on test data, but cannot provide any easily…

Machine Learning · Computer Science 2020-08-26 Yashesh Dhebar , Sparsh Gupta , Kalyanmoy Deb

Many classification applications require accurate probability estimates in addition to good class separation but often classifiers are designed focusing only on the latter. Calibration is the process of improving probability estimates by…

Machine Learning · Computer Science 2020-01-31 Tuomo Alasalmi , Jaakko Suutala , Heli Koskimäki , Juha Röning

Identifying informative components in binary data is an essential task in many research areas, including life sciences, social sciences, and recommendation systems. Boolean matrix factorization (BMF) is a family of methods that performs…

Machine Learning · Computer Science 2024-07-03 Sebastian Dalleiger , Jilles Vreeken , Michael Kamp

Data classification is a major machine learning paradigm, which has been widely applied to solve a large number of real-world problems. Traditional data classification techniques consider only physical features (e.g., distance, similarity,…

Machine Learning · Computer Science 2020-11-12 Esteban Vilca , Liang Zhao

Addressing the interpretability problem of NMF on Boolean data, Boolean Matrix Factorization (BMF) uses Boolean algebra to decompose the input into low-rank Boolean factor matrices. These matrices are highly interpretable and very useful in…

Machine Learning · Computer Science 2023-07-18 Sebastian Dalleiger , Jilles Vreeken

Matrix factorization is a key tool in data analysis; its applications include recommender systems, correlation analysis, signal processing, among others. Binary matrices are a particular case which has received significant attention for…

Machine Learning · Statistics 2019-01-30 Ignacio Ramirez

We study the application of large language models to zero-shot and few-shot classification of tabular data. We prompt the large language model with a serialization of the tabular data to a natural-language string, together with a short…

Computation and Language · Computer Science 2023-03-20 Stefan Hegselmann , Alejandro Buendia , Hunter Lang , Monica Agrawal , Xiaoyi Jiang , David Sontag

Big Data concern large-volume, growing data sets that are complex and have multiple autonomous sources. Earlier technologies were not able to handle storage and processing of huge data thus Big Data concept comes into existence. This is a…

Machine Learning · Computer Science 2015-03-26 Praful Koturwar , Sheetal Girase , Debajyoti Mukhopadhyay

A central topic in functional data analysis is how to design an optimaldecision rule, based on training samples, to classify a data function. We exploit the optimal classification problem when data functions are Gaussian processes. Sharp…

Methodology · Statistics 2021-09-14 Shuoyang Wang , Zuofeng Shang , Guanqun Cao , Jun Liu

The goal of Boolean Matrix Factorization (BMF) is to approximate a given binary matrix as the product of two low-rank binary factor matrices, where the product of the factor matrices is computed under the Boolean algebra. While the problem…

Machine Learning · Computer Science 2020-12-08 Pauli Miettinen , Stefan Neumann

Massive classification, a classification task defined over a vast number of classes (hundreds of thousands or even millions), has become an essential part of many real-world systems, such as face recognition. Existing methods, including the…

Computer Vision and Pattern Recognition · Computer Science 2018-01-08 Xingcheng Zhang , Lei Yang , Junjie Yan , Dahua Lin
‹ Prev 1 2 3 10 Next ›