Related papers: Random Forest Kernel for High-Dimension Low Sample…

A Novel Random Forest Dissimilarity Measure for Multi-View Learning

Multi-view learning is a learning task in which data is described by several concurrent representations. Its main challenge is most often to exploit the complementarities between these representations to help solve a…

Machine Learning · Computer Science 2020-07-07 Hongliu Cao , Simon Bernard , Robert Sabourin , Laurent Heutte

Random Forest for Dissimilarity-based Multi-view Learning

Many classification problems are naturally multi-view in the sense their data are described through multiple heterogeneous descriptions. For such tasks, dissimilarity strategies are effective ways to make the different descriptions…

Machine Learning · Computer Science 2020-07-17 Simon Bernard , Hongliu Cao , Robert Sabourin , Laurent Heutte

Feature space reduction method for ultrahigh-dimensional, multiclass data: Random forest-based multiround screening (RFMS)

In recent years, numerous screening methods have been published for ultrahigh-dimensional data that contain hundreds of thousands of features; however, most of these features cannot handle data with thousands of classes. Prediction models…

Machine Learning · Computer Science 2026-02-06 Gergely Hanczár , Marcell Stippinger , Dávid Hanák , Marcell T. Kurbucz , Olivér M. Törteli , Ágnes Chripkó , Zoltán Somogyvári

Distributional Random Forests: Heterogeneity Adjustment and Multivariate Distributional Regression

Random Forest (Breiman, 2001) is a successful and widely used regression and classification algorithm. Part of its appeal and reason for its versatility is its (implicit) construction of a kernel-type weighting function on training data,…

Machine Learning · Statistics 2022-10-13 Domagoj Ćevid , Loris Michel , Jeffrey Näf , Nicolai Meinshausen , Peter Bühlmann

Supervised classification methods applied to airborne hyperspectral images: Comparative study using mutual information

Nowadays, the hyperspectral remote sensing imagery HSI becomes an important tool to observe the Earth's surface, detect the climatic changes and many other applications. The classification of HSI is one of the most challenging tasks due to…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Hasna Nhaila , Asma Elmaizi , Elkebir Sarhrouni , Ahmed Hammouch

Heterogeneous Random Forest

Random forest (RF) stands out as a highly favored machine learning approach for classification problems. The effectiveness of RF hinges on two key factors: the accuracy of individual trees and the diversity among them. In this study, we…

Machine Learning · Computer Science 2024-10-28 Ye-eun Kim , Seoung Yun Kim , Hyunjoong Kim

The classification for High-dimension low-sample size data

Huge amount of applications in various fields, such as gene expression analysis or computer vision, undergo data sets with high-dimensional low-sample-size (HDLSS), which has putted forward great challenges for standard statistical and…

Machine Learning · Computer Science 2022-06-07 Liran Shen , Meng Joo Er , Qingbo Yin

Boosting SISSO Performance on Small Sample Datasets by Using Random Forests Prescreening for Complex Feature Selection

In materials science, data-driven methods accelerate material discovery and optimization while reducing costs and improving success rates. Symbolic regression is a key to extracting material descriptors from large datasets, in particular…

Machine Learning · Computer Science 2024-10-01 Xiaolin Jiang , Guanqi Liu , Jiaying Xie , Zhenpeng Hu

Random Similarity Forests

The wealth of data being gathered about humans and their surroundings drives new machine learning applications in various fields. Consequently, more and more often, classifiers are trained using not only numerical data but also complex data…

Machine Learning · Computer Science 2022-04-13 Maciej Piernik , Dariusz Brzezinski , Pawel Zawadzki

Similarity plays a fundamental role in many areas, including data mining, machine learning, statistics and various applied domains. Inspired by the success of ensemble methods and the flexibility of trees, we propose to learn a similarity…

Machine Learning · Computer Science 2019-08-29 Donghui Yan , Songxiang Gu , Ying Xu , Zhiwei Qin

An Approximation Method for Fitted Random Forests

Random Forests (RF) is a popular machine learning method for classification and regression problems. It involves a bagging application to decision tree models. One of the primary advantages of the Random Forests model is the reduction in…

Machine Learning · Statistics 2022-07-06 Sai K Popuri

Data Dependent Kernel Approximation using Pseudo Random Fourier Features

Kernel methods are powerful and flexible approach to solve many problems in machine learning. Due to the pairwise evaluations in kernel methods, the complexity of kernel computation grows as the data size increases; thus the applicability…

Machine Learning · Computer Science 2017-11-28 Bharath Bhushan Damodaran , Nicolas Courty , Philippe-Henri Gosselin

Random matrices meet machine learning: a large dimensional analysis of LS-SVM

This article proposes a performance analysis of kernel least squares support vector machines (LS-SVMs) based on a random matrix approach, in the regime where both the dimension of data $p$ and their number $n$ grow large at the same rate.…

Machine Learning · Statistics 2016-09-09 Zhenyu Liao , Romain Couillet

Machine Learning Techniques for Pattern Recognition in High-Dimensional Data Mining

This paper proposes a frequent pattern data mining algorithm based on support vector machine (SVM), aiming to solve the performance bottleneck of traditional frequent pattern mining algorithms in high-dimensional and sparse data…

Machine Learning · Computer Science 2024-12-23 Pochun Li

A Foray into Parallel Optimisation Algorithms for High Dimension Low Sample Space Generalized Distance Weighted Discrimination problems

In many modern data sets, High dimension low sample size (HDLSS) data is prevalent in many fields of studies. There has been an increased focus recently on using machine learning and statistical methods to mine valuable information out of…

Optimization and Control · Mathematics 2023-05-23 Srivathsan Amruth , Xin Yee Lam

Support vector machines/relevance vector machine for remote sensing classification: A review

Kernel-based machine learning algorithms are based on mapping data from the original input feature space to a kernel feature space of higher dimensionality to solve a linear problem in that space. Over the last decade, kernel based…

Computer Vision and Pattern Recognition · Computer Science 2011-01-18 Mahesh Pal

Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?

The rise of Large Vision-Language Models (LVLMs) has significantly advanced video understanding. However, efficiently processing long videos remains a challenge due to the ``Sampling Dilemma'': low-density sampling risks missing critical…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Tianyuan Qu , Longxiang Tang , Bohao Peng , Senqiao Yang , Bei Yu , Jiaya Jia

Deep Distributed Random Samplings for Supervised Learning: An Alternative to Random Forests?

In (\cite{zhang2014nonlinear,zhang2014nonlinear2}), we have viewed machine learning as a coding and dimensionality reduction problem, and further proposed a simple unsupervised dimensionality reduction method, entitled deep distributed…

Machine Learning · Computer Science 2015-01-29 Xiao-Lei Zhang

Models Parametric Analysis via Adaptive Kernel Learning

Any applied mathematical model contains parameters. The paper proposes to use kernel learning for the parametric analysis of the model. The approach consists in setting a distribution on the parameter space, obtaining a finite training…

Optimization and Control · Mathematics 2025-01-27 Vladimir Norkin , Alois Pichler

Deep Representational Similarity Learning for analyzing neural signatures in task-based fMRI dataset

Similarity analysis is one of the crucial steps in most fMRI studies. Representational Similarity Analysis (RSA) can measure similarities of neural signatures generated by different cognitive states. This paper develops Deep…

Image and Video Processing · Electrical Eng. & Systems 2020-10-06 Muhammad Yousefnezhad , Jeffrey Sawalha , Alessandro Selvitella , Daoqiang Zhang