English
Related papers

Related papers: Random forests for binary geospatial data

200 papers

Random forest (RF) is one of the most popular methods for estimating regression functions. The local nature of the RF algorithm, based on intra-node means and variances, is ideal when errors are i.i.d. For dependent error processes like…

Machine Learning · Statistics 2021-06-29 Arkajyoti Saha , Sumanta Basu , Abhirup Datta

Analysis of geospatial data has traditionally been model-based, with a mean model, customarily specified as a linear regression on the covariates, and a covariance model, encoding the spatial dependence. We relax the strong assumption of…

Machine Learning · Statistics 2024-05-28 Wentao Zhan , Abhirup Datta

Random Forests [Breiman:2001] (RF) are a fully non-parametric statistical method requiring no distributional assumptions on covariate relation to the response. RF are a robust, nonlinear technique that optimizes predictive accuracy by…

Computation · Statistics 2016-12-30 John Ehrlinger

In this paper, we propose Random Forests by Random Weights (RF-RW), a theoretically grounded and practically effective alternative RF modelling for nonlinear time series data, where existing RF-based approaches struggle to adequately…

Methodology · Statistics 2025-11-18 Shihao Zhang , Zudi Lu , Chao Zheng

To obtain a probabilistic model for a dependent variable based on some set of explanatory variables, a distributional approach is often adopted where the parameters of the distribution are linked to regressors. In many classical models this…

Methodology · Statistics 2020-01-14 Lisa Schlosser , Torsten Hothorn , Reto Stauffer , Achim Zeileis

We propose generalized random forests, a method for non-parametric statistical estimation based on random forests (Breiman, 2001) that can be used to fit any quantity of interest identified as the solution to a set of local moment…

Methodology · Statistics 2018-04-06 Susan Athey , Julie Tibshirani , Stefan Wager

Random forest (Leo Breiman 2001a) (RF) is a non-parametric statistical method requiring no distributional assumptions on covariate relation to the response. RF is a robust, nonlinear technique that optimizes predictive accuracy by fitting…

Computation · Statistics 2016-12-30 John Ehrlinger

We propose a computationally efficient alternative to generalized random forests (GRFs) for estimating heterogeneous effects in large dimensions. While GRFs rely on a gradient-based splitting criterion, which in large dimensions is…

Machine Learning · Statistics 2025-06-18 David Fleischer , David A. Stephens , Archer Y. Yang

Random forests (RFs) are among the most popular supervised learning algorithms due to their nonlinear flexibility and ease-of-use. However, as black box models, they can only be interpreted via algorithmically-defined feature importance…

Methodology · Statistics 2025-05-26 Abhineet Agarwal , Ana M. Kenney , Yan Shuo Tan , Tiffany M. Tang , Bin Yu

Random Forest (Breiman, 2001) is a successful and widely used regression and classification algorithm. Part of its appeal and reason for its versatility is its (implicit) construction of a kernel-type weighting function on training data,…

Machine Learning · Statistics 2022-10-13 Domagoj Ćevid , Loris Michel , Jeffrey Näf , Nicolai Meinshausen , Peter Bühlmann

Random forests is a common non-parametric regression technique which performs well for mixed-type unordered data and irrelevant features, while being robust to monotonic variable transformations. Standard random forests, however, do not…

Computation · Statistics 2019-06-19 Taylor Pospisil , Ann B. Lee

Random forests are considered one of the best out-of-the-box classification and regression algorithms due to their high level of predictive performance with relatively little tuning. Pairwise proximities can be computed from a trained…

Machine Learning · Statistics 2023-03-02 Jake S. Rhodes , Adele Cutler , Kevin R. Moon

We propose methods to improve the forecasts from generalized autoregressive score (GAS) models (Creal et. al, 2013; Harvey, 2013) by localizing their parameters using decision trees and random forests. These methods avoid the curse of…

Econometrics · Economics 2023-05-31 Andrew J. Patton , Yasin Simsek

Gaussian processes (GPs) produce good probabilistic models of functions, but most GP kernels require $O((n+m)n^2)$ time, where $n$ is the number of data points and $m$ the number of predictive locations. We present a new kernel that allows…

Machine Learning · Computer Science 2023-04-03 Michael K. Cohen , Samuel Daulton , Michael A. Osborne

Data fields sampled on irregularly spaced points arise in many applications in the sciences and engineering. For regular grids, Convolutional Neural Networks (CNNs) have been successfully used to gaining benefits from weight sharing and…

Machine Learning · Computer Science 2023-02-28 Nathaniel Trask , Ravi G. Patel , Ben J. Gross , Paul J. Atzberger

The standard regression tree method applied to observations within clusters poses both methodological and implementation challenges. Effectively leveraging these data requires methods that account for both individual-level and sample-level…

Methodology · Statistics 2025-03-05 Jeremiah Allis , Xin Jin , Riddhi Ghosh

Environmental data may be "large" due to number of records, number of covariates, or both. Random forests has a reputation for good predictive performance when using many covariates with nonlinear relationships, whereas spatial regression,…

Applications · Statistics 2018-12-27 Eric W. Fox , Jay M. Ver Hoef , Anthony R. Olsen

Random forests is a state-of-the-art supervised machine learning method which behaves well in high-dimensional settings although some limitations may happen when $p$, the number of predictors, is much larger than the number of observations…

Methodology · Statistics 2019-02-01 Louis Capitaine , Robin Genuer , Rodolphe Thiébaut

We propose a random-effects approach to missing values for generalized linear mixed model (GLMM) analysis. The method converts a GLMM with missing covariates to another GLMM without missing covariates. The standard GLMM analysis tools for…

Methodology · Statistics 2026-01-01 Thuan Nguyen , Jiangshan Zhang , Jiming Jiang

Random forest (RF) missing data algorithms are an attractive approach for dealing with missing data. They have the desirable properties of being able to handle mixed types of missing data, they are adaptive to interactions and nonlinearity,…

Machine Learning · Statistics 2017-01-23 Fei Tang , Hemant Ishwaran
‹ Prev 1 2 3 10 Next ›