Related papers: An Innovative Algorithm For Robust, Interactive, P…

Distributed Simultaneous Inference in Generalized Linear Models via Confidence Distribution

We propose a distributed method for simultaneous inference for datasets with sample size much larger than the number of covariates, i.e., N >> p, in the generalized linear models framework. When such datasets are too big to be analyzed…

Methodology · Statistics 2020-07-23 Lu Tang , Ling Zhou , Peter X. -K. Song

Piecewise regression mixture for simultaneous functional data clustering and optimal segmentation

This paper introduces a novel mixture model-based approach for simultaneous clustering and optimal segmentation of functional data which are curves presenting regime changes. The proposed model consists in a finite mixture of piecewise…

Methodology · Statistics 2014-05-02 Faicel Chamroukhi

LCEN: A Nonlinear, Interpretable Feature Selection and Machine Learning Algorithm

Interpretable models can have advantages over black-box models, and interpretability is essential for the application of machine learning in critical settings, such as aviation or medicine. This article introduces the LASSO-Clip-EN (LCEN)…

Machine Learning · Computer Science 2025-12-02 Pedro Seber , Richard D. Braatz

K-Plane Regression

In this paper, we present a novel algorithm for piecewise linear regression which can learn continuous as well as discontinuous piecewise linear functions. The main idea is to repeatedly partition the data and learn a liner model in in each…

Machine Learning · Computer Science 2014-09-10 Naresh Manwani , P. S. Sastry

PLSO: A generative framework for decomposing nonstationary time-series into piecewise stationary oscillatory components

To capture the slowly time-varying spectral content of real-world time-series, a common paradigm is to partition the data into approximately stationary intervals and perform inference in the time-frequency domain. However, this approach…

Methodology · Statistics 2021-06-15 Andrew H. Song , Demba Ba , Emery N. Brown

Distributed Sparse Feature Selection in Communication-Restricted Networks

This paper aims to propose and theoretically analyze a new distributed scheme for sparse linear regression and feature selection. The primary goal is to learn the few causal features of a high-dimensional dataset based on noisy observations…

Machine Learning · Statistics 2021-11-05 Hanie Barghi , Amir Najafi , Seyed Abolfazl Motahari

Extensions of Morse-Smale Regression with Application to Actuarial Science

The problem of subgroups is ubiquitous in scientific research (ex. disease heterogeneity, spatial distributions in ecology...), and piecewise regression is one way to deal with this phenomenon. Morse-Smale regression offers a way to…

Machine Learning · Statistics 2017-08-22 Colleen M. Farrelly

A Better Alternative to Piecewise Linear Time Series Segmentation

Time series are difficult to monitor, summarize and predict. Segmentation organizes time series into few intervals having uniform characteristics (flatness, linearity, modality, monotonicity and so on). For scalability, we require fast…

Databases · Computer Science 2007-05-23 Daniel Lemire

A Comprehensive Approach to Universal Piecewise Nonlinear Regression Based on Trees

In this paper, we investigate adaptive nonlinear regression and introduce tree based piecewise linear regression algorithms that are highly efficient and provide significantly improved performance with guaranteed upper bounds in an…

Machine Learning · Computer Science 2013-12-30 N. Denizcan Vanli , Suleyman S. Kozat

Pairwise Difference Estimation of High Dimensional Partially Linear Model

This paper proposes a regularized pairwise difference approach for estimating the linear component coefficient in a partially linear model, with consistency and exact rates of convergence obtained in high dimensions under mild scaling…

Statistics Theory · Mathematics 2018-01-15 Fang Han , Zhao Ren , Yuxin Zhu

Computationally efficient segmentation for non-stationary time series with oscillatory patterns

We propose a novel approach for change-point detection and parameter learning in multivariate non-stationary time series exhibiting oscillatory behaviour. We approximate the process through a piecewise function defined by a sum of…

Methodology · Statistics 2026-02-02 Nicolas Bianco , Lorenzo Cappello

Contributions to Robust and Efficient Methods for Analysis of High Dimensional Data

A ubiquitous feature of data of our era is their extra-large sizes and dimensions. Analyzing such high-dimensional data poses significant challenges, since the feature dimension is often much larger than the sample size. This thesis…

Statistics Theory · Mathematics 2025-09-11 Kai Yang

Efficient Regularized Piecewise-Linear Regression Trees

We present a detailed analysis of the class of regression decision tree algorithms which employ a regulized piecewise-linear node-splitting criterion and have regularized linear models at the leaves. From a theoretic standpoint, based on…

Machine Learning · Computer Science 2019-07-02 Leonidas Lefakis , Oleksandr Zadorozhnyi , Gilles Blanchard

Discovering Frequent Gradual Itemsets with Imprecise Data

The gradual patterns that model the complex co-variations of attributes of the form "The more/less X, The more/less Y" play a crucial role in many real world applications where the amount of numerical data to manage is important, this is…

Machine Learning · Computer Science 2020-05-25 Michaël Chirmeni Boujike , Jerry Lonlac , Norbert Tsopze , Engelbert Mephu Nguifo

Nonparametric Distribution Regression Re-calibration

A key challenge in probabilistic regression is ensuring that predictive distributions accurately reflect true empirical uncertainty. Minimizing overall prediction error often encourages models to prioritize informativeness over calibration,…

Machine Learning · Statistics 2026-02-17 Ádám Jung , Domokos M. Kelen , András A. Benczúr

Improved probabilistic regression using diffusion models

Probabilistic regression models the entire predictive distribution of a response variable, offering richer insights than classical point estimates and directly allowing for uncertainty quantification. While diffusion-based generative models…

Machine Learning · Computer Science 2025-10-07 Carlo Kneissl , Christopher Bülte , Philipp Scholl , Gitta Kutyniok

A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data

Change point analysis has applications in a wide variety of fields. The general problem concerns the inference of a change in distribution for a set of time-ordered observations. Sequential detection is an online version in which new data…

Methodology · Statistics 2013-10-16 David S. Matteson , Nicholas A. James

Optimal Piecewise Local-Linear Approximations

Existing works on "black-box" model interpretation use local-linear approximations to explain the predictions made for each data instance in terms of the importance assigned to the different features for arriving at the prediction. These…

Machine Learning · Computer Science 2019-08-28 Kartik Ahuja , William Zame , Mihaela van der Schaar

A Probabilistic Framework for Temporal Distribution Generalization in Industry-Scale Recommender Systems

Temporal distribution shift (TDS) erodes the long-term accuracy of recommender systems, yet industrial practice still relies on periodic incremental training, which struggles to capture both stable and transient patterns. Existing…

Machine Learning · Computer Science 2025-11-27 Yuxuan Zhu , Cong Fu , Yabo Ni , Anxiang Zeng , Yuan Fang

Error analysis of regularized trigonometric linear regression with unbounded sampling: a statistical learning viewpoint

The effectiveness of non-parametric, kernel-based methods for function estimation comes at the price of high computational complexity, which hinders their applicability in adaptive, model-based control. Motivated by approximation techniques…

Statistics Theory · Mathematics 2023-03-17 Anna Scampicchio , Elena Arcari , Melanie N. Zeilinger