Related papers: Repeated Observations for Classification

On deconvolution with repeated measurements

In a large class of statistical inverse problems it is necessary to suppose that the transformation that is inverted is known. Although, in many applications, it is unrealistic to make this assumption, the problem is often insoluble without…

Statistics Theory · Mathematics 2008-12-18 Aurore Delaigle , Peter Hall , Alexander Meister

Density deconvolution from repeated measurements without symmetry assumption on the errors

We consider deconvolution from repeated observations with unknown error distribution. So far, this model has mostly been studied under the additional assumption that the errors are symmetric. We construct an estimator for the non-symmetric…

Statistics Theory · Mathematics 2014-07-15 Johanna Kappus , Fabienne Comte

Deconvolution of repeated measurements corrupted by unknown noise

Recent advances have demonstrated the possibility of solving the deconvolution problem without prior knowledge of the noise distribution. In this paper, we study the repeated measurements model, where information is derived from multiple…

Statistics Theory · Mathematics 2024-09-04 Jérémie Capitao-Miniconi , Elisabeth Gassiat , Luc Lehéricy

On Supervised Classification of Feature Vectors with Independent and Non-Identically Distributed Elements

In this paper, we investigate the problem of classifying feature vectors with mutually independent but non-identically distributed elements. First, we show the importance of this problem. Next, we propose a classifier and derive an…

Machine Learning · Computer Science 2021-09-01 Farzad Shahrivari , Nikola Zlatanov

Iterative Thresholding for Demixing Structured Superpositions in High Dimensions

We consider the demixing problem of two (or more) high-dimensional vectors from nonlinear observations when the number of such observations is far less than the ambient dimension of the underlying vectors. Specifically, we demonstrate an…

Machine Learning · Statistics 2017-01-25 Mohammadreza Soltani , Chinmay Hegde

Linear classification methods for multivariate repeated measures data -- a simulation study

Researchers in the behavioral and social sciences use linear discriminant analysis (LDA) for predictions of group membership (classification) and for identifying the variables most relevant to group separation among a set of continuous…

Methodology · Statistics 2025-05-28 Ricarda Graf , Marina Zeldovich , Sarah Friedrich

Joint Dimensionality Reduction for Two Feature Vectors

Many machine learning problems, especially multi-modal learning problems, have two sets of distinct features (e.g., image and text features in news story classification, or neuroimaging data and neurocognitive data in cognitive science…

Machine Learning · Statistics 2016-11-01 Yanjun Li , Yoram Bresler

Linear regression with unmatched data: a deconvolution perspective

Consider the regression problem where the response $Y\in\mathbb{R}$ and the covariate $X\in\mathbb{R}^d$ for $d\geq 1$ are \textit{unmatched}. Under this scenario, we do not have access to pairs of observations from the distribution of $(X,…

Statistics Theory · Mathematics 2023-09-19 Mona Azadkia , Fadoua Balabdaoui

Classification with many classes: challenges and pluses

The objective of the paper is to study accuracy of multi-class classification in high-dimensional setting, where the number of classes is also large ("large $L$, large $p$, small $n$" model). While this problem arises in many practical…

Statistics Theory · Mathematics 2019-07-18 Felix Abramovich , Marianna Pensky

Determination of class-specific variables in nonparametric multiple-class classification

As technology advanced, collecting data via automatic collection devices become popular, thus we commonly face data sets with lengthy variables, especially when these data sets are collected without specific research goals beforehand. It…

Machine Learning · Statistics 2022-05-10 Wan-Ping Nicole Chen , Yuan-chin Ivan Chang

Statistical Inference in Classification of High-dimensional Gaussian Mixture

We consider the classification problem of a high-dimensional mixture of two Gaussians with general covariance matrices. Using the replica method from statistical physics, we investigate the asymptotic behavior of a general class of…

Machine Learning · Statistics 2024-10-29 Hanwen Huang , Peng Zeng

Compressed Anomaly Detection with Multiple Mixed Observations

We consider a collection of independent random variables that are identically distributed, except for a small subset which follows a different, anomalous distribution. We study the problem of detecting which random variables in the…

Information Theory · Computer Science 2018-06-21 Natalie Durgin , Rachel Grotheer , Chenxi Huang , Shuang Li , Anna Ma , Deanna Needell , Jing Qin

Distributionally Robust Feature Selection

We study the problem of selecting limited features to observe such that models trained on them can perform well simultaneously across multiple subpopulations. This problem has applications in settings where collecting each feature is…

Machine Learning · Computer Science 2025-10-27 Maitreyi Swaroop , Tamar Krishnamurti , Bryan Wilder

Regression for partially observed variables and nonparametric quantiles of conditional probabilities

Efficient estimation under bias sampling, censoring or truncation is a difficult question which has been partially answered and the usual estimators are not always consistent. Several biased designs are considered for models with variables…

Statistics Theory · Mathematics 2007-10-22 Odile Pons

Training Deep Neural Networks to Detect Repeatable 2D Features Using Large Amounts of 3D World Capture Data

Image space feature detection is the act of selecting points or parts of an image that are easy to distinguish from the surrounding image region. By combining a repeatable point detection with a descriptor, parts of an image can be matched…

Computer Vision and Pattern Recognition · Computer Science 2019-12-11 Alexander Mai , Joseph Menke , Allen Yang

A Mathematical Framework for Feature Selection from Real-World Data with Non-Linear Observations

In this paper, we study the challenge of feature selection based on a relatively small collection of sample pairs $\{(x_i, y_i)\}_{1 \leq i \leq m}$. The observations $y_i \in \mathbb{R}$ are thereby supposed to follow a noisy single-index…

Machine Learning · Statistics 2016-12-28 Martin Genzel , Gitta Kutyniok

P-values for classification

Let $(X,Y)$ be a random variable consisting of an observed feature vector $X\in \mathcal{X}$ and an unobserved class label $Y\in \{1,2,...,L\}$ with unknown joint distribution. In addition, let $\mathcal{D}$ be a training data set…

Statistics Theory · Mathematics 2008-06-26 Lutz Duembgen , Bernd-Wolfgang Igl , Axel Munk

Nonparametric Density Estimation for Spatial Data with Wavelets

Nonparametric density estimators are studied for $d$-dimensional, strongly spatial mixing data which is defined on a general $N$-dimensional lattice structure. We consider linear and nonlinear hard thresholded wavelet estimators which are…

Statistics Theory · Mathematics 2017-12-27 Johannes T. N. Krebs

Longitudinal Support Vector Machines for High Dimensional Time Series

We consider the problem of learning a classifier from observed functional data. Here, each data-point takes the form of a single time-series and contains numerous features. Assuming that each such series comes with a binary label, the…

Machine Learning · Computer Science 2020-02-25 Kristiaan Pelckmans , Hong-Li Zeng

Classification with Costly Features as a Sequential Decision-Making Problem

This work focuses on a specific classification problem, where the information about a sample is not readily available, but has to be acquired for a cost, and there is a per-sample budget. Inspired by real-world use-cases, we analyze average…

Machine Learning · Computer Science 2020-03-05 Jaromír Janisch , Tomáš Pevný , Viliam Lisý