Related papers: Learning partially ranked data based on graph regu…

Penalized pairwise pseudo likelihood for variable selection with nonignorable missing data

The regularization approach for variable selection was well developed for a completely observed data set in the past two decades. In the presence of missing values, this approach needs to be tailored to different missing data mechanisms. In…

Methodology · Statistics 2017-07-31 Jiwei Zhao , Yang Yang , Yang Ning

Full-semiparametric-likelihood-based inference for non-ignorable missing data

During the past few decades, missing-data problems have been studied extensively, with a focus on the ignorable missing case, where the missing probability depends only on observable quantities. By contrast, research into non-ignorable…

Methodology · Statistics 2019-08-06 Yukun Liu , Pengfei Li , Jing Qin

A multi-class approach for ranking graph nodes: models and experiments with incomplete data

After the phenomenal success of the PageRank algorithm, many researchers have extended the PageRank approach to ranking graphs with richer structures beside the simple linkage structure. In some scenarios we have to deal with…

Numerical Analysis · Mathematics 2018-11-15 Gianna M. Del Corso , Francesco Romani

On the Optimal Recovery of Graph Signals

Learning a smooth graph signal from partially observed data is a well-studied task in graph-based machine learning. We consider this task from the perspective of optimal recovery, a mathematical framework for learning a function from…

Machine Learning · Computer Science 2023-05-31 Simon Foucart , Chunyang Liao , Nate Veldt

Estimation and imputation in Probabilistic Principal Component Analysis with Missing Not At Random data

Missing Not At Random (MNAR) values lead to significant biases in the data, since the probability of missingness depends on the unobserved values.They are ''not ignorable'' in the sense that they often require defining a model for the…

Statistics Theory · Mathematics 2020-06-11 Aude Sportisse , Claire Boyer , Julie Josse

On missing label patterns in semi-supervised learning

We investigate model based classification with partially labelled training data. In many biostatistical applications, labels are manually assigned by experts, who may leave some observations unlabelled due to class uncertainty. We analyse…

Methodology · Statistics 2019-04-08 Daniel Ahfock , Geoffrey J. McLachlan

Partially Observed Functional Data: The Case of Systematically Missing Parts

New estimators for the mean and the covariance function for partially observed functional data are proposed using a detour via the fundamental theorem of calculus. The new estimators allow for a consistent estimation of the mean and…

Methodology · Statistics 2018-08-01 Dominik Liebl , Stefan Rameseder

The asymptotics of ranking algorithms

We consider the predictive problem of supervised ranking, where the task is to rank sets of candidate items returned in response to queries. Although there exist statistical procedures that come with guarantees of consistency in this…

Statistics Theory · Mathematics 2013-11-27 John C. Duchi , Lester Mackey , Michael I. Jordan

Sequential identification of nonignorable missing data mechanisms

With nonignorable missing data, likelihood-based inference should be based on the joint distribution of the study variables and their missingness indicators. These joint models cannot be estimated from the data alone, thus requiring the…

Statistics Theory · Mathematics 2017-01-06 Mauricio Sadinle , Jerome P. Reiter

Imputation and low-rank estimation with Missing Not At Random data

Missing values challenge data analysis because many supervised and unsupervised learning methods cannot be applied directly to incomplete data. Matrix completion based on low-rank assumptions are very powerful solution for dealing with…

Machine Learning · Statistics 2020-01-30 Aude Sportisse , Claire Boyer , Julie Josse

Semiparametric Inference of the Complier Average Causal Effect with Nonignorable Missing Outcomes

Noncompliance and missing data often occur in randomized trials, which complicate the inference of causal effects. When both noncompliance and missing data are present, previous papers proposed moment and maximum likelihood estimators for…

Methodology · Statistics 2014-09-04 Hua Chen , Peng Ding , Zhi Geng , Xiao-Hua Zhou

Estimation of Network structures from partially observed Markov random fields

We consider the estimation of high-dimensional network structures from partially observed Markov random field data using a penalized pseudo-likelihood approach. We fit a misspecified model obtained by ignoring the missing data problem. We…

Statistics Theory · Mathematics 2011-08-16 Yves F. Atchade

Low-Rank Approximations of Nonseparable Panel Models

We provide estimation methods for nonseparable panel models based on low-rank factor structure approximations. The factor structures are estimated by matrix-completion methods to deal with the computational challenges of principal component…

Econometrics · Economics 2021-03-05 Iván Fernández-Val , Hugo Freeman , Martin Weidner

Estimation of Classification Rules from Partially Classified Data

We consider the situation where the observed sample contains some observations whose class of origin is known (that is, they are classified with respect to the g underlying classes of interest), and where the remaining observations in the…

Machine Learning · Statistics 2020-04-15 Geoffrey J. McLachlan , Daniel Ahfock

Is completeness necessary? Estimation in nonidentified linear models

Modern data analysis depends increasingly on estimating models via flexible high-dimensional or nonparametric machine learning methods, where the identification of structural parameters is often challenging and untestable. In linear…

Statistics Theory · Mathematics 2026-01-21 Andrii Babii , Jean-Pierre Florens

Statistical Inference with Different Missing-data Mechanisms

When data are missing due to at most one cause from some time to next time, we can make sampling distribution inferences about the parameter of the data by modeling the missing-data mechanism correctly. Proverbially, in case its mechanism…

Methodology · Statistics 2014-07-21 Kosuke Morikawa , Yutaka Kano

Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at Random

In most practical applications such as recommendation systems, display advertising, and so forth, the collected data often contains missing values and those missing values are generally missing-not-at-random, which deteriorates the…

Machine Learning · Computer Science 2024-05-27 Mingming Ha , Xuewen Tao , Wenfang Lin , Qionxu Ma , Wujiang Xu , Linxun Chen

Matrix Completion under Low-Rank Missing Mechanism

Matrix completion is a modern missing data problem where both the missing structure and the underlying parameter are high dimensional. Although missing structure is a key component to any missing data problems, existing matrix completion…

Machine Learning · Statistics 2020-03-23 Xiaojun Mao , Raymond K. W. Wong , Song Xi Chen

Identifiability of Subgroup Causal Effects in Randomized Experiments with Nonignorable Missing Covariates

Although randomized experiments are widely regarded as the gold standard for estimating causal effects, missing data of the pretreatment covariates makes it challenging to estimate the subgroup causal effects. When the missing data…

Statistics Theory · Mathematics 2014-01-08 Peng Ding , Zhi Geng

Semiparametric response model with nonignorable nonresponse

How to deal with nonignorable response is often a challenging problem encountered in statistical analysis with missing data. Parametric model assumption for the response mechanism is often made and there is no way to validate the model…

Methodology · Statistics 2018-10-31 Masatoshi Uehara , Jae Kwang Kim