English
Related papers

Related papers: Statistical Estimation from Dependent Data

200 papers

The standard linear and logistic regression models assume that the response variables are independent, but share the same linear relationship to their corresponding vectors of covariates. The assumption that the response variables are…

Machine Learning · Computer Science 2019-10-09 Constantinos Daskalakis , Nishanth Dikkala , Ioannis Panageas

We investigate the problem of statistical inference for logistic regression with high-dimensional covariates in settings where dependence among individuals is induced by an underlying Markov random field. Going beyond the pairwise…

Statistics Theory · Mathematics 2026-03-23 Josh Miles , Sohom Bhattacharya

In many supervised learning tasks, the entities to be labeled are related to each other in complex ways and their labels are not independent. For example, in hypertext classification, the labels of linked pages are highly correlated. A…

Machine Learning · Computer Science 2013-01-07 Ben Taskar , Pieter Abbeel , Daphne Koller

Logistic regression is key method for modeling the probability of a binary outcome based on a collection of covariates. However, the classical formulation of logistic regression relies on the independent sampling assumption, which is often…

Statistics Theory · Mathematics 2024-09-25 Somabha Mukherjee , Ziang Niu , Sagnik Halder , Bhaswar B. Bhattacharya , George Michailidis

Gaussian mixture models are widely used to model data generated from multiple latent sources. Despite its popularity, most theoretical research assumes that the labels are either independent and identically distributed, or follows a Markov…

Statistics Theory · Mathematics 2025-10-09 Seunghyun Lee , Rajarshi Mukherjee , Sumit Mukherjee

In this paper, we focus on the problem of statistical dependence estimation using characteristic functions. We propose a statistical dependence measure, based on the maximum-norm of the difference between joint and product-marginal…

Machine Learning · Computer Science 2022-08-18 Povilas Daniušis , Shubham Juneja , Lukas Kuzma , Virginijus Marcinkevičius

Recently, graph (network) data is an emerging research area in artificial intelligence, machine learning and statistics. In this work, we are interested in whether node's labels (people's responses) are affected by their neighbor's features…

Methodology · Statistics 2022-10-12 Haixiang Zhang , Yingjun Deng , Alan J. X. Guo , Qing-Hu Hou , Ou Wu

In a large social network whose members harbor binary sentiments towards an issue, we investigate the asymptotic accuracy of sentiment detection. We model the user sentiments by an Ising Markov random field model and allow the user…

Social and Information Networks · Computer Science 2017-10-10 Tian Tong , Rohit Negi

The Ising model is a celebrated example of a Markov random field, introduced in statistical physics to model ferromagnetism. This is a discrete exponential family with binary outcomes, where the sufficient statistic involves a quadratic…

Statistics Theory · Mathematics 2021-09-08 Somabha Mukherjee

Measuring conditional dependencies among the variables of a network is of great interest to many disciplines. This paper studies some shortcomings of the existing dependency measures in detecting direct causal influences or their lack of…

Machine Learning · Statistics 2017-06-05 Jalal Etesami , Kun Zhang , Negar Kiyavash

We present a novel deep learning method for estimating time-dependent parameters in Markov processes through discrete sampling. Departing from conventional machine learning, our approach reframes parameter approximation as an optimization…

Dependency networks (Heckerman et al., 2000) provide a flexible framework for modeling complex systems with many variables by combining independently learned local conditional distributions through pseudo-Gibbs sampling. Despite their…

Machine Learning · Computer Science 2026-04-02 Kazuya Takabatake , Shotaro Akaho

Existing methods for differentiable structure learning in discrete data typically assume that the data are generated from specific structural equation models. However, these assumptions may not align with the true data-generating process,…

Machine Learning · Computer Science 2025-10-28 Chang Deng , Bryon Aragam

The assumption that data samples are independent and identically distributed (iid) is standard in many areas of statistics and machine learning. Nevertheless, in some settings, such as social networks, infectious disease modeling, and…

Methodology · Statistics 2019-02-06 Eli Sherman , Ilya Shpitser

This paper deals with variable selection in multivariate linear regression model when the data are observations on a spatial domain being a grid of sites in $\mathbb{Z}^d$ with $d\geqslant 2$. We use a criterion that allows to characterize…

Statistics Theory · Mathematics 2023-05-23 Jean Roland Ebende Penda , Stéphane Bouka , Guy Martial Nkiet

Dependency networks (Heckerman et al., 2000) are potential probabilistic graphical models for systems comprising a large number of variables. Like Bayesian networks, the structure of a dependency network is represented by a directed graph,…

Machine Learning · Computer Science 2021-07-05 Kazuya Takabatake , Shotaro Akaho

An inductive probabilistic classification rule must generally obey the principles of Bayesian predictive inference, such that all observed and unobserved stochastic quantities are jointly modeled and the parameter uncertainty is fully…

Machine Learning · Statistics 2015-03-25 Henrik Nyman , Jie Xiong , Johan Pensar , Jukka Corander

Statistical dependence measures like mutual information is ideal for analyzing autoencoders, but it can be ill-posed for deterministic, static, noise-free networks. We adopt the variational (Gaussian) formulation that makes dependence among…

Machine Learning · Computer Science 2026-03-24 Bo Hu , Jose C Principe

We propose a simple and efficient method for ranking features in multi-label classification. The method produces a ranking of features showing their relevance in predicting labels, which in turn allows to choose a final subset of features.…

Machine Learning · Computer Science 2016-02-25 Paweł Teisseyre

The most basic assumption used in statistical learning theory is that training data and test data are drawn from the same underlying distribution. Unfortunately, in many applications, the "in-domain" test data is drawn from a distribution…

Machine Learning · Computer Science 2011-09-30 H. Daume , D. Marcu
‹ Prev 1 2 3 10 Next ›