Related papers: Deducing Truth from Correlation

Multi-Agent Fact Checking

We formulate the problem of fake news detection using distributed fact-checkers (agents) with unknown reliability. The stream of news/statements is modeled as an independent and identically distributed binary source (to represent true and…

Optimization and Control · Mathematics 2025-03-05 Ashwin Verma , Soheil Mohajer , Behrouz Touri

Unsupervised Ensemble Learning with Dependent Classifiers

In unsupervised ensemble learning, one obtains predictions from multiple sources or classifiers, yet without knowing the reliability and expertise of each source, and with no labeled data to assess it. The task is to combine these possibly…

Machine Learning · Computer Science 2016-02-24 Ariel Jaffe , Ethan Fetaya , Boaz Nadler , Tingting Jiang , Yuval Kluger

Towards efficient representation identification in supervised learning

Humans have a remarkable ability to disentangle complex sensory inputs (e.g., image, text) into simple factors of variation (e.g., shape, color) without much supervision. This ability has inspired many works that attempt to solve the…

Machine Learning · Computer Science 2024-12-25 Kartik Ahuja , Divyat Mahajan , Vasilis Syrgkanis , Ioannis Mitliagkas

Instance Optimal Learning

We consider the following basic learning task: given independent draws from an unknown distribution over a discrete support, output an approximation of the distribution that is as accurate as possible in $\ell_1$ distance (i.e. total…

Machine Learning · Computer Science 2015-11-12 Gregory Valiant , Paul Valiant

Representation Dependence in Probabilistic Inference

Non-deductive reasoning systems are often {\em representation dependent}: representing the same situation in two different ways may cause such a system to return two different answers. Some have viewed this as a significant problem. For…

Artificial Intelligence · Computer Science 2007-05-23 Joseph Y. Halpern , Daphne Koller

Total Empiricism: Learning from Data

Statistical analysis is an important tool to distinguish systematic from chance findings. Current statistical analyses rely on distributional assumptions reflecting the structure of some underlying model, which if not met lead to problems…

Statistics Theory · Mathematics 2023-11-15 Orestis Loukas , Ho Ryun Chung

Learning to Acquire Information

We consider the problem of diagnosis where a set of simple observations are used to infer a potentially complex hidden hypothesis. Finding the optimal subset of observations is intractable in general, thus we focus on the problem of active…

Artificial Intelligence · Computer Science 2017-07-12 Yewen Pu , Leslie P Kaelbling , Armando Solar-Lezama

Distributed Inference with Sparse and Quantized Communication

We consider the problem of distributed inference where agents in a network observe a stream of private signals generated by an unknown state, and aim to uniquely identify this state from a finite set of hypotheses. We focus on scenarios…

Systems and Control · Electrical Eng. & Systems 2021-09-01 Aritra Mitra , John A. Richards , Saurabh Bagchi , Shreyas Sundaram

Inferring deterministic causal relations

We consider two variables that are related to each other by an invertible function. While it has previously been shown that the dependence structure of the noise can provide hints to determine which of the two variables is the cause, we…

Machine Learning · Computer Science 2012-03-19 Povilas Daniusis , Dominik Janzing , Joris Mooij , Jakob Zscheischler , Bastian Steudel , Kun Zhang , Bernhard Schoelkopf

Distributed Learning with Partial Information Sharing

This work studies the distributed learning process on a network of agents. Agents make partial observation about an unknown hypothesis and iteratively share their beliefs over a set of possible hypotheses with their neighbors to learn the…

Systems and Control · Electrical Eng. & Systems 2024-11-19 P Raghavendra Rao , Pooja Vyavahare

An Unsupervised Bayesian Neural Network for Truth Discovery in Social Networks

The problem of estimating event truths from conflicting agent opinions in a social network is investigated. An autoencoder learns the complex relationships between event truths, agent reliabilities and agent observations. A Bayesian network…

Machine Learning · Computer Science 2021-01-26 Jielong Yang , Wee Peng Tay

True and false discoveries with independent and sequential e-values

In this paper we use e-values in the context of multiple hypothesis testing assuming that the base tests produce independent, or sequential, e-values. Our simulation and empirical studies and theoretical considerations suggest that, under…

Methodology · Statistics 2024-08-14 Vladimir Vovk , Ruodu Wang

Dependency Decomposition and a Reject Option for Explainable Models

Deploying machine learning models in safety-related do-mains (e.g. autonomous driving, medical diagnosis) demands for approaches that are explainable, robust against adversarial attacks and aware of the model uncertainty. Recent deep…

Computer Vision and Pattern Recognition · Computer Science 2020-12-14 Jan Kronenberger , Anselm Haselhoff

Certain and Approximately Certain Models for Statistical Learning

Real-world data is often incomplete and contains missing values. To train accurate models over real-world datasets, users need to spend a substantial amount of time and resources imputing and finding proper values for missing data items. In…

Machine Learning · Statistics 2024-03-05 Cheng Zhen , Nischal Aryal , Arash Termehchy , Alireza Aghasi , Amandeep Singh Chabada

Distributed Learning with Infinitely Many Hypotheses

We consider a distributed learning setup where a network of agents sequentially access realizations of a set of random variables with unknown distributions. The network objective is to find a parametrized distribution that best describes…

Optimization and Control · Mathematics 2016-05-10 Angelia Nedić , Alex Olshevsky , César Uribe

Evaluating Factuality in Generation with Dependency-level Entailment

Despite significant progress in text generation models, a serious limitation is their tendency to produce text that is factually inconsistent with information in the input. Recent work has studied whether textual entailment systems can be…

Computation and Language · Computer Science 2020-10-23 Tanya Goyal , Greg Durrett

Partial Information Sharing over Social Learning Networks

This work addresses the problem of sharing partial information within social learning strategies. In traditional social learning, agents solve a distributed multiple hypothesis testing problem by performing two operations at each instant:…

Signal Processing · Electrical Eng. & Systems 2022-12-07 Virginia Bordignon , Vincenzo Matta , Ali H. Sayed

A New Approach to Distributed Hypothesis Testing and Non-Bayesian Learning: Improved Learning Rate and Byzantine-Resilience

We study a setting where a group of agents, each receiving partially informative private signals, seek to collaboratively learn the true underlying state of the world (from a finite set of hypotheses) that generates their joint observation…

Systems and Control · Electrical Eng. & Systems 2019-07-09 Aritra Mitra , John A. Richards , Shreyas Sundaram

Learning Independent Causal Mechanisms

Statistical learning relies upon data sampled from a distribution, and we usually do not care what actually generated it in the first place. From the point of view of causal modeling, the structure of each distribution is induced by…

Machine Learning · Computer Science 2018-09-11 Giambattista Parascandolo , Niki Kilbertus , Mateo Rojas-Carulla , Bernhard Schölkopf

Detecting hidden confounding in observational data using multiple environments

A common assumption in causal inference from observational data is that there is no hidden confounding. Yet it is, in general, impossible to verify this assumption from a single dataset. Under the assumption of independent causal mechanisms…

Methodology · Statistics 2023-11-07 Rickard K. A. Karlsson , Jesse H. Krijthe