Related papers: Inference for Error-Prone Count Data: Estimation u…

Beyond MLE: Convex Learning for Text Generation

Maximum likelihood estimation (MLE) is a statistical method used to estimate the parameters of a probability distribution that best explain the observed data. In the context of text generation, MLE is often used to train generative language…

Computation and Language · Computer Science 2023-10-27 Chenze Shao , Zhengrui Ma , Min Zhang , Yang Feng

Convolutional Maximum Mean Discrepancy for Inference in Noisy Data

Modern data analyses frequently encounter settings where samples of variables are contaminated by measurement error. Ignoring measurement noise can substantially degrade statistical inference, while existing correction techniques are often…

Methodology · Statistics 2026-04-15 Ritwik Vashistha , Jeff M. Phillips , Abhra Sarkar , Arya Farahi

Penalized Likelihood Methods for Modeling Count Data

The paper considers parameter estimation in count data models using penalized likelihood methods. The motivating data consists of multiple independent count variables with a moderate sample size per variable. The data were collected during…

Methodology · Statistics 2026-04-15 Minh Thu Bui , Cornelis J. Potgieter , Akihito Kamata

Inference problems in binary regression model with misclassified responses

Misclassification of binary responses, if ignored, may severely bias the maximum likelihood estimators (MLE) of regression parameters. For such data, a binary regression model incorporating misclassification probabilities is extensively…

Statistics Theory · Mathematics 2020-09-28 Arindam Chatterjee , Tathagata Bandyopadhyay , Sumanta Adhya

Accurate inference in negative binomial regression

Negative binomial regression is commonly employed to analyze overdispersed count data. With small to moderate sample sizes, the maximum likelihood estimator of the dispersion parameter may be subject to a significant bias, that in turn…

Methodology · Statistics 2020-11-06 Euloge Clovis Kenne Pagui , Alessandra Salvan , Nicola Sartori

Calibrate to Discriminate: Improve In-Context Learning with Label-Free Comparative Inference

While in-context learning with large language models (LLMs) has shown impressive performance, we have discovered a unique miscalibration behavior where both correct and incorrect predictions are assigned the same level of confidence. We…

Computation and Language · Computer Science 2024-10-04 Wei Cheng , Tianlu Wang , Yanmin Ji , Fan Yang , Keren Tan , Yiyu Zheng

Generalizing Fault Detection Against Domain Shifts Using Stratification-Aware Cross-Validation

Incipient anomalies present milder symptoms compared to severe ones, and are more difficult to detect and diagnose due to their close resemblance to normal operating conditions. The lack of incipient anomaly examples in the training data…

Machine Learning · Computer Science 2020-08-21 Yingshui Tan , Baihong Jin , Qiushi Cui , Xiangyu Yue , Alberto Sangiovanni Vincentelli

CONTESTS: a Framework for Consistency Testing of Span Probabilities in Language Models

Although language model scores are often treated as probabilities, their reliability as probability estimators has mainly been studied through calibration, overlooking other aspects. In particular, it is unclear whether language models…

Computation and Language · Computer Science 2024-10-01 Eitan Wagner , Yuli Slavutsky , Omri Abend

A Multi-faceted Analysis of Cognitive Abilities: Evaluating Prompt Methods with Large Language Models on the CONSORT Checklist

Despite the rapid expansion of Large Language Models (LLMs) in healthcare, robust and explainable evaluation of their ability to assess clinical trial reporting according to CONSORT standards remains an open challenge. In particular,…

Artificial Intelligence · Computer Science 2026-02-26 Sohyeon Jeon , Hyung-Chul Lee

Can Vision-Language Models Count? A Synthetic Benchmark and Analysis of Attention-Based Interventions

Recent research suggests that Vision Language Models (VLMs) often rely on inherent biases learned during training when responding to queries about visual properties of images. These biases are exacerbated when VLMs are asked highly specific…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Saurav Sengupta , Nazanin Moradinasab , Jiebei Liu , Donald E. Brown

Context-aware learning for generative models

This work studies the class of algorithms for learning with side-information that emerge by extending generative models with embedded context-related variables. Using finite mixture models (FMM) as the prototypical Bayesian network, we show…

Machine Learning · Statistics 2020-08-17 Serafeim Perdikis , Robert Leeb , Ricardo Chavarriaga , José del R. Millán

Regression-aware Inference with LLMs

Large language models (LLMs) have shown strong results on a range of applications, including regression and scoring tasks. Typically, one obtains outputs from an LLM via autoregressive sampling from the model's output distribution. We show…

Computation and Language · Computer Science 2024-11-04 Michal Lukasik , Harikrishna Narasimhan , Aditya Krishna Menon , Felix Yu , Sanjiv Kumar

Counterfactual Maximum Likelihood Estimation for Training Deep Networks

Although deep learning models have driven state-of-the-art performance on a wide array of tasks, they are prone to spurious correlations that should not be learned as predictive clues. To mitigate this problem, we propose a causality-based…

Machine Learning · Computer Science 2021-10-27 Xinyi Wang , Wenhu Chen , Michael Saxon , William Yang Wang

Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions

Flow Matching (FM) models achieve remarkable results in generative tasks. Building upon diffusion models, FM's simulation-free training paradigm enables simplicity and efficiency but introduces a train-inference gap: model outputs cannot be…

Machine Learning · Computer Science 2026-01-30 Zhaoyi Li , Jingtao Ding , Yong Li , Shihua Li

Getting a CLUE: A Method for Explaining Uncertainty Estimates

Both uncertainty estimation and interpretability are important factors for trustworthy machine learning systems. However, there is little work at the intersection of these two areas. We address this gap by proposing a novel method for…

Machine Learning · Statistics 2021-03-19 Javier Antorán , Umang Bhatt , Tameem Adel , Adrian Weller , José Miguel Hernández-Lobato

On the consistency of supervised learning with missing values

In many application settings, the data have missing entries which make analysis challenging. An abundant literature addresses missing values in an inferential framework: estimating parameters and their variance from incomplete tables. Here,…

Machine Learning · Statistics 2024-03-22 Julie Josse , Jacob M. Chen , Nicolas Prost , Erwan Scornet , Gaël Varoquaux

The information of attribute uncertainties: what convolutional neural networks can learn about errors in input data

Errors in measurements are key to weighting the value of data, but are often neglected in Machine Learning (ML). We show how Convolutional Neural Networks (CNNs) are able to learn about the context and patterns of signal and noise, leading…

Machine Learning · Computer Science 2021-08-11 Natália V. N. Rodrigues , L. Raul Abramo , Nina S. Hirata

Distributed Estimation, Information Loss and Exponential Families

Distributed learning of probabilistic models from multiple data repositories with minimum communication is increasingly important. We study a simple communication-efficient learning framework that first calculates the local maximum…

Machine Learning · Statistics 2014-10-13 Qiang Liu , Alexander Ihler

A connection between the pattern classification problem and the General Linear Model for statistical inference

A connection between the General Linear Model (GLM) in combination with classical statistical inference and the machine learning (MLE)-based inference is described in this paper. Firstly, the estimation of the GLM parameters is expressed as…

Machine Learning · Statistics 2022-02-10 Juan Manuel Gorriz , SIPBA group , John Suckling

Estimation Beyond Data Reweighting: Kernel Method of Moments

Moment restrictions and their conditional counterparts emerge in many areas of machine learning and statistics ranging from causal inference to reinforcement learning. Estimators for these tasks, generally called methods of moments, include…

Machine Learning · Computer Science 2023-06-14 Heiner Kremer , Yassine Nemmour , Bernhard Schölkopf , Jia-Jie Zhu