Related papers: A Criterion for Aggregation Error for Multivariate…

Regionalization of Multiscale Spatial Processes using a Criterion for Spatial Aggregation Error

The modifiable areal unit problem and the ecological fallacy are known problems that occur when modeling multiscale spatial processes. We investigate how these forms of spatial aggregation error can guide a regionalization over a spatial…

Methodology · Statistics 2015-12-11 Jonathan R. Bradley , Christopher K. Wikle , Scott H. Holan

MAGE: Multimodal Alignment and Generation Enhancement via Bridging Visual and Semantic Spaces

In the latest advancements in multimodal learning, effectively addressing the spatial and semantic losses of visual data after encoding remains a critical challenge. This is because the performance of large multimodal models is positively…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Shaojun E , Yuchen Yang , Jiaheng Wu , Yan Zhang , Tiejun Zhao , Ziyan Chen

CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training

Despite significant work on low-bit quantization-aware training (QAT), there is still an accuracy gap between such techniques and native training. To address this, we introduce CAGE (Curvature-Aware Gradient Estimation), a new QAT method…

Machine Learning · Computer Science 2025-11-11 Soroush Tabesh , Mher Safaryan , Andrei Panferov , Alexandra Volkova , Dan Alistarh

Unified Micromechanics Theory of Composites

We consider the matrix composite materials (CM) of either random (statistically homogeneous or inhomogeneous), periodic, or deterministic (neither random nor periodic) structures. CMs exhibit linear or nonlinear behavior, coupled or…

Classical Physics · Physics 2025-12-23 Valeriy A. Buryachenko

Optimizing Multi-Scale Representations to Detect Effect Heterogeneity Using Earth Observation and Computer Vision: Applications to Two Anti-Poverty RCTs

Earth Observation (EO) data are increasingly used in policy analysis by enabling granular estimation of conditional average treatment effects (CATE). However, a challenge in EO-based causal inference is determining the scale of the input…

Machine Learning · Statistics 2025-03-18 Fucheng Warren Zhu , Connor T. Jerzak , Adel Daoud

Extending confidence calibration to generalised measures of variation

We propose the Variation Calibration Error (VCE) metric for assessing the calibration of machine learning classifiers. The metric can be viewed as an extension of the well-known Expected Calibration Error (ECE) which assesses the…

Machine Learning · Computer Science 2026-02-16 Andrew Thompson , Vivek Desai

Estimating Complier Average Causal Effects with Mixtures of Experts

Treatment non-compliance, where individuals deviate from their assigned experimental conditions, frequently complicates the estimation of causal effects. To address this, we introduce a novel learning framework based on a mixture of experts…

Methodology · Statistics 2025-06-25 François Grolleau , Céline Béji , Raphaël Porcher , François Petit

Goal-oriented error estimation and adaptivity in MsFEM computations

We introduce a goal-oriented strategy for multiscale computations performed using the Multiscale Finite Element Method (MsFEM). In a previous work, we have shown how to use, in the MsFEM framework, the concept of Constitutive Relation Error…

Numerical Analysis · Mathematics 2019-08-02 Ludovic Chamoin , Frederic Legoll

CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation

LLM-as-a-judge ensembles are the standard paradigm for scalable evaluation, but their aggregation mechanisms suffer from a fundamental flaw: they implicitly assume that judges provide independent estimates of true quality. However, in…

Machine Learning · Computer Science 2026-03-03 Jitian Zhao , Changho Shin , Tzu-Heng Huang , Satya Sai Srinath Namburi GNVV , Frederic Sala

Ensemble Estimation of Generalized Mutual Information with Applications to Genomics

Mutual information is a measure of the dependence between random variables that has been used successfully in myriad applications in many fields. Generalized mutual information measures that go beyond classical Shannon mutual information…

Information Theory · Computer Science 2021-07-30 Kevin R. Moon , Kumar Sricharan , Alfred O. Hero

DALE: Differential Accumulated Local Effects for efficient and accurate global explanations

Accumulated Local Effect (ALE) is a method for accurately estimating feature effects, overcoming fundamental failure modes of previously-existed methods, such as Partial Dependence Plots. However, ALE's approximation, i.e. the method for…

Machine Learning · Computer Science 2022-10-11 Vasilis Gkolemis , Theodore Dalamagas , Christos Diou

Analysis of Gaussian Spatial Models with Covariate Measurement Error

Uncertainty is an inherent characteristic of biological and geospatial data which is almost made by measurement error in the observed values of the quantity of interest. Ignoring measurement error can lead to biased estimates and inflated…

Applications · Statistics 2018-11-16 Vahid Tadayon

CAKE: A Scalable Commonsense-Aware Framework For Multi-View Knowledge Graph Completion

Knowledge graphs store a large number of factual triples while they are still incomplete, inevitably. The previous knowledge graph completion (KGC) models predict missing links between entities merely relying on fact-view data, ignoring the…

Artificial Intelligence · Computer Science 2022-04-19 Guanglin Niu , Bo Li , Yongfei Zhang , Shiliang Pu

Automatic Metric Validation for Grammatical Error Correction

Metric validation in Grammatical Error Correction (GEC) is currently done by observing the correlation between human and metric-induced rankings. However, such correlation studies are costly, methodologically troublesome, and suffer from…

Computation and Language · Computer Science 2018-05-08 Leshem Choshen , Omri Abend

Maximum Likelihood Estimation of Spatially Varying Coefficient Models for Large Data with an Application to Real Estate Price Prediction

In regression models for spatial data, it is often assumed that the marginal effects of covariates on the response are constant over space. In practice, this assumption might often be questionable. In this article, we show how a Gaussian…

Methodology · Statistics 2020-11-13 Jakob A. Dambon , Fabio Sigrist , Reinhard Furrer

Using the Mean Absolute Percentage Error for Regression Models

We study in this paper the consequences of using the Mean Absolute Percentage Error (MAPE) as a measure of quality for regression models. We show that finding the best model under the MAPE is equivalent to doing weighted Mean Absolute Error…

Machine Learning · Statistics 2015-06-16 Arnaud De Myttenaere , Boris Golden , Bénédicte Le Grand , Fabrice Rossi

RHALE: Robust and Heterogeneity-aware Accumulated Local Effects

Accumulated Local Effects (ALE) is a widely-used explainability method for isolating the average effect of a feature on the output, because it handles cases with correlated features well. However, it has two limitations. First, it does not…

Machine Learning · Computer Science 2023-09-21 Vasilis Gkolemis , Theodore Dalamagas , Eirini Ntoutsi , Christos Diou

Variational Autoencoder for Generating Broader-Spectrum prior Proposals in Markov chain Monte Carlo Methods

This study uses a Variational Autoencoder method to enhance the efficiency and applicability of Markov Chain Monte Carlo (McMC) methods by generating broader-spectrum prior proposals. Traditional approaches, such as the Karhunen-Lo\`eve…

Machine Learning · Computer Science 2025-07-02 Marcio Borges , Felipe Pereira , Michel Tosin

Tree-aggregated regression for compositional data with measurement errors

High-dimensional compositional covariates, often derived from count data, are subject to measurement error and are frequently analyzed after aggregation along a prespecified tree to improve interpretability in applications such as…

Methodology · Statistics 2026-05-18 Zhenghan Li , Tianying Wang

Coherent Disaggregation and Uncertainty Quantification for Spatially Misaligned Data

Spatial misalignment arises when datasets are aggregated or collected at different spatial scales, leading to information loss. We develop a Bayesian disaggregation framework that links misaligned data to a continuous-domain model through…

Methodology · Statistics 2025-12-16 Man Ho Suen , Mark Naylor , Finn Lindgren