Related papers: Factorizable Joint Shift in Multinomial Classifica…

Factorizable joint shift revisited

Factorizable joint shift (FJS) represents a type of distribution shift (or dataset shift) that comprises both covariate and label shift. Recently, it has been observed that FJS actually arises from consecutive label and covariate (or vice…

Machine Learning · Computer Science 2026-04-30 Dirk Tasche

Domain Adaptation with Factorizable Joint Shift

Existing domain adaptation (DA) usually assumes the domain shift comes from either the covariates or the labels. However, in real-world applications, samples selected from different domains could have biases in both the covariates and the…

Machine Learning · Computer Science 2022-04-12 Hao He , Yuzhe Yang , Hao Wang

Sparse joint shift in multinomial classification

Sparse joint shift (SJS) was recently proposed as a tractable model for general dataset shift which may cause changes to the marginal distributions of features and labels as well as the posterior probabilities and the class-conditional…

Machine Learning · Statistics 2024-06-25 Dirk Tasche

Invariance assumptions for class distribution estimation

We study the problem of class distribution estimation under dataset shift. On the training dataset, both features and class labels are observed while on the test dataset only the features can be observed. The task then is the estimation of…

Machine Learning · Computer Science 2023-11-30 Dirk Tasche

FedSM: Robust Semantics-Guided Feature Mixup for Bias Reduction in Federated Learning with Long-Tail Data

Federated Learning (FL) enables collaborative model training across decentralized clients without sharing private data. However, FL suffers from biased global models due to non-IID and long-tail data distributions. We propose…

Machine Learning · Computer Science 2026-01-08 Jingrui Zhang , Yimeng Xu , Shujie Li , Feng Liang , Haihan Duan , Yanjie Dong , Victor C. M. Leung , Xiping Hu

Towards a More Reliable Interpretation of Machine Learning Outputs for Safety-Critical Systems using Feature Importance Fusion

When machine learning supports decision-making in safety-critical systems, it is important to verify and understand the reasons why a particular output is produced. Although feature importance calculation approaches assist in…

Machine Learning · Statistics 2020-09-14 Divish Rengasamy , Benjamin Rothwell , Grazziela Figueredo

PatchMix Augmentation to Identify Causal Features in Few-shot Learning

The task of Few-shot learning (FSL) aims to transfer the knowledge learned from base categories with sufficient labelled data to novel categories with scarce known information. It is currently an important research question and has great…

Computer Vision and Pattern Recognition · Computer Science 2022-11-30 Chengming Xu , Chen Liu , Xinwei Sun , Siqian Yang , Yabiao Wang , Chengjie Wang , Yanwei Fu

Mechanistic Interpretation of Machine Learning Inference: A Fuzzy Feature Importance Fusion Approach

With the widespread use of machine learning to support decision-making, it is increasingly important to verify and understand the reasons why a particular output is produced. Although post-training feature importance approaches assist this…

Machine Learning · Computer Science 2021-10-25 Divish Rengasamy , Jimiama M. Mase , Mercedes Torres Torres , Benjamin Rothwell , David A. Winkler , Grazziela P. Figueredo

Feature Importance Disparities for Data Bias Investigations

It is widely held that one cause of downstream bias in classifiers is bias present in the training data. Rectifying such biases may involve context-dependent interventions such as training separate models on subgroups, removing features…

Machine Learning · Computer Science 2024-06-04 Peter W. Chang , Leor Fishman , Seth Neel

Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis

Conjoint analysis is a popular experimental design used to measure multidimensional preferences. Researchers examine how varying a factor of interest, while controlling for other relevant factors, influences decision-making. Currently,…

Methodology · Statistics 2024-11-20 Dae Woong Ham , Kosuke Imai , Lucas Janson

Federated Learning with Classifier Shift for Class Imbalance

Federated learning aims to learn a global model collaboratively while the training data belongs to different clients and is not allowed to be exchanged. However, the statistical heterogeneity challenge on non-IID data, such as class…

Machine Learning · Computer Science 2023-04-12 Yunheng Shen , Haoxiang Wang , Hairong Lv

Fused Lasso for Feature Selection using Structural Information

Feature selection has been proven a powerful preprocessing step for high-dimensional data analysis. However, most state-of-the-art methods tend to overlook the structural correlation information between pairwise samples, which may…

Machine Learning · Computer Science 2019-07-02 Lu Bai , Lixin Cui , Yue Wang , Philip S. Yu , Edwin R. Hancock

Significance Analysis for Pairwise Variable Selection in Classification

The goal of this article is to select important variables that can distinguish one class of data from another. A marginal variable selection method ranks the marginal effects for classification of individual variables, and is a useful and…

Methodology · Statistics 2014-02-19 Xingye Qiao , Yufeng Liu , J. S. Marron

FedShift: Robust Federated Learning Aggregation Scheme in Resource Constrained Environment via Weight Shifting

Federated Learning (FL) commonly relies on a central server to coordinate training across distributed clients. While effective, this paradigm suffers from significant communication overhead, impacting overall training efficiency. To…

Machine Learning · Computer Science 2026-02-12 Jungwon Seo , Minhoe Kim , Chunming Rong

Effect of covariate shift on multi-class classification of Fermi-LAT sources

Probabilistic classification of unassociated Fermi-LAT sources using machine learning methods has an implicit assumption that the distributions of associated and unassociated sources are the same as a function of source parameters, which is…

High Energy Astrophysical Phenomena · Physics 2024-01-04 Dmitry V. Malyshev

Factorized Fusion Shrinkage for Dynamic Relational Data

Modern data science applications often involve complex relational data with dynamic structures. An abrupt change in such dynamic relational data is typically observed in systems that undergo regime changes due to interventions. In such a…

Methodology · Statistics 2024-07-16 Peng Zhao , Anirban Bhattacharya , Debdeep Pati , Bani K. Mallick

Label-shift robust federated feature screening for high-dimensional classification

Distributed and federated learning are important tools for high-dimensional classification of large datasets. To reduce computational costs and overcome the curse of dimensionality, feature screening plays a pivotal role in eliminating…

Machine Learning · Statistics 2025-06-03 Qi Qin , Erbo Li , Xingxiang Li , Yifan Sun , Wu Wang , Chen Xu

Zoom and Shift are All You Need

Feature alignment serves as the primary mechanism for fusing multimodal data. We put forth a feature alignment approach that achieves full integration of multimodal information. This is accomplished via an alternating process of shifting…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Jiahao Qin

Featurized-Decomposition Join: Low-Cost Semantic Joins with Guarantees

Large Language Models (LLMs) are being increasingly used within data systems to process large datasets with text fields. A broad class of such tasks involves a semantic join-joining two tables based on a natural language predicate per pair…

Databases · Computer Science 2025-12-08 Sepanta Zeighami , Shreya Shankar , Aditya Parameswaran

Bayesian Joint Additive Factor Models for Multiview Learning

It is increasingly common to collect data of multiple different types on the same set of samples. Our focus is on studying relationships between such multiview features and responses. A motivating application arises in the context of…

Machine Learning · Statistics 2026-01-26 Niccolo Anceschi , Federico Ferrari , David B. Dunson , Himel Mallick