Related papers: Statistical Learning from Attribution Sets

Unbiased Learning-to-Rank with Biased Feedback

Implicit feedback (e.g., clicks, dwell times, etc.) is an abundant source of data in human-interactive systems. While implicit feedback has many advantages (e.g., it is inexpensive to collect, user centric, and timely), its inherent biases…

Information Retrieval · Computer Science 2016-08-17 Thorsten Joachims , Adith Swaminathan , Tobias Schnabel

Statistical Learning from Biased Training Samples

With the deluge of digitized information in the Big Data era, massive datasets are becoming increasingly available for learning predictive models. However, in many practical situations, the poor control of the data acquisition processes may…

Machine Learning · Statistics 2022-11-02 Stephan Clémençon , Pierre Laforgue

Where Not to Learn: Prior-Aligned Training with Subset-based Attribution Constraints for Reliable Decision-Making

Reliable models should not only predict correctly, but also justify decisions with acceptable evidence. Yet conventional supervised learning typically provides only class-level labels, allowing models to achieve high accuracy through…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Ruoyu Chen , Shangquan Sun , Xiaoqing Guo , Sanyi Zhang , Kangwei Liu , Shiming Liu , Zhangcheng Wang , Qunli Zhang , Hua Zhang , Xiaochun Cao

Attribution Modeling Increases Efficiency of Bidding in Display Advertising

Predicting click and conversion probabilities when bidding on ad exchanges is at the core of the programmatic advertising industry. Two separated lines of previous works respectively address i) the prediction of user conversion probability…

Machine Learning · Statistics 2017-07-24 Eustache Diemert , Julien Meynet , Pierre Galland , Damien Lefortier

Challenges and approaches to privacy preserving post-click conversion prediction

Online advertising has typically been more personalized than offline advertising, through the use of machine learning models and real-time auctions for ad targeting. One specific task, predicting the likelihood of conversion (i.e.\ the…

Machine Learning · Computer Science 2022-02-01 Conor O'Brien , Arvind Thiagarajan , Sourav Das , Rafael Barreto , Chetan Verma , Tim Hsu , James Neufield , Jonathan J Hunt

Representation Learning with Statistical Independence to Mitigate Bias

Presence of bias (in datasets or tasks) is inarguably one of the most critical challenges in machine learning applications that has alluded to pivotal debates in recent years. Such challenges range from spurious associations between…

Computer Vision and Pattern Recognition · Computer Science 2020-11-23 Ehsan Adeli , Qingyu Zhao , Adolf Pfefferbaum , Edith V. Sullivan , Li Fei-Fei , Juan Carlos Niebles , Kilian M. Pohl

Privacy Aware Learning

We study statistical risk minimization problems under a privacy model in which the data is kept confidential even from the learner. In this local privacy framework, we establish sharp upper and lower bounds on the convergence rates of…

Machine Learning · Statistics 2013-10-11 John C. Duchi , Michael I. Jordan , Martin J. Wainwright

Constrained Learning with Non-Convex Losses

Though learning has become a core component of modern information processing, there is now ample evidence that it can lead to biased, unsafe, and prejudiced systems. The need to impose requirements on learning is therefore paramount,…

Machine Learning · Computer Science 2022-10-20 Luiz F. O. Chamon , Santiago Paternain , Miguel Calvo-Fullana , Alejandro Ribeiro

Reconsidering Learning Objectives in Unbiased Recommendation with Unobserved Confounders

This work studies the problem of learning unbiased algorithms from biased feedback for recommendation. We address this problem from a novel distribution shift perspective. Recent works in unbiased recommendation have advanced the…

Information Retrieval · Computer Science 2023-10-05 Teng Xiao , Zhengyu Chen , Suhang Wang

Ensemble Ranking Model with Multiple Pretraining Strategies for Web Search

An effective ranking model usually requires a large amount of training data to learn the relevance between documents and queries. User clicks are often used as training data since they can indicate relevance and are cheap to collect, but…

Information Retrieval · Computer Science 2023-02-21 Xiaojie Sun , Lulu Yu , Yiting Wang , Keping Bi , Jiafeng Guo

Learning the Truth From Only One Side of the Story

Learning under one-sided feedback (i.e., where we only observe the labels for examples we predicted positively on) is a fundamental problem in machine learning -- applications include lending and recommendation systems. Despite this, there…

Machine Learning · Computer Science 2020-10-14 Heinrich Jiang , Qijia Jiang , Aldo Pacchiano

Contextual Preference Distribution Learning

Decision-making problems often feature uncertainty stemming from heterogeneous and context-dependent human preferences. To address this, we propose a sequential learning-and-optimization pipeline to learn preference distributions and…

Machine Learning · Computer Science 2026-03-19 Benjamin Hudson , Laurent Charlin , Emma Frejinger

Implicit Look-alike Modelling in Display Ads: Transfer Collaborative Filtering to CTR Estimation

User behaviour targeting is essential in online advertising. Compared with sponsored search keyword targeting and contextual advertising page content targeting, user behaviour targeting builds users' interest profiles via tracking their…

Machine Learning · Computer Science 2016-01-12 Weinan Zhang , Lingxi Chen , Jun Wang

Learning from Imperfect Demonstrations via Adversarial Confidence Transfer

Existing learning from demonstration algorithms usually assume access to expert demonstrations. However, this assumption is limiting in many real-world applications since the collected demonstrations may be suboptimal or even consist of…

Robotics · Computer Science 2022-03-03 Zhangjie Cao , Zihan Wang , Dorsa Sadigh

Online Set Learning from Precision and Recall Feedback

We consider the problem of learning an unknown subset $N_\text{target}$ of a domain in an online setting. In each round $t$, the learner predicts a set of items ${N}_t$ and receives one of two types of feedback, each with equal probability:…

Machine Learning · Computer Science 2026-05-12 Lee Cohen , Yishay Mansour , Shay Moran , Han Shao

Unbiased Learning to Rank with Unbiased Propensity Estimation

Learning to rank with biased click data is a well-known challenge. A variety of methods has been explored to debias click data for learning to rank such as click models, result interleaving and, more recently, the unbiased learning-to-rank…

Information Retrieval · Computer Science 2018-04-25 Qingyao Ai , Keping Bi , Cheng Luo , Jiafeng Guo , W. Bruce Croft

A Primer on Domain Adaptation

Standard supervised machine learning assumes that the distribution of the source samples used to train an algorithm is the same as the one of the target samples on which it is supposed to make predictions. However, as any data scientist…

Machine Learning · Computer Science 2020-02-12 Pirmin Lemberger , Ivan Panico

Predicting Choice with Set-Dependent Aggregation

Providing users with alternatives to choose from is an essential component in many online platforms, making the accurate prediction of choice vital to their success. A renewed interest in learning choice models has led to significant…

Machine Learning · Computer Science 2020-01-22 Nir Rosenfeld , Kojin Oshiba , Yaron Singer

An Offline Metric for the Debiasedness of Click Models

A well-known problem when learning from user clicks are inherent biases prevalent in the data, such as position or trust bias. Click models are a common method for extracting information from user clicks, such as document relevance in web…

Information Retrieval · Computer Science 2024-12-17 Romain Deffayet , Philipp Hager , Jean-Michel Renders , Maarten de Rijke

Distributionally Robust Learning for Multi-source Unsupervised Domain Adaptation

Empirical risk minimization often performs poorly when the distribution of the target domain differs from those of source domains. To address such potential distribution shifts, we develop an unsupervised domain adaptation approach that…

Machine Learning · Statistics 2025-03-25 Zhenyu Wang , Peter Bühlmann , Zijian Guo