Related papers: Pessimistic Evaluation

Distributionally-Informed Recommender System Evaluation

Current practice for evaluating recommender systems typically focuses on point estimates of user-oriented effectiveness metrics or business metrics, sometimes combined with additional metrics for considerations such as diversity and…

Information Retrieval · Computer Science 2023-09-13 Michael D. Ekstrand , Ben Carterette , Fernando Diaz

The Fault in Our Recommendations: On the Perils of Optimizing the Measurable

Recommendation systems are widespread, and through customized recommendations, promise to match users with options they will like. To that end, data on engagement is collected and used. Most recommendation systems are ranking-based, where…

Information Retrieval · Computer Science 2024-05-08 Omar Besbes , Yash Kanoria , Akshit Kumar

User-centered Evaluation of Popularity Bias in Recommender Systems

Recommendation and ranking systems are known to suffer from popularity bias; the tendency of the algorithm to favor a few popular items while under-representing the majority of other items. Prior research has examined various approaches for…

Information Retrieval · Computer Science 2021-03-12 Himan Abdollahpouri , Masoud Mansoury , Robin Burke , Bamshad Mobasher , Edward Malthouse

Information-Theoretic Measures for Objective Evaluation of Classifications

This work presents a systematic study of objective evaluations of abstaining classifications using Information-Theoretic Measures (ITMs). First, we define objective measures for which they do not depend on any free parameter. This…

Computer Vision and Pattern Recognition · Computer Science 2012-08-16 Bao-Gang Hu , Ran He , XiaoTong Yuan

Bias in Evaluation Processes: An Optimization-Based Model

Biases with respect to socially-salient attributes of individuals have been well documented in evaluation processes used in settings such as admissions and hiring. We view such an evaluation process as a transformation of a distribution of…

Computers and Society · Computer Science 2023-10-27 L. Elisa Celis , Amit Kumar , Anay Mehrotra , Nisheeth K. Vishnoi

Citation Statistics

This is a report about the use and misuse of citation data in the assessment of scientific research. The idea that research assessment must be done using ``simple and objective'' methods is increasingly prevalent today. The ``simple and…

Methodology · Statistics 2009-10-20 Robert Adler , John Ewing , Peter Taylor

Principled Multi-Aspect Evaluation Measures of Rankings

Information Retrieval evaluation has traditionally focused on defining principled ways of assessing the relevance of a ranked list of documents with respect to a query. Several methods extend this type of evaluation beyond relevance, making…

Information Retrieval · Computer Science 2022-12-02 Maria Maistro , Lucas Chaves Lima , Jakob Grue Simonsen , Christina Lioma

A general notion of information-related complexity applicable to both natural and man-made systems is proposed. The overall approach is to explicitly consider a rational agent performing a certain task with a quantifiable degree of success.…

Data Analysis, Statistics and Probability · Physics 2013-01-18 Eugene Perevalov , David Grace

Equal Experience in Recommender Systems

We explore the fairness issue that arises in recommender systems. Biased data due to inherent stereotypes of particular groups (e.g., male students' average rating on mathematics is often higher than that on humanities, and vice versa for…

Machine Learning · Computer Science 2022-10-13 Jaewoong Cho , Moonseok Choi , Changho Suh

Estimating Error and Bias in Offline Evaluation Results

Offline evaluations of recommender systems attempt to estimate users' satisfaction with recommendations using static data from prior user interactions. These evaluations provide researchers and developers with first approximations of the…

Information Retrieval · Computer Science 2020-01-28 Mucun Tian , Michael D. Ekstrand

Evaluating Stochastic Rankings with Expected Exposure

We introduce the concept of \emph{expected exposure} as the average attention ranked items receive from users over repeated samples of the same query. Furthermore, we advocate for the adoption of the principle of equal expected exposure:…

Information Retrieval · Computer Science 2020-10-22 Fernando Diaz , Bhaskar Mitra , Michael D. Ekstrand , Asia J. Biega , Ben Carterette

The Importance of Pessimism in Fixed-Dataset Policy Optimization

We study worst-case guarantees on the expected return of fixed-dataset policy optimization algorithms. Our core contribution is a unified conceptual and mathematical framework for the study of algorithms in this regime. This analysis…

Artificial Intelligence · Computer Science 2020-12-01 Jacob Buckman , Carles Gelada , Marc G. Bellemare

Take a Fresh Look at Recommender Systems from an Evaluation Standpoint

Recommendation has become a prominent area of research in the field of Information Retrieval (IR). Evaluation is also a traditional research topic in this community. Motivated by a few counter-intuitive observations reported in recent…

Information Retrieval · Computer Science 2023-08-22 Aixin Sun

Critiquing-based Modeling of Subjective Preferences

Applications designed for entertainment and other non-instrumental purposes are challenging to optimize because the relationships between system parameters and user experience can be unclear. Ideally, we would crowdsource these design…

Human-Computer Interaction · Computer Science 2022-04-26 Alan Medlar , Jing Li , Yang Liu , Dorota Glowacka

On Optimistic versus Randomized Exploration in Reinforcement Learning

We discuss the relative merits of optimistic and randomized approaches to exploration in reinforcement learning. Optimistic approaches presented in the literature apply an optimistic boost to the value estimate at each state-action pair and…

Machine Learning · Statistics 2017-06-15 Ian Osband , Benjamin Van Roy

Recommender Systems (RS) often suffer from popularity bias, where a small set of popular items dominate the recommendation results due to their high interaction rates, leaving many less popular items overlooked. This phenomenon…

Information Retrieval · Computer Science 2025-05-27 Juno Prent , Masoud Mansoury

Recommender Systems Fairness Evaluation via Generalized Cross Entropy

Fairness in recommender systems has been considered with respect to sensitive attributes of users (e.g., gender, race) or items (e.g., revenue in a multistakeholder setting). Regardless, the concept has been commonly interpreted as some…

Information Retrieval · Computer Science 2019-08-20 Yashar Deldjoo , Vito Walter Anelli , Hamed Zamani , Alejandro Bellogin , Tommaso Di Noia

Evaluation Measures of Individual Item Fairness for Recommender Systems: A Critical Study

Fairness is an emerging and challenging topic in recommender systems. In recent years, various ways of evaluating and therefore improving fairness have emerged. In this study, we examine existing evaluation measures of fairness in…

Information Retrieval · Computer Science 2024-05-21 Theresia Veronika Rampisela , Maria Maistro , Tuukka Ruotsalo , Christina Lioma

A Formal Account of Effectiveness Evaluation and Ranking Fusion

This paper proposes a theoretical framework which models the information provided by retrieval systems in terms of Information Theory. The proposed framework allows to formalize: (i) system effectiveness as an information theoretic…

Information Retrieval · Computer Science 2018-09-17 Enrique Amigó , Fernando Giner , Stefano Mizzaro , Damiano Spina

Connecting User and Item Perspectives in Popularity Debiasing for Collaborative Recommendation

Recommender systems learn from historical users' feedback that is often non-uniformly distributed across items. As a consequence, these systems may end up suggesting popular items more than niche items progressively, even when the latter…

Information Retrieval · Computer Science 2020-10-06 Ludovico Boratto , Gianni Fenu , Mirko Marras