Mark D. Reid — Scifaro

Fast rates in statistical and online learning

The speed with which a learning algorithm converges as it is presented with more data is a central problem in machine learning --- a fast rate of convergence means less data is needed for the same level of performance. The pursuit of fast…

Machine Learning · Computer Science 2021-08-31 Tim van Erven , Peter D. Grünwald , Nishant A. Mehta , Mark D. Reid , Robert C. Williamson

Causal Bandits: Learning Good Interventions via Causal Inference

We study the problem of using causal models to improve the rate at which good interventions can be learned online in a stochastic environment. Our formalism combines multi-arm bandits and causal inference to model a novel type of bandit…

Machine Learning · Statistics 2016-06-13 Finnian Lattimore , Tor Lattimore , Mark D. Reid

Compliance-Aware Bandits

Motivated by clinical trials, we study bandits with observable non-compliance. At each step, the learner chooses an arm, after, instead of observing only the reward, it also observes the action that took place. We show that such…

Machine Learning · Statistics 2016-02-10 Nicolás Della Penna , Mark D. Reid , David Balduzzi

Risk Dynamics in Trade Networks

We introduce a new framework to model interactions among agents which seek to trade to minimize their risk with respect to some future outcome. We quantify this risk using the concept of risk measures from finance, and introduce a class of…

Computer Science and Game Theory · Computer Science 2014-10-13 Rafael M. Frongillo , Mark D. Reid

Generalized Mixability via Entropic Duality

Mixability is a property of a loss which characterizes when fast convergence is possible in the game of prediction with expert advice. We show that a key property of mixability generalizes, and the exp and log operations present in the…

Machine Learning · Computer Science 2014-06-25 Mark D. Reid , Rafael M. Frongillo , Robert C. Williamson , Nishant Mehta

Generalised Mixability, Constant Regret, and Bayesian Updating

Mixability of a loss is known to characterise when constant regret bounds are achievable in games of prediction with expert advice through the use of Vovk's aggregating algorithm. We provide a new interpretation of mixability via convex…

Machine Learning · Computer Science 2014-03-12 Mark D. Reid , Rafael M. Frongillo , Robert C. Williamson

Bandit Market Makers

We introduce a modular framework for market making. It combines cost-function based automated market makers with bandit algorithms. We obtain worst-case profits guarantee's relative to the best in hindsight within a class of natural…

Trading and Market Microstructure · Quantitative Finance 2013-08-05 Nicolas Della Penna , Mark D. Reid

AOSO-LogitBoost: Adaptive One-Vs-One LogitBoost for Multi-Class Problem

This paper presents an improvement to model learning when using multi-class LogitBoost for classification. Motivated by the statistical view, LogitBoost can be seen as additive tree regression. Two important factors in this setting are: 1)…

Machine Learning · Statistics 2012-07-05 Peng Sun , Mark D. Reid , Jie Zhou

Crowd & Prejudice: An Impossibility Theorem for Crowd Labelling without a Gold Standard

A common use of crowd sourcing is to obtain labels for a dataset. Several algorithms have been proposed to identify uninformative members of the crowd so that their labels can be disregarded and the cost of paying them avoided. One common…

Social and Information Networks · Computer Science 2012-04-17 Nicolás Della Penna , Mark D. Reid

Conditional Random Fields and Support Vector Machines: A Hybrid Approach

We propose a novel hybrid loss for multiclass and structured prediction problems that is a convex combination of log loss for Conditional Random Fields (CRFs) and a multiclass hinge loss for Support Vector Machines (SVMs). We provide a…

Machine Learning · Computer Science 2010-09-20 Qinfeng Shi , Mark D. Reid , Tiberio Caetano

Composite Binary Losses

We study losses for binary classification and class probability estimation and extend the understanding of them from margin losses to general composite losses which are the composition of a proper loss with a link function. We characterise…

Machine Learning · Statistics 2009-12-18 Mark D. Reid , Robert C. Williamson

Generalised Pinsker Inequalities

We generalise the classical Pinsker inequality which relates variational divergence to Kullback-Liebler divergence in two ways: we consider arbitrary f-divergences in place of KL divergence, and we assume knowledge of a sequence of values…

Information Theory · Computer Science 2009-06-09 Mark D. Reid , Robert C. Williamson

Information, Divergence and Risk for Binary Experiments

We unify f-divergences, Bregman divergences, surrogate loss bounds (regret bounds), proper scoring rules, matching losses, cost curves, ROC-curves and information. We do this by systematically studying integral and variational…

Machine Learning · Statistics 2009-01-06 Mark D. Reid , Robert C. Williamson