David Issa Mattos

Bayesian causal inference in automotive software engineering and online evaluation

Randomised field experiments, such as A/B testing, have long been the gold standard for evaluating software changes. In the automotive domain, running randomised field experiments is not always desired, possible, or even ethical. In the…

Software Engineering · Computer Science 2022-07-04 Yuchu Liu , David Issa Mattos , Jan Bosch , Helena Holmström Olsson , Jonn Lantz

Bayesian propensity score matching in automotive embedded software engineering

Randomised field experiments, such as A/B testing, have long been the gold standard for evaluating the value that new software brings to customers. However, running randomised field experiments is not always desired, possible or even…

Software Engineering · Computer Science 2022-07-04 Yuchu Liu , David Issa Mattos , Jan Bosch , Helena Holmström Olsson , Jonn Lantz

On the Use of Causal Graphical Models for Designing Experiments in the Automotive Domain

Randomized field experiments are the gold standard for evaluating the impact of software changes on customers. In the online domain, randomization has been the main tool to ensure exchangeability. However, due to the different deployment…

Software Engineering · Computer Science 2022-04-26 David Issa Mattos , Yuchu Liu

Size matters? Or not: A/B testing with limited sample in automotive embedded software

A/B testing is gaining attention in the automotive sector as a promising tool to measure causal effects from software changes. Different from the web-facing businesses, where A/B testing has been well-established, the automotive domain…

Software Engineering · Computer Science 2021-11-12 Yuchu Liu , David Issa Mattos , Jan Bosch , Helena Holmström Olsson , Jonn Lantz

Bayesian Paired-Comparison with the bpcs Package

This article introduces the bpcs R package (Bayesian Paired Comparison in Stan) and the statistical models implemented in the package. This package aims to facilitate the use of Bayesian models for paired comparison data in behavioral…

Methodology · Statistics 2021-09-21 David Issa Mattos , Érika Martins Silva Ramos

Statistical Models for the Analysis of Optimization Algorithms with Benchmark Functions

Frequentist statistical methods, such as hypothesis testing, are standard practice in papers that provide benchmark comparisons. Unfortunately, these methods have often been misused, e.g., without testing for their statistical test…

Methodology · Statistics 2021-05-18 David Issa Mattos , Jan Bosch , Helena Holmström Olsson

On the Assessment of Benchmark Suites for Algorithm Comparison

Benchmark suites, i.e. a collection of benchmark functions, are widely used in the comparison of black-box optimization algorithms. Over the years, research has identified many desired qualities for benchmark suites, such as diverse…

Neural and Evolutionary Computing · Computer Science 2021-04-16 David Issa Mattos , Lucas Ruud , Jan Bosch , Helena Holmström Olsson

Engineering for a Science-Centric Experimentation Platform

Netflix is an internet entertainment service that routinely employs experimentation to guide strategy around product innovations. As Netflix grew, it had the opportunity to explore increasingly specialized improvements to its service, which…

Software Engineering · Computer Science 2019-10-10 Nikos Diamantopoulos , Jeffrey Wong , David Issa Mattos , Ilias Gerostathopoulos , Matthew Wardrop , Tobias Mao , Colin McFarland