English
Related papers

Related papers: Problem Formulation and Fairness

200 papers

Data science has employed great research efforts in developing advanced analytics, improving data models and cultivating new algorithms. However, not many authors have come across the organizational and socio-technical challenges that arise…

Machine Learning · Computer Science 2022-01-17 Iñigo Martinez , Elisabeth Viles , Igor G. Olaizola

In response to public scrutiny of data-driven algorithms, the field of data science has adopted ethics training and principles. Although ethics can help data scientists reflect on certain normative aspects of their work, such efforts are…

Computers and Society · Computer Science 2022-02-02 Ben Green

Data-driven science is an emerging paradigm where scientific discoveries depend on the execution of computational AI models against rich, discipline-specific datasets. With modern machine learning frameworks, anyone can develop and execute…

Machine Learning · Computer Science 2022-08-09 Seth Ockerman , John Wu , Christopher Stewart

Data science requires time-consuming iterative manual activities. In particular, activities such as data selection, preprocessing, transformation, and mining, highly depend on iterative trial-and-error processes that could be sped-up…

While data science has emerged as a contentious new scientific field, enormous debates and discussions have been made on it why we need data science and what makes it as a science. In reviewing hundreds of pieces of literature which include…

Computers and Society · Computer Science 2020-07-01 Longbing Cao

Classification, a heavily-studied data-driven machine learning task, drives an increasing number of prediction systems involving critical human decisions such as loan approval and criminal risk assessment. However, classifiers often…

Machine Learning · Computer Science 2022-04-12 Maliha Tashfia Islam , Anna Fariha , Alexandra Meliou , Babak Salimi

Causal inference from observational data is the goal of many data analyses in the health and social sciences. However, academic statistics has often frowned upon data analyses with a causal objective. The introduction of the term "data…

Machine Learning · Statistics 2019-04-11 Miguel A. Hernán , John Hsu , Brian Healy

The trustworthiness of data science systems in applied and real-world settings emerges from the resolution of specific tensions through situated, pragmatic, and ongoing forms of work. Drawing on research in CSCW, critical data studies, and…

Computers and Society · Computer Science 2020-02-11 Samir Passi , Steven J. Jackson

A suite of impressive scientific discoveries have been driven by recent advances in artificial intelligence. These almost all result from training flexible algorithms to solve difficult optimization problems specified in advance by teams of…

Artificial Intelligence · Computer Science 2024-12-18 Ruairidh M. Battleday , Samuel J. Gershman

The field of data science currently enjoys a broad definition that includes a wide array of activities which borrow from many other established fields of study. Having such a vague characterization of a field in the early stages might be…

Other Statistics · Statistics 2021-05-14 Roger D. Peng , Hilary S. Parker

This manuscript provides a systemic and data-centric view of what we term essential data science, as a natural ecosystem with challenges and missions stemming from the fusion of data universe with its multiple combinations of the 5D…

Machine Learning · Computer Science 2026-01-14 Emilio Porcu , Roy El Moukari , Laurent Najman , Francisco Herrera , Horst Simon

Data analysis requires translating higher level questions and hypotheses into computable statistical models. We present a mixed-methods study aimed at identifying the steps, considerations, and challenges involved in operationalizing…

Other Computer Science · Computer Science 2021-04-08 Eunice Jun , Melissa Birchfield , Nicole de Moura , Jeffrey Heer , Rene Just

In this paper we argue that data science is a coherent and novel approach to empirical problems that, in its most general form, does not build understanding about phenomena. Within the new type of mathematization at work in data science,…

Other Statistics · Statistics 2021-03-31 Domenico Napoletani , Marco Panza , Daniele Struppa

In recent years, data science has become an indispensable part of our society. Over time, we have become reliant on this technology because of its opportunity to gain value and new insights from data in any field - business, socializing,…

Computers and Society · Computer Science 2020-10-28 Dinh-An Ho , Oya Beyan

Background. As artificial intelligence and AI-powered systems continue to grow, the role of data scientists has become essential in software development environments. Data scientists face challenges related to managing large volumes of data…

Software Engineering · Computer Science 2025-01-30 Matheus de Morais Leça , Ronnie de Souza Santos

Data and Science has stood out in the generation of results, whether in the projects of the scientific domain or business domain. CERN Project, Scientific Institutes, companies like Walmart, Google, Apple, among others, need data to present…

General Literature · Computer Science 2022-01-19 Rogerio Rossi

Citations are the cornerstone of knowledge propagation and the primary means of assessing the quality of research, as well as directing investments in science. Science is increasingly becoming "data-intensive", where large volumes of data…

Digital Libraries · Computer Science 2017-09-28 Gianmaria Silvello

The data science revolution has led to an increased interest in the practice of data analysis. While much has been written about statistical thinking, a complementary form of thinking that appears in the practice of data analysis is design…

Methodology · Statistics 2023-05-24 Lucy D'Agostino McGowan , Roger D. Peng , Stephanie C. Hicks

Society's capacity for algorithmic problem-solving has never been greater. Artificial Intelligence is now applied across more domains than ever, a consequence of powerful abstractions, abundant data, and accessible software. As capabilities…

Machine Learning · Statistics 2024-08-20 Kris Sankaran

A fundamental problem in the practice and teaching of data science is how to evaluate the quality of a given data analysis, which is different than the evaluation of the science or question underlying the data analysis. Previously, we…

Other Statistics · Statistics 2019-04-29 Stephanie C. Hicks , Roger D. Peng
‹ Prev 1 2 3 10 Next ›