Related papers: Reprowd: Crowdsourced Data Processing Made Reprodu…

The Reproducible Research Platform establishes a unified open science environment bridging data and software lifecycles across disciplines, from proposal to publication

Many research groups aspire to make data and code FAIR and reproducible, yet struggle because the data and code life cycles are disconnected, executable environments are often missing from published work, and technical skill requirements…

Digital Libraries · Computer Science 2025-12-09 Andreas P. Cuny , Henry Lütcke , Andrei-Valentin Plamadă , Antti Luomi , John Hennig , Matthew Baker , Fabian Rudolf , Bernd Rinn

Quo Vadis, HCOMP? A Review of 12 Years of Research at the Frontier of Human Computation and Crowdsourcing

The field of human computation and crowdsourcing has historically studied how tasks can be outsourced to humans. However, many tasks previously distributed to human crowds can today be completed by generative AI with human-level abilities,…

Computers and Society · Computer Science 2025-04-03 Jonas Oppenlaender , Ujwal Gadiraju , Simo Hosio

Optimizing Open-Ended Crowdsourcing: The Next Frontier in Crowdsourced Data Management

Crowdsourcing is the primary means to generate training data at scale, and when combined with sophisticated machine learning algorithms, crowdsourcing is an enabler for a variety of emergent automated applications impacting all spheres of…

Human-Computer Interaction · Computer Science 2016-10-19 Aditya Parameswaran , Akash Das Sarma , Vipul Venkataraman

Crowd-Powered Data Mining

Many data mining tasks cannot be completely addressed by auto- mated processes, such as sentiment analysis and image classification. Crowdsourcing is an effective way to harness the human cognitive ability to process these machine-hard…

Databases · Computer Science 2018-10-22 Chengliang Chai , Ju Fan , Guoliang Li , Jiannan Wang , Yudian Zheng

Crowdsourcing for Bioinformatics

Motivation: Bioinformatics is faced with a variety of problems that require human involvement. Tasks like genome annotation, image analysis, knowledge-base construction and protein structure determination all benefit from human input. In…

Quantitative Methods · Quantitative Biology 2013-07-01 Benjamin M. Good , Andrew I. Su

From Crowdsourcing to Crowdmining: Using Implicit Human Intelligence for Better Understanding of Crowdsourced Data

With the development of mobile social networks, more and more crowdsourced data are generated on the Web or collected from real-world sensing. The fragment, heterogeneous, and noisy nature of online/offline crowdsourced data, however, makes…

Human-Computer Interaction · Computer Science 2019-08-08 Bin Guo , Huihui Chen , Yan Liu , Chao Chen , Qi Han , Zhiwen Yu

CrowdHub: Extending crowdsourcing platforms for the controlled evaluation of tasks designs

We present CrowdHub, a tool for running systematic evaluations of task designs on top of crowdsourcing platforms. The goal is to support the evaluation process, avoiding potential experimental biases that, according to our empirical…

Human-Computer Interaction · Computer Science 2019-09-11 Jorge Ramírez , Simone Degiacomi , Davide Zanella , Marcos Baez , Fabio Casati , Boualem Benatallah

CrowdFusion: A Crowdsourced Approach on Data Fusion Refinement

Data fusion has played an important role in data mining because high-quality data is required in a lot of applications. As on-line data may be out-of-date and errors in the data may propagate with copying and referring between sources, it…

Databases · Computer Science 2017-02-03 Yunfan Chen , Lei Chen , Chen Jason Zhang

Reproducibility, Replicability, and Repeatability: A survey of reproducible research with a focus on high performance computing

Reproducibility is widely acknowledged as a fundamental principle in scientific research. Currently, the scientific community grapples with numerous challenges associated with reproducibility, often referred to as the ''reproducibility…

Software Engineering · Computer Science 2024-09-16 Benjamin A. Antunes , David R. C. Hill

Knowledge Learning with Crowdsourcing: A Brief Review and Systematic Perspective

Big data have the characteristics of enormous volume, high velocity, diversity, value-sparsity, and uncertainty, which lead the knowledge learning from them full of challenges. With the emergence of crowdsourcing, versatile information can…

Machine Learning · Computer Science 2022-06-22 Jing Zhang

Reproducibility in Research: Systems, Infrastructure, Culture

The reproduction and replication of research results has become a major issue for a number of scientific disciplines. In computer science and related computational disciplines such as systems biology, the challenges closely revolve around…

Software Engineering · Computer Science 2017-07-31 Tom Crick , Benjamin A. Hall , Samin Ishtiaq

Crowdsourcing: a new tool for policy-making?

Crowdsourcing is rapidly evolving and applied in situations where ideas, labour, opinion or expertise of large groups of people are used. Crowdsourcing is now used in various policy-making initiatives; however, this use has usually focused…

Computers and Society · Computer Science 2018-02-12 Araz Taeihagh

ReproServer: Making Reproducibility Easier and Less Intensive

Reproducibility in the computational sciences has been stymied because of the complex and rapidly changing computational environments in which modern research takes place. While many will espouse reproducibility as a value, the challenge of…

Software Engineering · Computer Science 2018-08-07 Remi Rampin , Fernando Chirigati , Vicky Steeves , Juliana Freire

Advancing computational reproducibility in the Dataverse data repository platform

Recent reproducibility case studies have raised concerns showing that much of the deposited research has not been reproducible. One of their conclusions was that the way data repositories store research data and code cannot fully facilitate…

Digital Libraries · Computer Science 2020-06-18 Ana Trisovic , Philip Durbin , Tania Schlatter , Gustavo Durand , Sonia Barbosa , Danny Brooke , Mercè Crosas

An Overview of Query Processing on Crowdsourced Databases

Crowd-sourcing is a powerful solution for finding correct answers to expensive and unanswered queries in databases, including those with uncertain and incomplete data. Attempts to use crowd-sourcing to exploit human abilities to process…

Databases · Computer Science 2022-04-19 Marwa B. Swidan , Ali A. Alwan , Yonis Gulzar , Abedallah Zaid Abualkishik

Reference environments: A universal tool for reproducibility in computational biology

The drive for reproducibility in the computational sciences has provoked discussion and effort across a broad range of perspectives: technological, legislative/policy, education, and publishing. Discussion on these topics is not new, but…

Quantitative Methods · Quantitative Biology 2018-10-10 Daniel G. Hurley , Joseph Cursons , Matthew Faria , David M. Budden , Vijay Rajagopal , Edmund J. Crampin

Reproducibility via Crowdsourced Reverse Engineering: A Neural Network Case Study With DeepMind's Alpha Zero

The reproducibility of scientific findings are an important hallmark of quality and integrity in research. The scientific method requires hypotheses to be subjected to the most crucial tests, and for the results to be consistent across…

Computers and Society · Computer Science 2019-09-11 Dustin Tanksley , Donald C. Wunsch

Easy, Reproducible and Quality-Controlled Data Collection with Crowdaq

High-quality and large-scale data are key to success for AI systems. However, large-scale data annotation efforts are often confronted with a set of common challenges: (1) designing a user-friendly annotation interface; (2) training enough…

Human-Computer Interaction · Computer Science 2020-10-15 Qiang Ning , Hao Wu , Pradeep Dasigi , Dheeru Dua , Matt Gardner , Robert L. Logan , Ana Marasovic , Zhen Nie

RoboCrowd: Scaling Robot Data Collection through Crowdsourcing

In recent years, imitation learning from large-scale human demonstrations has emerged as a promising paradigm for training robot policies. However, the burden of collecting large quantities of human demonstrations is significant in terms of…

Robotics · Computer Science 2025-05-22 Suvir Mirchandani , David D. Yuan , Kaylee Burns , Md Sazzad Islam , Tony Z. Zhao , Chelsea Finn , Dorsa Sadigh

Reproducible Workflow

Reproducibility has been consistently identified as an important component of scientific research. Although there is a general consensus on the importance of reproducibility along with the other commonly used 'R' terminology (i.e.,…

Information Theory · Computer Science 2020-12-29 Anirudh Prabhu , Peter Fox