English
Related papers

Related papers: A Streaming Algorithm for Crowdsourced Data Classi…

200 papers

Crowdsourcing utilizes the wisdom of crowds for collective classification via information (e.g., labels of an item) provided by labelers. Current crowdsourcing algorithms are mainly unsupervised methods that are unaware of the quality of…

Social and Information Networks · Computer Science 2016-11-15 Pin-Yu Chen , Chia-Wei Lien , Fu-Jen Chu , Pai-Shun Ting , Shin-Ming Cheng

We consider the problem of cost-optimal utilization of a crowdsourcing platform for binary, unsupervised classification of a collection of items, given a prescribed error threshold. Workers on the crowdsourcing platform are assumed to be…

Machine Learning · Computer Science 2022-07-06 Yashvardhan Didwania , Jayakrishnan Nair , N. Hemachandra

The evaluation of noisy binary classifiers on unlabeled data is treated as a streaming task: given a data sketch of the decisions by an ensemble, estimate the true prevalence of the labels as well as each classifier's accuracy on them. Two…

Machine Learning · Statistics 2023-09-11 Andrés Corrada-Emmanuel

A key challenge in crowdsourcing is inferring the ground truth from noisy and unreliable data. To do so, existing approaches rely on collecting redundant information from the crowd, and aggregating it with some probabilistic method.…

Machine Learning · Computer Science 2019-11-14 Edoardo Manino , Long Tran-Thanh , Nicholas R. Jennings

We consider crowdsourced labeling under a $d$-type worker-task specialization model, where each worker and task is associated with one particular type among a finite set of types and a worker provides a more reliable answer to tasks of the…

Human-Computer Interaction · Computer Science 2021-06-10 Doyeon Kim , Hye Won Chung

Microtask crowdsourcing has enabled dataset advances in social science and machine learning, but existing crowdsourcing schemes are too expensive to scale up with the expanding volume of data. To scale and widen the applicability of…

Human-Computer Interaction · Computer Science 2016-02-16 Ranjay Krishna , Kenji Hata , Stephanie Chen , Joshua Kravitz , David A. Shamma , Li Fei-Fei , Michael S. Bernstein

With the increased interest in machine learning and big data problems, the need for large amounts of labelled data has also grown. However, it is often infeasible to get experts to label all of this data, which leads many practitioners to…

Machine Learning · Computer Science 2021-05-31 Pierce Burke , Richard Klein

We introduce an unsupervised approach to efficiently discover the underlying features in a data set via crowdsourcing. Our queries ask crowd members to articulate a feature common to two out of three displayed examples. In addition we also…

Machine Learning · Statistics 2015-04-02 James Y. Zou , Kamalika Chaudhuri , Adam Tauman Kalai

In recent years crowdsourcing has become the method of choice for gathering labeled training data for learning algorithms. Standard approaches to crowdsourcing view the process of acquiring labeled data separately from the process of…

Machine Learning · Computer Science 2017-04-17 Pranjal Awasthi , Avrim Blum , Nika Haghtalab , Yishay Mansour

Crowdsourcing platforms emerged as popular venues for purchasing human intelligence at low cost for large volume of tasks. As many low-paid workers are prone to give noisy answers, a common practice is to add redundancy by assigning…

Machine Learning · Computer Science 2018-10-09 Jungseul Ok , Sewoong Oh , Yunhun Jang , Jinwoo Shin , Yung Yi

Crowdsourcing is an online outsourcing mode which can solve the current machine learning algorithm's urge need for massive labeled data. Requester posts tasks on crowdsourcing platforms, which employ online workers over the Internet to…

Human-Computer Interaction · Computer Science 2022-04-28 Guangyang Han , Sufang Li , Runmin Wang , Chunming Wu

Crowdsourcing offers a practical method for ranking and scoring large amounts of items. To investigate the algorithms and incentives that can be used in crowdsourcing quality evaluations, we built CrowdGrader, a tool that lets students…

Social and Information Networks · Computer Science 2013-08-27 Luca de Alfaro , Michael Shavlovsky

Crowdsourcing system has emerged as an effective platform for labeling data with relatively low cost by using non-expert workers. Inferring correct labels from multiple noisy answers on data, however, has been a challenging problem, since…

Human-Computer Interaction · Computer Science 2023-09-14 Doyeon Kim , Jeonghwan Lee , Hye Won Chung

Crowdsourcing has been widely used to efficiently obtain labeled datasets for supervised learning from large numbers of human resources at low cost. However, one of the technical challenges in obtaining high-quality results from…

Human-Computer Interaction · Computer Science 2023-02-28 Ryosuke Ueda , Koh Takeuchi , Hisashi Kashima

Crowdsourcing is a relatively economic and efficient solution to collect annotations from the crowd through online platforms. Answers collected from workers with different expertise may be noisy and unreliable, and the quality of annotated…

Machine Learning · Computer Science 2020-01-08 Jingzheng Tu , Guoxian Yu , Jun Wang , Carlotta Domeniconi , Xiangliang Zhang

The problem of "approximating the crowd" is that of estimating the crowd's majority opinion by querying only a subset of it. Algorithms that approximate the crowd can intelligently stretch a limited budget for a crowdsourcing task. We…

Social and Information Networks · Computer Science 2012-04-17 Seyda Ertekin , Haym Hirsh , Cynthia Rudin

Recent studies have shown that the labels collected from crowdworkers can be discriminatory with respect to sensitive attributes such as gender and race. This raises questions about the suitability of using crowdsourced data for further…

Artificial Intelligence · Computer Science 2019-03-04 Naman Goel , Boi Faltings

In this paper, a high-speed online neural network classifier based on extreme learning machines for multi-label classification is proposed. In multi-label classification, each of the input data sample belongs to one or more than one of the…

Machine Learning · Computer Science 2016-09-06 Rajasekar Venkatesan , Meng Joo Er , Mihika Dave , Mahardhika Pratama , Shiqian Wu

Classification involves the learning of the mapping function that associates input samples to corresponding target label. There are two major categories of classification problems: Single-label classification and Multi-label classification.…

Machine Learning · Computer Science 2016-09-06 Meng Joo Er , Rajasekar Venkatesan , Ning Wang

We introduce a novel algorithm to perform graph clustering in the edge streaming setting. In this model, the graph is presented as a sequence of edges that can be processed strictly once. Our streaming algorithm has an extremely low memory…

Machine Learning · Computer Science 2017-12-13 Alexandre Hollocou , Julien Maudet , Thomas Bonald , Marc Lelarge
‹ Prev 1 2 3 10 Next ›