English
Related papers

Related papers: Data Curation with Deep Learning [Vision]

200 papers

Over the past years, there has been many efforts to curate and increase the added value of the raw data. Data curation has been defined as activities and processes an analyst undertakes to transform the raw data into contextualized data and…

Information Retrieval · Computer Science 2020-07-20 Alireza Tabebordbar

Data cleaning is the initial stage of any machine learning project and is one of the most critical processes in data analysis. It is a critical step in ensuring that the dataset is devoid of incorrect or erroneous data. It can be done…

Databases · Computer Science 2021-09-16 Ga Young Lee , Lubna Alzamil , Bakhtiyar Doskenov , Arash Termehchy

Understanding and analyzing big data is firmly recognized as a powerful and strategic priority. For deeper interpretation of and better intelligence with big data, it is important to transform raw data (unstructured, semi-structured and…

Information Retrieval · Computer Science 2016-12-13 Seyed-Mehdi-Reza Beheshti , Alireza Tabebordbar , Boualem Benatallah , Reza Nouri

Data Cleaning refers to the process of detecting and fixing errors in the data. Human involvement is instrumental at several stages of this process, e.g., to identify and repair errors, to validate computed repairs, etc. There is currently…

Databases · Computer Science 2018-01-03 El Kindi Rezig , Mourad Ouzzani , Ahmed K. Elmagarmid , Walid G. Aref

Data-centric AI is at the center of a fundamental shift in software engineering where machine learning becomes the new software, powered by big data and computing infrastructure. Here software engineering needs to be re-thought where data…

Machine Learning · Computer Science 2022-12-27 Steven Euijong Whang , Yuji Roh , Hwanjun Song , Jae-Gil Lee

Over the past few years, we have seen fundamental breakthroughs in core problems in machine learning, largely driven by advances in deep neural networks. At the same time, the amount of data collected in a wide array of scientific domains…

Machine Learning · Computer Science 2020-03-27 Maithra Raghu , Eric Schmidt

Deep learning has recently become very popular on account of its incredible success in many complex data-driven applications, such as image classification and speech recognition. The database community has worked on data-driven applications…

Databases · Computer Science 2020-01-22 Wei Wang , Meihui Zhang , Gang Chen , H. V. Jagadish , Beng Chin Ooi , Kian-Lee Tan

Data curation is the process of making a dataset fit-for-use and archiveable. It is critical to data-intensive science because it makes complex data pipelines possible, makes studies reproducible, and makes data (re)usable. Yet the…

Data collection is a major bottleneck in machine learning and an active research topic in multiple communities. There are largely two reasons data collection has recently become a critical issue. First, as machine learning is becoming more…

Machine Learning · Computer Science 2019-08-13 Yuji Roh , Geon Heo , Steven Euijong Whang

Studies of dataset development in machine learning call for greater attention to the data practices that make model development possible and shape its outcomes. Many argue that the adoption of theory and practices from archives and data…

Computers and Society · Computer Science 2024-05-07 Eshta Bhardwaj , Harshit Gujral , Siyi Wu , Ciara Zogheib , Tegan Maharaj , Christoph Becker

Deep learning is one of the new and important branches in machine learning. Deep learning refers to a set of algorithms that solve various problems such as images and texts by using various machine learning algorithms in multi-layer neural…

Computer Vision and Pattern Recognition · Computer Science 2019-01-10 Yang Li , Sangwhan Cha

Data curation is the problem of how to collect and organize samples into a dataset that supports efficient learning. Despite the centrality of the task, little work has been devoted towards a large-scale, systematic comparison of various…

Computer Vision and Pattern Recognition · Computer Science 2024-10-08 Benjamin Feuer , Jiawei Xu , Niv Cohen , Patrick Yubeaton , Govind Mittal , Chinmay Hegde

Clustering is a fundamental machine learning task which has been widely studied in the literature. Classic clustering methods follow the assumption that data are represented as features in a vectorized form through various representation…

Machine Learning · Computer Science 2022-06-16 Sheng Zhou , Hongjia Xu , Zhuonan Zheng , Jiawei Chen , Zhao li , Jiajun Bu , Jia Wu , Xin Wang , Wenwu Zhu , Martin Ester

Deep Learning is one of the newest trends in Machine Learning and Artificial Intelligence research. It is also one of the most popular scientific research trends now-a-days. Deep learning methods have brought revolutionary advances in…

Machine Learning · Computer Science 2018-07-24 Matiur Rahman Minar , Jibon Naher

NLP community is currently investing a lot more research and resources into development of deep learning models than training data. While we have made a lot of progress, it is now clear that our models learn all kinds of spurious patterns,…

Computation and Language · Computer Science 2021-05-31 Anna Rogers

Advancements in artificial intelligence, machine learning, and deep learning have catalyzed the transformation of big data analytics and management into pivotal domains for research and application. This work explores the theoretical…

Scene classification, aiming at classifying a scene image to one of the predefined scene categories by comprehending the entire image, is a longstanding, fundamental and challenging problem in computer vision. The rise of large-scale…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Delu Zeng , Minyu Liao , Mohammad Tavakolian , Yulan Guo , Bolei Zhou , Dewen Hu , Matti Pietikäinen , Li Liu

Deep learning, a branch of artificial intelligence, is a data-driven method that uses multiple layers of interconnected units or neurons to learn intricate patterns and representations directly from raw input data. Empowered by this…

Machine Learning · Computer Science 2025-07-28 Mohd Halim Mohd Noor , Ayokunle Olalekan Ige

Artificial intelligence has made remarkable progress in handling complex tasks, thanks to advances in hardware acceleration and machine learning algorithms. However, to acquire more accurate outcomes and solve more complex issues,…

Machine Learning · Computer Science 2023-09-12 Mohammad Dehghani , Zahra Yazdanparast

Self-supervised features are the cornerstone of modern machine learning systems. They are typically pre-trained on data collections whose construction and curation typically require extensive human effort. This manual process has some…

‹ Prev 1 2 3 10 Next ›