English
Related papers

Related papers: Open Data Quality

200 papers

Nowadays open data is entering the mainstream - it is free available for every stakeholder and is often used in business decision-making. It is important to be sure data is trustable and error-free as its quality problems can lead to huge…

Databases · Computer Science 2023-01-06 Anastasija Nikiforova

The digital transformation of our society is a constant challenge, as data is generated in almost every digital interaction. To use data effectively, it must be of high quality. This raises the question: what exactly is data quality? A…

Databases · Computer Science 2025-04-03 Markus Matoni , Arno Kesper , Gabriele Taentzer

Data quality describes the degree to which data meet specific requirements and are fit for use by humans and/or downstream tasks (e.g., artificial intelligence). Data quality can be assessed across multiple high-level concepts called…

Databases · Computer Science 2025-07-24 Vasileios Papastergios , Lisa Ehrlinger , Anastasios Gounaris

One of the most significant problems of Big Data is to extract knowledge through the huge amount of data. The usefulness of the extracted information depends strongly on data quality. In addition to the importance, data quality has recently…

Databases · Computer Science 2020-05-25 Mostafa Mirzaie , Behshid Behkamal , Samad Paydar

Data-oriented applications, their users, and even the law require data of high quality. Research has divided the rather vague notion of data quality into various dimensions, such as accuracy, consistency, and reputation. To achieve the goal…

Databases · Computer Science 2024-12-09 Sedir Mohammed , Lisa Ehrlinger , Hazar Harmouch , Felix Naumann , Divesh Srivastava

Data catalogs play a crucial role in modern data-driven organizations by facilitating the discovery, understanding, and utilization of diverse data assets. However, ensuring their quality and reliability is complex, especially in open and…

Information Retrieval · Computer Science 2025-07-18 Jorge Martinez-Gil

Data warehousing is continuously gaining importance as organizations are realizing the benefits of decision oriented data bases. However, the stumbling block to this rapid development is data quality issues at various stages of data…

Databases · Computer Science 2013-10-09 Vinay Kumar , Reema Thareja

In the distributed and dynamic framework of the Web, data quality is a big challenge. The Linked Open Data (LOD) provides an enormous amount of data, the quality of which is difficult to control. Quality is intrinsically a matter of usage,…

Databases · Computer Science 2021-07-14 Jacques Chabin , Mirian Halfeld-Ferrari , Béatrice Markhoff , Thanh Binh Nguyen

High-quality data is key to interpretable and trustworthy data analytics and the basis for meaningful data-driven decisions. In practical scenarios, data quality is typically associated with data preprocessing, profiling, and cleansing for…

Databases · Computer Science 2019-07-19 Lisa Ehrlinger , Elisa Rusz , Wolfram Wöß

Data quality is a key element for building and optimizing good learning models. Despite many attempts to characterize data quality, there is still a need for rigorous formalization and an efficient measure of the quality from available…

Machine Learning · Computer Science 2023-12-14 Jouseau Roxane , Salva Sébastien , Samir Chafik

This paper presents a framework for assessing data and metadata quality within Open Data portals. Although a few benchmark frameworks already exist for this purpose, they are not yet detailed enough in both breadth and depth to make valid…

Information Retrieval · Computer Science 2021-06-18 Lisa Wenige , Claus Stadler , Michael Martin , Richard Figura , Robert Sauter , Christopher W. Frank

Data is of high quality if it is fit for its intended use. The quality of data is influenced by the underlying data model and its quality. One major quality problem is the heterogeneity of data as quality aspects such as understandability…

Machine Learning · Computer Science 2021-11-15 Viola Wenz , Arno Kesper , Gabriele Taentzer

Software quality assurance has been a heated topic for several decades, but relatively few analyses were performed on open source software (OSS). As OSS has become very popular in our daily life, many researchers have been keen on the…

Software Engineering · Computer Science 2015-07-27 Jie Xu , Luiz Fernando Capretz , Danny Ho

In order to introduce an integrated research information system, this will provide scientific institutions with the necessary information on research activities and research results in assured quality. Since data collection, duplication,…

Databases · Computer Science 2019-01-23 Otmane Azeroual , Mohammad Abuosba

With the rapid increase of published open datasets, it is crucial to support the open data progress in smart cities while considering the open data quality. In the Czech Republic, and its National Open Data Catalogue (NODC), the open…

Databases · Computer Science 2023-03-06 Dasa Kusnirakova , Mouzhi Ge , Leonard Walletzky , Barbora Buhnova

Data completeness is an essential aspect of data quality, and has in turn a huge impact on the effective management of companies. For example, statistics are computed and audits are conducted in companies by implicitly placing the strong…

Databases · Computer Science 2013-06-10 Simon Razniewski , Marco Montali , Werner Nutt

This report discusses the issues of data quality in biobanks. It presents the state-of-the-art in data quality: the definition of data quality, the dimensions of data quality, and the quality management system for achieving or describing…

Computers and Society · Computer Science 2018-12-27 Suneth Ranasinghe , Horst Pichler , Johann Eder

A fundamental problem in the practice and teaching of data science is how to evaluate the quality of a given data analysis, which is different than the evaluation of the science or question underlying the data analysis. Previously, we…

Other Statistics · Statistics 2019-04-29 Stephanie C. Hicks , Roger D. Peng

Open data is an emerging paradigm to share large and diverse datasets -- primarily from governmental agencies, but also from other organizations -- with the goal to enable the exploitation of the data for societal, academic, and commercial…

Software Engineering · Computer Science 2012-02-09 Holger M. Kienle

Data is one of the most important assets of the information age, and its societal impact is undisputed. Yet, rigorous methods of assessing the quality of data are lacking. In this paper, we propose a formal definition for the quality of a…

Machine Learning · Computer Science 2020-05-13 Netanel Raviv , Siddharth Jain , Jehoshua Bruck
‹ Prev 1 2 3 10 Next ›