English
Related papers

Related papers: DQSOps: Data Quality Scoring Operations Framework …

200 papers

Within data-driven artificial intelligence (AI) systems for industrial applications, ensuring the reliability of the incoming data streams is an integral part of trustworthy decision-making. An approach to assess data validity is data…

Databases · Computer Science 2024-08-14 Firas Bayram , Bestoun S. Ahmed , Erik Hallin

High-quality data is critical to train performant Machine Learning (ML) models, highlighting the importance of Data Quality Management (DQM). Existing DQM schemes often cannot satisfactorily improve ML performance because, by design, they…

Machine Learning · Computer Science 2021-06-11 Tianhao Wang , Yi Zeng , Ming Jin , Ruoxi Jia

DevOps is a modern software engineering paradigm that is gaining widespread adoption in industry. The goal of DevOps is to bring software changes into production with a high frequency and fast feedback cycles. This conflicts with software…

In regulated domains such as finance, the integrity and governance of data pipelines are critical - yet existing systems treat data quality control (QC) as an isolated preprocessing step rather than a first-class system component. We…

Computational Finance · Quantitative Finance 2025-12-08 Devender Saini , Bhavika Jain , Nitish Ujjwal , Philip Sommer , Dan Romuald Mbanga , Dhagash Mehta

While high data quality (DQ) is critical for analytics, compliance, and AI performance, data quality management (DQM) remains a complex, resource-intensive, and often manual process. This study investigates the extent to which existing…

Databases · Computer Science 2025-06-30 Heidi Carolina Tamm , Anastasija Nikiforova

We outline a comprehensive framework for artificial intelligence (AI) Application Operations (AIAppOps), based on real-world experiences from diverse organizations. Data-driven projects pose additional challenges to organizations due to…

Computers and Society · Computer Science 2026-01-13 Daniel Jönsson , Mattias Tiger , Stefan Ekberg , Daniel Jakobsson , Mattias Jonhede , Fredrik Viksten

DevOps is a quite effective approach for managing software development and operation, as confirmed by plenty of success stories in real applications and case studies. DevOps is now becoming the main-stream solution adopted by the software…

Software Engineering · Computer Science 2019-10-09 Antonio Capizzi , Salvatore Distefano , Manuel Mazzara

The proliferation of SQL for data processing has often occurred without the rigor of traditional software development, leading to siloed efforts, logic replication, and increased risk. This ad-hoc approach hampers data governance and makes…

Systems and Control · Electrical Eng. & Systems 2025-11-18 Dmytro Valiaiev

Big data analytics (BDA) applications use machine learning algorithms to extract valuable insights from large, fast, and heterogeneous data sources. New software engineering challenges for BDA applications include ensuring performance…

Software Engineering · Computer Science 2020-11-18 Camilo Castellanos , Carlos A. Varela , Dario Correal

Requirements engineering is known to be a key factor for the success of software projects. Inside this discipline, goal-oriented requirements engineering approaches have shown specially suitable to deal with projects where it is necessary…

Software Engineering · Computer Science 2009-06-18 Cristina Cachero , Jesús Pardillo

Quality requirements typically differ among software features, e.g., due to different usage contexts of the features, different impacts of related quality deficiencies onto overall user satisfaction, or long-term plans of the developing…

Software Engineering · Computer Science 2022-03-08 Philipp Haindl , Reinhold Plösch

This paper presents a theoretical framework for an AI-driven data quality monitoring system designed to address the challenges of maintaining data quality in high-volume environments. We examine the limitations of traditional methods in…

Artificial Intelligence (AI) has recently attracted a lot of attention, transitioning from research labs to a wide range of successful deployments in many fields, which is particularly true for Deep Learning (DL) techniques. Ultimately, DL…

Artificial Intelligence · Computer Science 2022-03-01 Lixuan Yang , Dario Rossi

Data-centric AI has shed light on the significance of data within the machine learning (ML) pipeline. Recognizing its significance, academia, industry, and government departments have suggested various NLP data research initiatives. While…

Databases · Computer Science 2023-06-27 Eujeong Choi , Chanjun Park

The increasing energy demands and carbon footprint of large-scale AI require intelligent workload management in globally distributed data centers. Yet progress is limited by the absence of benchmarks that realistically capture the interplay…

Poor data quality limits the advantageous power of Machine Learning (ML) and weakens high-performing ML software systems. Nowadays, data are more prone to the risk of poor quality due to their increasing volume and complexity. Therefore,…

Machine Learning · Computer Science 2025-02-20 Manal Rahal , Bestoun S. Ahmed , Gergely Szabados , Torgny Fornstedt , Jorgen Samuelsson

Approaches to enhancing data quality (DQ) are classified into two main categories: data- and process-driven. However, prior research has predominantly utilized batch data preprocessing within the data-driven framework, which often proves…

Human-Computer Interaction · Computer Science 2025-07-17 Hyein Hong , Sangbong Yoo , SeokHwan Choi , Jisue Kim , Seongbum Seo , Haneol Cho , Chansoo Kim , Yun Jang

Neural language models have achieved human level performance across several NLP datasets. However, recent studies have shown that these models are not truly learning the desired task; rather, their high performance is attributed to…

Computation and Language · Computer Science 2020-05-05 Swaroop Mishra , Anjana Arunkumar , Bhavdeep Sachdeva , Chris Bryan , Chitta Baral

The widespread adoption of big data has ushered in a new era of data-driven decision-making, transforming numerous industries and sectors. However, the efficacy of these decisions hinges on the quality of the underlying data. Poor data…

Artificial Intelligence · Computer Science 2024-05-08 Widad Elouataoui

The performance of machine learning models depends heavily on training data. The scarcity of large-scale, well-annotated datasets poses significant challenges in creating robust models. To address this, synthetic data generated through…

Computer Vision and Pattern Recognition · Computer Science 2025-10-09 Ayush Zenith , Arnold Zumbrun , Neel Raut , Jing Lin
‹ Prev 1 2 3 10 Next ›