Related papers: Data Synthesis for Testing Black-Box Machine Learn…

Outline of an Independent Systematic Blackbox Test for ML-based Systems

This article proposes a test procedure that can be used to test ML models and ML-based systems independently of the actual training process. In this way, the typical quality statements such as accuracy and precision of these models and…

Machine Learning · Computer Science 2024-06-21 Hans-Werner Wiesbrock , Jürgen Großmann

Testing Monotonicity of Machine Learning Models

Today, machine learning (ML) models are increasingly applied in decision making. This induces an urgent need for quality assurance of ML models with respect to (often domain-dependent) requirements. Monotonicity is one such requirement. It…

Machine Learning · Computer Science 2020-02-28 Arnab Sharma , Heike Wehrheim

A General Framework for Data-Use Auditing of ML Models

Auditing the use of data in training machine-learning (ML) models is an increasingly pressing challenge, as myriad ML practitioners routinely leverage the effort of content creators to train models without their permission. In this paper,…

Cryptography and Security · Computer Science 2025-01-28 Zonghao Huang , Neil Zhenqiang Gong , Michael K. Reiter

Beyond Model Collapse: Scaling Up with Synthesized Data Requires Verification

Large Language Models (LLM) are increasingly trained on data generated by other LLM, either because generated text and images become part of the pre-training corpus, or because synthetized data is used as a replacement for expensive…

Machine Learning · Computer Science 2024-10-28 Yunzhen Feng , Elvis Dohmatob , Pu Yang , Francois Charton , Julia Kempe

Bridging the Generalisation Gap: Synthetic Data Generation for Multi-Site Clinical Model Validation

Ensuring the generalisability of clinical machine learning (ML) models across diverse healthcare settings remains a significant challenge due to variability in patient demographics, disease prevalence, and institutional practices. Existing…

Machine Learning · Computer Science 2025-04-30 Bradley Segal , Joshua Fieggen , David Clifton , Lei Clifton

Testing Framework for Black-box AI Models

With widespread adoption of AI models for important decision making, ensuring reliability of such models remains an important challenge. In this paper, we present an end-to-end generic framework for testing AI Models which performs…

Machine Learning · Computer Science 2021-02-12 Aniya Aggarwal , Samiulla Shaikh , Sandeep Hans , Swastik Haldar , Rema Ananthanarayanan , Diptikalyan Saha

An empirical study of testing machine learning in the wild

Recently, machine and deep learning (ML/DL) algorithms have been increasingly adopted in many software systems. Due to their inductive nature, ensuring the quality of these systems remains a significant challenge for the research community.…

Software Engineering · Computer Science 2024-07-16 Moses Openja , Foutse Khomh , Armstrong Foundjem , Zhen Ming , Jiang , Mouna Abidi , Ahmed E. Hassan

Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs

Training large language models (LLMs) for external tool usage is a rapidly expanding field, with recent research focusing on generating synthetic data to address the shortage of available data. However, the absence of systematic data…

Machine Learning · Computer Science 2024-09-27 Shadi Iskander , Nachshon Cohen , Zohar Karnin , Ori Shapira , Sofia Tolmach

Using Synthetic Data to estimate the True Error is theoretically and practically doable

Accurately evaluating model performance is crucial for deploying machine learning systems in real-world applications. Traditional methods often require a sufficiently large labeled test set to ensure a reliable evaluation. However, in many…

Machine Learning · Computer Science 2025-11-04 Hai Hoang Thanh , Duy-Tung Nguyen , Hung The Tran , Khoat Than

Model-checking Driven Black-box Testing Algorithms for Systems with Unspecified Components

Component-based software development has posed a serious challenge to system verification since externally-obtained components could be a new source of system failures. This issue can not be completely solved by either model-checking or…

Software Engineering · Computer Science 2016-08-31 Gaoyan Xie , Zhe Dang

Large Language Models Synergize with Automated Machine Learning

Recently, program synthesis driven by large language models (LLMs) has become increasingly popular. However, program synthesis for machine learning (ML) tasks still poses significant challenges. This paper explores a novel form of program…

Software Engineering · Computer Science 2024-09-10 Jinglue Xu , Jialong Li , Zhen Liu , Nagar Anthel Venkatesh Suryanarayanan , Guoyuan Zhou , Jia Guo , Hitoshi Iba , Kenji Tei

Towards Understanding Bias in Synthetic Data for Evaluation

Test collections are crucial for evaluating Information Retrieval (IR) systems. Creating a diverse set of user queries for these collections can be challenging, and obtaining relevance judgments, which indicate how well retrieved documents…

Information Retrieval · Computer Science 2025-10-07 Hossein A. Rahmani , Varsha Ramineni , Emine Yilmaz , Nick Craswell , Bhaskar Mitra

Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models -- Technical Challenges and Implications for Monitoring and Verification

Novel deep-learning (DL) architectures have reached a level where they can generate digital media, including photorealistic images, that are difficult to distinguish from real data. These technologies have already been used to generate…

Computer Vision and Pattern Recognition · Computer Science 2024-04-12 Tuong Vy Nguyen , Alexander Glaser , Felix Biessmann

Model Learning: A Survey on Foundation, Tools and Applications

The quality and correct functioning of software components embedded in electronic systems are of utmost concern especially for safety and mission-critical systems. Model-based testing and formal verification techniques can be employed to…

Formal Languages and Automata Theory · Computer Science 2019-01-08 Shahbaz Ali , Hailong Sun , Yongwang Zhao

Simulation as Reality? The Effectiveness of LLM-Generated Data in Open-ended Question Assessment

The advancement of Artificial Intelligence (AI) has created opportunities for e-learning, particularly in automated assessment systems that reduce educators' workload and provide timely feedback to students. However, developing effective…

Computers and Society · Computer Science 2025-02-11 Long Zhang , Meng Zhang , Wei Lin Wang , Yu Luo

AutoGeTS: Knowledge-based Automated Generation of Text Synthetics for Improving Text Classification

When developing text classification models for real world applications, one major challenge is the difficulty to collect sufficient data for all text classes. In this work, we address this challenge by utilizing large language models (LLMs)…

Computation and Language · Computer Science 2025-08-15 Chenhao Xue , Yuanzhe Jin , Adrian Carrasco-Revilla , Joyraj Chakraborty , Min Chen

Assessing Achievability of Queries and Constraints

Assessing and improving the quality of data in data-intensive systems are fundamental challenges that have given rise to numerous applications targeting transformation and cleaning of data. However, while schema design, data cleaning, and…

Databases · Computer Science 2017-12-12 Rada Chirkova , Jon Doyle , Juan L. Reutter

Accelerating Domain-Aware Electron Microscopy Analysis Using Deep Learning Models with Synthetic Data and Image-Wide Confidence Scoring

The integration of machine learning (ML) models enhances the efficiency, affordability, and reliability of feature detection in microscopy, yet their development and applicability are hindered by the dependency on scarce and often flawed…

Computer Vision and Pattern Recognition · Computer Science 2025-09-04 Matthew J. Lynch , Ryan Jacobs , Gabriella Bruno , Priyam Patki , Dane Morgan , Kevin G. Field

An Inductive Synthesis Framework for Verifiable Reinforcement Learning

Despite the tremendous advances that have been made in the last decade on developing useful machine-learning applications, their wider adoption has been hindered by the lack of strong assurance guarantees that can be made about their…

Machine Learning · Computer Science 2019-07-18 He Zhu , Zikang Xiong , Stephen Magill , Suresh Jagannathan

Synthetic Test Data Generation Using Recurrent Neural Networks: A Position Paper

Testing in production-like test environments is an essential part of quality assurance processes in many industries. Provisioning of such test environments, for information-intensive services, involves setting up databases that are…

Software Engineering · Computer Science 2024-07-09 Razieh Behjati , Erik Arisholm , Chao Tan , Margrethe M. Bedregal