Related papers: Hypothesis Testing for Validation and Certificatio…

A Theory of Black-Box Tests

The purpose of testing a system with respect to a requirement is to refute the hypothesis that the system satisfies the requirement. We build a theory of tests and refutation based on the elementary notions of satisfaction and refinement.…

Software Engineering · Computer Science 2020-06-19 Mohammad Torabi Dashti , David Basin

Testing for Overfitting

High complexity models are notorious in machine learning for overfitting, a phenomenon in which models well represent data but fail to generalize an underlying data generating process. A typical procedure for circumventing overfitting…

Machine Learning · Statistics 2025-03-11 James Schmidt

Some approximations in Model Checking and Testing

Model checking and testing are two areas with a similar goal: to verify that a system satisfies a property. They start with different hypothesis on the systems and develop many techniques with different notions of approximation, when an…

Logic in Computer Science · Computer Science 2013-04-19 M. C. Gaudel , R. Lassaigne , F. Magniez , M. de Rougemont

Interpreting Black Box Models via Hypothesis Testing

In science and medicine, model interpretations may be reported as discoveries of natural phenomena or used to guide patient treatments. In such high-stakes tasks, false discoveries may lead investigators astray. These applications would…

Machine Learning · Statistics 2020-08-18 Collin Burns , Jesse Thomason , Wesley Tansey

Hypothesis Testing the Circuit Hypothesis in LLMs

Large language models (LLMs) demonstrate surprising capabilities, but we do not understand how they are implemented. One hypothesis suggests that these capabilities are primarily executed by small subnetworks within the LLM, known as…

Artificial Intelligence · Computer Science 2024-10-18 Claudia Shi , Nicolas Beltran-Velez , Achille Nazaret , Carolina Zheng , Adrià Garriga-Alonso , Andrew Jesson , Maggie Makar , David M. Blei

Parametric Systems: Verification and Synthesis

In this paper we study possibilities of using hierarchical reasoning, symbol elimination and model generation for the verification of parametric systems, where the parameters can be constants or functions. Our goal is to automatically…

Logic in Computer Science · Computer Science 2019-10-14 Viorica Sofronie-Stokkermans

Quantum hypothesis testing in many-body systems

One of the key tasks in physics is to perform measurements in order to determine the state of a system. Often, measurements are aimed at determining the values of physical parameters, but one can also ask simpler questions, such as "is the…

Quantum Physics · Physics 2021-07-01 Jan de Boer , Victor Godet , Jani Kastikainen , Esko Keski-Vakkuri

Development and Realization of Validation Benchmarks

In the field of modeling, the word validation refers to simple comparisons between model outputs and experimental data. Usually, this comparison constitutes plotting the model results against data on the same axes to provide a visual…

Applications · Statistics 2021-06-11 Farid Mohammadi

Algorithm for Model Validation: Theory and Applications

Validation is often defined as the process of determining the degree to which a model is an accurate representation of the real world from the perspective of its intended uses. Validation is crucial as industries and governments depend…

Data Analysis, Statistics and Probability · Physics 2015-06-26 D. Sornette , A. B. Davis , K. Ide , K. R. Vixie , V. Pisarenko , J. R. Kamm

Role of Hypothesis Testing in Quantum Information

Recently, it is well recognized that hypothesis testing has deep relations with other topics in quantum information theory as well as in classical information theory. These relations enable us to derive precise evaluation in the…

Quantum Physics · Physics 2017-09-25 Masahito Hayashi

Combining closed-loop test generation and execution by means of model checking

Model checking is an established technique to formally verify automation systems which are required to be trusted. However, for sufficiently complex systems model checking becomes computationally infeasible. On the other hand, testing,…

Software Engineering · Computer Science 2019-07-30 Igor Buzhinsky , Valeriy Vyatkin

Validation of Approximate Likelihood and Emulator Models for Computationally Intensive Simulations

Complex phenomena in engineering and the sciences are often modeled with computationally intensive feed-forward simulations for which a tractable analytic likelihood does not exist. In these cases, it is sometimes necessary to estimate an…

Methodology · Statistics 2020-06-18 Niccolò Dalmasso , Ann B. Lee , Rafael Izbicki , Taylor Pospisil , Ilmun Kim , Chieh-An Lin

Hypothesis Formalization: Empirical Findings, Software Limitations, and Design Implications

Data analysis requires translating higher level questions and hypotheses into computable statistical models. We present a mixed-methods study aimed at identifying the steps, considerations, and challenges involved in operationalizing…

Other Computer Science · Computer Science 2021-04-08 Eunice Jun , Melissa Birchfield , Nicole de Moura , Jeffrey Heer , Rene Just

Sample Complexity of Composite Quantum Hypothesis Testing

This paper investigates symmetric composite binary quantum hypothesis testing (QHT), where the goal is to determine which of two uncertainty sets contains an unknown quantum state. While asymptotic error exponents for this problem are…

Quantum Physics · Physics 2026-04-13 Jacob Paul Simpson , Efstratios Palias , Sharu Theresa Jose

A General Framework for Verification and Control of Dynamical Models via Certificate Synthesis

An emerging branch of control theory specialises in certificate learning, concerning the specification of a desired (possibly complex) system behaviour for an autonomous or control model, which is then analytically verified by means of a…

Systems and Control · Electrical Eng. & Systems 2024-10-29 Alec Edwards , Andrea Peruffo , Alessandro Abate

A Framework for Proof-carrying Logical Transformations

In various provers and deductive verification tools, logical transformations are used extensively in order to reduce a proof task into a number of simpler tasks. Logical transformations are often part of the trusted base of such tools. In…

Logic in Computer Science · Computer Science 2021-07-07 Quentin Garchery

Hypothesis Testing in Imaging Inverse Problems

This paper proposes a framework for semantic hypothesis testing tailored to imaging inverse problems. Modern imaging methods struggle to support hypothesis testing, a core component of the scientific method that is essential for the…

Machine Learning · Statistics 2025-05-29 Yiming Xi , Konstantinos Zygalakis , Marcelo Pereyra

Sequential Tests of Statistical Hypotheses with Confidence Limits

In this paper, we propose a general method for testing composite hypotheses. Our idea is to use confidence limits to define stopping and decision rules. The requirements of operating characteristic function can be satisfied by adjusting the…

Statistics Theory · Mathematics 2012-02-10 Xinjia Chen

Quantum Conformance Test

We introduce a protocol addressing the conformance test problem, which consists in determining whether a process under test conforms to a reference one. We consider a process to be characterized by the set of end-product it produces, which…

Quantum Physics · Physics 2021-12-24 Giuseppe Ortolano , Pauline Boucher , Ivo Pietro Degiovanni , Elena Losero , Marco Genovese , Ivano Ruo Berchera

Optimal regularized hypothesis testing in statistical inverse problems

Testing of hypotheses is a well studied topic in mathematical statistics. Recently, this issue has also been addressed in the context of Inverse Problems, where the quantity of interest is not directly accessible but only after the…

Statistics Theory · Mathematics 2024-04-09 Remo Kretschmann , Daniel Wachsmuth , Frank Werner