Related papers: hyppo: A Multivariate Hypothesis Testing Python Pa…

Spey: smooth inference for reinterpretation studies

Statistical models serve as the cornerstone for hypothesis testing in empirical studies. This paper introduces a new cross-platform Python-based package designed to utilise different likelihood prescriptions via a flexible plug-in system.…

High Energy Physics - Phenomenology · Physics 2024-01-29 Jack Y. Araz

infotheory: A C++/Python package for multivariate information theoretic analysis

This paper introduces \texttt{infotheory}: a package written in C++ and usable from Python and C++, for multivariate information theoretic analyses of discrete and continuous data. This package allows the user to study the relationship…

Information Theory · Computer Science 2021-06-11 Madhavun Candadai , Eduardo J. Izquierdo

HighProbability determines which alternative hypotheses are sufficiently probable: Genomic applications include detection of differential gene expression

Many genomic experiments, notably microarray experiments seeking to detect differential gene expression, involve calculating a large number of p-values. This leads to the multiple testing problem: when the number of null hypotheses is…

Quantitative Methods · Quantitative Biology 2007-05-23 David R. Bickel

eipy: An Open-Source Python Package for Multi-modal Data Integration using Heterogeneous Ensembles

In this paper, we introduce eipy--an open-source Python package for developing effective, multi-modal heterogeneous ensembles for classification. eipy simultaneously provides both a rigorous, and user-friendly framework for comparing and…

Machine Learning · Computer Science 2024-12-11 Jamie J. R. Bennett , Aviad Susman , Yan Chak Li , Gaurav Pandey

BayesPy: Variational Bayesian Inference in Python

BayesPy is an open-source Python software package for performing variational Bayesian inference. It is based on the variational message passing framework and supports conjugate exponential family models. By removing the tedious task of…

Machine Learning · Statistics 2015-06-08 Jaakko Luttinen

rigidPy: Rigidity Analysis in Python

rigidPy is a Python package that provides a set of tools necessary for studying rigidity and mechanical response in spring networks. It also includes suitable modules for generating new realizations of networks with applications in glassy…

Soft Condensed Matter · Physics 2022-03-02 Varda F. Hagh , Mahdi Sadjadi

Deepchecks: A Library for Testing and Validating Machine Learning Models and Data

This paper presents Deepchecks, a Python library for comprehensively validating machine learning models and data. Our goal is to provide an easy-to-use library comprising of many checks related to various types of issues, such as model…

Machine Learning · Computer Science 2022-03-17 Shir Chorev , Philip Tannor , Dan Ben Israel , Noam Bressler , Itay Gabbay , Nir Hutnik , Jonatan Liberman , Matan Perlmutter , Yurii Romanyshyn , Lior Rokach

BFpack: Flexible Bayes Factor Testing of Scientific Theories in R

There has been a tremendous methodological development of Bayes factors for hypothesis testing in the social and behavioral sciences, and related fields. This development is due to the flexibility of the Bayes factor for testing multiple…

Computation · Statistics 2019-11-19 Joris Mulder , Xin Gu , Anton Olsson-Collentine , Andrew Tomarken , Florian Böing-Messing , Herbert Hoijtink , Marlyne Meijerink , Donald R. Williams , Janosch Menke , Jean-Paul Fox , Yves Rosseel , Eric-Jan Wagenmakers , Caspar van Lissa

Memento: Facilitating Effortless, Efficient, and Reliable ML Experiments

Running complex sets of machine learning experiments is challenging and time-consuming due to the lack of a unified framework. This leaves researchers forced to spend time implementing necessary features such as parallelization, caching,…

Machine Learning · Computer Science 2023-11-22 Zac Pullar-Strecker , Xinglong Chang , Liam Brydon , Ioannis Ziogas , Katharina Dost , Jörg Wicker

Extensions of Heterogeneity in Integration and Prediction (HIP) with R Shiny Application

Multiple data views measured on the same set of participants is becoming more common and has the potential to deepen our understanding of many complex diseases by analyzing these different views simultaneously. Equally important, many of…

Methodology · Statistics 2023-10-13 J. Butts , C. Wendt , R. Bowler , C. P. Hersh , Q. Long , L. Eberly , S. E. Safo

NApy: Efficient Statistics in Python for Large-Scale Heterogeneous Data with Enhanced Support for Missing Data

Existing Python libraries and tools lack the ability to efficiently compute statistical test results for large datasets in the presence of missing values. This presents an issue as soon as constraints on runtime and memory availability…

Mathematical Software · Computer Science 2025-05-02 Fabian Woller , Lis Arend , Christian Fuchsberger , Markus List , David B. Blumenthal

A New Framework of Multistage Hypothesis Tests

In this paper, we have established a general framework of multistage hypothesis tests which applies to arbitrarily many mutually exclusive and exhaustive composite hypotheses. Within the new framework, we have constructed specific…

Statistics Theory · Mathematics 2013-11-05 Xinjia Chen

HypoML: Visual Analysis for Hypothesis-based Evaluation of Machine Learning Models

In this paper, we present a visual analytics tool for enabling hypothesis-based evaluation of machine learning (ML) models. We describe a novel ML-testing framework that combines the traditional statistical hypothesis testing (commonly used…

Human-Computer Interaction · Computer Science 2020-08-28 Qianwen Wang , William Alexander , Jack Pegg , Huamin Qu , Min Chen

QuickMMCTest - Quick Multiple Monte Carlo Testing

Multiple hypothesis testing is widely used to evaluate scientific studies involving statistical tests. However, for many of these tests, p-values are not available and are thus often approximated using Monte Carlo tests such as permutation…

Applications · Statistics 2018-10-17 Axel Gandy , Georg Hahn

PyPOTS: A Python Toolkit for Machine Learning on Partially-Observed Time Series

PyPOTS is an open-source Python library dedicated to data mining and analysis on multivariate partially-observed time series with missing values. Particularly, it provides easy access to diverse algorithms categorized into five tasks:…

Machine Learning · Computer Science 2025-07-10 Wenjie Du , Yiyuan Yang , Linglong Qian , Jun Wang , Qingsong Wen

HolPy: Interactive Theorem Proving in Python

HolPy is an interactive theorem proving system implemented in Python. It uses higher-order logic as the logical foundation. Its main features include a pervasive use of macros in producing, checking, and storing proofs, a JSON-based format…

Logic in Computer Science · Computer Science 2020-01-28 Bohua Zhan

Compound p-Value Statistics for Multiple Testing Procedures

Many multiple testing procedures make use of the p-values from the individual pairs of hypothesis tests, and are valid if the p-value statistics are independent and uniformly distributed under the null hypotheses. However, it has recently…

Methodology · Statistics 2011-08-25 Joshua D. Habiger , Edsel A. Pena

Predictive Independence Testing, Predictive Conditional Independence Testing, and Predictive Graphical Modelling

Testing (conditional) independence of multivariate random variables is a task central to statistical inference and modelling in general - though unfortunately one for which to date there does not exist a practicable workflow. State-of-art…

Machine Learning · Statistics 2018-05-01 Samuel Burkart , Franz J Király

mlpy: Machine Learning Python

mlpy is a Python Open Source Machine Learning library built on top of NumPy/SciPy and the GNU Scientific Libraries. mlpy provides a wide range of state-of-the-art machine learning methods for supervised and unsupervised problems and it is…

Mathematical Software · Computer Science 2012-03-02 Davide Albanese , Roberto Visintainer , Stefano Merler , Samantha Riccadonna , Giuseppe Jurman , Cesare Furlanello

A Library for Implementing the Multiple Hypothesis Tracking Algorithm

The Multiple Hypothesis Tracking (MHT) algorithm is known to produce good results in difficult multi-target tracking situations. However, its implementation is not trivial, and is associated with a significant programming effort, code size…

Data Structures and Algorithms · Computer Science 2011-06-14 David Miguel Antunes , David Martins de Matos , José Gaspar