English
Related papers

Related papers: A Tutorial on Statistically Sound Pattern Discover…

200 papers

Discovering statistically significant patterns from databases is an important challenging problem. The main obstacle of this problem is in the difficulty of taking into account the selection bias, i.e., the bias arising from the fact that…

Machine Learning · Statistics 2016-03-10 Shinya Suzumura , Kazuya Nakagawa , Mahito Sugiyama , Koji Tsuda , Ichiro Takeuchi

Discriminative pattern mining is an essential task of data mining. This task aims to discover patterns which occur more frequently in a class than other classes in a class-labeled dataset. This type of patterns is valuable in various…

Machine Learning · Computer Science 2019-06-05 Hoang Son Pham , Gwendal Virlet , Dominique Lavenier , Alexandre Termier

Data mining is the task of discovering interesting patterns from large amounts of data. There are many data mining tasks, such as classification, clustering, association rule mining, and sequential pattern mining. Sequential pattern mining…

Databases · Computer Science 2010-02-08 Mahdi Esmaeili , Fazekas Gabor

Statistical dependence between hypotheses poses a significant challenge to the stability of large scale multiple hypotheses testing. Ignoring it often results in an unacceptably large spread in the false positive proportion even though the…

Methodology · Statistics 2018-10-15 Sairam Rayaprolu , Zhiyi Chi

Pattern discovery algorithms in the music domain aim to find meaningful components in musical compositions. Over the years, although many algorithms have been developed for pattern discovery in music data, it remains a challenging task. To…

Sound · Computer Science 2020-10-26 Iris Ren , Anja Volk , Wouter Swierstra , Remco C. Veltkamp

The features of a logically sound approach to a theory of statistical reasoning are discussed. A particular approach that satisfies these criteria is reviewed. This is seen to involve selection of a model, model checking, elicitation of a…

Statistics Theory · Mathematics 2018-05-09 Luai Al-Labadi , Zeynep Baskurt , Michael Evans

Statistically significant patterns mining (SSPM) is an essential and challenging data mining task in the field of knowledge discovery in databases (KDD), in which each pattern is evaluated via a hypothesis test. Our study aims to introduce…

Methodology · Statistics 2020-08-26 Thien Q. Tran , Kazuto Fukuchi , Youhei Akimoto , Jun Sakuma

Conditional-independence-based discovery uses statistical tests to identify a graphical model that represents the independence structure of variables in a dataset. These tests, however, can be unreliable, and algorithms are sensitive to…

Machine Learning · Computer Science 2026-04-21 Philipp M. Faller , Dominik Janzing

The use of patterns in predictive models is a topic that has received a lot of attention in recent years. Pattern mining can help to obtain models for structured domains, such as graphs and sequences, and has been proposed as a means to…

Artificial Intelligence · Computer Science 2011-11-29 Björn Bringmann , Siegfried Nijssen , Albrecht Zimmermann

Physical theories that depend on many parameters or are tested against data from many different experiments pose unique challenges to statistical inference. Many models in particle physics, astrophysics and cosmology fall into one or both…

This paper formalizes a latent variable inference problem we call {\em supervised pattern discovery}, the goal of which is to find sets of observations that belong to a single ``pattern.'' We discuss two versions of the problem and prove…

Machine Learning · Statistics 2014-02-10 Jonathan H. Huggins , Cynthia Rudin

Statistical analysis is an important tool to distinguish systematic from chance findings. Current statistical analyses rely on distributional assumptions reflecting the structure of some underlying model, which if not met lead to problems…

Statistics Theory · Mathematics 2023-11-15 Orestis Loukas , Ho Ryun Chung

We report our ongoing work about a new deep architecture working in tandem with a statistical test procedure for jointly training texts and their label descriptions for multi-label and multi-class classification tasks. A statistical…

Computation and Language · Computer Science 2019-06-18 Ahmad Aghaebrahimian , Mark Cieliebak

In this paper, we focus on the problem of statistical dependence estimation using characteristic functions. We propose a statistical dependence measure, based on the maximum-norm of the difference between joint and product-marginal…

Machine Learning · Computer Science 2022-08-18 Povilas Daniušis , Shubham Juneja , Lukas Kuzma , Virginijus Marcinkevičius

We describe a statistical hypothesis test for the presence of a signal. The test allows the researcher to fix the signal location and/or width a priori, or perform a search to find the signal region that maximizes the signal. The background…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Wolfgang Rolke , Angel Lopez

The robust detection of statistical dependencies between the components of a complex system is a key step in gaining a network-based understanding of the system. Because of their simplicity and low computation cost, pairwise statistics are…

Statistics Theory · Mathematics 2019-08-01 Antoine Messager , Nicos Georgiou , Luc Berthouze

Consider an experiment involving a potentially small number of subjects. Some random variables are observed on each subject: a high-dimensional one called the "observed" random variable, and a one-dimensional one called the "outcome" random…

Machine Learning · Statistics 2018-06-15 Tarun Yellamraju , Mireille Boutin

As advances in technology allow for the collection, storage, and analysis of vast amounts of data, the task of screening and assessing the significance of discovered patterns is becoming a major challenge in data mining applications. In…

Databases · Computer Science 2010-02-08 Adam Kirsch , Michael Mitzenmacher , Andrea Pietracaprina , Geppino Pucci , Eli Upfal , Fabio Vandin

User privacy can be compromised by matching user data traces to records of their previous behavior. The matching of the statistical characteristics of traces to prior user behavior has been widely studied. However, an adversary can also…

Cryptography and Security · Computer Science 2021-08-30 Bo Guan , Nazanin Takbiri , Dennis Goeckel , Amir Houmansadr , Hossein Pishro-Nik

Statistical modeling plays a fundamental role in understanding the underlying mechanism of massive data (statistical inference) and predicting the future (statistical prediction). Although all models are wrong, researchers try their best to…

Methodology · Statistics 2020-06-17 Hangjin Jiang
‹ Prev 1 2 3 10 Next ›