Related papers: Finding Statistically Significant Attribute Intera…

Interpreting Classifiers through Attribute Interactions in Datasets

In this work we present the novel ASTRID method for investigating which attribute interactions classifiers exploit when making predictions. Attribute interactions in classification tasks mean that two or more attributes together provide…

Machine Learning · Statistics 2017-07-25 Andreas Henelius , Kai Puolamäki , Antti Ukkonen

Algorithms for Efficient Mining of Statistically Significant Attribute Association Information

Knowledge of the association information between the attributes in a data set provides insight into the underlying structure of the data and explains the relationships (independence, synergy, redundancy) between the attributes and class (if…

Databases · Computer Science 2012-08-21 Pritam Chanda , Aidong Zhang , Murali Ramanathan

Mining Statistically Significant Attribute Associations in Attributed Graphs

Recently, graphs have been widely used to represent many different kinds of real world data or observations such as social networks, protein-protein networks, road networks, and so on. In many cases, each node in a graph is associated with…

Social and Information Networks · Computer Science 2016-09-28 Jihwan Lee , Keehwan Park , Sunil Prabhakar

Deep determinism and the assessment of mechanistic interaction between categorical and continuous variables

Our aim is to detect mechanistic interaction between the effects of two causal factors on a binary response, as an aid to identifying situations where the effects are mediated by a common mechanism. We propose a formalization of mechanistic…

Methodology · Statistics 2015-06-23 Carlo Berzuini , A. Philip Dawid

Statistical Inference for Qualitative Interactions with Applications to Precision Medicine and Differential Network Analysis

Qualitative interactions occur when a treatment effect or measure of association varies in sign by sub-population. Of particular interest in many biomedical settings are absence/presence qualitative interactions, which occur when an effect…

Methodology · Statistics 2020-10-20 Aaron Hudson , Ali Shojaie

Finding Statistically Significant Interactions between Continuous Features

The search for higher-order feature interactions that are statistically significantly associated with a class variable is of high relevance in fields such as Genetics or Healthcare, but the combinatorial explosion of the candidate space…

Machine Learning · Statistics 2019-05-13 Mahito Sugiyama , Karsten Borgwardt

Learning Interesting Categorical Attributes for Refined Data Exploration

This work proposes and evaluates a novel approach to determine interesting categorical attributes for lists of entities. Once identified, such categories are of immense value to allow constraining (filtering) a current view of a user to…

Databases · Computer Science 2017-11-30 Koninika Pal , Sebastian Michel

Statistical data mining for symbol associations in genomic databases

A methodology is proposed to automatically detect significant symbol associations in genomic databases. A new statistical test is proposed to assess the significance of a group of symbols when found in several genesets of a given database.…

Genomics · Quantitative Biology 2013-09-11 Bernard Ycart , Frédéric Pont , Jean-Jacques Fournié

Discovering Categorical Main and Interaction Effects Based on Association Rule Mining

With the growing size of data sets, feature selection becomes increasingly important. Taking interactions of original features into consideration will lead to extremely high dimension, especially when the features are categorical and…

Databases · Computer Science 2021-04-13 Qiuqiang Lin , Chuanhou Gao

Significance Analysis of High-Dimensional, Low-Sample Size Partially Labeled Data

Classification and clustering are both important topics in statistical learning. A natural question herein is whether predefined classes are really different from one another, or whether clusters are really there. Specifically, we may be…

Machine Learning · Statistics 2015-09-22 Qiyi Lu , Xingye Qiao

Mining Feature Relationships in Data

When faced with a new dataset, most practitioners begin by performing exploratory data analysis to discover interesting patterns and characteristics within data. Techniques such as association rule mining are commonly applied to uncover…

Machine Learning · Computer Science 2021-02-03 Andrew Lensen

friends.test: rank-based method for feature selection in interaction matrices

The analysis of the interaction matrix between two distinct sets is essential across diverse fields, from pharmacovigilance to transcriptomics. Not all interactions are equally informative: a marker gene associated with a few specific…

Quantitative Methods · Quantitative Biology 2026-01-21 Alexandra Suvorikova , Alexey Kroshnin , Dmirijs Lvovs , Vera Mukhina , Andrey Mironov , Elana J. Fertig , Ludmila Danilova , Alexander Favorov

STRive: An association rule-based system for the exploration of spatiotemporal categorical data

Effectively analyzing spatiotemporal data plays a central role in understanding real-world phenomena and informing decision-making. Capturing the interaction between spatial and temporal dimensions also helps explain the underlying…

Human-Computer Interaction · Computer Science 2025-09-04 Mauro Diaz , Luis Sante , Joel Perca , João Victor da Silva , Nivan Ferreira , Jorge Poco

A Statistical Approach to Set Classification by Feature Selection with Applications to Classification of Histopathology Images

Set classification problems arise when classification tasks are based on sets of observations as opposed to individual observations. In set classification, a classification rule is trained with $N$ sets of observations, where each set is…

Methodology · Statistics 2016-03-08 Sungkyu Jung , Xingye Qiao

Inferring individual attributes from search engine queries and auxiliary information

Internet data has surfaced as a primary source for investigation of different aspects of human behavior. A crucial step in such studies is finding a suitable cohort (i.e., a set of users) that shares a common trait of interest to…

Information Retrieval · Computer Science 2018-05-16 Luca Soldaini , Elad Yom-Tov

Characterizing Discriminative Patterns

Discriminative patterns are association patterns that occur with disproportionate frequency in some classes versus others, and have been studied under names such as emerging patterns and contrast sets. Such patterns have demonstrated…

Databases · Computer Science 2011-02-22 Gang Fang , Wen Wang , Benjamin Oatley , Brian Van Ness , Michael Steinbach , Vipin Kumar

Inference of Common Multidimensional Equally-Distributed Attributes

Given two relations containing multiple measurements - possibly with uncertainties - our objective is to find which sets of attributes from the first have a corresponding set on the second, using exclusively a sample of the data. This…

Databases · Computer Science 2022-07-20 Alejandro Alvarez-Ayllon , Manuel Palomo-Duarte , Juan-Manuel Dodero

Sequential Advantage Selection for Optimal Treatment Regimes

Variable selection for optimal treatment regime in a clinical trial or an observational study is getting more attention. Most existing variable selection techniques focused on selecting variables that are important for prediction, therefore…

Methodology · Statistics 2014-05-22 Ailin Fan , Wenbin Lu , Rui Song

DiSC: Differential Spectral Clustering of Features

Selecting subsets of features that differentiate between two conditions is a key task in a broad range of scientific domains. In many applications, the features of interest form clusters with similar effects on the data at hand. To recover…

Machine Learning · Computer Science 2022-11-11 Ram Dyuthi Sristi , Gal Mishne , Ariel Jaffe

Enhancing interpretability of rule-based classifiers through feature graphs

In domains where transparency and trustworthiness are crucial, such as healthcare, rule-based systems are widely used and often preferred over black-box models for decision support systems due to their inherent interpretability. However, as…

Machine Learning · Computer Science 2025-06-18 Christel Sirocchi , Damiano Verda