Related papers: Structural-Entropy-Based Sample Selection for Effi…

Multi-Relational Structural Entropy

Structural Entropy (SE) measures the structural information contained in a graph. Minimizing or maximizing SE helps to reveal or obscure the intrinsic structural patterns underlying graphs in an interpretable manner, finding applications in…

Social and Information Networks · Computer Science 2024-05-14 Yuwei Cao , Hao Peng , Angsheng Li , Chenyu You , Zhifeng Hao , Philip S Yu

Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

Data samples collected for training machine learning models are typically assumed to be independent and identically distributed (iid). Recent research has demonstrated that this assumption can be problematic as it simplifies the manifold of…

Machine Learning · Computer Science 2019-10-16 Kaixuan Zhang , Qinglong Wang , Xue Liu , C. Lee Giles

Structural Entropy Guided Probabilistic Coding

Probabilistic embeddings have several advantages over deterministic embeddings as they map each data point to a distribution, which better describes the uncertainty and complexity of data. Many works focus on adjusting the distribution…

Artificial Intelligence · Computer Science 2024-12-16 Xiang Huang , Hao Peng , Li Sun , Hui Lin , Chunyang Liu , Jiang Cao , Philip S. Yu

Mitigating Label Noise on Graph via Topological Sample Selection

Despite the success of the carefully-annotated benchmarks, the effectiveness of existing graph neural networks (GNNs) can be considerably impaired in practice when the real-world graph data is noisily labeled. Previous explorations in…

Machine Learning · Computer Science 2024-08-30 Yuhao Wu , Jiangchao Yao , Xiaobo Xia , Jun Yu , Ruxin Wang , Bo Han , Tongliang Liu

SE-GSL: A General and Effective Graph Structure Learning Framework through Structural Entropy Optimization

Graph Neural Networks (GNNs) are de facto solutions to structural data learning. However, it is susceptible to low-quality and unreliable structure, which has been a norm rather than an exception in real-world graphs. Existing graph…

Machine Learning · Computer Science 2023-03-20 Dongcheng Zou , Hao Peng , Xiang Huang , Renyu Yang , Jianxin Li , Jia Wu , Chunyang Liu , Philip S. Yu

Causal Feature Selection via Transfer Entropy

Machine learning algorithms are designed to capture complex relationships between features. In this context, the high dimensionality of data often results in poor model performance, with the risk of overfitting. Feature selection, the…

Machine Learning · Computer Science 2023-10-18 Paolo Bonetti , Alberto Maria Metelli , Marcello Restelli

Swift Sampler: Efficient Learning of Sampler by 10 Parameters

Data selection is essential for training deep learning models. An effective data sampler assigns proper sampling probability for training data and helps the model converge to a good local minimum with high performance. Previous studies in…

Machine Learning · Computer Science 2024-10-10 Jiawei Yao , Chuming Li , Canran Xiao

A Cross-Entropy-based Method to Perform Information-based Feature Selection

From a machine learning point of view, identifying a subset of relevant features from a real data set can be useful to improve the results achieved by classification methods and to reduce their time and space complexity. To achieve this…

Machine Learning · Computer Science 2017-05-23 Pietro Cassara , Alessandro Rozza , Mirco Nanni

SEHFS: Structural Entropy-Guided High-Order Correlation Learning for Multi-View Multi-Label Feature Selection

In recent years, multi-view multi-label learning (MVML) has attracted extensive attention due to its close alignment to real-world scenarios. Information-theoretic methods have gained prominence for learning nonlinear correlations. However,…

Machine Learning · Computer Science 2026-03-04 Cheng Peng , Yonghao Li , Wanfu Gao , Jie Wen , Weiping Ding

Shapley effect estimation in reliability-oriented sensitivity analysis with correlated inputs by importance sampling

Reliability-oriented sensitivity analysis aims at combining both reliability and sensitivity analyses by quantifying the influence of each input variable of a numerical model on a quantity of interest related to its failure. In particular,…

Statistics Theory · Mathematics 2022-10-25 Julien Demange-Chryst , François Bachoc , Jérôme Morio

Entropy-Based Data Selection for Language Models

Modern language models (LMs) increasingly require two critical resources: computational resources and data resources. Data selection techniques can effectively reduce the amount of training data required for fine-tuning LMs. However, their…

Computation and Language · Computer Science 2026-02-20 Hongming Li , Yang Liu , Chao Huang

Mechanics Informatics: A paradigm for efficiently learning constitutive models

Efficient and accurate learning of constitutive laws is crucial for accurately predicting the mechanical behavior of materials under complex loading conditions. Accurate model calibration hinges on a delicate interplay between the…

Computational Engineering, Finance, and Science · Computer Science 2025-06-24 Royal C. Ihuaenyi , Wei Li , Martin Z. Bazant , Juner Zhu

Data-Efficient Training by Evolved Sampling

Data selection is designed to accelerate learning with preserved performance. To achieve this, a fundamental thought is to identify informative data samples with significant contributions to the training. In this work, we propose…

Machine Learning · Computer Science 2025-09-30 Ziheng Cheng , Zhong Li , Jiang Bian

Cross-entropy optimisation of importance sampling parameters for statistical model checking

Statistical model checking avoids the exponential growth of states associated with probabilistic model checking by estimating properties from multiple executions of a system and by giving results within confidence bounds. Rare properties…

Performance · Computer Science 2012-01-26 Cyrille Jégourel , Axel Legay , Sean Sedwards

Semi-Supervised Clustering via Structural Entropy with Different Constraints

Semi-supervised clustering techniques have emerged as valuable tools for leveraging prior information in the form of constraints to improve the quality of clustering outcomes. Despite the proliferation of such methods, the ability to…

Machine Learning · Computer Science 2023-12-19 Guangjie Zeng , Hao Peng , Angsheng Li , Zhiwei Liu , Runze Yang , Chunyang Liu , Lifang He

Structurally Diverse Sampling for Sample-Efficient Training and Comprehensive Evaluation

A growing body of research has demonstrated the inability of NLP models to generalize compositionally and has tried to alleviate it through specialized architectures, training schemes, and data augmentation, among other approaches. In this…

Computation and Language · Computer Science 2022-11-03 Shivanshu Gupta , Sameer Singh , Matt Gardner

An Entropy-Based Model for Hierarchical Learning

Machine learning is the dominant approach to artificial intelligence, through which computers learn from data and experience. In the framework of supervised learning, a necessity for a computer to learn from data accurately and efficiently…

Machine Learning · Statistics 2023-01-25 Amir R. Asadi

Neuro-Symbolic Entropy Regularization

In structured prediction, the goal is to jointly predict many output variables that together encode a structured object -- a path in a graph, an entity-relation triple, or an ordering of objects. Such a large output space makes learning…

Machine Learning · Computer Science 2022-01-28 Kareem Ahmed , Eric Wang , Kai-Wei Chang , Guy Van den Broeck

Entropy-Based Search Algorithm for Experimental Design

The scientific method relies on the iterated processes of inference and inquiry. The inference phase consists of selecting the most probable models based on the available data; whereas the inquiry phase consists of using what is known about…

Machine Learning · Statistics 2015-05-19 N. K. Malakar , K. H. Knuth

Quantifying Spatiotemporal Stability by means of Entropy: Approach and Motivations

Several studies demonstrate that there are critical differences between real wireless networks and simulation models. This finding has permitted to extract spatial and temporal properties for links and to provide efficient methods as biased…

Networking and Internet Architecture · Computer Science 2012-07-12 Mohamed-Haykel Zayani , Vincent Gauthier , Djamal Zeghlache