English
Related papers

Related papers: Structural-Entropy-Based Sample Selection for Effi…

200 papers

Structural Entropy (SE) measures the structural information contained in a graph. Minimizing or maximizing SE helps to reveal or obscure the intrinsic structural patterns underlying graphs in an interpretable manner, finding applications in…

Social and Information Networks · Computer Science 2024-05-14 Yuwei Cao , Hao Peng , Angsheng Li , Chenyu You , Zhifeng Hao , Philip S Yu

Data samples collected for training machine learning models are typically assumed to be independent and identically distributed (iid). Recent research has demonstrated that this assumption can be problematic as it simplifies the manifold of…

Machine Learning · Computer Science 2019-10-16 Kaixuan Zhang , Qinglong Wang , Xue Liu , C. Lee Giles

Probabilistic embeddings have several advantages over deterministic embeddings as they map each data point to a distribution, which better describes the uncertainty and complexity of data. Many works focus on adjusting the distribution…

Artificial Intelligence · Computer Science 2024-12-16 Xiang Huang , Hao Peng , Li Sun , Hui Lin , Chunyang Liu , Jiang Cao , Philip S. Yu

Despite the success of the carefully-annotated benchmarks, the effectiveness of existing graph neural networks (GNNs) can be considerably impaired in practice when the real-world graph data is noisily labeled. Previous explorations in…

Machine Learning · Computer Science 2024-08-30 Yuhao Wu , Jiangchao Yao , Xiaobo Xia , Jun Yu , Ruxin Wang , Bo Han , Tongliang Liu

Graph Neural Networks (GNNs) are de facto solutions to structural data learning. However, it is susceptible to low-quality and unreliable structure, which has been a norm rather than an exception in real-world graphs. Existing graph…

Machine Learning · Computer Science 2023-03-20 Dongcheng Zou , Hao Peng , Xiang Huang , Renyu Yang , Jianxin Li , Jia Wu , Chunyang Liu , Philip S. Yu

Machine learning algorithms are designed to capture complex relationships between features. In this context, the high dimensionality of data often results in poor model performance, with the risk of overfitting. Feature selection, the…

Machine Learning · Computer Science 2023-10-18 Paolo Bonetti , Alberto Maria Metelli , Marcello Restelli

Data selection is essential for training deep learning models. An effective data sampler assigns proper sampling probability for training data and helps the model converge to a good local minimum with high performance. Previous studies in…

Machine Learning · Computer Science 2024-10-10 Jiawei Yao , Chuming Li , Canran Xiao

From a machine learning point of view, identifying a subset of relevant features from a real data set can be useful to improve the results achieved by classification methods and to reduce their time and space complexity. To achieve this…

Machine Learning · Computer Science 2017-05-23 Pietro Cassara , Alessandro Rozza , Mirco Nanni

In recent years, multi-view multi-label learning (MVML) has attracted extensive attention due to its close alignment to real-world scenarios. Information-theoretic methods have gained prominence for learning nonlinear correlations. However,…

Machine Learning · Computer Science 2026-03-04 Cheng Peng , Yonghao Li , Wanfu Gao , Jie Wen , Weiping Ding

Reliability-oriented sensitivity analysis aims at combining both reliability and sensitivity analyses by quantifying the influence of each input variable of a numerical model on a quantity of interest related to its failure. In particular,…

Statistics Theory · Mathematics 2022-10-25 Julien Demange-Chryst , François Bachoc , Jérôme Morio

Modern language models (LMs) increasingly require two critical resources: computational resources and data resources. Data selection techniques can effectively reduce the amount of training data required for fine-tuning LMs. However, their…

Computation and Language · Computer Science 2026-02-20 Hongming Li , Yang Liu , Chao Huang

Efficient and accurate learning of constitutive laws is crucial for accurately predicting the mechanical behavior of materials under complex loading conditions. Accurate model calibration hinges on a delicate interplay between the…

Computational Engineering, Finance, and Science · Computer Science 2025-06-24 Royal C. Ihuaenyi , Wei Li , Martin Z. Bazant , Juner Zhu

Data selection is designed to accelerate learning with preserved performance. To achieve this, a fundamental thought is to identify informative data samples with significant contributions to the training. In this work, we propose…

Machine Learning · Computer Science 2025-09-30 Ziheng Cheng , Zhong Li , Jiang Bian

Statistical model checking avoids the exponential growth of states associated with probabilistic model checking by estimating properties from multiple executions of a system and by giving results within confidence bounds. Rare properties…

Performance · Computer Science 2012-01-26 Cyrille Jégourel , Axel Legay , Sean Sedwards

Semi-supervised clustering techniques have emerged as valuable tools for leveraging prior information in the form of constraints to improve the quality of clustering outcomes. Despite the proliferation of such methods, the ability to…

Machine Learning · Computer Science 2023-12-19 Guangjie Zeng , Hao Peng , Angsheng Li , Zhiwei Liu , Runze Yang , Chunyang Liu , Lifang He

A growing body of research has demonstrated the inability of NLP models to generalize compositionally and has tried to alleviate it through specialized architectures, training schemes, and data augmentation, among other approaches. In this…

Computation and Language · Computer Science 2022-11-03 Shivanshu Gupta , Sameer Singh , Matt Gardner

Machine learning is the dominant approach to artificial intelligence, through which computers learn from data and experience. In the framework of supervised learning, a necessity for a computer to learn from data accurately and efficiently…

Machine Learning · Statistics 2023-01-25 Amir R. Asadi

In structured prediction, the goal is to jointly predict many output variables that together encode a structured object -- a path in a graph, an entity-relation triple, or an ordering of objects. Such a large output space makes learning…

Machine Learning · Computer Science 2022-01-28 Kareem Ahmed , Eric Wang , Kai-Wei Chang , Guy Van den Broeck

The scientific method relies on the iterated processes of inference and inquiry. The inference phase consists of selecting the most probable models based on the available data; whereas the inquiry phase consists of using what is known about…

Machine Learning · Statistics 2015-05-19 N. K. Malakar , K. H. Knuth

Several studies demonstrate that there are critical differences between real wireless networks and simulation models. This finding has permitted to extract spatial and temporal properties for links and to provide efficient methods as biased…

Networking and Internet Architecture · Computer Science 2012-07-12 Mohamed-Haykel Zayani , Vincent Gauthier , Djamal Zeghlache
‹ Prev 1 2 3 10 Next ›