English
Related papers

Related papers: Optimizing Feature Selection for Binary Classifica…

200 papers

Multi-view datasets offer diverse forms of data that can enhance prediction models by providing complementary information. However, the use of multi-view data leads to an increase in high-dimensional data, which poses significant challenges…

Neural and Evolutionary Computing · Computer Science 2024-03-05 Vandad Imani , Carlos Sevilla-Salcedo , Elaheh Moradi , Vittorio Fortino , Jussi Tohka

Real world datasets often contain noisy labels, and learning from such datasets using standard classification approaches may not produce the desired performance. In this paper, we propose a Gaussian Mixture Discriminant Analysis (GMDA) with…

Machine Learning · Computer Science 2022-01-26 Jian-wei Liu , Zheng-ping Ren , Run-kun Lu , Xiong-lin Luo

Analyzing large datasets to select optimal features is one of the most important research areas in machine learning and data mining. This feature selection procedure involves dimensionality reduction which is crucial in enhancing the…

Neural and Evolutionary Computing · Computer Science 2024-09-24 Zhila Yaseen Taha , Abdulhady Abas Abdullah , Tarik A. Rashid

Feature selection is an important process in machine learning. It builds an interpretable and robust model by selecting the features that contribute the most to the prediction target. However, most mature feature selection algorithms,…

Machine Learning · Computer Science 2022-07-20 Zhifeng Qiu , Wanxin Zeng , Dahua Liao , Ning Gui

The feature subset selection problem aims at selecting the relevant subset of features to improve the performance of a Machine Learning (ML) algorithm on training data. Some features in data can be inherently noisy, costly to compute,…

Neural and Evolutionary Computing · Computer Science 2022-05-04 Ayaz Ur Rehman , Anas Nadeem , Muhammad Zubair Malik

Deep neural networks (DNNs) trained on large-scale datasets have exhibited significant performance in image classification. Many large-scale datasets are collected from websites, however they tend to contain inaccurate labels that are…

Computer Vision and Pattern Recognition · Computer Science 2019-04-23 Daiki Tanaka , Daiki Ikami , Toshihiko Yamasaki , Kiyoharu Aizawa

As an effective data preprocessing step, feature selection has shown its effectiveness to prepare high-dimensional data for many machine learning tasks. The proliferation of high di-mension and huge volume big data, however, has brought…

Machine Learning · Computer Science 2019-03-01 Ning Gui , Danni Ge , Ziyin Hu

Training deep neural network (DNN) with noisy labels is practically challenging since inaccurate labels severely degrade the generalization ability of DNN. Previous efforts tend to handle part or full data in a unified denoising flow via…

Computer Vision and Pattern Recognition · Computer Science 2022-08-24 Boshen Zhang , Yuxi Li , Yuanpeng Tu , Jinlong Peng , Yabiao Wang , Cunlin Wu , Yang Xiao , Cairong Zhao

In few-shot learning (FSL), the labeled samples are scarce. Thus, label errors can significantly reduce classification accuracy. Since label errors are inevitable in realistic learning tasks, improving the robustness of the model in the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Nan Xiang , Lifeng Xing , Dequan Jin

Analysis of high dimensional noisy data is of essence across a variety of research fields. Feature selection techniques are designed to find the relevant feature subset that can facilitate classification or pattern detection. Traditional…

Machine Learning · Computer Science 2014-04-14 Bo Wang , Anna Goldenberg

Modern deep neural networks (DNNs) become frail when the datasets contain noisy (incorrect) class labels. Robust techniques in the presence of noisy labels can be categorized into two folds: developing noise-robust functions or using…

Machine Learning · Computer Science 2021-10-28 Taehyeon Kim , Jongwoo Ko , Sangwook Cho , Jinhwan Choi , Se-Young Yun

Falsely annotated samples, also known as noisy labels, can significantly harm the performance of deep learning models. Two main approaches for learning with noisy labels are global noise estimation and data filtering. Global noise…

Machine Learning · Computer Science 2025-07-31 Yuval Grinberg , Nimrod Harel , Jacob Goldberger , Ofir Lindenbaum

Deep Neural Networks (DNNs) have been shown to be susceptible to memorization or overfitting in the presence of noisily-labelled data. For the problem of robust learning under such noisy data, several algorithms have been proposed. A…

Machine Learning · Computer Science 2022-12-06 Deep Patel , P. S. Sastry

Foundation models (FMs) pretrained on large datasets have become fundamental for various downstream machine learning tasks, in particular in scenarios where obtaining perfectly labeled data is prohibitively expensive. In this paper, we…

Machine Learning · Computer Science 2025-08-04 Ecem Bozkurt , Antonio Ortega

Feature selection (FS) is an important research topic in machine learning. Usually, FS is modelled as a+ bi-objective optimization problem whose objectives are: 1) classification accuracy; 2) number of features. One of the main issues in…

Artificial Intelligence · Computer Science 2021-04-21 Yu Xue , Yihang Tang , Xin Xu , Jiayu Liang , Ferrante Neri

In recent years, deep neural networks (DNNs) have gained remarkable achievement in computer vision tasks, and the success of DNNs often depends greatly on the richness of data. However, the acquisition process of data and high-quality…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Mengting Li , Chuang Zhu

Label noise is ubiquitous in various machine learning scenarios such as self-labeling with model predictions and erroneous data annotation. Many existing approaches are based on heuristics such as sample losses, which might not be flexible…

Machine Learning · Computer Science 2022-12-29 Zhihao Wang , Zongyu Lin , Peiqi Liu , Guidong ZHeng , Junjie Wen , Xianxin Chen , Yujun Chen , Zhilin Yang

The success of Deep Neural Network (DNN) models significantly depends on the quality of provided annotations. In medical image segmentation, for example, having multiple expert annotations for each data point is common to minimize…

Computer Vision and Pattern Recognition · Computer Science 2025-02-12 Asma Ahmed Hashmi , Aigerim Zhumabayeva , Nikita Kotelevskii , Artem Agafonov , Mohammad Yaqub , Maxim Panov , Martin Takáč

High quality labeled datasets have allowed deep learning to achieve impressive results on many sound analysis tasks. Yet, it is labor-intensive to accurately annotate large amount of audio data, and the dataset may contain noisy labels in…

Audio and Speech Processing · Electrical Eng. & Systems 2020-07-17 Boqing Zhu , Kele Xu , Qiuqiang Kong , Huaimin Wang , Yuxing Peng

Feature selection has remained a daunting challenge in machine learning and artificial intelligence, where increasingly complex, high-dimensional datasets demand principled strategies for isolating the most informative predictors. Despite…

Machine Learning · Statistics 2025-12-02 Mousam Sinha , Tirtha Sarathi Ghosh , Ridam Pal
‹ Prev 1 2 3 10 Next ›