English
Related papers

Related papers: Consistent Feature Construction with Constrained G…

200 papers

Before any publication, data analysis of high-energy physics experiments must be validated. This validation is granted only if a perfect understanding of the data and the analysis process is demonstrated. Therefore, physicists prefer using…

Machine Learning · Computer Science 2019-12-18 Noëlie Cherrier , Maxime Defurne , Jean-Philippe Poli , Franck Sabatié

Machine learning models, such as neural networks, decision trees, random forests, and gradient boosting machines, accept a feature vector, and provide a prediction. These models learn in a supervised fashion where we provide feature vectors…

Machine Learning · Computer Science 2020-11-03 Jeff Heaton

Feature engineering is a crucial step in the process of predictive modeling. It involves the transformation of given feature space, typically using mathematical functions, with the objective of reducing the modeling error for a given…

Artificial Intelligence · Computer Science 2017-09-22 Udayan Khurana , Horst Samulowitz , Deepak Turaga

Complex networks theory has commonly been used for modelling and understanding the interactions taking place between the elements composing complex systems. More recently, the use of generative models has gained momentum, as they allow…

Physics and Society · Physics 2016-05-19 Massimiliano Zanin , Marco Correia , Pedro A. C. Sousa , Jorge Cruz

For classification problems, feature extraction is a crucial process which aims to find a suitable data representation that increases the performance of the machine learning algorithm. According to the curse of dimensionality theorem, the…

Machine Learning · Computer Science 2010-10-12 Ilknur Icke , Andrew Rosenberg

Despite the widely reported success of embedding-based machine learning methods on natural language processing tasks, the use of more easily interpreted engineered features remains common in fields such as cognitive impairment (CI)…

Machine Learning · Computer Science 2020-10-14 Benjamin Eyre , Aparna Balagopalan , Jekaterina Novikova

Feature-based format is the main data representation format used by machine learning algorithms. When the features do not properly describe the initial data, performance starts to degrade. Some algorithms address this problem by internally…

Artificial Intelligence · Computer Science 2015-12-18 Marian-Andrei Rizoiu , Julien Velcin , Stéphane Lallich

Recently, feature selection has become an increasingly important area of research due to the surge in high-dimensional datasets in all areas of modern life. A plethora of feature selection algorithms have been proposed, but it is difficult…

Neural and Evolutionary Computing · Computer Science 2019-10-24 Andrew Lensen , Bing Xue , Mengjie Zhang

Clustering is a difficult and widely-studied data mining task, with many varieties of clustering algorithms proposed in the literature. Nearly all algorithms use a similarity measure such as a distance metric (e.g. Euclidean distance) to…

Neural and Evolutionary Computing · Computer Science 2019-10-24 Andrew Lensen , Bing Xue , Mengjie Zhang

Combining multiple audio features can improve the performance of music tagging, but common deep learning-based feature fusion methods often lack interpretability. To address this problem, we propose a Genetic Programming (GP) pipeline that…

Formal methods apply algorithms based on mathematical principles to enhance the reliability of systems. It would only be natural to try to progress from verification, model checking or testing a system against its formal specification into…

Software Engineering · Computer Science 2014-02-28 Gal Katz , Doron Peled

Constructing high-quality features is critical to any quantitative data analysis. While feature engineering was historically addressed by carefully hand-crafting data representations based on domain expertise, deep neural networks (DNNs)…

Machine Learning · Computer Science 2025-02-25 Max Vargas , Reilly Cannon , Andrew Engel , Anand D. Sarwate , Tony Chiang

Computational protein design facilitates discovery of novel proteins with prescribed structure and functionality. Exciting designs were recently reported using novel data-driven methodologies that can be roughly divided into two categories:…

Biological Physics · Physics 2023-03-28 Cyril Malbranke , David Bikard , Simona Cocco , Rémi Monasson , Jérôme Tubiana

We introduce genetic algorithms as a means to estimate the accuracy required to discriminate among different models using experimental observables. We exemplify the technique in the context of the minimal supersymmetric standard model. If…

High Energy Physics - Phenomenology · Physics 2009-11-10 B. C. Allanach , D. Grellscheid , F. Quevedo

This document introduces basics in data preparation, feature selection and learning basics for high energy physics tasks. The emphasis is on feature selection by principal component analysis, information gain and significance measures for…

Data Analysis, Statistics and Probability · Physics 2008-03-18 Anselm Vossen

Feature selection (FS) is assumed to improve predictive performance and identify meaningful features in high-dimensional datasets. Surprisingly, small random subsets of features (0.02-1%) match or outperform the predictive performance of…

Machine Learning · Computer Science 2025-09-22 Bhavesh Neekhra , Debayan Gupta , Partha Pratim Chakrabarti

We propose and study a method for learning interpretable representations for the task of regression. Features are represented as networks of multi-type expression trees comprised of activation functions common in neural networks in addition…

Neural and Evolutionary Computing · Computer Science 2019-03-26 William La Cava , Tilak Raj Singh , James Taggart , Srinivas Suri , Jason H. Moore

Feature construction can substantially improve the accuracy of Machine Learning (ML) algorithms. Genetic Programming (GP) has been proven to be effective at this task by evolving non-linear combinations of input features. GP additionally…

Neural and Evolutionary Computing · Computer Science 2020-01-13 Marco Virgolin , Tanja Alderliesten , Peter A. N. Bosman

The research community continues to seek increasingly more advanced synthetic data generators to reliably evaluate the strengths and limitations of machine learning methods. This work aims to increase the availability of datasets…

Machine Learning · Computer Science 2026-01-30 Joanna Komorniczak

When humans perform inductive learning, they often enhance the process with background knowledge. With the increasing availability of well-formed collaborative knowledge bases, the performance of learning algorithms could be significantly…

Artificial Intelligence · Computer Science 2018-02-02 Lior Friedman , Shaul Markovitch
‹ Prev 1 2 3 10 Next ›