Related papers: Feature Engineering for Predictive Modeling using …

An Empirical Analysis of Feature Engineering for Predictive Modeling

Machine learning models, such as neural networks, decision trees, random forests, and gradient boosting machines, accept a feature vector, and provide a prediction. These models learn in a supervised fashion where we provide feature vectors…

Machine Learning · Computer Science 2020-11-03 Jeff Heaton

Self-Optimizing Feature Transformation

Feature transformation aims to extract a good representation (feature) space by mathematically transforming existing features. It is crucial to address the curse of dimensionality, enhance model generalization, overcome data sparsity, and…

Machine Learning · Computer Science 2022-12-26 Meng Xiao , Dongjie Wang , Min Wu , Kunpeng Liu , Hui Xiong , Yuanchun Zhou , Yanjie Fu

Augmenting data-driven models for energy systems through feature engineering: A Python framework for feature engineering

Data-driven modeling is an approach in energy systems modeling that has been gaining popularity. In data-driven modeling, machine learning methods such as linear regression, neural networks or decision-tree based methods are being applied.…

Machine Learning · Computer Science 2023-01-05 Sandra Wilfling

Learned Feature Importance Scores for Automated Feature Engineering

Feature engineering has demonstrated substantial utility for many machine learning workflows, such as in the small data regime or when distribution shifts are severe. Thus automating this capability can relieve much manual effort and…

Machine Learning · Computer Science 2024-06-07 Yihe Dong , Sercan Arik , Nathanael Yoder , Tomas Pfister

MACFE: A Meta-learning and Causality Based Feature Engineering Framework

Feature engineering has become one of the most important steps to improve model prediction performance, and to produce quality datasets. However, this process requires non-trivial domain-knowledge which involves a time-consuming process.…

Machine Learning · Computer Science 2022-07-11 Ivan Reyes-Amezcua , Daniel Flores-Araiza , Gilberto Ochoa-Ruiz , Andres Mendez-Vazquez , Eduardo Rodriguez-Tello

Feature Interaction Aware Automated Data Representation Transformation

Creating an effective representation space is crucial for mitigating the curse of dimensionality, enhancing model generalization, addressing data sparsity, and leveraging classical models more effectively. Recent advancements in automated…

Machine Learning · Computer Science 2024-01-17 Ehtesamul Azim , Dongjie Wang , Kunpeng Liu , Wei Zhang , Yanjie Fu

Projective Inference in High-dimensional Problems: Prediction and Feature Selection

This paper discusses predictive inference and feature selection for generalized linear models with scarce but high-dimensional data. We argue that in many cases one can benefit from a decision theoretically justified two-stage approach:…

Machine Learning · Statistics 2020-11-09 Juho Piironen , Markus Paasiniemi , Aki Vehtari

DIFER: Differentiable Automated Feature Engineering

Feature engineering, a crucial step of machine learning, aims to extract useful features from raw data to improve data quality. In recent years, great efforts have been devoted to Automated Feature Engineering (AutoFE) to replace expensive…

Machine Learning · Computer Science 2022-10-11 Guanghui Zhu , Zhuoer Xu , Xu Guo , Chunfeng Yuan , Yihua Huang

Reinforcement-Enhanced Autoregressive Feature Transformation: Gradient-steered Search in Continuous Space for Postfix Expressions

Feature transformation aims to generate new pattern-discriminative feature space from original features to improve downstream machine learning (ML) task performances. However, the discrete search space for the optimal feature explosively…

Machine Learning · Computer Science 2023-09-26 Dongjie Wang , Meng Xiao , Min Wu , Pengfei Wang , Yuanchun Zhou , Yanjie Fu

A reinforced learning approach to optimal design under model uncertainty

Optimal designs are usually model-dependent and likely to be sub-optimal if the postulated model is not correctly specified. In practice, it is common that a researcher has a list of candidate models at hand and a design has to be found…

Statistics Theory · Mathematics 2023-03-29 Mingyao Ai , Holger Dette , Zhengfu Liu , Jun Yu

Embedded Constrained Feature Construction for High-Energy Physics Data Classification

Before any publication, data analysis of high-energy physics experiments must be validated. This validation is granted only if a perfect understanding of the data and the analysis process is demonstrated. Therefore, physicists prefer using…

Machine Learning · Computer Science 2019-12-18 Noëlie Cherrier , Maxime Defurne , Jean-Philippe Poli , Franck Sabatié

Comprehensive Overview of Reward Engineering and Shaping in Advancing Reinforcement Learning Applications

The aim of Reinforcement Learning (RL) in real-world applications is to create systems capable of making autonomous decisions by learning from their environment through trial and error. This paper emphasizes the importance of reward…

Machine Learning · Computer Science 2024-12-31 Sinan Ibrahim , Mostafa Mostafa , Ali Jnadi , Hadi Salloum , Pavel Osinenko

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity

Reinforcement learning provides an automated framework for learning behaviors from high-level reward specifications, but in practice the choice of reward function can be crucial for good results -- while in principle the reward only needs…

Machine Learning · Computer Science 2022-10-19 Abhishek Gupta , Aldo Pacchiano , Yuexiang Zhai , Sham M. Kakade , Sergey Levine

REFRESH: Responsible and Efficient Feature Reselection Guided by SHAP Values

Feature selection is a crucial step in building machine learning models. This process is often achieved with accuracy as an objective, and can be cumbersome and computationally expensive for large-scale datasets. Several additional model…

Machine Learning · Computer Science 2024-03-15 Shubham Sharma , Sanghamitra Dutta , Emanuele Albini , Freddy Lecue , Daniele Magazzeni , Manuela Veloso

Toward Efficient Automated Feature Engineering

Automated Feature Engineering (AFE) refers to automatically generate and select optimal feature sets for downstream tasks, which has achieved great success in real-world applications. Current AFE methods mainly focus on improving the…

Machine Learning · Computer Science 2022-12-27 Kafeng Wang , Pengyang Wang , Chengzhong xu

FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies

Feature Transformation is crucial for classic machine learning that aims to generate feature combinations to enhance the performance of downstream tasks from a data-centric perspective. Current methodologies, such as manual expert-driven…

Machine Learning · Computer Science 2025-03-27 Tianqi He , Xiaohan Huang , Yi Du , Qingqing Long , Ziyue Qiao , Min Wu , Yanjie Fu , Yuanchun Zhou , Meng Xiao

Predicting purchasing intent: Automatic Feature Learning using Recurrent Neural Networks

We present a neural network for predicting purchasing intent in an Ecommerce setting. Our main contribution is to address the significant investment in feature engineering that is usually associated with state-of-the-art methods such as…

Machine Learning · Computer Science 2018-07-24 Humphrey Sheil , Omer Rana , Ronan Reilly

Fast Prediction of New Feature Utility

We study the new feature utility prediction problem: statistically testing whether adding a new feature to the data representation can improve predictive accuracy on a supervised learning task. In many applications, identifying new…

Machine Learning · Computer Science 2012-06-22 Hoyt Koepke , Mikhail Bilenko

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen

A Theory of Feature Learning

Feature Learning aims to extract relevant information contained in data sets in an automated fashion. It is driving force behind the current deep learning trend, a set of methods that have had widespread empirical success. What is lacking…

Machine Learning · Statistics 2015-04-02 Brendan van Rooyen , Robert C. Williamson