Related papers: MACFE: A Meta-learning and Causality Based Feature…

Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning

Automated feature engineering (AFE) enables AI systems to autonomously construct high-utility representations from raw tabular data. However, existing AFE methods rely on statistical heuristics, yielding brittle features that fail under…

Artificial Intelligence · Computer Science 2026-02-19 Arun Vignesh Malarkkan , Wangyang Ying , Yanjie Fu

Characterizing classification datasets: a study of meta-features for meta-learning

Meta-learning is increasingly used to support the recommendation of machine learning algorithms and their configurations. Such recommendations are made based on meta-data, consisting of performance evaluations of algorithms on prior…

Machine Learning · Computer Science 2019-08-27 Adriano Rivolli , Luís P. F. Garcia , Carlos Soares , Joaquin Vanschoren , André C. P. L. F. de Carvalho

Toward Efficient Automated Feature Engineering

Automated Feature Engineering (AFE) refers to automatically generate and select optimal feature sets for downstream tasks, which has achieved great success in real-world applications. Current AFE methods mainly focus on improving the…

Machine Learning · Computer Science 2022-12-27 Kafeng Wang , Pengyang Wang , Chengzhong xu

Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering

As the field of automated machine learning (AutoML) advances, it becomes increasingly important to incorporate domain knowledge into these systems. We present an approach for doing so by harnessing the power of large language models (LLMs).…

Artificial Intelligence · Computer Science 2023-10-02 Noah Hollmann , Samuel Müller , Frank Hutter

Causality-based Feature Selection: Methods and Evaluations

Feature selection is a crucial preprocessing step in data analytics and machine learning. Classical feature selection algorithms select features based on the correlations between predictive features and the class variable and do not attempt…

Machine Learning · Computer Science 2019-11-19 Kui Yu , Xianjie Guo , Lin Liu , Jiuyong Li , Hao Wang , Zhaolong Ling , Xindong Wu

Feature Engineering for Predictive Modeling using Reinforcement Learning

Feature engineering is a crucial step in the process of predictive modeling. It involves the transformation of given feature space, typically using mathematical functions, with the objective of reducing the modeling error for a given…

Artificial Intelligence · Computer Science 2017-09-22 Udayan Khurana , Horst Samulowitz , Deepak Turaga

IIFE: Interaction Information Based Automated Feature Engineering

Automated feature engineering (AutoFE) is the process of automatically building and selecting new features that help improve downstream predictive performance. While traditional feature engineering requires significant domain expertise and…

Machine Learning · Computer Science 2025-02-28 Tom Overman , Diego Klabjan , Jean Utke

LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers

Automated feature engineering plays a critical role in improving predictive model performance for tabular learning tasks. Traditional automated feature engineering methods are limited by their reliance on pre-defined transformations within…

Machine Learning · Computer Science 2026-05-12 Nikhil Abhyankar , Parshin Shojaee , Chandan K. Reddy

Causality and Robust Optimization

A decision-maker must consider cofounding bias when attempting to apply machine learning prediction, and, while feature selection is widely recognized as important process in data-analysis, it could cause cofounding bias. A causal Bayesian…

Machine Learning · Statistics 2020-03-02 Akihiro Yabe

Causal Feature Selection via Transfer Entropy

Machine learning algorithms are designed to capture complex relationships between features. In this context, the high dimensionality of data often results in poor model performance, with the risk of overfitting. Feature selection, the…

Machine Learning · Computer Science 2023-10-18 Paolo Bonetti , Alberto Maria Metelli , Marcello Restelli

OpenFE: Automated Feature Generation with Expert-level Performance

The goal of automated feature generation is to liberate machine learning experts from the laborious task of manual feature generation, which is crucial for improving the learning performance of tabular data. The major challenge in automated…

Machine Learning · Computer Science 2023-06-06 Tianping Zhang , Zheyu Zhang , Zhiyuan Fan , Haoyan Luo , Fengyuan Liu , Qian Liu , Wei Cao , Jian Li

DIFER: Differentiable Automated Feature Engineering

Feature engineering, a crucial step of machine learning, aims to extract useful features from raw data to improve data quality. In recent years, great efforts have been devoted to Automated Feature Engineering (AutoFE) to replace expensive…

Machine Learning · Computer Science 2022-10-11 Guanghui Zhu , Zhuoer Xu , Xu Guo , Chunfeng Yuan , Yihua Huang

A Feature-Driven Framework for Software Fault Prediction

Software fault prediction (SFP) is a critical task in software engineering, enabling early identification of faults in modules to improve software quality and reduce maintenance costs. This research investigates the combined effects of…

Software Engineering · Computer Science 2026-05-19 Ahmad Nauman Ghazi , Nagajyothi Devarapalli , Ashir Javeed , Sadi Alawadi , Fahed Alkhabbas , Khalid AlKharabsheh

From Dependence to Causation

Machine learning is the science of discovering statistical dependencies in data, and the use of those dependencies to perform predictions. During the last decade, machine learning has made spectacular progress, surpassing human performance…

Machine Learning · Statistics 2016-07-13 David Lopez-Paz

Metalearning for Feature Selection

A general formulation of optimization problems in which various candidate solutions may use different feature-sets is presented, encompassing supervised classification, automated program learning and other cases. A novel characterization of…

Machine Learning · Computer Science 2017-03-22 Ben Goertzel , Nil Geisweiller , Chris Poulin

"FRAME: Forward Recursive Adaptive Model Extraction-A Technique for Advance Feature Selection"

The challenges in feature selection, particularly in balancing model accuracy, interpretability, and computational efficiency, remain a critical issue in advancing machine learning methodologies. To address these complexities, this study…

Machine Learning · Computer Science 2026-01-06 Nachiket Kapure , Harsh Joshi , Parul Kumari , Rajeshwari Mistri , Manasi Mali

Federated Automated Feature Engineering

Automated feature engineering (AutoFE) is used to automatically create new features from original features to improve predictive performance without needing significant human intervention and domain expertise. Many algorithms exist for…

Machine Learning · Computer Science 2025-04-23 Tom Overman , Diego Klabjan

Fantastic Features and Where to Find Them: Detecting Cognitive Impairment with a Subsequence Classification Guided Approach

Despite the widely reported success of embedding-based machine learning methods on natural language processing tasks, the use of more easily interpreted engineered features remains common in fields such as cognitive impairment (CI)…

Machine Learning · Computer Science 2020-10-14 Benjamin Eyre , Aparna Balagopalan , Jekaterina Novikova

Bridging the Gap Between Data-Driven And Theory-Driven Modelling - Leveraging Causal Machine Learning for Integrative Modelling of Dynamical Systems

Classical machine learning techniques often struggle with overfitting and unreliable predictions when exposed to novel conditions. Introducing causality into the modelling process offers a promising way to mitigate these challenges by…

Computational Engineering, Finance, and Science · Computer Science 2025-05-28 David Zapata Gonzalez , Marcel Meyer , Oliver Mueller

AEFE: Automatic Embedded Feature Engineering for Categorical Features

The challenge of solving data mining problems in e-commerce applications such as recommendation system (RS) and click-through rate (CTR) prediction is how to make inferences by constructing combinatorial features from a large number of…

Machine Learning · Computer Science 2021-10-20 Zhenyuan Zhong , Jie Yang , Yacong Ma , Shoubin Dong , Jinlong Hu