Related papers: Model Reframing by Feature Context Change

Test cost and misclassification cost trade-off using reframing

Many solutions to cost-sensitive classification (and regression) rely on some or all of the following assumptions: we have complete knowledge about the cost context at training time, we can easily re-train whenever the cost context changes,…

Machine Learning · Computer Science 2013-05-31 Celestine Periale Maguedong-Djoumessi , José Hernández-Orallo

Cost-Sensitive Feature Selection of Data with Errors

In data mining applications, feature selection is an essential process since it reduces a model's complexity. The cost of obtaining the feature values must be taken into consideration in many domains. In this paper, we study the…

Machine Learning · Computer Science 2013-06-04 Hong Zhao , Fan Min , William Zhu

Classification with Costly Features as a Sequential Decision-Making Problem

This work focuses on a specific classification problem, where the information about a sample is not readily available, but has to be acquired for a cost, and there is a per-sample budget. Inspired by real-world use-cases, we analyze average…

Machine Learning · Computer Science 2020-03-05 Jaromír Janisch , Tomáš Pevný , Viliam Lisý

Cost-Sensitive Feature-Value Acquisition Using Feature Relevance

In many real-world machine learning problems, feature values are not readily available. To make predictions, some of the missing features have to be acquired, which can incur a cost in money, computational time, or human time, depending on…

Machine Learning · Computer Science 2019-12-20 Kimmo Kärkkäinen , Mohammad Kachuee , Orpaz Goldstein , Majid Sarrafzadeh

Scalable Decision-Focused Learning through Cost-Sensitive Regression

Many real-world combinatorial problems involve uncertain parameters, which can be predicted given contextual features and historical data. These `predict-then-optimize' or `contextual optimization' problems have gained significant…

Machine Learning · Computer Science 2026-05-19 Noah Schutte , Senne Berden , Tias Guns , Krzysztof Postek , Neil Yorke-Smith

Sharing pattern submodels for prediction with missing values

Missing values are unavoidable in many applications of machine learning and present challenges both during training and at test time. When variables are missing in recurring patterns, fitting separate pattern submodels have been proposed as…

Machine Learning · Computer Science 2023-11-27 Lena Stempfle , Ashkan Panahi , Fredrik D. Johansson

Feature Engineering for Predictive Modeling using Reinforcement Learning

Feature engineering is a crucial step in the process of predictive modeling. It involves the transformation of given feature space, typically using mathematical functions, with the objective of reducing the modeling error for a given…

Artificial Intelligence · Computer Science 2017-09-22 Udayan Khurana , Horst Samulowitz , Deepak Turaga

REFRESH: Responsible and Efficient Feature Reselection Guided by SHAP Values

Feature selection is a crucial step in building machine learning models. This process is often achieved with accuracy as an objective, and can be cumbersome and computationally expensive for large-scale datasets. Several additional model…

Machine Learning · Computer Science 2024-03-15 Shubham Sharma , Sanghamitra Dutta , Emanuele Albini , Freddy Lecue , Daniele Magazzeni , Manuela Veloso

Feature selection with test cost constraint

Feature selection is an important preprocessing step in machine learning and data mining. In real-world applications, costs, including money, time and other resources, are required to acquire the features. In some cases, there is a test…

Artificial Intelligence · Computer Science 2013-05-22 Fan Min , Qinghua Hu , William Zhu

Sampling-guided exploration of active feature selection policies

Determining the most appropriate features for machine learning predictive models is challenging regarding performance and feature acquisition costs. In particular, global feature choice is limited given that some features will only benefit…

Machine Learning · Computer Science 2026-03-17 Gabriel Bernardino , Anders Jonsson , Patrick Clarysse , Nicolas Duchateau

Estimation under Model Misspecification with Fake Features

We consider estimation under model misspecification where there is a model mismatch between the underlying system, which generates the data, and the model used during estimation. We propose a model misspecification framework which enables a…

Signal Processing · Electrical Eng. & Systems 2023-02-22 Martin Hellkvist , Ayça Özçelikkale , Anders Ahlén

Cost-sensitive Selection of Variables by Ensemble of Model Sequences

Many applications require the collection of data on different variables or measurements over many system performance metrics. We term those broadly as measures or variables. Often data collection along each measure incurs a cost, thus it is…

Methodology · Statistics 2021-11-30 Donghui Yan , Zhiwei Qin , Songxiang Gu , Haiping Xu , Ming Shao

Classification with Costly Features in Hierarchical Deep Sets

Classification with Costly Features (CwCF) is a classification problem that includes the cost of features in the optimization criteria. Individually for each sample, its features are sequentially acquired to maximize accuracy while…

Machine Learning · Computer Science 2024-07-17 Jaromír Janisch , Tomáš Pevný , Viliam Lisý

Opportunistic Learning: Budgeted Cost-Sensitive Learning from Data Streams

In many real-world learning scenarios, features are only acquirable at a cost constrained under a budget. In this paper, we propose a novel approach for cost-sensitive feature acquisition at the prediction-time. The suggested method…

Machine Learning · Computer Science 2019-02-19 Mohammad Kachuee , Orpaz Goldstein , Kimmo Karkkainen , Sajad Darabi , Majid Sarrafzadeh

Controlling Costs: Feature Selection on a Budget

The traditional framework for feature selection treats all features as costing the same amount. However, in reality, a scientist often has considerable discretion regarding which variables to measure, and the decision involves a tradeoff…

Methodology · Statistics 2023-02-14 Guo Yu , Daniela Witten , Jacob Bien

Cost-Sensitive Feature Selection by Optimizing F-Measures

Feature selection is beneficial for improving the performance of general machine learning tasks by extracting an informative subset from the high-dimensional features. Conventional feature selection methods usually ignore the class…

Computer Vision and Pattern Recognition · Computer Science 2019-04-05 Meng Liu , Chang Xu , Yong Luo , Chao Xu , Yonggang Wen , Dacheng Tao

Feature and Variable Selection in Classification

The amount of information in the form of features and variables avail- able to machine learning algorithms is ever increasing. This can lead to classifiers that are prone to overfitting in high dimensions, high di- mensional models do not…

Machine Learning · Computer Science 2014-02-12 Aaron Karper

It's All in the Mix: Wasserstein Classification and Regression with Mixed Features

Problem definition: A key challenge in supervised learning is data scarcity, which can cause prediction models to overfit to the training data and perform poorly out of sample. A contemporary approach to combat overfitting is offered by…

Optimization and Control · Mathematics 2025-10-10 Reza Belbasi , Aras Selvi , Wolfram Wiesemann

Cost-sensitive Feature Selection for Support Vector Machines

Feature Selection is a crucial procedure in Data Science tasks such as Classification, since it identifies the relevant variables, making thus the classification procedures more interpretable, cheaper in terms of measurement and more…

Machine Learning · Statistics 2024-01-17 Sandra Benítez-Peña , Rafael Blanquero , Emilio Carrizosa , Pepa Ramírez-Cobo

Tests for qualitative features in the random coefficients model

The random coefficients model is an extension of the linear regression model that allows for unobserved heterogeneity in the population by modeling the regression coefficients as random variables. Given data from this model, the statistical…

Methodology · Statistics 2018-03-15 Fabian Dunker , Konstantin Eckle , Katharina Proksch , Johannes Schmidt-Hieber