Related papers: Attribute Efficient Linear Regression with Data-De…

Optimal Algorithms for Ridge and Lasso Regression with Partially Observed Attributes

We consider the most common variants of linear regression, including Ridge, Lasso and Support-vector regression, in a setting where the learner is allowed to observe only a fixed number of attributes of each example at training time. We…

Machine Learning · Computer Science 2015-03-19 Elad Hazan , Tomer Koren

Linear Regression with Limited Observation

We consider the most common variants of linear regression, including Ridge, Lasso and Support-vector regression, in a setting where the learner is allowed to observe only a fixed number of attributes of each example at training time. We…

Machine Learning · Computer Science 2012-06-22 Elad Hazan , Tomer Koren

Efficient Learning with Partially Observed Attributes

We describe and analyze efficient algorithms for learning a linear predictor from examples when the learner can only view a few attributes of each training example. This is the case, for instance, in medical research, where each patient…

Machine Learning · Computer Science 2010-04-29 Nicolò Cesa-Bianchi , Shai Shalev-Shwartz , Ohad Shamir

Learning Feature Nonlinearities with Non-Convex Regularized Binned Regression

For various applications, the relations between the dependent and independent variables are highly nonlinear. Consequently, for large scale complex problems, neural networks and regression trees are commonly preferred over linear models…

Machine Learning · Computer Science 2017-05-23 Samet Oymak , Mehrdad Mahdavi , Jiasi Chen

Feature Adaptation for Sparse Linear Regression

Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,\Sigma)$, and we seek an estimator with small…

Data Structures and Algorithms · Computer Science 2023-05-29 Jonathan Kelner , Frederic Koehler , Raghu Meka , Dhruv Rohatgi

High-dimensional regression with potential prior information on variable importance

There are a variety of settings where vague prior information may be available on the importance of predictors in high-dimensional regression settings. Examples include ordering on the variables offered by their empirical variances (which…

Methodology · Statistics 2022-05-20 Benjamin G. Stokell , Rajen D. Shah

Cooperative Thresholded Lasso for Sparse Linear Bandit

We present a novel approach to address the multi-agent sparse contextual linear bandit problem, in which the feature vectors have a high dimension $d$ whereas the reward function depends on only a limited set of features - precisely $s_0…

Machine Learning · Computer Science 2023-05-31 Haniyeh Barghi , Xiaotong Cheng , Setareh Maghsudi

Consistency of Extreme Learning Machines and Regression under Non-Stationarity and Dependence for ML-Enhanced Moving Objects

Supervised learning by extreme learning machines resp. neural networks with random weights is studied under a non-stationary spatial-temporal sampling design which especially addresses settings where an autonomous object moving in a…

Machine Learning · Statistics 2021-09-02 Ansgar Steland

Online Active Linear Regression via Thresholding

We consider the problem of online active learning to collect data for regression modeling. Specifically, we consider a decision maker with a limited experimentation budget who must efficiently learn an underlying linear population model.…

Machine Learning · Statistics 2016-12-22 Carlos Riquelme , Ramesh Johari , Baosen Zhang

Linear Time Feature Selection for Regularized Least-Squares

We propose a novel algorithm for greedy forward feature selection for regularized least-squares (RLS) regression and classification, also known as the least-squares support vector machine or ridge regression. The algorithm, which we call…

Machine Learning · Statistics 2010-03-19 Tapio Pahikkala , Antti Airola , Tapio Salakoski

Sample Efficient Stochastic Gradient Iterative Hard Thresholding Method for Stochastic Sparse Linear Regression with Limited Attribute Observation

We develop new stochastic gradient methods for efficiently solving sparse linear regression in a partial attribute observation setting, where learners are only allowed to observe a fixed number of actively chosen attributes per example at…

Optimization and Control · Mathematics 2018-12-04 Tomoya Murata , Taiji Suzuki

Adaptive Decision-Making with Constraints and Dependent Losses: Performance Guarantees and Applications to Online and Nonlinear Identification

We consider adaptive decision-making problems where an agent optimizes a cumulative performance objective by repeatedly choosing among a finite set of options. Compared to the classical prediction-with-expert-advice set-up, we consider…

Machine Learning · Computer Science 2023-04-10 Michael Muehlebach

Resource Constrained Structured Prediction

We study the problem of structured prediction under test-time budget constraints. We propose a novel approach applicable to a wide range of structured prediction problems in computer vision and natural language processing. Our approach…

Machine Learning · Statistics 2016-06-09 Tolga Bolukbasi , Kai-Wei Chang , Joseph Wang , Venkatesh Saligrama

Robust Learning of Trimmed Estimators via Manifold Sampling

We adapt a manifold sampling algorithm for the nonsmooth, nonconvex formulations of learning that arise when imposing robustness to outliers present in the training data. We demonstrate the approach on objectives based on trimmed loss.…

Optimization and Control · Mathematics 2018-07-10 Matt Menickelly , Stefan M. Wild

Online Budgeted Learning for Classifier Induction

In real-world machine learning applications, there is a cost associated with sampling of different features. Budgeted learning can be used to select which feature-values to acquire from each instance in a dataset, such that the best model…

Machine Learning · Computer Science 2019-03-14 Eran Fainman , Bracha Shapira , Lior Rokach , Yisroel Mirsky

High-dimensional Linear Regression for Dependent Data with Applications to Nowcasting

Recent research has focused on $\ell_1$ penalized least squares (Lasso) estimators for high-dimensional linear regressions in which the number of covariates $p$ is considerably larger than the sample size $n$. However, few studies have…

Statistics Theory · Mathematics 2022-05-05 Yuefeng Han , Ruey S. Tsay

Dependent relevance determination for smooth and structured sparse regression

In many problem settings, parameter vectors are not merely sparse but dependent in such a way that non-zero coefficients tend to cluster together. We refer to this form of dependency as "region sparsity." Classical sparse regression…

Machine Learning · Statistics 2019-01-28 Anqi Wu , Oluwasanmi Koyejo , Jonathan W. Pillow

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient

This paper provides a statistical analysis of high-dimensional batch Reinforcement Learning (RL) using sparse linear function approximation. When there is a large number of candidate features, our result sheds light on the fact that…

Machine Learning · Computer Science 2020-11-10 Botao Hao , Yaqi Duan , Tor Lattimore , Csaba Szepesvári , Mengdi Wang

Regression modeling on stratified data with the lasso

We consider the estimation of regression models on strata defined using a categorical covariate, in order to identify interactions between this categorical covariate and the other predictors. A basic approach requires the choice of a…

Statistics Theory · Mathematics 2016-11-09 Edouard Ollier , Vivian Viallon

The Stochastic Replica Approach to Machine Learning: Stability and Parameter Optimization

We introduce a statistical physics inspired supervised machine learning algorithm for classification and regression problems. The method is based on the invariances or stability of predicted results when known data is represented as…

Machine Learning · Statistics 2018-11-19 Patrick Chao , Tahereh Mazaheri , Bo Sun , Nicholas B. Weingartner , Zohar Nussinov