Trevor Hastie — Scifaro

Enhancing a Risk Model by Adding Transient Statistical Factors

Estimating the covariance of asset returns, i.e., the risk model, is a key component of financial portfolio construction and evaluation. Most risk modeling approaches produce a factor model that decomposes the asset variability into two…

Applications · Statistics 2026-05-14 Alexandros E. Tzikas , Emmanuel J. Candès , Trevor Hastie , Stephen P. Boyd , Mykel J. Kochenderfer , Ronald N. Kahn

balnet: Pathwise Estimation of Covariate Balancing Propensity Scores

We present balnet, an R package for scalable pathwise estimation of covariate balancing propensity scores via logistic covariate balancing loss functions. Regularization paths are computed with Yang and Hastie (2024)'s generic elastic net…

Methodology · Statistics 2026-04-24 Erik Sverdrup , Trevor Hastie

Single-Asset Adaptive Leveraged Volatility Control

This paper introduces a methodology for constructing a market index composed of a liquid risky asset and a liquid risk-free asset that achieves a fixed target volatility. Existing volatility-targeting strategies typically scale portfolio…

Portfolio Management · Quantitative Finance 2026-04-01 Nikhil Devanathan , Dylan Rueter , Stephen Boyd , Emmanuel Candès , Trevor Hastie , Mykel J. Kochenderfer , Arpit Apoorv , David Soronow , Igor Zamkovsky

Adaptive Strategies for Pension Fund Management

This paper proposes a simulation-based framework for assessing and improving the performance of a pension fund management scheme. This framework is modular and allows the definition of customized performance metrics that are used to assess…

Optimization and Control · Mathematics 2026-03-17 Raphael Chinchilla , Thomas D. Rueter , Timothy R. McDade , Peter R. Fisher , Emmanuel Candes , Trevor Hastie , Stephen Boyd

Univariate-Guided Sparse Regression for Biobank-Scale High-Dimensional Omics Data

We present a scalable framework for computing polygenic risk scores (PRS) in high-dimensional genomic settings using the recently introduced Univariate-Guided Sparse Regression (uniLasso). UniLasso is a two-stage penalized regression…

Methodology · Statistics 2026-01-23 Joshua Richland , Tuomo Kiiskinen , William Wang , Sophia Lu , Balasubramanian Narasimhan , Trevor Hastie , Manuel Rivas , Robert Tibshirani

Factor Fitting, Rank Allocation, and Partitioning in Multilevel Low Rank Matrices

We consider multilevel low rank (MLR) matrices, defined as a row and column permutation of a sum of matrices, each one a block diagonal refinement of the previous one, with all blocks low rank given in factored form. MLR matrices extend low…

Machine Learning · Statistics 2025-10-27 Tetiana Parshakova , Trevor Hastie , Eric Darve , Stephen Boyd

Scalable solution to crossed random effects model with random slopes

The crossed random effects model is widely used, finding applications in various fields such as longitudinal studies, e-commerce, and recommender systems, among others. However, these models encounter scalability challenges, as the…

Methodology · Statistics 2025-10-21 Disha Ghandwani , Swarnadip Ghosh , Trevor Hastie , Art B. Owen

Fitting Multilevel Factor Models

We examine a special case of the multilevel factor model, with covariance given by multilevel low rank (MLR) matrix~\cite{parshakova2023factor}. We develop a novel, fast implementation of the expectation-maximization algorithm, tailored for…

Machine Learning · Statistics 2025-08-26 Tetiana Parshakova , Trevor Hastie , Stephen Boyd

Non-smooth optimization meets automated material model discovery

Automated material model discovery disrupts the tedious and time-consuming cycle of iteratively calibrating and modifying manually designed models. Non-smooth L1-norm regularization is the backbone of automated model discovery; however, the…

Computational Engineering, Finance, and Science · Computer Science 2025-07-15 Moritz Flaschel , Trevor Hastie , Ellen Kuhl

Univariate-Guided Sparse Regression

In this paper, we introduce ``UniLasso'' -- a novel statistical method for sparse regression. This two-stage approach preserves the signs of the univariate coefficients and leverages their magnitude. Both of these properties are attractive…

Methodology · Statistics 2025-06-26 Sourav Chatterjee , Trevor Hastie , Robert Tibshirani

Nuclear penalized multinomial regression with an application to predicting at bat outcomes in baseball

We propose the nuclear norm penalty as an alternative to the ridge penalty for regularized multinomial regression. This convex relaxation of reduced-rank multinomial regression has the advantage of leveraging underlying structure among the…

Machine Learning · Statistics 2025-06-09 Scott Powers , Trevor Hastie , Robert Tibshirani

Pre-validation Revisited

Pre-validation is a way to build prediction model with two datasets of significantly different feature dimensions. Previous work showed that the asymptotic distribution of the resulting test statistic for the pre-validated predictor…

Methodology · Statistics 2025-05-23 Jing Shang , Sourav Chatterjee , Trevor Hastie , Robert Tibshirani

A Statistical View of Column Subset Selection

We consider the problem of selecting a small subset of representative variables from a large dataset. In the computer science literature, this dimensionality reduction problem is typically formalized as Column Subset Selection (CSS).…

Methodology · Statistics 2025-05-20 Anav Sood , Trevor Hastie

Ridge Regularization: an Essential Concept in Data Science

Ridge or more formally $\ell_2$ regularization shows up in many areas of statistics and machine learning. It is one of those essential devices that any good data scientist needs to master for their craft. In this brief ridge fest I have…

Methodology · Statistics 2024-11-01 Trevor Hastie

A Fast Coordinate Descent Method for High-Dimensional Non-Negative Least Squares using a Unified Sparse Regression Framework

We develop theoretical results that establish a connection across various regression methods such as the non-negative least squares, bounded variable least squares, simplex constrained least squares, and lasso. In particular, we show in…

Computation · Statistics 2024-10-29 James Yang , Trevor Hastie

The mosaic permutation test: an exact and nonparametric goodness-of-fit test for factor models

Financial firms often rely on fundamental factor models to explain correlations among asset returns and manage risk. Yet after major events, e.g., COVID-19, analysts may reassess whether existing risk models continue to fit well:…

Methodology · Statistics 2024-09-30 Asher Spector , Rina Foygel Barber , Trevor Hastie , Ronald N. Kahn , Emmanuel Candès

Scalable recommender system based on factor analysis

Recommender systems have become crucial in the modern digital landscape, where personalized content, products, and services are essential for enhancing user experience. This paper explores statistical models for recommender systems,…

Methodology · Statistics 2024-08-13 Disha Ghandwani , Trevor Hastie

MMIL: A novel algorithm for disease associated cell type discovery

Single-cell datasets often lack individual cell labels, making it challenging to identify cells associated with disease. To address this, we introduce Mixture Modeling for Multiple Instance Learning (MMIL), an expectation maximization…

Quantitative Methods · Quantitative Biology 2024-06-13 Erin Craig , Timothy Keyes , Jolanda Sarno , Maxim Zaslavsky , Garry Nolan , Kara Davis , Trevor Hastie , Robert Tibshirani

A Fast and Scalable Pathwise-Solver for Group Lasso and Elastic Net Penalized Regression via Block-Coordinate Descent

We develop fast and scalable algorithms based on block-coordinate descent to solve the group lasso and the group elastic net for generalized linear models along a regularization path. Special attention is given when the loss is the usual…

Computation · Statistics 2024-05-15 James Yang , Trevor Hastie

Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank

Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals, underscoring a critical gap in genetic research. Here, we assess whether…

Machine Learning · Computer Science 2024-05-08 Thomas Le Menestrel , Erin Craig , Robert Tibshirani , Trevor Hastie , Manuel Rivas