Richard G. Baraniuk

Minimizing Collateral Damage in Activation Steering

Activation steering is a method for controlling Large Language Model (LLM) behavior by intervening in its internal representations to increase the alignment with a specific target feature direction. However, standard interventions, such as…

Machine Learning · Computer Science 2026-05-05 Tam Nguyen , Tu Anh Nguyen , Sina Alemohammad , Richard G. Baraniuk

Leakage and Second-Order Dynamics Improve Hippocampal RNN Replay

Biological neural networks (like the hippocampus) can internally generate "replay" resembling stimulus-driven activity. Recent computational models of replay use noisy recurrent neural networks (RNNs) trained to path-integrate. Replay in…

Machine Learning · Computer Science 2026-02-23 Josue Casco-Rodriguez , Nanda H. Krishna , Richard G. Baraniuk

Learning Context Matters: Measuring and Diagnosing Personalization Gaps in LLM-Based Instructional Design

The adoption of generative AI in education has accelerated dramatically in recent years, with Large Language Models (LLMs) increasingly integrated into learning environments in the hope of providing personalized support that enhances…

Computers and Society · Computer Science 2026-02-06 Johaun Hatchett , Debshila Basu Mallick , Brittany C. Bradford , Richard G. Baraniuk

Rates and architectures for learning geometrically non-trivial operators

Deep learning methods have proven capable of recovering operators between high-dimensional spaces, such as solution maps of PDEs and similar objects in mathematical physics, from very few training samples. This phenomenon of data-efficiency…

Machine Learning · Computer Science 2025-12-11 T. Mitchell Roddenberry , Leo Tzou , Ivan Dokmanić , Maarten V. de Hoop , Richard G. Baraniuk

Neon: Negative Extrapolation From Self-Training Improves Image Generation

Scaling generative AI models is bottlenecked by the scarcity of high-quality training data. The ease of synthesizing from a generative model suggests using (unverified) synthetic data to augment a limited corpus of real data for the purpose…

Graphics · Computer Science 2025-10-15 Sina Alemohammad , Zhangyang Wang , Richard G. Baraniuk

W4S4: WaLRUS Meets S4 for Long-Range Sequence Modeling

State Space Models (SSMs) have emerged as powerful components for sequence modeling, enabling efficient handling of long-range dependencies via linear recurrence and convolutional computation. However, their effectiveness depends heavily on…

Machine Learning · Computer Science 2025-06-10 Hossein Babaei , Mel White , Richard G. Baraniuk

WaLRUS: Wavelets for Long-range Representation Using SSMs

State-Space Models (SSMs) have proven to be powerful tools for modeling long-range dependencies in sequential data. While the recent method known as HiPPO has demonstrated strong performance, and formed the basis for machine learning models…

Image and Video Processing · Electrical Eng. & Systems 2025-05-20 Hossein Babaei , Mel White , Sina Alemohammad , Richard G. Baraniuk

SaFARi: State-Space Models for Frame-Agnostic Representation

State-Space Models (SSMs) have re-emerged as a powerful tool for online function approximation, and as the backbone of machine learning models for long-range dependent data. However, to date, only a few polynomial bases have been explored…

Machine Learning · Computer Science 2025-05-15 Hossein Babaei , Mel White , Sina Alemohammad , Richard G. Baraniuk

Improving Routing in Sparse Mixture of Experts with Graph of Tokens

Sparse Mixture of Experts (SMoE) has emerged as a key to achieving unprecedented scalability in deep learning. By activating only a small subset of parameters per sample, SMoE achieves an exponential increase in parameter counts while…

Machine Learning · Computer Science 2025-05-05 Tam Nguyen , Ngoc N. Tran , Khai Nguyen , Richard G. Baraniuk

MazeNet: An Accurate, Fast, and Scalable Deep Learning Solution for Steiner Minimum Trees

The Obstacle Avoiding Rectilinear Steiner Minimum Tree (OARSMT) problem, which seeks the shortest interconnection of a given number of terminals in a rectilinear plane while avoiding obstacles, is a critical task in integrated circuit…

Machine Learning · Computer Science 2025-04-01 Gabriel Díaz Ramos , Toros Arikan , Richard G. Baraniuk

Drawing Early-Bird Tickets: Towards More Efficient Training of Deep Networks

(Frankle & Carbin, 2019) shows that there exist winning tickets (small but critical subnetworks) for dense, randomly initialized networks, that can be trained alone to achieve comparable accuracies to the latter in a similar number of…

Machine Learning · Computer Science 2025-03-04 Haoran You , Chaojian Li , Pengfei Xu , Yonggan Fu , Yue Wang , Xiaohan Chen , Richard G. Baraniuk , Zhangyang Wang , Yingyan Celine Lin

Do LLMs Make Mistakes Like Students? Exploring Natural Alignment between Language Models and Human Error Patterns

Large Language Models (LLMs) have demonstrated remarkable capabilities in various educational tasks, yet their alignment with human learning patterns, particularly in predicting which incorrect options students are most likely to select in…

Computation and Language · Computer Science 2025-02-24 Naiming Liu , Shashank Sonkar , Richard G. Baraniuk

The Imitation Game for Educational AI

As artificial intelligence systems become increasingly prevalent in education, a fundamental challenge emerges: how can we verify if an AI truly understands how students think and reason? Traditional evaluation methods like measuring…

Artificial Intelligence · Computer Science 2025-02-24 Shashank Sonkar , Naiming Liu , Xinghe Chen , Richard G. Baraniuk

Learning Transferable Features for Implicit Neural Representations

Implicit neural representations (INRs) have demonstrated success in a variety of applications, including inverse problems and neural rendering. An INR is typically trained to capture one signal of interest, resulting in learned neural…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Kushal Vyas , Ahmed Imtiaz Humayun , Aniket Dashpute , Richard G. Baraniuk , Ashok Veeraraghavan , Guha Balakrishnan

Estimating the Number and Locations of Boundaries in Reverberant Environments with Deep Learning

Underwater acoustic environment estimation is a challenging but important task for remote sensing scenarios. Current estimation methods require high signal strength and a solution to the fragile echo labeling problem to be effective. In…

Sound · Computer Science 2024-11-06 Toros Arikan , Luca M. Chackalackal , Fatima Ahsan , Konrad Tittel , Andrew C. Singer , Gregory W. Wornell , Richard G. Baraniuk

LLM-based Cognitive Models of Students with Misconceptions

Accurately modeling student cognition is crucial for developing effective AI-driven educational technologies. A key challenge is creating realistic student models that satisfy two essential properties: (1) accurately replicating specific…

Human-Computer Interaction · Computer Science 2024-10-18 Shashank Sonkar , Xinghe Chen , Naiming Liu , Richard G. Baraniuk , Mrinmaya Sachan

Student Data Paradox and Curious Case of Single Student-Tutor Model: Regressive Side Effects of Training LLMs for Personalized Learning

The pursuit of personalized education has led to the integration of Large Language Models (LLMs) in developing intelligent tutoring systems. To better understand and adapt to individual student needs, including their misconceptions, LLMs…

Computation and Language · Computer Science 2024-10-08 Shashank Sonkar , Naiming Liu , Richard G. Baraniuk

Pedagogical Alignment of Large Language Models

Large Language Models (LLMs), when used in educational settings without pedagogical fine-tuning, often provide immediate answers rather than guiding students through the problem-solving process. This approach falls short of pedagogically…

Computation and Language · Computer Science 2024-10-08 Shashank Sonkar , Kangqi Ni , Sapana Chaudhary , Richard G. Baraniuk

Improving Fairness and Mitigating MADness in Generative Models

Generative models unfairly penalize data belonging to minority classes, suffer from model autophagy disorder (MADness), and learn biased estimates of the underlying distribution parameters. Our theoretical and empirical results show that…

Machine Learning · Computer Science 2024-10-07 Paul Mayer , Lorenzo Luzi , Ali Siahkoohi , Don H. Johnson , Richard G. Baraniuk

A Primal-Dual Framework for Transformers and Neural Networks

Self-attention is key to the remarkable success of transformers in sequence modeling tasks including many applications in natural language processing and computer vision. Like neural network layers, these attention mechanisms are often…

Machine Learning · Computer Science 2024-06-21 Tan M. Nguyen , Tam Nguyen , Nhat Ho , Andrea L. Bertozzi , Richard G. Baraniuk , Stanley J. Osher