Vahid Tarokh — Scifaro

Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization

Estimating the unknown reward functions driving agents' behaviors is of central interest in inverse reinforcement learning and game theory. To tackle this problem, we develop a unified framework for reward function recovery in two-player…

Machine Learning · Computer Science 2026-05-20 Junyi Liao , Zihan Zhu , Ethan Fang , Zhuoran Yang , Vahid Tarokh

Rethinking Token Prediction: Tree-Structured Diffusion Language Model

Discrete diffusion language models have emerged as a competitive alternative to auto-regressive language models, but training them efficiently under limited parameter and memory budgets remains challenging. Modern architectures are…

Computation and Language · Computer Science 2026-04-07 Zihao Wu , Haoming Yang , Juncheng Dong , Vahid Tarokh

Boosting In-Context Learning in LLMs Through the Lens of Classical Supervised Learning

In-Context Learning (ICL) allows Large Language Models (LLMs) to adapt to new tasks with just a few examples, but their predictions often suffer from systematic biases, leading to unstable performance in classification. While calibration…

Machine Learning · Statistics 2026-03-05 Korel Gundem , Juncheng Dong , Dennis Zhang , Vahid Tarokh , Zhengling Qi

Learning in Context, Guided by Choice: A Reward-Free Paradigm for Reinforcement Learning with Transformers

In-context reinforcement learning (ICRL) leverages the in-context learning capabilities of transformer models (TMs) to efficiently generalize to unseen sequential decision-making tasks without parameter updates. However, existing ICRL…

Machine Learning · Computer Science 2026-02-10 Juncheng Dong , Bowen He , Moyang Guo , Ethan X. Fang , Zhuoran Yang , Vahid Tarokh

Score-based Metropolis-Hastings for Fractional Langevin Algorithms

Sampling from heavy-tailed and multimodal distributions is challenging when neither the target density nor the proposal density can be evaluated, as in $\alpha$-stable L\'evy-driven fractional Langevin algorithms. While the target…

Machine Learning · Statistics 2026-02-03 Ahmed Aloui , Junyi Liao , Ali Hasan , Jose Blanchet , Vahid Tarokh

In-Context Reinforcement Learning From Suboptimal Historical Data

Transformer models have achieved remarkable empirical successes, largely due to their in-context learning capabilities. Inspired by this, we explore training an autoregressive transformer for in-context reinforcement learning (ICRL). In…

Machine Learning · Computer Science 2026-01-29 Juncheng Dong , Moyang Guo , Ethan X. Fang , Zhuoran Yang , Vahid Tarokh

CARE: Turning LLMs Into Causal Reasoning Expert

Large language models (LLMs) have recently demonstrated impressive capabilities across a range of reasoning and generation tasks. However, research studies have shown that LLMs lack the ability to identify causal relationships, a…

Machine Learning · Computer Science 2025-11-21 Juncheng Dong , Yiling Liu , Ahmed Aloui , Vahid Tarokh , David Carlson

Neuro-Logic Lifelong Learning

Solving Inductive Logic Programming (ILP) problems with neural networks is a key challenge in Neural-Symbolic Ar- tificial Intelligence (AI). While most research has focused on designing novel network architectures for individual prob-…

Artificial Intelligence · Computer Science 2025-11-18 Bowen He , Xiaoan Xu , Alper Kamil Bozkurt , Vahid Tarokh , Juncheng Dong

Score-Based Quickest Change Detection and Fault Identification for Multi-Stream Signals

This paper introduces an approach to multi-stream quickest change detection and fault isolation for unnormalized and score-based statistical models. Traditional optimal algorithms in the quickest change detection literature require explicit…

Signal Processing · Electrical Eng. & Systems 2025-11-07 Wuxia Chen , Sean Moushegian , Vahid Tarokh , Taposh Banerjee

Conditional Score Learning for Quickest Change Detection in Markov Transition Kernels

We address the problem of quickest change detection in Markov processes with unknown transition kernels. The key idea is to learn the conditional score $\nabla_{\mathbf{y}} \log p(\mathbf{y}|\mathbf{x})$ directly from sample pairs $(…

Machine Learning · Computer Science 2025-11-07 Wuxia Chen , Taposh Banerjee , Vahid Tarokh

RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications

We present a large-scale dataset called RASPNet for radar adaptive signal processing (RASP) applications to support the development of data-driven models within the adaptive radar community. RASPNet exceeds 16 TB in size and comprises 100…

Machine Learning · Computer Science 2025-11-05 Shyam Venkatasubramanian , Bosung Kang , Ali Pezeshki , Muralidhar Rangaswamy , Vahid Tarokh

A PDE-Informed Latent Diffusion Model for 2-m Temperature Downscaling

This work presents a physics-conditioned latent diffusion model tailored for dynamical downscaling of atmospheric data, with a focus on reconstructing high-resolution 2-m temperature fields. Building upon a pre-existing diffusion…

Machine Learning · Computer Science 2025-10-29 Paul Rosu , Muchang Bahng , Erick Jiang , Rico Zhu , Vahid Tarokh

Learn2Mix: Training Neural Networks Using Adaptive Data Integration

Accelerating model convergence in resource-constrained environments is essential for fast and efficient neural network training. This work presents learn2mix, a new training strategy that adaptively adjusts class proportions within batches,…

Machine Learning · Computer Science 2025-10-24 Shyam Venkatasubramanian , Vahid Tarokh

STARK: Strategic Team of Agents for Refining Kernels

The efficiency of GPU kernels is central to the progress of modern AI, yet optimizing them remains a difficult and labor-intensive task due to complex interactions between memory hierarchies, thread scheduling, and hardware-specific…

Artificial Intelligence · Computer Science 2025-10-21 Juncheng Dong , Yang Yang , Tao Liu , Yang Wang , Feng Qi , Vahid Tarokh , Kaushik Rangadurai , Shuang Yang

Reinforcement Learning-Based Optimization of CT Acquisition and Reconstruction Parameters Through Virtual Imaging Trials

Protocol optimization is critical in Computed Tomography (CT) to achieve high diagnostic image quality while minimizing radiation dose. However, due to the complex interdependencies among CT acquisition and reconstruction parameters,…

Machine Learning · Computer Science 2025-10-13 David Fenwick , Navid NaderiAlizadeh , Vahid Tarokh , Nicholas Felice , Darin Clark , Jayasai Rajagopal , Anuj Kapadia , Benjamin Wildman-Tobriner , Ehsan Samei , Ehsan Abadi

PASTA: A Unified Framework for Offline Assortment Learning

We study a broad class of assortment optimization problems in an offline and data-driven setting. In such problems, a firm lacks prior knowledge of the underlying choice model, and aims to determine an optimal assortment based on historical…

Machine Learning · Computer Science 2025-10-03 Juncheng Dong , Weibin Mo , Zhengling Qi , Cong Shi , Ethan X. Fang , Vahid Tarokh

Dual-Function Radar-Communication Beamforming with Outage Probability Metric

The integrated design of communication and sensing may offer a potential solution to address spectrum congestion. In this work, we develop a beamforming method for a dual-function radar-communication system, where the transmit signal is…

Signal Processing · Electrical Eng. & Systems 2025-08-07 Hossein Maleki , Carles Diaz-Vilor , Ali Pezeshki , Vahid Tarokh , Hamid Jafarkhani

Robust Score-Based Quickest Change Detection

Methods in the field of quickest change detection rapidly detect in real-time a change in the data-generating distribution of an online data stream. Existing methods have been able to detect this change point when the densities of the pre-…

Methodology · Statistics 2025-07-09 Sean Moushegian , Suya Wu , Enmao Diao , Jie Ding , Taposh Banerjee , Vahid Tarokh

Diffusion-Based Hypothesis Testing and Change-Point Detection

Score-based methods have recently seen increasing popularity in modeling and generation. Methods have been constructed to perform hypothesis testing and change-point detection with score functions, but these methods are in general not as…

Machine Learning · Statistics 2025-06-23 Sean Moushegian , Taposh Banerjee , Vahid Tarokh

Conditional Average Treatment Effect Estimation Under Hidden Confounders

One of the major challenges in estimating conditional potential outcomes and conditional average treatment effects (CATE) is the presence of hidden confounders. Since testing for hidden confounders cannot be accomplished only with…

Machine Learning · Computer Science 2025-06-17 Ahmed Aloui , Juncheng Dong , Ali Hasan , Vahid Tarokh