Lin — Scifaro

BlockBatch: Multi-Scale Consensus Decoding for Efficient Diffusion Language Model Inference

Diffusion language models (dLLMs) generate text by iteratively denoising multiple token positions in parallel, offering an attractive alternative to strictly autoregressive decoding. In practice, however, block-wise dLLM inference exposes a…

Machine Learning · Computer Science 2026-05-29 Xiaoyou Wu , Cheng-Jhih Shih , Binfei Ji , Yong Liu , Yingyan , Lin

A Survey on Graph Neural Network Acceleration: Algorithms, Systems, and Customized Hardware

Graph neural networks (GNNs) are emerging for machine learning research on graph-structured data. GNNs achieve state-of-the-art performance on many tasks, but they face scalability challenges when it comes to real-world applications that…

Machine Learning · Computer Science 2026-04-02 Shichang Zhang , Atefeh Sohrabizadeh , Cheng Wan , Zijie Huang , Ziniu Hu , Yewen Wang , Yingyan , Lin , Jason Cong , Yizhou Sun

From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models

Vision-Language-Action (VLA) models have recently enabled embodied agents to perform increasingly complex tasks by jointly reasoning over visual, linguistic, and motor modalities. However, we find that the prevailing notion of…

Machine Learning · Computer Science 2026-03-20 Zhuofan Li , Hongkun Yang , Zhenyang Chen , Yangxuan Chen , Yingyan , Lin , Chaojian Li

Nemotron-CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Pre-training datasets are typically collected from web content and lack inherent domain divisions. For instance, widely used datasets like Common Crawl do not include explicit domain labels, while manually curating labeled datasets such as…

Computation and Language · Computer Science 2025-12-02 Shizhe Diao , Yu Yang , Yonggan Fu , Xin Dong , Dan Su , Markus Kliegl , Zijia Chen , Peter Belcak , Yoshi Suhara , Hongxu Yin , Mostofa Patwary , Yingyan , Lin , Jan Kautz , Pavlo Molchanov

Looking Forward: Challenges and Opportunities in Agentic AI Reliability

This chapter presents perspectives for challenges and future development in building reliable AI systems, particularly, agentic AI systems. Several open research problems related to mitigating the risks of cascading failures are discussed.…

Artificial Intelligence · Computer Science 2025-11-18 Liudong Xing , Janet , Lin

From Failure Modes to Reliability Awareness in Generative and Agentic AI System

This chapter bridges technical analysis and organizational preparedness by tracing the path from layered failure modes to reliability awareness in generative and agentic AI systems. We first introduce an 11-layer failure stack, a structured…

Systems and Control · Electrical Eng. & Systems 2025-11-11 Janet , Lin , Liangwei Zhang

Fewer Denoising Steps or Cheaper Per-Step Inference: Towards Compute-Optimal Diffusion Model Deployment

Diffusion models have shown remarkable success across generative tasks, yet their high computational demands challenge deployment on resource-limited platforms. This paper investigates a critical question for compute-optimal diffusion model…

Computer Vision and Pattern Recognition · Computer Science 2025-08-11 Zhenbang Du , Yonggan Fu , Lifu Wang , Jiayi Qian , Xiao Luo , Yingyan , Lin

A3D-MoE: Acceleration of Large Language Models with Mixture of Experts via 3D Heterogeneous Integration

Conventional large language models (LLMs) are equipped with dozens of GB to TB of model parameters, making inference highly energy-intensive and costly as all the weights need to be loaded to onboard processing elements during computation.…

Hardware Architecture · Computer Science 2025-07-28 Wei-Hsing Huang , Janak Sharda , Cheng-Jhih Shih , Yuyao Kong , Faaiq Waqar , Pin-Jun Chen , Yingyan , Lin , Shimeng Yu

3DGauCIM: Accelerating Static/Dynamic 3D Gaussian Splatting via Digital CIM for High Frame Rate Real-Time Edge Rendering

Dynamic 3D Gaussian splatting (3DGS) extends static 3DGS to render dynamic scenes, enabling AR/VR applications with moving objects. However, implementing dynamic 3DGS on edge devices faces challenges: (1) Loading all Gaussian parameters…

Hardware Architecture · Computer Science 2025-07-28 Wei-Hsing Huang , Cheng-Jhih Shih , Jian-Wei Su , Samuel Wade Wang , Vaidehi Garg , Yuyao Kong , Jen-Chun Tien , Nealson Li , Arijit Raychowdhury , Meng-Fan Chang , Yingyan , Lin , Shimeng Yu

LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models

Recent advancements in Large Language Models (LLMs) have spurred interest in numerous applications requiring robust long-range capabilities, essential for processing extensive input contexts and continuously generating extended outputs. As…

Machine Learning · Computer Science 2025-07-22 Dachuan Shi , Yonggan Fu , Xiangchi Yuan , Zhongzhi Yu , Haoran You , Sixu Li , Xin Dong , Jan Kautz , Pavlo Molchanov , Yingyan , Lin

Improvement Strategies for Few-Shot Learning in OCT Image Classification of Rare Retinal Diseases

This paper focuses on using few-shot learning to improve the accuracy of classifying OCT diagnosis images with major and rare classes. We used the GAN-based augmentation strategy as a baseline and introduced several novel methods to further…

Image and Video Processing · Electrical Eng. & Systems 2025-05-27 Cheng-Yu Tai , Ching-Wen Chen , Chi-Chin Wu , Bo-Chen Chiu , Cheng-Hung , Lin , Cheng-Kai Lu , Jia-Kang Wang , Tzu-Lun Huang

Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training

Training diffusion models (DMs) requires substantial computational resources due to multiple forward and backward passes across numerous timesteps, motivating research into efficient training techniques. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2025-04-15 Lexington Whalen , Zhenbang Du , Haoran You , Chaojian Li , Sixu Li , Yingyan , Lin

Impact of Stain Variation and Color Normalization for Prognostic Predictions in Pathology

In recent years, deep neural networks (DNNs) have demonstrated remarkable performance in pathology applications, potentially even outperforming expert pathologists due to their ability to learn subtle features from large datasets. One…

Image and Video Processing · Electrical Eng. & Systems 2024-09-16 Siyu , Lin , Haowen Zhou , Richard J. Cote , Mark Watson , Ramaswamy Govindan , Changhuei Yang

Length-scale study in deep learning prediction for non-small cell lung cancer brain metastasis

Deep learning assisted digital pathology has the potential to impact clinical practice in significant ways. In recent studies, deep neural network (DNN) enabled analysis outperforms human pathologists. Increasing sizes and complexity of the…

Image and Video Processing · Electrical Eng. & Systems 2024-06-04 Haowen Zhou , Steven , Lin , Mark Watson , Cory T. Bernadt , Oumeng Zhang , Ramaswamy Govindan , Richard J. Cote , Changhuei Yang

Improving Cancer Imaging Diagnosis with Bayesian Networks and Deep Learning: A Bayesian Deep Learning Approach

With recent advancements in the development of artificial intelligence applications using theories and algorithms in machine learning, many accurate models can be created to train and predict on given datasets. With the realization of the…

Machine Learning · Computer Science 2024-03-29 Pei Xi , Lin

Some Critical Thinking on EV Battery Reliability: from Enhancement to Optimization -- comprehensive perspectives, lifecycle innovation, system cognation, and strategic insights

In the era of sustainable transportation, the significance of electric vehicles (EVs) and their battery technology is becoming increasingly paramount. This study addresses the critical aspect of EV battery reliability, an essential factor…

Systems and Control · Electrical Eng. & Systems 2024-01-11 Jing , Lin , Christofer Silfvenius

NetDistiller: Empowering Tiny Deep Learning via In-Situ Distillation

Boosting the task accuracy of tiny neural networks (TNNs) has become a fundamental challenge for enabling the deployments of TNNs on edge devices which are constrained by strict limitations in terms of memory, computation, bandwidth, and…

Machine Learning · Computer Science 2023-11-01 Shunyao Zhang , Yonggan Fu , Shang Wu , Jyotikrishna Dass , Haoran You , Yingyan , Lin

A multiplicative ergodic theorem for von Neumann algebra valued cocycles

The classical Multiplicative Ergodic Theorem (MET) of Oseledets is generalized here to cocycles taking values in a semi-finite von Neumann algebra. This allows for a continuous Lyapunov distribution.

Operator Algebras · Mathematics 2021-03-31 Lewis Bowen , Ben Hayes , Yuqing , Lin

A Northern Ecliptic Survey for Solar System Science

Making an inventory of the Solar System is one of the four fundamental science requirements for the Large Synoptic Survey Telescope (LSST). The current baseline footprint for LSST's main Wide-Fast-Deep (WFD) Survey observes the sky below…

Earth and Planetary Astrophysics · Physics 2018-12-05 Megan E. Schwamb , Kathryn Volk , Hsing Wen , Lin , Michael S. P. Kelley , Michele T. Bannister , Henry H. Hsieh , R. Lynne Jones , Michael Mommert , Colin Snodgrass , Darin Ragozzine , Steven R. Chesley , Scott S. Sheppard , Mario Juric , Marc W. Buie

The Effects of Filter Choice on Outer Solar System Science with LSST

Making an inventory of the Solar System is one of the four pillars that the requirements for the Large Synoptic Survey Telescope (LSST) are built upon. The choice between same-filter nightly pairs or different-filter nightly pairs in the…

Earth and Planetary Astrophysics · Physics 2018-12-04 Kathryn Volk , Megan E. Schwamb , Wes Fraser , Michael S. P. Kelley , Hsing Wen , Lin , Darin Ragozzine , R. Lynne Jones , Colin Snodgrass , Michele T. Bannister