Related papers: DynamicEmbedding: Extending TensorFlow for Colossa…

Nimble: Efficiently Compiling Dynamic Neural Networks for Model Inference

Modern deep neural networks increasingly make use of features such as dynamic control flow, data structures and dynamic tensor shapes. Existing deep learning systems focus on optimizing and executing static neural networks which assume a…

Programming Languages · Computer Science 2021-03-15 Haichen Shen , Jared Roesch , Zhi Chen , Wei Chen , Yong Wu , Mu Li , Vin Sharma , Zachary Tatlock , Yida Wang

Bio-Inspired, Task-Free Continual Learning through Activity Regularization

The ability to sequentially learn multiple tasks without forgetting is a key skill of biological brains, whereas it represents a major challenge to the field of deep learning. To avoid catastrophic forgetting, various continual learning…

Neural and Evolutionary Computing · Computer Science 2022-12-09 Francesco Lässig , Pau Vilimelis Aceituno , Martino Sorbaro , Benjamin F. Grewe

Dynamic Sparse Learning: A Novel Paradigm for Efficient Recommendation

In the realm of deep learning-based recommendation systems, the increasing computational demands, driven by the growing number of users and items, pose a significant challenge to practical deployment. This challenge is primarily twofold:…

Information Retrieval · Computer Science 2024-02-06 Shuyao Wang , Yongduo Sui , Jiancan Wu , Zhi Zheng , Hui Xiong

Dynamic Control Flow in Large-Scale Machine Learning

Many recent machine learning models rely on fine-grained dynamic control flow for training and inference. In particular, models based on recurrent neural networks and on reinforcement learning depend on recurrence relations, data-dependent…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-09 Yuan Yu , Martín Abadi , Paul Barham , Eugene Brevdo , Mike Burrows , Andy Davis , Jeff Dean , Sanjay Ghemawat , Tim Harley , Peter Hawkins , Michael Isard , Manjunath Kudlur , Rajat Monga , Derek Murray , Xiaoqiang Zheng

Dynamically Modular and Sparse General Continual Learning

Real-world applications often require learning continuously from a stream of data under ever-changing conditions. When trying to learn from such non-stationary data, deep neural networks (DNNs) undergo catastrophic forgetting of previously…

Computer Vision and Pattern Recognition · Computer Science 2023-01-03 Arnav Varma , Elahe Arani , Bahram Zonooz

Dynamic Sparse Training for Deep Reinforcement Learning

Deep reinforcement learning (DRL) agents are trained through trial-and-error interactions with the environment. This leads to a long training time for dense neural networks to achieve good performance. Hence, prohibitive computation and…

Machine Learning · Computer Science 2022-05-09 Ghada Sokar , Elena Mocanu , Decebal Constantin Mocanu , Mykola Pechenizkiy , Peter Stone

Dynamic Sparse Graph for Efficient Deep Learning

We propose to execute deep neural networks (DNNs) with dynamic and sparse graph (DSG) structure for compressive memory and accelerative execution during both training and inference. The great success of DNNs motivates the pursuing of…

Machine Learning · Computer Science 2019-05-08 Liu Liu , Lei Deng , Xing Hu , Maohua Zhu , Guoqi Li , Yufei Ding , Yuan Xie

Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Tensorflow Pretrained Models

The application of TensorFlow pre-trained models in deep learning is explored, with an emphasis on practical guidance for tasks such as image classification and object detection. The study covers modern architectures, including ResNet,…

Machine Learning · Computer Science 2025-11-20 Keyu Chen , Ziqian Bi , Qian Niu , Junyu Liu , Benji Peng , Sen Zhang , Ming Liu , Xinyuan Song , Zekun Jiang , Tianyang Wang , Ming Li , Xuanhe Pan , Jiawei Xu , Jinlang Wang , Pohsun Feng

Compressed Learning of Deep Neural Networks for OpenCL-Capable Embedded Systems

Deep neural networks (DNNs) have been quite successful in solving many complex learning problems. However, DNNs tend to have a large number of learning parameters, leading to a large memory and computation requirement. In this paper, we…

Machine Learning · Computer Science 2019-05-21 Sangkyun Lee , Jeonghyun Lee

DynamicRetriever: A Pre-training Model-based IR System with Neither Sparse nor Dense Index

Web search provides a promising way for people to obtain information and has been extensively studied. With the surgence of deep learning and large-scale pre-training techniques, various neural information retrieval models are proposed and…

Information Retrieval · Computer Science 2022-03-02 Yujia Zhou , Jing Yao , Zhicheng Dou , Ledell Wu , Ji-Rong Wen

DeepCache: Accelerating Diffusion Models for Free

Diffusion models have recently gained unprecedented attention in the field of image synthesis due to their remarkable generative capabilities. Notwithstanding their prowess, these models often incur substantial computational costs,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Xinyin Ma , Gongfan Fang , Xinchao Wang

NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling

Model ensembles have long been a cornerstone for improving generalization and robustness in deep learning. However, their effectiveness often comes at the cost of substantial computational overhead. To address this issue, state-of-the-art…

Machine Learning · Computer Science 2025-05-26 Bram Grooten , Farid Hasanov , Chenxiang Zhang , Qiao Xiao , Boqian Wu , Zahra Atashgahi , Ghada Sokar , Shiwei Liu , Lu Yin , Elena Mocanu , Mykola Pechenizkiy , Decebal Constantin Mocanu

Dynamic Model Pruning with Feedback

Deep neural networks often have millions of parameters. This can hinder their deployment to low-end devices, not only due to high memory requirements but also because of increased latency at inference. We propose a novel model compression…

Machine Learning · Computer Science 2020-06-15 Tao Lin , Sebastian U. Stich , Luis Barba , Daniil Dmitriev , Martin Jaggi

deepregression: a Flexible Neural Network Framework for Semi-Structured Deep Distributional Regression

In this paper we describe the implementation of semi-structured deep distributional regression, a flexible framework to learn conditional distributions based on the combination of additive regression models and deep networks. Our…

Machine Learning · Statistics 2022-03-11 David Rügamer , Chris Kolb , Cornelius Fritz , Florian Pfisterer , Philipp Kopper , Bernd Bischl , Ruolin Shen , Christina Bukas , Lisa Barros de Andrade e Sousa , Dominik Thalmeier , Philipp Baumann , Lucas Kook , Nadja Klein , Christian L. Müller

DeepCell: Self-Supervised Multiview Fusion for Circuit Representation Learning

We introduce DeepCell, a novel circuit representation learning framework that effectively integrates multiview information from both And-Inverter Graphs (AIGs) and Post-Mapping (PM) netlists. At its core, DeepCell employs a self-supervised…

Machine Learning · Computer Science 2025-07-09 Zhengyuan Shi , Chengyu Ma , Ziyang Zheng , Lingfeng Zhou , Hongyang Pan , Wentao Jiang , Fan Yang , Xiaoyan Yang , Zhufei Chu , Qiang Xu

The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latents

Diffusion models have achieved success in high-fidelity data synthesis, yet their capacity for more complex, structured reasoning like text following tasks remains constrained. While advances in language models have leveraged strategies…

Computer Vision and Pattern Recognition · Computer Science 2026-04-29 Yuwei Sun , Yuxuan Yao , Hui Li , Siyu Zhu

Improving the Expressiveness of Deep Learning Frameworks with Recursion

Recursive neural networks have widely been used by researchers to handle applications with recursively or hierarchically structured data. However, embedded control flow deep learning frameworks such as TensorFlow, Theano, Caffe2, and MXNet…

Machine Learning · Computer Science 2018-09-05 Eunji Jeong , Joo Seong Jeong , Soojeong Kim , Gyeong-In Yu , Byung-Gon Chun

Disentangling Disentangled Representations: Towards Improved Latent Units via Diffusion Models

Disentangled representation learning (DRL) aims to break down observed data into core intrinsic factors for a profound understanding of the data. In real-world scenarios, manually defining and labeling these factors are non-trivial, making…

Machine Learning · Computer Science 2024-11-01 Youngjun Jun , Jiwoo Park , Kyobin Choo , Tae Eun Choi , Seong Jae Hwang

Learning Continually from Low-shot Data Stream

While deep learning has achieved remarkable results on various applications, it is usually data hungry and struggles to learn over non-stationary data stream. To solve these two limits, the deep learning model should not only be able to…

Machine Learning · Computer Science 2019-09-05 Canyu Le , Xihan Wei , Biao Wang , Lei Zhang , Zhonggui Chen

Semi-Supervised Semantic Segmentation of Cell Nuclei via Diffusion-based Large-Scale Pre-Training and Collaborative Learning

Automated semantic segmentation of cell nuclei in microscopic images is crucial for disease diagnosis and tissue microenvironment analysis. Nonetheless, this task presents challenges due to the complexity and heterogeneity of cells. While…

Image and Video Processing · Electrical Eng. & Systems 2023-08-10 Zhuchen Shao , Sourya Sengupta , Hua Li , Mark A. Anastasio