Related papers: Conditional Information Gain Trellis

Conditional Information Gain Networks

Deep neural network models owe their representational power to the high number of learnable parameters. It is often infeasible to run these largely parametrized deep models in limited resource environments, like mobile phones. Network…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Ufuk Can Biçici , Cem Keskin , Lale Akarun

CIG: Exploration via Conditional Information Gain

Intrinsic rewards for exploration in reinforcement learning condition on different contexts: lifelong rewards score each transition against accumulated experience but ignore within-rollout redundancy; episodic rewards penalize…

Machine Learning · Computer Science 2026-05-21 Tim Joseph , Marcus Fechner , Philipp Stegmaier , Karam Daaboul , J. Marius Zöllner

Conditional Deep Learning for Energy-Efficient and Enhanced Pattern Recognition

Deep learning neural networks have emerged as one of the most powerful classification tools for vision related applications. However, the computational and energy requirements associated with such deep nets can be quite high, and hence…

Computer Vision and Pattern Recognition · Computer Science 2016-01-29 Priyadarshini Panda , Abhronil Sengupta , Kaushik Roy

Conditional computation in neural networks: principles and research trends

This article summarizes principles and ideas from the emerging area of applying \textit{conditional computation} methods to the design of neural networks. In particular, we focus on neural networks that can dynamically activate or…

Machine Learning · Computer Science 2024-07-09 Simone Scardapane , Alessandro Baiocchi , Alessio Devoto , Valerio Marsocci , Pasquale Minervini , Jary Pomponi

Controlling Computation versus Quality for Neural Sequence Models

Most neural networks utilize the same amount of compute for every example independent of the inherent complexity of the input. Further, methods that adapt the amount of computation to the example focus on finding a fixed inference-time…

Machine Learning · Computer Science 2020-04-17 Ankur Bapna , Naveen Arivazhagan , Orhan Firat

Confidence-gated training for efficient early-exit neural networks

Early-exit neural networks reduce inference cost by enabling confident predictions at intermediate layers. However, joint training often leads to gradient interference, with deeper classifiers dominating optimization. We propose…

Machine Learning · Computer Science 2026-01-12 Saad Mokssit , Ouassim Karrakchou , Alejandro Mousist , Mounir Ghogho

Conditional Channel Gated Networks for Task-Aware Continual Learning

Convolutional Neural Networks experience catastrophic forgetting when optimized on a sequence of learning problems: as they meet the objective of the current training examples, their performance on previous tasks drops drastically. In this…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Davide Abati , Jakub Tomczak , Tijmen Blankevoort , Simone Calderara , Rita Cucchiara , Babak Ehteshami Bejnordi

Distilling Critical Paths in Convolutional Neural Networks

Neural network compression and acceleration are widely demanded currently due to the resource constraints on most deployment targets. In this paper, through analyzing the filter activation, gradients, and visualizing the filters'…

Computer Vision and Pattern Recognition · Computer Science 2018-11-09 Fuxun Yu , Zhuwei Qin , Xiang Chen

Constraint Guided Gradient Descent: Guided Training with Inequality Constraints

Deep learning is typically performed by learning a neural network solely from data in the form of input-output pairs ignoring available domain knowledge. In this work, the Constraint Guided Gradient Descent (CGGD) framework is proposed that…

Artificial Intelligence · Computer Science 2022-06-15 Quinten Van Baelen , Peter Karsmakers

Conditional Generative Neural System for Probabilistic Trajectory Prediction

Effective understanding of the environment and accurate trajectory prediction of surrounding dynamic obstacles are critical for intelligent systems such as autonomous vehicles and wheeled mobile robotics navigating in complex scenarios to…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Jiachen Li , Hengbo Ma , Masayoshi Tomizuka

Conditional Computation in Neural Networks for faster models

Deep learning has become the state-of-art tool in many applications, but the evaluation and training of deep models can be time-consuming and computationally expensive. The conditional computation approach has been proposed to tackle this…

Machine Learning · Computer Science 2016-01-11 Emmanuel Bengio , Pierre-Luc Bacon , Joelle Pineau , Doina Precup

Convolutional Conditional Neural Processes

Neural processes are a family of models which use neural networks to directly parametrise a map from data sets to predictions. Directly parametrising this map enables the use of expressive neural networks in small-data problems where neural…

Machine Learning · Statistics 2024-08-20 Wessel P. Bruinsma

Batch-Shaping for Learning Conditional Channel Gated Networks

We present a method that trains large capacity neural networks with significantly improved accuracy and lower dynamic computational cost. We achieve this by gating the deep-learning architecture on a fine-grained-level. Individual…

Machine Learning · Computer Science 2020-04-06 Babak Ehteshami Bejnordi , Tijmen Blankevoort , Max Welling

The Tree Ensemble Layer: Differentiability meets Conditional Computation

Neural networks and tree ensembles are state-of-the-art learners, each with its unique statistical and computational advantages. We aim to combine these advantages by introducing a new layer for neural networks, composed of an ensemble of…

Machine Learning · Computer Science 2020-07-14 Hussein Hazimeh , Natalia Ponomareva , Petros Mol , Zhenyu Tan , Rahul Mazumder

cGANs with Conditional Convolution Layer

Conditional generative adversarial networks (cGANs) have been widely researched to generate class conditional images using a single generator. However, in the conventional cGANs techniques, it is still challenging for the generator to learn…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Min-Cheol Sagong , Yong-Goo Shin , Yoon-Jae Yeo , Seung Park , Sung-Jea Ko

Conditional Generative Models for Learning Stochastic Processes

A framework to learn a multi-modal distribution is proposed, denoted as the Conditional Quantum Generative Adversarial Network (C-qGAN). The neural network structure is strictly within a quantum circuit and, as a consequence, is shown to…

Quantum Physics · Physics 2023-10-20 Salvatore Certo , Anh Pham , Nicolas Robles , Andrew Vlasic

Latent Constraints: Learning to Generate Conditionally from Unconditional Generative Models

Deep generative neural networks have proven effective at both conditional and unconditional modeling of complex data distributions. Conditional generation enables interactive control, but creating new controls often requires expensive…

Machine Learning · Computer Science 2017-12-25 Jesse Engel , Matthew Hoffman , Adam Roberts

Conditional Neural Processes

Deep neural networks excel at function approximation, yet they are typically trained from scratch for each new function. On the other hand, Bayesian methods, such as Gaussian Processes (GPs), exploit prior knowledge to quickly infer the…

Machine Learning · Computer Science 2018-07-05 Marta Garnelo , Dan Rosenbaum , Chris J. Maddison , Tiago Ramalho , David Saxton , Murray Shanahan , Yee Whye Teh , Danilo J. Rezende , S. M. Ali Eslami

Gradient Routing: Masking Gradients to Localize Computation in Neural Networks

Neural networks are trained primarily based on their inputs and outputs, without regard for their internal mechanisms. These neglected mechanisms determine properties that are critical for safety, like (i) transparency; (ii) the absence of…

Machine Learning · Computer Science 2024-12-02 Alex Cloud , Jacob Goldman-Wetzler , Evžen Wybitul , Joseph Miller , Alexander Matt Turner

Efficient distributional regression trees learning algorithms for calibrated non-parametric probabilistic forecasts

The perspective of developing trustworthy AI for critical applications in science and engineering requires machine learning techniques that are capable of estimating their own uncertainty. In the context of regression, instead of estimating…

Machine Learning · Computer Science 2026-05-14 Quentin Duchemin , Guillaume Obozinski