Related papers: Instance-Level Meta Normalization

U-Net Training with Instance-Layer Normalization

Normalization layers are essential in a Deep Convolutional Neural Network (DCNN). Various normalization methods have been proposed. The statistics used to normalize the feature maps can be computed at batch, channel, or instance level.…

Image and Video Processing · Electrical Eng. & Systems 2019-08-27 Xiao-Yun Zhou , Peichao Li , Zhao-Yang Wang , Guang-Zhong Yang

Meta-learning the Learning Trends Shared Across Tasks

Meta-learning stands for 'learning to learn' such that generalization to new tasks is achieved. Among these methods, Gradient-based meta-learning algorithms are a specific sub-class that excel at quick adaptation to new tasks with limited…

Machine Learning · Computer Science 2020-10-20 Jathushan Rajasegaran , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Mubarak Shah

GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training

Normalization is known to help the optimization of deep neural networks. Curiously, different architectures require specialized normalization methods. In this paper, we study what normalization is effective for Graph Neural Networks (GNNs).…

Machine Learning · Computer Science 2021-06-14 Tianle Cai , Shengjie Luo , Keyulu Xu , Di He , Tie-Yan Liu , Liwei Wang

TaskNorm: Rethinking Batch Normalization for Meta-Learning

Modern meta-learning approaches for image classification rely on increasingly deep networks to achieve state-of-the-art performance, making batch normalization an essential component of meta-learning pipelines. However, the hierarchical…

Machine Learning · Statistics 2020-07-14 John Bronskill , Jonathan Gordon , James Requeima , Sebastian Nowozin , Richard E. Turner

Meta Networks

Neural networks have been successfully applied in applications with a large amount of labeled data. However, the task of rapid generalization on new concepts with small training data while preserving performances on previously learned ones…

Machine Learning · Computer Science 2017-06-09 Tsendsuren Munkhdalai , Hong Yu

MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization

Model-Agnostic Meta-Learning (MAML) and its variants are popular few-shot classification methods. They train an initializer across a variety of sampled learning tasks (also known as episodes) such that the initialized model can adapt…

Computer Vision and Pattern Recognition · Computer Science 2020-10-13 Yangbin Chen , Yun Ma , Tom Ko , Jianping Wang , Qing Li

Analyzing the Training Dynamics of Image Restoration Transformers: A Revisit to Layer Normalization

This work analyzes the training dynamics of Image Restoration (IR) Transformers and uncovers a critical yet overlooked issue: conventional LayerNorm (LN) drives feature magnitudes to diverge to a million scale and collapses channel-wise…

Computer Vision and Pattern Recognition · Computer Science 2026-02-23 MinKyu Lee , Sangeek Hyun , Woojin Jun , Hyunjun Kim , Jiwoo Chung , Jae-Pil Heo

Gradient Agreement as an Optimization Objective for Meta-Learning

This paper presents a novel optimization method for maximizing generalization over tasks in meta-learning. The goal of meta-learning is to learn a model for an agent adapting rapidly when presented with previously unseen tasks. Tasks are…

Machine Learning · Computer Science 2018-10-19 Amir Erfan Eshratifar , David Eigen , Massoud Pedram

Normalized Convolutional Neural Network

We introduce a Normalized Convolutional Neural Layer, a novel approach to normalization in convolutional networks. Unlike conventional methods, this layer normalizes the rows of the im2col matrix during convolution, making it inherently…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Dongsuk Kim , Geonhee Lee , Myungjae Lee , Shin Uk Kang , Dongmin Kim

Large-Scale Meta-Learning with Continual Trajectory Shifting

Meta-learning of shared initialization parameters has shown to be highly effective in solving few-shot learning tasks. However, extending the framework to many-shot scenarios, which may further enhance its practicality, has been relatively…

Machine Learning · Computer Science 2022-02-17 Jaewoong Shin , Hae Beom Lee , Boqing Gong , Sung Ju Hwang

Deep Adaptive Input Normalization for Time Series Forecasting

Deep Learning (DL) models can be used to tackle time series analysis tasks with great success. However, the performance of DL models can degenerate rapidly if the data are not appropriately normalized. This issue is even more apparent when…

Computational Finance · Quantitative Finance 2019-09-24 Nikolaos Passalis , Anastasios Tefas , Juho Kanniainen , Moncef Gabbouj , Alexandros Iosifidis

Meta Batch-Instance Normalization for Generalizable Person Re-Identification

Although supervised person re-identification (Re-ID) methods have shown impressive performance, they suffer from a poor generalization capability on unseen domains. Therefore, generalizable Re-ID has recently attracted growing attention.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Seokeon Choi , Taekyung Kim , Minki Jeong , Hyoungseob Park , Changick Kim

Retrieval-Augmented Meta Learning for Low-Resource Text Classification

Meta learning have achieved promising performance in low-resource text classification which aims to identify target classes with knowledge transferred from source classes with sets of small tasks named episodes. However, due to the limited…

Computation and Language · Computer Science 2023-09-12 Rongsheng Li , Yangning Li , Yinghui Li , Chaiyut Luoyiching , Hai-Tao Zheng , Nannan Zhou , Hanjing Su

Understanding and Improving Layer Normalization

Layer normalization (LayerNorm) is a technique to normalize the distributions of intermediate layers. It enables smoother gradients, faster training, and better generalization accuracy. However, it is still unclear where the effectiveness…

Machine Learning · Computer Science 2019-11-19 Jingjing Xu , Xu Sun , Zhiyuan Zhang , Guangxiang Zhao , Junyang Lin

On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers

State-of-the-art rehearsal-free continual learning methods exploit the peculiarities of Vision Transformers to learn task-specific prompts, drastically reducing catastrophic forgetting. However, there is a tradeoff between the number of…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Thomas De Min , Massimiliano Mancini , Karteek Alahari , Xavier Alameda-Pineda , Elisa Ricci

In-Context Learning for MIMO Equalization Using Transformer-Based Sequence Models

Large pre-trained sequence models, such as transformer-based architectures, have been recently shown to have the capacity to carry out in-context learning (ICL). In ICL, a decision on a new input is made via a direct mapping of the input…

Information Theory · Computer Science 2024-01-23 Matteo Zecchin , Kai Yu , Osvaldo Simeone

An Optimization-Based Meta-Learning Model for MRI Reconstruction with Diverse Dataset

Purpose: This work aims at developing a generalizable MRI reconstruction model in the meta-learning framework. The standard benchmarks in meta-learning are challenged by learning on diverse task distributions. The proposed network learns…

Computer Vision and Pattern Recognition · Computer Science 2021-10-05 Wanyu Bian , Yunmei Chen , Xiaojing Ye , Qingchao Zhang

Decoder Choice Network for Meta-Learning

Meta-learning has been widely used for implementing few-shot learning and fast model adaptation. One kind of meta-learning methods attempt to learn how to control the gradient descent process in order to make the gradient-based learning…

Machine Learning · Computer Science 2019-11-20 Jialin Liu , Fei Chao , Longzhi Yang , Chih-Min Lin , Qiang Shen

Learning Regularization Parameters of Inverse Problems via Deep Neural Networks

In this work, we describe a new approach that uses deep neural networks (DNN) to obtain regularization parameters for solving inverse problems. We consider a supervised learning approach, where a network is trained to approximate the…

Numerical Analysis · Mathematics 2021-04-15 Babak Maboudi Afkham , Julianne Chung , Matthias Chung

Meta Dropout: Learning to Perturb Features for Generalization

A machine learning model that generalizes well should obtain low errors on unseen test examples. Thus, if we know how to optimally perturb training examples to account for test examples, we may achieve better generalization performance.…

Machine Learning · Computer Science 2022-02-15 Hae Beom Lee , Taewook Nam , Eunho Yang , Sung Ju Hwang