Related papers: MAXIM: Multi-Axis MLP for Image Processing

Learning Pixel-adaptive Multi-layer Perceptrons for Real-time Image Enhancement

Deep learning-based bilateral grid processing has emerged as a promising solution for image enhancement, inherently encoding spatial and intensity information while enabling efficient full-resolution processing through slicing operations.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-17 Junyu Lou , Xiaorui Zhao , Kexuan Shi , Shuhang Gu

MLP-Mixer: An all-MLP Architecture for Vision

Convolutional Neural Networks (CNNs) are the go-to model for computer vision. Recently, attention-based networks, such as the Vision Transformer, have also become popular. In this paper we show that while convolutions and attention are both…

Computer Vision and Pattern Recognition · Computer Science 2021-06-14 Ilya Tolstikhin , Neil Houlsby , Alexander Kolesnikov , Lucas Beyer , Xiaohua Zhai , Thomas Unterthiner , Jessica Yung , Andreas Steiner , Daniel Keysers , Jakob Uszkoreit , Mario Lucic , Alexey Dosovitskiy

Image-to-Image MLP-mixer for Image Reconstruction

Neural networks are highly effective tools for image reconstruction problems such as denoising and compressive sensing. To date, neural networks for image reconstruction are almost exclusively convolutional. The most popular architecture is…

Computer Vision and Pattern Recognition · Computer Science 2022-02-07 Youssef Mansour , Kang Lin , Reinhard Heckel

AS-MLP: An Axial Shifted MLP Architecture for Vision

An Axial Shifted MLP architecture (AS-MLP) is proposed in this paper. Different from MLP-Mixer, where the global spatial feature is encoded for information flow through matrix transposition and one token-mixing MLP, we pay more attention to…

Computer Vision and Pattern Recognition · Computer Science 2022-03-18 Dongze Lian , Zehao Yu , Xing Sun , Shenghua Gao

MM-UNet: A Mixed MLP Architecture for Improved Ophthalmic Image Segmentation

Ophthalmic image segmentation serves as a critical foundation for ocular disease diagnosis. Although fully convolutional neural networks (CNNs) are commonly employed for segmentation, they are constrained by inductive biases and face…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Zunjie Xiao , Xiaoqing Zhang , Risa Higashita , Jiang Liu

X-MLP: A Patch Embedding-Free MLP Architecture for Vision

Convolutional neural networks (CNNs) and vision transformers (ViT) have obtained great achievements in computer vision. Recently, the research of multi-layer perceptron (MLP) architectures for vision have been popular again. Vision MLPs are…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Xinyue Wang , Zhicheng Cai , Chenglei Peng

On the Adaptability of Neural Network Image Super-Resolution

In this paper, we described and developed a framework for Multilayer Perceptron (MLP) to work on low level image processing, where MLP will be used to perform image super-resolution. Meanwhile, MLP are trained with different types of images…

Computer Vision and Pattern Recognition · Computer Science 2012-12-24 Kah Keong Chua , Yong Haur Tay

Toward Next-generation Medical Vision Backbones: Modeling Finer-grained Long-range Visual Dependency

Medical Image Computing (MIC) is a broad research topic covering both pixel-wise (e.g., segmentation, registration) and image-wise (e.g., classification, regression) vision tasks. Effective analysis demands models that capture both global…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Mingyuan Meng

MixerGAN: An MLP-Based Architecture for Unpaired Image-to-Image Translation

While attention-based transformer networks achieve unparalleled success in nearly all language tasks, the large number of tokens (pixels) found in images coupled with the quadratic activation memory usage makes them prohibitive for problems…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 George Cazenavette , Manuel Ladron De Guevara

An Experimental Exploration of In-Memory Computing for Multi-Layer Perceptrons

In modern computer architectures, the performance of many memory-bound workloads (e.g., machine learning, graph processing, databases) is limited by the data movement bottleneck that emerges when transferring large amounts of data between…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-12 Pedro Carrinho , Hamid Moghadaspour , Oscar Ferraz , João Dinis Ferreira , Yann Falevoz , Vitor Silva , Gabriel Falcao

CS-Mixer: A Cross-Scale Vision MLP Model with Spatial-Channel Mixing

Despite their simpler information fusion designs compared with Vision Transformers and Convolutional Neural Networks, Vision MLP architectures have demonstrated strong performance and high data efficiency in recent research. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-28 Jonathan Cui , David A. Araujo , Suman Saha , Md. Faisal Kabir

MC-MLP:Multiple Coordinate Frames in all-MLP Architecture for Vision

In deep learning, Multi-Layer Perceptrons (MLPs) have once again garnered attention from researchers. This paper introduces MC-MLP, a general MLP-like backbone for computer vision that is composed of a series of fully-connected (FC) layers.…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Zhimin Zhu , Jianguo Zhao , Tong Mu , Yuliang Yang , Mengyu Zhu

R2-MLP: Round-Roll MLP for Multi-View 3D Object Recognition

Recently, vision architectures based exclusively on multi-layer perceptrons (MLPs) have gained much attention in the computer vision community. MLP-like models achieve competitive performance on a single 2D image classification with less…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Shuo Chen , Tan Yu , Ping Li

UniNet: Unified Architecture Search with Convolution, Transformer, and MLP

Recently, transformer and multi-layer perceptron (MLP) architectures have achieved impressive results on various vision tasks. A few works investigated manually combining those operators to design visual network architectures, and can…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Jihao Liu , Hongsheng Li , Guanglu Song , Xin Huang , Yu Liu

MaxViT: Multi-Axis Vision Transformer

Transformers have recently gained significant attention in the computer vision community. However, the lack of scalability of self-attention mechanisms with respect to image size has limited their wide adoption in state-of-the-art vision…

Computer Vision and Pattern Recognition · Computer Science 2022-09-12 Zhengzhong Tu , Hossein Talebi , Han Zhang , Feng Yang , Peyman Milanfar , Alan Bovik , Yinxiao Li

Pay Attention to MLPs

Transformers have become one of the most important architectural innovations in deep learning and have enabled many breakthroughs over the past few years. Here we propose a simple network architecture, gMLP, based on MLPs with gating, and…

Machine Learning · Computer Science 2021-06-03 Hanxiao Liu , Zihang Dai , David R. So , Quoc V. Le

Mixing and Shifting: Exploiting Global and Local Dependencies in Vision MLPs

Token-mixing multi-layer perceptron (MLP) models have shown competitive performance in computer vision tasks with a simple architecture and relatively small computational cost. Their success in maintaining computation efficiency is mainly…

Computer Vision and Pattern Recognition · Computer Science 2022-02-15 Huangjie Zheng , Pengcheng He , Weizhu Chen , Mingyuan Zhou

Scaling MLPs: A Tale of Inductive Bias

In this work we revisit the most fundamental building block in deep learning, the multi-layer perceptron (MLP), and study the limits of its performance on vision tasks. Empirical insights into MLPs are important for multiple reasons. (1)…

Machine Learning · Computer Science 2023-10-04 Gregor Bachmann , Sotiris Anagnostidis , Thomas Hofmann

ResMLP: Feedforward networks for image classification with data-efficient training

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically…

Computer Vision and Pattern Recognition · Computer Science 2021-06-11 Hugo Touvron , Piotr Bojanowski , Mathilde Caron , Matthieu Cord , Alaaeldin El-Nouby , Edouard Grave , Gautier Izacard , Armand Joulin , Gabriel Synnaeve , Jakob Verbeek , Hervé Jégou

S$^2$-MLPv2: Improved Spatial-Shift MLP Architecture for Vision

Recently, MLP-based vision backbones emerge. MLP-based vision architectures with less inductive bias achieve competitive performance in image recognition compared with CNNs and vision Transformers. Among them, spatial-shift MLP (S$^2$-MLP),…

Computer Vision and Pattern Recognition · Computer Science 2021-08-03 Tan Yu , Xu Li , Yunfeng Cai , Mingming Sun , Ping Li