Related papers: Deformably-Scaled Transposed Convolution

Attention-based Image Upsampling

Convolutional layers are an integral part of many deep neural network solutions in computer vision. Recent work shows that replacing the standard convolution operation with mechanisms based on self-attention leads to improved performance on…

Computer Vision and Pattern Recognition · Computer Science 2020-12-21 Souvik Kundu , Hesham Mostafa , Sharath Nittur Sridhar , Sairam Sundaresan

DTC: A Deformable Transposed Convolution Module for Medical Image Segmentation

In medical image segmentation, particularly in UNet-like architectures, upsampling is primarily used to transform smaller feature maps into larger ones, enabling feature fusion between encoder and decoder features and supporting multi-scale…

Computer Vision and Pattern Recognition · Computer Science 2026-01-27 Chengkun Sun , Jinqian Pan , Renjie Liang , Zhengkang Fan , Xin Miao , Jiang Bian , Jie Xu

Efficient Smoothing of Dilated Convolutions for Image Segmentation

Dilated Convolutions have been shown to be highly useful for the task of image segmentation. By introducing gaps into convolutional filters, they enable the use of larger receptive fields without increasing the original kernel size. Even…

Computer Vision and Pattern Recognition · Computer Science 2019-03-20 Thomas Ziegler , Manuel Fritsche , Lorenz Kuhn , Konstantin Donhauser

Optimal Density Functions for Weighted Convolution in Learning Models

The paper introduces the weighted convolution, a novel approach to the convolution for signals defined on regular grids (e.g., 2D images) through the application of an optimal density function to scale the contribution of neighbouring…

Computer Vision and Pattern Recognition · Computer Science 2025-06-02 Simone Cammarasana , Giuseppe Patanè

Depth Edge Guided CNNs for Sparse Depth Upsampling

Guided sparse depth upsampling aims to upsample an irregularly sampled sparse depth map when an aligned high-resolution color image is given as guidance. Many neural networks have been designed for this task. However, they often ignore the…

Computer Vision and Pattern Recognition · Computer Science 2020-03-24 Yi Guo , Ji Liu

More complex encoder is not all you need

U-Net and its variants have been widely used in medical image segmentation. However, most current U-Net variants confine their improvement strategies to building more complex encoder, while leaving the decoder unchanged or adopting a simple…

Image and Video Processing · Electrical Eng. & Systems 2023-10-30 Weibin Yang , Longwei Xu , Pengwei Wang , Dehua Geng , Yusong Li , Mingyuan Xu , Zhiqi Dong

Lifting Layers: Analysis and Applications

The great advances of learning-based approaches in image processing and computer vision are largely based on deeply nested networks that compose linear transfer functions with suitable non-linearities. Interestingly, the most frequently…

Computer Vision and Pattern Recognition · Computer Science 2018-03-26 Peter Ochs , Tim Meinhardt , Laura Leal-Taixe , Michael Moeller

Transformer-based Learned Image Compression for Joint Decoding and Denoising

This work introduces a Transformer-based image compression system. It has the flexibility to switch between the standard image reconstruction and the denoising reconstruction from a single compressed bitstream. Instead of training separate…

Image and Video Processing · Electrical Eng. & Systems 2024-02-21 Yi-Hsin Chen , Kuan-Wei Ho , Shiau-Rung Tsai , Guan-Hsun Lin , Alessandro Gnutti , Wen-Hsiao Peng , Riccardo Leonardi

Learnable Sampling 3D Convolution for Video Enhancement and Action Recognition

A key challenge in video enhancement and action recognition is to fuse useful information from neighboring frames. Recent works suggest establishing accurate correspondences between neighboring frames before fusing temporal information.…

Computer Vision and Pattern Recognition · Computer Science 2020-11-24 Shuyang Gu , Jianmin Bao , Dong Chen

An Underparametrized Deep Decoder Architecture for Graph Signals

While deep convolutional architectures have achieved remarkable results in a gamut of supervised applications dealing with images and speech, recent works show that deep untrained non-convolutional architectures can also outperform…

Signal Processing · Electrical Eng. & Systems 2020-03-13 Samuel Rey , Antonio G. Marques , Santiago Segarra

Additive decomposition of one-dimensional signals using Transformers

One-dimensional signal decomposition is a well-established and widely used technique across various scientific fields. It serves as a highly valuable pre-processing step for data analysis. While traditional decomposition techniques often…

Machine Learning · Computer Science 2025-06-09 Samuele Salti , Andrea Pinto , Alessandro Lanza , Serena Morigi

Dense residual Transformer for image denoising

Image denoising is an important low-level computer vision task, which aims to reconstruct a noise-free and high-quality image from a noisy image. With the development of deep learning, convolutional neural network (CNN) has been gradually…

Computer Vision and Pattern Recognition · Computer Science 2022-05-17 Chao Yao , Shuo Jin , Meiqin Liu , Xiaojuan Ban

Learning Task-Specific Generalized Convolutions in the Permutohedral Lattice

Dense prediction tasks typically employ encoder-decoder architectures, but the prevalent convolutions in the decoder are not image-adaptive and can lead to boundary artifacts. Different generalized convolution operations have been…

Computer Vision and Pattern Recognition · Computer Science 2019-09-10 Anne S. Wannenwetsch , Martin Kiefel , Peter V. Gehler , Stefan Roth

Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter

Conventional CNNs for texture synthesis consist of a sequence of (de)-convolution and up/down-sampling layers, where each layer operates locally and lacks the ability to capture the long-term structural dependency required by texture…

Computer Vision and Pattern Recognition · Computer Science 2020-07-15 Guilin Liu , Rohan Taori , Ting-Chun Wang , Zhiding Yu , Shiqiu Liu , Fitsum A. Reda , Karan Sapra , Andrew Tao , Bryan Catanzaro

Convolutional Occupancy Networks

Recently, implicit neural representations have gained popularity for learning-based 3D reconstruction. While demonstrating promising results, most implicit approaches are limited to comparably simple geometry of single objects and do not…

Computer Vision and Pattern Recognition · Computer Science 2020-08-04 Songyou Peng , Michael Niemeyer , Lars Mescheder , Marc Pollefeys , Andreas Geiger

Sparse is Enough in Scaling Transformers

Large Transformer models yield impressive results on many tasks, but are expensive to train, or even fine-tune, and so slow at decoding that their use and study becomes out of reach. We address this problem by leveraging sparsity. We study…

Machine Learning · Computer Science 2021-11-29 Sebastian Jaszczur , Aakanksha Chowdhery , Afroz Mohiuddin , Łukasz Kaiser , Wojciech Gajewski , Henryk Michalewski , Jonni Kanerva

Multi-Level Representation Learning for Deep Subspace Clustering

This paper proposes a novel deep subspace clustering approach which uses convolutional autoencoders to transform input images into new representations lying on a union of linear subspaces. The first contribution of our work is to insert…

Computer Vision and Pattern Recognition · Computer Science 2020-01-24 Mohsen Kheirandishfard , Fariba Zohrizadeh , Farhad Kamangar

Convolution-Free Medical Image Segmentation using Transformers

Like other applications in computer vision, medical image segmentation has been most successfully addressed using deep learning models that rely on the convolution operation as their main building block. Convolutions enjoy important…

Image and Video Processing · Electrical Eng. & Systems 2022-04-05 Davood Karimi , Serge Vasylechko , Ali Gholipour

Design of Efficient Convolutional Layers using Single Intra-channel Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure

Deep convolutional neural networks achieve remarkable visual recognition performance, at the cost of high computational complexity. In this paper, we have a new design of efficient convolutional layers based on three schemes. The 3D…

Computer Vision and Pattern Recognition · Computer Science 2017-01-25 Min Wang , Baoyuan Liu , Hassan Foroosh

Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction

While convolutional neural networks have shown a tremendous impact on various computer vision tasks, they generally demonstrate limitations in explicitly modeling long-range dependencies due to the intrinsic locality of the convolution…

Computer Vision and Pattern Recognition · Computer Science 2021-08-06 Guanglei Yang , Hao Tang , Mingli Ding , Nicu Sebe , Elisa Ricci