Related papers: Adaptive Patching for High-resolution Image Segmen…

Attention Toward Neighbors: A Context Aware Framework for High Resolution Image Segmentation

High-resolution image segmentation remains challenging and error-prone due to the enormous size of intermediate feature maps. Conventional methods avoid this problem by using patch based approaches where each patch is segmented…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Fahim Faisal Niloy , M. Ashraful Amin , Amin Ahsan Ali , AKM Mahbubur Rahman

Processing Megapixel Images with Deep Attention-Sampling Models

Existing deep architectures cannot operate on very large signals such as megapixel images due to computational and memory constraints. To tackle this limitation, we propose a fully differentiable end-to-end trainable model that samples and…

Computer Vision and Pattern Recognition · Computer Science 2019-07-18 Angelos Katharopoulos , François Fleuret

APT: Improving Diffusion Models for High Resolution Image Generation with Adaptive Path Tracing

Latent Diffusion Models (LDMs) are generally trained at fixed resolutions, limiting their capability when scaling up to high-resolution images. While training-based approaches address this limitation by training on high-resolution datasets,…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Sangmin Han , Jinho Jeong , Jinwoo Kim , Seon Joo Kim

Attention-Based Multimodal Image Matching

We propose an attention-based approach for multimodal image patch matching using a Transformer encoder attending to the feature maps of a multiscale Siamese CNN. Our encoder is shown to efficiently aggregate multiscale image embeddings…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Aviad Moreshet , Yosi Keller

Iterative Patch Selection for High-Resolution Image Recognition

High-resolution images are prevalent in various applications, such as autonomous driving and computer-aided diagnosis. However, training neural networks on such images is computationally challenging and easily leads to out-of-memory errors…

Computer Vision and Pattern Recognition · Computer Science 2023-03-08 Benjamin Bergner , Christoph Lippert , Aravindh Mahendran

Multi-dimension Transformer with Attention-based Filtering for Medical Image Segmentation

The accurate segmentation of medical images is crucial for diagnosing and treating diseases. Recent studies demonstrate that vision transformer-based methods have significantly improved performance in medical image segmentation, primarily…

Computer Vision and Pattern Recognition · Computer Science 2024-05-22 Wentao Wang , Xi Xiao , Mingjie Liu , Qing Tian , Xuanyao Huang , Qizhen Lan , Swalpa Kumar Roy , Tianyang Wang

Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation

Accurate segmentation of aortic vascular structures is critical for diagnosing and treating cardiovascular diseases.Traditional Transformer-based models have shown promise in this domain by capturing long-range dependencies between vascular…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Zhenxi Zhang , Fuchen Zheng , Adnan Iltaf , Yifei Han , Zhenyu Cheng , Yue Du , Bin Li , Tianyong Liu , Shoujun Zhou

Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation

We present a new encoder-decoder Vision Transformer architecture, Patcher, for medical image segmentation. Unlike standard Vision Transformers, it employs Patcher blocks that segment an image into large patches, each of which is further…

Image and Video Processing · Electrical Eng. & Systems 2023-05-31 Yanglan Ou , Ye Yuan , Xiaolei Huang , Stephen T. C. Wong , John Volpi , James Z. Wang , Kelvin Wong

Accelerating Vision Transformers with Adaptive Patch Sizes

Vision Transformers (ViTs) partition input images into uniformly sized patches regardless of their content, resulting in long input sequence lengths for high-resolution images. We present Adaptive Patch Transformers (APT), which addresses…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Rohan Choudhury , JungEun Kim , Jinhyung Park , Eunho Yang , László A. Jeni , Kris M. Kitani

HIPA: Hierarchical Patch Transformer for Single Image Super Resolution

Transformer-based architectures start to emerge in single image super resolution (SISR) and have achieved promising performance. Most existing Vision Transformers divide images into the same number of patches with a fixed size, which may…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Qing Cai , Yiming Qian , Jinxing Li , Jun Lv , Yee-Hong Yang , Feng Wu , David Zhang

Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter

In this work, we present Patch-Adapter, an effective framework for high-resolution text-guided image inpainting. Unlike existing methods limited to lower resolutions, our approach achieves 4K+ resolution while maintaining precise content…

Computer Vision and Pattern Recognition · Computer Science 2025-10-16 Jianhui Zhang , Sheng Cheng , Qirui Sun , Jia Liu , Wang Luyang , Chaoyu Feng , Chen Fang , Lei Lei , Jue Wang , Shuaicheng Liu

High-Resolution Deep Image Matting

Image matting is a key technique for image and video editing and composition. Conventionally, deep learning approaches take the whole input image and an associated trimap to infer the alpha matte using convolutional neural networks. Such…

Computer Vision and Pattern Recognition · Computer Science 2021-01-18 Haichao Yu , Ning Xu , Zilong Huang , Yuqian Zhou , Humphrey Shi

Masked Transformer for image Anomaly Localization

Image anomaly detection consists in detecting images or image portions that are visually different from the majority of the samples in a dataset. The task is of practical importance for various real-life applications like biomedical image…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Axel De Nardin , Pankaj Mishra , Gian Luca Foresti , Claudio Piciarelli

Towards Hierarchical Regional Transformer-based Multiple Instance Learning

The classification of gigapixel histopathology images with deep multiple instance learning models has become a critical task in digital pathology and precision medicine. In this work, we propose a Transformer-based multiple instance…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Josef Cersovsky , Sadegh Mohammadi , Dagmar Kainmueller , Johannes Hoehne

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Deep neural network models used for medical image segmentation are large because they are trained with high-resolution three-dimensional (3D) images. Graphics processing units (GPUs) are widely used to accelerate the trainings. However, the…

Machine Learning · Computer Science 2018-12-20 Haruki Imai , Samuel Matzek , Tung D. Le , Yasushi Negishi , Kiyokuni Kawachiya

Patch Pruning Strategy Based on Robust Statistical Measures of Attention Weight Diversity in Vision Transformers

Multi-head self-attention is a distinctive feature extraction mechanism of vision transformers that computes pairwise relationships among all input patches, contributing significantly to their high performance. However, it is known to incur…

Computer Vision and Pattern Recognition · Computer Science 2025-07-28 Yuki Igaue , Hiroaki Aizawa

Harnessing The Power of Attention For Patch-Based Biomedical Image Classification

Biomedical image analysis is of paramount importance for the advancement of healthcare and medical research. Although conventional convolutional neural networks (CNNs) are frequently employed in this domain, facing limitations in capturing…

Computer Vision and Pattern Recognition · Computer Science 2025-10-28 Gousia Habib , Shaima Qureshi , Malik ishfaq

DPT: Deformable Patch-based Transformer for Visual Recognition

Transformer has achieved great success in computer vision, while how to split patches in an image remains a problem. Existing methods usually use a fixed-size patch embedding which might destroy the semantics of objects. To address this…

Computer Vision and Pattern Recognition · Computer Science 2021-08-02 Zhiyang Chen , Yousong Zhu , Chaoyang Zhao , Guosheng Hu , Wei Zeng , Jinqiao Wang , Ming Tang

Locally Shifted Attention With Early Global Integration

Recent work has shown the potential of transformers for computer vision applications. An image is first partitioned into patches, which are then used as input tokens for the attention mechanism. Due to the expensive quadratic cost of the…

Computer Vision and Pattern Recognition · Computer Science 2021-12-23 Shelly Sheynin , Sagie Benaim , Adam Polyak , Lior Wolf

Biomedical Image Segmentation by Retina-like Sequential Attention Mechanism Using Only A Few Training Images

In this paper we propose a novel deep learning-based algorithm for biomedical image segmentation which uses a sequential attention mechanism able to shift the focus of attention across the image in a selective way, allowing subareas which…

Computer Vision and Pattern Recognition · Computer Science 2019-09-30 Shohei Hayashi , Bisser Raytchev , Toru Tamaki , Kazufumi Kaneda