English
Related papers

Related papers: DPT: Deformable Patch-based Transformer for Visual…

200 papers

A novel template matching algorithm that can incorporate the concept of deformable parts, is presented in this paper. Unlike the deformable part model (DPM) employed in object recognition, the proposed template-matching approach called…

Computer Vision and Pattern Recognition · Computer Science 2016-04-13 Hyungtae Lee , Heesung Kwon , Ryan M. Robinson , William D. Nothwang

The objective of dense material segmentation is to identify the material categories for every image pixel. Recent studies adopt image patches to extract material features. Although the trained networks can improve the segmentation…

Computer Vision and Pattern Recognition · Computer Science 2024-02-29 Yuwen Heng , Srinandan Dasmahapatra , Hansung Kim

Image anomaly detection consists in detecting images or image portions that are visually different from the majority of the samples in a dataset. The task is of practical importance for various real-life applications like biomedical image…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Axel De Nardin , Pankaj Mishra , Gian Luca Foresti , Claudio Piciarelli

The key idea of current deep learning methods for dense prediction is to apply a model on a regular patch centered on each pixel to make pixel-wise predictions. These methods are limited in the sense that the patches are determined by…

Computer Vision and Pattern Recognition · Computer Science 2017-06-09 Jun Li , Yongjun Chen , Lei Cai , Ian Davidson , Shuiwang Ji

Transformer is beneficial for image denoising tasks since it can model long-range dependencies to overcome the limitations presented by inductive convolutional biases. However, directly applying the transformer structure to remove noise is…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Kangliang Liu , Xiangcheng Du , Sijie Liu , Yingbin Zheng , Xingjiao Wu , Cheng Jin

This paper proposes a novel video inpainting method. We make three main contributions: First, we extended previous Transformers with patch alignment by introducing Deformed Patch-based Homography (DePtH), which improves patch-level feature…

Computer Vision and Pattern Recognition · Computer Science 2022-09-29 Jiayin Cai , Changlin Li , Xin Tao , Chun Yuan , Yu-Wing Tai

Patch attacks, one of the most threatening forms of physical attack in adversarial examples, can lead networks to induce misclassification by modifying pixels arbitrarily in a continuous region. Certifiable patch defense can guarantee…

Computer Vision and Pattern Recognition · Computer Science 2022-03-17 Zhaoyu Chen , Bo Li , Jianghe Xu , Shuang Wu , Shouhong Ding , Wenqiang Zhang

We present a new encoder-decoder Vision Transformer architecture, Patcher, for medical image segmentation. Unlike standard Vision Transformers, it employs Patcher blocks that segment an image into large patches, each of which is further…

Image and Video Processing · Electrical Eng. & Systems 2023-05-31 Yanglan Ou , Ye Yuan , Xiaolei Huang , Stephen T. C. Wong , John Volpi , James Z. Wang , Kelvin Wong

Video transformers have recently emerged as an effective alternative to convolutional networks for action classification. However, most prior video transformers adopt either global space-time attention or hand-defined strategies to compare…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Jue Wang , Lorenzo Torresani

DETR has been recently proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance. However, it suffers from slow convergence and limited feature spatial resolution, due to the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-19 Xizhou Zhu , Weijie Su , Lewei Lu , Bin Li , Xiaogang Wang , Jifeng Dai

In this paper we propose novel Deformable Part Networks (DPNs) to learn {\em pose-invariant} representations for 2D object recognition. In contrast to the state-of-the-art pose-aware networks such as CapsNet \cite{sabour2017dynamic} and STN…

Machine Learning · Statistics 2018-05-24 Ziming Zhang , Rongmei Lin , Alan Sullivan

Sparse-view computed tomography (CT) can be used to reduce radiation dose greatly but is suffers from severe image artifacts. Recently, the deep learning based method for sparse-view CT reconstruction has attracted a major attention.…

Image and Video Processing · Electrical Eng. & Systems 2022-11-21 Wenjun Xia , Wenxiang Cong , Ge Wang

Vision Transformers (ViTs) partition input images into uniformly sized patches regardless of their content, resulting in long input sequence lengths for high-resolution images. We present Adaptive Patch Transformers (APT), which addresses…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Rohan Choudhury , JungEun Kim , Jinhyung Park , Eunho Yang , László A. Jeni , Kris M. Kitani

Vision Transformers have achieved great success in computer visions, delivering exceptional performance across various tasks. However, their inherent reliance on sequential input enforces the manual partitioning of images into patch…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Changzhen Li , Jie Zhang , Yang Wei , Zhilong Ji , Jinfeng Bai , Shiguang Shan

Accurate segmentation of aortic vascular structures is critical for diagnosing and treating cardiovascular diseases.Traditional Transformer-based models have shown promise in this domain by capturing long-range dependencies between vascular…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Zhenxi Zhang , Fuchen Zheng , Adnan Iltaf , Yifei Han , Zhenyu Cheng , Yue Du , Bin Li , Tianyong Liu , Shoujun Zhou

Regular object detection methods output rectangle bounding boxes, which are unable to accurately describe the actual object shapes. Instance segmentation methods output pixel-level labels, which are computationally expensive for real-time…

Computer Vision and Pattern Recognition · Computer Science 2023-04-06 Yang Zheng , Oles Andrienko , Yonglei Zhao , Minwoo Park , Trung Pham

Transformers are transforming the landscape of computer vision, especially for recognition tasks. Detection transformers are the first fully end-to-end learning systems for object detection, while vision transformers are the first fully…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Hwanjun Song , Deqing Sun , Sanghyuk Chun , Varun Jampani , Dongyoon Han , Byeongho Heo , Wonjae Kim , Ming-Hsuan Yang

Recently, numerous algorithms have been developed to tackle the problem of light field super-resolution (LFSR), i.e., super-resolving low-resolution light fields to gain high-resolution views. Despite delivering encouraging results, these…

Computer Vision and Pattern Recognition · Computer Science 2022-01-04 Shunzhou Wang , Tianfei Zhou , Yao Lu , Huijun Di

Vision transformer has demonstrated promising performance on challenging computer vision tasks. However, directly training the vision transformers may yield unstable and sub-optimal results. Recent works propose to improve the performance…

Computer Vision and Pattern Recognition · Computer Science 2021-06-14 Chengyue Gong , Dilin Wang , Meng Li , Vikas Chandra , Qiang Liu

Deep learning based object detectors are commonly deployed on mobile devices to solve a variety of tasks. For maximum accuracy, each detector is usually trained to solve one single specific task, and comes with a completely independent set…

Computer Vision and Pattern Recognition · Computer Science 2021-01-06 Keren Ye , Adriana Kovashka , Mark Sandler , Menglong Zhu , Andrew Howard , Marco Fornoni
‹ Prev 1 2 3 10 Next ›