Related papers: Patchy Image Structure Classification Using Multi-…

Adaptive Morph-Patch Transformer for Aortic Vessel Segmentation

Accurate segmentation of aortic vascular structures is critical for diagnosing and treating cardiovascular diseases.Traditional Transformer-based models have shown promise in this domain by capturing long-range dependencies between vascular…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Zhenxi Zhang , Fuchen Zheng , Adnan Iltaf , Yifei Han , Zhenyu Cheng , Yue Du , Bin Li , Tianyong Liu , Shoujun Zhou

SORT: Second-Order Response Transform for Visual Recognition

In this paper, we reveal the importance and benefits of introducing second-order operations into deep neural networks. We propose a novel approach named Second-Order Response Transform (SORT), which appends element-wise product transform to…

Computer Vision and Pattern Recognition · Computer Science 2017-09-15 Yan Wang , Lingxi Xie , Chenxi Liu , Ya Zhang , Wenjun Zhang , Alan Yuille

Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer

Segmentation of ultra-high resolution (UHR) images is a critical task with numerous applications, yet it poses significant challenges due to high spatial resolution and rich fine details. Recent approaches adopt a dual-branch architecture,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Haopeng Sun , Yingwei Zhang , Lumin Xu , Sheng Jin , Yiqiang Chen

Multiscale Vision Transformer With Deep Clustering-Guided Refinement for Weakly Supervised Object Localization

This work addresses the task of weakly-supervised object localization. The goal is to learn object localization using only image-level class labels, which are much easier to obtain compared to bounding box annotations. This task is…

Computer Vision and Pattern Recognition · Computer Science 2023-12-18 David Kim , Sinhae Cha , Byeongkeun Kang

A Universal Representation Transformer Layer for Few-Shot Image Classification

Few-shot classification aims to recognize unseen classes when presented with only a small number of samples. We consider the problem of multi-domain few-shot image classification, where unseen classes and examples come from diverse data…

Machine Learning · Computer Science 2020-09-04 Lu Liu , William Hamilton , Guodong Long , Jing Jiang , Hugo Larochelle

Cross-Modality Fusion Transformer for Multispectral Object Detection

Multispectral image pairs can provide the combined information, making object detection applications more reliable and robust in the open world. To fully exploit the different modalities, we present a simple yet effective cross-modality…

Image and Video Processing · Electrical Eng. & Systems 2022-10-05 Fang Qingyun , Han Dapeng , Wang Zhaokui

Mesh-SORT: Simple and effective location-wise tracker with lost management strategies

Multi-Object Tracking (MOT) has gained extensive attention in recent years due to its potential applications in traffic and pedestrian detection. We note that tracking by detection may suffer from errors generated by noise detectors, such…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 ZongTan Li

A Joint Morphological Profiles and Patch Tensor Change Detection for Hyperspectral Imagery

Multi-temporal hyperspectral images can be used to detect changed information, which has gradually attracted researchers' attention. However, traditional change detection algorithms have not deeply explored the relevance of spatial and…

Computer Vision and Pattern Recognition · Computer Science 2022-01-21 Zengfu Hou , Wei Li

Topology-Aware Uncertainty for Image Segmentation

Segmentation of curvilinear structures such as vasculature and road networks is challenging due to relatively weak signals and complex geometry/topology. To facilitate and accelerate large scale annotation, one has to adopt semi-automatic…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Saumya Gupta , Yikai Zhang , Xiaoling Hu , Prateek Prasanna , Chao Chen

Exploring Vision Transformers for Fine-grained Classification

Existing computer vision research in categorization struggles with fine-grained attributes recognition due to the inherently high intra-class variances and low inter-class variances. SOTA methods tackle this challenge by locating the most…

Computer Vision and Pattern Recognition · Computer Science 2021-07-01 Marcos V. Conde , Kerem Turgutlu

Transition Matrix Representation of Trees with Transposed Convolutions

How can we effectively find the best structures in tree models? Tree models have been favored over complex black box models in domains where interpretability is crucial for making irreversible decisions. However, searching for a tree…

Machine Learning · Computer Science 2022-02-23 Jaemin Yoo , Lee Sael

ModeT: Learning Deformable Image Registration via Motion Decomposition Transformer

The Transformer structures have been widely used in computer vision and have recently made an impact in the area of medical image registration. However, the use of Transformer in most registration networks is straightforward. These networks…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Haiqiao Wang , Dong Ni , Yi Wang

Recent Advances in Embedding Methods for Multi-Object Tracking: A Survey

Multi-object tracking (MOT) aims to associate target objects across video frames in order to obtain entire moving trajectories. With the advancement of deep neural networks and the increasing demand for intelligent video analysis, MOT has…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Gaoang Wang , Mingli Song , Jenq-Neng Hwang

PI-Att: Topology Attention for Segmentation Networks through Adaptive Persistence Image Representation

Segmenting multiple objects (e.g., organs) in medical images often requires an understanding of their topology, which simultaneously quantifies the shape of the objects and their positions relative to each other. This understanding is…

Image and Video Processing · Electrical Eng. & Systems 2024-08-16 Mehmet Bahadir Erden , Sinan Unver , Ilke Ali Gurses , Rustu Turkay , Cigdem Gunduz-Demir

2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds

The commonly adopted detect-then-match approach to registration finds difficulties in the cross-modality cases due to the incompatible keypoint detection and inconsistent feature description. We propose, 2D3D-MATR, a detection-free method…

Computer Vision and Pattern Recognition · Computer Science 2023-08-15 Minhao Li , Zheng Qin , Zhirui Gao , Renjiao Yi , Chenyang Zhu , Yulan Guo , Kai Xu

Let Images Give You More:Point Cloud Cross-Modal Training for Shape Analysis

Although recent point cloud analysis achieves impressive progress, the paradigm of representation learning from a single modality gradually meets its bottleneck. In this work, we take a step towards more discriminative 3D point cloud…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Xu Yan , Heshen Zhan , Chaoda Zheng , Jiantao Gao , Ruimao Zhang , Shuguang Cui , Zhen Li

PointTrack++ for Effective Online Multi-Object Tracking and Segmentation

Multiple-object tracking and segmentation (MOTS) is a novel computer vision task that aims to jointly perform multiple object tracking (MOT) and instance segmentation. In this work, we present PointTrack++, an effective on-line framework…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Zhenbo Xu , Wei Zhang , Xiao Tan , Wei Yang , Xiangbo Su , Yuchen Yuan , Hongwu Zhang , Shilei Wen , Errui Ding , Liusheng Huang

Patch-based field-of-view matching in multi-modal images for electroporation-based ablations

Various multi-modal imaging sensors are currently involved at different steps of an interventional therapeutic work-flow. Cone beam computed tomography (CBCT), computed tomography (CT) or Magnetic Resonance (MR) images thereby provides…

Image and Video Processing · Electrical Eng. & Systems 2020-11-25 Luc Lafitte , Rémi Giraud , Cornel Zachiu , Mario Ries , Olivier Sutter , Antoine Petit , Olivier Seror , Clair Poignard , Baudouin Denis de Senneville

Local Intensity Order Transformation for Robust Curvilinear Object Segmentation

Segmentation of curvilinear structures is important in many applications, such as retinal blood vessel segmentation for early detection of vessel diseases and pavement crack segmentation for road condition evaluation and maintenance.…

Image and Video Processing · Electrical Eng. & Systems 2022-04-06 Tianyi Shi , Nicolas Boutry , Yongchao Xu , Thierry Géraud

Masked Transformer for image Anomaly Localization

Image anomaly detection consists in detecting images or image portions that are visually different from the majority of the samples in a dataset. The task is of practical importance for various real-life applications like biomedical image…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Axel De Nardin , Pankaj Mishra , Gian Luca Foresti , Claudio Piciarelli