Related papers: Fine-grained Visual-textual Representation Learnin…

Classification-Specific Parts for Improving Fine-Grained Visual Categorization

Fine-grained visual categorization is a classification task for distinguishing categories with high intra-class and small inter-class variance. While global approaches aim at using the whole image for performing the classification,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-07 Dimitri Korsch , Paul Bodesheim , Joachim Denzler

Fine-graind Image Classification via Combining Vision and Language

Fine-grained image classification is a challenging task due to the large intra-class variance and small inter-class variance, aiming at recognizing hundreds of sub-categories belonging to the same basic-level category. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2017-11-29 Xiangteng He , Yuxin Peng

Part-guided Relational Transformers for Fine-grained Visual Recognition

Fine-grained visual recognition is to classify objects with visually similar appearances into subcategories, which has made great progress with the development of deep CNNs. However, handling subtle differences between different…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Yifan Zhao , Jia Li , Xiaowu Chen , Yonghong Tian

Fine-Grained Visual Entailment

Visual entailment is a recently proposed multimodal reasoning task where the goal is to predict the logical relationship of a piece of text to an image. In this paper, we propose an extension of this task, where the goal is to predict the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Christopher Thomas , Yipeng Zhang , Shih-Fu Chang

Self-Supervised Learning for Fine-Grained Image Classification

Fine-grained image classification involves identifying different subcategories of a class which possess very subtle discriminatory features. Fine-grained datasets usually provide bounding box annotations along with class labels to aid the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Farha Al Breiki , Muhammad Ridzuan , Rushali Grandhe

Fine-Grained Classification: Connecting Metadata via Cross-Contrastive Pre-Training

Fine-grained visual classification aims to recognize objects belonging to many subordinate categories of a supercategory, where appearance alone often fails to distinguish highly similar classes. We propose a unified framework that…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Sumit Mamtani , Yash Thesia

Interpretable and Accurate Fine-grained Recognition via Region Grouping

We present an interpretable deep model for fine-grained visual recognition. At the core of our method lies the integration of region-based part discovery and attribution within a deep neural network. Our model is trained using image-level…

Computer Vision and Pattern Recognition · Computer Science 2020-05-22 Zixuan Huang , Yin Li

Democratizing Fine-grained Visual Recognition with Large Language Models

Identifying subordinate-level categories from images is a longstanding task in computer vision and is referred to as fine-grained visual recognition (FGVR). It has tremendous significance in real-world applications since an average…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Mingxuan Liu , Subhankar Roy , Wenjing Li , Zhun Zhong , Nicu Sebe , Elisa Ricci

Parsing Objects at a Finer Granularity: A Survey

Fine-grained visual parsing, including fine-grained part segmentation and fine-grained object recognition, has attracted considerable critical attention due to its importance in many real-world applications, e.g., agriculture, remote…

Computer Vision and Pattern Recognition · Computer Science 2024-02-23 Yifan Zhao , Jia Li , Yonghong Tian

R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Fine-grained visual categorization (FGVC) aims to discriminate similar subcategories, whose main challenge is the large intraclass diversities and subtle inter-class differences. Existing FGVC methods usually select discriminant regions…

Computer Vision and Pattern Recognition · Computer Science 2022-04-22 Yu Wang , Shuo Ye , Shujian Yu , Xinge You

Multi-View Active Fine-Grained Recognition

As fine-grained visual classification (FGVC) being developed for decades, great works related have exposed a key direction -- finding discriminative local regions and revealing subtle differences. However, unlike identifying visual contents…

Computer Vision and Pattern Recognition · Computer Science 2022-06-03 Ruoyi Du , Wenqing Yu , Heqing Wang , Dongliang Chang , Ting-En Lin , Yongbin Li , Zhanyu Ma

Improving Fine-grained Visual Understanding in VLMs through Text-Only Training

Visual-Language Models (VLMs) have become a powerful tool for bridging the gap between visual and linguistic understanding. However, the conventional learning approaches for VLMs often suffer from limitations, such as the high resource…

Computation and Language · Computer Science 2025-04-01 Dasol Choi , Guijin Son , Soo Yong Kim , Gio Paik , Seunghyeok Hong

Fine-Grained Categorization via CNN-Based Automatic Extraction and Integration of Object-Level and Part-Level Features

Fine-grained categorization can benefit from part-based features which reveal subtle visual differences between object categories. Handcrafted features have been widely used for part detection and classification. Although a recent trend…

Computer Vision and Pattern Recognition · Computer Science 2017-06-23 Ting Sun , Lin Sun , Dit-Yan Yeung

Fast Fine-grained Image Classification via Weakly Supervised Discriminative Localization

Fine-grained image classification is to recognize hundreds of subcategories in each basic-level category. Existing methods employ discriminative localization to find the key distinctions among subcategories. However, they generally have two…

Computer Vision and Pattern Recognition · Computer Science 2018-05-08 Xiangteng He , Yuxin Peng , Junjie Zhao

Fine-R1: Make Multi-modal LLMs Excel in Fine-Grained Visual Recognition by Chain-of-Thought Reasoning

Any entity in the visual world can be hierarchically grouped based on shared characteristics and mapped to fine-grained sub-categories. While Multi-modal Large Language Models (MLLMs) achieve strong performance on coarse-grained visual…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Hulingxiao He , Zijun Geng , Yuxin Peng

Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features

Fine-grained visual classification is a challenging task that recognizes the sub-classes belonging to the same meta-class. Large inter-class similarity and intra-class variance is the main challenge of this task. Most exiting methods try to…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Dongliang Chang , Yixiao Zheng , Zhanyu Ma , Ruoyi Du , Kongming Liang

Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval

Fine-grained text-to-image retrieval aims to retrieve a fine-grained target image with a given text query. Existing methods typically assume that each training image is accurately depicted by its textual descriptions. However, textual…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Zehong Ma , Hao Chen , Wei Zeng , Limin Su , Shiliang Zhang

Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features

Text contained in an image carries high-level semantics that can be exploited to achieve richer image understanding. In particular, the mere presence of text provides strong guiding content that should be employed to tackle a diversity of…

Computer Vision and Pattern Recognition · Computer Science 2020-01-15 Andres Mafla , Sounak Dey , Ali Furkan Biten , Lluis Gomez , Dimosthenis Karatzas

Self-Supervised Learning for Fine-Grained Visual Categorization

Recent research in self-supervised learning (SSL) has shown its capability in learning useful semantic representations from images for classification tasks. Through our work, we study the usefulness of SSL for Fine-Grained Visual…

Computer Vision and Pattern Recognition · Computer Science 2021-05-20 Muhammad Maaz , Hanoona Abdul Rasheed , Dhanalaxmi Gaddam

Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches

Fine-grained visual classification (FGVC) is much more challenging than traditional classification tasks due to the inherently subtle intra-class object variations. Recent works mainly tackle this problem by focusing on how to locate the…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Ruoyi Du , Dongliang Chang , Ayan Kumar Bhunia , Jiyang Xie , Zhanyu Ma , Yi-Zhe Song , Jun Guo