English
Related papers

Related papers: Fine-grained Visual-textual Representation Learnin…

200 papers

Fine-grained visual categorization is a classification task for distinguishing categories with high intra-class and small inter-class variance. While global approaches aim at using the whole image for performing the classification,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-07 Dimitri Korsch , Paul Bodesheim , Joachim Denzler

Fine-grained image classification is a challenging task due to the large intra-class variance and small inter-class variance, aiming at recognizing hundreds of sub-categories belonging to the same basic-level category. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2017-11-29 Xiangteng He , Yuxin Peng

Fine-grained visual recognition is to classify objects with visually similar appearances into subcategories, which has made great progress with the development of deep CNNs. However, handling subtle differences between different…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Yifan Zhao , Jia Li , Xiaowu Chen , Yonghong Tian

Visual entailment is a recently proposed multimodal reasoning task where the goal is to predict the logical relationship of a piece of text to an image. In this paper, we propose an extension of this task, where the goal is to predict the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Christopher Thomas , Yipeng Zhang , Shih-Fu Chang

Fine-grained image classification involves identifying different subcategories of a class which possess very subtle discriminatory features. Fine-grained datasets usually provide bounding box annotations along with class labels to aid the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Farha Al Breiki , Muhammad Ridzuan , Rushali Grandhe

Fine-grained visual classification aims to recognize objects belonging to many subordinate categories of a supercategory, where appearance alone often fails to distinguish highly similar classes. We propose a unified framework that…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Sumit Mamtani , Yash Thesia

We present an interpretable deep model for fine-grained visual recognition. At the core of our method lies the integration of region-based part discovery and attribution within a deep neural network. Our model is trained using image-level…

Computer Vision and Pattern Recognition · Computer Science 2020-05-22 Zixuan Huang , Yin Li

Identifying subordinate-level categories from images is a longstanding task in computer vision and is referred to as fine-grained visual recognition (FGVR). It has tremendous significance in real-world applications since an average…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Mingxuan Liu , Subhankar Roy , Wenjing Li , Zhun Zhong , Nicu Sebe , Elisa Ricci

Fine-grained visual parsing, including fine-grained part segmentation and fine-grained object recognition, has attracted considerable critical attention due to its importance in many real-world applications, e.g., agriculture, remote…

Computer Vision and Pattern Recognition · Computer Science 2024-02-23 Yifan Zhao , Jia Li , Yonghong Tian

Fine-grained visual categorization (FGVC) aims to discriminate similar subcategories, whose main challenge is the large intraclass diversities and subtle inter-class differences. Existing FGVC methods usually select discriminant regions…

Computer Vision and Pattern Recognition · Computer Science 2022-04-22 Yu Wang , Shuo Ye , Shujian Yu , Xinge You

As fine-grained visual classification (FGVC) being developed for decades, great works related have exposed a key direction -- finding discriminative local regions and revealing subtle differences. However, unlike identifying visual contents…

Computer Vision and Pattern Recognition · Computer Science 2022-06-03 Ruoyi Du , Wenqing Yu , Heqing Wang , Dongliang Chang , Ting-En Lin , Yongbin Li , Zhanyu Ma

Visual-Language Models (VLMs) have become a powerful tool for bridging the gap between visual and linguistic understanding. However, the conventional learning approaches for VLMs often suffer from limitations, such as the high resource…

Computation and Language · Computer Science 2025-04-01 Dasol Choi , Guijin Son , Soo Yong Kim , Gio Paik , Seunghyeok Hong

Fine-grained categorization can benefit from part-based features which reveal subtle visual differences between object categories. Handcrafted features have been widely used for part detection and classification. Although a recent trend…

Computer Vision and Pattern Recognition · Computer Science 2017-06-23 Ting Sun , Lin Sun , Dit-Yan Yeung

Fine-grained image classification is to recognize hundreds of subcategories in each basic-level category. Existing methods employ discriminative localization to find the key distinctions among subcategories. However, they generally have two…

Computer Vision and Pattern Recognition · Computer Science 2018-05-08 Xiangteng He , Yuxin Peng , Junjie Zhao

Any entity in the visual world can be hierarchically grouped based on shared characteristics and mapped to fine-grained sub-categories. While Multi-modal Large Language Models (MLLMs) achieve strong performance on coarse-grained visual…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Hulingxiao He , Zijun Geng , Yuxin Peng

Fine-grained visual classification is a challenging task that recognizes the sub-classes belonging to the same meta-class. Large inter-class similarity and intra-class variance is the main challenge of this task. Most exiting methods try to…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Dongliang Chang , Yixiao Zheng , Zhanyu Ma , Ruoyi Du , Kongming Liang

Fine-grained text-to-image retrieval aims to retrieve a fine-grained target image with a given text query. Existing methods typically assume that each training image is accurately depicted by its textual descriptions. However, textual…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Zehong Ma , Hao Chen , Wei Zeng , Limin Su , Shiliang Zhang

Text contained in an image carries high-level semantics that can be exploited to achieve richer image understanding. In particular, the mere presence of text provides strong guiding content that should be employed to tackle a diversity of…

Computer Vision and Pattern Recognition · Computer Science 2020-01-15 Andres Mafla , Sounak Dey , Ali Furkan Biten , Lluis Gomez , Dimosthenis Karatzas

Recent research in self-supervised learning (SSL) has shown its capability in learning useful semantic representations from images for classification tasks. Through our work, we study the usefulness of SSL for Fine-Grained Visual…

Computer Vision and Pattern Recognition · Computer Science 2021-05-20 Muhammad Maaz , Hanoona Abdul Rasheed , Dhanalaxmi Gaddam

Fine-grained visual classification (FGVC) is much more challenging than traditional classification tasks due to the inherently subtle intra-class object variations. Recent works mainly tackle this problem by focusing on how to locate the…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Ruoyi Du , Dongliang Chang , Ayan Kumar Bhunia , Jiyang Xie , Zhanyu Ma , Yi-Zhe Song , Jun Guo
‹ Prev 1 2 3 10 Next ›