Related papers: Generative Modeling for Small-Data Object Detectio…

Transfer learning with generative models for object detection on limited datasets

The availability of data is limited in some fields, especially for object detection tasks, where it is necessary to have correctly labeled bounding boxes around each object. A notable example of such data scarcity is found in the domain of…

Computer Vision and Pattern Recognition · Computer Science 2024-09-11 Matteo Paiano , Stefano Martina , Carlotta Giannelli , Filippo Caruso

Oriented Bounding Boxes for Small and Freely Rotated Objects

A novel object detection method is presented that handles freely rotated objects of arbitrary sizes, including tiny objects as small as $2\times 2$ pixels. Such tiny objects appear frequently in remotely sensed images, and present a…

Computer Vision and Pattern Recognition · Computer Science 2022-02-16 Mohsen Zand , Ali Etemad , Michael Greenspan

Learning Scene-specific Object Detectors Based on a Generative-Discriminative Model with Minimal Supervision

One object class may show large variations due to diverse illuminations, backgrounds and camera viewpoints. Traditional object detection methods often perform worse under unconstrained video environments. To address this problem, many…

Computer Vision and Pattern Recognition · Computer Science 2018-03-14 Dapeng Luo , Zhipeng Zeng , Nong Sang , Xiang Wu , Longsheng Wei , Quanzheng Mou , Jun Cheng , Chen Luo

Open-Vocabulary Object Detection Using Captions

Despite the remarkable accuracy of deep neural networks in object detection, they are costly to train and scale due to supervision requirements. Particularly, learning more object categories typically requires proportionally more bounding…

Computer Vision and Pattern Recognition · Computer Science 2021-03-16 Alireza Zareian , Kevin Dela Rosa , Derek Hao Hu , Shih-Fu Chang

Active learning with version spaces for object detection

Given an image, we would like to learn to detect objects belonging to particular object categories. Common object detection methods train on large annotated datasets which are annotated in terms of bounding boxes that contain the object of…

Computer Vision and Pattern Recognition · Computer Science 2016-11-30 Soumya Roy , Vinay P. Namboodiri , Arijit Biswas

Perceptual Generative Adversarial Networks for Small Object Detection

Detecting small objects is notoriously challenging due to their low resolution and noisy representation. Existing object detection pipelines usually detect small objects through learning representations of all the objects at multiple…

Computer Vision and Pattern Recognition · Computer Science 2017-06-21 Jianan Li , Xiaodan Liang , Yunchao Wei , Tingfa Xu , Jiashi Feng , Shuicheng Yan

Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection

Generative image models are increasingly being used for training data augmentation in vision tasks. In the context of automotive object detection, methods usually focus on producing augmented frames that look as realistic as possible, for…

Computer Vision and Pattern Recognition · Computer Science 2025-04-25 Jens Petersen , Davide Abati , Amirhossein Habibian , Auke Wiggers

Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes

Semantic segmentation of medical images is pivotal in applications like disease diagnosis and treatment planning. While deep learning has excelled in automating this task, a major hurdle is the need for numerous annotated segmentation…

Image and Video Processing · Electrical Eng. & Systems 2024-09-02 Li Zhang , Basu Jindal , Ahmed Alaa , Robert Weinreb , David Wilson , Eran Segal , James Zou , Pengtao Xie

Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data

We demonstrate that a generative model for object shapes can achieve state of the art results on challenging scene text recognition tasks, and with orders of magnitude fewer training images than required for competing discriminative…

Computer Vision and Pattern Recognition · Computer Science 2016-11-10 Xinghua Lou , Ken Kansky , Wolfgang Lehrach , CC Laan , Bhaskara Marthi , D. Scott Phoenix , Dileep George

Learning to detect and localize many objects from few examples

The current trend in object detection and localization is to learn predictions with high capacity deep neural networks trained on a very large amount of annotated data and using a high amount of processing power. In this work, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2016-11-18 Bastien Moysset , Christoper Kermorvant , Christian Wolf

Adding New Categories in Object Detection Using Few-Shot Copy-Paste

Developing data-efficient instance detection models that can handle rare object categories remains a key challenge in computer vision. However, existing research often overlooks data collection strategies and evaluation metrics tailored to…

Computer Vision and Pattern Recognition · Computer Science 2025-04-15 Boyang Deng , Meiyan Lin , Shoulun Long

Imagining the Unseen: Generative Location Modeling for Object Placement

Location modeling, or determining where non-existing objects could feasibly appear in a scene, has the potential to benefit numerous computer vision tasks, from automatic object insertion to scene creation in virtual reality. Yet, this…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Jooyeol Yun , Davide Abati , Mohamed Omran , Jaegul Choo , Amirhossein Habibian , Auke Wiggers

Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models

Accurately controlling object count in text-to-image generation remains a key challenge. Supervised methods often fail, as training data rarely covers all count variations. Methods that manipulate the denoising process to add or remove…

Computer Vision and Pattern Recognition · Computer Science 2025-06-06 Oz Zafar , Yuval Cohen , Lior Wolf , Idan Schwartz

Knowledge Combination to Learn Rotated Detection Without Rotated Annotation

Rotated bounding boxes drastically reduce output ambiguity of elongated objects, making it superior to axis-aligned bounding boxes. Despite the effectiveness, rotated detectors are not widely employed. Annotating rotated bounding boxes is…

Computer Vision and Pattern Recognition · Computer Science 2023-05-05 Tianyu Zhu , Bryce Ferenczi , Pulak Purkait , Tom Drummond , Hamid Rezatofighi , Anton van den Hengel

Generative Residual Attention Network for Disease Detection

Accurate identification and localization of abnormalities from radiology images serve as a critical role in computer-aided diagnosis (CAD) systems. Building a highly generalizable system usually requires a large amount of data with…

Image and Video Processing · Electrical Eng. & Systems 2021-10-26 Euyoung Kim , Soochahn Lee , Kyoung Mu Lee

Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos

Segmenting objects in videos is a fundamental computer vision task. The current deep learning based paradigm offers a powerful, but data-hungry solution. However, current datasets are limited by the cost and human effort of annotating…

Computer Vision and Pattern Recognition · Computer Science 2021-01-07 Bin Zhao , Goutam Bhat , Martin Danelljan , Luc Van Gool , Radu Timofte

Segmentation-Based Bounding Box Generation for Omnidirectional Pedestrian Detection

We propose a segmentation-based bounding box generation method for omnidirectional pedestrian detection that enables detectors to tightly fit bounding boxes to pedestrians without omnidirectional images for training. Due to the wide angle…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Masato Tamura , Tomoaki Yoshinaga

GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection

This paper presents GenDet, a novel framework that redefines object detection as an image generation task. In contrast to traditional approaches, GenDet adopts a pioneering approach by leveraging generative modeling: it conditions on the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Chen Min , Chengyang Li , Fanjie Kong , Qi Zhu , Dawei Zhao , Liang Xiao

Assisting Scene Graph Generation with Self-Supervision

Research in scene graph generation has quickly gained traction in the past few years because of its potential to help in downstream tasks like visual question answering, image captioning, etc. Many interesting approaches have been proposed…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Sandeep Inuganti , Vineeth N Balasubramanian

Weakly Supervised Object Detection with Segmentation Collaboration

Weakly supervised object detection aims at learning precise object detectors, given image category labels. In recent prevailing works, this problem is generally formulated as a multiple instance learning module guided by an image…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Xiaoyan Li , Meina Kan , Shiguang Shan , Xilin Chen