Related papers: Learning Robust Object Recognition Using Composed …

Improving Classification of Occluded Objects through Scene Context

The presence of occlusions has provided substantial challenges to typically-powerful object recognition algorithms. Additional sources of information can be extremely valuable to reduce errors caused by occlusions. Scene context is known to…

Computer Vision and Pattern Recognition · Computer Science 2025-10-31 Courtney M. King , Daniel D. Leeds , Damian Lyons , George Kalaitzis

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion

Analyzing complex scenes with Deep Neural Networks is a challenging task, particularly when images contain multiple objects that partially occlude each other. Existing approaches to image analysis mostly process objects independently and do…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Xiaoding Yuan , Adam Kortylewski , Yihong Sun , Alan Yuille

Compositional Scene Representation Learning via Reconstruction: A Survey

Visual scenes are composed of visual concepts and have the property of combinatorial explosion. An important reason for humans to efficiently learn from diverse visual scenes is the ability of compositional perception, and it is desirable…

Machine Learning · Computer Science 2023-06-16 Jinyang Yuan , Tonglin Chen , Bin Li , Xiangyang Xue

Synthesizing Training Data for Object Detection in Indoor Scenes

Detection of objects in cluttered indoor environments is one of the key enabling functionalities for service robots. The best performing object detection approaches in computer vision exploit deep Convolutional Neural Networks (CNN) to…

Computer Vision and Pattern Recognition · Computer Science 2017-09-11 Georgios Georgakis , Arsalan Mousavian , Alexander C. Berg , Jana Kosecka

Classification of Occluded Objects using Fast Recurrent Processing

Recurrent neural networks are powerful tools for handling incomplete data problems in computer vision, thanks to their significant generative capabilities. However, the computational demand for these algorithms is too high to work in real…

Computer Vision and Pattern Recognition · Computer Science 2015-05-07 Ozgur Yilmaz

Combining Commonsense Reasoning and Knowledge Acquisition to Guide Deep Learning in Robotics

Algorithms based on deep network models are being used for many pattern recognition and decision-making tasks in robotics and AI. Training these models requires a large labeled dataset and considerable computational resources, which are not…

Artificial Intelligence · Computer Science 2022-01-26 Mohan Sridharan , Tiago Mota

Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints

Visual scenes are extremely rich in diversity, not only because there are infinite combinations of objects and background, but also because the observations of the same scene may vary greatly with the change of viewpoints. When observing a…

Computer Vision and Pattern Recognition · Computer Science 2021-12-14 Jinyang Yuan , Bin Li , Xiangyang Xue

Robustness of Object Recognition under Extreme Occlusion in Humans and Computational Models

Most objects in the visual world are partially occluded, but humans can recognize them without difficulty. However, it remains unknown whether object recognition models like convolutional neural networks (CNNs) can handle real-world…

Computer Vision and Pattern Recognition · Computer Science 2019-06-05 Hongru Zhu , Peng Tang , Jeongho Park , Soojin Park , Alan Yuille

Recurrent Feedback Improves Recognition of Partially Occluded Objects

Recurrent connectivity in the visual cortex is believed to aid object recognition for challenging conditions such as occlusion. Here we investigate if and how artificial neural networks also benefit from recurrence. We compare architectures…

Computer Vision and Pattern Recognition · Computer Science 2021-04-22 Markus Roland Ernst , Jochen Triesch , Thomas Burwick

Scene Synthesis via Uncertainty-Driven Attribute Synchronization

Developing deep neural networks to generate 3D scenes is a fundamental problem in neural synthesis with immediate applications in architectural CAD, computer graphics, as well as in generating virtual robot training environments. This task…

Computer Vision and Pattern Recognition · Computer Science 2021-09-02 Haitao Yang , Zaiwei Zhang , Siming Yan , Haibin Huang , Chongyang Ma , Yi Zheng , Chandrajit Bajaj , Qixing Huang

Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion

Computer vision systems in real-world applications need to be robust to partial occlusion while also being explainable. In this work, we show that black-box deep convolutional neural networks (DCNNs) have only limited robustness to partial…

Computer Vision and Pattern Recognition · Computer Science 2020-06-30 Adam Kortylewski , Qing Liu , Angtian Wang , Yihong Sun , Alan Yuille

GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

Deep generative models allow for photorealistic image synthesis at high resolutions. But for many applications, this is not enough: content creation also needs to be controllable. While several recent works investigate how to disentangle…

Computer Vision and Pattern Recognition · Computer Science 2021-04-30 Michael Niemeyer , Andreas Geiger

Biologically Inspired Visual System Architecture for Object Recognition in Autonomous Systems

Findings in recent years on the sensitivity of convolutional neural networks to additive noise, light conditions and to the wholeness of the training dataset, indicate that this technology still lacks the robustness needed for the…

Image and Video Processing · Electrical Eng. & Systems 2020-07-23 Dan Malowany , Hugo Guterman

Recurrent Connectivity Aids Recognition of Partly Occluded Objects

Feedforward convolutional neural networks are the prevalent model of core object recognition. For challenging conditions, such as occlusion, neuroscientists believe that the recurrent connectivity in the visual cortex aids object…

Computer Vision and Pattern Recognition · Computer Science 2019-09-16 Markus Roland Ernst , Jochen Triesch , Thomas Burwick

Investigating Object Compositionality in Generative Adversarial Networks

Deep generative models seek to recover the process with which the observed data was generated. They may be used to synthesize new samples or to subsequently extract representations. Successful approaches in the domain of images are driven…

Computer Vision and Pattern Recognition · Computer Science 2020-07-27 Sjoerd van Steenkiste , Karol Kurach , Jürgen Schmidhuber , Sylvain Gelly

Compositional Scene Understanding through Inverse Generative Modeling

Generative models have demonstrated remarkable abilities in generating high-fidelity visual content. In this work, we explore how generative models can further be used not only to synthesize visual content but also to understand the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yanbo Wang , Justin Dauwels , Yilun Du

A Probabilistic Theory of Deep Learning

A grand challenge in machine learning is the development of computational algorithms that match or outperform humans in perceptual inference tasks that are complicated by nuisance variation. For instance, visual object recognition involves…

Machine Learning · Statistics 2015-04-03 Ankit B. Patel , Tan Nguyen , Richard G. Baraniuk

Synthetic Data Generation and Adaption for Object Detection in Smart Vending Machines

This paper presents an improved scheme for the generation and adaption of synthetic images for the training of deep Convolutional Neural Networks(CNNs) to perform the object detection task in smart vending machines. While generating…

Computer Vision and Pattern Recognition · Computer Science 2019-04-30 Kai Wang , Fuyuan Shi , Wenqi Wang , Yibing Nan , Shiguo Lian

Compositional Scene Modeling with Global Object-Centric Representations

The appearance of the same object may vary in different scene images due to perspectives and occlusions between objects. Humans can easily identify the same object, even if occlusions exist, by completing the occluded parts based on its…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Tonglin Chen , Bin Li , Zhimeng Shen , Xiangyang Xue

Scene Text Synthesis for Efficient and Effective Deep Network Training

A large amount of annotated training images is critical for training accurate and robust deep network models but the collection of a large amount of annotated training images is often time-consuming and costly. Image synthesis alleviates…

Computer Vision and Pattern Recognition · Computer Science 2023-04-25 Changgong Zhang , Fangneng Zhan , Hongyuan Zhu , Shijian Lu