Related papers: ConceptVision: A Flexible Scene Classification Fra…

SegDiscover: Visual Concept Discovery via Unsupervised Semantic Segmentation

Visual concept discovery has long been deemed important to improve interpretability of neural networks, because a bank of semantically meaningful concepts would provide us with a starting point for building machine learning models that…

Computer Vision and Pattern Recognition · Computer Science 2022-04-26 Haiyang Huang , Zhi Chen , Cynthia Rudin

Interpretable Multi-View Clustering

Multi-view clustering has become a significant area of research, with numerous methods proposed over the past decades to enhance clustering accuracy. However, in many real-world applications, it is crucial to demonstrate a clear…

Machine Learning · Computer Science 2025-02-07 Mudi Jiang , Lianyu Hu , Zengyou He , Zhikui Chen

PerspectiveNet: Multi-View Perception for Dynamic Scene Understanding

Generating detailed descriptions from multiple cameras and viewpoints is challenging due to the complex and inconsistent nature of visual data. In this paper, we introduce PerspectiveNet, a lightweight yet efficient model for generating…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Vinh Nguyen

Scenarios: A New Representation for Complex Scene Understanding

The ability for computational agents to reason about the high-level content of real world scene images is important for many applications. Existing attempts at addressing the problem of complex scene understanding lack representational…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Zachary A. Daniels , Dimitris N. Metaxas

Unified Perceptual Parsing for Scene Understanding

Humans recognize the visual world at multiple levels: we effortlessly categorize scenes and detect objects inside, while also identifying the textures and surfaces of the objects along with their different compositional parts. In this…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Tete Xiao , Yingcheng Liu , Bolei Zhou , Yuning Jiang , Jian Sun

Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition

With the explosive growth of video data in real-world applications, a comprehensive representation of videos becomes increasingly important. In this paper, we address the problem of video scene recognition, whose goal is to learn a…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Xuzheng Yu , Chen Jiang , Wei Zhang , Tian Gan , Linlin Chao , Jianan Zhao , Yuan Cheng , Qingpei Guo , Wei Chu

A transition towards virtual representations of visual scenes

Visual scene understanding is a fundamental task in computer vision that aims to extract meaningful information from visual data. It traditionally involves disjoint and specialized algorithms for different tasks that are tailored for…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Américo Pereira , Pedro Carvalho , Luís Côrte-Real

CusConcept: Customized Visual Concept Decomposition with Diffusion Models

Enabling generative models to decompose visual concepts from a single image is a complex and challenging problem. In this paper, we study a new and challenging task, customized concept decomposition, wherein the objective is to leverage…

Computer Vision and Pattern Recognition · Computer Science 2024-10-02 Zhi Xu , Shaozhe Hao , Kai Han

Tagger: Deep Unsupervised Perceptual Grouping

We present a framework for efficient perceptual inference that explicitly reasons about the segmentation of its inputs and features. Rather than being trained for any specific segmentation, our framework learns the grouping process in an…

Computer Vision and Pattern Recognition · Computer Science 2016-11-29 Klaus Greff , Antti Rasmus , Mathias Berglund , Tele Hotloo Hao , Jürgen Schmidhuber , Harri Valpola

Knowledge-Aware Neuron Interpretation for Scene Classification

Although neural models have achieved remarkable performance, they still encounter doubts due to the intransparency. To this end, model prediction explanation is attracting more and more attentions. However, current methods rarely…

Computer Vision and Pattern Recognition · Computer Science 2024-01-30 Yong Guan , Freddy Lecue , Jiaoyan Chen , Ru Li , Jeff Z. Pan

Discovering Concepts in Learned Representations using Statistical Inference and Interactive Visualization

Concept discovery is one of the open problems in the interpretability literature that is important for bridging the gap between non-deep learning experts and model end-users. Among current formulations, concepts defines them by as a…

Machine Learning · Computer Science 2022-02-11 Adrianna Janik , Kris Sankaran

Classification with Conceptual Safeguards

We propose a new approach to promote safety in classification tasks with established concepts. Our approach -- called a conceptual safeguard -- acts as a verification layer for models that predict a target outcome by first predicting the…

Machine Learning · Computer Science 2024-11-08 Hailey Joren , Charles Marx , Berk Ustun

IntelliCap: Intelligent Guidance for Consistent View Sampling

Novel view synthesis from images, for example, with 3D Gaussian splatting, has made great progress. Rendering fidelity and speed are now ready even for demanding virtual reality applications. However, the problem of assisting humans in…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Ayaka Yasunaga , Hideo Saito , Dieter Schmalstieg , Shohei Mori

Dynamic Proposals for Efficient Object Detection

Object detection is a basic computer vision task to loccalize and categorize objects in a given image. Most state-of-the-art detection methods utilize a fixed number of proposals as an intermediate representation of object candidates, which…

Computer Vision and Pattern Recognition · Computer Science 2022-07-13 Yiming Cui , Linjie Yang , Ding Liu

Concept Lens: Visually Analyzing the Consistency of Semantic Manipulation in GANs

As applications of generative AI become mainstream, it is important to understand what generative models are capable of producing, and the extent to which one can predictably control their outputs. In this paper, we propose a visualization…

Human-Computer Interaction · Computer Science 2024-07-01 Sangwon Jeong , Mingwei Li , Matthew Berger , Shusen Liu

Dense Captioning with Joint Inference and Visual Context

Dense captioning is a newly emerging computer vision topic for understanding images with dense language descriptions. The goal is to densely detect visual concepts (e.g., objects, object parts, and interactions between them) from images,…

Computer Vision and Pattern Recognition · Computer Science 2017-08-09 Linjie Yang , Kevin Tang , Jianchao Yang , Li-Jia Li

Low-level Vision by Consensus in a Spatial Hierarchy of Regions

We introduce a multi-scale framework for low-level vision, where the goal is estimating physical scene values from image data---such as depth from stereo image pairs. The framework uses a dense, overlapping set of image regions at multiple…

Computer Vision and Pattern Recognition · Computer Science 2015-04-15 Ayan Chakrabarti , Ying Xiong , Steven J. Gortler , Todd Zickler

Improving Computer Vision Interpretability: Transparent Two-level Classification for Complex Scenes

Treating images as data has become increasingly popular in political science. While existing classifiers for images reach high levels of accuracy, it is difficult to systematically assess the visual features on which they base their…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Stefan Scholz , Nils B. Weidmann , Zachary C. Steinert-Threlkeld , Eda Keremoğlu , Bastian Goldlücke

Few-Shot Viewpoint Estimation

Viewpoint estimation for known categories of objects has been improved significantly thanks to deep networks and large datasets, but generalization to unknown categories is still very challenging. With an aim towards improving performance…

Computer Vision and Pattern Recognition · Computer Science 2019-08-02 Hung-Yu Tseng , Shalini De Mello , Jonathan Tremblay , Sifei Liu , Stan Birchfield , Ming-Hsuan Yang , Jan Kautz

PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples

We present PartComposer: a framework for part-level concept learning from single-image examples that enables text-to-image diffusion models to compose novel objects from meaningful components. Existing methods either struggle with…

Graphics · Computer Science 2025-09-16 Junyu Liu , R. Kenny Jones , Daniel Ritchie