Related papers: ODAM: Gradient-based instance-specific visual expl…

ODAM: Object Detection, Association, and Mapping using Posed RGB Video

Localizing objects and estimating their extent in 3D is an important step towards high-level 3D scene understanding, which has many applications in Augmented Reality and Robotics. We present ODAM, a system for 3D Object Detection,…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Kejie Li , Daniel DeTone , Steven Chen , Minh Vo , Ian Reid , Hamid Rezatofighi , Chris Sweeney , Julian Straub , Richard Newcombe

GAM: Explainable Visual Similarity and Classification via Gradient Activation Maps

We present Gradient Activation Maps (GAM) - a machinery for explaining predictions made by visual similarity and classification models. By gleaning localized gradient and activation information from multiple network layers, GAM offers…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Oren Barkan , Omri Armstrong , Amir Hertz , Avi Caciularu , Ori Katz , Itzik Malkiel , Noam Koenigstein

OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving

To enhance autonomous driving safety in complex scenarios, various methods have been proposed to simulate LiDAR point cloud data. Nevertheless, these methods often face challenges in producing high-quality, diverse, and controllable…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Tianyi Yan , Junbo Yin , Xianpeng Lang , Ruigang Yang , Cheng-Zhong Xu , Jianbing Shen

Class-Discriminative Attention Maps for Vision Transformers

Importance estimators are explainability methods that quantify feature importance for deep neural networks (DNN). In vision transformers (ViT), the self-attention mechanism naturally leads to attention maps, which are sometimes interpreted…

Computer Vision and Pattern Recognition · Computer Science 2024-10-28 Lennart Brocki , Jakub Binda , Neo Christopher Chung

FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors

LiDAR-based 3D object detection has made impressive progress recently, yet most existing models are black-box, lacking interpretability. Previous explanation approaches primarily focus on analyzing image-based models and are not readily…

Computer Vision and Pattern Recognition · Computer Science 2024-05-22 Shuai Liu , Boyang Li , Zhiyu Fang , Mingyue Cui , Kai Huang

Object Instance Mining for Weakly Supervised Object Detection

Weakly supervised object detection (WSOD) using only image-level annotations has attracted growing attention over the past few years. Existing approaches using multiple instance learning easily fall into local optima, because such mechanism…

Computer Vision and Pattern Recognition · Computer Science 2020-02-05 Chenhao Lin , Siwen Wang , Dongqi Xu , Yu Lu , Wayne Zhang

Fusion-CAM: Integrating Gradient and Region-Based Class Activation Maps for Robust Visual Explanations

Interpreting the decision-making process of deep convolutional neural networks remains a central challenge in achieving trustworthy and transparent artificial intelligence. Explainable AI (XAI) techniques, particularly Class Activation Map…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Hajar Dekdegue , Moncef Garouani , Josiane Mothe , Jordan Bernigaud

Explaining YOLO: Leveraging Grad-CAM to Explain Object Detections

We investigate the problem of explainability for visual object detectors. Specifically, we demonstrate on the example of the YOLO object detector how to integrate Grad-CAM into the model architecture and analyze the results. We show how to…

Computer Vision and Pattern Recognition · Computer Science 2022-11-23 Armin Kirchknopf , Djordje Slijepcevic , Ilkay Wunderlich , Michael Breiter , Johannes Traxler , Matthias Zeppelzauer

Cyclic Graph Attentive Match Encoder (CGAME): A Novel Neural Network For OD Estimation

Origin-Destination Estimation plays an important role in the era of Intelligent Transportation. Nevertheless, as a under-determined problem, OD estimation confronts many challenges from cross-space inference to non-convex, non-linear…

Machine Learning · Computer Science 2022-08-24 Guanzhou Li , Yujing He , Jianping Wu , Duowei Li

Dynamic Object Aware LiDAR SLAM based on Automatic Generation of Training Data

Highly dynamic environments, with moving objects such as cars or humans, can pose a performance challenge for LiDAR SLAM systems that assume largely static scenes. To overcome this challenge and support the deployment of robots in real…

Robotics · Computer Science 2022-01-14 Patrick Pfreundschuh , Hubertus Franciscus Cornelis Hendrikx , Victor Reijgwart , Renaud Dubé , Roland Siegwart , Andrei Cramariuc

Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map

Interpretation of deep learning remains a very challenging problem. Although the Class Activation Map (CAM) is widely used to interpret deep model predictions by highlighting object location, it fails to provide insight into the salient…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Yuguang Yang , Runtang Guo , Sheng Wu , Yimi Wang , Juan Zhang , Xuan Gong , Baochang Zhang

Revisiting Out-of-Distribution Detection in LiDAR-based 3D Object Detection

LiDAR-based 3D object detection has become an essential part of automated driving due to its ability to localize and classify objects precisely in 3D. However, object detectors face a critical challenge when dealing with unknown foreground…

Computer Vision and Pattern Recognition · Computer Science 2024-04-25 Michael Kösel , Marcel Schreiber , Michael Ulrich , Claudius Gläser , Klaus Dietmayer

ALOOD: Exploiting Language Representations for LiDAR-based Out-of-Distribution Object Detection

LiDAR-based 3D object detection plays a critical role for reliable and safe autonomous driving systems. However, existing detectors often produce overly confident predictions for objects not belonging to known categories, posing significant…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Michael Kösel , Marcel Schreiber , Michael Ulrich , Claudius Gläser , Klaus Dietmayer

Detecting Out-of-Distribution Objects through Class-Conditioned Inpainting

Recent object detectors have achieved impressive accuracy in identifying objects seen during training. However, real-world deployment often introduces novel and unexpected objects, referred to as out-of-distribution (OOD) objects, posing…

Machine Learning · Computer Science 2025-11-20 Quang-Huy Nguyen , Jin Peng Zhou , Zhenzhen Liu , Khanh-Huyen Bui , Kilian Q. Weinberger , Wei-Lun Chao , Dung D. Le

Opti-CAM: Optimizing saliency maps for interpretability

Methods based on class activation maps (CAM) provide a simple mechanism to interpret predictions of convolutional neural networks by using linear combinations of feature maps as saliency maps. By contrast, masking-based methods optimize a…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Hanwei Zhang , Felipe Torres , Ronan Sicre , Yannis Avrithis , Stephane Ayache

OccAM's Laser: Occlusion-based Attribution Maps for 3D Object Detectors on LiDAR Data

While 3D object detection in LiDAR point clouds is well-established in academia and industry, the explainability of these models is a largely unexplored field. In this paper, we propose a method to generate attribution maps for the detected…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 David Schinagl , Georg Krispel , Horst Possegger , Peter M. Roth , Horst Bischof

ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations

Object detection models typically rely on predefined categories, limiting their ability to identify novel objects in open-world scenarios. To overcome this constraint, we introduce ADAM: Autonomous Discovery and Annotation Model, a…

Computer Vision and Pattern Recognition · Computer Science 2025-06-11 Amirreza Rouhi , Solmaz Arezoomandan , Knut Peterson , Joseph T. Woods , David K. Han

Generalizing Adversarial Explanations with Grad-CAM

Gradient-weighted Class Activation Mapping (Grad- CAM), is an example-based explanation method that provides a gradient activation heat map as an explanation for Convolution Neural Network (CNN) models. The drawback of this method is that…

Computer Vision and Pattern Recognition · Computer Science 2022-04-13 Tanmay Chakraborty , Utkarsh Trehan , Khawla Mallat , Jean-Luc Dugelay

OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction

Visual search is important in our daily life. The efficient allocation of visual attention is critical to effectively complete visual search tasks. Prior research has predominantly modelled the spatial allocation of visual attention in…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Yini Fang , Jingling Yu , Haozheng Zhang , Ralf van der Lans , Bertram Shi

Object-ABN: Learning to Generate Sharp Attention Maps for Action Recognition

In this paper we propose an extension of the Attention Branch Network (ABN) by using instance segmentation for generating sharper attention maps for action recognition. Methods for visual explanation such as Grad-CAM usually generate blurry…

Computer Vision and Pattern Recognition · Computer Science 2023-03-22 Tomoya Nitta , Tsubasa Hirakawa , Hironobu Fujiyoshi , Toru Tamaki