Related papers: Exploring Self-Attention for Visual Intersection C…

Use of First and Third Person Views for Deep Intersection Classification

We explore the problem of intersection classification using monocular on-board passive vision, with the goal of classifying traffic scenes with respect to road topology. We divide the existing approaches into two broad categories according…

Computer Vision and Pattern Recognition · Computer Science 2019-01-23 Koji Takeda , Kanji Tanaka

Unified Local and Global Attention Interaction Modeling for Vision Transformers

We present a novel method that extends the self-attention mechanism of a vision transformer (ViT) for more accurate object detection across diverse datasets. ViTs show strong capability for image understanding tasks such as object…

Computer Vision and Pattern Recognition · Computer Science 2024-12-30 Tan Nguyen , Coy D. Heldermon , Corey Toler-Franklin

SA-Det3D: Self-Attention Based Context-Aware 3D Object Detection

Existing point-cloud based 3D object detectors use convolution-like operators to process information in a local neighbourhood with fixed-weight kernels and aggregate global context hierarchically. However, non-local neural networks and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-23 Prarthana Bhattacharyya , Chengjie Huang , Krzysztof Czarnecki

Traffic Scene Small Target Detection Method Based on YOLOv8n-SPTS Model for Autonomous Driving

This paper focuses on the key issue in autonomous driving: small target recognition in dynamic perception. Existing algorithms suffer from poor detection performance due to missing small target information, scale imbalance, and occlusion.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Songhan Wu

Efficient Vision Transformer for Accurate Traffic Sign Detection

This research paper addresses the challenges associated with traffic sign detection in self-driving vehicles and driver assistance systems. The development of reliable and highly accurate algorithms is crucial for the widespread adoption of…

Computer Vision and Pattern Recognition · Computer Science 2023-11-03 Javad Mirzapour Kaleybar , Hooman Khaloo , Avaz Naghipour

Model Guided Road Intersection Classification

Understanding complex scenarios from in-vehicle cameras is essential for safely operating autonomous driving systems in densely populated areas. Among these, intersection areas are one of the most critical as they concentrate a considerable…

Computer Vision and Pattern Recognition · Computer Science 2021-11-25 Augusto Luis Ballardini , Álvaro Hernández , Miguel Ángel Sotelo

Deep Learning with Attention Mechanism for Predicting Driver Intention at Intersection

In this paper, a driver's intention prediction near a road intersection is proposed. Our approach uses a deep bidirectional Long Short-Term Memory (LSTM) with an attention mechanism model based on a hybrid-state system (HSS) framework. As…

Computer Vision and Pattern Recognition · Computer Science 2020-06-11 Abenezer Girma , Seifemichael Amsalu , Abrham Workineh , Mubbashar Khan , Abdollah Homaifar

Attention-based Multimodal Feature Representation Model for Micro-video Recommendation

In recommender systems, models mostly use a combination of embedding layers and multilayer feedforward neural networks. The high-dimensional sparse original features are downscaled in the embedding layer and then fed into the fully…

Information Retrieval · Computer Science 2022-05-19 Mohan Hasama , Jing Li

Drawing Attention to Detail: Pose Alignment through Self-Attention for Fine-Grained Object Classification

Intra-class variations in the open world lead to various challenges in classification tasks. To overcome these challenges, fine-grained classification was introduced, and many approaches were proposed. Some rely on locating and using…

Computer Vision and Pattern Recognition · Computer Science 2023-02-10 Salwa Al Khatib , Mohamed El Amine Boudjoghra , Jameel Hassan

Assessing the Impact of Attention and Self-Attention Mechanisms on the Classification of Skin Lesions

Attention mechanisms have raised significant interest in the research community, since they promise significant improvements in the performance of neural network architectures. However, in any specific problem, we still lack a principled…

Computer Vision and Pattern Recognition · Computer Science 2021-12-24 Rafael Pedro , Arlindo L. Oliveira

Probabilistic Multi-modal Trajectory Prediction with Lane Attention for Autonomous Vehicles

Trajectory prediction is crucial for autonomous vehicles. The planning system not only needs to know the current state of the surrounding objects but also their possible states in the future. As for vehicles, their trajectories are…

Robotics · Computer Science 2020-07-07 Chenxu Luo , Lin Sun , Dariush Dabiri , Alan Yuille

Self-Attention-Based Contextual Modulation Improves Neural System Identification

Convolutional neural networks (CNNs) have been shown to be state-of-the-art models for visual cortical neurons. Cortical neurons in the primary visual cortex are sensitive to contextual information mediated by extensive horizontal and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Isaac Lin , Tianye Wang , Shang Gao , Shiming Tang , Tai Sing Lee

Attention-Mechanism-based Tracking Method for Intelligent Internet of Vehicles

Vehicle tracking task plays an important role on the internet of vehicles and intelligent transportation system. Beyond the traditional GPS sensor, the image sensor can capture different kinds of vehicles, analyze their driving situation…

Computer Vision and Pattern Recognition · Computer Science 2018-11-08 Xu Kang , Bin Song , Jie Guo , Xiaojiang Du , Mohsen Guizani

Focal Self-attention for Local-Global Interactions in Vision Transformers

Recently, Vision Transformer and its variants have shown great promise on various computer vision tasks. The ability of capturing short- and long-range visual dependencies through self-attention is arguably the main source for the success.…

Computer Vision and Pattern Recognition · Computer Science 2021-07-02 Jianwei Yang , Chunyuan Li , Pengchuan Zhang , Xiyang Dai , Bin Xiao , Lu Yuan , Jianfeng Gao

Explicitly Modeled Attention Maps for Image Classification

Self-attention networks have shown remarkable progress in computer vision tasks such as image classification. The main benefit of the self-attention mechanism is the ability to capture long-range feature interactions in attention-maps.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-19 Andong Tan , Duc Tam Nguyen , Maximilian Dax , Matthias Nießner , Thomas Brox

TransVPR: Transformer-based place recognition with multi-level attention aggregation

Visual place recognition is a challenging task for applications such as autonomous driving navigation and mobile robot localization. Distracting elements presenting in complex scenes often lead to deviations in the perception of visual…

Computer Vision and Pattern Recognition · Computer Science 2022-04-14 Ruotong Wang , Yanqing Shen , Weiliang Zuo , Sanping Zhou , Nanning Zheng

Learning to Navigate Intersections with Unsupervised Driver Trait Inference

Navigation through uncontrolled intersections is one of the key challenges for autonomous vehicles. Identifying the subtle differences in hidden traits of other drivers can bring significant benefits when navigating in such environments. We…

Robotics · Computer Science 2022-03-02 Shuijing Liu , Peixin Chang , Haonan Chen , Neeloy Chakraborty , Katherine Driggs-Campbell

Exploring Contextual Representation and Multi-Modality for End-to-End Autonomous Driving

Learning contextual and spatial environmental representations enhances autonomous vehicle's hazard anticipation and decision-making in complex scenarios. Recent perception systems enhance spatial understanding with sensor fusion but often…

Robotics · Computer Science 2024-01-18 Shoaib Azam , Farzeen Munir , Ville Kyrki , Moongu Jeon , Witold Pedrycz

Attention Augmented Convolutional Networks

Convolutional networks have been the paradigm of choice in many computer vision applications. The convolution operation however has a significant weakness in that it only operates on a local neighborhood, thus missing global information.…

Computer Vision and Pattern Recognition · Computer Science 2020-09-11 Irwan Bello , Barret Zoph , Ashish Vaswani , Jonathon Shlens , Quoc V. Le

Joint Perception and Prediction for Autonomous Driving: A Survey

Perception and prediction modules are critical components of autonomous driving systems, enabling vehicles to navigate safely through complex environments. The perception module is responsible for perceiving the environment, including…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Lucas Dal'Col , Miguel Oliveira , Vítor Santos