Related papers: SemVecNet: Generalizable Vector Map Generation for…

ViT-BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation

Generating a detailed near-field perceptual model of the environment is an important and challenging problem in both self-driving vehicles and autonomous mobile robotics. A Bird Eye View (BEV) map, providing a panoptic representation, is a…

Computer Vision and Pattern Recognition · Computer Science 2022-06-01 Pramit Dutta , Ganesh Sistu , Senthil Yogamani , Edgar Galván , John McDonald

VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization

Bird's-eye-view (BEV) map layout estimation requires an accurate and full understanding of the semantics for the environmental elements around the ego car to make the results coherent and realistic. Due to the challenges posed by occlusion,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Yiwei Zhang , Jin Gao , Fudong Ge , Guan Luo , Bing Li , Zhaoxiang Zhang , Haibin Ling , Weiming Hu

MapDiffusion: Generative Diffusion for Vectorized Online HD Map Construction and Uncertainty Estimation in Autonomous Driving

Autonomous driving requires an understanding of the static environment from sensor data. Learned Bird's-Eye View (BEV) encoders are commonly used to fuse multiple inputs, and a vector decoder predicts a vectorized map representation from…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Thomas Monninger , Zihan Zhang , Zhipeng Mo , Md Zafar Anwar , Steffen Staab , Sihao Ding

MV-Map: Offboard HD-Map Generation with Multi-view Consistency

While bird's-eye-view (BEV) perception models can be useful for building high-definition maps (HD-Maps) with less human labor, their results are often unreliable and demonstrate noticeable inconsistencies in the predicted HD-Maps from…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Ziyang Xie , Ziqi Pang , Yu-Xiong Wang

AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction

Autonomous driving requires understanding infrastructure elements, such as lanes and crosswalks. To navigate safely, this understanding must be derived from sensor data in real-time and needs to be represented in vectorized form. Learned…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Thomas Monninger , Md Zafar Anwar , Stanislaw Antol , Steffen Staab , Sihao Ding

VectorMapNet: End-to-end Vectorized HD Map Learning

Autonomous driving systems require High-Definition (HD) semantic maps to navigate around urban roads. Existing solutions approach the semantic mapping problem by offline manual annotation, which suffers from serious scalability issues.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-27 Yicheng Liu , Tianyuan Yuan , Yue Wang , Yilun Wang , Hang Zhao

Generating Evidential BEV Maps in Continuous Driving Space

Safety is critical for autonomous driving, and one aspect of improving safety is to accurately capture the uncertainties of the perception system, especially knowing the unknown. Different from only providing deterministic or probabilistic…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Yunshuang Yuan , Hao Cheng , Michael Ying Yang , Monika Sester

LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

Semantic Bird's Eye View (BEV) maps offer a rich representation with strong occlusion reasoning for various decision making tasks in autonomous driving. However, most BEV mapping approaches employ a fully supervised learning paradigm that…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Nikhil Gosala , Kürsat Petek , B Ravi Kiran , Senthil Yogamani , Paulo Drews-Jr , Wolfram Burgard , Abhinav Valada

An Initial Study of Bird's-Eye View Generation for Autonomous Vehicles using Cross-View Transformers

Bird's-Eye View (BEV) maps provide a structured, top-down abstraction that is crucial for autonomous-driving perception. In this work, we employ Cross-View Transformers (CVT) for learning to map camera images to three BEV's channels - road,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Felipe Carlos dos Santos , Eric Aislan Antonelo , Gustavo Claudio Karl Couto

Enhancing Vectorized Map Perception with Historical Rasterized Maps

In autonomous driving, there is growing interest in end-to-end online vectorized map perception in bird's-eye-view (BEV) space, with an expectation that it could replace traditional high-cost offline high-definition (HD) maps. However, the…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Xiaoyu Zhang , Guangwei Liu , Zihao Liu , Ningyi Xu , Yunhui Liu , Ji Zhao

MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models

Despite tremendous advancements in bird's-eye view (BEV) perception, existing models fall short in generating realistic and coherent semantic map layouts, and they fail to account for uncertainties arising from partial sensor information…

Computer Vision and Pattern Recognition · Computer Science 2023-08-25 Xiyue Zhu , Vlas Zyrianov , Zhijian Liu , Shenlong Wang

PathPainter: Transferring the Generalization Ability of Image Generation Models to Embodied Navigation

Bird's-eye-view (BEV) images have been widely demonstrated to provide valuable prior information for navigation. Given the global information provided by such views, two key challenges remain: how to fully exploit this information and how…

Robotics · Computer Science 2026-05-11 Yijin Wang , Yuru Tian , Xijie Huang , Weiqi Gai , Mo Zhu , Xin Zhou , Yuze Wu , Fei Gao

SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset

Bird's-eye view (BEV) perception has garnered significant attention in autonomous driving in recent years, in part because BEV representation facilitates multi-modal sensor fusion. BEV representation enables a variety of perception tasks…

Computer Vision and Pattern Recognition · Computer Science 2025-03-28 Goodarz Mehr , Azim Eskandarian

Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks

In this paper, we introduce Mask2Map, a novel end-to-end online HD map construction method designed for autonomous driving applications. Our approach focuses on predicting the class and ordered point set of map instances within a scene,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-13 Sehwan Choi , Jungho Kim , Hongjae Shin , Jun Won Choi

Learning to Generate Vectorized Maps at Intersections with Multiple Roadside Cameras

Vectorized maps are indispensable for precise navigation and the safe operation of autonomous vehicles. Traditional methods for constructing these maps fall into two categories: offline techniques, which rely on expensive, labor-intensive…

Computer Vision and Pattern Recognition · Computer Science 2025-07-14 Quanxin Zheng , Miao Fan , Shengtong Xu , Linghe Kong , Haoyi Xiong

MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning

In autonomous driving, high-definition (HD) maps and semantic maps in bird's-eye view (BEV) are essential for accurate localization, planning, and decision-making. This paper introduces an enhanced End-to-End model named MapFM for online…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Leonid Ivanov , Vasily Yuryev , Dmitry Yudin

BEVMAPMATCH: Multimodal BEV Neural Map Matching for Robust Re-Localization of Autonomous Vehicles

Localization in GNSS-denied and GNSS-degraded environments is a challenge for the safe widespread deployment of autonomous vehicles. Such GNSS-challenged environments require alternative methods for robust localization. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2026-03-30 Shounak Sural , Ragunathan Rajkumar

BEVWorld: A Multimodal World Simulator for Autonomous Driving via Scene-Level BEV Latents

World models have attracted increasing attention in autonomous driving for their ability to forecast potential future scenarios. In this paper, we propose BEVWorld, a novel framework that transforms multimodal sensor inputs into a unified…

Computer Vision and Pattern Recognition · Computer Science 2025-05-01 Yumeng Zhang , Shi Gong , Kaixin Xiong , Xiaoqing Ye , Xiaofan Li , Xiao Tan , Fan Wang , Jizhou Huang , Hua Wu , Haifeng Wang

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending and drawing extensive attention both from industry and academia. Conventional approaches for most autonomous driving algorithms perform detection,…

Computer Vision and Pattern Recognition · Computer Science 2023-09-28 Hongyang Li , Chonghao Sima , Jifeng Dai , Wenhai Wang , Lewei Lu , Huijie Wang , Jia Zeng , Zhiqi Li , Jiazhi Yang , Hanming Deng , Hao Tian , Enze Xie , Jiangwei Xie , Li Chen , Tianyu Li , Yang Li , Yulu Gao , Xiaosong Jia , Si Liu , Jianping Shi , Dahua Lin , Yu Qiao

Street-View Image Generation from a Bird's-Eye View Layout

Bird's-Eye View (BEV) Perception has received increasing attention in recent years as it provides a concise and unified spatial representation across views and benefits a diverse set of downstream driving applications. At the same time,…

Computer Vision and Pattern Recognition · Computer Science 2024-02-14 Alexander Swerdlow , Runsheng Xu , Bolei Zhou