English
Related papers

Related papers: Masked Autoencoders for Point Cloud Self-supervise…

200 papers

Masked autoencoding has achieved great success for self-supervised learning in the image and language domains. However, mask based pretraining has yet to show benefits for point cloud understanding, likely due to standard backbones like…

Computer Vision and Pattern Recognition · Computer Science 2022-08-02 Haotian Liu , Mu Cai , Yong Jae Lee

Transformer-based Self-supervised Representation Learning methods learn generic features from unlabeled datasets for providing useful network initialization parameters for downstream tasks. Recently, self-supervised learning based upon…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Jincen Jiang , Xuequan Lu , Lizhi Zhao , Richard Dazeley , Meili Wang

Point cloud understanding aims to acquire robust and general feature representations from unlabeled data. Masked point modeling-based methods have recently shown significant performance across various downstream tasks. These pre-training…

Computer Vision and Pattern Recognition · Computer Science 2025-06-30 Yixin Zha , Chuxin Wang , Wenfei Yang , Tianzhu Zhang

Masked autoencoder has been widely explored in point cloud self-supervised learning, whereby the point cloud is generally divided into visible and masked parts. These methods typically include an encoder accepting visible patches…

Computer Vision and Pattern Recognition · Computer Science 2024-10-25 Xiangdong Zhang , Shaofeng Zhang , Junchi Yan

Masked auto-encoding is a popular and effective self-supervised learning approach to point cloud learning. However, most of the existing methods reconstruct only the masked points and overlook the local geometry information, which is also…

Computer Vision and Pattern Recognition · Computer Science 2022-07-08 Yabin Zhang , Jiehong Lin , Chenhang He , Yongwei Chen , Kui Jia , Lei Zhang

This paper tries to address a fundamental question in point cloud self-supervised learning: what is a good signal we should leverage to learn features from point clouds without annotations? To answer that, we introduce a point cloud…

Computer Vision and Pattern Recognition · Computer Science 2023-05-16 Xiaoyu Tian , Haoxi Ran , Yue Wang , Hang Zhao

The manual annotation for large-scale point clouds is still tedious and unavailable for many harsh real-world tasks. Self-supervised learning, which is used on raw and unlabeled data to pre-train deep neural networks, is a promising…

Computer Vision and Pattern Recognition · Computer Science 2022-10-31 Junsheng Zhou , Xin Wen , Baorui Ma , Yu-Shen Liu , Yue Gao , Yi Fang , Zhizhong Han

Masked autoencoding has become a successful pretraining paradigm for Transformer models for text, images, and, recently, point clouds. Raw automotive datasets are suitable candidates for self-supervised pre-training as they generally are…

Computer Vision and Pattern Recognition · Computer Science 2023-03-10 Georg Hess , Johan Jaxing , Elias Svensson , David Hagerman , Christoffer Petersson , Lennart Svensson

Masked Autoencoders (MAE) have shown great potentials in self-supervised pre-training for language and 2D image transformers. However, it still remains an open question on how to exploit masked autoencoding for learning 3D representations…

Computer Vision and Pattern Recognition · Computer Science 2022-10-17 Renrui Zhang , Ziyu Guo , Rongyao Fang , Bin Zhao , Dong Wang , Yu Qiao , Hongsheng Li , Peng Gao

In computer-aided design (CAD) community, the point cloud data is pervasively applied in reverse engineering, where the point cloud analysis plays an important role. While a large number of supervised learning methods have been proposed to…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Cheng Zhang , Jian Shi , Xuan Deng , Zizhao Wu

Recently, the self-supervised learning framework data2vec has shown inspiring performance for various modalities using a masked student-teacher approach. However, it remains open whether such a framework generalizes to the unique challenges…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Karim Knaebel , Jonas Schult , Alexander Hermans , Bastian Leibe

Point clouds provide a flexible and natural representation usable in countless applications such as robotics or self-driving cars. Recently, deep neural networks operating on raw point cloud data have shown promising results on supervised…

Machine Learning · Computer Science 2019-06-04 Jonathan Sauder , Bjarne Sievers

Masked Autoencoders (MAE) have demonstrated promising performance in self-supervised learning for both 2D and 3D computer vision. Nevertheless, existing MAE-based methods still have certain drawbacks. Firstly, the functional decoupling…

Computer Vision and Pattern Recognition · Computer Science 2023-10-06 Yang Liu , Chen Chen , Can Wang , Xulin King , Mengyuan Liu

Self-supervised methods have been proven effective for learning deep representations of 3D point cloud data. Although recent methods in this domain often rely on random masking of inputs, the results of this approach can be improved. We…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Michał Szachniewicz , Wojciech Kozłowski , Michał Stypułkowski , Maciej Zięba

Recent deep networks that directly handle points in a point set, e.g., PointNet, have been state-of-the-art for supervised learning tasks on point clouds such as classification and segmentation. In this work, a novel end-to-end deep…

Computer Vision and Pattern Recognition · Computer Science 2018-04-04 Yaoqing Yang , Chen Feng , Yiru Shen , Dong Tian

Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision. Unlike MAEs used in the image domain, where the pretext task is to restore…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Siming Yan , Yuqi Yang , Yuxiao Guo , Hao Pan , Peng-shuai Wang , Xin Tong , Yang Liu , Qixing Huang

We present Point-BERT, a new paradigm for learning Transformers to generalize the concept of BERT to 3D point cloud. Inspired by BERT, we devise a Masked Point Modeling (MPM) task to pre-train point cloud Transformers. Specifically, we…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Xumin Yu , Lulu Tang , Yongming Rao , Tiejun Huang , Jie Zhou , Jiwen Lu

Recognizing human actions from point cloud videos has attracted tremendous attention from both academia and industry due to its wide applications like automatic driving, robotics, and so on. However, current methods for point cloud action…

Computer Vision and Pattern Recognition · Computer Science 2022-09-02 Xiaodong Chen , Wu Liu , Xinchen Liu , Yongdong Zhang , Jungong Han , Tao Mei

Recently, multi-modal masked autoencoders (MAE) has been introduced in 3D self-supervised learning, offering enhanced feature learning by leveraging both 2D and 3D data to capture richer cross-modal representations. However, these…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Zhimin Chen , Xuewei Chen , Xiao Guo , Yingwei Li , Longlong Jing , Liang Yang , Bing Li

Point cloud is a crucial representation of 3D contents, which has been widely used in many areas such as virtual reality, mixed reality, autonomous driving, etc. With the boost of the number of points in the data, how to efficiently…

Computer Vision and Pattern Recognition · Computer Science 2022-08-05 Kang You , Pan Gao , Qing Li
‹ Prev 1 2 3 10 Next ›