English
Related papers

Related papers: Implicit Autoencoder for Point-Cloud Self-Supervis…

200 papers

The manual annotation for large-scale point clouds is still tedious and unavailable for many harsh real-world tasks. Self-supervised learning, which is used on raw and unlabeled data to pre-train deep neural networks, is a promising…

Computer Vision and Pattern Recognition · Computer Science 2022-10-31 Junsheng Zhou , Xin Wen , Baorui Ma , Yu-Shen Liu , Yue Gao , Yi Fang , Zhizhong Han

Masked Autoencoders (MAE) have shown great potentials in self-supervised pre-training for language and 2D image transformers. However, it still remains an open question on how to exploit masked autoencoding for learning 3D representations…

Computer Vision and Pattern Recognition · Computer Science 2022-10-17 Renrui Zhang , Ziyu Guo , Rongyao Fang , Bin Zhao , Dong Wang , Yu Qiao , Hongsheng Li , Peng Gao

Deep generative architectures provide a way to model not only images but also complex, 3-dimensional objects, such as point clouds. In this work, we present a novel method to obtain meaningful representations of 3D shapes that can be used…

Machine Learning · Computer Science 2019-05-03 Maciej Zamorski , Maciej Zięba , Piotr Klukowski , Rafał Nowak , Karol Kurach , Wojciech Stokowiec , Tomasz Trzciński

Transformer-based Self-supervised Representation Learning methods learn generic features from unlabeled datasets for providing useful network initialization parameters for downstream tasks. Recently, self-supervised learning based upon…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Jincen Jiang , Xuequan Lu , Lizhi Zhao , Richard Dazeley , Meili Wang

Learning 3D representation plays a critical role in masked autoencoder (MAE) based pre-training methods for point cloud, including single-modal and cross-modal based MAE. Specifically, although cross-modal MAE methods learn strong 3D…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Yaohua Zha , Huizhen Ji , Jinmin Li , Rongsheng Li , Tao Dai , Bin Chen , Zhi Wang , Shu-Tao Xia

In the area of 3D shape analysis, the geometric properties of a shape have long been studied. Instead of directly extracting representative features using expert-designed descriptors or end-to-end deep neural networks, this paper is…

Computer Vision and Pattern Recognition · Computer Science 2021-12-22 Zongji Wang , Yunfei Liu , Feng Lu

Point cloud is a crucial representation of 3D contents, which has been widely used in many areas such as virtual reality, mixed reality, autonomous driving, etc. With the boost of the number of points in the data, how to efficiently…

Computer Vision and Pattern Recognition · Computer Science 2022-08-05 Kang You , Pan Gao , Qing Li

Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision. Unlike MAEs used in the image domain, where the pretext task is to restore…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Siming Yan , Yuqi Yang , Yuxiao Guo , Hao Pan , Peng-shuai Wang , Xin Tong , Yang Liu , Qixing Huang

Masked Autoencoders (MAE) have demonstrated promising performance in self-supervised learning for both 2D and 3D computer vision. Nevertheless, existing MAE-based methods still have certain drawbacks. Firstly, the functional decoupling…

Computer Vision and Pattern Recognition · Computer Science 2023-10-06 Yang Liu , Chen Chen , Can Wang , Xulin King , Mengyuan Liu

We consider the problem of image representation for the tasks of unsupervised learning and semi-supervised learning. In those learning tasks, the raw image vectors may not provide enough representation for their intrinsic structures due to…

Machine Learning · Computer Science 2014-02-20 Yiyi Liao , Yue Wang , Yong Liu

Recently, multi-modal masked autoencoders (MAE) has been introduced in 3D self-supervised learning, offering enhanced feature learning by leveraging both 2D and 3D data to capture richer cross-modal representations. However, these…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Zhimin Chen , Xuewei Chen , Xiao Guo , Yingwei Li , Longlong Jing , Liang Yang , Bing Li

We propose a deep autoencoder with graph topology inference and filtering to achieve compact representations of unorganized 3D point clouds in an unsupervised manner. Many previous works discretize 3D points to voxels and then use…

Computer Vision and Pattern Recognition · Computer Science 2020-02-19 Siheng Chen , Chaojing Duan , Yaoqing Yang , Duanshun Li , Chen Feng , Dong Tian

Recent deep networks that directly handle points in a point set, e.g., PointNet, have been state-of-the-art for supervised learning tasks on point clouds such as classification and segmentation. In this work, a novel end-to-end deep…

Computer Vision and Pattern Recognition · Computer Science 2018-04-04 Yaoqing Yang , Chen Feng , Yiru Shen , Dong Tian

Three-dimensional geometric data offer an excellent domain for studying representation learning and generative modeling. In this paper, we look at geometric data represented as point clouds. We introduce a deep AutoEncoder (AE) network with…

Computer Vision and Pattern Recognition · Computer Science 2018-06-13 Panos Achlioptas , Olga Diamanti , Ioannis Mitliagkas , Leonidas Guibas

Pre-training by numerous image data has become de-facto for robust 2D representations. In contrast, due to the expensive data acquisition and annotation, a paucity of large-scale 3D datasets severely hinders the learning for high-quality 3D…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Renrui Zhang , Liuhui Wang , Yu Qiao , Peng Gao , Hongsheng Li

Point cloud is a fundamental 3D representation which is widely used in real world applications such as autonomous driving. As a newly-developed media format which is characterized by complexity and irregularity, point cloud creates a need…

Computer Vision and Pattern Recognition · Computer Science 2019-05-10 Wei Yan , Yiting shao , Shan Liu , Thomas H Li , Zhu Li , Ge Li

Shape priors learned from data are commonly used to reconstruct 3D objects from partial or noisy data. Yet no such shape priors are available for indoor scenes, since typical 3D autoencoders cannot handle their scale, complexity, or…

Computer Vision and Pattern Recognition · Computer Science 2020-03-23 Chiyu Max Jiang , Avneesh Sud , Ameesh Makadia , Jingwei Huang , Matthias Nießner , Thomas Funkhouser

Masked autoencoder has been widely explored in point cloud self-supervised learning, whereby the point cloud is generally divided into visible and masked parts. These methods typically include an encoder accepting visible patches…

Computer Vision and Pattern Recognition · Computer Science 2024-10-25 Xiangdong Zhang , Shaofeng Zhang , Junchi Yan

As a promising scheme of self-supervised learning, masked autoencoding has significantly advanced natural language processing and computer vision. Inspired by this, we propose a neat scheme of masked autoencoders for point cloud…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Yatian Pang , Wenxiao Wang , Francis E. H. Tay , Wei Liu , Yonghong Tian , Li Yuan

Learning structures of 3D shapes is a fundamental problem in the field of computer graphics and geometry processing. We present a simple yet interpretable unsupervised method for learning a new structural representation in the form of 3D…

Computer Vision and Pattern Recognition · Computer Science 2020-03-27 Nenglun Chen , Lingjie Liu , Zhiming Cui , Runnan Chen , Duygu Ceylan , Changhe Tu , Wenping Wang
‹ Prev 1 2 3 10 Next ›