Related papers: Point Cloud Pre-training with Diffusion Models

Point Cloud Resampling with Learnable Heat Diffusion

Generative diffusion models have shown empirical successes in point cloud resampling, generating a denser and more uniform distribution of points from sparse or noisy 3D point clouds by progressively refining noise into structure. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Wenqiang Xu , Wenrui Dai , Duoduo Xue , Ziyang Zheng , Chenglin Li , Junni Zou , Hongkai Xiong

Rethinking Point Clouds as Sequences: A Causal Next-Token Predictive Learning Framework

With the rapid progress of multimodal foundation models and predictive pre-training, an important open question is how to equip 3D point clouds with a pre-training paradigm that is better aligned with next-token and next-embedding learning.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Yumeng Yao , Jingzhi Dong , Haowen Gu , Tao Chen , Zonghan Wu , Xiaoshui Huang , Yazhou Yao

Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models

With the overwhelming trend of mask image modeling led by MAE, generative pre-training has shown a remarkable potential to boost the performance of fundamental models in 2D vision. However, in 3D vision, the over-reliance on…

Computer Vision and Pattern Recognition · Computer Science 2023-09-08 Ziyi Wang , Xumin Yu , Yongming Rao , Jie Zhou , Jiwen Lu

Masked Discrimination for Self-Supervised Learning on Point Clouds

Masked autoencoding has achieved great success for self-supervised learning in the image and language domains. However, mask based pretraining has yet to show benefits for point cloud understanding, likely due to standard backbones like…

Computer Vision and Pattern Recognition · Computer Science 2022-08-02 Haotian Liu , Mu Cai , Yong Jae Lee

Diffusion Masked Pretraining for Dynamic Point Cloud

Dynamic point cloud pretraining is still dominated by masked reconstruction objectives. However, these objectives inherit two key limitations. Existing methods inject ground-truth tube centers as decoder positional embeddings, causing…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Zhuoyue Zhang , Jihua Zhu , Chaowei Fang , Jian Liu , Ajmal Saeed Mian

Point Discriminative Learning for Data-efficient 3D Point Cloud Analysis

3D point cloud analysis has drawn a lot of research attention due to its wide applications. However, collecting massive labelled 3D point cloud data is both time-consuming and labor-intensive. This calls for data-efficient learning methods.…

Computer Vision and Pattern Recognition · Computer Science 2023-01-23 Fayao Liu , Guosheng Lin , Chuan-Sheng Foo , Chaitanya K. Joshi , Jie Lin

P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting

Nowadays, pre-training big models on large-scale datasets has become a crucial topic in deep learning. The pre-trained models with high representation ability and transferability achieve a great success and dominate many downstream tasks in…

Computer Vision and Pattern Recognition · Computer Science 2022-10-13 Ziyi Wang , Xumin Yu , Yongming Rao , Jie Zhou , Jiwen Lu

Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling

We present Point-BERT, a new paradigm for learning Transformers to generalize the concept of BERT to 3D point cloud. Inspired by BERT, we devise a Masked Point Modeling (MPM) task to pre-train point cloud Transformers. Specifically, we…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Xumin Yu , Lulu Tang , Yongming Rao , Tiejun Huang , Jie Zhou , Jiwen Lu

Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning

Diffusion-based models, widely used in text-to-image generation, have proven effective in 2D representation learning. Recently, this framework has been extended to 3D self-supervised learning by constructing a conditional point generator…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Yiyang Chen , Shanshan Zhao , Lunhao Duan , Changxing Ding , Dacheng Tao

PointDiffuse: A Dual-Conditional Diffusion Model for Enhanced Point Cloud Semantic Segmentation

Diffusion probabilistic models are traditionally used to generate colors at fixed pixel positions in 2D images. Building on this, we extend diffusion models to point cloud semantic segmentation, where point positions also remain fixed, and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Yong He , Hongshan Yu , Mingtao Feng , Tongjia Chen , Zechuan Li , Anwaar Ulhaq , Saeed Anwar , Ajmal Saeed Mian

RealDiff: Real-world 3D Shape Completion using Self-Supervised Diffusion Models

Point cloud completion aims to recover the complete 3D shape of an object from partial observations. While approaches relying on synthetic shape priors achieved promising results in this domain, their applicability and generalizability to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Başak Melis Öcal , Maxim Tatarchenko , Sezer Karaoglu , Theo Gevers

Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction

The reconstruction of real-world surfaces is on high demand in various applications. Most existing reconstruction approaches apply 3D scanners for creating point clouds which are generally sparse and of low density. These points clouds will…

Computer Vision and Pattern Recognition · Computer Science 2021-03-01 Rajat Sharma , Tobias Schwandt , Christian Kunert , Steffen Urban , Wolfgang Broll

Inferring Neural Signed Distance Functions by Overfitting on Single Noisy Point Clouds through Finetuning Data-Driven based Priors

It is important to estimate an accurate signed distance function (SDF) from a point cloud in many computer vision applications. The latest methods learn neural SDFs using either a data-driven based or an overfitting-based strategy. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-28 Chao Chen , Yu-Shen Liu , Zhizhong Han

Data-Efficient Point Cloud Semantic Segmentation Pipeline for Unimproved Roads

In this case study, we present a data-efficient point cloud segmentation pipeline and training framework for robust segmentation of unimproved roads and seven other classes. Our method employs a two-stage training framework: first, a…

Image and Video Processing · Electrical Eng. & Systems 2025-08-29 Andrew Yarovoi , Christopher R. Valenta

Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes

In the recent years, the research community has witnessed growing use of 3D point cloud data for the high applicability in various real-world applications. By means of 3D point cloud, this modality enables to consider the actual size and…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Daichi Otsuka , Shinichi Mae , Ryosuke Yamada , Hirokatsu Kataoka

3D Point Cloud Pre-training with Knowledge Distillation from 2D Images

The recent success of pre-trained 2D vision models is mostly attributable to learning from large-scale datasets. However, compared with 2D image datasets, the current pre-training data of 3D point cloud is limited. To overcome this…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Yuan Yao , Yuanhan Zhang , Zhenfei Yin , Jiebo Luo , Wanli Ouyang , Xiaoshui Huang

Surface Reconstruction from Point Clouds by Learning Predictive Context Priors

Surface reconstruction from point clouds is vital for 3D computer vision. State-of-the-art methods leverage large datasets to first learn local context priors that are represented as neural network-based signed distance functions (SDFs)…

Computer Vision and Pattern Recognition · Computer Science 2022-04-26 Baorui Ma , Yu-Shen Liu , Matthias Zwicker , Zhizhong Han

PCRDiffusion: Diffusion Probabilistic Models for Point Cloud Registration

We propose a new framework that formulates point cloud registration as a denoising diffusion process from noisy transformation to object transformation. During training stage, object transformation diffuses from ground-truth transformation…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Yue Wu , Yongzhe Yuan , Xiaolong Fan , Xiaoshui Huang , Maoguo Gong , Qiguang Miao

PointVST: Self-Supervised Pre-training for 3D Point Clouds via View-Specific Point-to-Image Translation

The past few years have witnessed the great success and prevalence of self-supervised representation learning within the language and 2D vision communities. However, such advancements have not been fully migrated to the field of 3D point…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Qijian Zhang , Junhui Hou

Point cloud completion via structured feature maps using a feedback network

In this paper, we tackle the challenging problem of point cloud completion from the perspective of feature learning. Our key observation is that to recover the underlying structures as well as surface details, given partial input, a…

Computer Vision and Pattern Recognition · Computer Science 2022-09-15 Zejia Su , Haibin Huang , Chongyang Ma , Hui Huang , Ruizhen Hu