Related papers: Point Cloud Learning with Transformer

PCT: Point cloud transformer

The irregular domain and lack of ordering make it challenging to design deep neural networks for point cloud processing. This paper presents a novel framework named Point Cloud Transformer(PCT) for point cloud learning. PCT is based on…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Meng-Hao Guo , Jun-Xiong Cai , Zheng-Ning Liu , Tai-Jiang Mu , Ralph R. Martin , Shi-Min Hu

Dual Transformer for Point Cloud Analysis

Following the tremendous success of transformer in natural language processing and image understanding tasks, in this paper, we present a novel point cloud representation learning architecture, named Dual Transformer Network (DTNet), which…

Computer Vision and Pattern Recognition · Computer Science 2021-04-28 Xian-Feng Han , Yi-Fei Jin , Hui-Xian Cheng , Guo-Qiang Xiao

Cross-Level Cross-Scale Cross-Attention Network for Point Cloud Representation

Self-attention mechanism recently achieves impressive advancement in Natural Language Processing (NLP) and Image Processing domains. And its permutation invariance property makes it ideally suitable for point cloud processing. Inspired by…

Computer Vision and Pattern Recognition · Computer Science 2021-04-28 Xian-Feng Han , Zhang-Yue He , Jia Chen , Guo-Qiang Xiao

Transformers in 3D Point Clouds: A Survey

Transformers have been at the heart of the Natural Language Processing (NLP) and Computer Vision (CV) revolutions. The significant success in NLP and CV inspired exploring the use of Transformers in point cloud processing. However, how do…

Computer Vision and Pattern Recognition · Computer Science 2022-09-22 Dening Lu , Qian Xie , Mingqiang Wei , Kyle Gao , Linlin Xu , Jonathan Li

Surface Normal Estimation with Transformers

We propose the use of a Transformer to accurately predict normals from point clouds with noise and density variations. Previous learning-based methods utilize PointNet variants to explicitly extract multi-scale features at different input…

Computer Vision and Pattern Recognition · Computer Science 2024-01-12 Barry Shichen Hu , Siyun Liang , Johannes Paetzold , Huy H. Nguyen , Isao Echizen , Jiapeng Tang

3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis

General point clouds have been increasingly investigated for different tasks, and recently Transformer-based networks are proposed for point cloud analysis. However, there are barely related works for medical point clouds, which are…

Image and Video Processing · Electrical Eng. & Systems 2021-12-20 Jianhui Yu , Chaoyi Zhang , Heng Wang , Dingxin Zhang , Yang Song , Tiange Xiang , Dongnan Liu , Weidong Cai

Transfer Learning with Point Transformers

Point Transformers are near state-of-the-art models for classification, segmentation, and detection tasks on Point Cloud data. They utilize a self attention based mechanism to model large range spatial dependencies between multiple point…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Kartik Gupta , Rahul Vippala , Sahima Srivastava

PointCAT: Cross-Attention Transformer for point cloud

Transformer-based models have significantly advanced natural language processing and computer vision in recent years. However, due to the irregular and disordered structure of point cloud data, transformer-based models for 3D deep learning…

Computer Vision and Pattern Recognition · Computer Science 2023-04-07 Xincheng Yang , Mingze Jin , Weiji He , Qian Chen

Multi-modal Multi-task Pre-training for Improved Point Cloud Understanding

Recent advances in multi-modal pre-training methods have shown promising effectiveness in learning 3D representations by aligning multi-modal features between 3D shapes and their corresponding 2D counterparts. However, existing multi-modal…

Computer Vision and Pattern Recognition · Computer Science 2025-07-24 Liwen Liu , Weidong Yang , Lipeng Ma , Ben Fei

PointGPT: Auto-regressively Generative Pre-training from Point Clouds

Large language models (LLMs) based on the generative pre-training transformer (GPT) have demonstrated remarkable effectiveness across a diverse range of downstream tasks. Inspired by the advancements of the GPT, we present PointGPT, a novel…

Computer Vision and Pattern Recognition · Computer Science 2023-05-24 Guangyan Chen , Meiling Wang , Yi Yang , Kai Yu , Li Yuan , Yufeng Yue

PointResNet: Residual Network for 3D Point Cloud Segmentation and Classification

Point cloud segmentation and classification are some of the primary tasks in 3D computer vision with applications ranging from augmented reality to robotics. However, processing point clouds using deep learning-based algorithms is quite…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Aadesh Desai , Saagar Parikh , Seema Kumari , Shanmuganathan Raman

3D Object Detection with Pointformer

Feature learning for 3D object detection from point clouds is very challenging due to the irregularity of 3D point cloud data. In this paper, we propose Pointformer, a Transformer backbone designed for 3D point clouds to learn features…

Computer Vision and Pattern Recognition · Computer Science 2021-06-23 Xuran Pan , Zhuofan Xia , Shiji Song , Li Erran Li , Gao Huang

Multi-scale Geometry-aware Transformer for 3D Point Cloud Classification

Self-attention modules have demonstrated remarkable capabilities in capturing long-range relationships and improving the performance of point cloud tasks. However, point cloud objects are typically characterized by complex, disordered, and…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Xian Wei , Muyu Wang , Shing-Ho Jonathan Lin , Zhengyu Li , Jian Yang , Arafat Al-Jawari , Xuan Tang

PU-Transformer: Point Cloud Upsampling Transformer

Given the rapid development of 3D scanners, point clouds are becoming popular in AI-driven machines. However, point cloud data is inherently sparse and irregular, causing significant difficulties for machine perception. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Shi Qiu , Saeed Anwar , Nick Barnes

Point Cloud Recognition with Position-to-Structure Attention Transformers

In this paper, we present Position-to-Structure Attention Transformers (PS-Former), a Transformer-based algorithm for 3D point cloud recognition. PS-Former deals with the challenge in 3D point cloud representation where points are not…

Computer Vision and Pattern Recognition · Computer Science 2022-10-06 Zheng Ding , James Hou , Zhuowen Tu

PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer Architecture

In recent years, point cloud analysis methods based on the Transformer architecture have made significant progress, particularly in the context of multimedia applications such as 3D modeling, virtual reality, and autonomous systems.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Qiang Zheng , Chao Zhang , Jian Sun

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Point cloud is an important type of geometric data structure. Due to its irregular format, most researchers transform such data to regular 3D voxel grids or collections of images. This, however, renders data unnecessarily voluminous and…

Computer Vision and Pattern Recognition · Computer Science 2017-04-12 Charles R. Qi , Hao Su , Kaichun Mo , Leonidas J. Guibas

Transformers for Object Detection in Large Point Clouds

We present TransLPC, a novel detection model for large point clouds that is based on a transformer architecture. While object detection with transformers has been an active field of research, it has proved difficult to apply such models to…

Computer Vision and Pattern Recognition · Computer Science 2022-10-03 Felicia Ruppel , Florian Faion , Claudius Gläser , Klaus Dietmayer

Point Transformer

Self-attention networks have revolutionized natural language processing and are making impressive strides in image analysis tasks such as image classification and object detection. Inspired by this success, we investigate the application of…

Computer Vision and Pattern Recognition · Computer Science 2021-09-28 Hengshuang Zhao , Li Jiang , Jiaya Jia , Philip Torr , Vladlen Koltun

3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification

Although accurate and fast point cloud classification is a fundamental task in 3D applications, it is difficult to achieve this purpose due to the irregularity and disorder of point clouds that make it challenging to achieve effective and…

Computer Vision and Pattern Recognition · Computer Science 2022-03-03 Dening Lu , Qian Xie , Linlin Xu , Jonathan Li