Related papers: Point Transformer

PointConvFormer: Revenge of the Point-based Convolution

We introduce PointConvFormer, a novel building block for point cloud based deep network architectures. Inspired by generalization theory, PointConvFormer combines ideas from point convolution, where filter weights are only based on relative…

Computer Vision and Pattern Recognition · Computer Science 2023-05-12 Wenxuan Wu , Li Fuxin , Qi Shan

PCT: Point cloud transformer

The irregular domain and lack of ordering make it challenging to design deep neural networks for point cloud processing. This paper presents a novel framework named Point Cloud Transformer(PCT) for point cloud learning. PCT is based on…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Meng-Hao Guo , Jun-Xiong Cai , Zheng-Ning Liu , Tai-Jiang Mu , Ralph R. Martin , Shi-Min Hu

Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks

Many machine learning tasks such as multiple instance learning, 3D shape recognition, and few-shot image classification are defined on sets of instances. Since solutions to such problems do not depend on the order of elements of the set,…

Machine Learning · Computer Science 2019-05-28 Juho Lee , Yoonho Lee , Jungtaek Kim , Adam R. Kosiorek , Seungjin Choi , Yee Whye Teh

CpT: Convolutional Point Transformer for 3D Point Cloud Processing

We present CpT: Convolutional point Transformer - a novel deep learning architecture for dealing with the unstructured nature of 3D point cloud data. CpT is an improvement over existing attention-based Convolutions Neural Networks as well…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Chaitanya Kaul , Joshua Mitton , Hang Dai , Roderick Murray-Smith

Transfer Learning with Point Transformers

Point Transformers are near state-of-the-art models for classification, segmentation, and detection tasks on Point Cloud data. They utilize a self attention based mechanism to model large range spatial dependencies between multiple point…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Kartik Gupta , Rahul Vippala , Sahima Srivastava

GridFormer: Point-Grid Transformer for Surface Reconstruction

Implicit neural networks have emerged as a crucial technology in 3D surface reconstruction. To reconstruct continuous surfaces from discrete point clouds, encoding the input points into regular grid features (plane or volume) has been…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Shengtao Li , Ge Gao , Yudong Liu , Yu-Shen Liu , Ming Gu

Point Cloud Learning with Transformer

Remarkable performance from Transformer networks in Natural Language Processing promote the development of these models in dealing with computer vision tasks such as image recognition and segmentation. In this paper, we introduce a novel…

Computer Vision and Pattern Recognition · Computer Science 2022-10-26 Qi Zhong , Xian-Feng Han

Transformer Meets Boundary Value Inverse Problems

A Transformer-based deep direct sampling method is proposed for electrical impedance tomography, a well-known severely ill-posed nonlinear boundary value inverse problem. A real-time reconstruction is achieved by evaluating the learned…

Machine Learning · Computer Science 2023-03-07 Ruchi Guo , Shuhao Cao , Long Chen

Points to Patches: Enabling the Use of Self-Attention for 3D Shape Recognition

While the Transformer architecture has become ubiquitous in the machine learning field, its adaptation to 3D shape recognition is non-trivial. Due to its quadratic computational complexity, the self-attention operator quickly becomes…

Computer Vision and Pattern Recognition · Computer Science 2022-04-11 Axel Berg , Magnus Oskarsson , Mark O'Connor

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Point cloud is an important type of geometric data structure. Due to its irregular format, most researchers transform such data to regular 3D voxel grids or collections of images. This, however, renders data unnecessarily voluminous and…

Computer Vision and Pattern Recognition · Computer Science 2017-04-12 Charles R. Qi , Hao Su , Kaichun Mo , Leonidas J. Guibas

Transformer in Transformer

Transformer is a new kind of neural architecture which encodes the input data as powerful features via the attention mechanism. Basically, the visual transformers first divide the input images into several local patches and then calculate…

Computer Vision and Pattern Recognition · Computer Science 2021-10-27 Kai Han , An Xiao , Enhua Wu , Jianyuan Guo , Chunjing Xu , Yunhe Wang

Patch Is Not All You Need

Vision Transformers have achieved great success in computer visions, delivering exceptional performance across various tasks. However, their inherent reliance on sequential input enforces the manual partitioning of images into patch…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Changzhen Li , Jie Zhang , Yang Wei , Zhilong Ji , Jinfeng Bai , Shiguang Shan

Learning point embedding for 3D data processing

Among 2D convolutional networks on point clouds, point-based approaches consume point clouds of fixed size directly. By analysis of PointNet, a pioneer in introducing deep learning into point sets, we reveal that current point-based methods…

Computer Vision and Pattern Recognition · Computer Science 2021-08-11 Zhenpeng Chen , Yuan li

Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions

Sorting is a fundamental operation of all computer systems, having been a long-standing significant research topic. Beyond the problem formulation of traditional sorting algorithms, we consider sorting problems for more abstract yet…

Machine Learning · Computer Science 2024-03-15 Jungtaek Kim , Jeongbeen Yoon , Minsu Cho

Transformer for Partial Differential Equations' Operator Learning

Data-driven learning of partial differential equations' solution operators has recently emerged as a promising paradigm for approximating the underlying solutions. The solution operators are usually parameterized by deep learning models…

Machine Learning · Computer Science 2023-05-01 Zijie Li , Kazem Meidani , Amir Barati Farimani

3D Object Detection with Pointformer

Feature learning for 3D object detection from point clouds is very challenging due to the irregularity of 3D point cloud data. In this paper, we propose Pointformer, a Transformer backbone designed for 3D point clouds to learn features…

Computer Vision and Pattern Recognition · Computer Science 2021-06-23 Xuran Pan , Zhuofan Xia , Shiji Song , Li Erran Li , Gao Huang

SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

Unsupervised object-centric learning aims to decompose scenes into interpretable object entities, termed slots. Slot-based auto-encoders stand out as a prominent method for this task. Within them, crucial aspects include guiding the encoder…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Ioannis Kakogeorgiou , Spyros Gidaris , Konstantinos Karantzalos , Nikos Komodakis

Spatial Transformer for 3D Point Clouds

Deep neural networks are widely used for understanding 3D point clouds. At each point convolution layer, features are computed from local neighborhoods of 3D points and combined for subsequent processing in order to extract semantic…

Computer Vision and Pattern Recognition · Computer Science 2021-05-13 Jiayun Wang , Rudrasis Chakraborty , Stella X. Yu

Flexible Networks for Learning Physical Dynamics of Deformable Objects

Learning the physical dynamics of deformable objects with particle-based representation has been the objective of many computational models in machine learning. While several state-of-the-art models have achieved this objective in simulated…

Computer Vision and Pattern Recognition · Computer Science 2022-01-14 Jinhyung Park , DoHae Lee , In-Kwon Lee

PatchGT: Transformer over Non-trainable Clusters for Learning Graph Representations

Recently the Transformer structure has shown good performances in graph learning tasks. However, these Transformer models directly work on graph nodes and may have difficulties learning high-level information. Inspired by the vision…

Machine Learning · Computer Science 2023-04-11 Han Gao , Xu Han , Jiaoyang Huang , Jian-Xun Wang , Li-Ping Liu