Computer Vision and Pattern Recognition · Computer Science
Improved Robustness of Vision Transformer via PreLayerNorm in Patch Embedding
Bum Jun Kim, Hyeyeon Choi, Hyeonah Jang, Dong Gu Lee +2
2021-11-17
Machine Learning · Computer Science
Understanding and Improving Layer Normalization
Jingjing Xu, Xu Sun, Zhiyuan Zhang, Guangxiang Zhao +1
2019-11-19
Computation and Language · Computer Science
DeepNet: Scaling Transformers to 1,000 Layers
Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang +2
2022-03-02
Machine Learning · Computer Science
GeoNorm: Unify Pre-Norm and Post-Norm with Geodesic Optimization
Chuanyang Zheng, Jiankai Sun, Yihang Gao, Chi Wang +10
2026-01-30
Computer Vision and Pattern Recognition · Computer Science
Learning to Merge Tokens in Vision Transformers
Cedric Renggli, André Susano Pinto, Neil Houlsby, Basil Mustafa +2
2022-02-25
Computer Vision and Pattern Recognition · Computer Science
Masked Transformer for image Anomaly Localization
Axel De Nardin, Pankaj Mishra, Gian Luca Foresti, Claudio Piciarelli
2022-10-28
Computer Vision and Pattern Recognition · Computer Science
Three things everyone should know about Vision Transformers
Hugo Touvron, Matthieu Cord, Alaaeldin El-Nouby, Jakob Verbeek +1
2022-03-21
Computer Vision and Pattern Recognition · Computer Science
Vision Transformers with Patch Diversification
Chengyue Gong, Dilin Wang, Meng Li, Vikas Chandra +1
2021-06-14
Machine Learning · Computer Science
FlashNorm: Fast Normalization for Transformers
Nils Graef, Filip Makraduli, Andrew Wasielewski, Matthew Clapp
2026-04-28
Computer Vision and Pattern Recognition · Computer Science
Surface Normal Estimation with Transformers
Barry Shichen Hu, Siyun Liang, Johannes Paetzold, Huy H. Nguyen +2
2024-01-12
Computer Vision and Pattern Recognition · Computer Science
DDT: Dual-branch Deformable Transformer for Image Denoising
Kangliang Liu, Xiangcheng Du, Sijie Liu, Yingbin Zheng +2
2023-04-14
Computation and Language · Computer Science
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
Zhijian Zhuo, Yutao Zeng, Ya Wang, Sijun Zhang +4
2025-12-09
Computer Vision and Pattern Recognition · Computer Science
LayerShuffle: Enhancing Robustness in Vision Transformers by Randomizing Layer Execution Order
Matthias Freiberger, Peter Kun, Anders Sundnes Løvlie, Sebastian Risi
2024-12-09
Computer Vision and Pattern Recognition · Computer Science
Exploiting Layer Normalization Fine-tuning in Visual Transformer Foundation Models for Classification
Zhaorui Tan, Tan Pan, Kaizhu Huang, Weimiao Yu +7
2025-08-12
Computer Vision and Pattern Recognition · Computer Science
DuoFormer: Leveraging Hierarchical Representations by Local and Global Attention Vision Transformer
Xiaoya Tang, Bodong Zhang, Man Minh Ho, Beatrice S. Knudsen +1
2025-06-17
Computer Vision and Pattern Recognition · Computer Science
Patch Is Not All You Need
Changzhen Li, Jie Zhang, Yang Wei, Zhilong Ji +2
2023-08-22