Related papers: Transformer Wave Function for Quantum Long-Range m…

Transformer variational wave functions for frustrated quantum spin systems

The Transformer architecture has become the state-of-art model for natural language processing tasks and, more recently, also for computer vision tasks, thus defining the Vision Transformer (ViT) architecture. The key feature is the ability…

Disordered Systems and Neural Networks · Physics 2023-06-13 Luciano Loris Viteritti , Riccardo Rende , Federico Becca

Vision Transformer Neural Quantum States for Impurity Models

Transformer neural networks, known for their ability to recognize complex patterns in high-dimensional data, offer a promising framework for capturing many-body correlations in quantum systems. We employ an adapted Vision Transformer (ViT)…

Strongly Correlated Electrons · Physics 2024-08-26 Xiaodong Cao , Zhicheng Zhong , Yi Lu

Fine-tuning Vision Transformers for the Prediction of State Variables in Ising Models

Transformers are state-of-the-art deep learning models that are composed of stacked attention and point-wise, fully connected layers designed for handling sequential data. Transformers are not only ubiquitous throughout Natural Language…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Onur Kara , Arijit Sehanobish , Hector H Corzo

Quantum-Enhanced Vision Transformer for Flood Detection using Remote Sensing Imagery

Reliable flood detection is critical for disaster management, yet classical deep learning models often struggle with the high-dimensional, nonlinear complexities inherent in remote sensing data. To mitigate these limitations, we introduced…

Machine Learning · Computer Science 2026-03-17 Soumyajit Maity , Behzad Ghanbarian

Pattern Description of Quantum Phase Transitions in the Transverse Antiferromagnetic Ising Model with a Longitudinal Field

Despite of simplicity of the transverse antiferromagnetic Ising model with a uniform longitudinal field, its phases and involved quntum phase transitions (QPTs) are nontrivial in comparison to its ferromagnetic counterpart. For example,…

Statistical Mechanics · Physics 2025-11-19 Yun-Tong Yang , Hong-Gang Luo

Application of the Interface Approach in Quantum Ising Models

We investigate phase transitions in the Ising model and the ANNNI model in transverse field using the interface approach. The exact result of the Ising chain in a transverse field is reproduced. We find that apart from the interfacial…

Statistical Mechanics · Physics 2009-10-30 Parongama Sen

Efficiently Training Vision Transformers on Structural MRI Scans for Alzheimer's Disease Detection

Neuroimaging of large populations is valuable to identify factors that promote or resist brain disease, and to assist diagnosis, subtyping, and prognosis. Data-driven models such as convolutional neural networks (CNNs) have increasingly…

Image and Video Processing · Electrical Eng. & Systems 2023-03-16 Nikhil J. Dhinagar , Sophia I. Thomopoulos , Emily Laltoo , Paul M. Thompson

Scaling Vision Transformers

Attention-based neural networks such as the Vision Transformer (ViT) have recently attained state-of-the-art results on many computer vision benchmarks. Scale is a primary ingredient in attaining excellent results, therefore, understanding…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Xiaohua Zhai , Alexander Kolesnikov , Neil Houlsby , Lucas Beyer

Improving Vision Transformers by Revisiting High-frequency Components

The transformer models have shown promising effectiveness in dealing with various vision tasks. However, compared with training Convolutional Neural Network (CNN) models, training Vision Transformer (ViT) models is more difficult and relies…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Jiawang Bai , Li Yuan , Shu-Tao Xia , Shuicheng Yan , Zhifeng Li , Wei Liu

Neural-network quantum state study of the long-range antiferromagnetic Ising chain

We investigate quantum phase transitions in the transverse field Ising chain with algebraically decaying long-range (LR) antiferromagnetic interactions using the variational Monte Carlo method with the restricted Boltzmann machine employed…

Statistical Mechanics · Physics 2024-06-14 Jicheol Kim , Dongkyu Kim , Dong-Hee Kim

CViT: Continuous Vision Transformer for Operator Learning

Operator learning, which aims to approximate maps between infinite-dimensional function spaces, is an important area in scientific machine learning with applications across various physical domains. Here we introduce the Continuous Vision…

Machine Learning · Computer Science 2025-02-18 Sifan Wang , Jacob H Seidman , Shyam Sankaran , Hanwen Wang , George J. Pappas , Paris Perdikaris

Intriguing Properties of Vision Transformers

Vision transformers (ViT) have demonstrated impressive performance across various machine vision problems. These models are based on multi-head self-attention mechanisms that can flexibly attend to a sequence of image patches to encode…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Muzammal Naseer , Kanchana Ranasinghe , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Ming-Hsuan Yang

Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers

Vision transformers (ViTs) have demonstrated their superior accuracy for computer vision tasks compared to convolutional neural networks (CNNs). However, ViT models are often computation-intensive for efficient deployment on…

Machine Learning · Computer Science 2024-07-26 Zhengang Li , Alec Lu , Yanyue Xie , Zhenglun Kong , Mengshu Sun , Hao Tang , Zhong Jia Xue , Peiyan Dong , Caiwen Ding , Yanzhi Wang , Xue Lin , Zhenman Fang

Sub-token ViT Embedding via Stochastic Resonance Transformers

Vision Transformer (ViT) architectures represent images as collections of high-dimensional vectorized tokens, each corresponding to a rectangular non-overlapping patch. This representation trades spatial granularity for embedding…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Dong Lao , Yangchao Wu , Tian Yu Liu , Alex Wong , Stefano Soatto

Quantum Embedding with Transformer for High-dimensional Data

Quantum embedding with transformers is a novel and promising architecture for quantum machine learning to deliver exceptional capability on near-term devices or simulators. The research incorporated a vision transformer (ViT) to advance…

Quantum Physics · Physics 2024-02-21 Hao-Yuan Chen , Yen-Jui Chang , Shih-Wei Liao , Ching-Ray Chang

Vision Transformers on the Edge: A Comprehensive Survey of Model Compression and Acceleration Strategies

In recent years, vision transformers (ViTs) have emerged as powerful and promising techniques for computer vision tasks such as image classification, object detection, and segmentation. Unlike convolutional neural networks (CNNs), which…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Shaibal Saha , Lanyu Xu

Interpreting vision transformers via residual replacement model

How do vision transformers (ViTs) represent and process the world? This paper addresses this long-standing question through the first systematic analysis of 6.6K features across all layers, extracted via sparse autoencoders, and by…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Jinyeong Kim , Junhyeok Kim , Yumin Shim , Joohyeok Kim , Sunyoung Jung , Seong Jae Hwang

How to Train Vision Transformer on Small-scale Datasets?

Vision Transformer (ViT), a radically different architecture than convolutional neural networks offers multiple advantages including design simplicity, robustness and state-of-the-art performance on many vision tasks. However, in contrast…

Computer Vision and Pattern Recognition · Computer Science 2022-10-14 Hanan Gani , Muzammal Naseer , Mohammad Yaqub

Vision Transformers for End-to-End Quark-Gluon Jet Classification from Calorimeter Images

Distinguishing between quark- and gluon-initiated jets is a critical and challenging task in high-energy physics, pivotal for improving new physics searches and precision measurements at the Large Hadron Collider. While deep learning,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Md Abrar Jahin , Shahriar Soudeep , Arian Rahman Aditta , M. F. Mridha , Nafiz Fahad , Md. Jakir Hossen

Data-Efficient Realized Volatility Forecasting with Vision Transformers

Recent work in financial machine learning has shown the virtue of complexity: the phenomenon by which deep learning methods capable of learning highly nonlinear relationships outperform simpler approaches in financial forecasting. While…

Machine Learning · Computer Science 2025-11-06 Emi Soroka , Artem Arzyn