Related papers: TOAST: Transformer Optimization using Adaptive and…

TOAST: Task-Oriented Adaptive Semantic Transmission over Dynamic Wireless Environments

The evolution toward 6G networks demands a fundamental shift from bit-centric transmission to semantic-aware communication that emphasizes task-relevant information. This work introduces TOAST (Task-Oriented Adaptive Semantic Transmission),…

Machine Learning · Computer Science 2025-06-30 Sheng Yun , Jianhua Pei , Ping Wang

TOAST: Transfer Learning via Attention Steering

Transfer learning involves adapting a pre-trained model to novel downstream tasks. However, we observe that current transfer learning methods often fail to focus on task-relevant features. In this work, we explore refocusing model attention…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Baifeng Shi , Siyu Gai , Trevor Darrell , Xin Wang

Enhancing Transformer-Based Vision Models: Addressing Feature Map Anomalies Through Novel Optimization Strategies

Vision Transformers (ViTs) have demonstrated superior performance across a wide range of computer vision tasks. However, structured noise artifacts in their feature maps hinder downstream applications such as segmentation and depth…

Computer Vision and Pattern Recognition · Computer Science 2025-09-25 Sumit Mamtani

ToaSt: Token Channel Selection and Structured Pruning for Efficient ViT

Vision Transformers (ViTs) have achieved remarkable success across various vision tasks, yet their deployment is often hindered by prohibitive computational costs. While structured weight pruning and token compression have emerged as…

Computer Vision and Pattern Recognition · Computer Science 2026-02-19 Hyunchan Moon , Cheonjun Park , Steven L. Waslander

Adaptive Layer Selection for Efficient Vision Transformer Fine-Tuning

Recently, foundation models based on Vision Transformers (ViTs) have become widely available. However, their fine-tuning process is highly resource-intensive, and it hinders their adoption in several edge or low-energy applications. To this…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Alessio Devoto , Federico Alvetreti , Jary Pomponi , Paolo Di Lorenzo , Pasquale Minervini , Simone Scardapane

Constraint-Informed Learning for Warm Starting Trajectory Optimization

Future spacecraft and surface robotic missions require increasingly capable autonomy stacks for exploring challenging and unstructured domains, and trajectory optimization will be a cornerstone of such autonomy stacks. However, the…

Robotics · Computer Science 2024-09-18 Julia Briden , Changrak Choi , Kyongsik Yun , Richard Linares , Abhishek Cauligi

TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation

Current referring expression comprehension algorithms can effectively detect or segment objects indicated by nouns, but how to understand verb reference is still under-explored. As such, we study the challenging problem of task oriented…

Computer Vision and Pattern Recognition · Computer Science 2022-10-20 Pengfei Li , Beiwen Tian , Yongliang Shi , Xiaoxue Chen , Hao Zhao , Guyue Zhou , Ya-Qin Zhang

BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference

Large-scale foundation models have demonstrated exceptional performance in language and vision tasks. However, the numerous dense matrix-vector operations involved in these large networks pose significant computational challenges during…

Machine Learning · Computer Science 2024-10-31 Changwoo Lee , Soo Min Kwon , Qing Qu , Hun-Seok Kim

RECAST: Reparameterized, Compact weight Adaptation for Sequential Tasks

Incremental learning aims to adapt to new sets of categories over time with minimal computational overhead. Prior work often addresses this task by training efficient task-specific adaptors that modify frozen layer weights or features to…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 Nazia Tasnim , Bryan A. Plummer

Adaptivity and Modularity for Efficient Generalization Over Task Complexity

Can transformers generalize efficiently on problems that require dealing with examples with different levels of difficulty? We introduce a new task tailored to assess generalization over different complexities and present results that…

Machine Learning · Computer Science 2023-10-16 Samira Abnar , Omid Saremi , Laurent Dinh , Shantel Wilson , Miguel Angel Bautista , Chen Huang , Vimal Thilak , Etai Littwin , Jiatao Gu , Josh Susskind , Samy Bengio

TeST: Test-time Self-Training under Distribution Shift

Despite their recent success, deep neural networks continue to perform poorly when they encounter distribution shifts at test time. Many recently proposed approaches try to counter this by aligning the model to the new distribution prior to…

Computer Vision and Pattern Recognition · Computer Science 2022-09-26 Samarth Sinha , Peter Gehler , Francesco Locatello , Bernt Schiele

NAT: Neural Architecture Transformer for Accurate and Compact Architectures

Designing effective architectures is one of the key factors behind the success of deep neural networks. Existing deep architectures are either manually designed or automatically searched by some Neural Architecture Search (NAS) methods.…

Machine Learning · Computer Science 2020-01-14 Yong Guo , Yin Zheng , Mingkui Tan , Qi Chen , Jian Chen , Peilin Zhao , Junzhou Huang

ResT: An Efficient Transformer for Visual Recognition

This paper presents an efficient multi-scale vision Transformer, called ResT, that capably served as a general-purpose backbone for image recognition. Unlike existing Transformer methods, which employ standard Transformer blocks to tackle…

Computer Vision and Pattern Recognition · Computer Science 2021-10-15 Qinglong Zhang , Yubin Yang

DEFT-VTON: Efficient Virtual Try-On with Consistent Generalised H-Transform

Diffusion models enable high-quality virtual try-on (VTO) with their established image synthesis abilities. Despite the extensive end-to-end training of large pre-trained models involved in current VTO methods, real-world applications often…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Xingzi Xu , Qi Li , Shuwen Qiu , Julien Han , Karim Bouyarmane

LATST: Are Transformers Necessarily Complex for Time-Series Forecasting

Transformer-based architectures have achieved remarkable success in natural language processing and computer vision. However, their performance in multivariate long-term forecasting often falls short compared to simpler linear baselines.…

Machine Learning · Computer Science 2025-07-09 Dizhen Liang

Going deeper with Image Transformers

Transformers have been recently adapted for large scale image classification, achieving high scores shaking up the long supremacy of convolutional neural networks. However the optimization of image transformers has been little studied so…

Computer Vision and Pattern Recognition · Computer Science 2021-04-08 Hugo Touvron , Matthieu Cord , Alexandre Sablayrolles , Gabriel Synnaeve , Hervé Jégou

FAAST: Forward-Only Associative Learning via Closed-Form Fast Weights for Test-Time Supervised Adaptation

Adapting pretrained models typically involves a trade-off between the high training costs of backpropagation and the heavy inference overhead of memory-based or in-context learning. We propose FAAST, a forward-only associative adaptation…

Machine Learning · Computer Science 2026-05-11 Guangsheng Bao , Hongbo Zhang , Han Cui , Ke Sun , Yanbin Zhao , Juncai He , Yue Zhang

DeepViT: Towards Deeper Vision Transformer

Vision transformers (ViTs) have been successfully applied in image classification tasks recently. In this paper, we show that, unlike convolution neural networks (CNNs)that can be improved by stacking more convolutional layers, the…

Computer Vision and Pattern Recognition · Computer Science 2021-04-20 Daquan Zhou , Bingyi Kang , Xiaojie Jin , Linjie Yang , Xiaochen Lian , Zihang Jiang , Qibin Hou , Jiashi Feng

A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition

This paper presents a novel knowledge distillation neural architecture leveraging efficient transformer networks for effective image classification. Natural images display intricate arrangements encompassing numerous extraneous elements.…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Dewan Tauhid Rahman , Yeahia Sarker , Antar Mazumder , Md. Shamim Anower

TART: Token-based Architecture Transformer for Neural Network Performance Prediction

In the realm of neural architecture design, achieving high performance is largely reliant on the manual expertise of researchers. Despite the emergence of Neural Architecture Search (NAS) as a promising technique for automating this…

Machine Learning · Computer Science 2025-01-07 Yannis Y. He