English
Related papers

Related papers: TOAST: Transformer Optimization using Adaptive and…

200 papers

The evolution toward 6G networks demands a fundamental shift from bit-centric transmission to semantic-aware communication that emphasizes task-relevant information. This work introduces TOAST (Task-Oriented Adaptive Semantic Transmission),…

Machine Learning · Computer Science 2025-06-30 Sheng Yun , Jianhua Pei , Ping Wang

Transfer learning involves adapting a pre-trained model to novel downstream tasks. However, we observe that current transfer learning methods often fail to focus on task-relevant features. In this work, we explore refocusing model attention…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Baifeng Shi , Siyu Gai , Trevor Darrell , Xin Wang

Vision Transformers (ViTs) have demonstrated superior performance across a wide range of computer vision tasks. However, structured noise artifacts in their feature maps hinder downstream applications such as segmentation and depth…

Computer Vision and Pattern Recognition · Computer Science 2025-09-25 Sumit Mamtani

Vision Transformers (ViTs) have achieved remarkable success across various vision tasks, yet their deployment is often hindered by prohibitive computational costs. While structured weight pruning and token compression have emerged as…

Computer Vision and Pattern Recognition · Computer Science 2026-02-19 Hyunchan Moon , Cheonjun Park , Steven L. Waslander

Recently, foundation models based on Vision Transformers (ViTs) have become widely available. However, their fine-tuning process is highly resource-intensive, and it hinders their adoption in several edge or low-energy applications. To this…

Computer Vision and Pattern Recognition · Computer Science 2024-08-19 Alessio Devoto , Federico Alvetreti , Jary Pomponi , Paolo Di Lorenzo , Pasquale Minervini , Simone Scardapane

Future spacecraft and surface robotic missions require increasingly capable autonomy stacks for exploring challenging and unstructured domains, and trajectory optimization will be a cornerstone of such autonomy stacks. However, the…

Robotics · Computer Science 2024-09-18 Julia Briden , Changrak Choi , Kyongsik Yun , Richard Linares , Abhishek Cauligi

Current referring expression comprehension algorithms can effectively detect or segment objects indicated by nouns, but how to understand verb reference is still under-explored. As such, we study the challenging problem of task oriented…

Computer Vision and Pattern Recognition · Computer Science 2022-10-20 Pengfei Li , Beiwen Tian , Yongliang Shi , Xiaoxue Chen , Hao Zhao , Guyue Zhou , Ya-Qin Zhang

Large-scale foundation models have demonstrated exceptional performance in language and vision tasks. However, the numerous dense matrix-vector operations involved in these large networks pose significant computational challenges during…

Machine Learning · Computer Science 2024-10-31 Changwoo Lee , Soo Min Kwon , Qing Qu , Hun-Seok Kim

Incremental learning aims to adapt to new sets of categories over time with minimal computational overhead. Prior work often addresses this task by training efficient task-specific adaptors that modify frozen layer weights or features to…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 Nazia Tasnim , Bryan A. Plummer

Can transformers generalize efficiently on problems that require dealing with examples with different levels of difficulty? We introduce a new task tailored to assess generalization over different complexities and present results that…

Despite their recent success, deep neural networks continue to perform poorly when they encounter distribution shifts at test time. Many recently proposed approaches try to counter this by aligning the model to the new distribution prior to…

Computer Vision and Pattern Recognition · Computer Science 2022-09-26 Samarth Sinha , Peter Gehler , Francesco Locatello , Bernt Schiele

Designing effective architectures is one of the key factors behind the success of deep neural networks. Existing deep architectures are either manually designed or automatically searched by some Neural Architecture Search (NAS) methods.…

Machine Learning · Computer Science 2020-01-14 Yong Guo , Yin Zheng , Mingkui Tan , Qi Chen , Jian Chen , Peilin Zhao , Junzhou Huang

This paper presents an efficient multi-scale vision Transformer, called ResT, that capably served as a general-purpose backbone for image recognition. Unlike existing Transformer methods, which employ standard Transformer blocks to tackle…

Computer Vision and Pattern Recognition · Computer Science 2021-10-15 Qinglong Zhang , Yubin Yang

Diffusion models enable high-quality virtual try-on (VTO) with their established image synthesis abilities. Despite the extensive end-to-end training of large pre-trained models involved in current VTO methods, real-world applications often…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Xingzi Xu , Qi Li , Shuwen Qiu , Julien Han , Karim Bouyarmane

Transformer-based architectures have achieved remarkable success in natural language processing and computer vision. However, their performance in multivariate long-term forecasting often falls short compared to simpler linear baselines.…

Machine Learning · Computer Science 2025-07-09 Dizhen Liang

Transformers have been recently adapted for large scale image classification, achieving high scores shaking up the long supremacy of convolutional neural networks. However the optimization of image transformers has been little studied so…

Computer Vision and Pattern Recognition · Computer Science 2021-04-08 Hugo Touvron , Matthieu Cord , Alexandre Sablayrolles , Gabriel Synnaeve , Hervé Jégou

Adapting pretrained models typically involves a trade-off between the high training costs of backpropagation and the heavy inference overhead of memory-based or in-context learning. We propose FAAST, a forward-only associative adaptation…

Machine Learning · Computer Science 2026-05-11 Guangsheng Bao , Hongbo Zhang , Han Cui , Ke Sun , Yanbin Zhao , Juncai He , Yue Zhang

Vision transformers (ViTs) have been successfully applied in image classification tasks recently. In this paper, we show that, unlike convolution neural networks (CNNs)that can be improved by stacking more convolutional layers, the…

Computer Vision and Pattern Recognition · Computer Science 2021-04-20 Daquan Zhou , Bingyi Kang , Xiaojie Jin , Linjie Yang , Xiaochen Lian , Zihang Jiang , Qibin Hou , Jiashi Feng

This paper presents a novel knowledge distillation neural architecture leveraging efficient transformer networks for effective image classification. Natural images display intricate arrangements encompassing numerous extraneous elements.…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Dewan Tauhid Rahman , Yeahia Sarker , Antar Mazumder , Md. Shamim Anower

In the realm of neural architecture design, achieving high performance is largely reliant on the manual expertise of researchers. Despite the emergence of Neural Architecture Search (NAS) as a promising technique for automating this…

Machine Learning · Computer Science 2025-01-07 Yannis Y. He
‹ Prev 1 2 3 10 Next ›