Related papers: Asynchronous Perception Machine For Efficient Test…

ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment

The advent of the Transformer architecture has propelled the growth of natural language processing (NLP) models, leading to remarkable achievements in numerous NLP tasks. Yet, the absence of specialized hardware like expansive GPU memory…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-18 Xiaofeng Wu , Jia Rao , Wei Chen

FAPM: Fast Adaptive Patch Memory for Real-time Industrial Anomaly Detection

Feature embedding-based methods have shown exceptional performance in detecting industrial anomalies by comparing features of target images with normal images. However, some methods do not meet the speed requirements of real-time inference,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-27 Donghyeong Kim , Chaewon Park , Suhwan Cho , Sangyoun Lee

Asynchronous Sharpness-Aware Minimization For Fast and Accurate Deep Learning

Sharpness-Aware Minimization (SAM) is an optimization method that improves generalization performance of machine learning models. Despite its superior generalization, SAM has not been actively used in real-world applications due to its…

Machine Learning · Computer Science 2025-03-17 Junhyuk Jo , Jihyun Lim , Sunwoo Lee

Adaptive Prompt Learning with SAM for Few-shot Scanning Probe Microscope Image Segmentation

The Segment Anything Model (SAM) has demonstrated strong performance in image segmentation of natural scene images. However, its effectiveness diminishes markedly when applied to specific scientific domains, such as Scanning Probe…

Computer Vision and Pattern Recognition · Computer Science 2024-10-17 Yao Shen , Ziwei Wei , Chunmeng Liu , Shuming Wei , Qi Zhao , Kaiyang Zeng , Guangyao Li

Adaptive Prototype Model for Attribute-based Multi-label Few-shot Action Recognition

In real-world action recognition systems, incorporating more attributes helps achieve a more comprehensive understanding of human behavior. However, using a single model to simultaneously recognize multiple attributes can lead to a decrease…

Computer Vision and Pattern Recognition · Computer Science 2025-02-19 Juefeng Xiao , Tianqi Xiang , Zhigang Tu

AMPNet: Asynchronous Model-Parallel Training for Dynamic Neural Networks

New types of machine learning hardware in development and entering the market hold the promise of revolutionizing deep learning in a manner as profound as GPUs. However, existing software frameworks and training algorithms for deep learning…

Machine Learning · Computer Science 2017-06-23 Alexander L. Gaunt , Matthew A. Johnson , Maik Riechert , Daniel Tarlow , Ryota Tomioka , Dimitrios Vytiniotis , Sam Webster

AToM: Amortized Text-to-Mesh using 2D Diffusion

We introduce Amortized Text-to-Mesh (AToM), a feed-forward text-to-mesh framework optimized across multiple text prompts simultaneously. In contrast to existing text-to-3D methods that often entail time-consuming per-prompt optimization and…

Computer Vision and Pattern Recognition · Computer Science 2024-02-02 Guocheng Qian , Junli Cao , Aliaksandr Siarohin , Yash Kant , Chaoyang Wang , Michael Vasilkovsky , Hsin-Ying Lee , Yuwei Fang , Ivan Skorokhodov , Peiye Zhuang , Igor Gilitschenski , Jian Ren , Bernard Ghanem , Kfir Aberman , Sergey Tulyakov

Time-Aware Feature Selection: Adaptive Temporal Masking for Stable Sparse Autoencoder Training

Understanding the internal representations of large language models is crucial for ensuring their reliability and safety, with sparse autoencoders (SAEs) emerging as a promising interpretability approach. However, current SAE training…

Machine Learning · Computer Science 2025-10-13 T. Ed Li , Junyu Ren

Beyond Human-prompting: Adaptive Prompt Tuning with Semantic Alignment for Anomaly Detection

Pre-trained Vision-Language Models (VLMs) have recently shown promise in detecting anomalies. However, previous approaches are fundamentally limited by their reliance on human-designed prompts and the lack of accessible anomaly samples,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-25 Pi-Wei Chen , Jerry Chun-Wei Lin , Wei-Han Chen , Jia Ji , Zih-Ching Chen , Feng-Hao Yeh , Chao-Chun Chen

TTP: Test-Time Padding for Adversarial Detection and Robust Adaptation on Vision-Language Models

Vision-Language Models (VLMs), such as CLIP, have achieved impressive zero-shot recognition performance but remain highly susceptible to adversarial perturbations, posing significant risks in safety-critical scenarios. Previous…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Zhiwei Li , Yitian Pang , Weining Wang , Zhenan Sun , Qi Li

Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

The Transformer architecture, underpinned by the self-attention mechanism, has become the de facto standard for sequence modeling tasks. However, its core computational primitive scales quadratically with sequence length (O(N^2)), creating…

Computation and Language · Computer Science 2025-09-03 Rishiraj Acharya

Towards Fine-Grained Robustness: Attention-Guided Test-Time Prompt Tuning for Vision-Language Models

Vision-Language Models (VLMs), such as CLIP, have achieved significant zero-shot performance on downstream tasks with various fine-tuning adaptation methods. However, recent studies have proven that adversarial attacks can significantly…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Jia-Wei Hai , Yijun Wang , Xiu-Shen Wei

Accelerating Irregular Computations with Hardware Transactional Memory and Active Messages

We propose Atomic Active Messages (AAM), a mechanism that accelerates irregular graph computations on both shared- and distributed-memory machines. The key idea behind AAM is that hardware transactional memory (HTM) can be used for simple…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-11-02 Maciej Besta , Torsten Hoefler

Advancing Textual Prompt Learning with Anchored Attributes

Textual-based prompt learning methods primarily employ multiple learnable soft prompts and hard class tokens in a cascading manner as text inputs, aiming to align image and text (category) spaces for downstream tasks. However, current…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Zheng Li , Yibing Song , Ming-Ming Cheng , Xiang Li , Jian Yang

Machine Learning Enabled Scalable Performance Prediction of Scientific Codes

We present the Analytical Memory Model with Pipelines (AMMP) of the Performance Prediction Toolkit (PPT). PPT-AMMP takes high-level source code and hardware architecture parameters as input, predicts runtime of that code on the target…

Performance · Computer Science 2020-11-16 Gopinath Chennupati , Nandakishore Santhi , Phill Romero , Stephan Eidenbenz

MPM: Mutual Pair Merging for Efficient Vision Transformers

Decreasing sequence length is a common way to accelerate transformers, but prior token reduction work often targets classification and reports proxy metrics rather than end-to-end latency. For semantic segmentation, token reduction is…

Computer Vision and Pattern Recognition · Computer Science 2026-04-08 Simon Ravé , Pejman Rasti , David Rousseau

Analogical Trajectory Transfer

We study analogical trajectory transfer, where the goal is to translate motion trajectories in one 3D environment to a semantically analogous location in another. Such a capacity would enable machines to perform analogical spatial…

Computer Vision and Pattern Recognition · Computer Science 2026-05-15 Junho Kim , Eun Sun Lee , Gwangtak Bae , Seunggu Kang , Young Min Kim

Unsupervised representation learning with recognition-parametrised probabilistic models

We introduce a new approach to probabilistic unsupervised learning based on the recognition-parametrised model (RPM): a normalised semi-parametric hypothesis class for joint distributions over observed and latent variables. Under the key…

Machine Learning · Computer Science 2023-04-21 William I. Walker , Hugo Soulat , Changmin Yu , Maneesh Sahani

CAD: Memory Efficient Convolutional Adapter for Segment Anything

The Foundation model for image segmentation, Segment Anything (SAM), has been actively researched in various fields since its proposal. Various researches have been proposed to adapt SAM to specific domains, with one notable approach…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Joohyeok Kim , Joonhyeon Song , Seohwan Yun , Seongho Yoon , Sangmin Lee

Asymmetric Feature Maps with Application to Sketch Based Retrieval

We propose a novel concept of asymmetric feature maps (AFM), which allows to evaluate multiple kernels between a query and database entries without increasing the memory requirements. To demonstrate the advantages of the AFM method, we…

Computer Vision and Pattern Recognition · Computer Science 2017-04-14 Giorgos Tolias , Ondřej Chum