English
Related papers

Related papers: Asynchronous Perception Machine For Efficient Test…

200 papers

The advent of the Transformer architecture has propelled the growth of natural language processing (NLP) models, leading to remarkable achievements in numerous NLP tasks. Yet, the absence of specialized hardware like expansive GPU memory…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-18 Xiaofeng Wu , Jia Rao , Wei Chen

Feature embedding-based methods have shown exceptional performance in detecting industrial anomalies by comparing features of target images with normal images. However, some methods do not meet the speed requirements of real-time inference,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-27 Donghyeong Kim , Chaewon Park , Suhwan Cho , Sangyoun Lee

Sharpness-Aware Minimization (SAM) is an optimization method that improves generalization performance of machine learning models. Despite its superior generalization, SAM has not been actively used in real-world applications due to its…

Machine Learning · Computer Science 2025-03-17 Junhyuk Jo , Jihyun Lim , Sunwoo Lee

The Segment Anything Model (SAM) has demonstrated strong performance in image segmentation of natural scene images. However, its effectiveness diminishes markedly when applied to specific scientific domains, such as Scanning Probe…

Computer Vision and Pattern Recognition · Computer Science 2024-10-17 Yao Shen , Ziwei Wei , Chunmeng Liu , Shuming Wei , Qi Zhao , Kaiyang Zeng , Guangyao Li

In real-world action recognition systems, incorporating more attributes helps achieve a more comprehensive understanding of human behavior. However, using a single model to simultaneously recognize multiple attributes can lead to a decrease…

Computer Vision and Pattern Recognition · Computer Science 2025-02-19 Juefeng Xiao , Tianqi Xiang , Zhigang Tu

New types of machine learning hardware in development and entering the market hold the promise of revolutionizing deep learning in a manner as profound as GPUs. However, existing software frameworks and training algorithms for deep learning…

We introduce Amortized Text-to-Mesh (AToM), a feed-forward text-to-mesh framework optimized across multiple text prompts simultaneously. In contrast to existing text-to-3D methods that often entail time-consuming per-prompt optimization and…

Understanding the internal representations of large language models is crucial for ensuring their reliability and safety, with sparse autoencoders (SAEs) emerging as a promising interpretability approach. However, current SAE training…

Machine Learning · Computer Science 2025-10-13 T. Ed Li , Junyu Ren

Pre-trained Vision-Language Models (VLMs) have recently shown promise in detecting anomalies. However, previous approaches are fundamentally limited by their reliance on human-designed prompts and the lack of accessible anomaly samples,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-25 Pi-Wei Chen , Jerry Chun-Wei Lin , Wei-Han Chen , Jia Ji , Zih-Ching Chen , Feng-Hao Yeh , Chao-Chun Chen

Vision-Language Models (VLMs), such as CLIP, have achieved impressive zero-shot recognition performance but remain highly susceptible to adversarial perturbations, posing significant risks in safety-critical scenarios. Previous…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Zhiwei Li , Yitian Pang , Weining Wang , Zhenan Sun , Qi Li

The Transformer architecture, underpinned by the self-attention mechanism, has become the de facto standard for sequence modeling tasks. However, its core computational primitive scales quadratically with sequence length (O(N^2)), creating…

Computation and Language · Computer Science 2025-09-03 Rishiraj Acharya

Vision-Language Models (VLMs), such as CLIP, have achieved significant zero-shot performance on downstream tasks with various fine-tuning adaptation methods. However, recent studies have proven that adversarial attacks can significantly…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Jia-Wei Hai , Yijun Wang , Xiu-Shen Wei

We propose Atomic Active Messages (AAM), a mechanism that accelerates irregular graph computations on both shared- and distributed-memory machines. The key idea behind AAM is that hardware transactional memory (HTM) can be used for simple…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-11-02 Maciej Besta , Torsten Hoefler

Textual-based prompt learning methods primarily employ multiple learnable soft prompts and hard class tokens in a cascading manner as text inputs, aiming to align image and text (category) spaces for downstream tasks. However, current…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Zheng Li , Yibing Song , Ming-Ming Cheng , Xiang Li , Jian Yang

We present the Analytical Memory Model with Pipelines (AMMP) of the Performance Prediction Toolkit (PPT). PPT-AMMP takes high-level source code and hardware architecture parameters as input, predicts runtime of that code on the target…

Performance · Computer Science 2020-11-16 Gopinath Chennupati , Nandakishore Santhi , Phill Romero , Stephan Eidenbenz

Decreasing sequence length is a common way to accelerate transformers, but prior token reduction work often targets classification and reports proxy metrics rather than end-to-end latency. For semantic segmentation, token reduction is…

Computer Vision and Pattern Recognition · Computer Science 2026-04-08 Simon Ravé , Pejman Rasti , David Rousseau

We study analogical trajectory transfer, where the goal is to translate motion trajectories in one 3D environment to a semantically analogous location in another. Such a capacity would enable machines to perform analogical spatial…

Computer Vision and Pattern Recognition · Computer Science 2026-05-15 Junho Kim , Eun Sun Lee , Gwangtak Bae , Seunggu Kang , Young Min Kim

We introduce a new approach to probabilistic unsupervised learning based on the recognition-parametrised model (RPM): a normalised semi-parametric hypothesis class for joint distributions over observed and latent variables. Under the key…

Machine Learning · Computer Science 2023-04-21 William I. Walker , Hugo Soulat , Changmin Yu , Maneesh Sahani

The Foundation model for image segmentation, Segment Anything (SAM), has been actively researched in various fields since its proposal. Various researches have been proposed to adapt SAM to specific domains, with one notable approach…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Joohyeok Kim , Joonhyeon Song , Seohwan Yun , Seongho Yoon , Sangmin Lee

We propose a novel concept of asymmetric feature maps (AFM), which allows to evaluate multiple kernels between a query and database entries without increasing the memory requirements. To demonstrate the advantages of the AFM method, we…

Computer Vision and Pattern Recognition · Computer Science 2017-04-14 Giorgos Tolias , Ondřej Chum
‹ Prev 1 2 3 10 Next ›