English
Related papers

Related papers: MIST: Multiple Instance Spatial Transformer Networ…

200 papers

Weakly supervised video anomaly detection (WS-VAD) is to distinguish anomalies from normal events based on discriminative representations. Most existing works are limited in insufficient video representations. In this work, we develop a…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Jia-Chang Feng , Fa-Ting Hong , Wei-Shi Zheng

Decreasing projection views to lower X-ray radiation dose usually leads to severe streak artifacts. To improve image quality from sparse-view data, a Multi-domain Integrative Swin Transformer network (MIST-net) was developed in this…

Image and Video Processing · Electrical Eng. & Systems 2022-04-18 Jiayi Pan , Heye Zhang , Weifei Wu , Zhifan Gao , Weiwen Wu

Although deep convolutional neural networks(CNNs) have achieved remarkable results on object detection and segmentation, pre- and post-processing steps such as region proposals and non-maximum suppression(NMS), have been required. These…

Computer Vision and Pattern Recognition · Computer Science 2016-05-12 Eunbyung Park , Alexander C. Berg

Masked Image Modeling (MIM) achieves outstanding success in self-supervised representation learning. Unfortunately, MIM models typically have huge computational burden and slow learning process, which is an inevitable obstacle for their…

Computer Vision and Pattern Recognition · Computer Science 2023-03-10 Haoqing Wang , Yehui Tang , Yunhe Wang , Jianyuan Guo , Zhi-Hong Deng , Kai Han

Neural Networks require large amounts of memory and compute to process high resolution images, even when only a small part of the image is actually informative for the task at hand. We propose a method based on a differentiable Top-K…

Computer Vision and Pattern Recognition · Computer Science 2021-04-08 Jean-Baptiste Cordonnier , Aravindh Mahendran , Alexey Dosovitskiy , Dirk Weissenborn , Jakob Uszkoreit , Thomas Unterthiner

Person Re-identification (ReID) is to identify the same person across different cameras. It is a challenging task due to the large variations in person pose, occlusion, background clutter, etc How to extract powerful features is a…

Computer Vision and Pattern Recognition · Computer Science 2017-10-19 Dangwei Li , Xiaotang Chen , Zhang Zhang , Kaiqi Huang

Spatial Transformer Networks (STN) can generate geometric transformations which modify input images to improve the classifier's performance. In this work, we combine the idea of STN with Reinforcement Learning (RL). To this end, we break…

Machine Learning · Computer Science 2021-06-29 Fatemeh Azimi , Federico Raue , Joern Hees , Andreas Dengel

We study a multiclass multiple instance learning (MIL) problem where the labels only suggest whether any instance of a class exists or does not exist in a training sample or example. No further information, e.g., the number of instances of…

Machine Learning · Statistics 2019-03-15 Xi-Lin Li

In this work, we consider the problem of instance-wise dynamic network model selection for multi-task learning. To this end, we propose an efficient approach to exploit a compact but accurate model in a backbone architecture for each…

Computer Vision and Pattern Recognition · Computer Science 2019-09-12 Chanho Ahn , Eunwoo Kim , Songhwai Oh

For visual object recognition tasks, the illumination variations can cause distinct changes in object appearance and thus confuse the deep neural network based recognition models. Especially for some rare illumination conditions, collecting…

Computer Vision and Pattern Recognition · Computer Science 2022-10-07 Haipeng Zhang , Zhong Cao , Ziang Yan , Changshui Zhang

Inverse problems are essential to imaging applications. In this paper, we propose a model-based deep learning network, named FISTA-Net, by combining the merits of interpretability and generality of the model-based Fast Iterative…

Image and Video Processing · Electrical Eng. & Systems 2021-01-26 Jinxi Xiang , Yonggui Dong , Yunjie Yang

We train deep residual networks with a stochastic variant of the nonlinear multigrid method MG/OPT. To build the multilevel hierarchy, we use the dynamical systems viewpoint specific to residual networks. We report significant speed-ups and…

Machine Learning · Computer Science 2021-08-10 Cyrill von Planta , Alena Kopanicakova , Rolf Krause

We propose a novel recurrent attentional structure to localize and recognize objects jointly. The network can learn to extract a sequence of local observations with detailed appearance and rough context, instead of sliding windows or…

Computer Vision and Pattern Recognition · Computer Science 2017-12-20 Jie Lyu , Zejian Yuan , Dapeng Chen

Transformer has been widely used for self-supervised pre-training in Natural Language Processing (NLP) and achieved great success. However, it has not been fully explored in visual self-supervised learning. Meanwhile, previous methods only…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Zhaowen Li , Zhiyang Chen , Fan Yang , Wei Li , Yousong Zhu , Chaoyang Zhao , Rui Deng , Liwei Wu , Rui Zhao , Ming Tang , Jinqiao Wang

Learning to count is a learning strategy that has been recently proposed in the literature for dealing with problems where estimating the number of object instances in a scene is the final objective. In this framework, the task of learning…

Computer Vision and Pattern Recognition · Computer Science 2015-06-01 Santi Seguí , Oriol Pujol , Jordi Vitrià

Learned inverse problem solvers exhibit remarkable performance in applications like image reconstruction tasks. These data-driven reconstruction methods often follow a two-step scheme. First, one trains the often neural network-based…

Robust self-training (RST) can augment the adversarial robustness of image classification models without significantly sacrificing models' generalizability. However, RST and other state-of-the-art defense approaches failed to preserve the…

Image and Video Processing · Electrical Eng. & Systems 2022-05-05 Shoukun Sun , Min Xian , Aleksandar Vakanski , Hossny Ghanem

Recent algorithms for image manipulation detection almost exclusively use deep network models. These approaches require either dense pixelwise groundtruth masks, camera ids, or image metadata to train the networks. On one hand, constructing…

Computer Vision and Pattern Recognition · Computer Science 2022-03-16 Susmit Agrawal , Prabhat Kumar , Siddharth Seth , Toufiq Parag , Maneesh Singh , Venkatesh Babu

There are classification tasks that take as inputs groups of images rather than single images. In order to address such situations, we introduce a nested multi-instance deep network. The approach is generic in that it is applicable to…

Machine Learning · Statistics 2018-08-31 Alexander Stec , Diego Klabjan , Jean Utke

We propose a new quantum neural network for image classification, which is able to classify the parity of the MNIST dataset with full resolution with a test accuracy of up to 97.5% without any classical pre-processing or post-processing.…

Quantum Physics · Physics 2025-05-22 Paolo Alessandro Xavier Tognini , Leonardo Banchi , Giacomo De Palma
‹ Prev 1 2 3 10 Next ›