Related papers: MIST: Multiple Instance Spatial Transformer Networ…

MIST: Multiple Instance Self-Training Framework for Video Anomaly Detection

Weakly supervised video anomaly detection (WS-VAD) is to distinguish anomalies from normal events based on discriminative representations. Most existing works are limited in insufficient video representations. In this work, we develop a…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Jia-Chang Feng , Fa-Ting Hong , Wei-Shi Zheng

Multi-domain Integrative Swin Transformer network for Sparse-View Tomographic Reconstruction

Decreasing projection views to lower X-ray radiation dose usually leads to severe streak artifacts. To improve image quality from sparse-view data, a Multi-domain Integrative Swin Transformer network (MIST-net) was developed in this…

Image and Video Processing · Electrical Eng. & Systems 2022-04-18 Jiayi Pan , Heye Zhang , Weifei Wu , Zhifan Gao , Weiwen Wu

Learning to decompose for object detection and instance segmentation

Although deep convolutional neural networks(CNNs) have achieved remarkable results on object detection and segmentation, pre- and post-processing steps such as region proposals and non-maximum suppression(NMS), have been required. These…

Computer Vision and Pattern Recognition · Computer Science 2016-05-12 Eunbyung Park , Alexander C. Berg

Masked Image Modeling with Local Multi-Scale Reconstruction

Masked Image Modeling (MIM) achieves outstanding success in self-supervised representation learning. Unfortunately, MIM models typically have huge computational burden and slow learning process, which is an inevitable obstacle for their…

Computer Vision and Pattern Recognition · Computer Science 2023-03-10 Haoqing Wang , Yehui Tang , Yunhe Wang , Jianyuan Guo , Zhi-Hong Deng , Kai Han

Differentiable Patch Selection for Image Recognition

Neural Networks require large amounts of memory and compute to process high resolution images, even when only a small part of the image is actually informative for the task at hand. We propose a method based on a differentiable Top-K…

Computer Vision and Pattern Recognition · Computer Science 2021-04-08 Jean-Baptiste Cordonnier , Aravindh Mahendran , Alexey Dosovitskiy , Dirk Weissenborn , Jakob Uszkoreit , Thomas Unterthiner

Learning Deep Context-aware Features over Body and Latent Parts for Person Re-identification

Person Re-identification (ReID) is to identify the same person across different cameras. It is a challenging task due to the large variations in person pose, occlusion, background clutter, etc How to extract powerful features is a…

Computer Vision and Pattern Recognition · Computer Science 2017-10-19 Dangwei Li , Xiaotang Chen , Zhang Zhang , Kaiqi Huang

A Reinforcement Learning Approach for Sequential Spatial Transformer Networks

Spatial Transformer Networks (STN) can generate geometric transformations which modify input images to improve the classifier's performance. In this work, we combine the idea of STN with Reinforcement Learning (RL). To this end, we break…

Machine Learning · Computer Science 2021-06-29 Fatemeh Azimi , Federico Raue , Joern Hees , Andreas Dengel

A Multiclass Multiple Instance Learning Method with Exact Likelihood

We study a multiclass multiple instance learning (MIL) problem where the labels only suggest whether any instance of a class exists or does not exist in a training sample or example. No further information, e.g., the number of instances of…

Machine Learning · Statistics 2019-03-15 Xi-Lin Li

Deep Elastic Networks with Model Selection for Multi-Task Learning

In this work, we consider the problem of instance-wise dynamic network model selection for multi-task learning. To this end, we propose an efficient approach to exploit a compact but accurate model in a backbone architecture for each…

Computer Vision and Pattern Recognition · Computer Science 2019-09-12 Chanho Ahn , Eunwoo Kim , Songhwai Oh

Sill-Net: Feature Augmentation with Separated Illumination Representation

For visual object recognition tasks, the illumination variations can cause distinct changes in object appearance and thus confuse the deep neural network based recognition models. Especially for some rare illumination conditions, collecting…

Computer Vision and Pattern Recognition · Computer Science 2022-10-07 Haipeng Zhang , Zhong Cao , Ziang Yan , Changshui Zhang

FISTA-Net: Learning A Fast Iterative Shrinkage Thresholding Network for Inverse Problems in Imaging

Inverse problems are essential to imaging applications. In this paper, we propose a model-based deep learning network, named FISTA-Net, by combining the merits of interpretability and generality of the model-based Fast Iterative…

Image and Video Processing · Electrical Eng. & Systems 2021-01-26 Jinxi Xiang , Yonggui Dong , Yunjie Yang

Training of deep residual networks with stochastic MG/OPT

We train deep residual networks with a stochastic variant of the nonlinear multigrid method MG/OPT. To build the multilevel hierarchy, we use the dynamical systems viewpoint specific to residual networks. We report significant speed-ups and…

Machine Learning · Computer Science 2021-08-10 Cyrill von Planta , Alena Kopanicakova , Rolf Krause

Learning Fixation Point Strategy for Object Detection and Classification

We propose a novel recurrent attentional structure to localize and recognize objects jointly. The network can learn to extract a sequence of local observations with detailed appearance and rough context, instead of sliding windows or…

Computer Vision and Pattern Recognition · Computer Science 2017-12-20 Jie Lyu , Zejian Yuan , Dapeng Chen

MST: Masked Self-Supervised Transformer for Visual Representation

Transformer has been widely used for self-supervised pre-training in Natural Language Processing (NLP) and achieved great success. However, it has not been fully explored in visual self-supervised learning. Meanwhile, previous methods only…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Zhaowen Li , Zhiyang Chen , Fan Yang , Wei Li , Yousong Zhu , Chaoyang Zhao , Rui Deng , Liwei Wu , Rui Zhao , Ming Tang , Jinqiao Wang

Learning to count with deep object features

Learning to count is a learning strategy that has been recently proposed in the literature for dealing with problems where estimating the number of object instances in a scene is the final objective. In this framework, the task of learning…

Computer Vision and Pattern Recognition · Computer Science 2015-06-01 Santi Seguí , Oriol Pujol , Jordi Vitrià

Invertible residual networks in the context of regularization theory for linear inverse problems

Learned inverse problem solvers exhibit remarkable performance in applications like image reconstruction tasks. These data-driven reconstruction methods often follow a two-step scheme. First, one trains the often neural network-based…

Numerical Analysis · Mathematics 2023-12-21 Clemens Arndt , Alexander Denker , Sören Dittmer , Nick Heilenkötter , Meira Iske , Tobias Kluth , Peter Maass , Judith Nickel

MIRST-DM: Multi-Instance RST with Drop-Max Layer for Robust Classification of Breast Cancer

Robust self-training (RST) can augment the adversarial robustness of image classification models without significantly sacrificing models' generalizability. However, RST and other state-of-the-art defense approaches failed to preserve the…

Image and Video Processing · Electrical Eng. & Systems 2022-05-05 Shoukun Sun , Min Xian , Aleksandar Vakanski , Hossny Ghanem

SISL:Self-Supervised Image Signature Learning for Splicing Detection and Localization

Recent algorithms for image manipulation detection almost exclusively use deep network models. These approaches require either dense pixelwise groundtruth masks, camera ids, or image metadata to train the networks. On one hand, constructing…

Computer Vision and Pattern Recognition · Computer Science 2022-03-16 Susmit Agrawal , Prabhat Kumar , Siddharth Seth , Toufiq Parag , Maneesh Singh , Venkatesh Babu

Nested multi-instance classification

There are classification tasks that take as inputs groups of images rather than single images. In order to address such situations, we introduce a nested multi-instance deep network. The approach is generic in that it is applicable to…

Machine Learning · Statistics 2018-08-31 Alexander Stec , Diego Klabjan , Jean Utke

Solving MNIST with a globally trained Mixture of Quantum Experts

We propose a new quantum neural network for image classification, which is able to classify the parity of the MNIST dataset with full resolution with a test accuracy of up to 97.5% without any classical pre-processing or post-processing.…

Quantum Physics · Physics 2025-05-22 Paolo Alessandro Xavier Tognini , Leonardo Banchi , Giacomo De Palma