English
Related papers

Related papers: multi-patch aggregation models for resampling dete…

200 papers

In unmanned aerial systems, especially in complex environments, accurately detecting tiny objects is crucial. Resizing images is a common strategy to improve detection accuracy, particularly for small objects. However, simply enlarging…

Computer Vision and Pattern Recognition · Computer Science 2025-05-06 Luqi Gong , Haotian Chen , Yikun Chen , Tianliang Yao , Chao Li , Shuai Zhao , Guangjie Han

There are a variety of approaches to obtain a vast receptive field with convolutional neural networks (CNNs), such as pooling or striding convolutions. Most of these approaches were initially designed for image classification and later…

Computer Vision and Pattern Recognition · Computer Science 2020-06-16 Omid Hosseini Jafari , Carsten Rother

Most convolutional neural networks use some method for gradually downscaling the size of the hidden layers. This is commonly referred to as pooling, and is applied to reduce the number of parameters, improve invariance to certain…

Computer Vision and Pattern Recognition · Computer Science 2018-04-13 Faraz Saeedan , Nicolas Weber , Michael Goesele , Stefan Roth

Multi-vector embedding models have emerged as a powerful paradigm for document retrieval, preserving fine-grained visual and textual details through token-level representations. However, this expressiveness comes at a staggering cost:…

Information Retrieval · Computer Science 2026-01-13 Sungguk Cha , DongWook Kim , Mintae Kim , Youngsub Han , Byoung-Ki Jeon , Sangyeob Lee

Realistic image forgeries involve a combination of splicing, resampling, cloning, region removal and other methods. While resampling detection algorithms are effective in detecting splicing and resampling, copy-move detection algorithms…

Pooling layers are essential building blocks of convolutional neural networks (CNNs), to reduce computational overhead and increase the receptive fields of proceeding convolutional operations. Their goal is to produce downsampled volumes…

Computer Vision and Pattern Recognition · Computer Science 2022-12-05 Alexandros Stergiou , Ronald Poppe

Spatial downsampling layers are favored in convolutional neural networks (CNNs) to downscale feature maps for larger receptive fields and less memory consumption. However, for discriminative tasks, there is a possibility that these layers…

Computer Vision and Pattern Recognition · Computer Science 2019-08-28 Ziteng Gao , Limin Wang , Gangshan Wu

Image zooming or upsampling is a widely used tool in image processing and an essential step in many algorithms. Upsampling increases the number of pixels and introduces new information into the image, which can lead to numerical effects…

Numerical Analysis · Mathematics 2023-12-04 Bojan Crnković , Jerko Škifić , Tina Bosner

Pooling operations, which can be calculated at low cost and serve as a linear or nonlinear transfer function for data reduction, are found in almost every modern neural network. Countless modern approaches have already tackled replacing the…

Computer Vision and Pattern Recognition · Computer Science 2021-01-19 Wolfgang Fuhl , Enkelejda Kasneci

Non-regular sampling can reduce aliasing at the expense of noise. Recently, it has been shown that non-regular sampling can be carried out using a conventional regular imaging sensor when the surface of its individual pixels is partially…

Image and Video Processing · Electrical Eng. & Systems 2022-03-02 Simon Grosche , Jürgen Seiler , André Kaup

Pooling is a ubiquitous operation in image processing algorithms that allows for higher-level processes to collect relevant low-level features from a region of interest. Currently, max-pooling is one of the most commonly used operators in…

Computer Vision and Pattern Recognition · Computer Science 2020-11-09 Arash Akbarinia , Raquel Gil Rodríguez , C. Alejandro Parraga

In this paper, we present a novel deep learning-based approach for still image super-resolution, that unlike the mainstream models does not rely solely on the input low resolution image for high quality upsampling, and takes advantage of a…

Computer Vision and Pattern Recognition · Computer Science 2018-12-17 Farzad Toutounchi , Ebroul Izquierdo

Deep neural networks with alternating convolutional, max-pooling and decimation layers are widely used in state of the art architectures for computer vision. Max-pooling purposefully discards precise spatial information in order to create…

Computer Vision and Pattern Recognition · Computer Science 2016-04-19 Sina Honari , Jason Yosinski , Pascal Vincent , Christopher Pal

Image retargeting is the task of making images capable of being displayed on screens with different sizes. This work should be done so that high-level visual information and low-level features such as texture remain as intact as possible to…

Computer Vision and Pattern Recognition · Computer Science 2019-10-18 Mahdi Ahmadi , Nader Karimi , Shadrokh Samavi

In the framework of convolutional neural networks that lie at the heart of deep learning, downsampling is often performed with a max-pooling operation that only retains the element with maximum activation, while completely discarding the…

Computer Vision and Pattern Recognition · Computer Science 2018-04-17 Ashwani Kumar

State-of-the-art image segmentation algorithms generally consist of at least two successive and distinct computations: a boundary detection process that uses local image information to classify image locations as boundaries between objects,…

Computer Vision and Pattern Recognition · Computer Science 2016-11-03 Michał Januszewski , Jeremy Maitin-Shepard , Peter Li , Jörgen Kornfeld , Winfried Denk , Viren Jain

Pooling layers (e.g., max and average) may overlook important information encoded in the spatial arrangement of pixel intensity and/or feature values. We propose a novel lacunarity pooling layer that aims to capture the spatial…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Akshatha Mohan , Joshua Peeples

We introduce a novel loss max-pooling concept for handling imbalanced training data distributions, applicable as alternative loss layer in the context of deep neural networks for semantic image segmentation. Most real-world semantic…

Computer Vision and Pattern Recognition · Computer Science 2017-04-11 Samuel Rota Bulò , Gerhard Neuhold , Peter Kontschieder

Image classification is considered, and a hierarchical max-pooling model with additional local pooling is introduced. Here the additional local pooling enables the hierachical model to combine parts of the image which have a variable…

Computer Vision and Pattern Recognition · Computer Science 2021-06-10 Benjamin Walter

When a human matches two images, the viewer has a natural tendency to view the wide area around the target pixel to obtain clues of right correspondence. However, designing a matching cost function that works on a large window in the same…

Computer Vision and Pattern Recognition · Computer Science 2017-09-20 Haesol Park , Kyoung Mu Lee
‹ Prev 1 2 3 10 Next ›