Related papers: 4X4 Census Transform

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs

We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few large convolutional kernels instead of a stack of small…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Xiaohan Ding , Xiangyu Zhang , Yizhuang Zhou , Jungong Han , Guiguang Ding , Jian Sun

Efficient and Invariant Convolutional Neural Networks for Dense Prediction

Convolutional neural networks have shown great success on feature extraction from raw input data such as images. Although convolutional neural networks are invariant to translations on the inputs, they are not invariant to other…

Computer Vision and Pattern Recognition · Computer Science 2018-09-05 Hongyang Gao , Shuiwang Ji

Hyper-Convolutions via Implicit Kernels for Medical Imaging

The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares…

Image and Video Processing · Electrical Eng. & Systems 2022-02-08 Tianyu Ma , Alan Q. Wang , Adrian V. Dalca , Mert R. Sabuncu

Deformable Kernel Networks for Joint Image Filtering

Joint image filters are used to transfer structural details from a guidance picture used as a prior to a target image, in tasks such as enhancing spatial resolution and suppressing noise. Previous methods based on convolutional neural…

Computer Vision and Pattern Recognition · Computer Science 2020-10-22 Beomjun Kim , Jean Ponce , Bumsub Ham

An Alternative Approach of Steganography using Reference Image

This paper is to create a practical steganographic implementation for 4-bit images.The proposed technique converts 4 bit image into 4 shaded Gray Scale image. This image will be act as reference image to hide the text. Using this grey scale…

Multimedia · Computer Science 2014-08-17 Samir Kumar Bandyopadhyay , Indra Kanta Maitra

Convolutional Kernel Networks

An important goal in visual recognition is to devise image representations that are invariant to particular transformations. In this paper, we address this goal with a new type of convolutional neural network (CNN) whose invariance is…

Computer Vision and Pattern Recognition · Computer Science 2015-01-08 Julien Mairal , Piotr Koniusz , Zaid Harchaoui , Cordelia Schmid

MixModule: Mixed CNN Kernel Module for Medical Image Segmentation

Convolutional neural networks (CNNs) have been successfully applied to medical image classification, segmentation, and related tasks. Among the many CNNs architectures, U-Net and its improved versions based are widely used and achieve…

Computer Vision and Pattern Recognition · Computer Science 2020-02-27 Henry H. Yu , Xue Feng , Hao Sun , Ziwen Wang

Integrating Large Circular Kernels into CNNs through Neural Architecture Search

The square kernel is a standard unit for contemporary CNNs, as it fits well on the tensor computation for convolution operation. However, the retinal ganglion cells in the biological visual system have approximately concentric receptive…

Computer Vision and Pattern Recognition · Computer Science 2022-04-19 Kun He , Chao Li , Yixiao Yang , Gao Huang , John E. Hopcroft

XProspeCT: CT Volume Generation from Paired X-Rays

Computed tomography (CT) is a beneficial imaging tool for diagnostic purposes. CT scans provide detailed information concerning the internal anatomic structures of a patient, but present higher radiation dose and costs compared to X-ray…

Image and Video Processing · Electrical Eng. & Systems 2024-03-05 Benjamin Paulson , Joshua Goldshteyn , Sydney Balboni , John Cisler , Andrew Crisler , Natalia Bukowski , Julia Kalish , Theodore Colwell

MedNeXt: Transformer-driven Scaling of ConvNets for Medical Image Segmentation

There has been exploding interest in embracing Transformer-based architectures for medical image segmentation. However, the lack of large-scale annotated medical datasets make achieving performances equivalent to those in natural images…

Image and Video Processing · Electrical Eng. & Systems 2024-06-04 Saikat Roy , Gregor Koehler , Constantin Ulrich , Michael Baumgartner , Jens Petersen , Fabian Isensee , Paul F. Jaeger , Klaus Maier-Hein

More for Less: Compact Convolutional Transformers Enable Robust Medical Image Classification with Limited Data

Transformers are very powerful tools for a variety of tasks across domains, from text generation to image captioning. However, transformers require substantial amounts of training data, which is often a challenge in biomedical settings,…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Andrew Kean Gao

A DCT Approximation for Image Compression

An orthogonal approximation for the 8-point discrete cosine transform (DCT) is introduced. The proposed transformation matrix contains only zeros and ones; multiplications and bit-shift operations are absent. Close spectral behavior…

Multimedia · Computer Science 2014-02-26 R. J. Cintra , F. M. Bayer

Convolution with even-sized kernels and symmetric padding

Compact convolutional neural networks gain efficiency mainly through depthwise convolutions, expanded channels and complex topologies, which contrarily aggravate the training process. Besides, 3x3 kernels dominate the spatial representation…

Computer Vision and Pattern Recognition · Computer Science 2019-05-23 Shuang Wu , Guanrui Wang , Pei Tang , Feng Chen , Luping Shi

Rethinking Spatial Invariance of Convolutional Networks for Object Counting

Previous work generally believes that improving the spatial invariance of convolutional networks is the key to object counting. However, after verifying several mainstream counting networks, we surprisingly found too strict pixel-level…

Computer Vision and Pattern Recognition · Computer Science 2022-08-19 Zhi-Qi Cheng , Qi Dai , Hong Li , JingKuan Song , Xiao Wu , Alexander G. Hauptmann

DCT-like Transform for Image Compression Requires 14 Additions Only

A low-complexity 8-point orthogonal approximate DCT is introduced. The proposed transform requires no multiplications or bit-shift operations. The derived fast algorithm requires only 14 additions, less than any existing DCT approximation.…

Multimedia · Computer Science 2017-02-06 F. M. Bayer , R. J. Cintra

A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

The design of the optimal inverse discrete cosine transform (IDCT) to compensate the quantization error is proposed for effective lossy image compression in this work. The forward and inverse DCTs are designed in pair in current image/video…

Multimedia · Computer Science 2021-02-02 Yifan Wang , Zhanxuan Mei , Chia-Yang Tsai , Ioannis Katsavounidis , C. -C. Jay Kuo

Blind Deconvolution for Color Images Using Normalized Quaternion Kernels

In this work, we address the challenging problem of blind deconvolution for color images. Existing methods often convert color images to grayscale or process each color channel separately, which overlooking the relationships between color…

Computer Vision and Pattern Recognition · Computer Science 2025-11-24 Yuming Yang , Michael K. Ng , Zhigang Jia , Wei Wang

Trans${^2}$-CBCT: A Dual-Transformer Framework for Sparse-View CBCT Reconstruction

Cone-beam computed tomography (CBCT) using only a few X-ray projection views enables faster scans with lower radiation dose, but the resulting severe under-sampling causes strong artifacts and poor spatial coverage. We address these…

Image and Video Processing · Electrical Eng. & Systems 2025-06-25 Minmin Yang , Huantao Ren , Senem Velipasalar

A-CCNN: adaptive ccnn for density estimation and crowd counting

Crowd counting, for estimating the number of people in a crowd using vision-based computer techniques, has attracted much interest in the research community. Although many attempts have been reported, real-world problems, such as huge…

Computer Vision and Pattern Recognition · Computer Science 2018-04-23 Saeed Amirgholipour Kasmani , Xiangjian He , Wenjing Jia , Dadong Wang , Michelle Zeibots

Computed Tomography Image Enhancement using 3D Convolutional Neural Network

Computed tomography (CT) is increasingly being used for cancer screening, such as early detection of lung cancer. However, CT studies have varying pixel spacing due to differences in acquisition parameters. Thick slice CTs have lower…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Meng Li , Shiwen Shen , Wen Gao , William Hsu , Jason Cong