English
Related papers

Related papers: 4X4 Census Transform

200 papers

We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few large convolutional kernels instead of a stack of small…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Xiaohan Ding , Xiangyu Zhang , Yizhuang Zhou , Jungong Han , Guiguang Ding , Jian Sun

Convolutional neural networks have shown great success on feature extraction from raw input data such as images. Although convolutional neural networks are invariant to translations on the inputs, they are not invariant to other…

Computer Vision and Pattern Recognition · Computer Science 2018-09-05 Hongyang Gao , Shuiwang Ji

The convolutional neural network (CNN) is one of the most commonly used architectures for computer vision tasks. The key building block of a CNN is the convolutional kernel that aggregates information from the pixel neighborhood and shares…

Image and Video Processing · Electrical Eng. & Systems 2022-02-08 Tianyu Ma , Alan Q. Wang , Adrian V. Dalca , Mert R. Sabuncu

Joint image filters are used to transfer structural details from a guidance picture used as a prior to a target image, in tasks such as enhancing spatial resolution and suppressing noise. Previous methods based on convolutional neural…

Computer Vision and Pattern Recognition · Computer Science 2020-10-22 Beomjun Kim , Jean Ponce , Bumsub Ham

This paper is to create a practical steganographic implementation for 4-bit images.The proposed technique converts 4 bit image into 4 shaded Gray Scale image. This image will be act as reference image to hide the text. Using this grey scale…

Multimedia · Computer Science 2014-08-17 Samir Kumar Bandyopadhyay , Indra Kanta Maitra

An important goal in visual recognition is to devise image representations that are invariant to particular transformations. In this paper, we address this goal with a new type of convolutional neural network (CNN) whose invariance is…

Computer Vision and Pattern Recognition · Computer Science 2015-01-08 Julien Mairal , Piotr Koniusz , Zaid Harchaoui , Cordelia Schmid

Convolutional neural networks (CNNs) have been successfully applied to medical image classification, segmentation, and related tasks. Among the many CNNs architectures, U-Net and its improved versions based are widely used and achieve…

Computer Vision and Pattern Recognition · Computer Science 2020-02-27 Henry H. Yu , Xue Feng , Hao Sun , Ziwen Wang

The square kernel is a standard unit for contemporary CNNs, as it fits well on the tensor computation for convolution operation. However, the retinal ganglion cells in the biological visual system have approximately concentric receptive…

Computer Vision and Pattern Recognition · Computer Science 2022-04-19 Kun He , Chao Li , Yixiao Yang , Gao Huang , John E. Hopcroft

Computed tomography (CT) is a beneficial imaging tool for diagnostic purposes. CT scans provide detailed information concerning the internal anatomic structures of a patient, but present higher radiation dose and costs compared to X-ray…

Image and Video Processing · Electrical Eng. & Systems 2024-03-05 Benjamin Paulson , Joshua Goldshteyn , Sydney Balboni , John Cisler , Andrew Crisler , Natalia Bukowski , Julia Kalish , Theodore Colwell

There has been exploding interest in embracing Transformer-based architectures for medical image segmentation. However, the lack of large-scale annotated medical datasets make achieving performances equivalent to those in natural images…

Image and Video Processing · Electrical Eng. & Systems 2024-06-04 Saikat Roy , Gregor Koehler , Constantin Ulrich , Michael Baumgartner , Jens Petersen , Fabian Isensee , Paul F. Jaeger , Klaus Maier-Hein

Transformers are very powerful tools for a variety of tasks across domains, from text generation to image captioning. However, transformers require substantial amounts of training data, which is often a challenge in biomedical settings,…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Andrew Kean Gao

An orthogonal approximation for the 8-point discrete cosine transform (DCT) is introduced. The proposed transformation matrix contains only zeros and ones; multiplications and bit-shift operations are absent. Close spectral behavior…

Multimedia · Computer Science 2014-02-26 R. J. Cintra , F. M. Bayer

Compact convolutional neural networks gain efficiency mainly through depthwise convolutions, expanded channels and complex topologies, which contrarily aggravate the training process. Besides, 3x3 kernels dominate the spatial representation…

Computer Vision and Pattern Recognition · Computer Science 2019-05-23 Shuang Wu , Guanrui Wang , Pei Tang , Feng Chen , Luping Shi

Previous work generally believes that improving the spatial invariance of convolutional networks is the key to object counting. However, after verifying several mainstream counting networks, we surprisingly found too strict pixel-level…

Computer Vision and Pattern Recognition · Computer Science 2022-08-19 Zhi-Qi Cheng , Qi Dai , Hong Li , JingKuan Song , Xiao Wu , Alexander G. Hauptmann

A low-complexity 8-point orthogonal approximate DCT is introduced. The proposed transform requires no multiplications or bit-shift operations. The derived fast algorithm requires only 14 additions, less than any existing DCT approximation.…

Multimedia · Computer Science 2017-02-06 F. M. Bayer , R. J. Cintra

The design of the optimal inverse discrete cosine transform (IDCT) to compensate the quantization error is proposed for effective lossy image compression in this work. The forward and inverse DCTs are designed in pair in current image/video…

Multimedia · Computer Science 2021-02-02 Yifan Wang , Zhanxuan Mei , Chia-Yang Tsai , Ioannis Katsavounidis , C. -C. Jay Kuo

In this work, we address the challenging problem of blind deconvolution for color images. Existing methods often convert color images to grayscale or process each color channel separately, which overlooking the relationships between color…

Computer Vision and Pattern Recognition · Computer Science 2025-11-24 Yuming Yang , Michael K. Ng , Zhigang Jia , Wei Wang

Cone-beam computed tomography (CBCT) using only a few X-ray projection views enables faster scans with lower radiation dose, but the resulting severe under-sampling causes strong artifacts and poor spatial coverage. We address these…

Image and Video Processing · Electrical Eng. & Systems 2025-06-25 Minmin Yang , Huantao Ren , Senem Velipasalar

Crowd counting, for estimating the number of people in a crowd using vision-based computer techniques, has attracted much interest in the research community. Although many attempts have been reported, real-world problems, such as huge…

Computer Vision and Pattern Recognition · Computer Science 2018-04-23 Saeed Amirgholipour Kasmani , Xiangjian He , Wenjing Jia , Dadong Wang , Michelle Zeibots

Computed tomography (CT) is increasingly being used for cancer screening, such as early detection of lung cancer. However, CT studies have varying pixel spacing due to differences in acquisition parameters. Thick slice CTs have lower…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Meng Li , Shiwen Shen , Wen Gao , William Hsu , Jason Cong
‹ Prev 1 2 3 10 Next ›