Related papers: Efficient CNN-LSTM based Image Captioning using Ne…

Compressed Image Captioning using CNN-based Encoder-Decoder Framework

In today's world, image processing plays a crucial role across various fields, from scientific research to industrial applications. But one particularly exciting application is image captioning. The potential impact of effective image…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Md Alif Rahman Ridoy , M Mahmud Hasan , Shovon Bhowmick

Neural Image Captioning

In recent years, the biggest advances in major Computer Vision tasks, such as object recognition, handwritten-digit identification, facial recognition, and many others., have all come through the use of Convolutional Neural Networks (CNNs).…

Computation and Language · Computer Science 2019-07-05 Elaina Tan , Lakshay Sharma

Exploring Effective Mask Sampling Modeling for Neural Image Compression

Image compression aims to reduce the information redundancy in images. Most existing neural image compression methods rely on side information from hyperprior or context models to eliminate spatial redundancy, but rarely address the channel…

Computer Vision and Pattern Recognition · Computer Science 2023-06-12 Lin Liu , Mingming Zhao , Shanxin Yuan , Wenlong Lyu , Wengang Zhou , Houqiang Li , Yanfeng Wang , Qi Tian

Coreset-Based Neural Network Compression

We propose a novel Convolutional Neural Network (CNN) compression algorithm based on coreset representations of filters. We exploit the redundancies extant in the space of CNN weights and neuronal activations (across samples) in order to…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Abhimanyu Dubey , Moitreya Chatterjee , Narendra Ahuja

Pre-Trained CNN Architecture for Transformer-Based Image Caption Generation Model

Automatic image captioning, a multifaceted task bridging computer vision and natural language processing, aims to generate descriptive textual content from visual input. While Convolutional Neural Networks (CNNs) and Long Short-Term Memory…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Amanuel Tafese Dufera

Neural Networks Compression for Language Modeling

In this paper, we consider several compression techniques for the language modeling problem based on recurrent neural networks (RNNs). It is known that conventional RNNs, e.g, LSTM-based networks in language modeling, are characterized with…

Machine Learning · Statistics 2019-04-09 Artem M. Grachev , Dmitry I. Ignatov , Andrey V. Savchenko

Image Captioning using Deep Stacked LSTMs, Contextual Word Embeddings and Data Augmentation

Image Captioning, or the automatic generation of descriptions for images, is one of the core problems in Computer Vision and has seen considerable progress using Deep Learning Techniques. We propose to use Inception-ResNet Convolutional…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Sulabh Katiyar , Samir Kumar Borgohain

Image Captioning with Sparse Recurrent Neural Network

Recurrent Neural Network (RNN) has been widely used to tackle a wide variety of language generation problems and are capable of attaining state-of-the-art (SOTA) performance. However despite its impressive results, the large number of…

Computer Vision and Pattern Recognition · Computer Science 2019-10-29 Jia Huei Tan , Chee Seng Chan , Joon Huang Chuah

Compression strategies and space-conscious representations for deep neural networks

Recent advances in deep learning have made available large, powerful convolutional neural networks (CNN) with state-of-the-art performance in several real-world applications. Unfortunately, these large-sized models have millions of…

Machine Learning · Computer Science 2020-07-17 Giosuè Cataldo Marinò , Gregorio Ghidoli , Marco Frasca , Dario Malchiodi

CNN+CNN: Convolutional Decoders for Image Captioning

Image captioning is a challenging task that combines the field of computer vision and natural language processing. A variety of approaches have been proposed to achieve the goal of automatically describing an image, and recurrent neural…

Computer Vision and Pattern Recognition · Computer Science 2018-05-24 Qingzhong Wang , Antoni B. Chan

Boosting Neural Image Compression for Machines Using Latent Space Masking

Today, many image coding scenarios do not have a human as final intended user, but rather a machine fulfilling computer vision tasks on the decoded image. Thereby, the primary goal is not to keep visual quality but maintain the task…

Image and Video Processing · Electrical Eng. & Systems 2024-10-28 Kristian Fischer , Fabian Brand , André Kaup

On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition

We study the problem of compressing recurrent neural networks (RNNs). In particular, we focus on the compression of RNN acoustic models, which are motivated by the goal of building compact and accurate speech recognition systems which can…

Computation and Language · Computer Science 2016-05-03 Rohit Prabhavalkar , Ouais Alsharif , Antoine Bruguier , Ian McGraw

Full Resolution Image Compression with Recurrent Neural Networks

This paper presents a set of full-resolution lossy image compression methods based on neural networks. Each of the architectures we describe can provide variable compression rates during deployment without requiring retraining of the…

Computer Vision and Pattern Recognition · Computer Science 2017-07-10 George Toderici , Damien Vincent , Nick Johnston , Sung Jin Hwang , David Minnen , Joel Shor , Michele Covell

Computationally Efficient Neural Image Compression

Image compression using neural networks have reached or exceeded non-neural methods (such as JPEG, WebP, BPG). While these networks are state of the art in ratedistortion performance, computational feasibility of these models remains a…

Image and Video Processing · Electrical Eng. & Systems 2019-12-19 Nick Johnston , Elad Eban , Ariel Gordon , Johannes Ballé

Compressing CNN models for resource-constrained systems by channel and layer pruning

Convolutional Neural Networks (CNNs) have achieved significant breakthroughs in various fields. However, these advancements have led to a substantial increase in the complexity and size of these networks. This poses a challenge when…

Machine Learning · Computer Science 2025-09-11 Ahmed Sadaqa , Di Liu

Neural Image Compression and Explanation

Explaining the prediction of deep neural networks (DNNs) and semantic image compression are two active research areas of deep learning with a numerous of applications in decision-critical systems, such as surveillance cameras, drones and…

Computer Vision and Pattern Recognition · Computer Science 2020-12-09 Xiang Li , Shihao Ji

On Efficient Neural Network Architectures for Image Compression

Recent advances in learning-based image compression typically come at the cost of high complexity. Designing computationally efficient architectures remains an open challenge. In this paper, we empirically investigate the impact of…

Image and Video Processing · Electrical Eng. & Systems 2024-06-18 Yichi Zhang , Zhihao Duan , Fengqing Zhu

An End-to-End Compression Framework Based on Convolutional Neural Networks

Deep learning, e.g., convolutional neural networks (CNNs), has achieved great success in image processing and computer vision especially in high level vision applications such as recognition and understanding. However, it is rarely used to…

Computer Vision and Pattern Recognition · Computer Science 2017-08-03 Feng Jiang , Wen Tao , Shaohui Liu , Jie Ren , Xun Guo , Debin Zhao

Coding for Computation: Efficient Compression of Neural Networks for Reconfigurable Hardware

As state of the art neural networks (NNs) continue to grow in size, their resource-efficient implementation becomes ever more important. In this paper, we introduce a compression scheme that reduces the number of computations required for…

Machine Learning · Computer Science 2025-04-25 Hans Rosenberger , Rodrigo Fischer , Johanna S. Fröhlich , Ali Bereyhi , Ralf R. Müller

Deep Image Compression via End-to-End Learning

We present a lossy image compression method based on deep convolutional neural networks (CNNs), which outperforms the existing BPG, WebP, JPEG2000 and JPEG as measured via multi-scale structural similarity (MS-SSIM), at the same bit rate.…

Image and Video Processing · Electrical Eng. & Systems 2018-06-06 Haojie Liu , Tong Chen , Qiu Shen , Tao Yue , Zhan Ma