Related papers: Latent Discretization for Continuous-time Sequence…

Deep Generative Video Compression

The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose an end-to-end, deep generative modeling…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Jun Han , Salvator Lombardo , Christopher Schroers , Stephan Mandt

DeepZip: Lossless Data Compression using Recurrent Neural Networks

Sequential data is being generated at an unprecedented pace in various forms, including text and genomic data. This creates the need for efficient compression mechanisms to enable better storage, transmission and processing of such data. To…

Computation and Language · Computer Science 2018-11-21 Mohit Goyal , Kedar Tatwawadi , Shubham Chandak , Idoia Ochoa

Fast Decoding in Sequence Models using Discrete Latent Variables

Autoregressive sequence models based on deep neural networks, such as RNNs, Wavenet and the Transformer attain state-of-the-art results on many tasks. However, they are difficult to parallelize and are thus slow at processing long…

Machine Learning · Computer Science 2018-06-11 Łukasz Kaiser , Aurko Roy , Ashish Vaswani , Niki Parmar , Samy Bengio , Jakob Uszkoreit , Noam Shazeer

Neural Joint Source-Channel Coding

For reliable transmission across a noisy communication channel, classical results from information theory show that it is asymptotically optimal to separate out the source and channel coding processes. However, this decomposition can fall…

Machine Learning · Computer Science 2019-05-15 Kristy Choi , Kedar Tatwawadi , Aditya Grover , Tsachy Weissman , Stefano Ermon

Relaxed Quantization for Discretized Neural Networks

Neural network quantization has become an important research area due to its great impact on deployment of large models on resource constrained devices. In order to train networks that can be effectively discretized without loss of…

Machine Learning · Computer Science 2018-10-05 Christos Louizos , Matthias Reisser , Tijmen Blankevoort , Efstratios Gavves , Max Welling

Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning

Recurrent neural networks have a strong inductive bias towards learning temporally compressed representations, as the entire history of a sequence is represented by a single vector. By contrast, Transformers have little inductive bias…

Machine Learning · Computer Science 2022-10-26 Aniket Didolkar , Kshitij Gupta , Anirudh Goyal , Nitesh B. Gundavarapu , Alex Lamb , Nan Rosemary Ke , Yoshua Bengio

Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation

Neural audio codecs, used as speech tokenizers, have demonstrated remarkable potential in the field of speech generation. However, to ensure high-fidelity audio reconstruction, neural audio codecs typically encode audio into long sequences…

Audio and Speech Processing · Electrical Eng. & Systems 2025-06-02 Wenrui Liu , Qian Chen , Wen Wang , Yafeng Chen , Jin Xu , Zhifang Guo , Guanrou Yang , Weiqin Li , Xiaoda Yang , Tao Jin , Minghui Fang , Jialong Zuo , Bai Jionghao , Zemin Liu

A Novel Sequential Coreset Method for Gradient Descent Algorithms

A wide range of optimization problems arising in machine learning can be solved by gradient descent algorithms, and a central question in this area is how to efficiently compress a large-scale dataset so as to reduce the computational…

Machine Learning · Computer Science 2022-10-11 Jiawei Huang , Ruomin Huang , Wenjie Liu , Nikolaos M. Freris , Hu Ding

Differentiable Fine-grained Quantization for Deep Neural Network Compression

Neural networks have shown great performance in cognitive tasks. When deploying network models on mobile devices with limited resources, weight quantization has been widely adopted. Binary quantization obtains the highest compression but…

Computer Vision and Pattern Recognition · Computer Science 2018-11-14 Hsin-Pai Cheng , Yuanjun Huang , Xuyang Guo , Yifei Huang , Feng Yan , Hai Li , Yiran Chen

Discretizing Logged Interaction Data Biases Learning for Decision-Making

Time series data that are not measured at regular intervals are commonly discretized as a preprocessing step. For example, data about customer arrival times might be simplified by summing the number of arrivals within hourly intervals,…

Machine Learning · Statistics 2018-10-09 Peter Schulam , Suchi Saria

Variance-based Gradient Compression for Efficient Distributed Deep Learning

Due to the substantial computational cost, training state-of-the-art deep neural networks for large-scale datasets often requires distributed training using multiple computation workers. However, by nature, workers need to frequently…

Machine Learning · Computer Science 2018-02-21 Yusuke Tsuzuku , Hiroto Imachi , Takuya Akiba

Neural Compression-Based Feature Learning for Video Restoration

How to efficiently utilize the temporal features is crucial, yet challenging, for video restoration. The temporal features usually contain various noisy and uncorrelated information, and they may interfere with the restoration of the…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Cong Huang , Jiahao Li , Bin Li , Dong Liu , Yan Lu

Learned Compression for Images and Point Clouds

Over the last decade, deep learning has shown great success at performing computer vision tasks, including classification, super-resolution, and style transfer. Now, we apply it to data compression to help build the next generation of…

Image and Video Processing · Electrical Eng. & Systems 2024-09-16 Mateen Ulhaq

Utilising Low Complexity CNNs to Lift Non-Local Redundancies in Video Coding

Digital media is ubiquitous and produced in ever-growing quantities. This necessitates a constant evolution of compression techniques, especially for video, in order to maintain efficient storage and transmission. In this work, we aim at…

Image and Video Processing · Electrical Eng. & Systems 2020-04-29 Jan P. Klopp , Liang-Gee Chen , Shao-Yi Chien

Differentiable Resolution Compression and Alignment for Efficient Video Classification and Retrieval

Optimizing video inference efficiency has become increasingly important with the growing demand for video analysis in various fields. Some existing methods achieve high efficiency by explicit discard of spatial or temporal information,…

Computer Vision and Pattern Recognition · Computer Science 2023-09-18 Rui Deng , Qian Wu , Yuke Li , Haoran Fu

Semantic Segmentation in Learned Compressed Domain

Most machine vision tasks (e.g., semantic segmentation) are based on images encoded and decoded by image compression algorithms (e.g., JPEG). However, these decoded images in the pixel domain introduce distortion, and they are optimized for…

Computer Vision and Pattern Recognition · Computer Science 2022-09-07 Jinming Liu , Heming Sun , Jiro Katto

Balancing Specialization, Generalization, and Compression for Detection and Tracking

We propose a method for specializing deep detectors and trackers to restricted settings. Our approach is designed with the following goals in mind: (a) Improving accuracy in restricted domains; (b) preventing overfitting to new domains and…

Computer Vision and Pattern Recognition · Computer Science 2019-09-26 Dotan Kaufman , Koby Bibas , Eran Borenstein , Michael Chertok , Tal Hassner

Discretization of Time Series Data

Data discretization, also known as binning, is a frequently used technique in computer science, statistics, and their applications to biological data analysis. We present a new method for the discretization of real-valued data into a finite…

Other Quantitative Biology · Quantitative Biology 2007-05-23 Elena S. Dimitrova , John J. McGee , Reinhard C. Laubenbacher

Dilated Temporal Fully-Convolutional Network for Semantic Segmentation of Motion Capture Data

Semantic segmentation of motion capture sequences plays a key part in many data-driven motion synthesis frameworks. It is a preprocessing step in which long recordings of motion capture sequences are partitioned into smaller segments.…

Computer Vision and Pattern Recognition · Computer Science 2018-07-17 Noshaba Cheema , Somayeh Hosseini , Janis Sprenger , Erik Herrmann , Han Du , Klaus Fischer , Philipp Slusallek

A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection

Training and deploying deepfake detection models on edge devices offers the advantage of maintaining data privacy and confidentiality by processing it close to its source. However, this approach is constrained by the limited computational…

Machine Learning · Computer Science 2025-05-01 Andreas Karathanasis , John Violos , Ioannis Kompatsiaris , Symeon Papadopoulos