Related papers: Query Time Optimized Deep Learning Based Video Inf…

Focus: Querying Large Video Datasets with Low Latency and Low Cost

Large volumes of videos are continuously recorded from cameras deployed for traffic control and surveillance with the goal of answering "after the fact" queries: identify video frames with objects of certain classes (cars, bags) from many…

Databases · Computer Science 2018-01-11 Kevin Hsieh , Ganesh Ananthanarayanan , Peter Bodik , Paramvir Bahl , Matthai Philipose , Phillip B. Gibbons , Onur Mutlu

Accelerating Deep Learning Inference via Freezing

Over the last few years, Deep Neural Networks (DNNs) have become ubiquitous owing to their high accuracy on real-world tasks. However, this increase in accuracy comes at the cost of computationally expensive models leading to higher…

Machine Learning · Computer Science 2020-02-10 Adarsh Kumar , Arjun Balasubramanian , Shivaram Venkataraman , Aditya Akella

Focusing on What is Relevant: Time-Series Learning and Understanding using Attention

This paper is a contribution towards interpretability of the deep learning models in different applications of time-series. We propose a temporal attention layer that is capable of selecting the relevant information to perform various…

Computer Vision and Pattern Recognition · Computer Science 2018-06-25 Phongtharin Vinayavekhin , Subhajit Chaudhury , Asim Munawar , Don Joven Agravante , Giovanni De Magistris , Daiki Kimura , Ryuki Tachibana

Reduced-Order Modeling of Deep Neural Networks

We introduce a new method for speeding up the inference of deep neural networks. It is somewhat inspired by the reduced-order modeling techniques for dynamical systems.The cornerstone of the proposed method is the maximum volume algorithm.…

Machine Learning · Computer Science 2020-11-26 Julia Gusak , Talgat Daulbaev , Evgeny Ponomarev , Andrzej Cichocki , Ivan Oseledets

An automated approach for improving the inference latency and energy efficiency of pretrained CNNs by removing irrelevant pixels with focused convolutions

Computer vision often uses highly accurate Convolutional Neural Networks (CNNs), but these deep learning models are associated with ever-increasing energy and computation requirements. Producing more energy-efficient CNNs often requires…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Caleb Tung , Nicholas Eliopoulos , Purvish Jajal , Gowri Ramshankar , Chen-Yun Yang , Nicholas Synovic , Xuecen Zhang , Vipin Chaudhary , George K. Thiruvathukal , Yung-Hsiang Lu

Dynamic Query Selection for Fast Visual Perceiver

Transformers have been matching deep convolutional networks for vision architectures in recent works. Most work is focused on getting the best results on large-scale benchmarks, and scaling laws seem to be the most successful strategy:…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Corentin Dancette , Matthieu Cord

Training Strategies for Vision Transformers for Object Detection

Vision-based Transformer have shown huge application in the perception module of autonomous driving in terms of predicting accurate 3D bounding boxes, owing to their strong capability in modeling long-range dependencies between the visual…

Computer Vision and Pattern Recognition · Computer Science 2023-04-06 Apoorv Singh

DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition

While transformers have shown great potential on video recognition with their strong capability of capturing long-range dependencies, they often suffer high computational costs induced by the self-attention to the huge number of 3D tokens.…

Computer Vision and Pattern Recognition · Computer Science 2022-11-23 Yuxuan Liang , Pan Zhou , Roger Zimmermann , Shuicheng Yan

CHAI: CacHe Attention Inference for text2video

Text-to-video diffusion models deliver impressive results but remain slow because of the sequential denoising of 3D latents. Existing approaches to speed up inference either require expensive model retraining or use heuristic-based step…

Computer Vision and Pattern Recognition · Computer Science 2026-02-19 Joel Mathew Cherian , Ashutosh Muralidhara Bharadwaj , Vima Gupta , Anand Padmanabha Iyer

THIA: Accelerating Video Analytics using Early Inference and Fine-Grained Query Planning

To efficiently process visual data at scale, researchers have proposed two techniques for lowering the computational overhead associated with the underlying deep learning models. The first approach consists of leveraging a specialized,…

Databases · Computer Science 2021-02-18 Jiashen Cao , Ramyad Hadidi , Joy Arulraj , Hyesoon Kim

FOSS: A Self-Learned Doctor for Query Optimizer

Various works have utilized deep learning to address the query optimization problem in database system. They either learn to construct plans from scratch in a bottom-up manner or steer the plan generation behavior of traditional optimizer…

Databases · Computer Science 2024-08-15 Kai Zhong , Luming Sun , Tao Ji , Cuiping Li , Hong Chen

Using Learned Indexes to Improve Time Series Indexing Performance on Embedded Sensor Devices

Efficiently querying data on embedded sensor and IoT devices is challenging given the very limited memory and CPU resources. With the increasing volumes of collected data, it is critical to process, filter, and manipulate data on the edge…

Databases · Computer Science 2023-03-07 David Ding , Ivan Carvalho , Ramon Lawrence

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior

We present a simple and effective deep convolutional neural network (CNN) model for video deblurring. The proposed algorithm mainly consists of optical flow estimation from intermediate latent frames and latent frame restoration steps. It…

Computer Vision and Pattern Recognition · Computer Science 2020-04-07 Jinshan Pan , Haoran Bai , Jinhui Tang

Efficient keyword spotting using time delay neural networks

This paper describes a novel method of live keyword spotting using a two-stage time delay neural network. The model is trained using transfer learning: initial training with phone targets from a large speech corpus is followed by training…

Audio and Speech Processing · Electrical Eng. & Systems 2018-08-29 Samuel Myer , Vikrant Singh Tomar

Cortex: A Compiler for Recursive Deep Learning Models

Optimizing deep learning models is generally performed in two steps: (i) high-level graph optimizations such as kernel fusion and (ii) low level kernel optimizations such as those found in vendor libraries. This approach often leaves…

Machine Learning · Computer Science 2021-03-08 Pratik Fegade , Tianqi Chen , Phillip B. Gibbons , Todd C. Mowry

DeDelayed: Deleting Remote Inference Delay via On-Device Correction

Video comprises the vast majority of bits that are generated daily, and is the primary signal driving current innovations in robotics, remote sensing, and wearable technology. Yet, the most powerful video understanding models are too…

Image and Video Processing · Electrical Eng. & Systems 2026-04-03 Dan Jacobellis , Mateen Ulhaq , Fabien Racapé , Hyomin Choi , Neeraja J. Yadwadkar

Adaptive Neural Networks for Efficient Inference

We present an approach to adaptively utilize deep neural networks in order to reduce the evaluation time on new examples without loss of accuracy. Rather than attempting to redesign or approximate existing networks, we propose two schemes…

Machine Learning · Computer Science 2017-09-20 Tolga Bolukbasi , Joseph Wang , Ofer Dekel , Venkatesh Saligrama

Don't Trash your Intermediate Results, Cache 'em

In data warehouse and data mart systems, queries often take a long time to execute due to their complex nature. Query response times can be greatly improved by caching final/intermediate results of previous queries, and using them to answer…

Databases · Computer Science 2007-05-23 Prasan Roy , Krithi Ramamritham , S. Seshadri , Pradeep Shenoy , S. Sudarshan

Space-time Mixing Attention for Video Transformer

This paper is on video recognition using Transformers. Very recent attempts in this area have demonstrated promising results in terms of recognition accuracy, yet they have been also shown to induce, in many cases, significant computational…

Computer Vision and Pattern Recognition · Computer Science 2021-06-14 Adrian Bulat , Juan-Manuel Perez-Rua , Swathikiran Sudhakaran , Brais Martinez , Georgios Tzimiropoulos

EcoLens: Leveraging Multi-Objective Bayesian Optimization for Energy-Efficient Video Processing on Edge Devices

Video processing for real-time analytics in resource-constrained environments presents a significant challenge in balancing energy consumption and video semantics. This paper addresses the problem of energy-efficient video processing by…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Benjamin Civjan , Bo Chen , Ruixiao Zhang , Klara Nahrstedt