Related papers: A Coprocessor for Accelerating Visual Information …

A brief experience on journey through hardware developments for image processing and its applications on Cryptography

The importance of embedded applications on image and video processing,communication and cryptography domain has been taking a larger space in current research era. Improvement of pictorial information for betterment of human perception like…

Hardware Architecture · Computer Science 2013-01-14 Sangeet Saha , Chandrajit pal , Rourab paul , Satyabrata Maity , Suman Sau

Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases

Today's computing systems require moving data back-and-forth between computing resources (e.g., CPUs, GPUs, accelerators) and off-chip main memory so that computation can take place on the data. Unfortunately, this data movement is a major…

Hardware Architecture · Computer Science 2022-05-31 Geraldo F. Oliveira , Amirali Boroumand , Saugata Ghose , Juan Gómez-Luna , Onur Mutlu

Rethinking Co-design of Neural Architectures and Hardware Accelerators

Neural architectures and hardware accelerators have been two driving forces for the progress in deep learning. Previous works typically attempt to optimize hardware given a fixed model architecture or model architecture given fixed…

Machine Learning · Computer Science 2021-02-18 Yanqi Zhou , Xuanyi Dong , Berkin Akin , Mingxing Tan , Daiyi Peng , Tianjian Meng , Amir Yazdanbakhsh , Da Huang , Ravi Narayanaswami , James Laudon

Streaming-capable High-performance Architecture of Learned Image Compression Codecs

Learned image compression allows achieving state-of-the-art accuracy and compression ratios, but their relatively slow runtime performance limits their usage. While previous attempts on optimizing learned image codecs focused more on the…

Image and Video Processing · Electrical Eng. & Systems 2022-08-04 Fangzheng Lin , Heming Sun , Jiro Katto

ImaGen: A General Framework for Generating Memory- and Power-Efficient Image Processing Accelerators

Image processing algorithms are prime targets for hardware acceleration as they are commonly used in resource- and power-limited applications. Today's image processing accelerator designs make rigid assumptions about the algorithm…

Hardware Architecture · Computer Science 2023-04-10 Nisarg Ujjainkar , Jingwen Leng , Yuhao Zhu

AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization

In this study, we introduce a new problem raised by social media and photojournalism, named Image Address Localization (IAL), which aims to predict the readable textual address where an image was taken. Existing two-stage approaches involve…

Computer Vision and Pattern Recognition · Computer Science 2024-07-12 Shixiong Xu , Chenghao Zhang , Lubin Fan , Gaofeng Meng , Shiming Xiang , Jieping Ye

A Reconfigurable Convolution-in-Pixel CMOS Image Sensor Architecture

The separation of the data capture and analysis in modern vision systems has led to a massive amount of data transfer between the end devices and cloud computers, resulting in long latency, slow response, and high power consumption.…

Image and Video Processing · Electrical Eng. & Systems 2024-08-13 Ruibing Song , Kejie Huang , Zongsheng Wang , Haibin Shen

Striving for Faster and Better: A One-Layer Architecture with Auto Re-parameterization for Low-Light Image Enhancement

Deep learning-based low-light image enhancers have made significant progress in recent years, with a trend towards achieving satisfactory visual quality while gradually reducing the number of parameters and improving computational…

Computer Vision and Pattern Recognition · Computer Science 2025-02-28 Nan An , Long Ma , Guangchao Han , Xin Fan , RIsheng Liu

ConvPIM: Evaluating Digital Processing-in-Memory through Convolutional Neural Network Acceleration

Processing-in-memory (PIM) architectures are emerging to reduce data movement in data-intensive applications. These architectures seek to exploit the same physical devices for both information storage and logic, thereby dwarfing the…

Hardware Architecture · Computer Science 2023-05-09 Orian Leitersdorf , Ronny Ronen , Shahar Kvatinsky

Active Learning Pipeline for Brain Mapping in a High Performance Computing Environment

This paper describes a scalable active learning pipeline prototype for large-scale brain mapping that leverages high performance computing power. It enables high-throughput evaluation of algorithm results, which, after human review, are…

Image and Video Processing · Electrical Eng. & Systems 2020-06-29 Adam Michaleas , Lars A. Gjesteby , Michael Snyder , David Chavez , Meagan Ash , Matthew A. Melton , Damon G. Lamb , Sara N. Burke , Kevin J. Otto , Lee Kamentsky , Webster Guan , Kwanghun Chung , Laura J. Brattain

Neuromorphic-P2M: Processing-in-Pixel-in-Memory Paradigm for Neuromorphic Image Sensors

Edge devices equipped with computer vision must deal with vast amounts of sensory data with limited computing resources. Hence, researchers have been exploring different energy-efficient solutions such as near-sensor processing, in-sensor…

Image and Video Processing · Electrical Eng. & Systems 2023-01-24 Md Abdullah-Al Kaiser , Gourav Datta , Zixu Wang , Ajey P. Jacob , Peter A. Beerel , Akhilesh R. Jaiswal

Improving Continuous Sign Language Recognition with Adapted Image Models

The increase of web-scale weakly labelled image-text pairs have greatly facilitated the development of large-scale vision-language models (e.g., CLIP), which have shown impressive generalization performance over a series of downstream…

Computer Vision and Pattern Recognition · Computer Science 2024-04-15 Lianyu Hu , Tongkai Shi , Liqing Gao , Zekang Liu , Wei Feng

CODEBench: A Neural Architecture and Hardware Accelerator Co-Design Framework

Recently, automated co-design of machine learning (ML) models and accelerator architectures has attracted significant attention from both the industry and academia. However, most co-design frameworks either explore a limited search space or…

Hardware Architecture · Computer Science 2022-12-09 Shikhar Tuli , Chia-Hao Li , Ritvik Sharma , Niraj K. Jha

Efficient Convolutional Neural Networks for Pixelwise Classification on Heterogeneous Hardware Systems

This work presents and analyzes three convolutional neural network (CNN) models for efficient pixelwise classification of images. When using convolutional neural networks to classify single pixels in patches of a whole image, a lot of…

Computer Vision and Pattern Recognition · Computer Science 2015-09-14 Fabian Tschopp

Programming Heterogeneous Systems from an Image Processing DSL

Specialized image processing accelerators are necessary to deliver the performance and energy efficiency required by important applications in computer vision, computational photography, and augmented reality. But creating,…

Software Engineering · Computer Science 2016-11-01 Jing Pu , Steven Bell , Xuan Yang , Jeff Setter , Stephen Richardson , Jonathan Ragan-Kelley , Mark Horowitz

Uni-Render: A Unified Accelerator for Real-Time Rendering Across Diverse Neural Renderers

Recent advancements in neural rendering technologies and their supporting devices have paved the way for immersive 3D experiences, significantly transforming human interaction with intelligent devices across diverse applications. However,…

Graphics · Computer Science 2025-04-01 Chaojian Li , Sixu Li , Linrui Jiang , Jingqun Zhang , Yingyan Celine Lin

LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation

Pixel-wise semantic segmentation for visual scene understanding not only needs to be accurate, but also efficient in order to find any use in real-time application. Existing algorithms even though are accurate but they do not focus on…

Computer Vision and Pattern Recognition · Computer Science 2018-04-03 Abhishek Chaurasia , Eugenio Culurciello

Joint Architecture-Token-Bitwidth Multi-Axis Optimization of Vision Transformers for Semiconductor IC Packaging

Vision Transformers (ViTs) have achieved strong performance in visual recognition, yet their deployment in resource-constrained industrial environments remains limited. Some main challenges are their high computational cost, memory…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Phat Nguyen , Xue Geng , Kaixin Xu , Wang Zhe , Xulei Yang , Ngai-Man Cheung

AutoRAC: Automated Processing-in-Memory Accelerator Design for Recommender Systems

The performance bottleneck of deep-learning-based recommender systems resides in their backbone Deep Neural Networks. By integrating Processing-In-Memory~(PIM) architectures, researchers can reduce data movement and enhance energy…

Hardware Architecture · Computer Science 2025-05-19 Feng Cheng , Tunhou Zhang , Junyao Zhang , Jonathan Hao-Cheng Ku , Yitu Wang , Xiaoxuan Yang , Hai , Li , Yiran Chen

Reconfiguring the Imaging Pipeline for Computer Vision

Advancements in deep learning have ignited an explosion of research on efficient hardware for embedded computer vision. Hardware vision acceleration, however, does not address the cost of capturing and processing the image data that feeds…

Computer Vision and Pattern Recognition · Computer Science 2017-08-03 Mark Buckler , Suren Jayasuriya , Adrian Sampson