Related papers: Optimizing L1 cache for embedded systems through g…

Multi-objective optimization of energy consumption and execution time in a single level cache memory for embedded systems

Current embedded systems are specifically designed to run multimedia applications. These applications have a big impact on both performance and energy consumption. Both metrics can be optimized selecting the best cache configuration for a…

Neural and Evolutionary Computing · Computer Science 2023-02-23 Josefa Díaz Álvarez , José L. Risco-Martín , J. Manuel Colmenar

Configurable memory systems for embedded many-core processors

The memory system of a modern embedded processor consumes a large fraction of total system energy. We explore a range of different configuration options and show that a reconfigurable design can make better use of the resources available to…

Hardware Architecture · Computer Science 2016-01-08 Daniel Bates , Alex Chadwick , Robert Mullins

Evolutionary Design of the Memory Subsystem

The memory hierarchy has a high impact on the performance and power consumption in the system. Moreover, current embedded systems, included in mobile devices, are specifically designed to run multimedia applications, which are memory…

Hardware Architecture · Computer Science 2023-03-29 Josefa Díaz Álvarez , José L. Risco-Martín , J. Manuel Colmenar

Dominant block guided optimal cache size estimation to maximize IPC of embedded software

Embedded system software is highly constrained from performance, memory footprint, energy consumption and implementing cost view point. It is always desirable to obtain better Instructions per Cycle. Instruction cache has major contribution…

Performance · Computer Science 2013-12-10 Rajendra Patel , Arvind Rajawat

Mixed-Precision Embedding Using a Cache

In recommendation systems, practitioners observed that increase in the number of embedding tables and their sizes often leads to significant improvement in model performances. Given this and the business importance of these models to major…

Machine Learning · Computer Science 2020-10-26 Jie Amy Yang , Jianyu Huang , Jongsoo Park , Ping Tak Peter Tang , Andrew Tulloch

Performance-Optimum Superscalar Architecture for Embedded Applications

Embedded applications are widely used in portable devices such as wireless phones, personal digital assistants, laptops, etc. High throughput and real time requirements are especially important in such data-intensive tasks. Therefore,…

Hardware Architecture · Computer Science 2012-04-13 Mehdi Alipour , Mostafa E. Salehi

Design Space Exploration to Find the Optimum Cache and Register File Size for Embedded Applications

In the future, embedded processors must process more computation-intensive network applications and internet traffic and packet-processing tasks become heavier and sophisticated. Since the processor performance is severely related to the…

Hardware Architecture · Computer Science 2012-05-10 Mehdi Alipour , Mostafa E. Salehi , Hesamodin shojaei baghini

A Generative Caching System for Large Language Models

Caching has the potential to be of significant benefit for accessing large language models (LLMs) due to their high latencies which typically range from a small number of seconds to well over a minute. Furthermore, many LLMs charge money…

Databases · Computer Science 2025-03-25 Arun Iyengar , Ashish Kundu , Ramana Kompella , Sai Nandan Mamidi

Modular Grammatical Evolution for the Generation of Artificial Neural Networks

This paper presents a novel method, called Modular Grammatical Evolution (MGE), towards validating the hypothesis that restricting the solution space of NeuroEvolution to modular and simple neural networks enables the efficient generation…

Neural and Evolutionary Computing · Computer Science 2022-08-05 Khabat Soltanian , Ali Ebnenasir , Mohsen Afsharchi

An Ensemble Embedding Approach for Improving Semantic Caching Performance in LLM-based Systems

Semantic caching enhances the efficiency of large language model (LLM) systems by identifying semantically similar queries, storing responses once, and serving them for subsequent equivalent requests. However, existing semantic caching…

Machine Learning · Computer Science 2025-07-10 Shervin Ghaffari , Zohre Bahranifard , Mohammad Akbari

A methodology to automatically optimize dynamic memory managers applying grammatical evolution

Modern consumer devices must execute multimedia applications that exhibit high resource utilization. In order to efficiently execute these applications, the dynamic memory subsystem needs to be optimized. This complex task can be tackled in…

Hardware Architecture · Computer Science 2024-03-08 José L. Risco-Martín , J. Manuel Colmenar , J. Ignacio Hidalgo , Juan Lanchares , Josefa Díaz

Compositional Memory Systems for Multimedia Communicating Tasks

Conventional cache models are not suited for real-time parallel processing because tasks may flush each other's data out of the cache in an unpredictable manner. In this way the system is not compositional so the overall performance is…

Hardware Architecture · Computer Science 2011-11-09 A. M. Molnos , M. J. M. Heijligers , S. D. Cotofana , J. T. J. Van Eijndhoven

GASE: Generatively Augmented Sentence Encoding

We propose a training-free approach to improve sentence embeddings leveraging test-time compute by applying generative text models for data augmentation at inference time. Unlike conventional data augmentation that utilises synthetic…

Computation and Language · Computer Science 2025-09-09 Manuel Frank , Haithem Afli

Caching Historical Embeddings in Conversational Search

Rapid response, namely low latency, is fundamental in search applications; it is particularly so in interactive search sessions, such as those encountered in conversational settings. An observation with a potential to reduce latency asserts…

Information Retrieval · Computer Science 2022-11-28 Ophir Frieder , Ida Mele , Cristina Ioana Muntean , Franco Maria Nardini , Raffaele Perego , Nicola Tonellotto

Phase distance mapping: a phase-based cache tuning methodology for embedded systems

Networked embedded systems typically leverage a collection of low-power embedded systems (nodes) to collaboratively execute applications spanning diverse application domains (e.g., video, image processing, communication, etc.) with diverse…

Hardware Architecture · Computer Science 2016-02-16 Tosiron Adegbija , Ann Gordon-Ross , Arslan Munir

Automated Concatenation of Embeddings for Structured Prediction

Pretrained contextualized embeddings are powerful word representations for structured prediction tasks. Recent work found that better word representations can be obtained by concatenating different types of embeddings. However, the…

Computation and Language · Computer Science 2021-06-02 Xinyu Wang , Yong Jiang , Nguyen Bach , Tao Wang , Zhongqiang Huang , Fei Huang , Kewei Tu

Advancing Semantic Caching for LLMs with Domain-Specific Embeddings and Synthetic Data

This report investigates enhancing semantic caching effectiveness by employing specialized, fine-tuned embedding models. Semantic caching relies on embedding similarity rather than exact key matching, presenting unique challenges in…

Machine Learning · Computer Science 2025-04-04 Waris Gill , Justin Cechmanek , Tyler Hutcherson , Srijith Rajamohan , Jen Agarwal , Muhammad Ali Gulzar , Manvinder Singh , Benoit Dion

RESTORE: Graph Embedding Assessment Through Reconstruction

Following the success of Word2Vec embeddings, graph embeddings (GEs) have gained substantial traction. GEs are commonly generated and evaluated extrinsically on downstream applications, but intrinsic evaluations of the original graph…

Machine Learning · Computer Science 2023-09-06 Hong Yung Yip , Chidaksh Ravuru , Neelabha Banerjee , Shashwat Jha , Amit Sheth , Aman Chadha , Amitava Das

Evolutionary Optimisation of Real-Time Systems and Networks

The design space of networked embedded systems is very large, posing challenges to the optimisation of such platforms when it comes to support applications with real-time guarantees. Recent research has shown that a number of inter-related…

Performance · Computer Science 2020-07-21 Leandro Soares Indrusiak , Robert I. Davis , Piotr Dziurzanski

Probabilistic Grammatical Evolution

Grammatical Evolution (GE) is one of the most popular Genetic Programming (GP) variants, and it has been used with success in several problem domains. Since the original proposal, many enhancements have been proposed to GE in order to…

Neural and Evolutionary Computing · Computer Science 2021-03-16 Jessica Mégane , Nuno Lourenço , Penousal Machado