English
Related papers

Related papers: A Register Allocation Algorithm in the Presence of…

200 papers

In a compiler, an essential component is the register allocator. Two main algorithms have dominated implementations, graph coloring and linear scan, differing in how live values are modeled. Graph coloring uses an edge in an `interference…

Programming Languages · Computer Science 2020-11-12 Ian Rogers

Register allocation (mapping variables to processor registers or memory) and instruction scheduling (reordering instructions to increase instruction-level parallelism) are essential tasks for generating efficient assembly code in a…

Programming Languages · Computer Science 2019-06-10 Roberto Castañeda Lozano , Christian Schulte

Register allocation has long been formulated as a graph coloring problem, coloring the conflict graph with physical registers. Such a formulation does not fully capture the goal of the allocation, which is to minimize the traffic between…

Programming Languages · Computer Science 2012-02-27 Yin Wang , R. Kent Dybvig

The use and location of memory in integrated circuits plays a key factor in their performance. Memory requires large physical area, access times limit overall system performance and connectivity can result in large fan-out. Modern FPGA…

Hardware Architecture · Computer Science 2020-03-25 Alexander E. Beasley

Arising disruptive memory technologies continuously make their way into the memory hierarchy at various levels. Racetrack memory is one promising candidate for future memory due to the overall low energy consumption, access latency and high…

Hardware Architecture · Computer Science 2025-03-03 Christian Hakert , Shuo-Han Chen , Kay Heider , Roland Kühn , Yun-Chih Chen , Jens Teubner , Jian-Jia Chen

Bringing high-level machine learning models to efficient and well-suited machine implementations often invokes a bunch of tools, e.g.~code generators, compilers, and optimizers. Along such tool chains, abstractions have to be applied. This…

Machine Learning · Computer Science 2024-04-11 Daniel Biebert , Christian Hakert , Kuan-Hsun Chen , Jian-Jia Chen

This paper proposes an application mapping algorithm, BandMap, for coarse-grained reconfigurable array (CGRA), which allocates the bandwidth in PE array according to the transferring demands of data, especially the data with high spatial…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-10-11 Xiaobing Ni , Jiaheng Ruan , Mengke Ge , Wendi Sun , Song Chen , Yi Kang

In engineering applications sorting is an important and widely studied problem where execution speed and resources used for computation are of extreme importance, especially if we think about real time data processing. Most of the…

Hardware Architecture · Computer Science 2012-06-08 Rourab Paul , Suman Sau , Amlan Chakrabarti

In this paper, the acceleration of algorithms using a design of a field programmable gate array (FPGA) as a prototype of a static dataflow architecture is discussed. The static dataflow architecture using operators interconnected by…

Hardware Architecture · Computer Science 2015-03-13 Jorge Luiz e Silva , Joelmir Jose Lopes , Bruno de Abreu Silva , Antonio Carlos Fernandes da Silva

Register allocation is a much studied problem. A particularly important context for optimizing register allocation is within loops, since a significant fraction of the execution time of programs is often inside loop code. A variety of…

Programming Languages · Computer Science 2014-06-04 Lukasz Domagala , Fabrice Rastello , Sadayappan Ponnuswany , Duco Van Amstel

Data structures are a cornerstone of most modern programming languages. Whether they are provided via separate libraries, built into the language specification, or as part of the language's standard library -- data structures such as lists,…

Programming Languages · Computer Science 2025-03-03 Lukas Makor , Sebastian Kloibhofer , Peter Hofer , David Leopoldseder , Hanspeter Mössenböck

Efficient and real time segmentation of color images has a variety of importance in many fields of computer vision such as image compression, medical imaging, mapping and autonomous navigation. Being one of the most computationally…

Computer Vision and Pattern Recognition · Computer Science 2017-10-09 Roopal Nahar , Akanksha Baranwal , K. Madhava Krishna

Coarse-grained reconfigurable architectures aim to achieve both goals of high performance and flexibility. However, existing reconfigurable array architectures require many resources without considering the specific application domain.…

Hardware Architecture · Computer Science 2011-11-09 Yoonjin Kim , Mary Kiemb , Chulsoo Park , Jinyong Jung , Kiyoung Choi

Applications making excessive use of single-object based data structures (such as linked lists, trees, etc...) can see a drop in efficiency over a period of time due to the randomization of nodes in memory. This slow down is due to the…

Data Structures and Algorithms · Computer Science 2021-10-22 Dhruv Matani , Gaurav Menghani

Domain-specific accelerators are used in various computing systems ranging from edge devices to data centers. Coarse-grained reconfigurable arrays (CGRAs) represent an architectural midpoint between the flexibility of an FPGA and the…

Hardware Architecture · Computer Science 2023-01-04 Taeyoung Kong , Kalhan Koul , Priyanka Raina , Mark Horowitz , Christopher Torng

We study the problem of executing an application represented by a precedence task graph on a parallel machine composed of standard computing cores and accelerators. Contrary to most existing approaches, we distinguish the allocation and the…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-11-20 Marcos Amaris , Giorgio Lucarelli , Clément Mommessin , Denis Trystram

Registers are the fastest memory components within the GPU's complex memory hierarchy, accessed by names rather than addresses. They are managed entirely by the compiler through a process called register allocation, during which the…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-01-28 Deniz Elbek , Kamer Kaya

The architecture of a coarse-grained reconfigurable array (CGRA) processing element (PE) has a significant effect on the performance and energy efficiency of an application running on the CGRA. This paper presents an automated approach for…

Hardware Architecture · Computer Science 2021-04-30 Jackson Melchert , Kathleen Feng , Caleb Donovick , Ross Daly , Clark Barrett , Mark Horowitz , Pat Hanrahan , Priyanka Raina

Genetic Algorithms (GAs) are used to solve search and optimization problems in which an optimal solution can be found using an iterative process with probabilistic and non-deterministic transitions. However, depending on the problem's…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-01-23 Matheus F. Torquato , Marcelo A. C. Fernandes

Coarse-Grained Reconfigurable Arrays (CGRAs) are specialized accelerators commonly employed to boost performance in workloads with iterative structures. Existing research typically focuses on compiler or architecture optimizations aimed at…

Hardware Architecture · Computer Science 2025-08-28 Xiangfeng Liu , Zhe Jiang , Anzhen Zhu , Xiaomeng Han , Mingsong Lyu , Qingxu Deng , Nan Guan
‹ Prev 1 2 3 10 Next ›