Related papers: LLM-Aided Compilation for Tensor Accelerators

A High-Level Compiler Integration Approach for Deep Learning Accelerators Supporting Abstraction and Optimization

The growing adoption of domain-specific architectures in edge computing platforms for deep learning has highlighted the efficiency of hardware accelerators. However, integrating custom accelerators into modern machine learning (ML)…

Machine Learning · Computer Science 2025-07-08 Samira Ahmadifarsani , Daniel Mueller-Gritschneder , Ulf Schlichtmann

Code Transpilation for Hardware Accelerators

DSLs and hardware accelerators have proven to be very effective in optimizing computationally expensive workloads. In this paper, we propose a solution to the challenge of manually rewriting legacy or unoptimized code in domain-specific…

Programming Languages · Computer Science 2023-08-15 Yuto Nishida , Sahil Bhatia , Shadaj Laddad , Hasan Genc , Yakun Sophia Shao , Alvin Cheung

A Survey on Hardware Accelerators for Large Language Models

Large Language Models (LLMs) have emerged as powerful tools for natural language processing tasks, revolutionizing the field with their ability to understand and generate human-like text. As the demand for more sophisticated LLMs continues…

Hardware Architecture · Computer Science 2025-01-13 Christoforos Kachris

Autocomp: A Powerful and Portable Code Optimizer for Tensor Accelerators

Hardware accelerators, especially those designed for tensor processing, have become ubiquitous in today's computing landscape. However, even with significant efforts in building compilers, programming these tensor accelerators remains…

Programming Languages · Computer Science 2025-11-07 Charles Hong , Sahil Bhatia , Alvin Cheung , Yakun Sophia Shao

Exploring and Characterizing Large Language Models For Embedded System Development and Debugging

Large language models (LLMs) have shown remarkable abilities to generate code, however their ability to develop software for embedded systems, which requires cross-domain knowledge of hardware and software has not been studied. In this…

Software Engineering · Computer Science 2023-11-23 Zachary Englhardt , Richard Li , Dilini Nissanka , Zhihan Zhang , Girish Narayanswamy , Joseph Breda , Xin Liu , Shwetak Patel , Vikram Iyer

A Review on Proprietary Accelerators for Large Language Models

With the advancement of Large Language Models (LLMs), the importance of accelerators that efficiently process LLM computations has been increasing. This paper discusses the necessity of LLM accelerators and provides a comprehensive analysis…

Performance · Computer Science 2025-03-14 Sihyeong Park , Jemin Lee , Byung-Soo Kim , Seokhun Jeon

CompilerGPT: Leveraging Large Language Models for Analyzing and Acting on Compiler Optimization Reports

Current compiler optimization reports often present complex, technical information that is difficult for programmers to interpret and act upon effectively. This paper assesses the capability of large language models (LLM) to understand…

Programming Languages · Computer Science 2025-06-16 Peter Pirkelbauer , Chunhua Liao

Hardware Acceleration of LLMs: A comprehensive survey and comparison

Large Language Models (LLMs) have emerged as powerful tools for natural language processing tasks, revolutionizing the field with their ability to understand and generate human-like text. In this paper, we present a comprehensive survey of…

Hardware Architecture · Computer Science 2024-09-06 Nikoletta Koilia , Christoforos Kachris

A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators

Artificial intelligence (AI) methods have become critical in scientific applications to help accelerate scientific discovery. Large language models (LLMs) are being considered as a promising approach to address some of the challenging…

Performance · Computer Science 2023-10-10 Murali Emani , Sam Foreman , Varuni Sastry , Zhen Xie , Siddhisanket Raskar , William Arnold , Rajeev Thakur , Venkatram Vishwanath , Michael E. Papka

Meta Large Language Model Compiler: Foundation Models of Compiler Optimization

Large Language Models (LLMs) have demonstrated remarkable capabilities across a variety of software engineering and coding tasks. However, their application in the domain of code and compiler optimization remains underexplored. Training…

Programming Languages · Computer Science 2024-07-04 Chris Cummins , Volker Seeker , Dejan Grubisic , Baptiste Roziere , Jonas Gehring , Gabriel Synnaeve , Hugh Leather

New Solutions on LLM Acceleration, Optimization, and Application

Large Language Models (LLMs) have become extremely potent instruments with exceptional capacities for comprehending and producing human-like text in a wide range of applications. However, the increasing size and complexity of LLMs present…

Machine Learning · Computer Science 2024-06-18 Yingbing Huang , Lily Jiaxin Wan , Hanchen Ye , Manvi Jha , Jinghua Wang , Yuhong Li , Xiaofan Zhang , Deming Chen

How Can Large Language Models Help Humans in Design and Manufacturing?

The advancement of Large Language Models (LLMs), including GPT-4, provides exciting new opportunities for generative design. We investigate the application of this tool across the entire design and manufacturing workflow. Specifically, we…

Computation and Language · Computer Science 2023-07-28 Liane Makatura , Michael Foshey , Bohan Wang , Felix HähnLein , Pingchuan Ma , Bolei Deng , Megan Tjandrasuwita , Andrew Spielberg , Crystal Elaine Owens , Peter Yichen Chen , Allan Zhao , Amy Zhu , Wil J Norton , Edward Gu , Joshua Jacob , Yifei Li , Adriana Schulz , Wojciech Matusik

GPT-4 as an interface between researchers and computational software: improving usability and reproducibility

Large language models (LLMs) are playing an increasingly important role in science and engineering. For example, their ability to parse and understand human and computer languages makes them powerful interpreters and their use in…

Materials Science · Physics 2023-10-19 Juan C. Verduzco , Ethan Holbrook , Alejandro Strachan

GPT4AIGChip: Towards Next-Generation AI Accelerator Design Automation via Large Language Models

The remarkable capabilities and intricate nature of Artificial Intelligence (AI) have dramatically escalated the imperative for specialized AI accelerators. Nonetheless, designing these accelerators for various AI workloads remains both…

Machine Learning · Computer Science 2025-01-07 Yonggan Fu , Yongan Zhang , Zhongzhi Yu , Sixu Li , Zhifan Ye , Chaojian Li , Cheng Wan , Yingyan Celine Lin

LLM4VV: Developing LLM-Driven Testsuite for Compiler Validation

Large language models (LLMs) are a new and powerful tool for a wide span of applications involving natural language and demonstrate impressive code generation abilities. The goal of this work is to automatically generate tests and use these…

Artificial Intelligence · Computer Science 2024-03-12 Christian Munley , Aaron Jarmusch , Sunita Chandrasekaran

REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving

While model serving has unlocked unprecedented capabilities, the high cost of serving large-scale models continues to be a significant barrier to widespread accessibility and rapid innovation. Compiler optimizations have long driven…

Machine Learning · Computer Science 2026-02-05 Annabelle Sujun Tang , Christopher Priebe , Rohan Mahapatra , Lianhui Qin , Hadi Esmaeilzadeh

A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator

Large language models (LLMs) are both storage-intensive and computation-intensive, posing significant challenges when deployed on resource-constrained hardware. As linear layers in LLMs are mainly resource consuming parts, this paper…

Hardware Architecture · Computer Science 2025-02-03 Sixiao Huang , Tintin Wang , Ang Li , Ao Shen , Kai Li , Keyao Jiang , Mingqiang Huang , Hao Yu

GFormer: Accelerating Large Language Models with Optimized Transformers on Gaudi Processors

Heterogeneous hardware like Gaudi processor has been developed to enhance computations, especially matrix operations for Transformer-based large language models (LLMs) for generative AI tasks. However, our analysis indicates that…

Hardware Architecture · Computer Science 2024-12-31 Chengming Zhang , Xinheng Ding , Baixi Sun , Xiaodong Yu , Weijian Zheng , Zhen Xie , Dingwen Tao

ML For Hardware Design Interpretability: Challenges and Opportunities

The increasing size and complexity of machine learning (ML) models have driven the growing need for custom hardware accelerators capable of efficiently supporting ML workloads. However, the design of such accelerators remains a…

Machine Learning · Computer Science 2025-04-15 Raymond Baartmans , Andrew Ensinger , Victor Agostinelli , Lizhong Chen

Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization

Large language models (LLMs) demonstrate outstanding performance in various tasks in machine learning and have thus become one of the most important workloads in today's computing landscape. However, deploying LLM inference poses challenges…

Machine Learning · Computer Science 2024-06-21 Jungi Lee , Wonbeom Lee , Jaewoong Sim