Related papers: DG-RePlAce: A Dataflow-Driven GPU-Accelerated Anal…

Timing-Driven Global Placement by Efficient Critical Path Extraction

Timing optimization during the global placement of integrated circuits has been a significant focus for decades, yet it remains a complex, unresolved issue. Recent analytical methods typically use pin-level timing information to adjust net…

Hardware Architecture · Computer Science 2025-03-18 Yunqi Shi , Siyuan Xu , Shixiong Kai , Xi Lin , Ke Xue , Mingxuan Yuan , Chao Qian

DiffPlace: A Conditional Diffusion Framework for Simultaneous VLSI Placement Beyond Sequential Paradigms

Chip placement, a critical step in the VLSI physical design flow, directly impacts performance, power, and routability. Traditional chip placement methods, relying on analytical optimization or sequential reinforcement learning (RL), face…

Hardware Architecture · Computer Science 2026-04-08 Kien Le Trung , Truong-Son Hy

TransPlace: Transferable Circuit Global Placement via Graph Neural Network

Global placement, a critical step in designing the physical layout of computer chips, is essential to optimize chip performance. Prior global placement methods optimize each circuit design individually from scratch. Their neglect of…

Machine Learning · Computer Science 2025-03-27 Yunbo Hou , Haoran Ye , Shuwen Yang , Yingxue Zhang , Siyuan Xu , Guojie Song

Recursive Learning-Based Virtual Buffering for Analytical Global Placement

Due to the skewed scaling of interconnect versus cell delay in modern technology nodes, placement with buffer porosity (i.e., cell density) awareness is essential for timing closure in physical synthesis flows. However, existing approaches…

Machine Learning · Computer Science 2025-08-01 Andrew B. Kahng , Yiting Liu , Zhiang Wang

RoutePlacer: An End-to-End Routability-Aware Placer with Graph Neural Network

Placement is a critical and challenging step of modern chip design, with routability being an essential indicator of placement quality. Current routability-oriented placers typically apply an iterative two-stage approach, wherein the first…

Machine Learning · Computer Science 2024-06-06 Yunbo Hou , Haoran Ye , Yingxue Zhang , Siyuan Xu , Guojie Song

The Power of Graph Signal Processing for Chip Placement Acceleration

Placement is a critical task with high computation complexity in VLSI physical design. Modern analytical placers formulate the placement objective as a nonlinear optimization task, which suffers a long iteration time. To accelerate and…

Machine Learning · Computer Science 2025-02-26 Yiting Liu , Hai Zhou , Jia Wang , Fan Yang , Xuan Zeng , Li Shang

Critical Path Aware Timing-Driven Global Placement for Large-Scale Heterogeneous FPGAs

Timing optimization during global placement is critical for achieving optimal circuit performance and remains a key challenge in modern Field Programmable Gate Array (FPGA) design. As FPGA designs scale and heterogeneous resources increase,…

Hardware Architecture · Computer Science 2025-12-02 He Jiang , Yi Guo , Shikai Guo , Huijiang Liu , Xiaochen Li , Ning Wang , Zhixiong Di

MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning

Placement is an essential task in modern chip design, aiming at placing millions of circuit modules on a 2D chip canvas. Unlike the human-centric solution, which requires months of intense effort by hardware engineers to produce a layout to…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Yao Lai , Yao Mu , Ping Luo

Guiding Global Placement With Reinforcement Learning

Recent advances in GPU accelerated global and detail placement have reduced the time to solution by an order of magnitude. This advancement allows us to leverage data driven optimization (such as Reinforcement Learning) in an effort to…

Machine Learning · Computer Science 2021-09-07 Robert Kirby , Kolby Nottingham , Rajarshi Roy , Saad Godil , Bryan Catanzaro

GDP: Generalized Device Placement for Dataflow Graphs

Runtime and scalability of large neural networks can be significantly affected by the placement of operations in their dataflow graphs on suitable devices. With increasingly complex neural network architectures and heterogeneous device…

Machine Learning · Computer Science 2019-10-04 Yanqi Zhou , Sudip Roy , Amirali Abdolrashidi , Daniel Wong , Peter C. Ma , Qiumin Xu , Ming Zhong , Hanxiao Liu , Anna Goldie , Azalia Mirhoseini , James Laudon

Analytical Die-to-Die 3D Placement with Bistratal Wirelength Model and GPU Acceleration

In this paper, we present a new analytical 3D placement framework with a bistratal wirelength model for F2F-bonded 3D ICs with heterogeneous technology nodes based on the electrostatic-based density model. The proposed framework, enabled…

Hardware Architecture · Computer Science 2023-10-13 Peiyu Liao , Yuxuan Zhao , Dawei Guo , Yibo Lin , Bei Yu

An Efficient Stochastic Subgradient Method for the Global Placement Problem in Very Large-Scale Integration Circuits

The placement problem in Very Large-Scale Integration (VLSI) circuits is a critical step in chip design. Its primary goal is to optimize the wirelength of circuit components within a confined area while adhering to nonoverlapping…

Optimization and Control · Mathematics 2026-05-06 Yi-Shuang Yue , Yu-Hong Dai , Haijun Yu

On Joint Learning for Solving Placement and Routing in Chip Design

For its advantage in GPU acceleration and less dependency on human experts, machine learning has been an emerging tool for solving the placement and routing problems, as two critical steps in modern chip design flow. Being still in its…

Machine Learning · Computer Science 2021-12-28 Ruoyu Cheng , Junchi Yan

GAP-LA: GPU-Accelerated Performance-Driven Layer Assignment

Layer assignment is critical for global routing of VLSI circuits. It converts 2D routing paths into 3D routing solutions by determining the proper metal layer for each routing segments to minimize congestion and via count. As different…

Hardware Architecture · Computer Science 2026-01-08 Chunyuan Zhao , Zizheng Guo , Zuodong Zhang , Yibo Lin

Distributed Graph Layout for Scalable Small-world Network Analysis

The in-memory graph layout or organization has a considerable impact on the time and energy efficiency of distributed memory graph computations. It affects memory locality, inter-task load balance, communication time, and overall memory…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-01-04 George M Slota , Sivasankaran Rajamanickam , Kamesh Madduri

GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU Spatial Multiplexing

With the increasing popularity of robotics in industrial control and autonomous driving, deep reinforcement learning (DRL) raises the attention of various fields. However, DRL computation on the modern powerful GPU platform is still…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-20 Yuke Wang , Boyuan Feng , Zheng Wang , Tong Geng , Ang Li , Yufei Ding

A Faster, Lighter and Stronger Deep Learning-Based Approach for Place Recognition

Visual Place Recognition is an essential component of systems for camera localization and loop closure detection, and it has attracted widespread interest in multiple domains such as computer vision, robotics and AR/VR. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2022-11-29 Rui Huang , Ze Huang , Songzhi Su

RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection

Object detection plays a crucial role in smart video analysis, with applications ranging from autonomous driving and security to smart cities. However, achieving real-time object detection on edge devices presents significant challenges due…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Jianrui Shi , Yong Zhao , Zeyang Cui , Xiaoming Shen , Minhang Zeng , Xiaojie Liu

DreamShard: Generalizable Embedding Table Placement for Recommender Systems

We study embedding table placement for distributed recommender systems, which aims to partition and place the tables on multiple hardware devices (e.g., GPUs) to balance the computation and communication costs. Although prior work has…

Machine Learning · Computer Science 2022-10-06 Daochen Zha , Louis Feng , Qiaoyu Tan , Zirui Liu , Kwei-Herng Lai , Bhargav Bhushanam , Yuandong Tian , Arun Kejariwal , Xia Hu

Placeto: Learning Generalizable Device Placement Algorithms for Distributed Machine Learning

We present Placeto, a reinforcement learning (RL) approach to efficiently find device placements for distributed neural network training. Unlike prior approaches that only find a device placement for a specific computation graph, Placeto…

Machine Learning · Computer Science 2019-06-24 Ravichandra Addanki , Shaileshh Bojja Venkatakrishnan , Shreyan Gupta , Hongzi Mao , Mohammad Alizadeh