English
Related papers

Related papers: A Prediction Packetizing Scheme for Reducing Chann…

200 papers

Rapid design space exploration in early design stage is critical to algorithm-architecture co-design for accelerators. In this work, a pre-RTL cycle-accurate accelerator simulator based on SystemC transaction-level modeling (TLM),…

Hardware Architecture · Computer Science 2020-07-30 Sunwoo Kim , Jooho Wang , Youngho Seo , Sanghun Lee , Yeji Park , Sungkyung Park , Chester Sungchung Park

This paper describes the parallel implementation of the TRANSIMS traffic micro-simulation. The parallelization method is domain decomposition, which means that each CPU of the parallel computer is responsible for a different geographical…

Computational Engineering, Finance, and Science · Computer Science 2016-08-31 Kai Nagel , Marcus Rickert

Owing to the rapid growth number of vehicles, urban traffic congestion has become more and more severe in the last decades. As an effective approach, Model Predictive Control (MPC) has been applied to urban traffic signal control system.…

Systems and Control · Computer Science 2018-11-12 Qiming Zou , Ke Lu , Yu Li

Traffic simulation software is becoming increasingly popular as more cities worldwide use it to better manage their crowded traffic networks. An important requirement for such software is the ability to produce accurate results in real…

Other Computer Science · Computer Science 2013-04-15 Salim Farah , Magdy Bayoumi

In this work, we optimize speculative sampling for parallel hardware accelerators to improve sampling speed. We notice that substantial portions of the intermediate matrices necessary for speculative sampling can be computed concurrently.…

Machine Learning · Computer Science 2024-10-04 Dominik Wagner , Seanie Lee , Ilja Baumann , Philipp Seeberger , Korbinian Riedhammer , Tobias Bocklet

This work introduces an integrated approach to optimizing urban traffic by combining predictive modeling of vehicle flow, adaptive traffic signal control, and a modular integration architecture through distributed messaging. Using real-time…

Systems and Control · Electrical Eng. & Systems 2025-05-20 Ismail Zrigui , Samira Khoulji , Mohamed Larbi Kerkeb

The present paper deals with the problem of improving the efficiency of large scale turbulent flow simulations. The high-fidelity methods for modelling turbulent flows become available for a wider range of applications thanks to the…

Computational Physics · Physics 2018-04-10 Boris Krasnopolsky

Transformers are central to advances in artificial intelligence (AI), excelling in fields ranging from computer vision to natural language processing. Despite their success, their large parameter count and computational demands challenge…

Hardware Architecture · Computer Science 2025-03-10 Qunyou Liu , Marina Zapater , David Atienza

Traffic is essential for many dynamic processes on real networks, such as internet and urban traffic systems. The transport efficiency of the traffic system can be improved by taking full advantage of the resources in the system. In this…

Adaptation and Self-Organizing Systems · Physics 2015-06-04 J. -Q. Dong , Z. -G. Huang , Z. Zhou , L. Huang , Z. -X. Wu , Y. Do , Y. -H. Wang

This article presents an automatic approach to quickly derive a good solution for hardware resource partition and task granularity for task-based parallel applications on heterogeneous many-core architectures. Our approach employs a…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-03-10 Peng Zhang , Jianbin Fang , Canqun Yang , Chun Huang , Tao Tang , Zheng Wang

Recent advances in random-walk particle-tracking have enabled direct simulation of mixing and reactions on particles by allowing the particles to interact with each other using a multi-point mass transfer scheme. The mass transfer scheme…

Computational Physics · Physics 2019-04-22 Nicholas B. Engdahl , Michael J. Schmidt , David A. Benson

Cooperative maneuver planning promises to significantly improve traffic efficiency at unsignalized intersections by leveraging connected automated vehicles. Previous works on this topic have been mostly developed for completely automated…

Robotics · Computer Science 2026-02-03 Marvin Klimke , Max Bastian Mertens , Benjamin Völz , Michael Buchholz

Parallel multiphysics simulations often suffer from load imbalances originating from the applied coupling of algorithms with spatially and temporally varying workloads. It is thus desirable to minimize these imbalances to reduce the time to…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-12-03 Christoph Rettinger , Ulrich Rüde

Quite a few algorithms have been proposed to optimize the transmission performance of Multipath TCP (MPTCP). However, existing MPTCP protocols are still far from satisfactory in lossy and ever-changing networks because of their loss-based…

Networking and Internet Architecture · Computer Science 2021-06-14 Jiangping Han , Yitao Xing , Kaiping Xue , David S. L. Wei , Guoliang Xue , Peilin Hong

Transformers have revolutionized AI in natural language processing and computer vision, but their large computation and memory demands pose major challenges for hardware acceleration. In practice, end-to-end throughput is often limited by…

Hardware Architecture · Computer Science 2026-03-20 Qunyou Liu , Marina Zapater , David Atienza

As the complexity of the scan algorithm is dependent on the number of design registers, large SoC scan designs can no longer be verified in RTL simulation unless partitioned into smaller sub-blocks. This paper proposes a methodology to…

Other Computer Science · Computer Science 2014-09-12 Bill Jason Tomas , Yingtao Jiang , Mei Yang

Witnessing the advancing scale and complexity of chip design and benefiting from high-performance computation technologies, the simulation of Very Large Scale Integration (VLSI) Circuits imposes an increasing requirement for acceleration…

Data Structures and Algorithms · Computer Science 2023-04-27 Weijie Fang , Yanggeng Fu , Jiaquan Gao , Longkun Guo , Gregory Gutin , Xiaoyan Zhang

This paper studies parallelization schemes for stochastic Vector Quantization algorithms in order to obtain time speed-ups using distributed resources. We show that the most intuitive parallelization scheme does not lead to better…

Machine Learning · Statistics 2012-05-14 Matthieu Durut , Benoît Patra , Fabrice Rossi

Networks-on-Chip (NoCs) used in commercial many-core processors typically incorporate priority arbitration. Moreover, they experience bursty traffic due to application workloads. However, most state-of-the-art NoC analytical performance…

Performance · Computer Science 2020-07-29 Sumit K. Mandal , Raid Ayoub , Michael Kishinevsky , Mohammad M. Islam , Umit Y. Ogras

This paper investigates co-scheduling algorithms for processing a set of parallel applications. Instead of executing each application one by one, using a maximum degree of parallelism for each of them, we aim at scheduling several…

Data Structures and Algorithms · Computer Science 2013-05-01 Guillaume Aupy , Manu Shantharam , Anne Benoit , Yves Robert , Padma Raghavan
‹ Prev 1 2 3 10 Next ›