English
Related papers

Related papers: Value Function Based Performance Optimization of D…

200 papers

The paper presents an efficient real-time scheduling algorithm for intelligent real-time edge services, defined as those that perform machine intelligence tasks, such as voice recognition, LIDAR processing, or machine vision, on behalf of…

Machine Learning · Computer Science 2020-11-03 Shuochao Yao , Yifan Hao , Yiran Zhao , Huajie Shao , Dongxin Liu , Shengzhong Liu , Tianshi Wang , Jinyang Li , Tarek Abdelzaher

With the unprecedented proliferation of machine learning software, there is an ever-increasing need to generate efficient code for such applications. State-of-the-art deep-learning compilers like TVM and Halide incorporate a learning-based…

Machine Learning · Computer Science 2021-08-31 Shikhar Singh , Benoit Steiner , James Hegarty , Hugh Leather

As deep learning models nowadays are widely adopted by both cloud services and edge devices, reducing the latency of deep learning model inferences becomes crucial to provide efficient model serving. However, it is challenging to develop…

Machine Learning · Computer Science 2023-02-16 Yaoyao Ding , Cody Hao Yu , Bojian Zheng , Yizhi Liu , Yida Wang , Gennady Pekhimenko

After completing the design and training phases, deploying a deep learning model onto specific hardware is essential before practical implementation. Targeted optimizations are necessary to enhance the model's performance by reducing…

Human-Computer Interaction · Computer Science 2023-08-10 Laixin Xie , Chenyang Zhang , Ruofei Ma , Xing Jiang , Xingxing Xing , Wei Wan , Quan Li

We present a new algorithm to quickly generate high-performance GPU implementations of complex imaging and vision pipelines, directly from high-level Halide algorithm code. It is fully automatic, requiring no schedule templates or…

Programming Languages · Computer Science 2023-08-29 Luke Anderson , Andrew Adams , Karima Ma , Tzu-Mao Li , Tian Jin , Jonathan Ragan-Kelley

Distributed cloud environments hosting data-intensive applications often experience slowdowns due to network congestion, asymmetric bandwidth, and inter-node data shuffling. These factors are typically not captured by traditional host-level…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-21 Sankalpa Timilsina , Susmit Shannigrahi

In recent years, deep learning techniques have outperformed traditional models in many machine learning tasks. Deep neural networks have successfully been applied to address time series forecasting problems, which is a very important topic…

Machine Learning · Computer Science 2021-04-09 Pedro Lara-Benítez , Manuel Carranza-García , José C. Riquelme

We consider the problem of scheduling in constrained queueing networks with a view to minimizing packet delay. Modern communication systems are becoming increasingly complex, and are required to handle multiple types of traffic with widely…

Machine Learning · Computer Science 2021-05-04 Mohammani Zaki , Avi Mohan , Aditya Gopalan , Shie Mannor

Query performance prediction, the task of predicting the latency of a query, is one of the most challenging problem in database management systems. Existing approaches rely on features and performance models engineered by human experts, but…

Databases · Computer Science 2020-04-09 Ryan Marcus , Olga Papaemmanouil

Multilayer networks have seen a resurgence under the umbrella of deep learning. Current deep learning algorithms train the layers of the network sequentially, improving algorithmic performance as well as providing some regularization. We…

Machine Learning · Computer Science 2016-02-22 Ke Wu , Malik Magdon-Ismail

Minimizing job scheduling time is a fundamental issue in data center networks that has been extensively studied in recent years. The incoming jobs require different CPU and memory units, and span different number of time slots. The…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-11-21 Weijia Chen , Yuedong Xu , Xiaofeng Wu

Training deep learning models, particularly Transformer-based architectures such as Large Language Models (LLMs), demands substantial computational resources and extended training periods. While optimal configuration and infrastructure…

Machine Learning · Computer Science 2024-12-30 Alireza Pourali , Arian Boukani , Hamzeh Khazaei

Deep learning (DL) frameworks take advantage of GPUs to improve the speed of DL inference and training. Ideally, DL frameworks should be able to fully utilize the computation power of GPUs such that the running time depends on the amount of…

Machine Learning · Computer Science 2020-12-07 Woosuk Kwon , Gyeong-In Yu , Eunji Jeong , Byung-Gon Chun

Deep Learning has been recently recognized as one of the feasible solutions to effectively address combinatorial optimization problems, which are often considered important yet challenging in various research domains. In this work, we first…

Artificial Intelligence · Computer Science 2020-12-15 Hyunsung Lee , Michael Wang , Honguk Woo

The learning rate schedule is one of the most impactful aspects of neural network optimization, yet most schedules either follow simple parametric functions or react only to short-term training signals. None of them are supported by a…

Machine Learning · Computer Science 2025-09-30 Matt L. Sampson , Peter Melchior

In Large Language Model (LLM) inference, the output length of an LLM request is typically regarded as not known a priori. Consequently, most LLM serving systems employ a simple First-come-first-serve (FCFS) scheduling strategy, leading to…

Machine Learning · Computer Science 2024-08-29 Yichao Fu , Siqi Zhu , Runlong Su , Aurick Qiao , Ion Stoica , Hao Zhang

The manpower scheduling problem is a kind of critical combinational optimization problem. Researching solutions to scheduling problems can improve the efficiency of companies, hospitals, and other work units. This paper proposes a new model…

Machine Learning · Computer Science 2021-05-11 Tianyu Liu , Lingyu Zhang

Deep learning is rapidly becoming a go-to tool for many artificial intelligence problems due to its ability to outperform other approaches and even humans at many problems. Despite its popularity we are still unable to accurately predict…

Machine Learning · Computer Science 2018-11-30 Daniel Justus , John Brennan , Stephen Bonner , Andrew Stephen McGough

Characterizing and predicting the training performance of modern machine learning (ML) workloads on compute systems with compute and communication spread between CPUs, GPUs, and network devices is not only the key to optimization and…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-27 Zhongyi Lin , Ning Sun , Pallab Bhattacharya , Xizhou Feng , Louis Feng , John D. Owens

Time series prediction with neural networks has been the focus of much research in the past few decades. Given the recent deep learning revolution, there has been much attention in using deep learning models for time series prediction, and…

Machine Learning · Computer Science 2021-06-08 Rohitash Chandra , Shaurya Goyal , Rishabh Gupta
‹ Prev 1 2 3 10 Next ›