English
Related papers

Related papers: Learning Generalizable Program and Architecture Re…

200 papers

In this paper, we provide a fine-grain machine learning-based method, PerfNetV2, which improves the accuracy of our previous work for modeling the neural network performance on a variety of GPU accelerators. Given an application, the…

Machine Learning · Computer Science 2020-12-02 Chuan-Chi Wang , Ying-Chiao Liao , Ming-Chang Kao , Wen-Yew Liang , Shih-Hao Hung

Predicting the performance of deep learning (DL) models, such as execution time and resource utilization, is crucial for Neural Architecture Search (NAS), DL cluster schedulers, and other technologies that advance deep learning. The…

Performance · Computer Science 2025-02-04 Xinlong Zhao , Jiande Sun , Jia Zhang , Sujuan Hou , Shuai Li , Tong Liu , Ke Liu

Deep learning models are widely used across computer vision and other domains. When working on the model induction, selecting the right architecture for a given dataset often relies on repetitive trial-and-error procedures. This procedure…

Machine Learning · Computer Science 2026-01-06 Yen-Chia Chen , Hsing-Kuo Pao , Hanjuan Huang

In self-supervised learning, a model is trained to solve a pretext task, using a data set whose annotations are created by a machine. The objective is to transfer the trained weights to perform a downstream task in the target domain. We…

Machine Learning · Computer Science 2021-10-22 Prathamesh Sonawane , Sparsh Drolia , Saqib Shamsi , Bhargav Jain

The remarkable growth and significant success of machine learning have expanded its applications into programming languages and program analysis. However, a key challenge in adopting the latest machine learning methods is the representation…

Programming Languages · Computer Science 2023-12-01 Ali TehraniJamsaz , Quazi Ishtiaque Mahmud , Le Chen , Nesreen K. Ahmed , Ali Jannesari

In contrast to numerous NLP and 2D vision foundational models, learning a 3D foundational model poses considerably greater challenges. This is primarily due to the inherent data variability and diversity of downstream tasks. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Haoyi Zhu , Honghui Yang , Xiaoyang Wu , Di Huang , Sha Zhang , Xianglong He , Hengshuang Zhao , Chunhua Shen , Yu Qiao , Tong He , Wanli Ouyang

Performance modelling of a deep learning application is essential to improve and quantify the efficiency of the model framework. However, existing performance models are mostly case-specific, with limited capability for the new deep…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-22 Tulasi Kavarakuntla , Liangxiu Han , Huw Lloyd , Annabel Latham , Anthony Kleerekoper , Samson B. Akintoye

This paper discusses the estimation of the generalization gap, the difference between generalization performance and training performance, for overparameterized models including neural networks. We first show that a functional variance, a…

Machine Learning · Statistics 2023-03-21 Akifumi Okuno , Keisuke Yano

Meta-learning, or learning-to-learn, seeks to design algorithms that can utilize previous experience to rapidly learn new skills or adapt to new environments. Representation learning -- a key tool for performing meta-learning -- learns a…

Machine Learning · Computer Science 2022-01-04 Nilesh Tripuraneni , Chi Jin , Michael I. Jordan

Federated recommendation systems employ federated learning techniques to safeguard user privacy by transmitting model parameters instead of raw user data between user devices and the central server. Nevertheless, the current federated…

Information Retrieval · Computer Science 2023-05-12 Sichun Luo , Yuanzhang Xiao , Xinyi Zhang , Yang Liu , Wenbo Ding , Linqi Song

Fine-tuning of self-supervised models is a powerful transfer learning method in a variety of fields, including speech processing, since it can utilize generic feature representations obtained from large amounts of unlabeled data.…

Multimedia · Computer Science 2022-12-07 Shinta Otake , Rei Kawakami , Nakamasa Inoue

Existing vehicle trajectory prediction models struggle with generalizability, prediction uncertainties, and handling complex interactions. It is often due to limitations like complex architectures customized for a specific dataset and…

Computer Vision and Pattern Recognition · Computer Science 2025-01-10 Kaouther Messaoud , Matthieu Cord , Alexandre Alahi

Accurate hardware performance models are critical to efficient code generation. They can be used by compilers to make heuristic decisions, by superoptimizers as a minimization objective, or by autotuners to find an optimal configuration for…

A key component of automated algorithm selection and configuration, which in most cases are performed using supervised machine learning (ML) methods is a good-performing predictive model. The predictive model uses the feature representation…

Graph embedding techniques, which learn low-dimensional representations of a graph, are achieving state-of-the-art performance in many graph mining tasks. Most existing embedding algorithms assign a single vector to each node, implicitly…

Social and Information Networks · Computer Science 2020-10-22 Jisung Yoon , Kai-Cheng Yang , Woo-Sung Jung , Yong-Yeol Ahn

Program representation learning is a fundamental task in software engineering applications. With the availability of "big code" and the development of deep learning techniques, various program representation learning models have been…

Software Engineering · Computer Science 2021-09-17 Siqi Han , DongXia Wang , Wanting Li , Xuesong Lu

Catching and attributing code change-induced performance regressions in production is hard; predicting them beforehand, even harder. A primer on automatically learning to predict performance regressions in software, this article gives an…

Software Engineering · Computer Science 2023-05-23 Moritz Beller , Hongyu Li , Vivek Nair , Vijayaraghavan Murali , Imad Ahmad , Jürgen Cito , Drew Carlson , Ari Aye , Wes Dyer

Model merging combines knowledge from task-specific models into a unified multi-task model to avoid joint training on all task data. However, current methods face challenges due to representation bias, which can interfere with tasks…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Marcin Osial , Daniel Marczak , Bartosz Zieliński

Predictive coding is a message-passing framework initially developed to model information processing in the brain, and now also topic of research in machine learning due to some interesting properties. One of such properties is the natural…

Machine Learning · Computer Science 2022-12-12 Billy Byiringiro , Tommaso Salvatori , Thomas Lukasiewicz

Learning universal graph representations across heterogeneous domains is difficult because graph datasets differ in topology, node-attribute semantics, feature dimensions, and even attribute availability. We propose GraphVec, a…

Machine Learning · Computer Science 2026-05-08 Qi Feng , Jicong Fan
‹ Prev 1 2 3 10 Next ›