Related papers: Learning Generalizable Program and Architecture Re…

Toward Accurate Platform-Aware Performance Modeling for Deep Neural Networks

In this paper, we provide a fine-grain machine learning-based method, PerfNetV2, which improves the accuracy of our previous work for modeling the neural network performance on a variety of GPU accelerators. Given an application, the…

Machine Learning · Computer Science 2020-12-02 Chuan-Chi Wang , Ying-Chiao Liao , Ming-Chang Kao , Wen-Yew Liang , Shih-Hao Hung

PerfSeer: An Efficient and Accurate Deep Learning Models Performance Predictor

Predicting the performance of deep learning (DL) models, such as execution time and resource utilization, is crucial for Neural Architecture Search (NAS), DL cluster schedulers, and other technologies that advance deep learning. The…

Performance · Computer Science 2025-02-04 Xinlong Zhao , Jiande Sun , Jia Zhang , Sujuan Hou , Shuai Li , Tong Liu , Ke Liu

Data Complexity-aware Deep Model Performance Forecasting

Deep learning models are widely used across computer vision and other domains. When working on the model induction, selecting the right architecture for a given dataset often relies on repetitive trial-and-error procedures. This procedure…

Machine Learning · Computer Science 2026-01-06 Yen-Chia Chen , Hsing-Kuo Pao , Hanjuan Huang

Self-Supervised Visual Representation Learning Using Lightweight Architectures

In self-supervised learning, a model is trained to solve a pretext task, using a data set whose annotations are created by a machine. The objective is to transfer the trained weights to perform a downstream task in the target domain. We…

Machine Learning · Computer Science 2021-10-22 Prathamesh Sonawane , Sparsh Drolia , Saqib Shamsi , Bhargav Jain

PERFOGRAPH: A Numerical Aware Program Graph Representation for Performance Optimization and Program Analysis

The remarkable growth and significant success of machine learning have expanded its applications into programming languages and program analysis. However, a key challenge in adopting the latest machine learning methods is the representation…

Programming Languages · Computer Science 2023-12-01 Ali TehraniJamsaz , Quazi Ishtiaque Mahmud , Le Chen , Nesreen K. Ahmed , Ali Jannesari

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

In contrast to numerous NLP and 2D vision foundational models, learning a 3D foundational model poses considerably greater challenges. This is primarily due to the inherent data variability and diversity of downstream tasks. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Haoyi Zhu , Honghui Yang , Xiaoyang Wu , Di Huang , Sha Zhang , Xianglong He , Hengshuang Zhao , Chunhua Shen , Yu Qiao , Tong He , Wanli Ouyang

A Generic Performance Model for Deep Learning in a Distributed Environment

Performance modelling of a deep learning application is essential to improve and quantify the efficiency of the model framework. However, existing performance models are mostly case-specific, with limited capability for the new deep…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-22 Tulasi Kavarakuntla , Liangxiu Han , Huw Lloyd , Annabel Latham , Anthony Kleerekoper , Samson B. Akintoye

A generalization gap estimation for overparameterized models via the Langevin functional variance

This paper discusses the estimation of the generalization gap, the difference between generalization performance and training performance, for overparameterized models including neural networks. We first show that a functional variance, a…

Machine Learning · Statistics 2023-03-21 Akifumi Okuno , Keisuke Yano

Provable Meta-Learning of Linear Representations

Meta-learning, or learning-to-learn, seeks to design algorithms that can utilize previous experience to rapidly learn new skills or adapt to new environments. Representation learning -- a key tool for performing meta-learning -- learns a…

Machine Learning · Computer Science 2022-01-04 Nilesh Tripuraneni , Chi Jin , Michael I. Jordan

PerFedRec++: Enhancing Personalized Federated Recommendation with Self-Supervised Pre-Training

Federated recommendation systems employ federated learning techniques to safeguard user privacy by transmitting model parameters instead of raw user data between user devices and the central server. Nevertheless, the current federated…

Information Retrieval · Computer Science 2023-05-12 Sichun Luo , Yuanzhang Xiao , Xinyi Zhang , Yang Liu , Wenbo Ding , Linqi Song

Parameter Efficient Transfer Learning for Various Speech Processing Tasks

Fine-tuning of self-supervised models is a powerful transfer learning method in a variety of fields, including speech processing, since it can utilize generic feature representations obtained from large amounts of unlabeled data.…

Multimedia · Computer Science 2022-12-07 Shinta Otake , Rei Kawakami , Nakamasa Inoue

Towards Generalizable Trajectory Prediction Using Dual-Level Representation Learning And Adaptive Prompting

Existing vehicle trajectory prediction models struggle with generalizability, prediction uncertainties, and handling complex interactions. It is often due to limitations like complex architectures customized for a specific dataset and…

Computer Vision and Pattern Recognition · Computer Science 2025-01-10 Kaouther Messaoud , Matthieu Cord , Alexandre Alahi

A Learned Performance Model for Tensor Processing Units

Accurate hardware performance models are critical to efficient code generation. They can be used by compilers to make heuristic decisions, by superoptimizers as a minimization objective, or by autotuners to find an optimal configuration for…

Performance · Computer Science 2021-03-19 Samuel J. Kaufman , Phitchaya Mangpo Phothilimthana , Yanqi Zhou , Charith Mendis , Sudip Roy , Amit Sabne , Mike Burrows

Assessing the Generalizability of a Performance Predictive Model

A key component of automated algorithm selection and configuration, which in most cases are performed using supervised machine learning (ML) methods is a good-performing predictive model. The predictive model uses the feature representation…

Machine Learning · Computer Science 2023-06-02 Ana Nikolikj , Gjorgjina Cenikj , Gordana Ispirova , Diederick Vermetten , Ryan Dieter Lang , Andries Petrus Engelbrecht , Carola Doerr , Peter Korošec , Tome Eftimov

Persona2vec: A Flexible Multi-role Representations Learning Framework for Graphs

Graph embedding techniques, which learn low-dimensional representations of a graph, are achieving state-of-the-art performance in many graph mining tasks. Most existing embedding algorithms assign a single vector to each node, implicitly…

Social and Information Networks · Computer Science 2020-10-22 Jisung Yoon , Kai-Cheng Yang , Woo-Sung Jung , Yong-Yeol Ahn

A Comparison of Code Embeddings and Beyond

Program representation learning is a fundamental task in software engineering applications. With the availability of "big code" and the development of deep learning techniques, various program representation learning models have been…

Software Engineering · Computer Science 2021-09-17 Siqi Han , DongXia Wang , Wanting Li , Xuesong Lu

Learning to Learn to Predict Performance Regressions in Production at Meta

Catching and attributing code change-induced performance regressions in production is hard; predicting them beforehand, even harder. A primer on automatically learning to predict performance regressions in software, this article gives an…

Software Engineering · Computer Science 2023-05-23 Moritz Beller , Hongyu Li , Vivek Nair , Vijayaraghavan Murali , Imad Ahmad , Jürgen Cito , Drew Carlson , Ari Aye , Wes Dyer

Parameter-Efficient Interventions for Enhanced Model Merging

Model merging combines knowledge from task-specific models into a unified multi-task model to avoid joint training on all task data. However, current methods face challenges due to representation bias, which can interfere with tasks…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Marcin Osial , Daniel Marczak , Bartosz Zieliński

Robust Graph Representation Learning via Predictive Coding

Predictive coding is a message-passing framework initially developed to model information processing in the brain, and now also topic of research in machine learning due to some interesting properties. One of such properties is the natural…

Machine Learning · Computer Science 2022-12-12 Billy Byiringiro , Tommaso Salvatori , Thomas Lukasiewicz

GraphVec: Cross-Domain Graph Vectorization for Graph-Level Representation Learning

Learning universal graph representations across heterogeneous domains is difficult because graph datasets differ in topology, node-attribute semantics, feature dimensions, and even attribute availability. We propose GraphVec, a…

Machine Learning · Computer Science 2026-05-08 Qi Feng , Jicong Fan