Programming Languages · Computer Science
AI Powered Compiler Techniques for DL Code Optimization
Sanket Tavarageri, Gagandeep Goyal, Sasikanth Avancha, Bharat Kaul +1
2021-04-13
Artificial Intelligence · Computer Science
Adaptive Neural Compilation
Rudy Bunel, Alban Desmaison, Pushmeet Kohli, Philip H. S. Torr +1
2016-05-27
Machine Learning · Computer Science
Automating Generation of Low Precision Deep Learning Operators
Meghan Cowan, Thierry Moreau, Tianqi Chen, Luis Ceze
2018-10-29
Programming Languages · Computer Science
Learning to Make Compiler Optimizations More Effective
Rahim Mammadli, Marija Selakovic, Felix Wolf, Michael Pradel
2021-03-01
Information Theory · Computer Science
Successive Refinement in Large-Scale Computation: Advancing Model Inference Applications
Homa Esfahanizadeh, Alejandro Cohen, Shlomo Shamai, Muriel Medard
2024-02-13
Distributed, Parallel, and Cluster Computing · Computer Science
Gensor: A Graph-based Construction Tensor Compilation Method for Deep Learning
Hangda Liu, Boyu Diao, Yu Yang, Wenxin Chen +2
2025-02-18
Programming Languages · Computer Science
PolyScientist: Automatic Loop Transformations Combined with Microkernels for Optimization of Deep Learning Primitives
Sanket Tavarageri, Alexander Heinecke, Sasikanth Avancha, Gagandeep Goyal +2
2020-02-07
Distributed, Parallel, and Cluster Computing · Computer Science
A Collaborative Filtering Approach for the Automatic Tuning of Compiler Optimisations
Stefano Cereda, Gianluca Palermo, Paolo Cremonesi, Stefano Doni
2020-05-12
Emerging Technologies · Computer Science
High-Quality Iterative Logic Compiler for In-Memory SIMD Computation with Tight Coupling of Synthesis and Scheduling
Xingyue Qian, Chenyang Lv, Zhezhi He, Weikang Qian
2024-12-04
Machine Learning · Computer Science
TCL: Enabling Fast and Efficient Cross-Hardware Tensor Program Optimization via Continual Learning
Chaoyao Shen, Linfeng Jiang, Yixian Shen, Tao Xu +4
2026-04-15
Machine Learning · Computer Science
Learning to Optimize Tensor Programs
Tianqi Chen, Lianmin Zheng, Eddie Yan, Ziheng Jiang +4
2019-01-10
Performance · Computer Science
A Learned Performance Model for Tensor Processing Units
Samuel J. Kaufman, Phitchaya Mangpo Phothilimthana, Yanqi Zhou, Charith Mendis +3
2021-03-19
Distributed, Parallel, and Cluster Computing · Computer Science
TIRAMISU: A Polyhedral Compiler for Dense and Sparse Deep Learning
Riyadh Baghdadi, Abdelkader Nadir Debbagh, Kamel Abdous, Fatima Zohra Benhamida +4
2020-05-11
Distributed, Parallel, and Cluster Computing · Computer Science
Time-Based Roofline for Deep Learning Performance Analysis
Yunsong Wang, Charlene Yang, Steven Farrell, Yan Zhang +2
2020-09-24
Machine Learning · Computer Science
PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR
Zixuan Ma, Haojie Wang, Jingze Xing, Liyan Zheng +6
2023-07-12
Machine Learning · Statistics
Data-Efficient Kernel Methods for Learning Differential Equations and Their Solution Operators: Algorithms and Error Analysis
Yasamin Jalalian, Juan Felipe Osorio Ramirez, Alexander Hsu, Bamdad Hosseini +1
2025-04-07
Machine Learning · Computer Science
Bring Your Own Codegen to Deep Learning Compiler
Zhi Chen, Cody Hao Yu, Trevor Morris, Jorn Tuyls +5
2021-05-10
Machine Learning · Computer Science
Latent Replay for Real-Time Continual Learning
Lorenzo Pellegrini, Gabriele Graffieti, Vincenzo Lomonaco, Davide Maltoni
2020-03-05
Distributed, Parallel, and Cluster Computing · Computer Science
Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning
Scott Cyphers, Arjun K. Bansal, Anahita Bhiwandiwalla, Jayaram Bobba +17
2018-01-31