Related papers: Modeling Layout Abstractions Using Integer Set Rel…

CuTe Layout Representation and Algebra

Modern architectures for high-performance computing and deep learning increasingly incorporate specialized tensor instructions, including tensor cores for matrix multiplication and hardware-optimized copy operations for multi-dimensional…

Mathematical Software · Computer Science 2026-03-04 Cris Cecka

Linear Layouts: Robust Code Generation of Efficient Tensor Computation Using $\mathbb{F}_2$

Efficient tensor computation is a cornerstone of modern deep learning (DL) workloads, yet existing approaches struggle to achieve flexible and performant design and implementation of tensor layouts -- mappings between logical tensors and…

Programming Languages · Computer Science 2026-03-09 Keren Zhou , Mario Lezcano , Adam Goucher , Akhmed Rakhmati , Jeff Niu , Justin Lebar , Pawel Szczerbuk , Peter Bell , Phil Tillet , Thomas Raoux , Zahi Moudallal

A general tensor-structured compression scheme for efficient large language models

Large language models (LLMs) are dominated by dense linear transformations, whose storage, memory and computational overheads hinder efficient adaptation and deployment while masking the functional impacts of structural simplification. Here…

Computation and Language · Computer Science 2026-05-26 Ying Lu , Peng-Fei Zhou , Qi-Xuan Fang , Pan Zhang , Shi-Ju Ran , Gang Su

Axe: A Simple Unified Layout Abstraction for Machine Learning Compilers

Scaling modern deep learning workloads demands coordinated placement of data and compute across device meshes, memory hierarchies, and heterogeneous accelerators. We present Axe Layout, a hardware-aware abstraction that maps logical tensor…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-30 Bohan Hou , Hongyi Jin , Guanjie Wang , Jinqi Chen , Yaxing Cai , Lijie Yang , Zihao Ye , Yaoyao Ding , Ruihang Lai , Tianqi Chen

Logic Tensor Networks for Semantic Image Interpretation

Semantic Image Interpretation (SII) is the task of extracting structured semantic descriptions from images. It is widely agreed that the combined use of visual data and background knowledge is of great importance for SII. Recently,…

Artificial Intelligence · Computer Science 2017-05-26 Ivan Donadello , Luciano Serafini , Artur d'Avila Garcez

A Unified Iteration Space Transformation Framework for Sparse and Dense Tensor Algebra

We address the problem of optimizing mixed sparse and dense tensor algebra in a compiler. We show that standard loop transformations, such as strip-mining, tiling, collapsing, parallelization and vectorization, can be applied to irregular…

Mathematical Software · Computer Science 2020-01-03 Ryan Senanayake , Fredrik Kjolstad , Changwan Hong , Shoaib Kamil , Saman Amarasinghe

On the Symmetries of Deep Learning Models and their Internal Representations

Symmetry is a fundamental tool in the exploration of a broad range of complex systems. In machine learning symmetry has been explored in both models and data. In this paper we seek to connect the symmetries arising from the architecture of…

Machine Learning · Computer Science 2023-03-27 Charles Godfrey , Davis Brown , Tegan Emerson , Henry Kvinge

Self-supervised Photographic Image Layout Representation Learning

In the domain of image layout representation learning, the critical process of translating image layouts into succinct vector forms is increasingly significant across diverse applications, such as image retrieval, manipulation, and…

Computer Vision and Pattern Recognition · Computer Science 2024-08-21 Zhaoran Zhao , Peng Lu , Xujun Peng , Wenhao Guo

CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval

Image-Text Retrieval (ITR) is challenging in bridging visual and lingual modalities. Contrastive learning has been adopted by most prior arts. Except for limited amount of negative image-text pairs, the capability of constrastive learning…

Computer Vision and Pattern Recognition · Computer Science 2026-03-27 Haoran Wang , Dongliang He , Wenhao Wu , Boyang Xia , Min Yang , Fu Li , Yunlong Yu , Zhong Ji , Errui Ding , Jingdong Wang

Multi-relational Learning Using Weighted Tensor Decomposition with Modular Loss

We propose a modular framework for multi-relational learning via tensor decomposition. In our learning setting, the training data contains multiple types of relationships among a set of objects, which we represent by a sparse three-mode…

Machine Learning · Computer Science 2013-06-04 Ben London , Theodoros Rekatsinas , Bert Huang , Lise Getoor

Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting

Language models have the ability to perform in-context learning (ICL), allowing them to flexibly adapt their behavior based on context. This contrasts with in-weights learning (IWL), where memorized information is encoded in model…

Computation and Language · Computer Science 2025-03-04 Suraj Anand , Michael A. Lepori , Jack Merullo , Ellie Pavlick

Layer-stacked Attention for Heterogeneous Network Embedding

The heterogeneous network is a robust data abstraction that can model entities of different types interacting in various ways. Such heterogeneity brings rich semantic information but presents nontrivial challenges in aggregating the…

Machine Learning · Computer Science 2020-09-18 Nhat Tran , Jean Gao

Learning Relational Representations with Auto-encoding Logic Programs

Deep learning methods capable of handling relational data have proliferated over the last years. In contrast to traditional relational learning methods that leverage first-order logic for representing such data, these deep learning methods…

Machine Learning · Computer Science 2020-03-25 Sebastijan Dumancic , Tias Guns , Wannes Meert , Hendrik Blockeel

Inference over Unseen Entities, Relations and Literals on Knowledge Graphs

In recent years, knowledge graph embedding models have been successfully applied in the transductive setting to tackle various challenging tasks including link prediction, and query answering. Yet, the transductive setting does not allow…

Machine Learning · Computer Science 2024-10-10 Caglar Demir , N'Dah Jean Kouagou , Arnab Sharma , Axel-Cyrille Ngonga Ngomo

Interpretable learning of effective dynamics for multiscale systems

The modeling and simulation of high-dimensional multiscale systems is a critical challenge across all areas of science and engineering. It is broadly believed that even with today's computer advances resolving all spatiotemporal scales…

Machine Learning · Statistics 2023-09-13 Emmanuel Menier , Sebastian Kaltenbach , Mouadh Yagoubi , Marc Schoenauer , Petros Koumoutsakos

Compressing Structured Tensor Algebra

Tensor algebra is a crucial component for data-intensive workloads such as machine learning and scientific computing. As the complexity of data grows, scientists often encounter a dilemma between the highly specialized dense tensor algebra…

Programming Languages · Computer Science 2024-07-19 Mahdi Ghorbani , Emilien Bauer , Tobias Grosser , Amir Shaikhha

Knowing Where and What: Unified Word Block Pretraining for Document Understanding

Due to the complex layouts of documents, it is challenging to extract information for documents. Most previous studies develop multimodal pre-trained models in a self-supervised way. In this paper, we focus on the embedding learning of word…

Computation and Language · Computer Science 2022-08-01 Song Tao , Zijian Wang , Tiantian Fan , Canjie Luo , Can Huang

Embodying computation in nonlinear perturbative metamaterials

Designing metamaterials that carry out advanced computations poses a significant challenge. A powerful design strategy splits the problem into two steps: First, encoding the desired functionality in a discrete or tight-binding model, and…

Mesoscale and Nanoscale Physics · Physics 2025-09-03 Sima Zahedi Fard , Paolo Tiso , Parisa Omidvar , Marc Serra-Garcia

Zero-Shot Learning via Latent Space Encoding

Zero-Shot Learning (ZSL) is typically achieved by resorting to a class semantic embedding space to transfer the knowledge from the seen classes to unseen ones. Capturing the common semantic characteristics between the visual modality and…

Computer Vision and Pattern Recognition · Computer Science 2018-04-23 Yunlong Yu , Zhong Ji , Jichang Guo , Zhongfei , Zhang

Domain Adaptation Meets Zero-Shot Learning: An Annotation-Efficient Approach to Multi-Modality Medical Image Segmentation

Due to the lack of properly annotated medical data, exploring the generalization capability of the deep model is becoming a public concern. Zero-shot learning (ZSL) has emerged in recent years to equip the deep model with the ability to…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Cheng Bian , Chenglang Yuan , Kai Ma , Shuang Yu , Dong Wei , Yefeng Zheng