Related papers: Encoding large information structures in linear al…

Encoding architecture algebra

Despite the wide variety of input types in machine learning, this diversity is often not fully reflected in their representations or model architectures, leading to inefficiencies throughout a model's lifecycle. This paper introduces an…

Machine Learning · Computer Science 2024-10-16 Stephane Bersier , Xinyi Chen-Lin

Interpreting Encoding and Decoding Models

Encoding and decoding models are widely used in systems, cognitive, and computational neuroscience to make sense of brain-activity data. However, the interpretation of their results requires care. Decoding models can help reveal whether…

Neurons and Cognition · Quantitative Biology 2019-04-29 Nikolaus Kriegeskorte , Pamela K. Douglas

Large-margin Learning of Compact Binary Image Encodings

The use of high-dimensional features has become a normal practice in many computer vision applications. The large dimension of these features is a limiting factor upon the number of data points which may be effectively stored and processed,…

Computer Vision and Pattern Recognition · Computer Science 2015-06-18 Sakrapee Paisitkriangkrai , Chunhua Shen , Anton van den Hengel

A Local Information Criterion for Dynamical Systems

Encoding a sequence of observations is an essential task with many applications. The encoding can become highly efficient when the observations are generated by a dynamical system. A dynamical system imposes regularities on the observations…

Machine Learning · Statistics 2018-05-29 Arash Mehrjou , Friedrich Solowjow , Sebastian Trimpe , Bernhard Schölkopf

Language Models Encode the Value of Numbers Linearly

Large language models (LLMs) have exhibited impressive competence in various tasks, but their internal mechanisms on mathematical problems are still under-explored. In this paper, we study a fundamental question: how language models encode…

Computation and Language · Computer Science 2024-11-15 Fangwei Zhu , Damai Dai , Zhifang Sui

Directional Statistics in Machine Learning: a Brief Review

The modern data analyst must cope with data encoded in various forms, vectors, matrices, strings, graphs, or more. Consequently, statistical and machine learning models tailored to different data encodings are important. We focus on data…

Machine Learning · Statistics 2016-05-03 Suvrit Sra

Learning and analyzing vector encoding of symbolic representations

We present a formal language with expressions denoting general symbol structures and queries which access information in those structures. A sequence-to-sequence network processing this language learns to encode symbol structures and query…

Artificial Intelligence · Computer Science 2018-03-13 Roland Fernandez , Asli Celikyilmaz , Rishabh Singh , Paul Smolensky

Encoding and Understanding Astrophysical Information in Large Language Model-Generated Summaries

Large Language Models have demonstrated the ability to generalize well at many levels across domains, modalities, and even shown in-context learning capabilities. This enables research questions regarding how they can be used to encode…

Computation and Language · Computer Science 2025-11-19 Kiera McCormick , Rafael Martínez-Galarza

Deep Tensor Encoding

Learning an encoding of feature vectors in terms of an over-complete dictionary or a information geometric (Fisher vectors) construct is wide-spread in statistical signal processing and computer vision. In content based information…

Information Retrieval · Computer Science 2017-11-15 B Sengupta , E Vasquez , Y Qian

Feature Coding in the Era of Large Models: Dataset, Test Conditions, and Benchmark

Large models have achieved remarkable performance across various tasks, yet they incur significant computational costs and privacy concerns during both training and inference. Distributed deployment has emerged as a potential solution, but…

Multimedia · Computer Science 2025-09-03 Changsheng Gao , Yifan Ma , Qiaoxi Chen , Yenan Xu , Dong Liu , Weisi Lin

Learning Representation for Mixed Data Types with a Nonlinear Deep Encoder-Decoder Framework

Representation of data on mixed variables, numerical and categorical types to get suitable feature map is a challenging task as important information lies in a complex non-linear manifold. The feature transformation should be able to…

Machine Learning · Computer Science 2020-09-22 Saswata Sahoo , Souradip Chakraborty

Large Language Models: A Mathematical Formulation

Large language models (LLMs) process and predict sequences containing text to answer questions, and address tasks including document summarization, providing recommendations, writing software and solving quantitative problems. We provide a…

Numerical Analysis · Mathematics 2026-02-02 Ricardo Baptista , Andrew Stuart , Son Tran

On Learning Prediction-Focused Mixtures

Probabilistic models help us encode latent structures that both model the data and are ideally also useful for specific downstream tasks. Among these, mixture models and their time-series counterparts, hidden Markov models, identify…

Machine Learning · Computer Science 2021-10-29 Abhishek Sharma , Catherine Zeng , Sanjana Narayanan , Sonali Parbhoo , Finale Doshi-Velez

Structural Deep Encoding for Table Question Answering

Although Transformers-based architectures excel at processing textual information, their naive adaptation for tabular data often involves flattening the table structure. This simplification can lead to the loss of essential…

Computation and Language · Computer Science 2025-03-04 Raphaël Mouravieff , Benjamin Piwowarski , Sylvain Lamprier

Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation

In language processing, transformers benefit greatly from text being condensed. This is achieved through a larger vocabulary that captures word fragments instead of plain characters. This is often done with Byte Pair Encoding. In the…

Computer Vision and Pattern Recognition · Computer Science 2024-11-18 Tim Elsner , Paula Usinger , Julius Nehring-Wirxel , Gregor Kobsik , Victor Czech , Yanjiang He , Isaak Lim , Leif Kobbelt

A Self-Encoder for Learning Nearest Neighbors

We present the self-encoder, a neural network trained to guess the identity of each data sample. Despite its simplicity, it learns a very useful representation of data, in a self-supervised way. Specifically, the self-encoder learns to…

Machine Learning · Computer Science 2023-06-27 Armand Boschin , Thomas Bonald , Marc Jeanmougin

Deep Double Sparsity Encoder: Learning to Sparsify Not Only Features But Also Parameters

This paper emphasizes the significance to jointly exploit the problem structure and the parameter structure, in the context of deep modeling. As a specific and interesting example, we describe the deep double sparsity encoder (DDSE), which…

Machine Learning · Computer Science 2016-10-04 Zhangyang Wang , Thomas S. Huang

Explainable Predictive Modeling for Limited Spectral Data

Feature selection of high-dimensional labeled data with limited observations is critical for making powerful predictive modeling accessible, scalable, and interpretable for domain experts. Spectroscopy data, which records the interaction…

Machine Learning · Computer Science 2022-02-10 Frantishek Akulich , Hadis Anahideh , Manaf Sheyyab , Dhananjay Ambre

Compressing Large Sample Data for Discriminant Analysis

Large-sample data became prevalent as data acquisition became cheaper and easier. While a large sample size has theoretical advantages for many statistical methods, it presents computational challenges. Sketching, or compression, is a…

Machine Learning · Statistics 2020-05-11 Alexander F. Lapanowski , Irina Gaynanova

Learning-Time Encoding Shapes Unlearning in LLMs

As large language models (LLMs) are increasingly deployed in the real world, the ability to ``unlearn'', or remove specific pieces of knowledge post hoc, has become essential for a variety of reasons ranging from privacy regulations to…

Computation and Language · Computer Science 2025-06-19 Ruihan Wu , Konstantin Garov , Kamalika Chaudhuri