Related papers: Pre-training General Trajectory Embeddings with Ma…

Self-Supervised Learning via Maximum Entropy Coding

A mainstream type of current self-supervised learning methods pursues a general-purpose representation that can be well transferred to downstream tasks, typically by optimizing on a given pretext task such as instance discrimination. In…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Xin Liu , Zhongdao Wang , Yali Li , Shengjin Wang

UniTE: A Survey and Unified Pipeline for Pre-training Spatiotemporal Trajectory Embeddings

Spatiotemporal trajectories are sequences of timestamped locations, which enable a variety of analyses that in turn enable important real-world applications. It is common to map trajectories to vectors, called embeddings, before subsequent…

Machine Learning · Computer Science 2024-11-13 Yan Lin , Zeyu Zhou , Yicheng Liu , Haochen Lv , Haomin Wen , Tianyi Li , Yushuai Li , Christian S. Jensen , Shengnan Guo , Youfang Lin , Huaiyu Wan

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

Embedding models have been crucial in enabling various downstream tasks such as semantic similarity, information retrieval, and clustering. Recently, there has been a surge of interest in developing universal text embedding models that can…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Ziyan Jiang , Rui Meng , Xinyi Yang , Semih Yavuz , Yingbo Zhou , Wenhu Chen

On Learning Informative Trajectory Embeddings for Imitation, Classification and Regression

In real-world sequential decision making tasks like autonomous driving, robotics, and healthcare, learning from observed state-action trajectories is critical for tasks like imitation, classification, and clustering. For example,…

Machine Learning · Computer Science 2025-01-20 Zichang Ge , Changyu Chen , Arunesh Sinha , Pradeep Varakantham

TrajTok: Adaptive Spatial Tokenization for Trajectory Representation Learning

Learning generalizable trajectory representations from raw GPS traces remains difficult because the data is continuous, noisy, and irregularly sampled. Spatial tokenization is also challenging: fine grids yield sparse cells with weak…

Machine Learning · Computer Science 2026-05-20 Zhen Xiong , Shang-Ling Hsu , Cyrus Shahabi

Towards Generalizable Representations of Mathematical Strategies

Pretrained encoders for mathematical texts have achieved significant improvements on various tasks such as formula classification and information retrieval. Yet they remain limited in representing and capturing student strategies for entire…

Computers and Society · Computer Science 2026-04-13 Siddhartha Pradhan , Ethan Prihar , Erin Ottmar

Embedding-based In-Context Prompt Training for Enhancing LLMs as Text Encoders

Large language models (LLMs) have been widely explored for embedding generation. While recent studies show that in-context learning (ICL) effectively enhances the representational capability of LLMs by prepending a few task-related…

Computation and Language · Computer Science 2026-05-05 Ailiang Lin , Zhuoyun Li , Keyu Mao , Kotaro Funakoshi , Manabu Okumura

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

Multimodal pretraining is an effective strategy for the trinity of goals of representation learning in autonomous robots: 1) extracting both local and global task progressions; 2) enforcing temporal consistency of visual representation; 3)…

Robotics · Computer Science 2024-05-27 Jianxiong Li , Jinliang Zheng , Yinan Zheng , Liyuan Mao , Xiao Hu , Sijie Cheng , Haoyi Niu , Jihao Liu , Yu Liu , Jingjing Liu , Ya-Qin Zhang , Xianyuan Zhan

Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval

Cross-modal video-text retrieval, a challenging task in the field of vision and language, aims at retrieving corresponding instance giving sample from either modality. Existing approaches for this task all focus on how to design encoding…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Rui Zhao , Kecheng Zheng , Zheng-Jun Zha , Hongtao Xie , Jiebo Luo

Universal Embedding Function for Traffic Classification via QUIC Domain Recognition Pretraining: A Transfer Learning Success

Encrypted traffic classification (TC) methods must adapt to new protocols and extensions as well as to advancements in other machine learning fields. In this paper, we adopt a transfer learning setup best known from computer vision. We…

Machine Learning · Computer Science 2026-01-21 Jan Luxemburk , Karel Hynek , Richard Plný , Tomáš Čejka

Entropy-Driven Curriculum for Multi-Task Training in Human Mobility Prediction

The increasing availability of big mobility data from ubiquitous portable devices enables human mobility prediction through deep learning approaches. However, the diverse complexity of human mobility data impedes model training, leading to…

Machine Learning · Computer Science 2026-03-10 Tianye Fang , Xuanshu Luo , Martin Werner

Pre-training Contextual Location Embeddings in Personal Trajectories via Efficient Hierarchical Location Representations

Pre-training the embedding of a location generated from human mobility data has become a popular method for location based services. In practice, modeling the location embedding is too expensive, due to the large number of locations to be…

Artificial Intelligence · Computer Science 2023-10-03 Chung Park , Taesan Kim , Junui Hong , Minsung Choi , Jaegul Choo

Text and Code Embeddings by Contrastive Pre-Training

Text embeddings are useful features in many applications such as semantic search and computing text similarity. Previous work typically trains models customized for different use cases, varying in dataset choice, training objective and…

Computation and Language · Computer Science 2022-01-26 Arvind Neelakantan , Tao Xu , Raul Puri , Alec Radford , Jesse Michael Han , Jerry Tworek , Qiming Yuan , Nikolas Tezak , Jong Wook Kim , Chris Hallacy , Johannes Heidecke , Pranav Shyam , Boris Power , Tyna Eloundou Nekoul , Girish Sastry , Gretchen Krueger , David Schnurr , Felipe Petroski Such , Kenny Hsu , Madeleine Thompson , Tabarak Khan , Toki Sherbakov , Joanne Jang , Peter Welinder , Lilian Weng

Pre-training on Synthetic Driving Data for Trajectory Prediction

Accumulating substantial volumes of real-world driving data proves pivotal in the realm of trajectory forecasting for autonomous driving. Given the heavy reliance of current trajectory forecasting models on data-driven methodologies, we aim…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Yiheng Li , Seth Z. Zhao , Chenfeng Xu , Chen Tang , Chenran Li , Mingyu Ding , Masayoshi Tomizuka , Wei Zhan

Goal-Conditioned Variational Autoencoder Trajectory Primitives with Continuous and Discrete Latent Codes

Imitation learning is an intuitive approach for teaching motion to robotic systems. Although previous studies have proposed various methods to model demonstrated movement primitives, one of the limitations of existing methods is that the…

Robotics · Computer Science 2020-09-24 Takayuki Osa , Shuhei Ikemoto

Supervised Contextual Embeddings for Transfer Learning in Natural Language Processing Tasks

Pre-trained word embeddings are the primary method for transfer learning in several Natural Language Processing (NLP) tasks. Recent works have focused on using unsupervised techniques such as language modeling to obtain these embeddings. In…

Computation and Language · Computer Science 2019-07-01 Mihir Kale , Aditya Siddhant , Sreyashi Nag , Radhika Parik , Matthias Grabmair , Anthony Tomasic

GLID: Pre-training a Generalist Encoder-Decoder Vision Model

This paper proposes a GeneraLIst encoder-Decoder (GLID) pre-training method for better handling various downstream computer vision tasks. While self-supervised pre-training approaches, e.g., Masked Autoencoder, have shown success in…

Computer Vision and Pattern Recognition · Computer Science 2024-04-12 Jihao Liu , Jinliang Zheng , Yu Liu , Hongsheng Li

rETF-semiSL: Semi-Supervised Learning for Neural Collapse in Temporal Data

Deep neural networks for time series must capture complex temporal patterns, to effectively represent dynamic data. Self- and semi-supervised learning methods show promising results in pre-training large models, which -- when finetuned for…

Machine Learning · Computer Science 2025-08-15 Yuhan Xie , William Cappelletti , Mahsa Shoaran , Pascal Frossard

Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility Data

We empirically demonstrate that a transformer pre-trained on country-scale unlabeled human mobility data learns embeddings capable, through fine-tuning, of developing a deep understanding of the target geography and its corresponding…

Computers and Society · Computer Science 2024-12-13 Alameen Najjar

Word-Class Embeddings for Multiclass Text Classification

Pre-trained word embeddings encode general word semantics and lexical regularities of natural language, and have proven useful across many NLP tasks, including word sense disambiguation, machine translation, and sentiment analysis, to name…

Machine Learning · Computer Science 2021-09-22 Alejandro Moreo , Andrea Esuli , Fabrizio Sebastiani