Related papers: Behavior Structformer: Learning Players Representa…

player2vec: A Language Modeling Approach to Understand Player Behavior in Games

Methods for learning latent user representations from historical behavior logs have gained traction for recommendation tasks in e-commerce, content streaming, and other settings. However, this area still remains relatively underexplored in…

Machine Learning · Computer Science 2024-06-11 Tianze Wang , Maryam Honari-Jahromi , Styliani Katsarou , Olga Mikheeva , Theodoros Panagiotakopoulos , Sahar Asadi , Oleg Smirnov

ConvFormer: Revisiting Transformer for Sequential User Modeling

Sequential user modeling, a critical task in personalized recommender systems, focuses on predicting the next item a user would prefer, requiring a deep understanding of user behavior sequences. Despite the remarkable success of…

Artificial Intelligence · Computer Science 2023-10-10 Hao Wang , Jianxun Lian , Mingqi Wu , Haoxuan Li , Jiajun Fan , Wanyue Xu , Chaozhuo Li , Xing Xie

StructLayoutFormer:Conditional Structured Layout Generation via Structure Serialization and Disentanglement

Structured layouts are preferable in many 2D visual contents (\eg, GUIs, webpages) since the structural information allows convenient layout editing. Computational frameworks can help create structured layouts but require heavy labor input.…

Graphics · Computer Science 2025-10-31 Xin Hu , Pengfei Xu , Jin Zhou , Hongbo Fu , Hui Huang

Task Tokens: A Flexible Approach to Adapting Behavior Foundation Models

Recent advancements in imitation learning have led to transformer-based behavior foundation models (BFMs) that enable multi-modal, human-like control for humanoid agents. While excelling at zero-shot generation of robust behaviors, BFMs…

Machine Learning · Computer Science 2026-03-30 Ron Vainshtein , Zohar Rimon , Shie Mannor , Chen Tessler

Structural Biases for Improving Transformers on Translation into Morphologically Rich Languages

Machine translation has seen rapid progress with the advent of Transformer-based models. These models have no explicit linguistic structure built into them, yet they may still implicitly learn structured relationships by attending to…

Computation and Language · Computer Science 2022-08-15 Paul Soulos , Sudha Rao , Caitlin Smith , Eric Rosen , Asli Celikyilmaz , R. Thomas McCoy , Yichen Jiang , Coleman Haley , Roland Fernandez , Hamid Palangi , Jianfeng Gao , Paul Smolensky

Innovative tokenisation of structured data for LLM training

Data representation remains a fundamental challenge in machine learning, particularly when adapting sequence-based architectures like Transformers and Large Language Models (LLMs) for structured tabular data. Existing methods often fail to…

Machine Learning · Computer Science 2025-08-05 Kayvan Karim , Hani Ragab Hassen. Hadj Batatia

A Meta-Learning Perspective on Transformers for Causal Language Modeling

The Transformer architecture has become prominent in developing large causal language models. However, mechanisms to explain its capabilities are not well understood. Focused on the training process, here we establish a meta-learning view…

Machine Learning · Computer Science 2024-03-26 Xinbo Wu , Lav R. Varshney

ASFormer: Transformer for Action Segmentation

Algorithms for the action segmentation task typically use temporal models to predict what action is occurring at each frame for a minute-long daily activity. Recent studies have shown the potential of Transformer in modeling the relations…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Fangqiu Yi , Hongyu Wen , Tingting Jiang

Enhancing Transformers Through Conditioned Embedded Tokens

Transformers have transformed modern machine learning, driving breakthroughs in computer vision, natural language processing, and robotics. At the core of their success lies the attention mechanism, which enables the modeling of global…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Hemanth Saratchandran , Simon Lucey

Transformer-Based Behavioral Representation Learning Enables Transfer Learning for Mobile Sensing in Small Datasets

While deep learning has revolutionized research and applications in NLP and computer vision, this has not yet been the case for behavioral modeling and behavioral health applications. This is because the domain's datasets are smaller, have…

Machine Learning · Computer Science 2021-07-14 Mike A. Merrill , Tim Althoff

Behavior Transformers: Cloning $k$ modes with one stone

While behavior learning has made impressive progress in recent times, it lags behind computer vision and natural language processing due to its inability to leverage large, human-generated datasets. Human behaviors have wide variance,…

Machine Learning · Computer Science 2022-10-13 Nur Muhammad Mahi Shafiullah , Zichen Jeff Cui , Ariuntuya Altanzaya , Lerrel Pinto

On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQL

Structured data, prevalent in tables, databases, and knowledge graphs, poses a significant challenge in its representation. With the advent of large language models (LLMs), there has been a shift towards linearization-based methods, which…

Computation and Language · Computer Science 2024-04-04 Yutong Shao , Ndapa Nakashole

Transformer-based Topology Optimization

Topology optimization enables the design of highly efficient and complex structures, but conventional iterative methods, such as SIMP-based approaches, often suffer from high computational costs and sensitivity to initial conditions.…

Computational Engineering, Finance, and Science · Computer Science 2025-09-18 Aaron Lutheran , Srijan Das , Alireza Tabarraei

ProcessTransformer: Predictive Business Process Monitoring with Transformer Network

Predictive business process monitoring focuses on predicting future characteristics of a running process using event logs. The foresight into process execution promises great potentials for efficient operations, better resource management,…

Machine Learning · Computer Science 2021-04-05 Zaharah A. Bukhsh , Aaqib Saeed , Remco M. Dijkman

Semformer: Transformer Language Models with Semantic Planning

Next-token prediction serves as the dominant component in current neural language models. During the training phase, the model employs teacher forcing, which predicts tokens based on all preceding ground truth tokens. However, this approach…

Computation and Language · Computer Science 2024-10-28 Yongjing Yin , Junran Ding , Kai Song , Yue Zhang

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Transformers have become the predominant architecture in foundation models due to their excellent performance across various domains. However, the substantial cost of scaling these models remains a significant concern. This problem arises…

Machine Learning · Computer Science 2025-03-25 Haiyang Wang , Yue Fan , Muhammad Ferjad Naeem , Yongqin Xian , Jan Eric Lenssen , Liwei Wang , Federico Tombari , Bernt Schiele

Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary

Recent advances in explainable recommendations have explored the integration of language models to analyze natural language rationales for user-item interactions. Despite their potential, existing methods often rely on ID-based…

Machine Learning · Computer Science 2025-12-18 Xinshun Feng , Mingzhe Liu , Yi Qiao , Tongyu Zhu , Leilei Sun , Shuai Wang

Linear Classifiers that Encourage Constructive Adaptation

Machine learning systems are often used in settings where individuals adapt their features to obtain a desired outcome. In such settings, strategic behavior leads to a sharp loss in model performance in deployment. In this work, we aim to…

Machine Learning · Computer Science 2021-06-11 Yatong Chen , Jialu Wang , Yang Liu

Token Turing Machines

We propose Token Turing Machines (TTM), a sequential, autoregressive Transformer model with memory for real-world sequential visual understanding. Our model is inspired by the seminal Neural Turing Machine, and has an external memory…

Machine Learning · Computer Science 2023-04-14 Michael S. Ryoo , Keerthana Gopalakrishnan , Kumara Kahatapitiya , Ted Xiao , Kanishka Rao , Austin Stone , Yao Lu , Julian Ibarz , Anurag Arnab

1DFormer: a Transformer Architecture Learning 1D Landmark Representations for Facial Landmark Tracking

Recently, heatmap regression methods based on 1D landmark representations have shown prominent performance on locating facial landmarks. However, previous methods ignored to make deep explorations on the good potentials of 1D landmark…

Computer Vision and Pattern Recognition · Computer Science 2024-02-02 Shi Yin , Shijie Huan , Shangfei Wang , Jinshui Hu , Tao Guo , Bing Yin , Baocai Yin , Cong Liu