English
Related papers

Related papers: Behavior Structformer: Learning Players Representa…

200 papers

Methods for learning latent user representations from historical behavior logs have gained traction for recommendation tasks in e-commerce, content streaming, and other settings. However, this area still remains relatively underexplored in…

Sequential user modeling, a critical task in personalized recommender systems, focuses on predicting the next item a user would prefer, requiring a deep understanding of user behavior sequences. Despite the remarkable success of…

Artificial Intelligence · Computer Science 2023-10-10 Hao Wang , Jianxun Lian , Mingqi Wu , Haoxuan Li , Jiajun Fan , Wanyue Xu , Chaozhuo Li , Xing Xie

Structured layouts are preferable in many 2D visual contents (\eg, GUIs, webpages) since the structural information allows convenient layout editing. Computational frameworks can help create structured layouts but require heavy labor input.…

Graphics · Computer Science 2025-10-31 Xin Hu , Pengfei Xu , Jin Zhou , Hongbo Fu , Hui Huang

Recent advancements in imitation learning have led to transformer-based behavior foundation models (BFMs) that enable multi-modal, human-like control for humanoid agents. While excelling at zero-shot generation of robust behaviors, BFMs…

Machine Learning · Computer Science 2026-03-30 Ron Vainshtein , Zohar Rimon , Shie Mannor , Chen Tessler

Machine translation has seen rapid progress with the advent of Transformer-based models. These models have no explicit linguistic structure built into them, yet they may still implicitly learn structured relationships by attending to…

Data representation remains a fundamental challenge in machine learning, particularly when adapting sequence-based architectures like Transformers and Large Language Models (LLMs) for structured tabular data. Existing methods often fail to…

Machine Learning · Computer Science 2025-08-05 Kayvan Karim , Hani Ragab Hassen. Hadj Batatia

The Transformer architecture has become prominent in developing large causal language models. However, mechanisms to explain its capabilities are not well understood. Focused on the training process, here we establish a meta-learning view…

Machine Learning · Computer Science 2024-03-26 Xinbo Wu , Lav R. Varshney

Algorithms for the action segmentation task typically use temporal models to predict what action is occurring at each frame for a minute-long daily activity. Recent studies have shown the potential of Transformer in modeling the relations…

Computer Vision and Pattern Recognition · Computer Science 2021-10-19 Fangqiu Yi , Hongyu Wen , Tingting Jiang

Transformers have transformed modern machine learning, driving breakthroughs in computer vision, natural language processing, and robotics. At the core of their success lies the attention mechanism, which enables the modeling of global…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Hemanth Saratchandran , Simon Lucey

While deep learning has revolutionized research and applications in NLP and computer vision, this has not yet been the case for behavioral modeling and behavioral health applications. This is because the domain's datasets are smaller, have…

Machine Learning · Computer Science 2021-07-14 Mike A. Merrill , Tim Althoff

While behavior learning has made impressive progress in recent times, it lags behind computer vision and natural language processing due to its inability to leverage large, human-generated datasets. Human behaviors have wide variance,…

Machine Learning · Computer Science 2022-10-13 Nur Muhammad Mahi Shafiullah , Zichen Jeff Cui , Ariuntuya Altanzaya , Lerrel Pinto

Structured data, prevalent in tables, databases, and knowledge graphs, poses a significant challenge in its representation. With the advent of large language models (LLMs), there has been a shift towards linearization-based methods, which…

Computation and Language · Computer Science 2024-04-04 Yutong Shao , Ndapa Nakashole

Topology optimization enables the design of highly efficient and complex structures, but conventional iterative methods, such as SIMP-based approaches, often suffer from high computational costs and sensitivity to initial conditions.…

Computational Engineering, Finance, and Science · Computer Science 2025-09-18 Aaron Lutheran , Srijan Das , Alireza Tabarraei

Predictive business process monitoring focuses on predicting future characteristics of a running process using event logs. The foresight into process execution promises great potentials for efficient operations, better resource management,…

Machine Learning · Computer Science 2021-04-05 Zaharah A. Bukhsh , Aaqib Saeed , Remco M. Dijkman

Next-token prediction serves as the dominant component in current neural language models. During the training phase, the model employs teacher forcing, which predicts tokens based on all preceding ground truth tokens. However, this approach…

Computation and Language · Computer Science 2024-10-28 Yongjing Yin , Junran Ding , Kai Song , Yue Zhang

Transformers have become the predominant architecture in foundation models due to their excellent performance across various domains. However, the substantial cost of scaling these models remains a significant concern. This problem arises…

Recent advances in explainable recommendations have explored the integration of language models to analyze natural language rationales for user-item interactions. Despite their potential, existing methods often rely on ID-based…

Machine Learning · Computer Science 2025-12-18 Xinshun Feng , Mingzhe Liu , Yi Qiao , Tongyu Zhu , Leilei Sun , Shuai Wang

Machine learning systems are often used in settings where individuals adapt their features to obtain a desired outcome. In such settings, strategic behavior leads to a sharp loss in model performance in deployment. In this work, we aim to…

Machine Learning · Computer Science 2021-06-11 Yatong Chen , Jialu Wang , Yang Liu

We propose Token Turing Machines (TTM), a sequential, autoregressive Transformer model with memory for real-world sequential visual understanding. Our model is inspired by the seminal Neural Turing Machine, and has an external memory…

Recently, heatmap regression methods based on 1D landmark representations have shown prominent performance on locating facial landmarks. However, previous methods ignored to make deep explorations on the good potentials of 1D landmark…

Computer Vision and Pattern Recognition · Computer Science 2024-02-02 Shi Yin , Shijie Huan , Shangfei Wang , Jinshui Hu , Tao Guo , Bing Yin , Baocai Yin , Cong Liu
‹ Prev 1 2 3 10 Next ›