Related papers: Modularized Transfomer-based Ranking Framework

Personalized Re-ranking for Recommendation

Ranking is a core task in recommender systems, which aims at providing an ordered list of items to users. Typically, a ranking function is learned from the labeled dataset to optimize the global performance, which produces a ranking score…

Information Retrieval · Computer Science 2019-07-24 Changhua Pei , Yi Zhang , Yongfeng Zhang , Fei Sun , Xiao Lin , Hanxiao Sun , Jian Wu , Peng Jiang , Wenwu Ou

Position Information in Transformers: An Overview

Transformers are arguably the main workhorse in recent Natural Language Processing research. By definition a Transformer is invariant with respect to reordering of the input. However, language is inherently sequential and word order is…

Computation and Language · Computer Science 2021-09-10 Philipp Dufter , Martin Schmitt , Hinrich Schütze

Rethinking the Value of Transformer Components

Transformer becomes the state-of-the-art translation model, while it is not well studied how each intermediate component contributes to the model performance, which poses significant challenges for designing optimal architectures. In this…

Computation and Language · Computer Science 2020-11-10 Wenxuan Wang , Zhaopeng Tu

Efficient Transformers: A Survey

Transformer model architectures have garnered immense interest lately due to their effectiveness across a range of domains like language, vision and reinforcement learning. In the field of natural language processing for example,…

Machine Learning · Computer Science 2022-03-15 Yi Tay , Mostafa Dehghani , Dara Bahri , Donald Metzler

Language Ranker: A Lightweight Ranking framework for LLM Decoding

Conventional research on large language models (LLMs) has primarily focused on refining output distributions, while paying less attention to the decoding process that transforms these distributions into final responses. Recent advances,…

Computation and Language · Computer Science 2025-10-28 Chenheng Zhang , Tianqi Du , Jizhe Zhang , Mingqing Xiao , Yifei Wang , Yisen Wang , Zhouchen Lin

TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks

Classification tasks in NLP are typically addressed by selecting a pre-trained language model (PLM) from a model hub, and fine-tuning it for the task at hand. However, given the very large number of PLMs that are currently available, a…

Computation and Language · Computer Science 2024-09-11 Lukas Garbas , Max Ploner , Alan Akbik

Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference

Pre-trained Transformer models like T5 and BART have advanced the state of the art on a wide range of text generation tasks. Compressing these models into smaller ones has become critically important for practical use. Common neural network…

Computation and Language · Computer Science 2023-06-06 Wangchunshu Zhou , Ronan Le Bras , Yejin Choi

Modular Deep Learning

Transfer learning has recently become the dominant paradigm of machine learning. Pre-trained models fine-tuned for downstream tasks achieve better performance with fewer labelled examples. Nonetheless, it remains unclear how to develop…

Machine Learning · Computer Science 2024-01-30 Jonas Pfeiffer , Sebastian Ruder , Ivan Vulić , Edoardo Maria Ponti

Reduced-order Modeling of Modular, Position-dependent Systems with Translating Interfaces

Many complex mechatronic systems consist of multiple interconnected dynamical subsystems, which are designed, developed, analyzed, and manufactured by multiple independent teams. To support such a design approach, a modular model framework…

Systems and Control · Electrical Eng. & Systems 2024-02-13 Robert A. Egelmeers , Lars A. L. Janssen , Rob H. B. Fey , Jasper Gerritsen , Nathan van de Wouw

Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems

Learning feature interaction is the critical backbone to building recommender systems. In web-scale applications, learning feature interaction is extremely challenging due to the sparse and large input feature space; meanwhile, manually…

Information Retrieval · Computer Science 2023-11-13 Huan Gui , Ruoxi Wang , Ke Yin , Long Jin , Maciej Kula , Taibai Xu , Lichan Hong , Ed H. Chi

Pretrained Transformers for Text Ranking: BERT and Beyond

The goal of text ranking is to generate an ordered list of texts retrieved from a corpus in response to a query. Although the most common formulation of text ranking is search, instances of the task can also be found in many natural…

Information Retrieval · Computer Science 2021-08-20 Jimmy Lin , Rodrigo Nogueira , Andrew Yates

SORT: A Systematically Optimized Ranking Transformer for Industrial-scale Recommenders

While Transformers have achieved remarkable success in LLMs through superior scalability, their application in industrial-scale ranking models remains nascent, hindered by the challenges of high feature sparsity and low label density. In…

Information Retrieval · Computer Science 2026-03-05 Chunqi Wang , Bingchao Wu , Taotian Pang , Jiahao Wang , Jie Yang , Jia Liu , Hao Zhang , Hai Zhu , Lei Shen , Shizhun Wang , Bing Wang , Xiaoyi Zeng

Latent Structured Ranking

Many latent (factorized) models have been proposed for recommendation tasks like collaborative filtering and for ranking tasks like document or image retrieval and annotation. Common to all those methods is that during inference the items…

Machine Learning · Computer Science 2012-10-19 Jason Weston , John Blitzer

From Features to Transformers: Redefining Ranking for Scalable Impact

We present LiGR, a large-scale ranking framework developed at LinkedIn that brings state-of-the-art transformer-based modeling architectures into production. We introduce a modified transformer architecture that incorporates learned…

Machine Learning · Computer Science 2026-02-10 Fedor Borisyuk , Lars Hertel , Ganesh Parameswaran , Gaurav Srivastava , Sudarshan Srinivasa Ramanujam , Borja Ocejo , Peng Du , Andrei Akterskii , Neil Daftary , Shao Tang , Daqi Sun , Qiang Charles Xiao , Deepesh Nathani , Mohit Kothari , Yun Dai , Guoyao Li , Aman Gupta

Improving Transformer Models by Reordering their Sublayers

Multilayer transformer networks consist of interleaved self-attention and feedforward sublayers. Could ordering the sublayers in a different pattern lead to better performance? We generate randomly ordered transformers and train them with…

Computation and Language · Computer Science 2020-04-24 Ofir Press , Noah A. Smith , Omer Levy

Rankformer: A Graph Transformer for Recommendation based on Ranking Objective

Recommender Systems (RS) aim to generate personalized ranked lists for each user and are evaluated using ranking metrics. Although personalized ranking is a fundamental aspect of RS, this critical property is often overlooked in the design…

Information Retrieval · Computer Science 2025-03-24 Sirui Chen , Shen Han , Jiawei Chen , Binbin Hu , Sheng Zhou , Gang Wang , Yan Feng , Chun Chen , Can Wang

Transformers Meet Relational Databases

Transformer models have continuously expanded into all machine learning domains convertible to the underlying sequence-to-sequence representation, including tabular data. However, while ubiquitous, this representation restricts their…

Machine Learning · Computer Science 2025-07-24 Jakub Peleška , Gustav Šír

Ranking Micro-Influencers: a Novel Multi-Task Learning and Interpretable Framework

With the rise in use of social media to promote branded products, the demand for effective influencer marketing has increased. Brands are looking for improved ways to identify valuable influencers among a vast catalogue; this is even more…

Machine Learning · Computer Science 2021-12-02 Adam Elwood , Alberto Gasparin , Alessandro Rozza

RankUp: Towards High-rank Representations for Large Scale Advertising Recommender Systems

The scaling laws for recommender systems have been increasingly validated, where MetaFormer-based architectures consistently benefit from increased model depth, hidden dimensionality, and user behavior sequence length. However, whether…

Information Retrieval · Computer Science 2026-05-13 Jin Chen , Shangyu Zhang , Bin Hu , Chao Zhou , Junwei Pan , Gengsheng Xue , Wentao Ning , Gengyu Weng , Wang Zheng , Shaohua Liu , Zeen Xu , Chengyuan Mai , Shijie Quan , Tingyu Jiang , Lifeng Wang , Shudong Huang , Chengguo Yin , Haijie Gu , Jie Jiang

Addressing Some Limitations of Transformers with Feedback Memory

Transformers have been successfully applied to sequential, auto-regressive tasks despite being feedforward networks. Unlike recurrent neural networks, Transformers use attention to capture temporal relations while processing input tokens in…

Machine Learning · Computer Science 2021-01-26 Angela Fan , Thibaut Lavril , Edouard Grave , Armand Joulin , Sainbayar Sukhbaatar