Related papers: Goal-Conditioned Supervised Learning for Multi-Obj…

Multi-Objective Meta Learning

Meta learning with multiple objectives can be formulated as a Multi-Objective Bi-Level optimization Problem (MOBLP) where the upper-level subproblem is to solve several possible conflicting targets for the meta learner. However, existing…

Machine Learning · Computer Science 2021-02-16 Feiyang Ye , Baijiong Lin , Zhixiong Yue , Pengxin Guo , Qiao Xiao , Yu Zhang

Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL

Solving goal-conditioned tasks with sparse rewards using self-supervised learning is promising because of its simplicity and stability over current reinforcement learning (RL) algorithms. A recent work, called Goal-Conditioned Supervised…

Machine Learning · Computer Science 2022-02-15 Rui Yang , Yiming Lu , Wenzhe Li , Hao Sun , Meng Fang , Yali Du , Xiu Li , Lei Han , Chongjie Zhang

Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback

Reinforcement learning faces significant challenges when applied to tasks characterized by sparse reward structures. Although imitation learning, within the domain of supervised learning, offers faster convergence, it relies heavily on…

Machine Learning · Computer Science 2025-09-04 Zeqiang Zhang , Fabian Wurzberger , Gerrit Schmid , Sebastian Gottwald , Daniel A. Braun

Goal-Conditioned Supervised Learning with Sub-Goal Prediction

Recently, a simple yet effective algorithm -- goal-conditioned supervised-learning (GCSL) -- was proposed to tackle goal-conditioned reinforcement-learning. GCSL is based on the principle of hindsight learning: by observing states visited…

Machine Learning · Computer Science 2023-05-18 Tom Jurgenson , Aviv Tamar

Multi-behavior Self-supervised Learning for Recommendation

Modern recommender systems often deal with a variety of user interactions, e.g., click, forward, purchase, etc., which requires the underlying recommender engines to fully understand and leverage multi-behavior data from users. Despite…

Information Retrieval · Computer Science 2023-05-30 Jingcao Xu , Chaokun Wang , Cheng Wu , Yang Song , Kai Zheng , Xiaowei Wang , Changping Wang , Guorui Zhou , Kun Gai

Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

Machine learning problems with multiple objective functions appear either in learning with multiple criteria where learning has to make a trade-off between multiple performance metrics such as fairness, safety and accuracy; or, in…

Machine Learning · Computer Science 2024-03-20 Heshan Fernando , Han Shen , Miao Liu , Subhajit Chaudhury , Keerthiram Murugesan , Tianyi Chen

Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer

Securing long-term success is the ultimate aim of recommender systems, demanding strategies capable of foreseeing and shaping the impact of decisions on future user satisfaction. Current recommendation strategies grapple with two…

Information Retrieval · Computer Science 2025-01-14 Chongming Gao , Kexin Huang , Ziang Fei , Jiaju Chen , Jiawei Chen , Jianshan Sun , Shuchang Liu , Qingpeng Cai , Peng Jiang

Goal-Conditioned Supervised Learning for LLM Fine-Tuning

Large language models often require fine-tuning to better align their behavior with user intent at deployment. Existing approaches are commonly divided into online and offline paradigms. Online methods, such as RL-based alignment, can…

Machine Learning · Computer Science 2026-05-19 Shijun Li , Kaiwen Dong , Xiang Gao , Joydeep Ghosh

Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy

Offline goal-conditioned reinforcement learning (GCRL) aims at solving goal-reaching tasks with sparse rewards from an offline dataset. While prior work has demonstrated various approaches for agents to learn near-optimal policies, these…

Robotics · Computer Science 2024-03-05 Chenyang Cao , Zichen Yan , Renhao Lu , Junbo Tan , Xueqian Wang

Multi-Objective Recommendations: A Tutorial

Recommender systems (RecSys) have been well developed to assist user decision making. Traditional RecSys usually optimize a single objective (e.g., rating prediction errors or ranking quality) in the model. There is an emerging demand in…

Information Retrieval · Computer Science 2023-06-13 Yong Zheng , David , Wang

MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning

Recently, a state-of-the-art family of algorithms, known as Goal-Conditioned Weighted Supervised Learning (GCWSL) methods, has been introduced to tackle challenges in offline goal-conditioned reinforcement learning (RL). GCWSL optimizes a…

Machine Learning · Computer Science 2024-12-23 Xing Lei , Xuetao Zhang , Donglin Wang

Unifying Goal-Conditioned RL and Unsupervised Skill Learning via Control-Maximization

Unsupervised pretraining has driven empirical advances in goal-conditioned reinforcement learning (GCRL), but its theoretical foundations remain poorly understood. In particular, an influential class of methods, mutual information skill…

Machine Learning · Computer Science 2026-05-08 Alireza Modirshanechi , Benjamin Eysenbach , Peter Dayan , Eric Schulz

Q-WSL: Optimizing Goal-Conditioned RL with Weighted Supervised Learning via Dynamic Programming

A novel class of advanced algorithms, termed Goal-Conditioned Weighted Supervised Learning (GCWSL), has recently emerged to tackle the challenges posed by sparse rewards in goal-conditioned reinforcement learning (RL). GCWSL consistently…

Machine Learning · Computer Science 2025-06-10 Xing Lei , Xuetao Zhang , Zifeng Zhuang , Donglin Wang

Inverse Multiobjective Optimization Through Online Learning

We study the problem of learning the objective functions or constraints of a multiobjective decision making model, based on a set of sequentially arrived decisions. In particular, these decisions might not be exact and possibly carry…

Machine Learning · Computer Science 2022-12-27 Chaosheng Dong , Yijia Wang , Bo Zeng

Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning

In goal-conditioned reinforcement learning (GCRL), sparse rewards present significant challenges, often obstructing efficient learning. Although multi-step GCRL can boost this efficiency, it can also lead to off-policy biases in target…

Machine Learning · Computer Science 2023-11-30 Lisheng Wu , Ke Chen

Multi-Objective Recommender Systems: Survey and Challenges

Recommender systems can be characterized as software solutions that provide users convenient access to relevant content. Traditionally, recommender systems research predominantly focuses on developing machine learning algorithms that aim to…

Information Retrieval · Computer Science 2022-10-20 Dietmar Jannach

Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability

Goal-conditioned reinforcement learning (GCRL) refers to learning general-purpose skills that aim to reach diverse goals. In particular, offline GCRL only requires purely pre-collected datasets to perform training tasks without additional…

Machine Learning · Computer Science 2023-10-13 Hanlin Zhu , Amy Zhang

Multiobjectivization of Local Search: Single-Objective Optimization Benefits From Multi-Objective Gradient Descent

Multimodality is one of the biggest difficulties for optimization as local optima are often preventing algorithms from making progress. This does not only challenge local strategies that can get stuck. It also hinders meta-heuristics like…

Neural and Evolutionary Computing · Computer Science 2020-10-05 Vera Steinhoff , Pascal Kerschke , Pelin Aspar , Heike Trautmann , Christian Grimme

Objective Soups: Multilingual Multi-Task Modeling for Speech Processing

Training a single model for multilingual, multi-task speech processing (MSP) is severely hampered by conflicting objectives between tasks like speech recognition and translation. While multi-objective optimization (MOO) aims to align…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-14 A F M Saif , Lisha Chen , Xiaodong Cui , Songtao Lu , Brian Kingsbury , Tianyi Chen

Mixed Supervised Graph Contrastive Learning for Recommendation

Recommender systems (RecSys) play a vital role in online platforms, offering users personalized suggestions amidst vast information. Graph contrastive learning aims to learn from high-order collaborative filtering signals with unsupervised…

Information Retrieval · Computer Science 2024-04-29 Weizhi Zhang , Liangwei Yang , Zihe Song , Henry Peng Zou , Ke Xu , Yuanjie Zhu , Philip S. Yu