Related papers: GRASP: Gradient Realignment via Active Shared Perc…

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

A fundamental challenge in multiagent reinforcement learning is to learn beneficial behaviors in a shared environment with other simultaneously learning agents. In particular, each agent perceives the environment as effectively…

Machine Learning · Computer Science 2021-06-15 Dong-Ki Kim , Miao Liu , Matthew Riemer , Chuangchuang Sun , Marwa Abdulhai , Golnaz Habibi , Sebastian Lopez-Cot , Gerald Tesauro , Jonathan P. How

Parallel Stochastic Gradient-Based Planning for World Models

World models simulate environment dynamics from raw sensory inputs like video. However, using them for planning can be challenging due to the vast and unstructured search space. We propose a robust and highly parallelizable planner that…

Machine Learning · Computer Science 2026-02-03 Michael Psenka , Michael Rabbat , Aditi Krishnapriyan , Yann LeCun , Amir Bar

Learning Generalizable Risk-Sensitive Policies to Coordinate in Decentralized Multi-Agent General-Sum Games

While various multi-agent reinforcement learning methods have been proposed in cooperative settings, few works investigate how self-interested learning agents achieve mutual coordination in decentralized general-sum games and generalize…

Multiagent Systems · Computer Science 2023-01-05 Ziyi Liu , Xian Guo , Yongchun Fang

GauDP: Reinventing Multi-Agent Collaboration through Gaussian-Image Synergy in Diffusion Policies

Recently, effective coordination in embodied multi-agent systems has remained a fundamental challenge, particularly in scenarios where agents must balance individual perspectives with global environmental awareness. Existing approaches…

Robotics · Computer Science 2025-11-04 Ziye Wang , Li Kang , Yiran Qin , Jiahua Ma , Zhanglin Peng , Lei Bai , Ruimao Zhang

Gradient-Aware Model-based Policy Search

Traditional model-based reinforcement learning approaches learn a model of the environment dynamics without explicitly considering how it will be used by the agent. In the presence of misspecified model classes, this can lead to poor…

Machine Learning · Computer Science 2020-10-20 Pierluca D'Oro , Alberto Maria Metelli , Andrea Tirinzoni , Matteo Papini , Marcello Restelli

Convergence of Multi-Agent Learning with a Finite Step Size in General-Sum Games

Learning in a multi-agent system is challenging because agents are simultaneously learning and the environment is not stationary, undermining convergence guarantees. To address this challenge, this paper presents a new gradient-based…

Multiagent Systems · Computer Science 2019-03-08 Xinliang Song , Tonghan Wang , Chongjie Zhang

GRASP: Deterministic argument ranking in interaction graphs

Large language models are increasingly deployed as automated judges to evaluate the strength of arguments. As this role expands, their legitimacy depends on consistency, transparency, and the ability to separate argumentative structure from…

Machine Learning · Computer Science 2026-05-20 Diganta Misra , Antonio Orvieto , Rediet Abebe , Volkan Cevher

Convergence Analysis of Continuous-Time Distributed Stochastic Gradient Algorithms

In this paper, we propose a new framework to study distributed optimization problems with stochastic gradients by employing a multi-agent system with continuous-time dynamics. Here the goal of the agents is to cooperatively minimize the sum…

Systems and Control · Electrical Eng. & Systems 2026-02-10 Jianhua Sun , Kaihong Lu , Xin Yu

Gradient Shaping Beyond Clipping: A Functional Perspective on Update Magnitude Control

Gradient clipping is widely used to stabilize deep network training, but its formulation as a hard, fixed threshold limits flexibility and ignores gradient distribution dynamics. We propose SPAMP (Statistical Per-layer Adaptive Modulation…

Machine Learning · Computer Science 2025-10-03 Haochen You , Baojing Liu

GradMAP: Gradient-Based Multi-Agent Proximal Learning for Grid-Edge Flexibility

Coordinating large populations of grid-edge devices requires learning methods that remain fully decentralised in deployment while still respecting three-phase AC distribution-network physics. This paper proposes gradient-based multi-agent…

Machine Learning · Computer Science 2026-04-28 Yihong Zhou , Hongtai Zeng , Thomas Morstyn

GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning

Spatial reasoning, an important faculty of human cognition with many practical applications, is one of the core commonsense skills that is not purely language-based and, for satisfying (as opposed to optimal) solutions, requires some…

Artificial Intelligence · Computer Science 2025-01-20 Zhisheng Tang , Mayank Kejriwal

GRASP: Gated Regression-Aware Skill Proposer for Self-Improving LLM Agents

LLM agents acting in structured environments fail in operational rather than conversational ways, and reliability depends on procedural knowledge of the environment. Prior self-improvement methods accumulate natural-language guidance…

Artificial Intelligence · Computer Science 2026-05-29 Johannes Moll , Jean-Philippe Corbeil , Jiazhen Pan , Martin Hadamitzky , Daniel Rueckert , Lisa Adams , Keno Bressem

Fair Cooperation in Mixed-Motive Games via Conflict-Aware Gradient Adjustment

Multi-agent reinforcement learning in mixed-motive settings presents a fundamental challenge: agents must balance individual interests with collective goals, which are neither fully aligned nor strictly opposed. To address this, reward…

Multiagent Systems · Computer Science 2025-08-26 Woojun Kim , Katia Sycara

A Distributed Active Perception Strategy for Source Seeking and Level Curve Tracking

Algorithms for multi-agent systems to locate a source or to follow a desired level curve of spatially distributed scalar fields generally require sharing field measurements among the agents for gradient estimation. Yet, in this paper, we…

Systems and Control · Electrical Eng. & Systems 2021-04-08 Said Al-Abri , Fumin Zhang

Towards Fast Rates for Federated and Multi-Task Reinforcement Learning

We consider a setting involving $N$ agents, where each agent interacts with an environment modeled as a Markov Decision Process (MDP). The agents' MDPs differ in their reward functions, capturing heterogeneous objectives/tasks. The…

Machine Learning · Computer Science 2024-09-10 Feng Zhu , Robert W. Heath , Aritra Mitra

Decentralized Federated Learning with Gradient Tracking over Time-Varying Directed Networks

We investigate the problem of agent-to-agent interaction in decentralized (federated) learning over time-varying directed graphs, and, in doing so, propose a consensus-based algorithm called DSGTm-TV. The proposed algorithm incorporates…

Optimization and Control · Mathematics 2024-09-27 Duong Thuy Anh Nguyen , Su Wang , Duong Tung Nguyen , Angelia Nedich , H. Vincent Poor

Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning

Federated reinforcement learning (RL) enables collaborative decision making of multiple distributed agents without sharing local data trajectories. In this work, we consider a multi-task setting, in which each agent has its own private…

Machine Learning · Computer Science 2024-08-19 Tong Yang , Shicong Cen , Yuting Wei , Yuxin Chen , Yuejie Chi

GRASP: GRouped Activation Shared Parameterization for Parameter-Efficient Fine-Tuning and Robust Inference of Transformers

Parameter-efficient fine-tuning (PEFT) provides a scalable alternative to full-model adaptation by updating only a small subset of parameters in large pre-trained models. We introduce GRASP - GRouped Activation Shared Parameterization - a…

Machine Learning · Computer Science 2026-01-01 Malyaban Bal , Abhronil Sengupta

Towards Performatively Stable Equilibria in Decision-Dependent Games for Arbitrary Data Distribution Maps

In decision-dependent games, multiple players optimize their decisions under a data distribution that shifts with their joint actions, creating complex dynamics in applications like market pricing. A practical consequence of these dynamics…

Computer Science and Game Theory · Computer Science 2025-09-04 Guangzheng Zhong , Yang Liu , Jiming Liu

Grasp Stability Assessment Through Attention-Guided Cross-Modality Fusion and Transfer Learning

Extensive research has been conducted on assessing grasp stability, a crucial prerequisite for achieving optimal grasping strategies, including the minimum force grasping policy. However, existing works employ basic feature-level fusion…

Robotics · Computer Science 2023-08-03 Zhuangzhuang Zhang , Zhenning Zhou , Haili Wang , Zhinan Zhang , Huang Huang , Qixin Cao