English
Related papers

Related papers: GRASP: Gradient Realignment via Active Shared Perc…

200 papers

A fundamental challenge in multiagent reinforcement learning is to learn beneficial behaviors in a shared environment with other simultaneously learning agents. In particular, each agent perceives the environment as effectively…

World models simulate environment dynamics from raw sensory inputs like video. However, using them for planning can be challenging due to the vast and unstructured search space. We propose a robust and highly parallelizable planner that…

Machine Learning · Computer Science 2026-02-03 Michael Psenka , Michael Rabbat , Aditi Krishnapriyan , Yann LeCun , Amir Bar

While various multi-agent reinforcement learning methods have been proposed in cooperative settings, few works investigate how self-interested learning agents achieve mutual coordination in decentralized general-sum games and generalize…

Multiagent Systems · Computer Science 2023-01-05 Ziyi Liu , Xian Guo , Yongchun Fang

Recently, effective coordination in embodied multi-agent systems has remained a fundamental challenge, particularly in scenarios where agents must balance individual perspectives with global environmental awareness. Existing approaches…

Robotics · Computer Science 2025-11-04 Ziye Wang , Li Kang , Yiran Qin , Jiahua Ma , Zhanglin Peng , Lei Bai , Ruimao Zhang

Traditional model-based reinforcement learning approaches learn a model of the environment dynamics without explicitly considering how it will be used by the agent. In the presence of misspecified model classes, this can lead to poor…

Machine Learning · Computer Science 2020-10-20 Pierluca D'Oro , Alberto Maria Metelli , Andrea Tirinzoni , Matteo Papini , Marcello Restelli

Learning in a multi-agent system is challenging because agents are simultaneously learning and the environment is not stationary, undermining convergence guarantees. To address this challenge, this paper presents a new gradient-based…

Multiagent Systems · Computer Science 2019-03-08 Xinliang Song , Tonghan Wang , Chongjie Zhang

Large language models are increasingly deployed as automated judges to evaluate the strength of arguments. As this role expands, their legitimacy depends on consistency, transparency, and the ability to separate argumentative structure from…

Machine Learning · Computer Science 2026-05-20 Diganta Misra , Antonio Orvieto , Rediet Abebe , Volkan Cevher

In this paper, we propose a new framework to study distributed optimization problems with stochastic gradients by employing a multi-agent system with continuous-time dynamics. Here the goal of the agents is to cooperatively minimize the sum…

Systems and Control · Electrical Eng. & Systems 2026-02-10 Jianhua Sun , Kaihong Lu , Xin Yu

Gradient clipping is widely used to stabilize deep network training, but its formulation as a hard, fixed threshold limits flexibility and ignores gradient distribution dynamics. We propose SPAMP (Statistical Per-layer Adaptive Modulation…

Machine Learning · Computer Science 2025-10-03 Haochen You , Baojing Liu

Coordinating large populations of grid-edge devices requires learning methods that remain fully decentralised in deployment while still respecting three-phase AC distribution-network physics. This paper proposes gradient-based multi-agent…

Machine Learning · Computer Science 2026-04-28 Yihong Zhou , Hongtai Zeng , Thomas Morstyn

Spatial reasoning, an important faculty of human cognition with many practical applications, is one of the core commonsense skills that is not purely language-based and, for satisfying (as opposed to optimal) solutions, requires some…

Artificial Intelligence · Computer Science 2025-01-20 Zhisheng Tang , Mayank Kejriwal

LLM agents acting in structured environments fail in operational rather than conversational ways, and reliability depends on procedural knowledge of the environment. Prior self-improvement methods accumulate natural-language guidance…

Artificial Intelligence · Computer Science 2026-05-29 Johannes Moll , Jean-Philippe Corbeil , Jiazhen Pan , Martin Hadamitzky , Daniel Rueckert , Lisa Adams , Keno Bressem

Multi-agent reinforcement learning in mixed-motive settings presents a fundamental challenge: agents must balance individual interests with collective goals, which are neither fully aligned nor strictly opposed. To address this, reward…

Multiagent Systems · Computer Science 2025-08-26 Woojun Kim , Katia Sycara

Algorithms for multi-agent systems to locate a source or to follow a desired level curve of spatially distributed scalar fields generally require sharing field measurements among the agents for gradient estimation. Yet, in this paper, we…

Systems and Control · Electrical Eng. & Systems 2021-04-08 Said Al-Abri , Fumin Zhang

We consider a setting involving $N$ agents, where each agent interacts with an environment modeled as a Markov Decision Process (MDP). The agents' MDPs differ in their reward functions, capturing heterogeneous objectives/tasks. The…

Machine Learning · Computer Science 2024-09-10 Feng Zhu , Robert W. Heath , Aritra Mitra

We investigate the problem of agent-to-agent interaction in decentralized (federated) learning over time-varying directed graphs, and, in doing so, propose a consensus-based algorithm called DSGTm-TV. The proposed algorithm incorporates…

Optimization and Control · Mathematics 2024-09-27 Duong Thuy Anh Nguyen , Su Wang , Duong Tung Nguyen , Angelia Nedich , H. Vincent Poor

Federated reinforcement learning (RL) enables collaborative decision making of multiple distributed agents without sharing local data trajectories. In this work, we consider a multi-task setting, in which each agent has its own private…

Machine Learning · Computer Science 2024-08-19 Tong Yang , Shicong Cen , Yuting Wei , Yuxin Chen , Yuejie Chi

Parameter-efficient fine-tuning (PEFT) provides a scalable alternative to full-model adaptation by updating only a small subset of parameters in large pre-trained models. We introduce GRASP - GRouped Activation Shared Parameterization - a…

Machine Learning · Computer Science 2026-01-01 Malyaban Bal , Abhronil Sengupta

In decision-dependent games, multiple players optimize their decisions under a data distribution that shifts with their joint actions, creating complex dynamics in applications like market pricing. A practical consequence of these dynamics…

Computer Science and Game Theory · Computer Science 2025-09-04 Guangzheng Zhong , Yang Liu , Jiming Liu

Extensive research has been conducted on assessing grasp stability, a crucial prerequisite for achieving optimal grasping strategies, including the minimum force grasping policy. However, existing works employ basic feature-level fusion…

Robotics · Computer Science 2023-08-03 Zhuangzhuang Zhang , Zhenning Zhou , Haili Wang , Zhinan Zhang , Huang Huang , Qixin Cao
‹ Prev 1 2 3 10 Next ›