Computer Science

Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

While Multi-Agent Systems (MAS) empower Large Language Models to tackle complex reasoning tasks through collaborative interaction, optimizing their dynamics remains a formidable challenge due to the discrete, non-differentiable nature of…

Multiagent Systems · Computer Science 2026-05-29 Wenwu Li , Yuran Song , Mingze Zhao , Bo Jin , Wenhao Li

BORA: Bridging Offline Reinforcement Learning and Online Residual Adaptation for Real-World Dexterous VLA Models

Vision-Language-Action (VLA) models have emerged as a promising paradigm for grounding visual-language understanding into real-world robotic manipulation. However, dexterous manipulation remains challenging for VLA policies due to…

Robotics · Computer Science 2026-05-29 Zhongxi Chen , Yifan Han , Yanming Shao , Huanming Liu , Congsheng Xu , Xiaoyu Chen , Yao Mu , Wenzhao Lian

ExDBSCAN: Explaining DBSCAN with Counterfactual Reasoning -- Additional Material

Clustering is an unsupervised technique for grouping data points by similarity. While explainability methods exist for supervised machine learning, they are not directly applicable to clustering, making it challenging to understand cluster…

Machine Learning · Computer Science 2026-05-29 Pernille Matthews , Lena Krieger , Tommaso Amico , Artur Zimek , Thomas Seidl , Ira Assent

TriSearch: Learning to Optimize Triangulations via Bistellar Flips

We introduce TriSearch, a reinforcement learning framework for optimizing objectives over triangulations of a polytope via bistellar flips. The key idea is a circuit-supported subtriangulation action representation: feasible flips are…

Machine Learning · Computer Science 2026-05-29 Yiran Wang , Guido Montúfar

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

Long-horizon interactions require language models to manage accumulating information: when to update their state, when to preserve their state, and what to ignore. We study this challenge as \textbf{Contextual Belief Management (CBM)}:…

Artificial Intelligence · Computer Science 2026-05-29 Haoming Xu , Weihong Xu , Zongrui Li , Mengru Wang , Yunzhi Yao , Chiyu Wu , Jin Shang , Yu Gong , Shumin Deng

MarginGate: Sparse Margin-Triggered Verification for Batch-Invariant LLM Inference

Temperature-zero BF16 LLM inference is often treated as reproducible, yet the same request can emit different tokens when decoded alone or inside a larger batch. Existing fixes use batch-invariant operators or LLM-42's per-token…

Machine Learning · Computer Science 2026-05-29 Kexin Chu , Yang Zhou , Wei Zhang

D\'ej\`a View: Looping Transformers for Multi-View 3D Reconstruction

Recent feed-forward 3D reconstruction transformers have scaled to over a billion parameters, following the broader trend of increasing model capacity in computer vision. Yet emerging evidence suggests that contiguous transformer layers…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Alessandro Burzio , Tobias Fischer , Sven Elflein , Qunjie Zhou , Riccardo de Lutio , Jiawei Ren , Jiahui Huang , Shengyu Huang , Marc Pollefeys , Laura Leal-Taixé , Zan Gojcic , Haithem Turki

GRUFF: LLM Pronoun Fidelity, Reasoning, and Biases in German

Third-person singular pronouns have long been used to study stereotypical biases in language models and to test their abilities to reason about reference. More recently, the interplay between reasoning and bias has been investigated with…

Computation and Language · Computer Science 2026-05-29 Fabian Mewes , Anne Lauscher , Vagrant Gautam

Faithful Embeddings of Irregular and Asynchronous Data for Online Log-NCDEs

Continuous-time models are a natural choice for irregular and asynchronous data. A central design choice is how to embed discrete observations into continuous time. Interpolation- and imputation-based embeddings reconstruct a continuous…

Machine Learning · Computer Science 2026-05-29 Benjamin Walker , Alexandre Bloch , Lingyi Yang , Sam Morley , Terry Lyons

bpK#: Delegatable Pseudonyms And Their Applications to National eID Systems

Electronic identities (eIDs) are crucial in an increasingly digitalized environment. Pseudonyms, as offered by Austria's governmental sector-specific personal identifiers (bPks), can significantly improve privacy by ensuring that personal…

Cryptography and Security · Computer Science 2026-05-29 Stephan Krenn , Doryan Lesaignoux , Sebastian Ramacher

Cycle Consistency in Video Object-Centric Learning

Self-supervised video Object-Centric Learning (OCL) aims to discover distinct objects and associate them across time, whereas self-supervised Multi-Object Tracking (MOT) focuses on associating pre-defined object detections or segmentations.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Rongzhen Zhao , Zhiyuan Li , Ruonan Wei , Juho Kannala , Joni Pajarinen

Automating Low-Risk Code Review at Meta: RADAR, Risk Calibration, and Review Efficiency

AI-assisted coding tools have altered software production. At Meta, significant lines of code per human-landed diff grew by 105.9% year over year and per-developer diff volume rose 51%, with agentic AI responsible for over 80% of that…

Software Engineering · Computer Science 2026-05-29 Chris Adams , Arjun Singh Banga , Parveen Bansal , Souvik Bhattacharya , Rujin Cao , Pedro Canahuati , Nate Cook , Brian Ellis , Prabhakar Goyal , Gurinder Grewal , Tianyu He , Matt Labunka , Alex Manners , David Molnar , Ging Cee Ng , Vishal Parekh , Jiefu Pei , Frederic Sagnes , James Saindon , Will Shackleton , Sid Sidhu , Gursharan Singh , Karthik Chengayan Sridhar , Matt Steiner , Pratibha Udmalpet , Sean Xia , Stacey Yan , Audris Mockus , Peter Rigby , Nachiappan Nagappan

Persona Conditioning of Brand Recommendations in Retrieval-Augmented Commercial Chat: A Prominence-Stratified Cross-Provider Audit

The same prompt -- "best CRM software" -- reaches AI assistants from buyers in widely different contexts: a solo founder, an enterprise VP, a UK SMB owner. We audit how strongly that contextual variation reshapes which brands the model…

Artificial Intelligence · Computer Science 2026-05-29 Will Jack , Noah Lehman , Keller Maloney , Sarah Xu

LexPath: A domain-oriented multi-path framework for legal article retrieval

Legal article retrieval is critical for building traceable and reliable legal AI systems, where conclusions must be grounded in specific legal articles. However, existing open-domain retrieval methods rely heavily on surface-level lexical…

Information Retrieval · Computer Science 2026-05-29 Weixuan Liu , Qingfeng Zhuge , Xuyang Chen

A Bayesian Approach to Membership Inference for Statistical Release

The membership inference problem for publicly released statistics from a private dataset is well-studied. When developing and formally analyzing attack strategies, however, the focus has been on attacks that model the population using only…

Cryptography and Security · Computer Science 2026-05-29 Lisa Oakley , Sam Stites , Cameron Moy , Steven Holtzen , Alina Oprea , Marco Gaboardi

A Dual-Path Architecture for Scaling Compute and Capacity in LLMs

Looped transformers apply a shared block multiple times and have emerged as a parameter-efficient route to scaling compute in language models. However, at fixed FLOPs a looped model has strictly less capacity than a baseline transformer. We…

Computation and Language · Computer Science 2026-05-29 Markus Frey , Behzad Shomali , Joachim Koehler , Mehdi Ali

HPO: Hysteretic Policy Optimization for Stable and Efficient Training under Sparse-Reward Regime

We investigate a narrow but common failure mode of GRPO-style reinforcement learning in the context of sparse verifiable rewards: early updates contain more responses with negative advantages than those with positive advantages, while…

Machine Learning · Computer Science 2026-05-29 Mohamed Sana , Nicola Piovesan , Antonio De Domenico , Fadhel Ayed , Haozhe Zhang

Double-Edged Sword or Sharp Tool? Designing and Evaluating Triadic LLM-Teacher Collaboration for K-12 Writing at Scale

The double-edged sword of integrating Large Language Models (LLMs) requires an effective triadic collaboration mechanism among LLMs, teachers and students, especially for K-12 education. By developing a triadic collaboration system to…

Artificial Intelligence · Computer Science 2026-05-29 Canran Wang , Yuwen Yang , Zhen Wang , Ming Ma , Ding Yu , Chentai Wang , Keman Huang , Xiaoyong Du

Active Continual Learning with Metaplastic Binary Bayesian Neural Networks

Always-on edge systems must keep learning as conditions change under tight compute budgets and must detect unreliable predictions. Bayesian binary neural networks are attractive in this setting, but mean-field Bernoulli posteriors can…

Machine Learning · Computer Science 2026-05-29 Kellian Cottart , Théo Ballet , Djohan Bonnet , Damien Querlioz

Mean-Field Diffuser: Scaling Offline MARL to Thousands of Agents

Diffusion-based planning has achieved strong results in single-agent offline reinforcement learning, yet scaling to many-agent systems remains intractable due to the curse of dimensionality in the joint trajectory space. We introduce…

Machine Learning · Computer Science 2026-05-29 Wenhao Li , Xiangfeng Wang , Bo Jin