Computer Science

Colored Noise Diffusion Sampling

Diffusion models achieve state-of-the-art image synthesis, with their generative trajectories fundamentally exhibiting a spectral bias, resolving low-frequency global structures early and high-frequency fine details later. Conventional…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Hadar Davidson , Noam Issachar , Sagie Benaim

Majorization precursors to supermodularity and subadditivity on the majorization lattice

We establish two structural majorization relations, which we call precursors, underlying the properties of supermodularity and subadditivity on the lattice induced by majorization. These are precursors in that they immediately imply that…

Information Theory · Computer Science 2026-05-29 Alexander Stévins , Michael G. Jabbour , Serge Deside , Nicolas J. Cerf

When, why, and how do diffusion posterior samplers fail? A finite-sample lens

Diffusion models have excellent capacity to model complex distributions of natural data, which has made them a popular and effective choice for posterior sampling in imaging inverse problems. Existing methods can incorporate any measurement…

Machine Learning · Computer Science 2026-05-29 Benjamin A. Burns , Sara Fridovich-Keil

SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?

Autonomous AI research agents aim to accelerate scientific discovery by automating the research pipeline, from hypothesis generation to peer review. However, existing benchmarks rarely test a fundamental bottleneck: whether Large Language…

Machine Learning · Computer Science 2026-05-29 Sy-Tuyen Ho , Minghui Liu , Huy Nghiem , Furong Huang

Supercharging Thermal Gaussian Splatting with Depth Estimation

Efficient and robust 3D scene representation is crucial in autonomous driving, robotics, and related fields. While RGB images provide valuable content for 3D reconstruction, other modalities like thermal or depth can enable additional…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Manoj Biswanath , Chenxin Cai , Hannah Schieber , Daniel Roth , Benjamin Busam

Reasoning with Sampling: Cutting at Decision Points

Frontier reasoning models are produced by posttraining base language models with reinforcement learning. Recent work has challenged this by showing that sampling from a sharpened version of the base model's distribution, a so-called power…

Machine Learning · Computer Science 2026-05-29 Felix Zhou , Anay Mehrotra , Quanquan C. Liu

RoboWits: Unexpected Challenges for Robotic Creative Problem Solving

The ability to reason, adapt, and creatively solve problems under unexpected challenges is essential for robots operating in real-world environments. However, current robotic benchmarks primarily emphasize skill-level execution and provide…

Robotics · Computer Science 2026-05-29 Chunru Lin , Hongxin Zhang , Fenghao Yu , Zhehuan Chen , Thomas L. Griffiths , Yejin Choi , David Held , Chuang Gan

Veda: Scalable Video Diffusion via Distilled Sparse Attention

Scaling Diffusion Transformers to generate high-resolution, long videos is constrained by the quadratic cost of self-attention, and existing sparse attention methods degrade under high sparsity. We show empirically that generation quality…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Shihao Han , Hao Yang , Xinting Hu , Xiaofeng Mei , Yi Jiang , Xiaojuan Qi

On Language Generation in the Limit with Bounded Memory

We study language generation in the limit under bounded memory. In this task, a learner observes examples from an unknown target language one at a time and must eventually output only new valid examples. Prior work assumes access to the…

Data Structures and Algorithms · Computer Science 2026-05-29 Jon Kleinberg , Anay Mehrotra , Amin Saberi , Grigoris Velegkas

In-Context Reward Adaptation for Robust Preference Modeling

Reinforcement Learning from Human Feedback (RLHF) typically relies on static reward models to align Large Language Models with human preferences. However, human values are inherently diverse and heterogeneous, and a single reward model…

Machine Learning · Computer Science 2026-05-29 Zhenyu Sun , Zheng Xu , Ermin Wei

Gram: Assessing sabotage propensities via automated alignment auditing

We introduce Gram, an automated alignment auditing framework to assess the propensity of AI agents to engage in sabotage. We evaluate Gemini models across 17 simulated agentic deployment scenarios that incentivize sabotage. We find Gemini…

Machine Learning · Computer Science 2026-05-29 David Lindner , Victoria Krakovna , Sebastian Farquhar

MonoPhysics: Estimating Geometry, Appearance, and Physical Parameters from Monocular Videos

Existing inverse physics methods recover physical parameters from multi-view videos, where geometric constraints across views resolve scale and 3D structure. In monocular settings, however, such constraints are absent, leading to severe…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Daniel Rho , Jun Myeong Choi , Matthew Thornton , Biswadip Dey , Roni Sengupta

Before the Shutter: Aesthetic and Actionable Portrait Photography Planning in 3D Scenes

Portrait photography is largely decided before the shutter opens: the subject's pose, the camera configuration, and the lighting devices must be coordinated within the surrounding 3D scene. In contrast, most existing computational methods…

Graphics · Computer Science 2026-05-29 Ruixiang Jiang , Chang Wen Chen

VPG: Visual Prefix Guidance for Autoregressive Image and Video Generation

Autoregressive image and video generators are trained with teacher-forced histories but must sample from their own generated prefixes at inference time, making them vulnerable to exposure bias and prefix drift. Existing remedies either…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Xinyao Liao , Qiyuan He , Yicong Li , Jiayin Zhu , Xiaoye Qu , Wei Wei , Angela Yao

Resolution Diagnostics for Paired LLM Evaluation

Across two public LLM leaderboards, many displayed pairwise rankings do not meet a conventional paired-test resolution target under the actual paired evaluation design: 11 of 40 Open LLM Leaderboard v1 pairwise comparisons and 4 of 9…

Computation and Language · Computer Science 2026-05-29 Anany Kotawala

SpecBench: Evaluating Specification-Level Reasoning for Software Engineering LLM Agents

Software engineering (SWE) agents are transitioning from code generation to full software development lifecycle automation. A critical phase in this lifecycle is specification design: transforming initial proposals into carefully considered…

Multiagent Systems · Computer Science 2026-05-29 Grant Hamblin , Kevin Song , Zhanda Zhu , Anand Jayarajan , Sihang Liu , Nandita Vijaykumar , Gennady Pekhimenko

A Heterogeneous Architecture for Robot RL Beyond GPU-Dominant Paradigms

Simulation-based RL for contemporary robot control is increasingly organized around GPU-resident simulation: physics, rollout collection, and learning are placed on a single GPU-centric execution path. This paradigm has greatly improved…

Robotics · Computer Science 2026-05-29 Yufei Jia , Zhanxiang Cao , Mingrui Yu , Heng Zhang , Shenyu Chen , Dixuan Jiang , Meng Li , Xiaofan Li , Yiyang Liu , Junzhe Wu , Zheng Li , XiLin Fang , Tingyu Cui , Shengcheng Fu , Haoyang Li , Anqi Wang , Zifan Wang , Dongjie Zhu , Chenyu Cao , Zhenbiao Huang , Ziang Zheng , Jie Lu , Xin Ma , Zhengyang Wei , Xiang Zhao , Tianyue Zhan , Ye He , Yuxiang Chen , Yizhou Jiang , Yue Li , Haizhou Ge , Yuhang Dong , Fan Jia , Ziheng Zhang , Meng Zhang , Xiwa Deng , Zhixing Chen , Hanyang Shao , Chenxin Dong , Yixuan Li , Yizhi Chen , Bokui Chen , Kaifeng Zhang , Hanqing Cui , Yusen Qin , Ruqi Huang , Lei Han , Tiancai Wang , Xiang Li , Yue Gao , Guyue Zhou

DP-SAPF: Saliency-Aware Parameter Fine-tuning of Public Models for Differentially Private Image Synthesis

Differentially private (DP) image synthesis generates images that preserve the statistical characteristics of a sensitive dataset, enabling sensitive data analysis and usage while providing rigorous guarantees of privacy leakage. Existing…

Cryptography and Security · Computer Science 2026-05-29 Chen Gong , Kecen Li , Zinan Lin , Tianhao Wang

Archon: A Unified Multimodal Model for Holistic Digital Human Generation

Digital humans are fundamental to immersive interaction, yet creating a unified model for holistic modalities, including text, audio, motion, and visual content, remains an open challenge. In this paper, we present Archon, a fully…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Chong Bao , Shichen Liu , Lijun Yu , David Futschik , Stylianos Moschoglou , Shefali Srivastava , Ziqian Bai , Feitong Tan , Guofeng Zhang , Zhaopeng Cui , Sean Fanello , Yinda Zhang

City-Mesh3R: Simulation-Ready City-Scale 3D Mesh Reconstruction from Multi-View Images

City-scale 3D surface reconstruction from multiview images for downstream 3D simulation, poses highly challenging problems due to the scale and complexity of urban scenes. Existing city-scale 3D reconstruction methods based on NeRF,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Sayan Paul , Sourav Ghosh , Siddharth Katageri , Soumyadip Maity , Sanjana Sinha , Brojeshwar Bhowmick