English
Related papers

Related papers: PRISM: Protocol Refinement through Intelligent Sim…

200 papers

This paper presents PRISM: an instruction-conditioned refinement method for imitation policies in robotic manipulation. This approach bridges Imitation Learning (IL) and Reinforcement Learning (RL) frameworks into a seamless pipeline, such…

Robotics · Computer Science 2026-03-09 Arnau Boix-Granell , Alberto San-Miguel-Tello , Magí Dalmau-Moreno , Néstor García

Robotic imitation learning typically requires models that capture multimodal action distributions while operating at real-time control rates and accommodating multiple sensing modalities. Although recent generative approaches such as…

Robotics · Computer Science 2026-02-03 Amisha Bhaskar , Pratap Tokekar , Stefano Di Cairano , Alexander Schperberg

Deploying large language model (LLM)-driven conversational agents in enterprise settings requires prompts that are simultaneously correct at launch and resilient to the non-deterministic behavioral drift that characterizes production LLM…

Artificial Intelligence · Computer Science 2026-05-18 Keshava Chaitanya , Jahnavi Gundakaram

Generative sequence modeling faces a fundamental tension between the expressivity of Transformers and the efficiency of linear sequence models. Existing efficient architectures are theoretically bounded by shallow, single-step linear…

Machine Learning · Computer Science 2026-02-13 Jie Jiang , Ke Cheng , Xin Xu , Mengyang Pang , Tianhao Lu , Jiaheng Li , Yue Liu , Yuan Wang , Jun Zhang , Huan Yu , Zhouchen Lin

Simulation plays a central role in scientific discovery. In many applications, the bottleneck is no longer running a simulator; it is choosing among large families of plausible simulators, each corresponding to different forward…

We introduce PRISM (Predictive Reasoning in Sequential Medicine), a transformer-based architecture designed to model the sequential progression of clinical decision-making processes. Unlike traditional approaches that rely on isolated…

Computation and Language · Computer Science 2025-06-16 Lionel Levine , John Santerre , Alex S. Young , T. Barry Levine , Francis Campion , Majid Sarrafzadeh

Prompt engineering is an effective but labor-intensive way to control text-to-image (T2I) generative models. Its time-intensive nature and complexity have spurred the development of algorithms for automated prompt generation. However, these…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Yutong He , Alexander Robey , Naoki Murata , Yiding Jiang , Joshua Nathaniel Williams , George J. Pappas , Hamed Hassani , Yuki Mitsufuji , Ruslan Salakhutdinov , J. Zico Kolter

With the rapid progress of large language models (LLMs), financial information retrieval has become a critical industrial application. Extracting task-relevant information from lengthy financial filings is essential for both operational and…

Artificial Intelligence · Computer Science 2026-04-07 Chun Chet Ng , Jia Yu Lim , Wei Zeng Low

We propose PRISM, a novel framework designed to overcome the limitations of 2D-based Preference-Based Reinforcement Learning (PBRL) by unifying 3D point cloud modeling and future-aware preference refinement. At its core, PRISM adopts a 3D…

Computation and Language · Computer Science 2025-03-20 Yirong Sun , Yanjun Chen

Large language models (LLMs) provide robots with powerful contextual reasoning abilities and a natural human interface. Yet, current LLM-enabled robots typically depend on cloud-hosted models, limiting their usability in environments with…

Large Language Models (LLMs), constrained by their auto-regressive nature, suffer from slow decoding. Speculative decoding methods have emerged as a promising solution to accelerate LLM decoding, attracting attention from both systems and…

Artificial Intelligence · Computer Science 2026-02-03 Xuliang Wang , Yuetao Chen , Maochan Zhen , Fang Liu , Xinzhou Zheng , Xingwu Liu , Hong Xu , Ming Li

Background: Increasingly, decision-making in healthcare relies on computer models, be it clinical prediction models at point of care or decision-analytic models at the policymaking level. Given the important role models play in both…

Computation · Statistics 2022-02-22 Amin Adibi , Stephanie Harvard , Mohsen Sadatsafavi

Scaling LLM-based embodied agents from text-only environments to complex multimodal settings remains a major challenge. Recent work identifies a perception-reasoning-decision gap in standalone Vision-Language Models (VLMs), which often…

Artificial Intelligence · Computer Science 2026-05-08 Mohamed Salim Aissi , Clemence Grislain , Clement Romac , Laure Soulier , Mohamed Chetouani , Olivier Sigaud , Nicolas Thome

Phone recognition (PR) serves as the atomic interface for language-agnostic modeling for cross-lingual speech processing and phonetic analysis. Despite prolonged efforts in developing PR systems, current evaluations only measure…

In this paper, we present PRISM, a Promptable and Robust Interactive Segmentation Model, aiming for precise segmentation of 3D medical images. PRISM accepts various visual inputs, including points, boxes, and scribbles as sparse prompts, as…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Hao Li , Han Liu , Dewei Hu , Jiacheng Wang , Ipek Oguz

Learning from few demonstrations to develop policies robust to variations in robot initial positions and object poses is a problem of significant practical interest in robotics. Compared to imitation learning, which often struggles to…

Robotics · Computer Science 2025-04-30 Haowen Sun , Han Wang , Chengzhong Ma , Shaolong Zhang , Jiawei Ye , Xingyu Chen , Xuguang Lan

We present PRISM, a unified framework that enables multiple image generation and editing tasks in a single foundational model. Starting from a pre-trained text-to-image diffusion model, PRISM proposes an effective fine-tuning strategy to…

Graphics · Computer Science 2025-05-15 Alara Dirik , Tuanfeng Wang , Duygu Ceylan , Stefanos Zafeiriou , Anna Frühstück

Current techniques for post-training Large Language Models (LLMs) rely either on costly human supervision or on external verifiers to boost performance on tasks such as mathematical reasoning and code generation. However, as LLMs improve…

Computation and Language · Computer Science 2026-01-21 Mukesh Ghimire , Aosong Feng , Liwen You , Youzhi Luo , Fang Liu , Xuan Zhu

DEEPTHINK methods improve reasoning by generating, refining, and aggregating populations of candidate solutions, which enables strong performance on complex mathematical and scientific tasks. However, existing frameworks often lack reliable…

Artificial Intelligence · Computer Science 2026-03-04 Rituraj Sharma , Weiyuan Chen , Noah Provenzano , Tu Vu

We introduce PRISM, a method for real-time filtering in a probabilistic generative model of agent motion and visual perception. Previous approaches either lack uncertainty estimates for the map and agent state, do not run in real-time, do…

Machine Learning · Computer Science 2022-12-07 Atanas Mirchev , Baris Kayalibay , Ahmed Agha , Patrick van der Smagt , Daniel Cremers , Justin Bayer
‹ Prev 1 2 3 10 Next ›