Ruochen Zhou — Scifaro

False Reality: Uncovering Sensor-induced Human-VR Interaction Vulnerability

Virtual Reality (VR) techniques, serving as the bridge between the real and virtual worlds, have boomed and are widely used in manufacturing, remote healthcare, gaming, etc. Specifically, VR systems offer users immersive experiences that…

Cryptography and Security · Computer Science 2026-05-26 Yancheng Jiang , Yan Jiang , Ruochen Zhou , Yi-Chao Chen , Xiaoyu Ji , Wenyuan Xu

SkillCraft: Can LLM Agents Learn to Use Tools Skillfully?

Real-world tool-using agents operate over long-horizon workflows with recurring structure and diverse demands, where effective behavior requires not only invoking atomic tools but also abstracting, and reusing higher-level tool…

Computation and Language · Computer Science 2026-03-11 Shiqi Chen , Jingze Gai , Ruochen Zhou , Jinghan Zhang , Tongyao Zhu , Junlong Li , Kangrui Wang , Zihan Wang , Zhengyu Chen , Klara Kaleb , Ning Miao , Siyang Gao , Cong Lu , Manling Li , Junxian He , Yee Whye Teh

Reinforcement Learning for Tool-Integrated Interleaved Thinking towards Cross-Domain Generalization

Recent advances in large language models (LLMs) have demonstrated remarkable capabilities in reasoning and tool utilization. However, the generalization of tool-augmented reinforcement learning (RL) across diverse domains remains a…

Machine Learning · Computer Science 2026-01-08 Zhengyu Chen , Jinluan Yang , Teng Xiao , Ruochen Zhou , Luan Zhang , Xiangyu Xi , Xiaowei Shi , Wei Wang , Jinggang Wang

Phantom Menace: Exploring and Enhancing the Robustness of VLA Models Against Physical Sensor Attacks

Vision-Language-Action (VLA) models revolutionize robotic systems by enabling end-to-end perception-to-action pipelines that integrate multiple sensory modalities, such as visual signals processed by cameras and auditory signals captured by…

Robotics · Computer Science 2025-12-22 Xuancun Lu , Jiaxiang Chen , Shilin Xiao , Zizhi Jin , Zhangrui Chen , Hanwen Yu , Bohan Qian , Ruochen Zhou , Xiaoyu Ji , Wenyuan Xu

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

Large Vision Language Models (VLMs) have long struggled with spatial reasoning tasks. Surprisingly, even simple spatial reasoning tasks, such as recognizing "under" or "behind" relationships between only two objects, pose significant…

Computation and Language · Computer Science 2025-10-14 Shiqi Chen , Tongyao Zhu , Ruochen Zhou , Jinghan Zhang , Siyang Gao , Juan Carlos Niebles , Mor Geva , Junxian He , Jiajun Wu , Manling Li

Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning?

There has been a growing interest in enhancing the mathematical problem-solving (MPS) capabilities of large language models. While the majority of research efforts concentrate on creating specialized models to solve mathematical problems,…

Computation and Language · Computer Science 2025-07-08 Ruochen Zhou , Minrui Xu , Shiqi Chen , Junteng Liu , Yunqi Li , Xinxin Lin , Zhengyu Chen , Junxian He

From Mathematical Reasoning to Code: Generalization of Process Reward Models in Test-Time Scaling

Recent advancements in improving the reasoning capabilities of Large Language Models have underscored the efficacy of Process Reward Models (PRMs) in addressing intermediate errors through structured feedback mechanisms. This study analyzes…

Computation and Language · Computer Science 2025-06-03 Zhengyu Chen , Yudong Wang , Teng Xiao , Ruochen Zhou , Xuesheng Yang , Wei Wang , Zhifang Sui , Jingang Wang

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Large Reasoning Models (LRMs) have shown remarkable capabilities in solving complex problems through reinforcement learning (RL), particularly by generating long reasoning traces. However, these extended outputs often exhibit substantial…

Computation and Language · Computer Science 2025-05-22 Wei Liu , Ruochen Zhou , Yiyun Deng , Yuzhen Huang , Junteng Liu , Yuntian Deng , Yizhe Zhang , Junxian He

Image-of-Thought Prompting for Visual Reasoning Refinement in Multimodal Large Language Models

Recent advancements in Chain-of-Thought (CoT) and related rationale-based works have significantly improved the performance of Large Language Models (LLMs) in complex reasoning tasks. With the evolution of Multimodal Large Language Models…

Artificial Intelligence · Computer Science 2024-05-30 Qiji Zhou , Ruochen Zhou , Zike Hu , Panzhong Lu , Siyang Gao , Yue Zhang