Hongbin Lin — Scifaro

Matryoshka Concept Bottleneck Models

Concept Bottleneck Models (CBMs) have emerged as a prominent paradigm for interpretable deep learning, learning by grounding predictions in human-understandable concepts. However, their practical deployment is hindered by the high cost of…

Machine Learning · Computer Science 2026-05-29 Ziye Chen , Hongbin Lin , Jie Li , Lijie Hu

Benchmarking and Mitigating Sycophancy in Medical Vision Language Models

Visual language models (VLMs) have the potential to transform medical workflows. However, the deployment is limited by sycophancy. Despite this serious threat to patient safety, a systematic benchmark remains lacking. This paper addresses…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Juangui Xu , Zikun Guo , Jingwei Lv , Hongbin Lin , Shu Yang , Jun Wen , Di Wang , Lijie Hu

AR1-ZO: Topology-Aware Rank-1 Zeroth-Order Queries for High-Rank LoRA Fine-Tuning

Zeroth-order (ZO) optimization enables large-language-model fine-tuning without storing backpropagation activations, while LoRA supplies compact trainable adapters. Combining them creates a rank paradox: increasing LoRA rank improves…

Machine Learning · Computer Science 2026-05-20 Ziye Chen , Hongbin Lin , Chenyu Zhang , Xiangda Yan , Yongjie Yang , Yao Shu

NoiseRater: Meta-Learned Noise Valuation for Diffusion Model Training

Diffusion models have achieved remarkable success across a wide range of generative tasks, yet their training paradigm largely treats injected noise as uniformly informative. In this work, we challenge this assumption and introduce…

Machine Learning · Computer Science 2026-05-12 Fang Wu , Haokai Zhao , Da Xing , Hanqun Cao , Tinson Xu , Yanchao Li , Xiangru Tang , Zehong Wang , Aaron Tu , Kuan Pang , Hanchen Wang , Hongbin Lin , Zeqi Zhou , Yinxi Li , Peng Xia , Li Erran Li , Molei Tao , Jure Leskovec , Aditya Joshi , Yejin Choi

Reference-Sampled Boltzmann Projection for KL-Regularized RLVR: Target-Matched Weighted SFT, Finite One-Shot Gaps, and Policy Mirror Descent

Online reinforcement learning with verifiable rewards (RLVR) turns checkable outcomes into a scalable training signal, but it keeps rollout generation, verifier scoring, and reference-policy evaluations on the optimization path. Static…

Machine Learning · Computer Science 2026-05-05 Yao Shu , Chenxing Wei , Hongbin Lin , Shuang Qiu , Hui Xiong

X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving

Scalable and reliable evaluation is increasingly critical in the end-to-end era of autonomous driving, where vision--language--action (VLA) policies directly map raw sensor streams to driving actions. Yet, current evaluation pipelines still…

Computer Vision and Pattern Recognition · Computer Science 2026-04-01 Chaoda Zheng , Sean Li , Jinhao Deng , Zhennan Wang , Shijia Chen , Liqiang Xiao , Ziheng Chi , Hongbin Lin , Kangjie Chen , Boyang Wang , Yu Zhang , Xianming Liu

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

Despite strong performance on existing benchmarks, it remains unclear whether large language models can reason over genuinely novel scientific information. Most evaluations score end-to-end RAG pipelines, where reasoning is confounded with…

Artificial Intelligence · Computer Science 2026-02-02 Shuangshuang Ying , Zheyu Wang , Yunjian Peng , Jin Chen , Yuhao Wu , Hongbin Lin , Dingyu He , Siyi Liu , Gengchen Yu , YinZhu Piao , Yuchen Wu , Xin Gui , Zhongyuan Peng , Xin Li , Xeron Du , Libo Qin , YiXin Cao , Ge Zhang , Stephen Huang

Controllable Concept Bottleneck Models

Concept Bottleneck Models (CBMs) have garnered much attention for their ability to elucidate the prediction process through a human-understandable concept layer. However, most previous studies focused on static scenarios where the data and…

Machine Learning · Computer Science 2026-01-05 Hongbin Lin , Chenyang Ren , Juangui Xu , Zhengyu Hu , Cheng-Long Wang , Yao Shu , Hui Xiong , Jingfeng Zhang , Di Wang , Lijie Hu

FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model

In autonomous driving, end-to-end planners learn scene representations from raw sensor data and utilize them to generate a motion plan or control actions. However, exclusive reliance on the current scene for motion planning may result in…

Computer Vision and Pattern Recognition · Computer Science 2025-12-15 Hongbin Lin , Yiming Yang , Yifan Zhang , Chaoda Zheng , Jie Feng , Sheng Wang , Zhennan Wang , Shijia Chen , Boyang Wang , Yu Zhang , Xianming Liu , Shuguang Cui , Zhen Li

DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving

In autonomous driving, vision-centric 3D object detection recognizes and localizes 3D objects from RGB images. However, due to high annotation costs and diverse outdoor scenes, training data often fails to cover all possible test scenarios,…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Hongbin Lin , Yiming Yang , Chaoda Zheng , Yifan Zhang , Shuaicheng Niu , Zilu Guo , Yafeng Li , Gui Gui , Shuguang Cui , Zhen Li

FASTopoWM: Fast-Slow Lane Segment Topology Reasoning with Latent World Models

Lane segment topology reasoning provides comprehensive bird's-eye view (BEV) road scene understanding, which can serve as a key perception module in planning-oriented end-to-end autonomous driving systems. Existing lane topology reasoning…

Computer Vision and Pattern Recognition · Computer Science 2025-11-13 Yiming Yang , Hongbin Lin , Yueru Luo , Suzhong Fu , Chao Zheng , Xinrui Yan , Shuqi Mei , Kun Tang , Shuguang Cui , Zhen Li

TopoStreamer: Temporal Lane Segment Topology Reasoning in Autonomous Driving

Lane segment topology reasoning constructs a comprehensive road network by capturing the topological relationships between lane segments and their semantic types. This enables end-to-end autonomous driving systems to perform road-dependent…

Computer Vision and Pattern Recognition · Computer Science 2025-11-13 Yiming Yang , Yueru Luo , Bingkun He , Hongbin Lin , Suzhong Fu , Chao Zheng , Zhipeng Cao , Erlong Li , Chao Yan , Shuguang Cui , Zhen Li

Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation

Sampling efficiency is critical for deploying visuomotor learning in real-world robotic manipulation. While task symmetry has emerged as a promising inductive bias to improve efficiency, most prior work is limited to isometric symmetries --…

Robotics · Computer Science 2025-08-18 Hongbin Lin , Juan Rojas , Kwok Wai Samuel Au

Visuomotor Grasping with World Models for Surgical Robots

Grasping is a fundamental task in robot-assisted surgery (RAS), and automating it can reduce surgeon workload while enhancing efficiency, safety, and consistency beyond teleoperated systems. Most prior approaches rely on explicit object…

Robotics · Computer Science 2025-08-18 Hongbin Lin , Bin Li , Kwok Wai Samuel Au

Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation

3D point cloud segmentation aims to assign semantic labels to individual points in a scene for fine-grained spatial understanding. Existing methods typically adopt data augmentation to alleviate the burden of large-scale annotation.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Hongbin Lin , Yifan Jiang , Juangui Xu , Jesse Jiaxi Xu , Yi Lu , Zhengyu Hu , Ying-Cong Chen , Hao Wang

DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

In autonomous driving, vision-centric 3D detection aims to identify 3D objects from images. However, high data collection costs and diverse real-world scenarios limit the scale of training data. Once distribution shifts occur between…

Computer Vision and Pattern Recognition · Computer Science 2025-03-17 Hongbin Lin , Zilu Guo , Yifan Zhang , Shuaicheng Niu , Yafeng Li , Ruimao Zhang , Shuguang Cui , Zhen Li

PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models

3D Multimodal Large Language Models (MLLMs) have recently made substantial advancements. However, their potential remains untapped, primarily due to the limited quantity and suboptimal quality of 3D datasets. Current approaches attempt to…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Zilu Guo , Hongbin Lin , Zhihao Yuan , Chaoda Zheng , Pengshuo Qiu , Dongzhi Jiang , Renrui Zhang , Chun-Mei Feng , Zhen Li

Editable Concept Bottleneck Models

Concept Bottleneck Models (CBMs) have garnered much attention for their ability to elucidate the prediction process through a humanunderstandable concept layer. However, most previous studies focused on cases where the data, including…

Machine Learning · Computer Science 2025-02-04 Lijie Hu , Chenyang Ren , Zhengyu Hu , Hongbin Lin , Cheng-Long Wang , Hui Xiong , Jingfeng Zhang , Di Wang

Towards Multi-dimensional Explanation Alignment for Medical Classification

The lack of interpretability in the field of medical image analysis has significant ethical and legal implications. Existing interpretable methods in this domain encounter several challenges, including dependency on specific models,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-30 Lijie Hu , Songning Lai , Wenshuo Chen , Hongru Xiao , Hongbin Lin , Lu Yu , Jingfeng Zhang , Di Wang

Fully Test-Time Adaptation for Monocular 3D Object Detection

Monocular 3D object detection (Mono 3Det) aims to identify 3D objects from a single RGB image. However, existing methods often assume training and test data follow the same distribution, which may not hold in real-world test scenarios. To…

Computer Vision and Pattern Recognition · Computer Science 2024-05-31 Hongbin Lin , Yifan Zhang , Shuaicheng Niu , Shuguang Cui , Zhen Li