Lin Sun — Scifaro

Harness-Bench: Measuring Harness Effects across Models in Realistic Agent Workflows

LLM agents are increasingly deployed as executable systems that use tools, modify workspaces, and produce concrete artifacts. In such workflows, performance depends not only on the base model, but also on the harness: the system layer that…

Artificial Intelligence · Computer Science 2026-05-28 Yilun Yao , Xinyu Tan , Chao-Hsuan Liu , Yaoming Li , Zhengyang Wang , Wenhan Yu , Zhewen Tan , Yuxuan Tian , Guangxiang Zhao , Lin Sun , Xiangzheng Zhang , Tong Yang

BEAR: Budgeted Evidence Allocation for Multi-Document Reasoning

We argue that multi-document reasoning is constrained not only by how much text a model can read, but also by how limited query-time evidence budget is allocated across documents and semantic granularities. Full-context inference exposes…

Computation and Language · Computer Science 2026-05-28 Lin Sun , Linglin Zhang , Jingang Huang , Change Jia , Zhengwei Cheng , Xiangzheng Zhang

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

Large language model agents increasingly rely on persistent memory to store past interactions, retrieve relevant demonstrations, and improve long-horizon task execution. However, this memory mechanism also creates a practical security…

Artificial Intelligence · Computer Science 2026-05-25 Zhewen Tan , Yilun Yao , Huiyan Jin , Wenhan Yu , Guoan Wang , Mengyuan Fan , liang lu , Feng Liu , Xiangzheng Zhang , Duohe Ma , Tong Yang , Lin Sun

When Good OCR Is Not Enough: Benchmarking OCR Robustness for Retrieval-Augmented Generation

Industrial Retrieval-Augmented Generation (RAG) systems depend on optical character recognition (OCR) to transform visual documents into text. Existing OCR benchmarks rely on character-level metrics, which inadequately measure downstream…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Lin Sun , Wang Dexian , Jingang Huang , Linglin Zhang , Change Jia , Zhengwei Cheng , Xiangzheng Zhang

Regularizations for shock and rarefaction waves in the perturbed solitons of the KP equation

Using an asymptotic perturbation method, we study the initial value problem for the KP equation with initial data consisting of parts of exact line-soliton solutions. We consider a slow modulation of the soliton parameters, described by a…

Pattern Formation and Solitons · Physics 2026-05-05 Guangfu Han , Yuji Kodama , Chuanzhong Li , Lin Sun

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

The challenge of reducing the size of Large Language Models (LLMs) while maintaining their performance has gained significant attention. However, existing methods, such as model distillation and transfer learning, often fail to achieve high…

Computation and Language · Computer Science 2026-04-30 Lin Sun , Guangxiang Zhao , Xiaoqi Jian , Yuhan Wu , Weihong Lin , Yongfu Zhu , Qilong Shi , Change Jia , Aomufei Yuan , Yuxuan Tian , Linglin Zhang , Jinzhu Wu , Junfeng Ran , Sai-er Hu , Zihan Jiang , Junting Zhou , Wenrui Liu , Xusen Xiao , Bin Cui , Tong Yang , Xiangzheng Zhang

Thinking with Reasoning Skills: Fewer Tokens, More Accuracy

Reasoning LLMs often spend substantial tokens on long intermediate reasoning traces (e.g., chain-of-thought) when solving new problems. We propose to summarize and store reusable reasoning skills distilled from extensive deliberation and…

Artificial Intelligence · Computer Science 2026-04-28 Guangxiang Zhao , Qilong Shi , Xusen Xiao , Xiangzheng Zhang , Tong Yang , Lin Sun

Petabit-per-second Random Number Generation

Physical random number generators based on chaotic microcombs, with their complex nonlinear dynamics and multi-channel parallel capability, have attracted considerable research attention. However, key technical challenges for chaotic…

Optics · Physics 2026-04-23 Lin Jiang , Jihui Sun , Qiao Zhang , Jincheng Cui , Xiaohan Wang , Yanlan Xiao , Lin Sun , Hairong Lin , Haijun He , Jiacheng Feng , Anlin Yi , Jia Ye , Xihua Zou , Wei Pan , Gangxiang Shen , Heng Zhou , Lianshan Yan

Rf spectra and pseudogap in ultracold Fermi gases across the BCS-BEC crossover from pairing fluctuation theory

The pseudogap phenomenon is a hallmark of strongly interacting Fermi systems, from high-temperature superconductors to ultracold atomic gases, yet its precise origin remains debated. Here we calculate the spectral function and rf spectra of…

Quantum Gases · Physics 2026-04-08 Chuping Li , Lin Sun , Kaichao Zhang , Junru Wu , Yuxuan Wu , Dingli Yuan , Pengyi Chen , Qijin Chen

Spectral study of the pseudogap in unitary Fermi gases

The existence of a pseudogap in unitary Fermi gases has recently been established and measured experimentally [Li et al., Nature 626, 288 (2024)]. This lends strong support for the pairing origin as the mechanism of the pseudogap in Fermi…

Quantum Gases · Physics 2026-04-08 Chuping Li , Lin Sun , Kaichao Zhang , Junru Wu , Yuxuan Wu , Dingli Yuan , Pengyi Chen , Qijin Chen

Effects of particle-hole fluctuations on the superfluid transition in two-dimensional atomic Fermi gases

Proper treatment of the many-body interactions is of paramount importance in our understanding of strongly correlated systems. Here we investigate the effects of particle-hole fluctuations on the Berezinskii-Kosterlitz-Thouless (BKT)…

Quantum Gases · Physics 2026-03-13 Junru Wu , Zongpu Wang , Lin Sun , Kaichao Zhang , Chuping Li , Yuxuan Wu , Pengyi Chen , Dingli Yuan , Qijin Chen

Beyond Parameter Arithmetic: Sparse Complementary Fusion for Distribution-Aware Model Merging

Model merging has emerged as a promising paradigm for composing the capabilities of large language models by directly operating in weight space, enabling the integration of specialized models without costly retraining. However, existing…

Artificial Intelligence · Computer Science 2026-02-13 Weihong Lin , Lin Sun , Qilong Shi , Aomufei Yuan , Yuxuan Tian , Zhengyang Wang , Guangxiang Zhao , Xiangzheng Zhang , Tong Yang

Beyond Static Alignment: Hierarchical Policy Control for LLM Safety via Risk-Aware Chain-of-Thought

Large Language Models (LLMs) face a fundamental safety-helpfulness trade-off due to static, one-size-fits-all safety policies that lack runtime controllabilityxf, making it difficult to tailor responses to diverse application needs. %As a…

Computation and Language · Computer Science 2026-02-09 Jianfeng Si , Lin Sun , Weihong Lin , Xiangzheng Zhang

TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment

In recent years, safety risks associated with large language models have become increasingly prominent, highlighting the urgent need to mitigate the generation of toxic and harmful content. The mainstream paradigm for LLM safety alignment…

Machine Learning · Computer Science 2026-02-02 Zhewen Tan , Wenhan Yu , Jianfeng Si , Tongxin Liu , Kaiqi Guan , Huiyan Jin , Jiawen Tao , Xiaokun Yuan , Duohe Ma , Xiangzheng Zhang , Tong Yang , Lin Sun

MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation

Temporal context is essential for robotic manipulation because such tasks are inherently non-Markovian, yet mainstream VLA models typically overlook it and struggle with long-horizon, temporally dependent tasks. Cognitive science suggests…

Robotics · Computer Science 2026-02-02 Hao Shi , Bin Xie , Yingfei Liu , Lin Sun , Fengrong Liu , Tiancai Wang , Erjin Zhou , Haoqiang Fan , Xiangyu Zhang , Gao Huang

Efficient Switchable Safety Control in LLMs via Magic-Token-Guided Co-Training

Current methods for content safety in Large Language Models (LLMs), such as Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF), often rely on multi-stage training pipelines and lack fine-grained,…

Computation and Language · Computer Science 2026-01-21 Jianfeng Si , Lin Sun , Zhewen Tan , Xiangzheng Zhang

On the maximum spectral radius of planar graphs

This paper investigates the maximum spectral radius of planar graphs with concrete fixed number of vertices, providing some tight bounds on the maximum spectral radius of general planar graph resorting to its order, and confirming that…

Combinatorics · Mathematics 2025-11-04 Guanglong Yu , Lin Sun

Dexbotic: Open-Source Vision-Language-Action Toolbox

In this paper, we present Dexbotic, an open-source Vision-Language-Action (VLA) model toolbox based on PyTorch. It aims to provide a one-stop VLA research service for professionals in the field of embodied intelligence. It offers a codebase…

Robotics · Computer Science 2025-10-28 Bin Xie , Erjin Zhou , Fan Jia , Hao Shi , Haoqiang Fan , Haowei Zhang , Hebei Li , Jianjian Sun , Jie Bin , Junwen Huang , Kai Liu , Kaixin Liu , Kefan Gu , Lin Sun , Meng Zhang , Peilong Han , Ruitao Hao , Ruitao Zhang , Saike Huang , Songhan Xie , Tiancai Wang , Tianle Liu , Wenbin Tang , Wenqi Zhu , Yang Chen , Yingfei Liu , Yizhuang Zhou , Yu Liu , Yucheng Zhao , Yunchao Ma , Yunfei Wei , Yuxiang Chen , Ze Chen , Zeming Li , Zhao Wu , Ziheng Zhang , Ziming Liu , Ziwei Yan , Ziyu Zhang

Large Language Models Badly Generalize across Option Length, Problem Types, and Irrelevant Noun Replacements

In this paper, we propose a ``Generalization Stress Test" to assess Large Language Models' (LLMs) generalization ability under slight and controlled perturbations, including option length, problem types, and irrelevant noun replacements. We…

Computation and Language · Computer Science 2025-09-23 Guangxiang Zhao , Saier Hu , Xiaoqi Jian , Jinzhu Wu , Yuhan Wu , Change Jia , Lin Sun , Xiangzheng Zhang

TextlessRAG: End-to-End Visual Document RAG by Speech Without Text

Document images encapsulate a wealth of knowledge, while the portability of spoken queries enables broader and flexible application scenarios. Yet, no prior work has explored knowledge base question answering over visual document images…

Computer Vision and Pattern Recognition · Computer Science 2025-09-11 Peijin Xie , Shun Qian , Bingquan Liu , Dexin Wang , Lin Sun , Xiangzheng Zhang