Tianshi Xu — Scifaro

Adapting the Interface, Not the Model: Runtime Harness Adaptation for Deterministic LLM Agents

LLM agents are shaped not only by their language models, but also by the runtime harness that mediates observation, tool use, action execution, feedback interpretation, and trajectory control. While existing agent adaptation methods mainly…

Artificial Intelligence · Computer Science 2026-05-28 Tianshi Xu , Huifeng Wen , Meng Li

EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval

Object-goal navigation (ObjNav) tasks an agent with navigating to the location of a specific object in an unseen environment. Embodied agents equipped with large language models (LLMs) and online constructed navigation maps can perform…

Robotics · Computer Science 2026-03-18 Zebin Yang , Sunjian Zheng , Tong Xie , Tianshi Xu , Bo Yu , Fan Wang , Jie Tang , Shaoshan Liu , Meng Li

UFO: Unlocking Ultra-Efficient Quantized Private Inference with Protocol and Algorithm Co-Optimization

Private convolutional neural network (CNN) inference based on secure two-party computation (2PC) suffers from high communication and latency overhead, especially from convolution layers. In this paper, we propose UFO, a quantized 2PC…

Cryptography and Security · Computer Science 2026-02-24 Wenxuan Zeng , Chao Yang , Tianshi Xu , Bo Zhang , Changrui Ren , Jin Dong , Meng Li

Preconditioned Truncated Single-Sample Estimators for Scalable Stochastic Optimization

Many large-scale stochastic optimization algorithms involve repeated solutions of linear systems or evaluations of log-determinants. In these regimes, computing exact solutions is often unnecessary; it is more computationally efficient to…

Numerical Analysis · Mathematics 2026-02-24 Tianshi Xu , Difeng Cai , Hua Huang , Edmond Chow , Yuanzhe Xi

CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning

Agentic Reinforcement Learning (RL) has empowered Large Language Models (LLMs) to utilize tools like Python interpreters for complex problem-solving. However, for parameter-constrained models (e.g., 4B--7B), the exploration phase is often…

Machine Learning · Computer Science 2026-01-22 Tianshi Xu , Yuteng Chen , Meng Li

HiGP: A high-performance Python package for Gaussian Process

Gaussian Processes (GPs) are flexible, nonparametric Bayesian models widely used for regression and classification because of their ability to capture complex data patterns and quantify predictive uncertainty. However, the O(n^3)…

Machine Learning · Computer Science 2026-01-14 Hua Huang , Tianshi Xu , Yuanzhe Xi , Edmond Chow

Designing Preconditioners for SGD: Local Conditioning, Noise Floors, and Basin Stability

Stochastic Gradient Descent (SGD) often slows in the late stage of training due to anisotropic curvature and gradient noise. We analyze preconditioned SGD in the geometry induced by a symmetric positive definite matrix $\mathbf{M}$,…

Numerical Analysis · Mathematics 2025-11-26 Mitchell Scott , Tianshi Xu , Ziyuan Tang , Alexandra Pichette-Emmons , Qiang Ye , Yousef Saad , Yuanzhe Xi

Multiscale Neural Networks for Approximating Green's Functions

Neural networks (NNs) have been widely used to solve partial differential equations (PDEs) in the applications of physics, biology, and engineering. One effective approach for solving PDEs with a fixed differential operator is learning…

Numerical Analysis · Mathematics 2025-11-21 Wenrui Hao , Rui Peng Li , Yuanzhe Xi , Tianshi Xu , Yahong Yang

CryptoMoE: Privacy-Preserving and Scalable Mixture of Experts Inference via Balanced Expert Routing

Private large language model (LLM) inference based on cryptographic primitives offers a promising path towards privacy-preserving deep learning. However, existing frameworks only support dense LLMs like LLaMA-1 and struggle to scale to…

Cryptography and Security · Computer Science 2025-11-12 Yifan Zhou , Tianshi Xu , Jue Hong , Ye Wu , Meng Li

Neural Approximate Inverse Preconditioners

In this paper, we propose a data-driven framework for constructing efficient approximate inverse preconditioners for elliptic partial differential equations (PDEs) by learning the Green's function of the underlying operator with neural…

Numerical Analysis · Mathematics 2025-10-21 Tianshi Xu , Rui Peng Li , Yuanzhe Xi

Ironman: Accelerating Oblivious Transfer Extension for Privacy-Preserving AI with Near-Memory Processing

With the wide application of machine learning (ML), privacy concerns arise with user data as they may contain sensitive information. Privacy-preserving ML (PPML) based on cryptographic primitives has emerged as a promising solution in which…

Hardware Architecture · Computer Science 2025-10-07 Chenqi Lin , Kang Yang , Tianshi Xu , Ling Liang , Yufei Wang , Zhaohui Chen , Runsheng Wang , Mingyu Gao , Meng Li

Breaking the Layer Barrier: Remodeling Private Transformer Inference with Hybrid CKKS and MPC

This paper presents an efficient framework for private Transformer inference that combines Homomorphic Encryption (HE) and Secure Multi-party Computation (MPC) to protect data privacy. Existing methods often leverage HE for linear layers…

Cryptography and Security · Computer Science 2025-09-03 Tianshi Xu , Wen-jie Lu , Jiangrui Yu , Chen Yi , Chenqi Lin , Runsheng Wang , Meng Li

Towards Efficient Privacy-Preserving Machine Learning: A Systematic Review from Protocol, Model, and System Perspectives

Privacy-preserving machine learning (PPML) based on cryptographic protocols has emerged as a promising paradigm to protect user data privacy in cloud-based machine learning services. While it achieves formal privacy protection, PPML often…

Cryptography and Security · Computer Science 2025-07-22 Wenxuan Zeng , Tianshi Xu , Yi Chen , Yifan Zhou , Mingzhe Zhang , Jin Tan , Cheng Hong , Meng Li

S$^4$C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models

Large language models (LLMs) exhibit remarkable reasoning capabilities across diverse downstream tasks. However, their autoregressive nature leads to substantial inference latency, posing challenges for real-time applications. Speculative…

Computation and Language · Computer Science 2025-06-18 Tao He , Guang Huang , Yu Yang , Tianshi Xu , Sicheng Zhao , Guiguang Ding , Pengyang Wang , Feng Tian

Mixed Precision Orthogonalization-Free Projection Methods for Eigenvalue and Singular Value Problems

Mixed-precision arithmetic offers significant computational advantages for large-scale matrix computation tasks, yet preserving accuracy and stability in eigenvalue problems and the singular value decomposition (SVD) remains challenging.…

Numerical Analysis · Mathematics 2025-05-05 Tianshi Xu , Zechen Zhang , Jie Chen , Yousef Saad , Yuanzhe Xi

Preconditioned Additive Gaussian Processes with Fourier Acceleration

Gaussian processes (GPs) are crucial in machine learning for quantifying uncertainty in predictions. However, their associated covariance matrices, defined by kernel functions, are typically dense and large-scale, posing significant…

Machine Learning · Computer Science 2025-04-02 Theresa Wagner , Tianshi Xu , Franziska Nestler , Yuanzhe Xi , Martin Stoll

PrivCirNet: Efficient Private Inference via Block Circulant Transformation

Homomorphic encryption (HE)-based deep neural network (DNN) inference protects data and model privacy but suffers from significant computation overhead. We observe transforming the DNN weights into circulant matrices converts general…

Cryptography and Security · Computer Science 2024-10-30 Tianshi Xu , Lemeng Wu , Runsheng Wang , Meng Li

PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization

Private deep neural network (DNN) inference based on secure two-party computation (2PC) enables secure privacy protection for both the server and the client. However, existing secure 2PC frameworks suffer from a high inference latency due…

Cryptography and Security · Computer Science 2024-10-15 Tianshi Xu , Shuzhang Zhong , Wenxuan Zeng , Runsheng Wang , Meng Li

Anderson Acceleration with Truncated Gram-Schmidt

Anderson Acceleration (AA) is a popular algorithm designed to enhance the convergence of fixed-point iterations. In this paper, we introduce a variant of AA based on a Truncated Gram-Schmidt process (AATGS) which has a few advantages over…

Numerical Analysis · Mathematics 2024-07-17 Ziyuan Tang , Tianshi Xu , Huan He , Yousef Saad , Yuanzhe Xi

FastQuery: Communication-efficient Embedding Table Query for Private LLM Inference

With the fast evolution of large language models (LLMs), privacy concerns with user queries arise as they may contain sensitive information. Private inference based on homomorphic encryption (HE) has been proposed to protect user query…

Cryptography and Security · Computer Science 2024-05-28 Chenqi Lin , Tianshi Xu , Zebin Yang , Runsheng Wang , Ru Huang , Meng Li