Defeng Sun — Scifaro

A Level Set Method with Secant Iterations for the Least-Squares Constrained Nuclear Norm Minimization

We present an efficient algorithm for least-squares constrained nuclear norm minimization, a computationally challenging problem with broad applications. Our approach combines a level set method with secant iterations and a proximal…

Optimization and Control · Mathematics 2026-03-16 Chiyu Ma , Jiaming Ma , Defeng Sun

Beyond the Prompt in Large Language Models: Comprehension, In-Context Learning, and Chain-of-Thought

Large Language Models (LLMs) have demonstrated remarkable proficiency across diverse tasks, exhibiting emergent properties such as semantic prompt comprehension, In-Context Learning (ICL), and Chain-of-Thought (CoT) reasoning. Despite their…

Computation and Language · Computer Science 2026-03-13 Yuling Jiao , Yanming Lai , Huazhen Lin , Wensen Ma , Houduo Qi , Defeng Sun

LAMBDA: A Large Model Based Data Agent

We introduce LArge Model Based Data Agent (LAMBDA), a novel open-source, code-free multi-agent data analysis system that leverages the power of large language models. LAMBDA is designed to address data analysis challenges in data-driven…

Artificial Intelligence · Computer Science 2026-03-10 Maojun Sun , Ruijian Han , Binyan Jiang , Houduo Qi , Defeng Sun , Yancheng Yuan , Jian Huang

DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

Large Language Model (LLM) agents can automate data-science workflows, but many rigorous statistical methods implemented in R remain underused because LLMs struggle with statistical knowledge and tool retrieval. Existing retrieval-augmented…

Information Retrieval · Computer Science 2026-03-06 Maojun Sun , Yue Wu , Yifei Xie , Ruijian Han , Binyan Jiang , Defeng Sun , Yancheng Yuan , Jian Huang

Standard Transformers Achieve the Minimax Rate in Nonparametric Regression with $C^{s,\lambda}$ Targets

The tremendous success of Transformer models in fields such as large language models and computer vision necessitates a rigorous theoretical investigation. To the best of our knowledge, this paper is the first work proving that standard…

Machine Learning · Statistics 2026-02-25 Yanming Lai , Defeng Sun

Complexity of normalized stochastic first-order methods with momentum under heavy-tailed noise

In this paper, we propose practical normalized stochastic first-order methods with Polyak momentum, multi-extrapolated momentum, and recursive momentum for solving unconstrained optimization problems. These methods employ dynamically…

Optimization and Control · Mathematics 2026-02-12 Chuan He , Zhaosong Lu , Defeng Sun , Zhanwang Deng

DSAEval: Evaluating Data Science Agents on a Wide Range of Real-World Data Science Problems

Recent LLM-based data agents aim to automate data science tasks ranging from data analysis to deep learning. However, the open-ended nature of real-world data science problems, which often span multiple taxonomies and lack standard answers,…

Artificial Intelligence · Computer Science 2026-01-21 Maojun Sun , Yifei Xie , Yue Wu , Ruijian Han , Binyan Jiang , Defeng Sun , Yancheng Yuan , Jian Huang

The global well-posedness for master equations of mean field games of controls

In this manuscript, we establish the global well-posedness for master equations of mean field games of controls, where the interaction is through the joint law of the state and control. Our results are proved under two different conditions:…

Probability · Mathematics 2026-01-21 Shuhui Liu , Xintian Liu , Chenchen Mou , Defeng Sun

A Survey on Large Language Model-based Agents for Statistics and Data Science

In recent years, data science agents powered by Large Language Models (LLMs), known as "data agents," have shown significant potential to transform the traditional data analysis paradigm. This survey provides an overview of the evolution,…

Artificial Intelligence · Computer Science 2025-12-01 Maojun Sun , Ruijian Han , Binyan Jiang , Houduo Qi , Defeng Sun , Yancheng Yuan , Jian Huang

dHPR: A Distributed Halpern Peaceman--Rachford Method for Non-smooth Distributed Optimization Problems

This paper introduces the distributed Halpern Peaceman--Rachford (dHPR) method, an efficient algorithm for solving distributed convex composite optimization problems with non-smooth objectives, which achieves a non-ergodic $O(1/k)$…

Optimization and Control · Mathematics 2025-11-14 Zhangcheng Feng , Defeng Sun , Yancheng Yuan , Guojun Zhang

The Aubin Property for Generalized Equations over $C^2$-cone Reducible Sets

This paper establishes the equivalence of the Aubin property and the strong regularity for generalized equations over $C^2$-cone reducible sets. This result resolves a long-standing question in variational analysis and extends the…

Optimization and Control · Mathematics 2025-10-14 Jiaming Ma , Defeng Sun

Progressive Bound Strengthening via Doubly Nonnegative Cutting Planes for Nonconvex Quadratic Programs

We introduce a cutting-plane framework for nonconvex quadratic programs (QPs) that progressively tightens convex relaxations. Our approach leverages the doubly nonnegative (DNN) relaxation to compute strong lower bounds and generate…

Optimization and Control · Mathematics 2025-10-06 Zheng Qu , Defeng Sun , Jintao Xu

On Error Bounds for Rank-Constrained Affine Matrix Sets

Rank-constrained matrix problems appear frequently across science and engineering. The convergence analysis of iterative algorithms developed for these problems often hinges on local error bounds, which correlate the distance to the…

Optimization and Control · Mathematics 2025-10-03 Ruoning Chen , Defeng Sun , Liping Zhang

On the Relationships among GPU-Accelerated First-Order Methods for Solving Linear Programming

This paper aims to understand the relationships among recently developed GPU-accelerated first-order methods (FOMs) for linear programming (LP), with particular emphasis on HPR-LP -- a Halpern Peaceman--Rachford (HPR) method for LP. Our…

Optimization and Control · Mathematics 2025-10-02 Kaihuang Chen , Defeng Sun , Yancheng Yuan , Guojun Zhang , Xinyuan Zhao

Robust Gradient Descent Estimation for Tensor Models under Heavy-Tailed Distributions

Low-rank tensor models are widely used in statistics. However, most existing methods rely heavily on the assumption that data follows a sub-Gaussian distribution. To address the challenges associated with heavy-tailed distributions…

Methodology · Statistics 2025-09-16 Xiaoyu Zhang , Di Wang , Guodong Li , Defeng Sun

On the $p$-order Semismoothness of the Metric Projection onto Slices of the Positive Semidefinite Cone

The metric projection onto the positive semidefinite (PSD) cone is strongly semismooth, a property that guarantees local quadratic convergence for many powerful algorithms in semidefinite programming. In this paper, we investigate whether…

Optimization and Control · Mathematics 2025-09-05 Ruoning Chen , Jiaming Ma , Defeng Sun

HPR-QP: A dual Halpern Peaceman-Rachford method for solving large-scale convex composite quadratic programming

In this paper, we introduce HPR-QP, a dual Halpern Peaceman-Rachford (HPR) method designed for solving large-scale convex composite quadratic programming. One distinctive feature of HPR-QP is that, instead of working with the primal…

Optimization and Control · Mathematics 2025-07-04 Kaihuang Chen , Defeng Sun , Yancheng Yuan , Guojun Zhang , Xinyuan Zhao

Distribution Matching for Self-Supervised Transfer Learning

In this paper, we propose a novel self-supervised transfer learning method called \underline{\textbf{D}}istribution \underline{\textbf{M}}atching (DM), which drives the representation distribution toward a predefined reference distribution…

Machine Learning · Statistics 2025-07-03 Yuling Jiao , Wensen Ma , Defeng Sun , Hansheng Wang , Yang Wang

Adaptive sieving: A dimension reduction technique for sparse optimization problems

In this paper, we propose an adaptive sieving (AS) strategy for solving general sparse machine learning models by effectively exploring the intrinsic sparsity of the solutions, wherein only a sequence of reduced problems with much smaller…

Optimization and Control · Mathematics 2025-04-28 Yancheng Yuan , Meixia Lin , Defeng Sun , Kim-Chuan Toh

Approximation Bounds for Transformer Networks with Application to Regression

We explore the approximation capabilities of Transformer networks for H\"older and Sobolev functions, and apply these results to address nonparametric regression estimation with dependent observations. First, we establish novel upper bounds…

Machine Learning · Statistics 2025-04-17 Yuling Jiao , Yanming Lai , Defeng Sun , Yang Wang , Bokai Yan