Qingfeng Sun
Let $S(t,f)=\pi^{-1}\arg L(1/2+it, f)$, where $f$ is a holomorphic Hecke cusp form of weight $2$ and prime level $q$. In this paper, we establish an unconditional asymptotic formula for the moments of $S(t,f)$, providing a level aspect…
As Large Language Model (LLM) alignment evolves from simple completions to complex, highly sophisticated generation, Reward Models are increasingly shifting toward rubric-guided evaluation to mitigate surface-level biases. However, the…
Recent advancements in Generative Reward Models (GRMs) have demonstrated that scaling the length of Chain-of-Thought (CoT) reasoning considerably enhances the reliability of evaluation. However, current works predominantly rely on…
Large Reasoning Models (LRMs) like o3 and DeepSeek-R1 have achieved remarkable progress in reasoning tasks with long cot. However, they remain computationally inefficient and struggle with accuracy when solving problems requiring complex…
Deep research agents have shown remarkable potential in handling long-horizon tasks. However, state-of-the-art performance typically relies on online reinforcement learning (RL), which is financially expensive due to extensive API calls.…
Let $F$ be a Hecke--Maass cusp form for $\mathrm{SL}_3(\mathbb{Z})$ with the Langlands parameter $\mu_{F}=\big(\mu_{F,1},\mu_{F,2},\mu_{F,3}\big)$ and the associated $L$-function $L(s, F)$. Define $S_F(t)=\pi^{-1}\arg L(1/2+\mathrm{i}t,…
We find some equidistribution results connected to restriction quantum unique ergodicity problem in this paper. We shows that \begin{align*} \frac{1}{|\mathcal{B}_k|}\sum_{f\in \mathcal{B}_k} \int_{R}y^{k}|f(z)|^{2}\psi(z) d\mu_{R}(z)\to…
Zero Reinforcement Learning (Zero-RL) has proven to be an effective approach for enhancing the reasoning capabilities of large language models (LLMs) by directly applying reinforcement learning with verifiable rewards on pretrained models,…
As Large Language Models (LLMs) rapidly advance, we introduce Hunyuan-TurboS, a novel large hybrid Transformer-Mamba Mixture of Experts (MoE) model. It synergistically combines Mamba's long-sequence processing efficiency with Transformer's…
Large language models (LLMs), such as GPT-4, have shown remarkable performance in natural language processing (NLP) tasks, including challenging mathematical reasoning. However, most existing open-source models are only pre-trained on…
Let $f$ be a fixed holomorphic primitive cusp form of even weight $k$, level $r$ and trivial nebentypus $\chi_r$. Let $q$ be an odd prime with $(q,r)=1$ and let $\chi$ be a primitive Dirichlet character modulus $q$ with $\chi\neq\chi_r$. In…
Code Large Language Models (Code LLMs), such as StarCoder, have demonstrated exceptional performance in code-related tasks. However, most existing models are solely pre-trained on extensive raw code data without instruction fine-tuning. In…
Training large language models (LLMs) with open-domain instruction following data brings colossal success. However, manually creating such instruction data is very time-consuming and labor-intensive. Moreover, humans may struggle to produce…
Despite recent progress achieved by code large language models (LLMs), their remarkable abilities are largely dependent on fine-tuning on the high-quality data, posing challenges for data collection and annotation. To address this, current…
Large Language Model-based agents have garnered significant attention and are becoming increasingly popular. Furthermore, planning ability is a crucial component of an LLM-based agent, which generally entails achieving a desired goal from…
This paper explores the use of Large Language Models (LLMs) for sequential recommendation, which predicts users' future interactions based on their past behavior. We introduce a new concept, "Integrating Recommendation Systems as a New…
In this paper, we establish an asymptotic formula for the twisted second moment of $L$-functions associated with Hecke--Maass cusp forms for $\rm SL(3,\mathbb{Z})$, and further deduce a weighted zero-density estimate for these $L$-functions…
Let $S_j(t)=\frac{1}{\pi}\arg L(1/2+it, u_j)$, where $u_j$ is an even Hecke--Maass cusp form for $\rm SL_2(\mathbb{Z})$ with Laplacian eigenvalue $\lambda_j=\frac{1}{4}+t_j^2$. Without assuming the GRH, we establish an asymptotic formula…
We compute the quantum variance of holomorphic cusp forms on the vertical geodesic for smooth compactly supported test functions. As an application we show that almost all holomorphic Hecke cusp forms, whose weights are in a short interval,…
Let $\tau_k(n)$ be the $k$-th divisor function. In this paper, we derive an asymptotic formula for the sum $$ \sum_{1\leq n_1,n_2, \dots, n_{\ell}\leq X^{\frac{1}{r}} \atop 1\leq n_{\ell+1}\le X^{\frac{1}{s}}}\tau_k(n_1^r+n_2^r+\dots…