Related papers: Arithmetic functions and learning theory
Building on work of Charton, we train small transformer models to calculate the M\"{o}bius function $\mu(n)$ and the squarefree indicator function $\mu^2(n)$. The models attain nontrivial predictive power. We apply a mixture of additional…
We prove lower bounds on learning the M\"obius or Liouville function with a variety of standard learning techniques, including kernel methods, noisy gradient methods, and correlational statistical query algorithms. These results follow from…
Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to…
The theory of learning under the uniform distribution is rich and deep, with connections to cryptography, computational complexity, and the analysis of boolean functions to name a few areas. This theory however is very limited due to the…
An elementary recursive relation for M$\ddot{\mathrm{o}}$bius function $\mu (n)$ is introduced by two simple ways. With this recursive relation, $\mu (n)$ can be calculated without directly knowing the factorization of the $n$. $\mu (1)…
We study the growth rate of the summatory function of the M\"obius function in the context of an algebraic curve over a finite field. Our work shows a strong resemblance to its number field counterpart, which was proved by Ng in 2004. We…
We obtained the probabilities for the values of the M\"obius function for arbitrary numbers and found that the asymptotic densities of the squarefree integers among the odd and even numbers are $8/\pi^2$ and $4/\pi^2$, respectively. It is…
A multiplicative function $f$ is said to be resembling the M\"{o}bius function if $f$ is supported on the square-free integers, and $f(p)=\pm 1$ for each prime $p$. We prove $O$- and $\Omega$-results for the summatory function $\sum_{n\leq…
We study algorithmic randomness notions via effective versions of almost-everywhere theorems from analysis and ergodic theory. The effectivization is in terms of objects described by a computably enumerable set, such as lower semicomputable…
The paper presents some results for reducing the computation of the M\"obius functon of a M\"obius category that arises from a combinatorial inverse semigroup to that of locally finite partially ordered sets. We illustrate the computation…
We examine correlations of the M\"obius function over $\mathbb{F}_q[t]$ with linear or quadratic phases, that is, averages of the form \begin{equation} \label{eq:average} \frac{1}{q^n}\sum_{\text{deg }f<n} \mu(f)\chi(Q(f)) \end{equation}…
We study numerical integration by combining the trapezoidal rule with a M\"obius transformation that maps the unit circle onto the real line. We prove that the resulting transformed trapezoidal rule attains the optimal rate of convergence…
Reinforcement learning with outcome-based feedback faces a fundamental challenge: when rewards are only observed at trajectory endpoints, how do we assign credit to the right actions? This paper provides the first comprehensive analysis of…
We study the theoretical properties of random Fourier features classification with Lipschitz continuous loss functions such as support vector machine and logistic regression. Utilizing the regularity condition, we show for the first time…
We introduce a general result relating "short averages" of a multiplicative function to "long averages" which are well understood. This result has several consequences. First, for the M\"obius function we show that there are cancellations…
The algorithmic theory of randomness is well developed when the underlying space is the set of finite or infinite sequences and the underlying probability distribution is the uniform distribution or a computable distribution. These…
In this article we prove a general theorem which establishes the existence of limiting distributions for a wide class of error terms from prime number theory. As a corollary to our main theorem, we deduce previous results of Wintner (1935),…
The construction by Du et al. (2019) implies that even if a learner is given linear features in $\mathbb R^d$ that approximate the rewards in a bandit with a uniform error of $\epsilon$, then searching for an action that is optimal up to…
Let $\lambda(n)$ and $\mu(n)$ denote the Liouville function and the M\"obius function, respectively. In this study, relationships between the values of $\lambda(n)$ and $\lambda(n+h)$ up to $n\leq10^8$ for $1\leq h\leq1,000$ are explored.…
We calculate the mean and variance of sums of the M\"obius function and the indicator function of the squarefrees, in both short intervals and arithmetic progressions, in the context of the ring of polynomials over a finite field of $q$…