To cheer you up in difficult times 5: A New Elementary Proof of the Prime Number Theorem by Florian K. Richter

Here is a piece of news that will certainly cheer you up: Florian Richter found A new elementary proof of the prime number theorem. (I thank Tami Ziegler for telling me about the new result.)

From left to right: Atle Selberg, Florian Richter, and Paul Erdos

From the Introduction

“One of the most fundamental results in mathematics is the Prime Number Theorem, which describes the asymptotic law of the distribution of prime numbers in the integers.

Prime Number Theorem. Let π(N) denote the number of primes smaller or equal to a positive integer N. Then

\lim_{N \to \infty}   \frac {\pi (N)}{N/\log N}~=~1

The Prime Number Theorem was conjectured independently by Gauß and Legendre towards the end of the 18th century and was proved independently by Hadamard and de la Vallée Poussin in the year 1896. Their proofs are similar and rely on sophisticated analytic machinery involving complex analysis, which was developed throughout the 19th century by the combined effort of many great mathematicians of this era, including Euler, Chebyshev, and Riemann (to name a few).

Even though it was believed for a long time not to be possible, more than 50 years after the analytic proof of the Prime Number Theorem was discovered, an elementary proof was found by Erdős and Selberg [Erd49; Sel49]. In this context, elementary refers to methods that avoid using complex analysis and instead rely only on rudimentary facts from calculus and basic arithmetic identities and inequalities. Their approach was based on Selberg’s famous “fundamental formula”:

\sum_{p \le x}\log ^2(p)~+~\sum_{pq \le x}\log (p)\log (q)~=~2x\log (x)+O(x).

In this paper we provide a new elementary proof of the Prime Number Theorem, using ideas inspired by recent developments surrounding Sarnak’s and Chowla’s conjecture. With the exception of Stirling’s approximation formula, which is used in Section 3 without giving a proof, our proof is self-contained.”

The introduction also cites papers on the history of the analytic method and an abridged version of it by Newman, on several earlier elementary proofs of the PNT, and on recent dynamically inspired way by McNamara of deriving the Prime Number Theorem from Selberg’s fundamental formula.

Some thoughts and remarks

What precisely is elementary, formally?

What is the formal distinction between elementary and non-elementary proofs of the PNT? (This was explained to me several times but I don’t remember the details.) The distinctions between “non-explicit” and “explicit” proofs (or constructions) and between “effective” and “non-effective” proofs are quite clear cut. But here I am less certain.

Are there other important distinctions about proofs that you are aware of?

PNT and Boolean functions (weighted majority functions)

An equivalent formulation of the prime number theorem is the following:

(*)~~~ \sum_{k \le n}\mu (k)~=~o(n),

where \mu is the Mobius function. Consider now a Boolean function f(x_1,x_2, \dots, x_n) with n variables. Here the variables x_i and the function f has values in \{0,1\}. Given n nonnegative weights w_1\le w_2 \le \cdots \le w_n and C>0 the function $f$ is a weighted majority function if f=1 iff w_1x_1+\cdots +w_nx_n \le C. Let p(f) be the probability that f=1, and let \hat f[n] be the top Fourier coefficient of f.


(1) Find conditions for a weighted majority function f that guarantee that

(**)~~~ \hat f([n])~=~o( p(f))

Let p_k be the kth prime, w_k = \log p_k, and C=\log n. The PNT asserts that the Boolean function associated to these parameters satisfies (**). (It is very easy to see that (**) and (*) are essentially the same.)  From vague understanding of some fragments of older elementary proofs of the PNT it looks that the proofs could perhaps be described in this way, namely starting with some weaker properties on the weights (namely the primes)  perhaps the derivation of (**) apply to general weighted majority functions with these properties. (This is related to PNT for systems of Beurling primes.) Finding conditions for (**) is interesting also for other weighted majority functions where the weights are not related to the growth behavior of the primes.  In fact, the following is also interesting:

(2) Find conditions for (general) Boolean functions f that guarantee that (**) holds, namely that the top Fourier coefficients is little o of p(f). Signs of low degree polynomials form a very interesting class of  Boolean functions, and they are of special interest also here.

Remark: The first problem looks very related to PNT for systems of Beurling primes going back to 1937 paper by Arne Beurling. (Based on the inequality 1937 < 1949, I understand that Breuling’s proof was not elementary, and I don’t know if the elementary proofs for PNT gives Breuling’s theorem as well.)

PNT here on the blog and Mobius randomness

The new paper is related to conjectures by Chowla and Sarnak and to the notion of Mobius randomness that we considered in an earlier post. The basic question is for which functions f of k, {\bf (***)} \sum_{k \le n} \mu (k)f(k) = o(\sum_{k \le n}f(k)). This is correct if f is random (or random-like in a variety of meanings), and it turns out to be correct if (in some sense) if f is of low complexity. We also mentioned some analogs of the PNT in this post. Both polymath 4 and polymath5 had some connections to the PNT.

Is PNT to expanders like RH to Ramanujan graphs?

This is another connection to combinatorics, See this post in “In Theory”.

This entry was posted in Number theory, Updates and tagged , , . Bookmark the permalink.

6 Responses to To cheer you up in difficult times 5: A New Elementary Proof of the Prime Number Theorem by Florian K. Richter

  1. John Baez says:

    I would guess that an elementary proof of the prime number theorem can be formalized in Peano Arithmetic, while a non-elementary one might use a more powerful set of axioms. For example, RCA0 is a weak version of second-order arithmetic that allows you to do some analysis which goes beyond Peano arithmetic, like prove the intermediate value theorem. The link also lists some stronger systems, which together with RCA0 form “the big five”: most everyday math can be done in one of these systems.

    • Gil Kalai says:

      Dear John, many thanks for the comment and link. Certainly the axiomatic setting that allows the proof is a major distinction between various proofs. However I am not sure how crucial this distinction is to “real-life” proofs and, in particular, is it relevant to the elementary/non-elementary distinction.

  2. wdjoyner says:

    Arne Breuling is spelled Arne Beurling.

    GK: Thanks corrected.

  3. Thanks for a *very* interesting blog!

    Your question, “What precisely is an elementary proof?” reminds me of an analogous question in algorithm design: “What precisely is a combinatorial algorithm?”
    I don’t believe there is a “formal” definition of “combinatorial” algorithm, but one can tell it apart from a “non-combinatorial” one, just as easily as one can tell oil from water …

  4. segevgonen says:

    In the context of number theory, I always thought ‘elementary’ just meant ‘without using complex analysis’ but I suppose this distinction can be difficult at times?

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s