Loading [MathJax]/jax/output/HTML-CSS/fonts/TeX/fontdata.js

Dirichlet convolution, part 2: Dirichlet series and prime counting

Revision en14, by adamant, 2023-06-30 22:59:52

Hi everyone!

Recently I've published a blog about how one can multiply and divide sequences with Dirichlet convolution. In this blog, we will learn about a convenient framework to reason about them in a "coordinate-free" notation, similar to how generating functions are used to analyze sequences under the regular convolution.

We will learn how to deal with Dirichlet multiplication and division in the framework of Dirichlet series, and derive a number of well known number theoretic results from this perspective. While doing so, we will learn about Riemann zeta function and will have a glimpse into why it is so important in analyzing behavior of prime numbers.


Convolutions recap

Let a1,a2, and b1,b2, be two sequences. We define their Dirichlet convolution as

(ab)k=ij=kaibj.

Before working with it, let's recall what we did with other convolutions. Assume, more generically, that we want to analyze

ck=ij=kaibj.

What we typically do to work with such convolutions is we introduce a family of functions ρi such that

ρiρj=ρij,

With these objects in mind, we can rewrite the expression above as

kckρk=kij=kaibjρij=(iaiρi)(jbjρj).

Note that we vaguely described ρi as "functions" without going into detail about their type and/or domain. This is somewhat intentional, as specific applications may imply different specifications. Consider some standard examples (essentially taken from here).

Examples

Dirichlet series

Now, what should we use for i \circ j = ij, where ij means simple multiplication? We already noticed that multiplying two exponents with the same base yields to summation in powers as x^i x^j = x^{i+j}. But what happens if we multiply two exponents with the same power? We get multiplication in bases, that is x^s y^s = (xy)^s. Thus, we define

\rho_i(s) = i^{-s},

where s is a positive integer greater than 1. Using such functions, we define the Dirichlet series of a sequence a_1, a_2, \dots as

\boxed{A(s) = \sum\limits_{i=1}^\infty \frac{a_i}{i^s}}

This allows us to rewrite Dirichlet convolution as

\boxed{C(s) = \sum\limits_k \frac{c_k}{k^s} = \sum\limits_k \sum\limits_{ij=k} \frac{a_i b_j}{(ij)^s} = \left(\sum\limits_i \frac{a_i}{i^s}\right)\left( \sum\limits_j \frac{b_j}{j^s}\right) = A(s) B(s)}

Note: We use i^{-s} instead of \rho_i(s) = i^s by convention, and so that Dirichlet series converges for integer s > 1.

With formal power series, it is common to write [x^k] F(x) to denote the coefficient near x^k of the expansion of F(x). For notational convenience, we will also write [n^{-s}] F(s) to denote the coefficient near n^{-s} of the Dirichlet series.

Multiplicative functions

First and most natural example of where Dirichlet series pop up are multiplicative functions. Let's see why.

A multiplicative function is a function f : \mathbb Z_+ \mapsto \mathbb C such that f(ab) = f(a) f(b) for any a,b such that \gcd(a,b)=1.

From fundamental theorem of arithmetic it follows that multiplicative functions are fully defined by their values in powers of primes:

n=p_1^{d_1} \dots p_k^{d_k} \implies f(n) = f(p_1^{d_1}) \dots f(p_k^{d_k}).

Therefore, when working with multiplicative functions, it is often convenient, for each prime number p to consider the sequence

f(1), f(p), f(p^2), \dots,

and sometimes even its ordinary generating function (also known as the Bell series):

F_p(x) = \sum\limits_k f(p^k) x^k,

as the Dirichlet product over just powers of primes corresponds to the regular product of such generating functions:

(f * g)(p^k) = \sum\limits_{p^i p^j = p^k} f(p^i) g(p^j) = \sum\limits_{i+j=k} f(p^i) g(p^j) = [x^{k}] F_p(x) G_p(x)

This connection re-expresses itself even more when treated through Dirichlet series framework. Let's, similarly to generating functions above, define the p-component of the multiplicative function's Dirichlet series F_p(s) as

\boxed{F_p(s) = \sum\limits_k \frac{f(p^k)}{p^{ks}}}

Using the notation of p-components, we can then find out that

\boxed{F(s) = \sum\limits_{n} \frac{f(n)}{n^{-s}} = \sum\limits_{n} \frac{f(p_1^{d_1}) \dots f(p_k^{d_k})}{p_1^{-d_1 s} \dots p_k^{-d_k s}} = \left(\sum\limits_{p_1} \frac{f(p_1^{d_1})}{p_1^{-d_1 s}}\right) \left(\sum\limits_{p_2}\frac{f(p_2^{d_2})}{p_2^{-d_2 s}}\right) \dots = \prod\limits_{p\text{ prime}} F_p(s)}

In other words, the full Dirichlet series can be represented as an infinite product over all primes (so-called Euler product):

\boxed{\sum\limits_n \frac{f(n)}{n^s} = \prod\limits_{p\text{ prime}} F_p(s)}

From this, for example, immediately follows that the Dirichlet convolution of multiplicative functions is a multiplicative function, as

F(s) G(s) = \prod\limits_{p\text{ prime}} F_p(s) \prod\limits_{p\text{ prime}} G_p(s) = \prod\limits_{p\text{ prime}} F_p(s) G_p(s).

This result is quite important, as it allows us to analyze Dirichlet series of multiplicative functions as infinite sums or infinite products of good-looking functions, depending on what properties are of higher interest to us.

Examples and their Dirichlet series

In this section, we consider some well-known examples of multiplicative functions (taken mostly from here) and how their Dirichlet series are computed. As the main purpose of this section is to get a grasp of various perspectives used to work with Dirichlet series, we will consider several possible derivations for each function.

The constant function I(n)=1. Since I(p^k) = 1, its p-components look like

\zeta_p(s) = \sum\limits_k \frac{1}{p^{ks}} = \frac{1}{1-p^{-s}},

therefore its Dirichlet series can be represented as

\boxed{\zeta(s) = \sum\limits_n \frac{1}{n^s} = \prod\limits_{p\text{ prime}}\frac{1}{1-p^{-s}}}

This particular Dirichlet series is also known as the Riemann zeta function. From the result above it follows that multiplying the Dirichlet series of a multiplicative function f(n) with the Riemann zeta function, results into a multiplicative function g(n) such that

g(n) = \sum\limits_{d|n} f(d).

The Möbius function. In number theory, one often wants to inverse such transform, that is, to find f(n) from a known g(n). As we already noticed above, G(s) = \zeta(s) F(s), thus we can express F(s) as F(s) = \zeta^{-1}(s)G(s). If you're familiar with Möbius inversion, you already know what happens next. We need to find the function \zeta^{-1}(s) explicitly, and we can do so as

\zeta^{-1}(s) = \prod\limits_{p\text{ prime}} (1-p^{-s}).

If we fully expand it, we get

\boxed{\zeta^{-1}(s) = \sum\limits_k (-1)^k \prod\limits_{p_1, \dots,p_k\text{ prime}} \frac{1}{p_1^s \dots p_k^s} = \sum\limits_n \frac{\mu(n)}{n^s}}

In the identity above, we introduced the function \mu(n) such that

\mu(n) = \begin{cases} (-1)^k, & n = p_1 \dots p_k, \\ 0, & \text{otherwise}. \end{cases}

The function \mu(n) is also known as the Möbius function. It is a multiplicative function, that can be defined in powers of primes as \mu(p^k) = [k=0] - [k=1], where [\cdot] is the Iverson bracket, which evaluates to 1 if the expression inside is true and to 0 otherwise.

From this, we also get the Möbius inversion formula:

f(n) = \sum\limits_{d|n} \mu\left(\frac{n}{d}\right) g(d).

The power function f(n) = n^a. Since f(p^k) = p^{ka}, its p-components look like

F_p(s) = \sum\limits_k \frac{p^{ka}}{p^{ks}} = \frac{1}{1-p^{a-s}},

thus its Dirichlet series can be represented as

\boxed{F(s) = \prod\limits_{p\text{ prime}} \frac{1}{1-p^{a-s}} = \zeta(s-a)}

Of course, the same result can also be obtained directly as

F(s) = \sum\limits_n \frac{n^a}{n^s} = \sum\limits_n \frac{1}{n^{s-a}} = \zeta(s-a).

The divisor function \sigma_a(n) = \sum\limits_{d|n} d^a. Since \sigma_a(p^k)=\sum\limits_{i \leq k} p^{ia}, its p-components look like

F_p(s) = \sum\limits_k \frac{1}{p^{ks}} \sum\limits_{i \leq k} p^{ia} = \sum\limits_i p^{ia} \sum\limits_{k \geq i} \frac{1}{p^{ks}} = \sum\limits_i p^{ai} \frac{p^{-is}}{1-p^{-s}} = \frac{1}{1-p^{-s}} \frac{1}{1-p^{a-s}}.

This, in turn, means that its Dirichlet series can be represented as

\boxed{F(s) = \prod\limits_{p\text{ prime}} \frac{1}{1-p^{-s}} \frac{1}{1-p^{a-s}} = \zeta(s) \zeta(s-a)}

Another way to get the same result directly is as follows:

F(s) = \sum\limits_d d^a \sum\limits_k \frac{1}{d^s k^s} = \sum\limits_{d,k} \frac{1}{d^{s-a} k^s} = \zeta(s) \zeta(s-a).

And yet another way to get it is to express \sigma_a(n) as the Dirichlet convolution of f(n) = 1 and g(n) = n^a. The first one has Dirichlet series \zeta(s), and the second one has Dirichlet series \zeta(s-a).

The Euler's totient function \varphi(n). Since \varphi(p^k) = p^k - p^{k-1}, its p-components look like

F_p(s) = 1+(p-1)\sum\limits_{k \geq 1} \frac{p^{k-1}}{p^{ks}} = 1 + \frac{p-1}{p} \sum\limits_{k \geq 1} \frac{1}{p^{k(s-1)}} = 1 + \frac{p-1}{p} \frac{p^{1-s}}{1-p^{1-s}} = \boxed{\frac{1-p^{-s}}{1-p^{1-s}}}

Multiplying it over all p, we get

\boxed{F(s) = \prod\limits_{p\text{ prime}} \frac{1-p^{-s}}{1-p^{1-s}} = \frac{\zeta(s-1)}{\zeta(s)}}

Another way to derive it is to express g(n) = n as the Dirichlet convolution of \varphi(n) and f(n)=1:

\sum\limits_{d|n} \varphi\left(\frac{n}{d}\right) = n,

as \varphi\left(\frac{n}{d}\right) is the amount of numbers k from 1 to n such that \gcd(k, n) = d. But the Dirichlet series of g(n) is known to be \zeta(s-1) and for f(n) it's \zeta(s), so this Dirichlet convolution, indeed, rewrites in terms of Dirichlet series as \zeta(s) F(s) = \zeta(s-1).

Totally multiplicative functions

A totally multiplicative function is a function f : \mathbb Z_+ \to \mathbb C, such that f(a) f(b) = f(ab) for any a, b.

For a totally multiplicative functions, its Dirichlet series is even more restricted, as the function is fully determined by its values in primes:

n=p_1^{d_1} \dots p_k^{d_k} \implies f(n) = f(p_1)^{d_1} \dots f(p_k)^{d_k},

and its p-components must look like

F_p(s) = \sum\limits_k \frac{f(p)^k}{p^{-ks}} = \frac{1}{1-f(p)p^{-s}},

thus the Dirichlet series of a totally multiplicative function can be represented as

\boxed{F(s) = \prod\limits_{p\text{ prime}} \frac{1}{1-f(p)p^{-s}}}

Noteworthy, expanding the inverse of such function gives us

\boxed{F^{-1}(s) = \sum\limits_k (-1)^k \prod\limits_{p_1,\dots,p_k} \frac{f(p_1)\dots f(p_k)}{p_1^s \dots p_k^s} = \sum\limits_n \frac{\mu(n) f(n)}{n^{s}}}

In other words, the inverse g(n) of a totally multiplicative function is g(n) = \mu(n) f(n). In particular, for power function f(n)=n^a, which is totally multiplicative, it means that the Dirichlet series of its inverse is

\boxed{\zeta^{-1}(s-a) = \sum\limits_{n} \frac{\mu(n) n^a}{n^{-s}}}

Another example of a totally multiplicative function, which often occurs in a competitive programming context, is the Legendre symbol:

\left(\frac{a}{p}\right) = \begin{cases} 1 & \text{if } a \text{ is a quadratic residue modulo } p \text{ and } a \not\equiv 0\pmod p, \\ -1 & \text{if } a \text{ is a quadratic nonresidue modulo } p, \\ 0 & \text{if } a \equiv 0 \pmod p, \end{cases}

which can as well be defined explicitly as

\left(\frac{a}{p}\right) \equiv a^{\frac{p-1}{2}} \pmod p \quad \text{ and } \quad\left(\frac{a}{p}\right) \in \{-1,0,1\}.

It is totally multiplicative in its top argument, as

\left(\frac{a}{p}\right) \left(\frac{b}{p}\right) = \left(\frac{ab}{p}\right).

Dirichlet power

Consider the following problem from this blog by jqdai0815:

102471C - Dirichlet $k$-th root. Given g(1), \dots, g(n) and k, find f(1), \dots, f(n) such that \overbrace{(f*\dots*f)}^{k}(n) = g(n).

Solution. When talking about quickly raising functions to arbitrary powers, we generally have two options to proceed.

First option is to represent the power as F^k(s) = \exp [k \log F(s)]. Another option is to take the derivative and represent G = F^k as

G' F = k F^k F' = k G F'.

The later is then used to represent the coefficients of Q(s) as a reasonably small recurrence. For Dirichlet series,

\frac{\partial}{\partial s} \frac{1}{n^s} = \frac{\partial}{\partial s} e^{-s \ln n} = -\frac{\ln n}{n^s},

therefore the derivative of the full Dirichlet series rewrites as

F'(s) = -\sum\limits_{n} \frac{f(n) \ln n}{n^s}.

Expanding both sides of G' F = k G F' into Dirichlet product, we get

\boxed{\sum\limits_{ab=n} f(b) g(a) \ln a = k \sum\limits_{ab=n} g(a) f(b) \ln b}

Since individual terms of Dirichlet convolution involve much less summands than regular convolution (and the number of summands amortizes to O(n \log n) over all indices), we could use the identity above to find f(n) from the values of g(n) and smaller values of f(n). Except for, we need to print exact answer modulo prime number... What do we do about it?

Instead of \ln n, we need to use some function that behaves reasonably similar to \ln n modulo prime numbers. But what kind of function could it be? What properties of \ln n do we actually rely on in this recurrence? Well, first thing to notice is that we don't really need e to be the base of the logarithm, as we may multiply both sides by a constant to substitute it with some other base.

But that still won't do it. Instead, we should substitute the derivative with some other operation L(\cdot), such that

L(F(s)^k) = k F(s)^{k-1} L(F(s)),

for any positive integer number k, and L(F(s)) is something with a reasonably simple form. By induction, any function satisfying Leibniz rule

L(ab) = aL(b) + L(a)b

will do. This is, in fact, how derivatives are defined in abstract algebra (see also closely related arithmetic derivative).

Since standard derivative multiplied f(n) with -\ln n, it's reasonable to assume that the function we're looking for also applies to Dirichlet series component-wise, multiplying each f(n) with another function h(n). Then, the product rule rewrites as

h(n) \sum\limits_{ab=n} f(a) g(b) = \sum\limits_{ab=n}[h(a) + h(b)] f(a) g(b).

From this it follows that h(n) could be any function such that h(ab) = h(a) + h(b) for any positive integer a, b. This is it, this is the most important property of \ln n in our context that we were looking for! Such functions are known as totally additive functions, and one natural example of it could be the prime omega function \Omega(n), which is the number of prime divisors of n, counted with multiplicities.

\exp, \log and prime-counting

Library Judge — Counting Primes. Count the number of primes no more than n \leq 10^{11}.

This approach was originally shared by ecnerwala in this comment. Consider the logarithm of \zeta(s):

\boxed{\log \zeta(s) = \sum\limits_{p\text{ prime}}\log\frac{1}{1-p^{-s}} = \sum\limits_{p\text{ prime}} \sum\limits_{k=1}^\infty \frac{p^{-ks}}{k} = \sum\limits_{k=1}^\infty \frac{1}{k} \sum\limits_{p\text{ prime}} \frac{1}{(p^k)^s}}

In other words,

[n^{-s}] \log \zeta(s) = \begin{cases} \frac{1}{k}, & n = p^k, \\ 0, & \text{otherwise}. \end{cases}

That being said, if we compute prefix sums of a function with Dirichlet series \log \zeta(s) up to n, we can then subtract corresponding counts over p^2, p^3, \dots to get the prime-counting function \pi(n). But how do we find the prefix sums of the logarithm of \zeta(s)?

We can use the series definition of the logarithm as

\log(1+x) = -\sum\limits_{k=1}^\infty \frac{(-1)^k x^k}{k},

and essentially use \zeta(s)-1 instead of x. Note that when f(1)=0, the values of F(s)^k have a quickly vanishing prefix, as

[n^{-s}] F^k(s) = \sum\limits_{\substack{a_1\dots a_k = n \\ a_1, \dots, a_k \geq 2}} f(a_1) \dots f(a_n).

Therefore, the prefix of (\zeta(s)-1)^k consists entirely of zeros for n < 2^k. Thus, we're only interested in F^k for k \leq \log_2 n. To compute these powers, we can use the approach shared by ecnerwala in the comment to my previous blog about Dirichlet convolutions.

We can split the space of xyz \leq n into O(n^{2/3}) rectanges [x_0, x_1] \times [y_0, y_1] \times [z_0, z_1] and use such splitting to compute prefix sums over Dirichlet series H(s) = F(s) G(s) or F(s) = H(s) / G(s), assuming that the prefix sums of the series on the right-hand side are known. Doing it as is would take O(n^{2/3} \log n), but we can speed up the whole computation. Let t \approx n^{1/6}, then

\zeta_t(s) = \prod\limits_{\substack{p\text{ prime} \\ p > t}} \frac{1}{1-p^{-s}}

has an empty prefix of length t, and we will only need first \small \frac{\log n}{\log t} values of (\zeta_t(s)-1)^k instead of \log n to compute the prefix sums of \log \zeta_t(s) up to n. To find \zeta_t(s) itself, we should use the fact that multiplying with 1-p^{-s} can be done in O(\sqrt n), as after such multiplication F(\lfloor n/k \rfloor) simply gets reduced by F(\lfloor n / (kp) \rfloor).

After this, we can add the contributions of first t terms back into logarithm to recover \log \zeta(s) from \log \zeta_t(s).

Note 1: As ecnerwala mentioned, this approach could more generally be used to sum up any totally multiplicative function over all primes up to n. Indeed, the logarithm of the Dirichlet series of a totally multiplicative function is

\boxed{\log F(s) = \log \prod\limits_{p\text{ prime}} \frac{1}{1-p(s)p^{-s}} = \sum\limits_{k=1}^\infty \frac{1}{k}\sum\limits_{p\text{ prime}}\frac{f(p)^k}{(p^k)^s}}

Thus, as we see,

[n^{-s}] \log F(s) = \begin{cases} \frac{f(p)^k}{k}, & n=p^k, \\ 0, & \text{otherwise}. \end{cases}

Note 2: The approach above can also be used to compute prefix sums of logarithms and exponents of any Dirichlet series. Generally, we can represent any Dirichlet series as an infinite product over not necessarily prime numbers as

\boxed{\sum\limits_{n=1}^\infty \frac{g(n)}{n^s} = \prod\limits_{n=1}^\infty \frac{1}{1-f(n)n^{-s}}}

In his comment, ecnerwala calls g(n) the pseudo-Euler transform of f(n), in reference to Euler transform of integer sequences. Similarly to what we did with primes, we can find f(1), \dots, f(t) for t \approx n^{1/6} and divide G(s) by corresponding fractions to get

G_t(s) = \prod\limits_{n>t} \frac{1}{1-f(n) n^{-s}},

so that we can find prefix sums of G_t(s) in O(n^{2/3}), and then amend them with logarithms of fractions to get \log G(s). To find G_t(s) itself we, again, shout notice that, again, multiplying G(s) with 1-f(t) t^{-s} can be done in O(\sqrt n), as after such multiplication, the value F(\lfloor n/k \rfloor) gets reduced by f(t) F(\lfloor n / (kt) \rfloor).

See also

Tags dirichlet convolution, tutorial

History

 
 
 
 
Revisions
 
 
  Rev. Lang. By When Δ Comment
en35 English adamant 2023-07-02 16:31:41 22
en34 English adamant 2023-07-02 16:30:09 33
en33 English adamant 2023-07-02 15:54:07 1
en32 English adamant 2023-07-02 15:52:34 50
en31 English adamant 2023-07-02 15:51:24 597
en30 English adamant 2023-07-02 03:42:39 1
en29 English adamant 2023-07-01 12:43:34 1
en28 English adamant 2023-07-01 01:34:34 1
en27 English adamant 2023-07-01 01:30:14 511
en26 English adamant 2023-06-30 23:59:50 3
en25 English adamant 2023-06-30 23:57:43 329
en24 English adamant 2023-06-30 23:32:38 31
en23 English adamant 2023-06-30 23:26:19 6
en22 English adamant 2023-06-30 23:25:26 120
en21 English adamant 2023-06-30 23:14:52 20
en20 English adamant 2023-06-30 23:12:40 6874
en19 English adamant 2023-06-30 23:09:48 27
en18 English adamant 2023-06-30 23:09:21 72
en17 English adamant 2023-06-30 23:08:37 265
en16 English adamant 2023-06-30 23:02:41 140
en15 English adamant 2023-06-30 23:00:59 4
en14 English adamant 2023-06-30 22:59:52 19
en13 English adamant 2023-06-30 22:58:36 31
en12 English adamant 2023-06-30 22:39:14 40
en11 English adamant 2023-06-30 22:31:12 20
en10 English adamant 2023-06-30 19:34:51 481
en9 English adamant 2023-06-30 19:24:49 891
en8 English adamant 2023-06-30 19:09:46 23
en7 English adamant 2023-06-30 18:25:20 452
en6 English adamant 2023-06-30 17:57:44 605
en5 English adamant 2023-06-30 17:50:40 222
en4 English adamant 2023-06-30 15:41:37 82
en3 English adamant 2023-06-30 13:31:18 182
en2 English adamant 2023-06-30 13:27:32 10
en1 English adamant 2023-06-30 13:26:43 21330 Initial revision (published)