[Tutorial] GCD Convolution

#	User	Rating
1	tourist	3857
2	jiangly	3747
3	orzdevinwang	3706
4	jqdai0815	3682
5	ksun48	3591
6	gamegame	3477
7	Benq	3468
8	Radewoosh	3463
9	ecnerwala	3451
10	heuristica	3431

#	User	Contrib.
1	cry	167
2	-is-this-fft-	162
3	Qingyu	160
4	Dominater069	159
5	atcoder_official	157
6	adamant	153
6	Um_nik	153
8	djm03178	151
8	luogu_official	151
10	awoo	147

On today's POI training camp round I have learnt a nice technique that could possibly be useful in some number theory problems. I couldn't find any CF article on it, so I think it's fair enough to share it on my own.

Remark on used notation

In some sums I will use an Iverson notation

Problem: Squarefree Function

Let's define a Squarefree Function $b(x)$ for any positive integer $x$ as $x$ divided by a greatest perfect square, that divides $x$ .

For example: $b(1) = 1$ , $b(2) = 2$ , $b(4) = 1$ , $b(6) = 6$ , $b(27) = 3$ , $b(54) = 6$ , $b(800) = 2$

Given an array $a$ of $n \leq 10^5$ positive integers, where each $a_i \leq 10^5$ compute sum

$\begin{gather} \sum\limits_{1 \leq i < j \leq n}b(a_i\cdot a_j) \end{gather}$

Technique: GCD Convolution

You might probably heard about a Sum Convolution. For two arrays $b$ , and $c$ , it is defined as an array $d$ such that $\begin{gather} d_k = \sum\limits_{i,j}b_i\cdot c_j[i + j = k] \end{gather}$ If not, it's basically the same thing as a polynomial multiplication. If $B(x) = b_0 + b_1x + b_2x^2 + ... + b_nx^n$ , and $C(x) = c_0 + c_1x + c_2x^2 + ... + c_nx^n$ , then $(B \cdot C)(x) = d_0 + d_1x + d_2x^2 + ... + d_{2n}x^{2n}$

Let's define GCD Convolution by analogy

Definition

A GCD Convolution of two arrays $b$ , and $c$ , consisting of positive integers, is an array $d$ such that $\begin{gather} d_k = \sum\limits_{i, j}b_i\cdot c_j[gcd(i,j) = k] \end{gather}$

Algorithm to find GCD Convolution

Of course, we can compute it naively by iterating over all pairs of indicies. If $b$ and $c$ consists of $n$ elements then the overall complexity would be $O(n^2log(max(b) + max(c)))$ . But it turns out, that we can do better.

Let's look at the sum $d_k$ values, with indicies divisible by some integer $g$ , so that is $k = gm$ for some integer m. $\begin{gather} \sum\limits_{m=1}^{n/g}d_{gm} = \sum\limits_{m=1}^{n/g}\sum\limits_{i,j}b_i\cdot c_j[gcd(i,j) = gm] = \sum\limits_{i,j}b_i\cdot c_j\left[g \;| \;gcd(i,j)\right] \end{gather}$

From the definition of gcd, we know that $g | gcd(i,j) \Leftrightarrow g | i \wedge g | j$ $\begin{gather} \sum\limits_{i,j}b_i\cdot c_j[g \;|\; gcd(i,j)] = \sum\limits_{i,j}b_i\cdot c_j[g \;|\; i][g \;|\; j] = \end{gather}$ $\begin{gather} =\sum\limits_{i,j}\left(b_i[g \;|\; i]\right)\cdot \left(c_j[g \;|\; j]\right) = \left(\sum\limits_{g|i}b_i\right)\left(\sum\limits_{g|j}c_j\right) \end{gather}$

We can define $B_g = \sum_{m=1}^{n/g}b_{gm}$ , and $C_g = \sum_{m=1}^{n/g}c_{gm}$ , and $D_g = \sum_{m=1}^{n/g}d_{gm}$ . From above equation one could easily derive $D_g = B_g\cdot C_g$ . Knowing that $O(n + \frac{n}{2} + \frac{n}{3} + ...) = O(n\log n)$ , arrays $B$ and $C$ can be computed directly from their definitions in $O(n\log n)$ .

Recovering a $d_k$ values from D array is simple. All we need is just subtract all the summands of $D_i$ except from the smallest. So, formally, we have $\begin{gather} d_k = D_k - \sum\limits_{m=2}^{n/k}d_{km} \end{gather}$ Which can be computed using dynamic programming, starting from $k = n$ .

So, the overall complexity of computing a GCD Convolution of two arrays of size $n$ is $O(n\log n)$ .

Back to original problem

We can see, that $\begin{gather} b(a_i\cdot a_j) = \frac{b(a_i)\cdot b(a_j)}{gcd(b(a_i), b(a_j))^2} \end{gather}$

So, having an array $w_{b(a_i)} = \sum\limits_ib(a_i)$ all we need is just to compute a GCD Convolution of $w$ with itself. Let's denote that convolution by $d$ . Then, from definition $\begin{gather} \sum\limits_{i,j :\;gcd(b(a_i), b(a_j)) = k} \frac{b(a_i)\cdot b(a_j)}{gcd(b(a_i), b(a_j))^2} = \frac{d_k}{k^2} \end{gather}$

So answer to our problem is just a sum $\begin{gather} \sum\limits_{k=1}^{max(b(a_i))}\frac{d_k}{k^2} \end{gather}$

If we denote $A = max(a_i)$ and $B = max(b(a_i))$ , then overall complexity of our solution is $O(n\log A + B \log B)$

Practice problems

Actually, I don't have any. I will be glad if you share some problems in comments. All I have is just this:

GCD Convolution

Rev.	By	When	Δ	Comment
en16	szaranczuk	2023-02-05 16:37:54	33
en15	szaranczuk	2023-02-05 01:19:42	0	(published)
en14	szaranczuk	2023-02-05 01:17:32	108
en13	szaranczuk	2023-02-05 01:01:55	653
en12	szaranczuk	2023-02-05 00:10:36	15
en11	szaranczuk	2023-02-04 23:51:22	151
en10	szaranczuk	2023-02-04 23:37:19	54
en9	szaranczuk	2023-02-04 23:33:55	7
en8	szaranczuk	2023-02-04 23:31:29	118
en7	szaranczuk	2023-02-04 22:50:30	340
en6	szaranczuk	2023-02-04 22:44:31	682
en5	szaranczuk	2023-02-04 22:15:46	1037
en4	szaranczuk	2023-02-04 21:39:05	627
en3	szaranczuk	2023-02-04 21:18:33	233
en2	szaranczuk	2023-02-04 21:06:12	325
en1	szaranczuk	2023-02-04 20:59:01	1344	Initial revision (saved to drafts)