Codeforces Beta Round #17 Tutorial

#	User	Rating
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3611
4	jiangly	3583
5	strapple	3515
6	tourist	3470
7	Radewoosh	3415
8	Um_nik	3376
9	maroonrk	3361
10	XVIII	3345

#	User	Contrib.
1	Qingyu	162
2	adamant	148
3	Um_nik	146
4	Dominater069	143
5	errorgorn	141
6	cry	138
7	Proof_by_QED	136
8	YuukiS	135
9	chromate00	134
10	soullless	133

Contest discussion

Problem A. Noldbach problem

To solve this problem you were to find prime numbers in range [2..N]. The constraints were pretty small, so you could do that in any way - using the Sieve of Eratosthenes or simply looping over all possible divisors of a number.
Take every pair of neighboring prime numbers and check if their sum increased by 1 is a prime number too. Count the number of these pairs, compare it to K and output the result.

Problem B. Hierarchy

Note that if employee, except one, has exactly one supervisor, then our hierarchy will be tree-like for sure.
For each employee consider all applications in which he appears as a subordinate. If for more than one employee there are no such applications at all, it's obvious that - 1 is the answer. In other case, for each employee find such an application with minimal cost and add these costs to get the answer.
Alternatively, you could use Kruskal's algorithm finding a minimum spanning tree for a graph. But you should be careful so that you don't assign the second supervisor to some employee.
And yes, the employees' qualifications weren't actually needed to solve this problem :)

Problem C. Balance

Consider the input string A of length n. Let's perform some operations from the problem statement on this string; suppose we obtained some string B. Let compression of string X be a string X' obtained from X by replacing all consecutive equal letters with one such letter. For example, if S = "aabcccbbaa", then its compression S' = "abcba".
Now, consider compressions of strings A and B - A' and B'. It can be proven that if B can be obtained from some string A using some number of operations from the problem statement, then B' is a subsequence of A', and vice versa - if for any strings A and B of equal length B' is a subsequence of A', then string B can be obtained from string A using some number of operations.
Intuitively you can understand it in this manner: suppose we use some letter a from position i of A in order to put letter a at positions j..k of B (again, using problem statement operations). Then we can't use letters at positions 1..i - 1 of string A in order to influence positions k + 1..n of string B in any way; also, we can't use letters at positions i + 1..n of string A in order to influence positions 1..j - 1 of string B.

Now we have some basis for our solution, which will use dynamic programming. We'll form string B letter by letter, considering the fact that B' should still be a subsequence of A' (that is, we'll search for B' in A' while forming B). For this matter we'll keep some position i in string A' denoting where we stopped searching B' in A' at the moment, and three variables kA, kB, kC, denoting the frequences of characters a, b, c respectively in string B. Here are the transitions of our DP:
1) The next character of string B is a. Then we may go from the state (i, kA, kB, kC) to the state (next_i, 'a', kA + 1, kB, kC).
2) The next character of string B is b. Then we may go from the state (i, kA, kB, kC) to the state (next_i, 'b', kA, kB + 1, kC).
3) The next character of string B is c. Then we may go from the state (i, kA, kB, kC) to the state (next_i, 'c', kA, kB, kC + 1).
Where next_i, x is equal to minimal j such that j ≥ i and A'_j = x (that is, the nearest occurrence of character x in A', starting from position i). Clearly, if in some case next_i, x is undefined, then the corresponding transition is impossible.
Having calculated f(i, kA, kB, kC), it's easy to find the answer: $\text{[math]}$ for all triples kA, kB, kC for which the balance condition is fulfilled.

Such a solution exceeds time and memory limits. Note that if string B is going to be balanced, then kA, kB, kC ≤ (n + 2) / 3, so the number of states can be reduced up to 27 times. But it's also possible to decrease memory usage much storing matrix f by layers, where each layer is given by some value of kA (or kB or kC). It's possible since the value of kA is increased either by 0 or by 1 in each transition.

The overall complexity is O(N⁴) with a quite small constant.

Problem D. Notepad

The answer to the problem is b^n - 1 * (b - 1) mod c. The main thing you should be able to do in this problem - count the value of A^B mod C for some long numbers A and B and a short number C. Simple exponentiation by squaring exceeds time limit since to converse of B to binary you need about O(|B|²) operations, where |B| is the number of digits in decimal representation of number B.
The first thing to do is to transform A to A mod C. It isn't difficult to understand that this transformation doesn't change the answer.
Let's represent C as C = p₁^k₁p₂^k₂...p_m^k_m, where p_i are different prime numbers.
Calculate t_i = A^B mod p_i^k_i for all i in the following way. There are two possible cases:
1) A is not divisible by p_i: then A and p_i are coprime, and you can use Euler's theorem: A^B' = A^B (mod C), where B' = B mod φ(C) and φ(C) is Euler's totient function;
2) A is divisible by p_i: then if B ≥ k_i then t_i = A^B mod p_i^k_i = 0, and if B < k_i then you can calculate t_i in any way since B is very small.
Since all p_i^k_i are pairwise coprime you may use chinese remainder theorem to obtain the answer.

You can also use Euler's theorem directly. Note that if for some B₁, B₂ ≥ 29 you know that |B₁ - B₂| mod φ(C) = 0, then all t_i for them will be the same (29 is the maximal possible value of k_i). Using that you may reduce B so that it becomes relatively small and obtain the answer using exponentiation by squaring.

There is another solution used by many contestants. Represent B as B = d₀ + 10d₁ + 10²d₂ + ... + 10^md_m, where 0 ≤ d_i ≤ 9 (in fact d_i is the i-th digit of B counting from the last). Since a^x + y = a^x· a^y, you know that A^B = A^d₀· A^10d₁· ...· A^{10^md_m}. The values of u_i = A^10ⁱ can be obtained by successive exponentiation (that is, u_i = u_i - 1¹⁰); if you have A^10ⁱ, it's easy to get A^10ⁱd_i = u_i^d_i. The answer is the product above (modulo C, of course).

Problem E. Palisection

The first thing to do is to find all subpalindromes in the given string. For this you may use a beautiful algorithm described for example here. In short, this algorithm find the maximal length of a subpalindrome with its center either at position i of the string or between positions i and i + 1 for each possible placement of the center.
For example, suppose that the maximal subpalindrome length with center at position i is 5; it means that there are three subpalindromes with center at i, with lengths 1 (i..i), 3 (i - 1..i + 1) and 5 (i - 2..i + 2). In general, all starting and finishing positions of subpalindromes with a fixed center lie on some interval of positions, which is pretty easy to find.
Let's find for each position i the value of start_i, which is equal to the number of subpalindromes starting at position i. All these values can be found in linear time. Let's create an auxiliary array a_i. If after processing a position in the string some new pack of subpalindromes was found and they start at positions i..j, then increase the value of a_i by 1, and decrease the value of a_j + 1 by 1. Now $\text{[math]}$ . Proving this fact is left as an exercise to the reader :)
In a similar way it's possible to calculate finish_i, which is equal to the number of subpalindromes finishing at position i.
Since it's easy to count the total number of subpalindromes, let's find the number of non-intersecting pairs of subpalindromes and subtract this number from the total number of pairs to obtain the answer. Note that if two subpalindromes don't intersect then one of them lies strictly to the left of the other one in the string. Then, the number of non-intersecting pairs of subpalindromes is equal to: $\text{[math]}$
This can be calculated in linear time if the value of $\text{[math]}$ is recalculated by addition of finish_i before moving from position i to position i + 1.
The overall complexity of the algorithm is O(N).

Comments (19)

Show archived | Write comment?

hacker007

16 years ago, hide # |

Excellent analysis. Thank you.

→ Reply

e-maxx

But what can you say about my solution?

In fact, it looks like:

a %= c;

p = phi(c);

ans = (a ^ (b % p)) % c;

if (b >= p && ((a ^ p) % c) == 0)

ans = 0;

Maybe there is some subtle counter-example?

slycelote

16 years ago, hide # ^ |

What? Are you talking about submit 63341? It's not equivalent to what you wrote.

tourist

If your solution is as you wrote, then I believe it's wrong, at least for the case a = 2, b = 4, c = 12. Your answer is 1 while the correct answer is 2^4 mod 12 = 4.

Ah, of course, my bad memory :)

a %= c;

p = phi(c);

ans = (a ^ (b % p)) % c;

if (b >= p)

ans *= ((a ^ p) % c);

Yes, this is correct. It's almost the same as the second approach described in the analysis (using Euler's theorem directly).

BaturaDima

7 years ago, hide # ^ |

Can anyone explaine why does this solution work?

Tactic

+13

Thanks so much for this analysis! Really appreciated, and I hope you will repeat it in upcoming contests.

removed1

In D, simplest generalization of fast exponentiation works well: A^(10*x+y) = (A^10)^x * A^y

swcai

No luck on problem C. I used java and got "exceed memory limitation error" when allocating dp[151][52][52][52]. :|

ktuan

I use dp[kA+kB+kC][151][kA][kB], it sounds worse than your way. However, kA+kB+kC always increases by one in each step of DP, so you can use dp1[151][kA][kB] and dp2[151][kA][kB] instead. It's easy to fit in memory.

MaxBuzz

You need to allocate a ragged array, like this:

int[][][][] dp = new int[151][52][][];
for (int i = 0; i < dp.length; ++i) {
    for (int j = 0; j < 52; ++j) {
        dp[i][j] = new int[52 - j][];
        for (int k = 0; j + k < 52; ++j) {
            dp[i][j][k] = new int[52 - j - k];
        }
    }
}

This allocates only the elements actually needed by the solution, so fitting in memory limit.

C/C++ users may use "full" allocation - unused pages of memory will not be loaded, and ML will not be overcomed. But use it with care - there are some problems (not this one) using "lazy" dynamic programming where it is possible to load all or just too many pages by some tricky tests, even if the number of used cells is acceptable.

Enric

A naive 15-line python code gives "runtime error on case 24" on problem D, but I expected something like TLE. What could be the reason for it? I usually code in C++ and I don't know what can cause a runtime error on python.

http://pastie.org/private/qsj8vbefe3flrfyuqi6ffq

Thank you all!