[Gym] XXI Open Cup. Grand Prix of Suwon

→ Pay attention

Before contest
Codeforces Round 995 (Div. 3)
06:12:19
Register now »

→ Streams

Codeforces Round 995 Solution Discussion

By aryanc403

Before stream 08:32:18

View all →

→ Top rated

#	User	Rating
1	tourist	3985
2	jiangly	3814
3	jqdai0815	3682
4	Benq	3529
5	orzdevinwang	3526
6	ksun48	3517
7	Radewoosh	3410
8	hos.lyric	3399
9	ecnerwala	3392
9	Um_nik	3392

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	cry	169
2	maomao90	162
2	Um_nik	162
4	atcoder_official	161
5	djm03178	158
6	-is-this-fft-	157
7	adamant	155
8	awoo	154
8	Dominater069	154
10	luogu_official	150

View all →

→ Find user

→ Recent actions

Detailed →

ko_osaga's blog

[Gym] XXI Open Cup. Grand Prix of Suwon

By ko_osaga, history, 4 years ago, In English

Hello! I'm happy to announce XXI Open Cup. Grand Prix of Suwon. Gyeonggi Science High School is located in Suwon, Korea.

This contest was used as a Day 9 Contest from Petrozavodsk Winter 2021 Camp.

Enter contest
Contest time: 2021/02/21 Sunday, 17:00 Korea Standard Time (UTC+9). For external accounts, the contest is ready now.
Problemsetters: mo_onrabbit2, Worteltje, GyojunYoun, ko_osaga, queued_q, rdd6584
Codeforces Gym

List of relevant previous contests:

Enjoy!

Announcement of 2020-2021 Winter Petrozavodsk Camp, Day 9 Contest (XXI Open Cup, Grand Prix of Suwon)

+155

ko_osaga
4 years ago
48

Comments (47)

Show archived | Write comment?

ko_osaga

4 years ago, # |

+27

Thank you for the participation. Credits are:

The setter is anonymous if not listed.

There are no editorials, sorry! Also, please expect about 1-2 weeks on the Gym migration.

→ Reply

AmDer

4 years ago, # |

+17

How to solve J problem?

→ Reply

Coder

4 years ago, # ^ |

+14

Note that [3..7] should always be next to each other in sorted array. Try every 2 in increasing order and maintain the set of valid positions of [3..7]. Each time pick the highest valid position and then binary search for 1.

→ Reply

xuanquang1999

4 years ago, # ^ |

← Rev. 3 →

My solution use a similar idea, but with an additional observation (which make the solution easier to implement):

Suppose that we are trying to use $$$a_i$$$ as $$$p_2$$$:

If $$$a_i + a_{i+1} < a_{i+2} + a_{i+3} + a_{i+4} +a_{i+5}$$$, it's optimal to use $$$a_i, \ldots, a_{i+4}$$$ as $$$p_3, \ldots, p_7$$$ and find best $$$p_1$$$ by binary search
Otherwise, we can brute-force over all possible positions for $$$p_3, \ldots, p_7$$$ (and find the best $$$p_1$$$ by binary search)

Notice that the number of position $$$i$$$ that fall into the second case is $$$O(\log A)$$$, so this solution is $$$O(N \log N \log A)$$$, which is sufficiently fast.

→ Reply

mango_lassi

4 years ago, # ^ |

My solution: sort the values. We always pick consecutive $$$P_7, P_6, P_5, P_4, P_3$$$, so first decide $$$i$$$ and let $$$P_7 = A[i], P_6 = A[i+1], \dots, P_3 = A[i+4]$$$.

Then, we want to pick the maximum $$$j$$$ s.t. $$$A[j] < P_7 + P_6 + P_5 + P_4 - P_3$$$, and such that some $$$P_1$$$ can still be selected, which holds IFF $$$A[j+1] - A[j] < P_3$$$.

The first condition gives an upper bound on $$$j$$$, and we can find the last position satisfying the second condition before it with a range minimum segment tree.

→ Reply

ko_osaga

4 years ago, # ^ |

If you observe that 3,4,5,6,7 should be consecutive, then it's easy to come up with $$$O(n^3)$$$ solution. Then you can simply take top-500 participants and run the $$$O(n^3)$$$ solution. The testers failed to prove or disprove this — I believe it's correct.

→ Reply

dimdum

4 years ago, # |

+59

Someone please explain F :(

→ Reply

Ant_Man

4 years ago, # ^ |

+16

→ Reply

stefanbalaz2

4 years ago, # ^ |

Brief explanation: consider cycles as a xor vector space, sufficient cycles would be the ones u can get with backedges in DFS tree, because u can take a cycle independently of anything else, by traversing to a cycle, going around the cycle, and traversing back the same path, define D(u,v) as max xor path from u to v, then D(u,v) consists of any path from u to v and some set of cycles. We can take the path from u to v as the one from the DFS tree, define that path as P(u,v). Define R(u) as xor on the path from root to u, then P(u,v)=R(u)^R(v). Now we have to maximize our P(u,v) by xoring it with some cycles. Construct a base for ur xor space, reduce it to reduced row echelon form, then we can greedily starting from the highest bit, if our answer doesnt have that bit, and our base has a vector with that MSB, we can xor our answer with that vector. We can see that only bits we actually care about are the MSB of the vectors from the base. We can see that xoring our answer with a vector from the base doesnt change any other bit that we care about(becaused our base is in reduced row echelon form). So our answer only depends on the base and R(u)^R(v). Going back to the original queries, we only have to count for every bit that is a MSB of a vector from our base, parity of number of pairs of R(u)^R(v) which have that bit set, l<=u,v<=r, if the parity is odd, we add that vector from the base to our query answer, and we have to add to our answer R(u) l<=u<=r if (r-l+1) is even.

→ Reply

mango_lassi

4 years ago, # |

+24

Is there a nice clean solution to A? My solution was a 300-line monstrosity, which I couldn't debug in time :P

→ Reply

mo_onrabbit2

4 years ago, # ^ |

+41

Let's set vertex 1 as the root. Each edge's contribution to the answer is either (sum of $$$A_i$$$ in it's subtree) or (sum of all $$$A_i$$$)-(sum of $$$A_i$$$ in it's subtree): later case only happens when the edge is in the path from query vertex to the root. We can maintain the sum of $$$A_i$$$ in each edge's subtree with some path updates and subtree updates, so dfs ordering+HLD is enough.

→ Reply

ko_osaga

4 years ago, # ^ |

+13

It seems that A is a standard task with top trees. Top tree is by no means "nice clean" solution, but if you have a good book code, then things will be different.

Nobody involved with the preparation was aware of the fact. It sounds like a high time to learn new technology :) The complexity is $$$O(Q \log N)$$$ in that case.

→ Reply

dragoon

4 years ago, # ^ |

If you have top tree implementation, would you mind to share it?

→ Reply

ecnerwala

4 years ago, # ^ |

← Rev. 2 →

Here's mine. The interface isn't fully flushed out, but it works.

https://github.com/ecnerwala/cp-book/blob/master/src/top_tree.hpp

Since some people asked, here's a full solution to A using it: https://ideone.com/B03DBB

→ Reply

geniucos

4 years ago, # |

+37

How do you do L and K? For L I thought there's a particularization of some k-th shortest path method (one can easily consruct an almost line graph) as for K, I could find this which is precisely the same problem and they do provide a linear time solution, but I guess the intended solution should've been easier.

→ Reply

ko_osaga

4 years ago, # ^ |

← Rev. 4 →

Model solution of L simply implements this paper. I failed to find any ad-hoc solutions exploiting the structure of graph, and hoped it doesn't exist cause that was the point. But there are ad-hoc solutions as well and I think they are interesting.

You can see some good implementation of Epp98 here. And it is surely a fun read. (I think sufficiently strong competitors can simply come up this from scratch.) I can explain my implementation in detail.

→ Reply

Petr

4 years ago, # ^ |

Do you have any pointers about those ad-hoc solutions?

I've came up with the following one: let's build a tree of subproblems. Each subproblem is "cover a segment [L, R] such that its bounds must be included". To reduce a subproblem to smaller ones, we iterate over how the middle is covered (there are six cases: X|X, XO|X, X|OX, XOO|X,XO|OX,X|OOX), and go to two smaller segments. This way we get O(n*logn) segments, but with a pretty big constant.

Then for each segment we maintain two things: some number of shortest solutions for this segment that we have "graduated", and a priority queue of pending ones. We maintain the property that for every subsegment solution that was used to produce an output, we have graduated at least one solution after it (if there is any), and for each vertex the priority queue contains not-yet-graduated combinations of the two children that cover at least (the last child solution used to produce output + 1) graduated solutions for each child. This ensures that the next solution to graduate from the top-most segment is the next in sorted order.

Every time we output a solution, we must trace it back through the tree, and sometimes graduate one more solution from the priority queue to the list, and sometimes combine more solutions from the two children and put them to the priority queue.

I don't have a proof of the complexity, but it feels like this could be O((n+k)logn) memory and O((n+k)*(logn)**2) time. However, even on 250K ones it takes 20 seconds and consumes 3G of memory :(

→ Reply

riadwaw

4 years ago, # ^ |

← Rev. 3 →

That seems close to what we did in upsolving. Try speeding up like this:

when dividing segment (l, r) don't try to divide it near m=(l+r)/2 but near m = (r&~(1<<(31-builtin_ctz(l ^ r)) - 1)) - 1 i.e near the place where the largest bit changes. This way segments get reused much more. As far as I remember that was last optimization we did and it changed 4KK nodes to 1KK nodes

→ Reply

Petr

4 years ago, # ^ |

+11

Thanks a lot, that's an awesome optimization! It reduced the number of nodes from 4M to 1M for us as well.

Unfortunately, we're still (barely) over ML, but now it looks possible to get there with some additional small optimizations.

→ Reply

ko_osaga

4 years ago, # ^ |

ksun48 told me a solution along that line of thinking.

→ Reply

ko_osaga

4 years ago, # ^ |

For K, there is a 20-liner solution submitted by Ptz teams, but I doubt the author was aware of it.

→ Reply

Qingyu

4 years ago, # |

+27

How to solve D and H?

→ Reply

ko_osaga

4 years ago, # ^ |

D: If we can not cross over two sides, then the problem is easy. You can consider clauses for each wire, and run 2-SAT (which is actually bipartite coloring).

Now the key idea is to consider clauses for each endpoint of the wire, not the wire. If some wire is fully contained in another wire (two endpoints of the interval is contained in another interval), then the two endpoints should lie on the same side. If some wire crosses another wire (exactly one endpoint of the interval is contained in another interval) then we can find two pair of endpoints that should not lie in same side.

You can see that these two cases are sufficient, and we can run the identical 2-SAT with that problem. Time complexity is $$$O(N^2)$$$ and you can optimize this to $$$O(N \log N)$$$ with some typical DS tricks.

→ Reply

Swistakk

4 years ago, # ^ |

+10

If you have a planarity test that outputs an embedding then that's a no-brainer in $$$O(n)$$$

→ Reply

ko_osaga

4 years ago, # ^ |

← Rev. 3 →

The planarity test is why we asked for a certificate, but hmm, maybe.

→ Reply

Swistakk

4 years ago, # ^ |

← Rev. 2 →

We got AC in upsolving using planarity test from our library and there were no problems with that, it's really easy. Too bad somehow I didn't realize that during the contest :<. Planarity test tells you whether each edge outgoing from each $$$i$$$ enters it from above or from below. If we have an edge from $$$i$$$ to $$$j$$$ such that it enters both of these vertices from above and $$$i<j$$$ then you go $$$j-i$$$ steps up from $$$i$$$, then $$$j-i$$$ steps right, then $$$j-i$$$ steps down. If it enters $$$i$$$ from above and $$$j$$$ from below then from $$$i$$$ you go $$$n+i$$$ steps up,

Unable to parse markup [type=CF_MATHJAX]

steps left, $$$2n+i+i$$$ steps down, $$$2j+i$$$ steps right, $$$n+j$$$ steps up.

→ Reply

ko_osaga

4 years ago, # ^ |

Yeah, at first sight that information looked insufficient, but I now agree it is easy. The actual contest was in Dec 2020 so I forgot some properties on the problem :(

Btw, mind sharing your planarity test codes? :)

→ Reply

Swistakk

4 years ago, # ^ |

← Rev. 2 →

Btw, mind sharing your planarity test codes? :)

Tagging mnbvmar

→ Reply

rsFalse

4 years ago, # |

+24

How to solve I -- Integer Array Shuffle?

→ Reply

Coder

4 years ago, # ^ |

Hint: figure out the condition for answer to be 1, then 2 and so on.

→ Reply

laozhu1234

4 years ago, # ^ |

The locality of a sequence is always either monotonically increasing or monotonically decreasing. Our target is to make the whole sequence ascending. So the question turns into decreasing the complexity of the so called 'up-down vector'(UDV) of a sequence. UDV decribes the rise and fall of the sequence. For example:sequence:1 4 3 2 5,and its UDV is 1 0 1 if we use 1 for rise and use 0 for fall. In this case, the rise and fall of the sequence is :1->4 rise!; 4->3->2 fall!; 2->5 rise!

A vital lemma:During the shuffle as the problem decribed,we can eliminate the front and the back elements of the A's UDV (if the length of it is even) and push_back the front element into the sequence B. Since the front and the back elements of the A's UDV is surely different and we can artificially arrange the first rise and the last fall to make it a rise(amuse the front element of A's UDV is 1 here, see the following example!)

Example:The sequence is 6 5 4 5 4 7 8 whose UDV is 0101 and during the shuffle I can successively push 8 7 6 5 4 4 5 in the new sequence which change the new UDV to 01. (In fact, for 0101, I eliminate the first 1 and the last 0 to turn them into a 0,and then for the rest 10 I turn them into a 1.Finally, the new UDV become 01)

I am afraid this is too long to explain the solution. So at last, I only state a conclusion: after a shuffle ,the UDV of the sequence reduce to half and round up. Please write down some sequences, write their UDV, try to eliminate the element to half the UDV and find the law!

Several details are not memtioned above but I think if you master the UDV, it is fun to find out the law in your own! Good luck!

I am a Chinese and I have wrote a blog in Chinese about a month ago. I am afraid and sorry that I can't post the blog here. Any questions, please contact me!

→ Reply

sigma425

4 years ago, # |

+10

I'd like to know the solutions for H,K (or at least pointers to some papers?) :D

→ Reply

xiaowuc1

4 years ago, # ^ |

+10

Relevant paper for K.

→ Reply

ko_osaga

4 years ago, # ^ |

+10

Relevant paper for H.

Brief explanation: The optimal solution can be partitioned into a chain of circles where the first / last circle have a maximum possible radius and the others simply follow it. This gives an $$$n^2$$$ solution: You enumerate the one with maximum possible radius, and move left/right to find the possible radius for other circle, giving $$$O(n)$$$ candidates of radius for each.

Let $$$D_i = X_i - X_{i - 1}$$$. You can find the maximal increasing / decreasing subarray of "gaps", like $$$D_i < D_{i + 1} < D_{i + 2} ... $$$, In this case, it is not optimal for $$$i, i + 1$$$ point to have maximum radius. This reduces the number of candidates.

Also, you can't move beyond the subarray of gaps (it will make the radius at most 0), so you can just try $$$O(1)$$$ candidate of radius or each circles.

→ Reply

zhangguangxuan99

4 years ago, # |

How to solve G?

→ Reply

Andreasyan

4 years ago, # ^ |

Let $$$dp[l][r]$$$ be answer in segment $$$[l,r]$$$. For calculation we need to choose some $$$i$$$ from that segment to be maximum. For $$$i$$$ it is optimal $$$-C[i][j] + V[i][j] * q[l][r][i] + dp[l][i - 1] + dp[i + 1][r]$$$ to be maximal, where $$$q[l][r][i]$$$ is number of queries from the segment $$$[l,r]$$$ containing $$$i$$$. We can pre-calculate $$$q$$$ and optimal $$$-C[i][j] + V[i][j] * q[l][r][i]$$$ for all possible $$$l,r,i$$$ with convex hull.

→ Reply

Swistakk

4 years ago, # ^ |

+12

The most important trick in this solution is that we do not need to take care of the fact that value at point $$$i$$$ is actually bigger than all other values at intervals $$$[l, i-1]$$$ and $$$[i+1, r]$$$

→ Reply

I_love_Tanya_Romanova

4 years ago, # |

What was the model solution for C?

→ Reply

ko_osaga

4 years ago, # ^ |

Binsearch the answer and do line sweeping with lazy segtrees and std::set.

TL is 4-5x model solution, but suboptimal implementation is easier and tempting. We did not want to block it, but there were other issues when we gave more than 5s.

→ Reply

derrick20

4 years ago, # ^ |

Could you elaborate on how that solution would work?

→ Reply

ko_osaga

4 years ago, # ^ |

+13

Let $$$R$$$ be the size of the answer.

Let's move $$$[x_c, x_c + R]$$$ in the $$$x$$$ coordinate section of the length $$$R$$$ from left to right. In the process of increasing $$$x_c$$$, $$$N$$$ points are inserted into the section at the time $$$x_c = x_i-R$$$, and are removed from the section at the time $$$x_c = x_i + 1$$$. To solve the problem, as points are added and removed, you need to quickly determine if there is a section of length $$$R$$$ containing all $$$K$$$ colored points.

Let's define $$$distinct[y]$$$ as the number of different colors that can be obtained when covering the $$$[y-R, y]$$$ section. If $$$x$$$ with $$$distinct[y] = K$$$ exists, it can be determined that the answer exists. Let's see how $$$distinct[y]$$$ changes when a point is added and removed.

For each color, let's manage the $$$y$$$ coordinates of the points of each color in a data structure like std::multiset (you have $$$K$$$ sets). When adding a point whose $$$y$$$ coordinate is $$$y_j$$$, basically, it will increase $$$distinct[i]$$$ by 1 in $$$[y_j, y_j + R]$$$. However, since the point immediately before $$$y_j$$$ covers the union, there are sections that shall not be added, and sections that shall not be added because the point immediately preceding it is blocked. This section can be obtained by case processing by knowing the coordinates of the points adjacent to each side of $$$y_j$$$. Therefore, if you use the lower_bound function, you can calculate the newly contributed section in $$$O(\log N)$$$. Also in deletion, the interval at which $$$distinct[i]$$$ decreases by 1 can be calculated in a very similar way.

Now the problem becomes an interval query problem, which adds a certain number to the interval and checks whether there is an element with a value of $$$K$$$ in the entire array. Since the value of a specific element in the array is always less than $$$K$$$, it can be solved by using lazy propagation in the segment tree. Since the decision problem is solved in $$$O(N \log X)$$$, the entire problem can be solved in $$$O(N \log^2 X)$$$ to get a perfect score.

→ Reply

hackerman44

4 years ago, # |

← Rev. 2 →

-35

when will u stream next time?
& can u share your vimrc as well?

→ Reply

ko_osaga

4 years ago, # ^ |

https://github.com/koosaga/olympiad/blob/master/Library/vimrc.vim

→ Reply

ko_osaga

4 years ago, # |

+34

The contest is now in Gym. Enjoy!

→ Reply

Man

4 years ago, # |

How to solve problem E?

→ Reply

triple__a

4 years ago, # ^ |

← Rev. 3 →

+18

Let's root this random tree at vertex $$$1$$$.

Note that $$$dist(u,v)=dist(1,u)+dist(1,v)-2dist(1,lca(u,v))$$$, we could precalculate the expected distance between $$$u$$$ and root by standard dp(denoted by $$$f_u$$$). Therefore, the remaining stuff is the expected distance between the root and lca of $$$u$$$ and $$$v$$$.

Suppose we have generated a tree according to the procedure described in the statement, we could find the lca of $$$u$$$ and $$$v$$$ ($$$u<v$$$) in the following approach:

If the parent of $$$v$$$ (denoted by $$$p_v$$$) is exactly $$$u$$$, we are done. Otherwise, $$$lca(u,v)=lca(u,p_v)$$$.

Note that once given the information that $$$i=lca(u,v)$$$, the expected distance of $$$i$$$ and root is exactly $$$f_i$$$ as chosen parents for vertex $$$j \le i$$$ and vertex $$$j>i$$$ are independent. Therefore, we can rephrase procedure generating distribution of distance between $$$lca(u,v)$$$ and root in a handy way:

Pick an integer $$$x \in [1,v-1]$$$ with probability propotional to $$$a_x$$$;
If $$$x > u$$$, replace $$$v=x$$$ and goto step 1;
If $$$x = u$$$, return $$$f_u$$$ (independence needed here);
Otherwise, set $$$v=u, u=x$$$ and goto step 1;

The last thing is to observe that we could indeed replace all $$$a_x$$$ with $$$0$$$ for all $$$u < x \le v$$$ as $$$x>u$$$ case is just reroll a random number (which is to say that the expected distance is independent with $$$v$$$ and we may denote $$$g_u$$$ the expected distance).

Therefore, we have $$$g_u=\sum_{i=1}^{u-1}\frac{a_ig_i}{S_u}+\frac{a_u}{S_u}f_u$$$, where $$$S_u=\sum_{i=1}^u a_i$$$ (which is similar to previous one).

→ Reply

walk_alone

14 months ago, # |

Here's my own tutorial about problem B, C, E, F, G, I, J, K written in Chinese. https://blog.csdn.net/m0_52048145/article/details/133896252 And I've found a Chinese tutorial about problem L which I've given the link in the problem L's part.

I hope this helps.

P.S. On October 17th, there are still tutorials on problem C and E unfinished. I'll finish them as soon as possible.

→ Reply