Блог пользователя GlebsHP

Автор GlebsHP, история, 3 года назад, По-английски
Tutorial is loading...
Tutorial is loading...
Tutorial is loading...
Tutorial is loading...
Tutorial is loading...
Tutorial is loading...
Tutorial is loading...
Tutorial is loading...
Разбор задач Nebius Welcome Round (Div. 1 + Div. 2)
  • Проголосовать: нравится
  • +293
  • Проголосовать: не нравится

»
3 года назад, скрыть # |
Rev. 4  
Проголосовать: нравится 0 Проголосовать: не нравится

Very fast editorial! Thanks!

»
3 года назад, скрыть # |
 
Проголосовать: нравится +105 Проголосовать: не нравится

I was too scared that $$$O(2^n \cdot n^2)$$$ is too slow for E so I ended up figuring it out in $$$O(2^n \cdot n)$$$ with $$$O(2^n)$$$ memory.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +49 Проголосовать: не нравится

Can't solve D :(

How to improve solving greed problems

»
3 года назад, скрыть # |
 
Проголосовать: нравится +13 Проголосовать: не нравится

Damn! Editorials are out even before the system test. Coolest round ever!(for me)

»
3 года назад, скрыть # |
Rev. 5  
Проголосовать: нравится -6 Проголосовать: не нравится

Why my approach to problem C is wrong?

We need for every $$$1 \le f \le p$$$ to check if:

$$$ \begin{aligned} x + \sum_{i = 1}^{f} i &\equiv 0 \mod n \\ x + \frac{f (f + 1)}2 &\equiv 0 \mod n \\ f ^ 2 + f &\equiv -2x \mod n \end{aligned} $$$

We end up checking the value of $$$f ^ 2 + f$$$ (mod $$$n$$$), which is equivalent to checking $$$m ^ 2 + m$$$ whereas $$$m \equiv f \mod n$$$.

From the previous reasoning, it is enough to check all possible values, $$$1 \le f \le \min(p, n)$$$.

The vague part is going from $$$f$$$ to $$$m$$$. Why isn't it correct?

  • »
    »
    3 года назад, скрыть # ^ |
    Rev. 2  
    Проголосовать: нравится +18 Проголосовать: не нравится

    You can't multiply by 2 so you have to go up to min(p, 2n) not min(p, n)

    • »
      »
      »
      3 года назад, скрыть # ^ |
      Rev. 2  
      Проголосовать: нравится +8 Проголосовать: не нравится

      To be accurate, if $$$n$$$ is odd, $$$2$$$ is invertible $$$\mod n$$$ so you can multiply by $$$2$$$ and check only up to $$$n$$$ when it is odd.

    • »
      »
      »
      3 года назад, скрыть # ^ |
       
      Проголосовать: нравится 0 Проголосовать: не нравится

      can you explain why exactly I can't multiply by 2?

      • »
        »
        »
        »
        3 года назад, скрыть # ^ |
        Rev. 4  
        Проголосовать: нравится +19 Проголосовать: не нравится

        This is because you want "$$$\iff$$$" (**if and only if**). Therefore, you need to be abble to multiply by $$$2$$$ (always works) but you also need to divide by $$$2$$$ (only works if $$$n$$$ is odd).

        1) Multiplication by $$$2$$$ (always works)

        Let $$$f \in \mathbb{Z}$$$ (an integer) and assume that $$$x + \dfrac{f(f + 1)}{2} = 0 \mod n$$$.

        This means that $$$x + \dfrac{f(f + 1)}{2} = \lambda n$$$ for some $$$\lambda \in \mathbb{Z}$$$.

        Therefore $$$2x + f(f + 1) = 2 \lambda n$$$.

        Since $$$2 \lambda \in \mathbb{Z}$$$, $$$2x + f(f + 1) = 0 \mod n$$$.

        2) Division by $$$2$$$ (only works if $$$n$$$ is odd)

        Now, let's assume that $$$2x + f(f + 1) = 0 \mod n$$$. Can we say that $$$x + \dfrac{f(f + 1)}{2} = 0 \mod n$$$?

        2.a) Assume $$$n$$$ is odd

        Then $$$\dfrac{n + 1}{2}$$$ is an integer.

        Thus, $$$\dfrac{n + 1}{2} (2x + f(f + 1)) = 0 \mod n$$$.

        By simplifying, and since $$$\dfrac{f(f + 1)}{2}$$$ is an integer, you do obtain $$$x + \dfrac{f(f + 1)}{2} = 0 \mod n$$$.

        2.b) Assume $$$n$$$ is even

        Then $$$\dfrac{n + 1}{2}$$$ is not an integer so the above trick does not work.

        What we would like to do is to deduce that $$$y = 0 \mod n$$$ from the fact that $$$2 y = 0 \mod n$$$.

        However, remember that $$$n$$$ is even so $$$n = 2m$$$ for some $$$m$$$ in $$$\mathbb{Z}$$$. Taking $$$y = m$$$, $$$2 y = 0 \mod n$$$ but $$$y \neq 0 \mod n$$$.

        Therefore, you cannot assume that $$$x + \dfrac{f(f + 1)}{2} = 0 \mod n$$$ if you only know that $$$2x + f(f + 1) = 0 \mod n$$$.

        EDIT: I missclicked on post instead of preview, I'm going to finish writing my message. xD EDIT 2: I finished my message.

  • »
    »
    3 года назад, скрыть # ^ |
    Rev. 2  
    Проголосовать: нравится 0 Проголосовать: не нравится

    I believe you have to check f up to min(p, 2n). I got the same equation as you, but I found out 2n is needed after trying out a few examples. I'd be happy to learn why 2n is needed. EDIT: See Beaboss's comment.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +12 Проголосовать: не нравится

You're given 3 problems and two hours to solve them

»
3 года назад, скрыть # |
 
Проголосовать: нравится +8 Проголосовать: не нравится

I have trouble understanding the additional question at the end of the tutorial for problem c:

"Question, can you build the test that required Vesper to use $$$x$$$ more than $$$100k$$$?"

Can somebody explain the question please? (not the solution though, I wanna solve it ^^')

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

why it will be tle ? i only use upper_bound get AC 197098569

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

it is logically very obvious that

 (2n*(2n+1)/2)%n=0

So, logically it is obvious that if we try out values from 1 to 2*n and do not get answer then we would not get answer after that too. But can anyone prove it mathematically though? Still cannot convince myself about the 2*n thing.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +10 Проголосовать: не нравится

Easiest div2 d I've ever seen, I wish I registered for the contest

»
3 года назад, скрыть # |
Rev. 2  
Проголосовать: нравится 0 Проголосовать: не нравится

I think in task F you have used about 150 cpu-days

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

Why is this giving the wrong answer for the maximum value in problem D?

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

:( Came home 1 hour late so couldn't join. Sad thing is I knew how to solve F because I did it in an approximation algorithms course before.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +159 Проголосовать: не нравится

F actually highlights something that is missing from Polygon (or maybe some online judges in general): the ability to declare that "this is the correct output file" without having it be generated by the model solution. In contrast, CMS (e.g. at IOI) treats the output files as the true ones and doesn't care if you didn't generate them using the model solution.

This is a niche usage of course, but it would be nice to give approximation problems without hacks like "the last line is an encoded version of the true answer". Approximation algorithms aren't really the CP meta right now, but e.g. for one task in an ICPC-style contest, why not? Compute the correct answers using some slow brute force or a commercial ILP/SAT solver and the checker can just check the approximation factor.

  • »
    »
    3 года назад, скрыть # ^ |
     
    Проголосовать: нравится 0 Проголосовать: не нравится

    You can implement an approximation problem on Polygon using an interactor instead of exposing an encoded answer.

    • The test case file contains the actual input to the problem and also the real answer.
    • Interactor reads the test case file and only shows the actual input to the program. Then it compares the program's answer to the test case's answer.

    For this specific problem F it wouldn't allow for hacks because it's hard to implement a validator that checks we have the exact answer. However this method would allow hacks for many approximation problems such as minimum vertex cover.

    The problem could be "Given a graph, find a vertex cover that is at most twice the size of the minimum vertex cover." In this case we don't need a strict requirement that we have the optimal vertex cover in the test case, just some vertex cover. Then someone would be able to hack an incorrect program as long as they can find a vertex cover that is less than half the size of the program's answer.

»
3 года назад, скрыть # |
Rev. 4  
Проголосовать: нравится -6 Проголосовать: не нравится

Randomised solution for E:

DFS going deep for 2 random child every time. If you get loop — check if union of neighbours in it covers everything. If it does, then we found required loop in editorial. Repeat until you have spent almost 2 seconds. Print "No" otherwise. Not sure how to estimate probabilty of failure though.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +32 Проголосовать: не нравится

Btw, fun fact about problem F.

There is a paper that proves that under SETH it is impossible to solve problem F in $$$O(m^{1-\varepsilon})$$$ time per query if we would need to answer queries online, there were also edge deletions, and we had to provide a $$$2-\delta$$$ approximation (here $$$f$$$-factor approximation means that we should provide an answer that is not smaller than the actual answer and at most $$$f$$$ times larger; I am not sure whether this theorem can be generalized to the notion of approximation that is used in problem F).

»
3 года назад, скрыть # |
 
Проголосовать: нравится +3 Проголосовать: не нравится

In problem D it should be $$$T2=max(0,m4−∑yi=0⌊l′i2⌋)$$$ not $$$T2=min(0,m4−∑yi=0⌊l′i2⌋)$$$

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

The system test of E is weak. My brute force solution (with special check for cut-edges) passed it.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +41 Проголосовать: не нравится

By the way, there is another solution for F.

Instead of running a binary search, you can just start bfs with updated vertexes and existing distances, not from scratch. This is too slow, but we can not run bfs each time, but only when sum of immediate increases (differences between end and start of added edges -1) is half of the current answer.

This is correct because this sum is the upper bound of the answer decrease.

The more tricky question why it is fast.

  1. After distance is less than $$$O(\sqrt(n))$$$, it is $$$O(nsqrt(n))$$$ even if we run bfs each time
  2. While difference is big, each edge is decreasing distance by at least factor 3/4 for at least $$$O(\sqrt(n))$$$ vertices.

This looks like very rough bound, probably in practice it's better.

  • »
    »
    3 года назад, скрыть # ^ |
     
    Проголосовать: нравится 0 Проголосовать: не нравится

    Could you please provide more detail as to why 2. is true?

    • »
      »
      »
      3 года назад, скрыть # ^ |
      Rev. 2  
      Проголосовать: нравится 0 Проголосовать: не нравится

      Hm. I was quite sure about it during the contest, but not really sure now.

      But I think it's still subquadratic. Idea is following. If we have a big shortcut (order of answer), then at least for second half of the vertices on the old shortest past to further end of shortcut, distance decreased significantly. Now we can say, that we either processed a lot of queries, or at least one of them is big.

      It seams it can lead to $$$N^{7/4}$$$ or $$$N^{5/3}$$$ bound. I'm not sure.

  • »
    »
    3 года назад, скрыть # ^ |
     
    Проголосовать: нравится 0 Проголосовать: не нравится

    Hi, I am a bit confused by what you call "sum of immediate increases".

    For instance, let's say I start the bfs from vertex $$$A$$$ and et's say I have two vertices $$$B$$$ and $$$C$$$ such $$$dist(A,B) = 42$$$ and $$$dist(A,C) = 50$$$. If I add the edge $$$(B, C)$$$, would the "immediate increase" be $$$50 - 42 = 8$$$?

    Also, when you say "this sum is the upper bound of the answer decrease", do you mean that the sum is an upper bound of the decrease of the diameter of the graph when adding the edge $$$(B, C)$$$?

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

what is the specific test in C? (last sentence in editorial)

»
3 года назад, скрыть # |
 
Проголосовать: нравится +3 Проголосовать: не нравится

Problem C has a brilliant name. No thinking, just pull your luck ))

»
3 года назад, скрыть # |
Rev. 2  
Проголосовать: нравится 0 Проголосовать: не нравится

In Problem C, I was shocked. I write AC code but I thought it was WA. Because I thought the loop needed p time every test case. if I loop run P time then I got TLE. That's why I didn't submit my code. but after finishing the contest I submit my code and it was an AC code.


void solve() { ll n, x, p; cin >> n >> x >> p; ll rem = n - x, sum = 0; if( x == 0 ) rem = 0; vector< int > v; for( ll i = 1 ; i <= n * 2 ; i++ ) { sum += i; if( sum % n == rem ) { cout << "YES" << endl; return; } if( i == p) break; } cout << "NO" << endl; }
»
3 года назад, скрыть # |
 
Проголосовать: нравится +48 Проголосовать: не нравится

Bonus question, can you guess the total number of cpu-days we have used to compute all the answers? Post you version in the comments!

To find the diameter of a general graph, we will need $$$n$$$ BFS, and each takes $$$O(n+m)$$$ time, overall $$$O(n*(n+m))$$$ per query. $$$O(n*q*(n+m+q))$$$ per testcase.

This problem has 80 testcases, assuming worst case $$$n=m=q=10^5$$$, its $$$80*10^5*10^5*(3*10^5) = 2.4*10^{17}$$$, assuming $$$10^8$$$ operations per second, on the normal computer we need $$$2.4*10^9$$$ seconds = ~3e5 days.

I was expecting the answer for generating testcases to be < 100s, using 1e5 parallel cpus.

  • »
    »
    3 года назад, скрыть # ^ |
     
    Проголосовать: нравится +206 Проголосовать: не нравится

    Both authors retired from CF in 2017, guess that's because they have been busy generating the test cases ever since...

  • »
    »
    3 года назад, скрыть # ^ |
    Rev. 2  
    Проголосовать: нравится +10 Проголосовать: не нравится

    I think there are better algorithms / heuristics to enormously speed up the brute force. For example: If you add an edge, for the current diameter, you run BFS from one of the endpoints and if the edge addition doesn't improve the diameter, you don't need to calculate anything at this step of the algorithm anymore.

    Also, things like keeping some upperbounds of diameters ending in nodes, and not running a BFS if isn't strictly needed. Maybe not worstcase complexity improvements but they will speed up the brute force a whole lot.

    • »
      »
      »
      3 года назад, скрыть # ^ |
      Rev. 6  
      Проголосовать: нравится +36 Проголосовать: не нравится

      I have a solution that works in $$$O(nm + q(n + m + q))$$$ for all tests except special ones.

      The idea is very simple, at the very beginning we calculate the furthest distance from each vertex with $$$n$$$ BFSs and call this value upper bound.

      Now, after adding the next edge, we go through the vertices in descending order of upper bound and recalculate the furthest vertex from them. If it has not changed, this upper bound is the answer after adding edge.

      But unfortunately, it's still not fast enough and an additional hack is needed. And this hack is simple — in addition to recalculating the farthest vertex, let's update the upper bound of other vertices $$$u$$$ as "max distance from current vertex" + "distance from the current vertex to $$$u$$$".

      This is already enough to pass all the tests except for special ones, which are essentially built against such solution (at the moment, my solution can generate an answer to every test except two special in an hour).

      But it was not used as a model solution, but was rather a validator for validating the simplest distributed algorithm.

      Сode snippet for better understanding
»
3 года назад, скрыть # |
 
Проголосовать: нравится +34 Проголосовать: не нравится

Bonus question, can you guess the total number of cpu-days we have used to compute all the answers? Post you version in the comments!

7

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

How to solve A using DFS?

»
3 года назад, скрыть # |
Rev. 2  
Проголосовать: нравится 0 Проголосовать: не нравится

Is there a ")" missing here for D?

and isn't max?

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

thank you for fast editorial

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

Why my solution got TLE even if it is O(nlogn).

My Solution: https://mirror.codeforces.com/contest/1804/submission/197105822 Can someone help me, please?

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

can someone explain why we are counting "at least one dark window segment" in problem D rather than one dark window segment .

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

In D, in last line, I think it should be max not min.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +42 Проголосовать: не нравится

F is a beautiful problem with a very scary statement

»
3 года назад, скрыть # |
 
Проголосовать: нравится +8 Проголосовать: не нравится

cool way to get the correct answers in cool problem F

»
3 года назад, скрыть # |
 
Проголосовать: нравится +55 Проголосовать: не нравится

Problem E: I was blown by the idea of using the first set bit to represent start of the circle. Very clever.

»
3 года назад, скрыть # |
Rev. 2  
Проголосовать: нравится 0 Проголосовать: не нравится

In problem D, my approach for finding the maximum value is to, first calculate "01" or "10", then "00" and at last "11", and assign this as two widowed rooms, and the remaining ones are single windowed. Can anyone point out my mistake? submission

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

What I do in C is, I run a loop from i = 1 to 1e5 and checking if i*n-x exist as sum of p natural number in logP overall time complexity is O(n*logP) but still it got TLE Why?

https://mirror.codeforces.com/contest/1804/submission/197293170

  • »
    »
    3 года назад, скрыть # ^ |
     
    Проголосовать: нравится 0 Проголосовать: не нравится

    Note that you only loop from 1 to 1e4,so it would goes wrong when n is large enough.

    If the answer is No,it will run the loop completely,and your function f is $$$O(\log p)$$$,so the total is $$$t\times 10^4\times \log p=3\times 10^9$$$ and it goes tle.At least you should replace 1e4 with n*2.

    BTW,I'm not sure whether it's right or not to use binary search.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +5 Проголосовать: не нравится

I have received a mail that my solution of C problem in Nebius Welcome Round is coinciding with Somebody else . In mail it was written , to post about the evidence .

C Problem was not too difficult for a person with average coding skills . The only trick in C was to get a control over the loop . I have seen many tutorials of past contests and in C problem , I used that for controlling the loop as the most common approach in these types get solved in nlog(n) or n root(n) . So I did it for n root(n) and then the code was simple a Brute force logic . It took me a number of contest and practice to reach specialist . Please reply me if i need to submit any supporting proof and what else can i upload . Please help GlebsHP , MikeMirzayanov . please do reply.

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

For Problem C.. I want to understand the repetitive behavior of the function (sum_from_1_to_2n % n), Please mention a mathematical or intuitive (better :) reasoning..

»
3 года назад, скрыть # |
 
Проголосовать: нравится +5 Проголосовать: не нравится

F is really fascinating.

»
3 года назад, скрыть # |
 
Проголосовать: нравится +8 Проголосовать: не нравится

Can anyone give me any hint for problem H? Is the problem approachable with bitmask dp and meet in the middle?

  • »
    »
    3 года назад, скрыть # ^ |
    Rev. 2  
    Проголосовать: нравится 0 Проголосовать: не нравится

    H can be solved with bitmask DP, but you have to come up with some methods to sum up the costs cleverly.

    • »
      »
      »
      3 года назад, скрыть # ^ |
       
      Проголосовать: нравится 0 Проголосовать: не нравится

      The problem I am facing is, I have two option, counter-clockwise and clockwise.

      I mean if its counter clockwise i have to add (a — b) * cnt[i][j] and (b — a) * cnt[i][j] if its clockwise. If i had only clockwise/ counter clockwise, i could have split the equation and take the sum independently (a * cnt[i][j] — b * cnt[i][j] for only counter clockwise). But now, how do i know which one is clock wise and which one is counter clockwise.

      • »
        »
        »
        »
        3 года назад, скрыть # ^ |
         
        Проголосовать: нравится 0 Проголосовать: не нравится

        That's the main difficulty of the problem. An approach is to consider the problem not from the perspective of characters, but from the edges on the cycle, that is, how many times each of them is used.

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

There is a formula mistake in tutorial of problem F.

Having the correct value of $$$c_(G_i)(v)$$$ we can use the binary search ...

That should be: Having the correct value of $$$c_{G_i}(v)$$$ we can use the binary search ...

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

Auto comment: topic has been updated by GlebsHP (previous revision, new revision, compare).

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

Simple DP for D... why is the editorial so complicated?

»
3 года назад, скрыть # |
 
Проголосовать: нравится 0 Проголосовать: не нравится

Auto comment: topic has been updated by GlebsHP (previous revision, new revision, compare).

»
3 года назад, скрыть # |
Rev. 2  
Проголосовать: нравится 0 Проголосовать: не нравится

I am very curious how the data of problem F was made, LOL

It is a really interesting problem, lots of thanks to the problem provider!