Overly strict time limits

Блог пользователя neal

Автор neal, 8 лет назад, По-английски

1043D - Mysterious Crime requires you to input 1 million integers and ultimately process them in either nm or $\text{[math]}$ time. But the time limit was set to only 1 second. This is very unfriendly to any language that isn't C++, and it even puts $\text{[math]}$ C++ solutions at risk of failing.

Case in point: tourist's contest submission uses an STL map and exceeded the time limit, causing him to drop over 100 places. Meanwhile, ksun48 and mnbvmar had almost exactly the same solution and barely squeezed by the time limit in 998 ms. tourist was also able to pass with the same code in practice mode by switching from C++11 to C++14, which shows how close it was.

I admit there are certain problems where it is interesting to distinguish O(n) and $\text{[math]}$ solutions, but this doesn't fit that case, as it doesn't take any novel insight to remove the log from the runtime here. Overall it would have been much better with a time limit of 2 or 3 seconds; or alternatively, with m ≤ 5 instead of 10.

#time, #timelimit, #timeout

+347

neal
8 лет назад
49

Комментарии (46)

Показать архивные | Написать комментарий?

neal

8 лет назад, скрыть # |

← Rev. 2 →

+77

To be clear, I believe the problem setters most likely weren't aware of the map solution, and I don't want to blame anyone in particular. But in general I've been observing a lot of strict time limits on problems where more lenient limits would be better.

→ Ответить

tmwilliamlin168

8 лет назад, скрыть # |

+49

What's makes this problem worse is that my map solution showed only around 500ms for pretests :( It may be my fault for not knowing that 10^6 map operations wouldn't pass in 1s, but couldn't the pretests be stronger?

→ Ответить

awoo

8 лет назад, скрыть # ^ |

+21

I had a nmlogn solution and it worked the same on pretests and systests (about 500 ms). Seems that you just can't put maxtest for every solution in there.

→ Ответить

pajenegod

8 лет назад, скрыть # |

+16

I'm kinda surprised about C++ people having problems with the time limit on D as it was very much solveable in python (pypy only took 421 ms for me). Problem E on the other hand might be impossible in pypy as the I/O is simply too big relative to the time limit.

→ Ответить

brdy

8 лет назад, скрыть # ^ |

I think has more to do with STL map<> in particularly performing poorly with integers pairs than with C++ performance in general.

→ Ответить

tfg

8 лет назад, скрыть # ^ |

+13

No, the problem was that with pairs you use O(3*n*m) integers + pointers that might be used in the map structure + extra stuff needed in the map structure (for tree rebalancing for example) + realocations inside map + lots of cache misses. If you remove the map and use vector + sort it'll probably pass way faster.

→ Ответить

brdy

8 лет назад, скрыть # ^ |

+13

Yeah I didn't elaborate. Set and Map have horrible constant factor because not cache friendly like vector. When you have all the extra stuff in integer pairs it is even worse.

→ Ответить

c1729

8 лет назад, скрыть # ^ |

+11

Challenge accepted — 45036971

→ Ответить

pajenegod

8 лет назад, скрыть # ^ |

Ah so it was possible. In my code I usually use input = sys.stdin.readline but switching to input = iter(sys.stdin.read().splitlines()).next made it run in 1.5 s. Haven't seen that one before, it is a nice combination between reading all input at one time and still having the functionality of the usual input. Using some homemade string to int conversion I was even able to get it to 1.3 s. Will remember this for the future, thanks!

→ Ответить

c1729

8 лет назад, скрыть # ^ |

Indeed, looked through your solutions, didn't strike me to use a custom-made str to int and consider the input file as an array, I think that's a great way to get better computation speed. (Especially since Python's builtin str to int is slow).

→ Ответить

pajenegod

8 лет назад, скрыть # ^ |

After playing around with this for a while I've noticed that input = iter(sys.stdin.readlines()).next seems to be a bit better than iter(sys.stdin.read().splitlines()).next . Also I've pretty much only seen improvements using this over input = sys.stdin.readline. For example on this problem my solution went from taking 951 ms down to 327 ms. I will definitely use this in the future for pretty much every problem I solve.

→ Ответить

c1729

8 лет назад, скрыть # ^ |

Thanks! I have more optimizations I've tinkered with here that might interest you too.

→ Ответить

brdy

8 лет назад, скрыть # |

+23

It's also notable that both ksun48 and mnbvmar used C++17 in contest, while (as mentioned in blog) tourist used C++11. When he submitted in C++14 it passed much more comfortably. Especially considering that there is some fluctuation in running times on codeforce judge, I think that especially for these problems (where 2000ms will still eliminate naive solutions) adding on more TL can't really hurt the competition and makes it more about solving problems than choosing the latest version of C++

→ Ответить

Xellos

8 лет назад, скрыть # |

+19

Why is an inefficient solution failing something bad? This isn't about distinguishing solutions — with one extra log, you often can't do that anyway. As a contestant, when you see 1 million as a constraint, you may want to think about a linear-time solution or weigh your chances, think if your idea with log will work in time, maybe use custom test to check and decide what to do based on the specific situation. A tighter limit simply means "we're fine with $\text{[math]}$ solutions passing if they're well-written, but we're not going to put extra effort into making sure they all pass". Note that 1 second is a default TL in Polygon.

→ Ответить

dkxdjy

8 лет назад, скрыть # ^ |

← Rev. 2 →

-7

I think having such a solution fail is sometimes ok, but if you are going to do that you should at least let people know that it is going to fail (by for example putting a maxtest in pretests) instead of having it pass pretest in like half the TL and then suddenly fail in systest

→ Ответить

Xellos

8 лет назад, скрыть # ^ |

← Rev. 2 →

+34

Did the setters even think about such a slow solution though? Again, 1 second is default TL, it doesn't have to be specifically set with these solutions in mind. Also this.

→ Ответить

dkxdjy

8 лет назад, скрыть # ^ |

Yeah, I suppose it's really hard to prevent something like this from happening if the authors don't think of the solution

→ Ответить

rng_58

8 лет назад, скрыть # ^ |

+112

I think it's fine if the TL is even tighter and all solutions with log factors fail. It's also fine if the TL is looser and allow all solutions with log factors. But something in the middle should be avoided, whenever possible (in this case a constant-optimization technique can be a decisive factor).

→ Ответить

Xellos

8 лет назад, скрыть # ^ |

+37

If the TL is tighter, then constant optimisation can be a decisive factor for linear solutions (or rather writing a solution that isn't constant-inefficient). As a problemsetter, I would rather risk making luck (or skill at constant optimisation) a bigger factor for asymptotically suboptimal solutions than for asymptotically optimal.

→ Ответить

rotavirus

8 лет назад, скрыть # |

-81

I think if you solved it with map, you solved it wrong. I solved in O(nm) using dsu

→ Ответить

flashmt

8 лет назад, скрыть # ^ |

+140

DSU sounds really cool bro. How about just using for loop?

→ Ответить

solvemproblr

8 лет назад, скрыть # |

← Rev. 2 →

+36

After the contest I submitted my code that got TL 36 and get AC. Identical codes(even compilers are the same):
http://mirror.codeforces.com/contest/1043/submission/45016350
http://mirror.codeforces.com/contest/1043/submission/45031166
Even tmwilliamlin168's solution got AC :\

→ Ответить

tmwilliamlin168

8 лет назад, скрыть # ^ |

← Rev. 2 →

+22

Actually, I used c++14 http://mirror.codeforces.com/contest/1043/submission/45002137 during the contest :'(

Edit: my solution also passes now with c++14 http://mirror.codeforces.com/contest/1043/submission/45031725

→ Ответить

solvemproblr

8 лет назад, скрыть # ^ |

But, I used c++17 :(

→ Ответить

xrisk

8 лет назад, скрыть # ^ |

+16

That's because the CF judge will run solutions that are very close to the TL multiple times I think.

→ Ответить

yashChandnani

8 лет назад, скрыть # ^ |

-6

NO, i once got a tle on a pretest in systesting.And also same code with same compiler can get accepted after the contest if it's worst case running time is close to time limit.

→ Ответить

xrisk

8 лет назад, скрыть # ^ |

Isn't that what I am saying?

If, during a contest, your solution runs in very close to TL, CF will run it multiple times. If it goes over the TL even once, I guess it will TLE.

In contrast to when you submit it manually, where whether or not it passes in TL could be completely a matter of luck (997 vs 1000ms is not a huge difference) as it is run only once.

→ Ответить

yashChandnani

8 лет назад, скрыть # ^ |

+10

Yes sorry.. apparently i am neither reading comments nor problem statements correctly these days..

→ Ответить

scott_wu

8 лет назад, скрыть # |

← Rev. 2 →

+178

Agreed! I think the problem-writing philosophy should be to set the loosest constraints that keep out undesired complexities instead of setting the tightest constraints that allow good complexities. The naive here is O(N²M), far from workable (and even an optimized O(N²) is unlikely to come close). In general, I'd argue for allowing the occasional hyper-optimized naive to sneak through over allowing correct solutions to fail.

Related side-note: I'm a little sad that fast I/O is becoming a requirement for many of the problems on CF. For veterans it's just a matter of adding two lines to template, but it increases the barrier to entry for newer contestants and punishes lots of solutions that deserve to pass.

→ Ответить

VLamarca

8 лет назад, скрыть # ^ |

+210

“But it increases the barrier to entry for newer contestants”

Very cool seeing a nutella that has empathy with begginers. Maybe you could teach Um_nik?

(just kidding :)

→ Ответить

Um_nik

8 лет назад, скрыть # ^ |

So empathy with beginners is about allowing them do what they want and telling that they are good anyways? You used slow I/O but that's okay, we raised TL for you. You can't read but that's okay, you always can ask authors to explain problem just for you. Maybe we should also tolerate those using python and set limitations such that only 1e7 operations fit in 1 second?

Imagine your teachers giving you A for anything. Would you learn anything? Don't think so.

→ Ответить

Iightcode

8 лет назад, скрыть # ^ |

-32

Imagine your teachers giving you A for anything. Would you learn anything? Don't think so.

No. But I would still be getting an A.

→ Ответить

Um_nik

8 лет назад, скрыть # ^ |

So let's give rounds consisting of one A+B problem. Everyone is a winner!

→ Ответить

Xellos

8 лет назад, скрыть # ^ |

+61

I think the problem-writing philosophy should be to set the loosest constraints that keep out undesired complexities

As someone who's used to squeezing in stupid sqrt-solutions for problems that are otherwise pretty hard, I should support this...

→ Ответить

rng_58

8 лет назад, скрыть # |

+174

My recommendation for setting TLs:

Consider all natural solutions to the problem.
Separate them into two groups: solutions intended to pass, and solutions intended to fail. Here two solutions of the same complexity should be in the same group.
Ideally, the TL should be loose enough to allow poorly-written solutions of the first group (even in Java), while tight enough to avoid heavily-optimized solutions of the second group.

Of course, sometimes it's impossible to satisfy the rules above. In such cases maybe we should admit that some poorly-written solutions may fail. But when the difference between good solutions and bad solutions is clear, I follow the rules above.

→ Ответить

AmanGupta_

8 лет назад, скрыть # |

← Rev. 3 →

-20

Why not to use unordered_map instead of map here ??

→ Ответить

filippos

8 лет назад, скрыть # ^ |

Mostly because unordered_map can be hacked (without the proper adjustments).

→ Ответить

dorijanlendvaj

8 лет назад, скрыть # ^ |

+55

Why not just use an array instead of unordered_map? The indexes are up to 10⁵ anyway.

→ Ответить

Dixtosa

8 лет назад, скрыть # |

+21

Even O(n*m) failed for me — http://mirror.codeforces.com/contest/1043/submission/45070910

BTW, MS C++ is ok in 700ms.

→ Ответить

neal

8 лет назад, скрыть # ^ |

+35

Yes, this is unfortunately because you need to add the following two lines to the beginning of your program for faster cin/cout (something scott_wu mentioned above):

ios::sync_with_stdio(false);
cin.tie(nullptr);

→ Ответить

xsc

8 лет назад, скрыть # |

-13

Why simple didn't put time limit 2s for all normal problems, and more than 2s for hardest problems?? ~1s for I/O, and 1s for solution.
And, may be stop to play between with 10^5, 2*10^5, 3*10^5, 4*10^5, 10^6 for limit of N, also. =).

→ Ответить

Anai

8 лет назад, скрыть # |

← Rev. 2 →

-23

I'm not saying that CF doesn't need to loosen its time limits a bit, but just to clear things up a bit: 1043D had an O(NM) solution, passing without any issues.

→ Ответить

neal

8 лет назад, скрыть # ^ |

+20

Yes, I say all this as someone who implemented the O(nm) solution during the contest: 45003278. My code uses about 1/4 of the available time, and I still think the time limit should be adjusted so that the $\text{[math]}$ solution passes comfortably.

→ Ответить

Xellos

8 лет назад, скрыть # ^ |

What if someone used more insertions into map<> (e.g. 3x more) and complained about the time limit not allowing his solution, but allowing some with the same complexity that just insert fewer times?

→ Ответить

neal

8 лет назад, скрыть # ^ |

+10

At some point there will be a high enough constant factor on an appropriate-complexity solution that you cannot distinguish well from a hyper-optimized, but worse-complexity solution.

So I am not saying you can always get it perfect, but you can definitely think about where to set the cutoff and try to make it relatively lenient. My main point is that for this problem (and several others), the constraints are much higher and the time limit much stricter than necessary.

→ Ответить

Xellos

8 лет назад, скрыть # ^ |

I agree with the general principle, but I don't think it was strict. My in-contest linear solution runs in 185 ms and http://mirror.codeforces.com/blog/entry/62808?#comment-467989 is a solution with log that runs in 500 ms. If you put 1 million pairs into map<> and get TLE because map is slow, that doesn't make the TL strict.

→ Ответить