A huge number of hacks on problem 1042C

#	User	Rating
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3603
4	jiangly	3583
5	turmax	3559
6	tourist	3541
7	strapple	3515
8	ksun48	3461
9	dXqwq	3436
10	Otomachi_Una	3413

#	User	Contrib.
1	Qingyu	157
2	adamant	153
3	Um_nik	147
4	Proof_by_QED	146
5	Dominater069	145
6	errorgorn	141
7	cry	139
8	YuukiS	135
9	TheScrasse	134
10	chromate00	133

A few hours ago, the hacking phase for the round (Div. 3) ended and the system testing began. I wanted to check how many solutions failed the system tests, and I saw a huge number of TLEs on problem C. After looking at several solutions, I realized that they all had one thing in common — unordered_map.
I think many people know that unordered_map (hereafter UM) can work in both $$$O(1)$$$ and $$$O(n)$$$ time. Because of this, some people found a test case where UM runs in $$$O(n)$$$, which makes the overall complexity $$$O(n^2)$$$.
It got me wondering why people massively use UM when a regular map passes within the time limits. I asked ChatGPT to solve this problem, and it gave me a solution using UM.

ChatGPT's code

#include <bits/stdc++.h>
using namespace std;
using ll = long long;

int main() {
    ios::sync_with_stdio(false);
    cin.tie(nullptr);
    
    int t;
    if(!(cin >> t)) return 0;
    while(t--) {
        int n;
        ll k;
        cin >> n >> k;
        vector<ll> S(n), T(n);
        for(int i=0;i<n;i++) cin >> S[i];
        for(int i=0;i<n;i++) cin >> T[i];

        unordered_map<ll,int> cntS, cntT;
        cntS.reserve(n*2);
        cntT.reserve(n*2);

        auto rep = [&](ll x)->ll {
            ll r = x % k;
            if(r == 0) return 0;
            if (k % 2 == 0 && 2*r == k) return r; // специальный случай при k чётном
            return min(r, k - r);
        };

        for(ll x: S) cntS[rep(x)]++;
        for(ll x: T) cntT[rep(x)]++;

        if(cntS == cntT) cout << "YES\n";
        else cout << "NO\n";
    }
    return 0;
}

On the one hand, it’s kind of funny that people who copied from ChatGPT (or another LLM) got their solutions rejected.
On the other hand, there are people who wrote the code themselves but decided to use UM instead of map (although I think that’s their own fault because they could have just used a regular map).

for i in range(k // 2 + 1): if i == 0 or (k % 2 == 0 and i == k // 2): if array_mod_k_a[i] != array_mod_k_b[i]: print("NO") return else: if array_mod_k_a[i] + array_mod_k_a[k - i] != array_mod_k_b[i] + array_mod_k_b[k - i]: print("NO") return

struct custom_hash { static uint64_t splitmix64(uint64_t x) { // http://xorshift.di.unimi.it/splitmix64.c x += 0x9e3779b97f4a7c15; x = (x ^ (x >> 30)) * 0xbf58476d1ce4e5b9; x = (x ^ (x >> 27)) * 0x94d049bb133111eb; return x ^ (x >> 31); } size_t operator()(uint64_t x) const { static const uint64_t FIXED_RANDOM = chrono::steady_clock::now().time_since_epoch().count(); return splitmix64(x + FIXED_RANDOM); } };

Comments (26)

Write comment?

Christine-

8 months ago, hide # |

+21

This is hilarious and a good entertainment too. That is, to go to the status page, pick problem C and check the TLE submissions only to see, that contestants with such submissions follow... a certain pattern, you know.

→ Reply

munkyy9

8 months ago, hide # ^ |

GPT-5 is doing wonders since its launch

porcelli

bruh i was one of those who wrote their solution with UM but i got MLE instead, can someone explain to me why i got MLE this is my code i also wrote a version in c++ and had failed again with MLE, after a while i had used the vectors with a space of O(n) and a complex of O(n*log(n))

this is my submission 333342133

the map should have worked in space O(2 * n + some factors) right?

pshan

← Rev. 2 →

I'm guessing it's because

When you iterate from

and i doesn't exist in your defaultdict you probably create an entry in it.

Edit: Wanted to add k can be up to 10^9+7. So you're creating up to (10^9+7)/2 entries in your dict which would be higher than the 256mb allowed.

why am i creating these entries shouldn t the dictonary create only (n) entries?

Calling array_mod_k_a[i] will create an entry if it doesn't exist.

From: https://docs.python.org/3/library/collections.html#defaultdict-examples

When each key is encountered for the first time, it is not already in the mapping; so an entry is automatically created using the default_factory function which returns an empty list.

(the example this is talking about is using a defaultdict of lists. In your case it would be 0 instead of an empty list)

So e.x. when you hit if array_mod_k_a[i] + array_mod_k_a[k - i] != array_mod_k_b[i] + array_mod_k_b[k - i]: in your loop. If array_mod_k_a[i] does not exist, then it gets created.

Since you're iterating from i = 1...k//2+1 you create an entry for every value between 1..k//2+1.

This creates like 10^9/2 entries which is more than 256mb. I think it's probably like 1.something Gb ish?

thx very much i'm actually stupid for all this time i tought i was iterating over n not over k on the contest i brainlagged too much

Yeah I originally did that too if you look at my 7 wrong submissions for that question

VasuBhakt

Honestly I was scared with a normal map, looking at the time it took during contest and seriously considered resubmitting, thank god i didn't

YF_YUSUF

I think you need to learn how to calculate complexity correctly. Because O(nlogn) is not scary when n<=2e5

Defylogicguy

I like to use map :)

It's short and rarely gets hacked.

true that, i have never used unordered map

So do I xD

Ashar-Usmani

I believe this will pass if you sort the array before hand

colorBlindCoder

bro ig there are stupid people like me who still use UM. I know in the worst case it takes O(N), but some of us are victims of our own carelessness. Tho I do think there should be a test case for this during the contest only :(

prameshanand

I am a beginner so I don't if I have a place to speak here lulw but I use unordered_map like all the time whenever I have to use something sort of a hashmap. why is it preferred to use a normal map over unordered_map? does the sorting provide some sort of a powerup?

onepersonintheuniverse

The reason map is preferred over unordered_map is because, when you use umap[x], it uses the hash function (default: std::hash) and some other stuff to check if umap[x] exists and returns the value if so. But if there's a hash collision (two keys in umap, say x and y) it takes up to $$$O(n)$$$ to resovle the collision and get the corresponding value. Carefully selected keys can result in a time complexity as high as $$$O(n^{2})$$$.

Ah ok, so ordered_map doesn't have this problem then? if that's so, shouldn't the normal ordered_map be used in every situation?

std::map takes $$$O(\log n)$$$ operations where $$$n$$$ is the number of elements in the map.

The reason people (and ChatGPT) use std::unordered_map does $$$O(1)$$$ operations, and so is generally faster than std::map, except when hash collisions happen. Some people also don't know about the hash collisions part and don't use a custom, stronger hash which lead to hash collisions with GCC's default std::hash and std::unordered_map.

Read this blog on std::unordered_map for preventing most hash collisions. Basically, if you use

and unordered_map<int, int, custom_hash>, hash collisions will become much less likely.

omg understood. I strive to have knowledge like you dude. Thank you for helping me out!

AksLolCoding

+10

No human would ever use unordered_map.reserve in a contest

_icy_

Why not? If I'm using unordered_map, it means I'm already concerned about performance, and reserve might speed up the code significantly and has no downsides, so practically no reason (at least I'm not aware of any) not to use it

Schullz

Why not? It prevents rebuilding hash map as it grows. Thus, some contestants may think that executing the reserve method may improve the performance

polyglot2922

7 months ago, hide # ^ |

Why do you think so? That method doesn't seem to be something unusual. With approximate understanding on how UM works and what does unordered_map.reserve do the UM can become a powerful enough tool.

coordinatebash

wait until GPT starts adding random time-based hash function to its code XD

YF_YUSUF's blog