Cheating has been discussed a lot lately. I'm writing a blog post about it with actual data (mainly, I want to know if there is an actual increase). However, in the many discussions that have ensued both here and on Discord servers, I noticed several recurring misconceptions. I wrote a section to debunk them and it grew kind of long, long enough to be a post in itself. So while my scripts are running, we can treat this as a kind of teaser.
Things you should know before you write a comment about cheating
Vocabulary:
- To plagiarize means to use someone else's work while passing it off as yours. It refers to the act itself, not to its detection or punishment. "jrandomcoder plagiarized all of his solutions in round 987" is a meaningful statement. "jrandomcoder cheated in round 987 but he still hasn't gotten plagiarized, I request MikeMirzayanov to look into this" is not.
- Plague is a disease. In a metaphorical sense it can mean any disease and in a doubly-metaphorical sense it could refer to cheating. However, it is not the same word as "plagiarism" and usually can't be used in its place. "jrandomcoder cheated in round 987 but he still hasn't gotten plagued" is a doubly meaningless statement. If this is a suggested punishment then I'm wrong.
Procedural:
- Codeforces has a plagiarism detector.
- The plagiarism detector is run after every round. However, this does not happen live or immediately. Presumably it takes a while and requires considerable human input (for example, if there are close calls).
- The plagiarism detector will be eventually run, likely within a few days after the contest.
- Ratings are updated soon after the round and likely before plagiarism detection happens. This is because users are impatient. Usually, if ratings are not updated 6 hours after the contest, there is already a sea of "is it rated" and requests for MikeMirzayanov to look into it.
- Ratings will eventually be updated to take disqualified people into account. This does not necessarily happen right after the plagiarism check. When it happens, you see a notification of the form "Ratings are temporarily rolled back. They will be returned soon."
- People have been banned for cheating.
Behavioral:
- When you report cheaters, don't tag random testers you found in the contest announcements. Most of them do a virtual contest, leave some feedback and that's it. They are not the organizers or administrators of the contest. They do not have any powers to punish cheaters. (I also doubt that tagging authors or maybe even coordinators is useful, but since I have been neither, I can't really comment on this).
- Definitely don't tag random "famous" or active members of the community who had nothing to do with organizing the contest. We can't do anything.
- Absolutely definitely don't write your cheater reports as replies to unrelated people's unrelated comments. What the hell?
If you are collecting data:
- If a user is caught cheating, they will usually be marked as "out of competition" and the verdict of all their submissions is set to Skipped.
- However, it appears that they are not always marked "out of competition". I don't really know why or what's the difference.
- Having some skipped submissions (and being out of competition) does not necessarily mean someone is a cheater. This also happens if you are e.g. Div 1 in a Div. 2 contest and you make a submission, pass the pretests and then make another submission to the same problem (for example, if you were only a few milliseconds below the time limit). The earlier submission will also be Skipped after system tests.
- Even having all skipped submissions (and being out of competition) does not necessarily mean someone is a cheater. See discussion here. We can ignore this here when gathering statistics because such cases are rare. But this has to be taken into account when discussing individuals.
- If you call https://mirror.codeforces.com/api/contest.status?contestId=2000&handle=jrandomcoder to check if all of someone's submissions in the contest are Skipped, you need to filter out all submissions made in practice mode and similar. They will not be skipped even if the user got caught cheating.
- When someone is caught cheating but before the ratings have been updated, the contest will appear on their profile as if they solved 0 problems, but with a rating change, possibly even a positive one.
- When someone is caught cheating and after the ratings have been updated, the contest appears on their profile as if they participated in it but it wasn't rated for them. You will see it if you choose "Only unrated" or "All" in the drop-down menu in the top-right corner in the Contests tab.
- If you want to compare the frequency of cheating (or many things, in fact), it is not enough to compare one recent contest to one older contest (and similar). Because everything varies unpredictably all the time, you will be comparing random noise to random noise. Some problems may be more suitable for detecting cheaters. The leakers may have had a good or a bad run. For all we know, even the time of day and the day of the week might affect cheating.
Keep in mind though, as I've said before, on a platform with a history as long as Codeforces has, you can't really assume "condition X happens (if and) only if a user got caught cheating". You also can't assume that various unusual situations have always been handled the same way, or even that they have been handled the same way in recent cases, just because you checked a number of recent ones and noticed a pattern. So everything here should be treated as a heuristic only.
If you say "wait, so Mike finally listened and started to ban cheaters?"
When you go visit some profiles, you might very well be greeted with this message.
This is message is a very recent addition. But I'm not sure about "finally". I don't think it's the case that Mike "finally" started banning cheaters this week or something. Because I don't think the bans are that recent. I can't prove it but I have some evidence.
Workflow:
- Pick contest from a few months to a year or so old.
- Go to the Status tab and filter for Skipped verdict.
- Click on the usernames randomly until you find someone who has been disabled.
- Go to their Submissions tab. Note the time of the last submission and the last Skipped submission.
I think we can assume that for many people, the time of the last submission is close to the time of their banning. With the delays between contests, plagiarism checks and revised rating updates, I think we can also assume that if someone is banned for cheating, it doesn't immediately happen after the contest. I've repeated this procedure for the following users:
- abhishek_1624 (last submission: 16.03.2024, last Skipped submission: 01.03.2024)
- bansala271 (last submission: 22.04.2024, last Skipped submission: 06.04.2024)
- Var57 (last submission = last Skipped submission: 29.04.2024)
- BhuvaneshwariM (last submission: 03.06.2024, last Skipped submission: 25.05.2024)
- binary_search75 (last submission = last Skipped submission: 30.06.2024)
- AlicenBob (last submission: 18.01.2024, last Skipped submission: 30.11.2023)
- manishreddy03 (last submission = last Skipped submission: 03.12.2023)
Not cherry-picking data here. In almost all of these cases, the user got caught cheating and stopped submitting some 2 weeks or so later. It doesn't prove anything but in my opinion it is strong evidence that these users did not get banned yesterday, they got banned months ago after getting caught cheating multiple times. (To be sure, we need to look at more data and compare to people who didn't get banned and so on. But it's a good starting point for a hypothesis.)
I think the reason why some people still believe that cheaters are never banned is that Codeforces, for the longest time, did not really indicate that an user is banned. Let us take rotavirus, perhaps the most famously banned user on Codeforces. For the longest time, you could tell he was banned only by the fact that the "Send message" link was missing. Compared to his new account, the links in the red rectangle were missing.
Obviously, not many people are aware of this small detail, so many people perhaps didn't realize how many people are banned (for cheating or trolling). Now it's quite easy to see if you follow my procedure above.
What this blog is not
[Added an hour after publishing based on some early discussion.]
This blog is not an attempt to say that everything is fine and stop complaining. There's lots of suggestions going around, and I think some of them are good, but I'll let other people defend them. I wrote this blog because a lot of people appear to be misinformed, especially about the bullet points in the "Procedural" section. And you can't seriously discuss any situation on false premises.
This blog is also not an excuse to write the final closing remark. It's something I realized after I typed most of this up.
One final note...
As I'm writing my actual blog, it has become clear to me just how much effort does in fact go towards combating cheaters. Things are not perfect and you can always do better, but people are working on this and people do care. And then I see comments and messages where people casually write "bro the admins don't care they just want to inflate participant counts" [the word 'bro' is not my addition]. I've tried pushing back against this once, ages ago, and I got some "bro you think this proves anything" type message.
Maybe Mike doesn't want to say this but I do: this is so entitled, inconsiderate and rude. I've been in a position where I put in a lot of work into something. And then people who don't even look into how much work this is come and tell me that this is trivial or I'm stupid or I don't even care. Or maybe they don't say it about me, but do say it about people who are working on things similar to what I'm working on. Mike doesn't deserve this kind of feedback.
I agree, especially with the last part. Mike is the whole reason this website exists. If you're mad he isnt "combatting cheaters enough," then codeforces isn't for you.
Thank you so much for this. It gets old seeing these same blog posts asking about rating rollback, plagiarism detection and so on after every single contest like clockwork. I'm sure they'll still happen, but at least we can leave a link to this blog instead of having to explain the same thing over and over again :)
Why early-morning-dreams
Finally a truly neutral and insightful blog to educate us. I have noticed that people act very rudely when anything related to cheating is being discussed whether it is somebody announcing some suspicioius submissions or somebody defending there skipped solution. A lot of these toxic messages are directed towards Codeforces, Respected Mike, and some peoples who have not yet been marked as cheater but their solutions were skipped(and they are defending their submissions). I do understand the frustration due to cheaters but a lot of this hate is misdirected, Codeforces and Mike are actively fighting against cheating, so they obviously dont deserve this. And a single skipped contest does not neccessarily mean that person is a cheater, first of all these algorithms are not 100% accurate and if everybody uses no template, optimal logic and very simple or generic variable names your chances of making it to skipped list increases drastically even if you did not cheat. I have friends who got their submissions skipped because the submission was optimal 4 line python code for a div4 A, and a friend who had a rare stroke of luck and managed to barely solve div(1+2) E, I can vouch for his innocence if that means anything but a single lucky day and generic variable names for an optimal solution was enough to earn him -28 contribution when he tried to defend himself, Conclusion:- Hate brings no benifit to anybody, please reconsider before saying such things.
Thanks for the blog. I feel like nowadays too many users are fixated on cheaters, leaked solutions and so on and they forget that the real essence of Competitive Programming is the joy of solving interesting problems, atleast in my opinion.
Things you should know before you discuss cheating: if you notice that nothing is happening and make a comment, you get muted.
Why is this being downvoted?
i doubt i am the most known banned user, we should do a poll
I can't read "procedure" and not think of Kramnik being stupid anymore.
interesting comment
Interesting
"Let's see game review, yeah of course 93.4%. To me its very clear, lets do the procedure. Report and block"
that kinda reminds me. does anyone remember WeaponizedAutist? whatever happened to him?
Adding another thing to reporting etiquette: please do not publicize the names of the cheating groups/channels you find. You may think that it is a good thing because you will potentially get some cheating group disbanded or because they deserve to be shamed, but it only leads to more cheaters flocking to that medium and thus more cheating.
Rather, if you want to report such a group, report it to Mike himself (or maybe some coordinator, I don't know) and do not mention names/identifiable details in your public report.
"Report it to Mike himself" will only lead to him receiving a ton of spam messages. He is already too busy to answer the comments (where he is mentioned) on other issues.
Judging by the comments and blogs from after yesterday's contest, trying to teach people to not post these links is a lost cause...
Can't wait to see the data compiled later, the thought/discussion of cheaters is getting everywhere but there's no real statistical data about it to see how actually concerning it is.
Why was Rotavirus banned?
He put $$$998244853$$$ as the modulo in a problem and pissed on Mike
Wasn't he banned because of racist comments that triggered an argument with an LGM?
no i didn't write such comments
Not only that. He posted constant spam and troll comments and was muted some 20 times before getting banned. That incident was just the final straw.
"However, it appears that they are not always marked "out of competition". I don't really know why or what's the difference."
I think the first time it's out of competition, and the second time, all submissions are skipped but rated, which causes a massive rating loss.
anyone tell me the procedure to report cheaters.
When someone is caught cheating but before the ratings have been updated, the contest will appear on their profile as if they solved 0 problems, but with a rating change, possibly even a positive one.
Imagine getting positive rating for not solving a single problem. This should be fixed asap.
It's not really broken, it's just unintuitive. They will lose the rating gain once the ratings are recalculated.
Informative!
great blog!
In the last 10 minutes 2 different blogs about a group of Chinese GM's allegedly selling solutions were taken down (possibly mass reported/downvoted etc). Thoughts?
Not that i am defending him but This blog is still up whereas as you said the blogs about chinese GM's were instantly deleted