Edu F vs GPT - Codeforces

→ Обратите внимание

До соревнования
Codeforces Round 1096 (Div. 3)
39:28:29
Зарегистрироваться »

→ Трансляции

Codeforces Round 1096 Solution Discussion

aryanc403

До начала 42:08:28

Всё →

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3603
4	jiangly	3583
5	strapple	3515
6	tourist	3470
7	dXqwq	3436
8	Radewoosh	3415
9	Otomachi_Una	3413
10	Um_nik	3376

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	Qingyu	157
2	adamant	152
3	Proof_by_QED	146
3	Um_nik	146
5	Dominater069	144
6	errorgorn	141
7	cry	139
8	YuukiS	135
9	TheScrasse	134
10	chromate00	133

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя DNR

Edu F vs GPT

Автор DNR, 18 месяцев назад, По-английски

https://mirror.codeforces.com/contest/2026/submission/288600046
https://mirror.codeforces.com/contest/2026/submission/288593410
https://mirror.codeforces.com/contest/2026/submission/288601081

It's so over for contest enjoyers.

DNR
18 месяцев назад
62

Комментарии (54)

Показать архивные | Написать комментарий?

manacher

18 месяцев назад, скрыть # |

← Rev. 2 →

+34

just feel that i'm so stupid that i cannot even win chatGPT :(

→ Ответить

adhdev

18 месяцев назад, скрыть # ^ |

+10

divided by rating, united by thoughts

→ Ответить

jeroenodb

18 месяцев назад, скрыть # |

+851

Hacked :)

→ Ответить

123gjweq2

18 месяцев назад, скрыть # ^ |

+13

lol chatgpt is trash

→ Ответить

DNR

18 месяцев назад, скрыть # ^ |

+13

she is much smarter than you already

→ Ответить

123gjweq2

18 месяцев назад, скрыть # ^ |

+70

what's her @

→ Ответить

Aritro_

18 месяцев назад, скрыть # ^ |

+11

chatgpt.com

→ Ответить

yud08

18 месяцев назад, скрыть # ^ |

+27

Doing God's work 🙏

→ Ответить

__anuj

18 месяцев назад, скрыть # ^ |

+30

That is savage

→ Ответить

DivinePunishment

18 месяцев назад, скрыть # ^ |

+76

cold af

→ Ответить

OtterZ

18 месяцев назад, скрыть # ^ |

-26

Great job but their ratings will grow though the submissions were hacked.

→ Ответить

pajenegod

18 месяцев назад, скрыть # ^ |

+33

They got hacked during the hacking phase of the EDU, which essentially means they FSTed. So no, they did not gain any rating from these submissions.

→ Ответить

OtterZ

18 месяцев назад, скрыть # ^ |

-25

I means that they might still get a positive delta.

→ Ответить

shouldorshouldnot

18 месяцев назад, скрыть # |

what do u mean

→ Ответить

__anuj

18 месяцев назад, скрыть # |

CP is Chess now.

→ Ответить

arvindf232

18 месяцев назад, скрыть # |

+108

I am impressed that people even attempted to use ChatGPT on problem F, something that is 1400 rating higher than ChatGPT. ChatGPT must feel flattered if it could feel.

→ Ответить

beaten_by_ai

18 месяцев назад, скрыть # |

Feel like this is more of an L on the authors' part to not make strong enough tests, where even brute force works.

→ Ответить

DNR

18 месяцев назад, скрыть # ^ |

+28

If you think about it, this might actually be the way to go, since offering full feedback makes it much easier for people who are unskilled at cp to gpt their way through problems.

Does it worsen the experience for other people? Yes, but I’d prefer to have weaker pretests as compared to hundreds of gpt greys above me in the final ranklist.

→ Ответить

Krish_M

18 месяцев назад, скрыть # ^ |

This actually is a great idea the authors can feed their question into gpt and design the testcases in such a manner that this solution passes the pretests. But another problems could arise that people could easily hack these solutions and get points.

→ Ответить

beaten_by_ai

18 месяцев назад, скрыть # ^ |

+60

Maybe Hacker Cup was right all along...

→ Ответить

PeruvianCartel

17 месяцев назад, скрыть # ^ |

This can be fixed by stress testing tho. In most of the problems on Codeforces, writing a brute-force and generating test only takes 5 minutes if you have the boilerplate setup ready. Having a 25% chance to do harder problems in contest is pretty powerful, because say, you have 25% to do 3 problems, 50% to do 4 problems, and 25% to do 5. If you just do enough contest so that your rating converges, that's some easy CM/M right there.

→ Ответить

if-then-explorer

18 месяцев назад, скрыть # |

+36

So, is it acceptable to describe the grey and green as borderline retarded? This is clearly rude, insulting and retarded at the same time.
I don't know much about Codeforces Code of Conduct but it can't be that one can just insult others like that.
Whether the blog author meant to refer to the particular grey and green participants mentioned in the blog or to all people with these ranks, I think it's equally unacceptable and this blog should be edited or deleted.

→ Ответить

Mindeveloped

18 месяцев назад, скрыть # ^ |

← Rev. 2 →

+76

I think it's equally unacceptable and this blog should be edited or deleted

No it's not. Cheaters are borderline retarded. Newbies and pupils are not.

→ Ответить

if-then-explorer

18 месяцев назад, скрыть # ^ |

+31

I disagree. Cheaters are just cheaters. If you prove someone cheats, then you apply whatever rules you have for that.
Anyway, the author is clearly describing all the grey and all the green, not the two cheaters in the post.

→ Ответить

Pirate_King

18 месяцев назад, скрыть # ^ |

womp womp

→ Ответить

retard.

18 месяцев назад, скрыть # |

← Rev. 3 →

I'm more sad about the fact that it took me 50 minutes to carefully implement D, although I've got the idea instantly. And o1-preview solves it in less than a minute. Guess the "borderline retarded" goes all the way up to the 1836 rating at least.

→ Ответить

Krish_M

18 месяцев назад, скрыть # ^ |

The glimmer of hope here is that there are people way above 1804 rating. Meaning AI still can't beat all of us and we have the potential to be better than o1.

→ Ответить

Mindeveloped

18 месяцев назад, скрыть # |

-19

Side note I think Educational Rounds should be unrated.

→ Ответить

Xiao_2710

18 месяцев назад, скрыть # ^ |

-7

Give it as an unrated contest if you want to codeforces allows you to do that,why cry?

→ Ответить

Mindeveloped

18 месяцев назад, скрыть # ^ |

← Rev. 2 →

+26

Because many people who does educational rounds to get $$$X$$$ rating does not have the capability to get $$$X$$$ rating with regular rounds, including me in the past (namely $$$X=1900$$$). That ruins the point of rating. Knowing some classical tricks does not mean you can solve real problems of the same difficulty.

→ Ответить

can_anyone_be_ORZ

18 месяцев назад, скрыть # ^ |

omg, do u mean that edu rounds are easier ?

→ Ответить

Mindeveloped

18 месяцев назад, скрыть # ^ |

← Rev. 2 →

+20

Their problems are more classical which means you can usually find similar techniques in other problems or even books or lectures. To be good at them you need to learn more classical techniques like binary search instead of improving your problemsolving mindset.

→ Ответить

Xiao_2710

18 месяцев назад, скрыть # ^ |

-20

Again how does it affect you? Your rating is only dependant on contests YOU choose to give being rated. Dont give edu rounds and skip the inflation in YOUR ratings,simple fix.

→ Ответить

Mindeveloped

18 месяцев назад, скрыть # ^ |

So what's wrong giving suggestions? Besides, it doesn't affect me either way because I'm Div.1 and forcefully unrated in those contests. And it's not about inflation. It's about these educational rounds serves more educational purposes than actual competition so we might want to exclude them from regular ratings.

→ Ответить

PQ2876

18 месяцев назад, скрыть # ^ |

I guess that's true, my average rating change in last 4 educational rounds is +81, but I usually loose rating in regular rounds.

→ Ответить

glaw

18 месяцев назад, скрыть # |

+18

Plz don't call me retarded I am trying to improve :(

→ Ответить

yoru_sacri

18 месяцев назад, скрыть # ^ |

Dont think so.The author used such word just to further satire those cheaters.You're not "the grey".You are a grey Nowbie who intend to improve by oneself.

→ Ответить

mohmmdali

18 месяцев назад, скрыть # ^ |

He meant to say regarding the "grey" and "green" in the mentioned submissions, not the grey and green as a whole!

→ Ответить

if-then-explorer

18 месяцев назад, скрыть # ^ |

+10

No, he meant the grey and the green as a whole. And he meant to make what he meant so clear by not including the third cheaters (the blue one).

→ Ответить

MathModel

18 месяцев назад, скрыть # |

← Rev. 3 →

ah , don't forget A as well , I hacked 7 submissions that works on $$$O(XY)$$$ , $$$O(\min(X,Y)^2)$$$ , $$$O(X^2+Y^2)$$$ which are obviously trash under current constraints.

Hacks:

Hacks Party

→ Ответить

macaquedev

18 месяцев назад, скрыть # ^ |

I don't think these people cheated though... I think they just couldn't think of a better construction...

However there was also 288542915 this... this guy fully KNEW the construction, but decided to run some random nonsense loops before outputting the construction... and the saddest thing is, I couldn't even hack him with the worst case (when the nested loops run to 999 and 1000 respectively, and there are 5000 testcases)...

→ Ответить

can_anyone_be_ORZ

18 месяцев назад, скрыть # ^ |

how can i be good at math like u

→ Ответить

MathModel

18 месяцев назад, скрыть # ^ |

Guess what

→ Ответить

123gjweq2

18 месяцев назад, скрыть # ^ |

chicken butt.

→ Ответить

MathModel

18 месяцев назад, скрыть # ^ |

bruh

→ Ответить

raoxj

18 месяцев назад, скрыть # |

+81

This post actually inspires a great way to combat cheating using GPT -- simply make the pretests weaker so that those brute force solutions by GPT will be allowed to pass pretests. As displayed by the previous OpenAI blog on CP, the performance of the model increases quite significantly when the number of allowed submissions increases; in addition, it is known that AI performs worse when the feedback it receives is not 100% accurate (i.e. pretest passed but FST). This really seems like a plausible way to reduce AI's effectiveness while affecting a genuine human solver much less (any competent contestant submitting an O(n^2) brute force to a n=10^5 question should know they'll FST anyway).

The above can be done in multiple ways, e.g. not including a max test in the pretests, which also helps to reduce the pretest judging time. A downside to this is that people can now hack all of these brute force solutions to get a lot of points -- maybe we can redesign the hacking system in some way. I'm sure there are other better methods than this, but this is just a suggestion for a starting ground.

That being said, if CF does want to take this path, it might be beneficial to make an announcement about this, mainly to protect the newer contestants at CF who have been familiar with the strong pretests these days so that they do not get frustrated unexpectedly.

→ Ответить

raoxj

18 месяцев назад, скрыть # ^ |

There's another downside to the above example method -- people can now submit bruteforce to a difficult problem on an alt, lock it and copy a legitimate solution from the room on their main.

Maybe it's time to reconsider in-contest hacking in the GPT era... But maybe someone can come up with a clever method that preserves the hacking system while still making the above cheat-combating method work.

→ Ответить

DNR

18 месяцев назад, скрыть # ^ |

+28

I don't think very many people want to preserve in-contest hacking.

→ Ответить

tfg

18 месяцев назад, скрыть # ^ |

+48

How about just making pretests weak and not allowing in-contest hacking?

→ Ответить

retard.

18 месяцев назад, скрыть # ^ |

Maybe it's time to reconsider in-contest hacking

Yes, maybe it's even possible to create a separate short phase after the coding where people can challenge solutions of others and get points for that. Like, imagine being the top coder in your room just based on hacks. But on codeforces I don't think even a single round matches a format like that, not that I remember.

→ Ответить

LeHuynhDuc

18 месяцев назад, скрыть # ^ |

If there is a concern that participants might hack GPT brute-force solutions, it could be possible to run system tests immediately after the contest and only open up hacking afterward.

→ Ответить

tyristik

18 месяцев назад, скрыть # ^ |

← Rev. 2 →

+24

any competent contestant submitting an O(n^2) brute force to a n=10^5 question should know they'll FST anyway

MrDindows will strongly disagree with you

→ Ответить

ffao

18 месяцев назад, скрыть # ^ |

+24

I like the idea in principle, but there have been several problems where constant factor is a real issue and having max tests in pretests is our main line of defense to measure those things (I don't want to have to make random max tests and test those in custom invocation for every problem).

For a very recent example, many solutions to 2035F - Tree Operations with the right complexity got TLE in pretests, as that problem requires a low constant implementation.

→ Ответить

steveonalex

18 месяцев назад, скрыть # ^ |

I don't really think so. You can counter that by local testing i.e. writing brute-force code to check the correctness of the produced program, and benchmark to see whether they can run within the time limit. I'm sure a green or a cyan will be more than competent enough to do those things. The only exceptions where I think this strategy would fail are those problems where generating strong tests is extremely difficult, such as graph problems, but they don't appear often enough to prevent cheaters from still having high performance. Codeforces best bet is probably just let them do what they want, cause they are gonna leave the platform after like 5 contests to get that juicy interview anyway.

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 29.04.2026 02:06:32 (f1).

Десктопная версия, переключиться на мобильную.

При поддержке