Записи в блоге - Codeforces

→ Обратите внимание

До соревнования
CodeTON Round 9 (Div. 1 + Div. 2, Rated, Prizes!)
32:29:51
Зарегистрироваться »

*есть доп. регистрация

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	4009
2	jiangly	3823
3	Benq	3738
4	Radewoosh	3633
5	jqdai0815	3620
6	orzdevinwang	3529
7	ecnerwala	3446
8	Um_nik	3396
9	ksun48	3390
10	gamegame	3386

Страны | Города | Организации

→ Лидеры (вклад)

№	Пользователь	Вклад
1	cry	167
2	Um_nik	163
3	maomao90	162
3	atcoder_official	162
5	adamant	159
6	-is-this-fft-	158
7	awoo	157
8	TheScrasse	154
9	Dominater069	153
9	nor	153

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя cf_chat_gpt

Google AI Bard joins the CF Rating Experiment

Автор cf_chat_gpt, история, 18 месяцев назад, По-английски

По-английски

Hello Humans,

It seems that AI Bard is now able to solve coding problems, therefore, I created an additional handle to test its capabilities in the next rated rounds : cf_bard,

Example of generated code:

Bard solution: https://mirror.codeforces.com/contest/1791/submission/205637247
ChatGPT solution: https://mirror.codeforces.com/contest/1791/submission/191894854

Let the battle begin & stay tuned :)

BR,

Полный текст и комментарии »

-25

cf_chat_gpt
18 месяцев назад
4

GPT-4 Rating

Автор cf_chat_gpt, история, 19 месяцев назад, По-английски

По-английски

Hello Humans,

Find below the GPT-4 Rating (also check my profile), after 9 rated contests (first 3 contests were with the use of GPT-3/-3.5 though).

--> Maximum Rating: 797.

Contests distribution:

6 div-2 contests (where only 1 problem was solved)
1 div-3 contest (where 1 problem is solved)
2 div-4 contests (where 4 problems are solved)

Number of passed solutions: 6

Number of solutions which finally got TLE (passing the first pretests however): 9

General methodology:

the submitted code is purely the output of GPT, without any change.
no solution hints are provided to GPT.
4, 5 retries (in avg) are requested per problem. (asking explicitly to use dp, brute force, to optimize the code for speed, to code in C++ or Python, reporting back to GPT the compilation error/ wrong output and letting him fix the code).

Observations:

When asked to use brute force, GPT is almost providing a functionally correct solution, which will TLE. It means it has some interesting ability to understand the problem statement (even when there's a lot of text)..
The generated Python code was slightly better than the C++ code (i.e. passing more pretests)..
GPT, quite often, cannot accurately determine the output of his program for a specific input. It means it has no access to a compiler for correction feedback. It would be much interesting if GPT can test his code on the test samples before providing a solution, but he doesn't :(
Weak logic ability on div-2 problem A. Sometimes the generated logic is almost correct but lacking few corner cases, and GPT was never able to confirm/test its logic on examples/test samples.. that's why it was miserably failing to solve almost all div-2 problem A statements..

See You, when GPT-5 is out..

BR,

Полный текст и комментарии »

+61

cf_chat_gpt
19 месяцев назад
6

ChatGPT Rating Experiment

Автор cf_chat_gpt, история, 22 месяца назад, По-английски

По-английски

Hello Humans,

Check below the ChatGPT rating during 3 Codeforces contests,

The submitted code was purely the output of ChatGPT without any changes.

Current Rating: 737, but I doubt it would go any higher..

Полный текст и комментарии »

Теги

chatgpt

+22

cf_chat_gpt
22 месяца назад
2

Codeforces (c) Copyright 2010-2024 Михаил Мирзаянов

Соревнования по программированию 2.0

Время на сервере: 22.11.2024 09:05:10 (l2).

Десктопная версия, переключиться на мобильную.

При поддержке