Scraping Codeforces Problems

→ Pay attention

Before contest
Spectral::Cup 2026 Round 1 (Codeforces Round 1094, Div. 1 + Div. 2)
08:20:11
Register now »

*has extra registration

→ Streams

Leetcode Weekly 499 + Biweekly 181 Solution Discussion

By aryanc403

Before stream 28:15:09

View all →

→ Top rated

#	User	Rating
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3603
4	jiangly	3583
5	turmax	3559
6	tourist	3541
7	strapple	3515
8	ksun48	3461
9	dXqwq	3436
10	Otomachi_Una	3413

Countries | Cities | Organizations

View all →

→ Top contributors

#	User	Contrib.
1	Qingyu	157
2	adamant	153
3	Um_nik	146
3	Proof_by_QED	146
5	Dominater069	145
6	errorgorn	141
7	cry	139
8	YuukiS	135
9	TheScrasse	134
10	chromate00	133

View all →

→ Find user

→ Recent actions

Detailed →

TheeLooser's blog

Scraping Codeforces Problems

By TheeLooser, history, 4 years ago, In English

Is there a way to scrape problem statements automatically?

peepohey

TheeLooser
4 years ago
11

Comments (7)

Show archived | Write comment?

pawarashish564

4 years ago, hide # |

Using codeforces API — checkout the Problem section.

→ Reply

TheeLooser

4 years ago, hide # ^ |

Unfortunately, the Problem object does not come with the statement text.

→ Reply

pawarashish564

4 years ago, hide # ^ |

I am not sure what you are trying to achieve but previously I was working on a similar kind of problem I used beautifulsoup from python to read HTML and parse the content. you can do a similar for your purpose.

→ Reply

TheeLooser

4 years ago, hide # ^ |

I tried using soup but it doesn't work anymore. I think Codeforces upgraded their systems (currently uses some sort of script to get statements on demand? I know very little about this stuff). In fact, previously you could just use wget to just download a problem page, like https://mirror.codeforces.com/problemset/problem/1673/F, to get the raw HTML. This doesn't work anymore. In case I might be missing something trivial, could you please try using soup again – I mean, right now? I think when you did your parsing, a simple wget command would've worked.

→ Reply

Xellos

4 years ago, hide # ^ |

Download page using the problem id?

→ Reply

TheeLooser

4 years ago, hide # |

So, a friend of mine looked into it and found out that wget/curl https://mirror.codeforces.com/contest/contestId/problems still works, while the problem with wget/curl https://mirror.codeforces.com/problemset/problem/contestId/index is that it just gives the preload HTML. So, scraping contest psets instead of individual problems is an alternative. Thanks for your comments.

→ Reply

Avanta

4 years ago, hide # ^ |

Unfortunately, it seems like this no longer work anymore :/ Did you manage to find any other alternative?

→ Reply

The only programming contests Web 2.0 platform

Server time: Apr/25/2026 09:14:51 (g2).

Desktop version, switch to mobile version.

Supported by