Beginner's Guide to Greedy

№	Пользователь	Рейтинг
1	Benq	3792
2	VivaciousAubergine	3647
3	Kevin114514	3603
4	jiangly	3583
5	turmax	3559
6	tourist	3541
7	strapple	3515
8	ksun48	3461
9	dXqwq	3436
10	Otomachi_Una	3413

№	Пользователь	Вклад
1	Qingyu	157
2	adamant	153
3	Um_nik	147
4	Proof_by_QED	146
5	Dominater069	145
6	errorgorn	141
7	cry	139
8	YuukiS	135
9	TheScrasse	134
10	chromate00	133

Introduction

I personally find greedy problems hard to derive. After struggling with proofs for quite a while, there are a few things I realized. Greedy proofs are very dependent on the rules of the problem. Loosely speaking each optimization problem gives you observations, from those observations, you realize that if you take certain choices while avoiding all others, then it will always be optimal. That realization is termed Greedy.

There are a lot of optimization problems, but only a small subset allows you to break the problem into subproblems where solving them optimally leads to the best global answer. This property, known as Optimal Substructure, is found in both DP and Greedy algorithms. Sadly, not every problem has this, for example, I CANT FIND ANY EXAMPLES. Maybe say we have an array that constantly shuffles around between subsequent queries, and there exists and element $$$\lceil \frac{n}{3} \rceil $$$ times and $$$query(x)$$$ returns number of values $$$x$$$ in the array. And we need to find that element, maybe only good way to do it sample it and probabilistically we will get the answer with in few dozen queries.

While DP and Greedy share the optimal substructure property, DP is essentially "smart brute force", where you define the subproblems and check all transitions to find the best one. In Greedy, however, you have to rigorously prove that ignoring all other choices and taking just one specific path works every single time. This takes the difficulty one notch up from DP.

The idea behind DP is the same everywhere, gather enough observations to define your subproblems, transitions, and base cases. This same idea is explored in a lot of diverse places—there is DP on bitmasks, trees, digits, ranges and On and on.., but there are not many discussions on Greedy. To fill this gap, I would like to discuss a lot of problems and proofs to capture the essence of how to prove Greedy solutions.

Idea

To prove a greedy solution, the arguments are almost always Proof by Contradiction or Induction. The argument goes like this: let's assume a greedy solution $$$G$$$ and a magical optimal solution $$$O$$$ that doesn't follow the greedy choices. Then, given our observations, we prove that either $$$G$$$ performs no worse than $$$O$$$, or that given the constraints, the final solution produced by both strategies will look exactly the same. On the flip side, among many greedy choices, to disprove one, the easiest way is to find a counter-example.

Below, I have listed links to various problems and discussed the greedy ideas they use, along with some common greedy examples. Sometimes Greedy seems obvious and intuitive, but I encourage you to try forming a rigorous argument while proving the problems mentioned below.

Problems

Basic problems

Try to come up with formal arguments to get the hang of it. Basic doesn't always mean easy.

P1. I have $$$n$$$ items and a knapsack of capacity $$$W$$$. The $$$i$$$-th item has value $$$v_i$$$ and weight $$$w_i$$$. I am allowed to take fractions of items. Find the strategy that maximizes the total value.

Hint

Solution

P2.I am running along an infinitely long number line. I can run for at most $$$M$$$ units without refreshments. There are $$$n$$$ refreshment stalls, where the $$$i$$$-th stall is located at $$$x_i$$$. Find the strategy that minimizes the number of refreshment breaks.

Hint

Solution

P3. Given 2 integer arrays $$$a$$$ and $$$b$$$ of size $$$n$$$ filled with positive integers. Reorder arrays to maximize $$$\prod_{i=1}^{n} a_{i}^{b_i}$$$. Too easy :P

Hint

Solution

P4. Given array of $$$n$$$ activities, $$$i$$$th activity takes $$$p_i$$$ time to complete. I want to schedule activities such activities dont overlap,

Let $$$c_i$$$ be the completion time of the activities. Say $$$i$$$th activity starts at $$$x$$$th time, then its completion time is $$$c_i=x+p_i$$$. Minimize the $$$ \cdot \sum_{i=1}^{n} c_i$$$ , or divide by $$$n$$$ to call it minimum average completion time.

Hint

Solution

Given another array $$$s$$$ that tells the starting time of the activities. Find the maximum set of activities you take.

Hint

Solution

Note: Graph is denoted with $$$H$$$ , as the letter $$$G$$$ represents greedy solution.

By now, we must have gotten some idea about how we try to find properties that connect the greedy solution $$$G$$$ and the optimal solution $$$O$$$ to make them look the same. The same applies to problems involving graphs or trees, where we assume a certain optimal tree $$$T_{O}$$$, or some subgraph $$$H_{O} \subset H$$$, or some partially constructed subtree $$$T_{O}' \subset T_{O} \subset H$$$ basically anything concrete that we can play with and try to form some concrete arguments about their behavior.

P5. Given a undirected graph $$$H=(V,E)$$$ with non-negative weights.

Find its Minimum spanning tree.

Hint

Solution

Find strategy to find the 2nd best Minimum spanning tree.

Hint

Solution

Bonus

Ideas used in above

In my opinion, after struggling and failing to formulate a satisfying argument, it feels very weird to hear people in editorials and comments say, "Oh, that's intuitive... it's obvious... it's trivial." I'm left wondering, "How come?" usually, their reply is just Cpp code translated into English x_x

Rev.	Кто	Когда	Δ	Комментарий
en11	dominique38	2026-02-12 19:00:11	183	Final Draft 2: Clarified the wording in P2.
en10	dominique38	2026-02-10 15:19:29	2	(published)
en9	dominique38	2026-02-10 15:04:08	23902	Final Draft 1
en8	dominique38	2026-01-31 21:58:10	3923	Completed P1 to P5. Added little language correction.
en7	dominique38	2026-01-31 17:18:09	5005	All P1 to P5 done. Remaining are the problem links.
en6	dominique38	2026-01-29 14:50:47	2	Minor typo change
en5	dominique38	2026-01-29 14:49:28	0	No change: Added Coauthors.
en4	dominique38	2026-01-29 14:34:57	0	No change: Added Coauthors.
en3	dominique38	2026-01-29 14:32:37	6118	Addition: P5 half done. P4 remaining. CF problems remaining.
en2	dominique38	2026-01-28 22:25:05	8360	Addition: P1, P2 and P3 are done. P4 and P5 and onward are remaining.
en1	dominique38	2026-01-28 17:16:48	4669	Initial revision: Added the basic structure, and rough content. (saved to drafts)

Introduction

Idea

Problems

Basic problems

Ideas used in above

История