[Incoherent Rambling] Optimal strategy in game theory

№	Пользователь	Рейтинг
1	tourist	3985
2	jiangly	3814
3	jqdai0815	3682
4	Benq	3529
5	orzdevinwang	3526
6	ksun48	3517
7	Radewoosh	3410
8	hos.lyric	3399
9	ecnerwala	3392
9	Um_nik	3392

№	Пользователь	Вклад
1	cry	169
2	maomao90	162
2	Um_nik	162
4	atcoder_official	161
5	djm03178	158
6	-is-this-fft-	157
7	adamant	155
8	awoo	154
8	Dominater069	154
10	luogu_official	150

To whom it may concern,

Today I was reading some recent problems when I stumbled upon this div2B. 1956B - Nene and the Card Game. At first, it seems like an innocent game theory problem, until I read the following sentence.

Nene is very smart so she always selects cards optimally in order to maximize her score in the end of the game (after $$$2n$$$ rounds). If she has several optimal moves, she selects the move that minimizes your score in the end of the game.

This struck me as a bit odd, because in most game theory problems on Codeforces, it's a win/lose/draw game, or the player wants to maximize the difference between their score and their opponent's score, or they want to maximize their own score and all outcomes have the same sum of scores for the two opponents (which is equivalent). But Nene's objective is most peculiar, where she doesn't care about how many points her opponent has at all, until she cannot get any more for herself and then it is the only thing she cares about.

Now, what does the problem say is the strategy of Nene's opponent (you)?

...what is the maximum number of points you can get by taking your turns optimally?

Hmm, it seems that you do not care about Nene's score, although she cares about your score. Oh well.

At this moment I thought, "is this problem even well-defined?" What does an optimal strategy mean in this game? We seem to take it for granted that it even makes sense to talk about an "optimal strategy" at all. There are a few possible concepts of what an optimal strategy could mean. Maybe it means playing the move that gives you the best guaranteed score, assuming the worst case for your opponent's decisions. Or maybe it means playing the move that gives you the best outcome assuming that your opponent is rational and plays optimally. Yet another interpretation could involve Alice and Bob agreeing to collaborate, if they see a path that makes them both better-off than if they competed.

In most game theory problems on Codeforces, we are dealing with a two-player, zero-sum (or constant-sum), turn-based, finite, perfect-information game. In these cases, multiple interpretations of optimal play turn out to be equivalent. The one where you try to optimize your guaranteed score is called Maximin and the one where you try to optimize your score assuming a rational opponent is called Minimax. Both of these solution concepts are equivalent for this class of games.

However the idea of "maximize your score first, then minimize your opponent's score second" seems to imply that we are outside of zero-sum territory (otherwise maximizing your own score would be enough). We cannot guarantee that Minimax = Maximin in non-zero-sum games. For example, consider this simple example game.

Alice goes first.
Choice 1: Alice and Bob both get 1 point, game ends.
Choice 2: Go to Bob

If Bob has a turn:
Choice 1: Alice gets 0 points, Bob gets 1 point, game ends.
Choice 2: Alice gets 2 points, Bob gets 2 points, game ends.

Maximin: Suppose Alice wants to maximize her guaranteed score, assuming the worst case. Then she should make choice 1, because she can guarantee 1 point, while the worst case in choice 2 is 0 points.
Minimax: Suppose Alice wants to maximize her score assuming that Bob is rational and will try to maximize his own score. Then she should go with choice 2, since Bob will rationally get 2 points for each of them, which is better than 1 point.

In this example, we see how non-zero-sum games can result in different optimal strategies depending on which convention we choose. In general, the problem statement would need to be clear about which convention we are using.

So, what about this div2B problem? Is it well-defined? Technically, yes it is. After analyzing the mechanics of the game, you will notice that the sum of scores of the two players is the same in all outcomes. In fact, for each type of card, one of them will be played first and the other will be played second, resulting in $$$1$$$ point for one of the players. So the sum of scores will always be $$$n$$$ in every outcome, meaning it's a constant-sum game in disguise.

So, what lesson can we take from my incoherent rambling? Mainly, optimal play in game theory is surprisingly nuanced. In my opinion, problems on Codeforces should be immediately clear that they are well-defined, and this div2B problem requires a key observation in order to realize that it is well-defined. Maybe when we have zero-sum games in the future, we should make it clear that it is zero-sum, even if it's just a statement like "We can prove that the sum of the players' scores is the same in all outcomes", and maybe linking to a resource about the definition of optimal play in zero-sum games.

Комментарии (11)

Написать комментарий?

Errichto

8 месяцев назад, # |

+66

Agreed. This problem statement is invalid. I'm surprised that coordinators or strong testers didn't catch that.

→ Ответить

Golovanov399

8 месяцев назад, # ^ |

+18

What exactly is invalid about this problem statement?

← Rev. 4 →

Did you read the blog and it wasn't enough to convince you? So here's a more absurd example:

You play against Bob. You alternate moves. Each turn, a player says a not-yet-chosen number $$$1$$$, $$$2$$$ or $$$3$$$, and adds it to their score. If a player chooses a number equal to the length of their name, their score is additionally increased by a number that the problem author was thinking about yesterday. This value is not smaller than $$$-1.9$$$. Each player maximizes their final score.
You go first and your name is Alice. What will be your final score?

The Bob's strategy is undefined and it makes the problem invalid. Only after reading the second paragraph, we can prove that some part of the statement doesn't matter and can be ignored. You can also guess it after reading the stupid part of the first paragraph. Neither of these two arguments makes the first paragraph ok.

Red0

-14

No offense(I'm just a pupil), but isn't it the coder's job to be able to parse out what the problem statement is asking? Cause like if you can't, you're basically getting rejected by natural selection in coding. Also classic examples of your "invalid" problems happen very often in USACO.

Btw, I'm a big fan of ur videos :)

-is-this-fft-

+28

This is not really what's being discussed here. Of course it's the coder's job to parse what the problem statement is asking. The discussion is about problems where it is not clear whether the thing being asked actually makes mathematical sense. I'd be very interested in hearing what USACO problems are invalid.

+23

I have read the blog. It even says "is it well defined? Yes". The point of the blog is that whether the problem is well defined or not is unclear until you, well, basically solve the problem. I don't disagree with this.

You, on the other hand, claim that the problem statement is invalid. As far as I understand, you mean this because one needs to read the whole statement to the end in order to obtain a perfectly valid model as opposed to read only a part of the statement and expect its generalisation to be well defined. You also provide an example which is completely fine if you read it all, although it indeed sounds silly.

There are plenty of problems where a crucial detail making the problem solvable is provided after the main part of the statement, I don't see why it's such a big deal for you. I admit that this particular problem could be formulated better, but "invalid" is too strong a word for this.

+33

Well, I claim that a valid problem statement can't include incorrect or incomplete parts. You can't say "if a player chooses an even number then 2 + 2 = 5", and then just say that only odd numbers are in the game.

I see your point though.

Ah. I agree that saying something that is wrong under some conditions can be considered ill-defined even if these conditions are later prohibited. But maximizing your score first and minimizing your opponent's second is okay, isn't it? Maybe we don't really understand what the best strategy is then, but it at least exists in finite games?

Monogon

But maximizing your score first and minimizing your opponent's second is okay, isn't it?

I think it's ok as long as the problem statement clarifies what convention we are using when it says "optimal strategy".

I understand now. I want to say we always assume the worst by default, but I don't know why

mnaeraxr

Side comment: I think constant-sum games are in the category of zero-sum games.

Блог пользователя Monogon