Because I am a betting game player, I was surprised by the task given in the question, so I calculated the reward rate of the training data set and found that the reward rate is greater than 1. This is not in line with reality. In reality, the reward rate of betting companies is less than 1.