The previous chapter introduced the Nash equilibrium strategy, which was derived using an algorithm called CFR. Before discussing CFR, let's first introduce regret matching.
Taking rock-paper-scissors as an example, its payoff matrix is as follows.
The algorithm iterates as follows:
For each player, initialize regretSum[action] to 0.
Iterate T times:
****Normalize any positive regretSum values to...
User Profile
Collapse
-
justfunnychen started a topic Game Theory in Action: Pluribus and CFR in Texas Hold'em (2) CFR Counterfactual Regret Minimization Algorithmin GeneralGame Theory in Action: Pluribus and CFR in Texas Hold'em (2) CFR Counterfactual Regret Minimization Algorithm
No activity results to display
Show More