Bayesian Q-Learning applied to a continuous version of the Prisoners Dilemma game. Agents converge to the Nash equilibrium solution (mutual defection).
raklokesh/Bayesian_Q-Learning
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Bayesian Q-Learning applied to a continuous version of the Prisoners Dilemma game. Agents converge to the Nash equilibrium solution (mutual defection).