Skip to content

Commit

Permalink
Add mean field algorithms in the algorithms documentation.
Browse files Browse the repository at this point in the history
PiperOrigin-RevId: 405659590
Change-Id: Icd120c13c236c83b06ab78f4c64361997b8fb4f2
  • Loading branch information
TheoCabannes authored and lanctot committed Oct 27, 2021
1 parent a6a0f01 commit 1579ab8
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions docs/algorithms.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,8 @@ CFR against a best responder (CFR-BR) | Tabular | [Johanson et
Exploitability / Best response | Tabular | [Shoham & Leyton-Brown '09](http://masfoundations.org/) | ![](_static/green_circ10.png "green circle")
External sampling Monte Carlo CFR | Tabular | [Lanctot et al. '09](http://mlanctot.info/files/papers/nips09mccfr.pdf), [Lanctot '13](http://mlanctot.info/files/papers/PhD_Thesis_MarcLanctot.pdf) | ![](_static/green_circ10.png "green circle")
Fixed Strategy Iteration CFR (FSICFR) | Tabular | [Neller &amp; Hnath '11](https://cupola.gettysburg.edu/csfac/2/) | <font color="orange"><b>~</b></font>
Mean-field Ficticious Play for MFG | Tabular | [Perrin et. al. '20](https://arxiv.org/abs/2007.03458) | <font color="orange"><b>~</b></font>
Online Mirror Descent for MFG | Tabular | [Perolat et. al. '21](https://arxiv.org/abs/2103.00623) | <font color="orange"><b>~</b></font>
Outcome sampling Monte Carlo CFR | Tabular | [Lanctot et al. '09](http://mlanctot.info/files/papers/nips09mccfr.pdf), [Lanctot '13](http://mlanctot.info/files/papers/PhD_Thesis_MarcLanctot.pdf) | ![](_static/green_circ10.png "green circle")
Q-learning | Tabular | [Sutton &amp; Barto '18](http://incompleteideas.net/book/the-book-2nd.html) | ![](_static/green_circ10.png "green circle")
SARSA | Tabular | [Sutton &amp; Barto '18](http://incompleteideas.net/book/the-book-2nd.html) | ![](_static/green_circ10.png "green circle")
Expand Down

0 comments on commit 1579ab8

Please sign in to comment.