Foundations Of Intelligent Learning Agents (FILA) Assignments
reinforcement-learning monte-carlo linear-programming thompson-sampling ucb bootstrapping multi-armed-bandits bellman-equation temporal-differencing-learning howards-pi sarsa-learning kl-ucb windy-gridworld intelligent-learning-agents
-
Updated
Nov 8, 2019 - Python