This repository is the implementation of SOFA, the Simulator for OFfline leArning and evaluation.
Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems. Jin Huang, Harrie Oosterhuis, Maarten de Rijke, Herke van Hoof. Recsys 2020.
$ cd examples
$ python run_dqn.py
We also provide the details of DQN-based Policy used in experiments and the related hyperparamters (See Appendix).
If you use our code, please cite our paper:
@inproceedings{huang2020keeping,
title={Keeping Dataset Biases out of the Simulation: A Debiased Simulator for Reinforcement Learning based Recommender Systems},
author={Huang, Jin and Oosterhuis, Harrie and de Rijke, Maarten and van Hoof, Herke},
booktitle={Fourteenth ACM Conference on Recommender Systems},
pages={190--199},
year={2020}
}