Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
UCB(Reinforcement Learning) (aribis369#51)
Upper Confidence Bound Learning.The Given Code is used to find which out of the 10 ads to be displayed on website for maximum Click Through Response by the user. The dataset is virtual showing what the ith user would have done if one of the 10 ads was shown to him i.e. 1 specifying he would have clicked it and 0 means he would have ignored the add
- Loading branch information