Skip to content

Instantly share code, notes, and snippets.

@kkoci
Forked from bhaktipriya/SimplePolicy.ipynb
Created February 24, 2018 03:44
Show Gist options
  • Save kkoci/373dcf45e7e03cfb5d4789a629f2345a to your computer and use it in GitHub Desktop.
Save kkoci/373dcf45e7e03cfb5d4789a629f2345a to your computer and use it in GitHub Desktop.
Policy gradient method for solving n-armed bandit problems.
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment