Skip to content

Instantly share code, notes, and snippets.

@bhaktipriya
Forked from awjuliani/SimplePolicy.ipynb
Created March 21, 2017 15:19
Show Gist options
  • Save bhaktipriya/6750d17b110f97d68f90582265cc9d80 to your computer and use it in GitHub Desktop.
Save bhaktipriya/6750d17b110f97d68f90582265cc9d80 to your computer and use it in GitHub Desktop.
Policy gradient method for solving n-armed bandit problems.
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment