Skip to content

Instantly share code, notes, and snippets.

@mphielipp
mphielipp / pg-pong.py
Created October 7, 2016 05:16 — forked from etienne87/pg-pong.py
Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels
""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
import numpy as np
import cPickle as pickle
import gym
from chainer import cuda
import cupy as cp
import time, threading
#backend
  1. Feature Learning
  1. Deep Learning