Skip to content

Instantly share code, notes, and snippets.

  1. Install simple_dqn.
  2. Run ./train.sh Breakout-v0 --evironment gym.
  3. Check results/Breakout-v0.csv for best performing epoch (in my case it was 61).
  4. Run ./test_gym.sh snapshots/Breakout-v0_61.pkl (replace 61 with your best epoch).
  5. Optional: run ./upload_gym.sh results/Breakout-v0 --api_key <your_key> to upload the results.

The Simple DQN implementation uses network architecture and hyperparameters from DeepMind Nature paper.