Code used to obtain these results can be found at the url https://github.com/NervanaSystems/simple_dqn, commit 31a92a9.
This code runs with Neon commit 344372b. Training and test scripts are included in scripts.txt
above.
Note that for training, the screen width and screen height must be specified as (40, 52).
Default training paramters are used as set in src/main.py
. This model was trained for 77 epochs which will take roughly 15 hours to train on a Titan X GPU.
learning_rate=0.00025
: Learning ratediscount_rate=0.99
: Discount rate for future rewardsbatch_size=32
: Batch size for neural networkoptimizer=rmsprop
: Network optimization algorithmdecay_rate=0.95
: Decay rate for RMSProp algorithmclip_error=1
: Clip error term in update.train_steps=250000
: How many training steps per epochepochs=77
: How many epochs to run
Few notes:
The quotes in the command inside
scripts.txt
, that is./train.sh “Breakout-v0” --environment gym --screen_width 40 --screen_height 52
are funny utf-8 quotes and will cause a crash if copy pasted verbatim. (we are adding this common failure case on our end though)The instructions pointed to for rlgym only contain explicit code for installing the base install that does not include ATARI envs. To run on ATARI envs it is also needed to install them explicitly:
pip install -e .[atari]
Please also kindly provide approximate runtime for anyone who might wish to reproduce.