This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Uses ptan library: https://github.com/Shmuma/rl/blob/ptan/ptan/samples/dqn_expreplay.py | |
| Config for the run: https://github.com/Shmuma/rl/blob/ptan/ptan/samples/runs/dqn_exp_cartpole.ini |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Crossentropy method with dense NN 40+40 | |
| https://github.com/Shmuma/Practical_RL/blob/master/week1/MountainCar-xentropy.ipynb |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Crossentropy method | |
| https://github.com/Shmuma/Practical_RL/blob/master/week1/taxi_crossentropy.ipynb |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Genetical algorythm with mutations probability decay. | |
| https://github.com/Shmuma/Practical_RL/blob/master/week0/frozen-8x8.ipynb |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| https://github.com/Shmuma/rl/blob/master/algos/a3c_async.py |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Slow version: https://github.com/Shmuma/rl/blob/master/test-1/a3c_atari.py |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| #!/usr/bin/env python | |
| # Quick-n-dirty implementation of Advantage Actor-Critic method from https://arxiv.org/abs/1602.01783 | |
| import argparse | |
| import logging | |
| import numpy as np | |
| from rl_lib.wrappers import HistoryWrapper | |
| logger = logging.getLogger() |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| #!/usr/bin/env python | |
| # Multi-layer perceptron inspired by this: https://gym.openai.com/evaluations/eval_P4KyYPwIQdSg6EqvHgYjiw | |
| # https://gist.githubusercontent.com/anonymous/d829ec2f8bda088ac897aa2055dcd3a8/raw/d3fcdfdcc9038bf24385589e94939dcd3c198349/crossentropy_method.py | |
| import gym | |
| import argparse | |
| from gym import wrappers | |
| import numpy as np | |
| from keras.models import Sequential | |
| from keras.layers import Dense, Activation |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| #!/usr/bin/env python | |
| # Multi-layer perceptron inspired by this: https://gym.openai.com/evaluations/eval_P4KyYPwIQdSg6EqvHgYjiw | |
| # https://gist.githubusercontent.com/anonymous/d829ec2f8bda088ac897aa2055dcd3a8/raw/d3fcdfdcc9038bf24385589e94939dcd3c198349/crossentropy_method.py | |
| import gym | |
| import argparse | |
| from gym import wrappers | |
| import numpy as np | |
| from keras.models import Sequential | |
| from keras.layers import Dense, Activation |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| #!/usr/bin/env python | |
| # Multi-layer perceptron inspired by this: https://gym.openai.com/evaluations/eval_P4KyYPwIQdSg6EqvHgYjiw | |
| # https://gist.githubusercontent.com/anonymous/d829ec2f8bda088ac897aa2055dcd3a8/raw/d3fcdfdcc9038bf24385589e94939dcd3c198349/crossentropy_method.py | |
| import gym | |
| import argparse | |
| from gym import wrappers | |
| import numpy as np | |
| from keras.models import Sequential | |
| from keras.layers import Dense, Activation |