Created
April 13, 2017 03:09
-
-
Save ricklentz/4c360f2b0b5be0530c3a9b5d289cce53 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# Temporal Difference Learning | |
# Reinforcement Learning | |
# Deep Q-Learning | |
# Experience Replay | |
# State, Action, Reward Triples | |
# TDLambda (Temporal Difference Learning, Monte Carlo) and or Dynamic Programming | |
# mac xcode-select --install | |
# mac pip install numpy incremental | |
# mac brew install golang libjpeg-turbo swig | |
# mac sudo -H pip3 install cmake | |
# mac export SDKROOT=/Applications/Xcode.app/Contents/Developer | |
sudo -H pip3 install cmake | |
sudo apt install zlib1g-dev swig | |
sudo -H pip3 install gym[all] | |
# Hyperparameters (learning rate, discount rate, epsilon, feature extraction, nn architecture) | |
# Model the system (state, environment, actions, rewards, returns, episodes, discounting, policies, value functions) | |
# Convolutional Neural Networks, Recurrent Neural Networks, Regression, Classification, Generative Adversarial Networks, Autoencoders | |
# Markov Decision Process | |
#Set of all states, set of all actions, Set of all rewards, state transition probabilities, discount factor (gamma) | |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment