🟢 Different Levels of AWS Resources for Machine Learning Model Training and Deployment

👉 EC2 Instances: Full User Control (Least Pre-built Content)
With EC2, you have complete control over the entire setup. You need to:
- Start an EC2 instance (e.g., GPU-enabled for training deep learning models).
- Install dependencies manually (e.g., Python, ML libraries like PyTorch or TensorFlow).
- Copy or configure the training script, and handle the training data management (downloading data from S3 or other sources).
- Run the training process manually using your own code.
- Manage all aspects of the environment, scaling, and resource management.

nov05 / 20241122_AWS SageMaker JupyterLab (or any other IDE), set up GitHub username and password.md

Last active November 24, 2024 11:03

20241122_AWS SageMaker JupyterLab (or any other IDE), set up GitHub username and password

Don't use the email you registered with GitHub for commits. Instead, GitHub provides you with a proxy email for this purpose. Just go to 'Settings - Emails' in your GitHub account, and you'll find the proxy email there.
Don't use your GitHub login password for commits. Instead, go to 'Settings - Developer Settings - Personal access tokens', create a token, and use that as your password for commits. Since Fine-grained tokens are still in Preview, I'm using a classic token for now.

nov05 / 20241119_udacity-aws-mle-nanodegree-env.md

Last active December 6, 2024 01:30

Local Install Requirements

Python 3.7
MXNet 1.8
Pandas >= 1.2.4
AutoGluon 0.2.0

👉 create sagemaker base environment

nov05 / 20240322_reinforcement learning_neural network soft update.md

Last active March 22, 2024 12:22

20240322_reinforcement learning_neural network soft update

"deeprl/agent/DDPG_agent.py"

trg = trg*(1-τ) + src*τ
τ is stored in self.config.target_network_mix

    def soft_update(self, target, source):
        ## trg = trg*(1-τ) + src*τ
        ## τ is stored in self.config.target_network_mix
        for target_param, source_param in zip(target.parameters(), source.parameters()):
 target_param.detach_()

nov05 / 20240225_udacity deep reinforcement learning_py310 env setup.md

Last active November 2, 2024 03:54

👉 Udacity Deep Reinforcement Learning Python Environment Setup

⚠️ Python 3.11 has to be downgraded to Python 3.10, or Multiprocessing will cause TypeError: code() argument 13 must be str, not int in both Windows and Linux. Google Colab is currently using Python 3.10 as well.

Windows 11 (64-bit), VSCode, Powershell, Miniconda3, Python 3.10
repo: https://github.com/Nov05/udacity-deep-reinforcement-learning
working dir: D:\github\ udacity-deep-reinforcement-learning\python
package deeprl is copied and modified from https://github.com/ShangtongZhang/DeepRL/tree/master/ deep_rl into .\python.

nov05 / 20240224_You appear to be missing MuJoCo.md

Created February 24, 2024 17:07

(drlnd_p2) PS D:\github\udacity-deep-reinforcement-learning\python\mujoco-py> python examples\body_interaction.py

You appear to be missing MuJoCo.  We expected to find the file here: C:\Users\*\.mujoco\mujoco210

This package only provides python bindings, the library must be installed separately.

Please follow the instructions on the README to install MuJoCo

nov05 / 20240223_udacity_drlnd_p2_env.md

Last active November 18, 2024 21:20

Udacity Deep Reinforcement Learning - p2 & `deeprl` env setup

👉 check the drlnd_py310 env setup notes
👉 check the p1 env setup notes
👉 course curriculum
👉 Colab notebooks

nov05 / 20240222_issue gym.wrappers.Monitor.md

Last active February 22, 2024 17:03

⚠️ issue: from gym.wrappers import Monitor caused ImportError: cannot import name 'Monitor' from 'gym.wrappers'.

solution (2022'):

from gym.wrappers.record_video import RecordVideo
env = gym.make('CartPole-v1', render_mode="rgb_array")
env = RecordVideo(env, './video',  episode_trigger = lambda episode_number: True)
env.reset()

nov05 / 20240219_reinforcement learning_pong PPO training log 1200e.md

Last active February 19, 2024 10:31

20240218_pong-PPO.ipynb
👉 training log for reference
1000 episodes, T4 GPU, Wall time: 1h 38min 14s

Episode: 20, score: -15.750000
[-16. -16. -16. -16. -16. -16. -16. -14.]
Episode: 40, score: -12.625000

nov05 / 20240218_reinforcement learning_pong training log 1200e.md

Created February 19, 2024 06:00

20240218_reinforcement learning_pong training log 1200e

20240217_pong_REINFORCE.ipynb
👉 training log for reference
1200 episodes on T4 GPU, Wall time: 2h 12min 12s

Episode: 20, score: -14.500000
[-14. -15. -16. -13. -14. -16. -16. -12.]
Episode: 40, score: -14.500000

nov05

🟢 Different Levels of AWS Resources for Machine Learning Model Training and Deployment

👉 Udacity Deep Reinforcement Learning Python Environment Setup

Udacity Deep Reinforcement Learning - p2 & deeprl env setup

Udacity Deep Reinforcement Learning - p2 & `deeprl` env setup