1.1 Learn the basic DRL concepts and popular algorithms.
1.2 Focus more on DDPG+HER algorithm.
1.) mujoco-py has more active community: the number of open issues ~120, while dm_control ~10.
1.1 Learn the basic DRL concepts and popular algorithms.
1.2 Focus more on DDPG+HER algorithm.
1.) mujoco-py has more active community: the number of open issues ~120, while dm_control ~10.
| #!/usr/bin/env python | |
| import os | |
| import mujoco_py | |
| import numpy as np | |
| PATH_TO_HUMANOID_XML = os.path.expanduser('~/.mujoco/mjpro150/model/humanoid.xml') | |
| #PATH_TO_HUMANOID_XML = os.path.expanduser('/home/rfa-xw/anaconda3/envs/mujoco-py/lib/python3.6/site-packages/gym/envs/robotics/assets/fetch/reach.xml') |
| ###Mac OS ignore | |
| .DS_Store | |
| ### PyCharm ### | |
| # User-specific stuff | |
| .idea/**/workspace.xml | |
| .idea/**/tasks.xml | |
| .idea/**/usage.statistics.xml | |
| .idea/**/dictionaries |
1.)qpos: position; qvel: velocity
2.)mujoco-py use actuator to make the move
joint type: free, ball, slide, hinge
actuator type: motor, position, velocity, cylinder, muscle
sensor type: touch, accelerometer, velocimeter, gyro, force, torque, magnetometer...
3.)
| """ | |
| Cart pole swing-up: Identical version to PILCO V0.9 | |
| """ | |
| import logging | |
| import math | |
| import gym | |
| from gym import spaces | |
| from gym.utils import seeding | |
| import numpy as np |