Tight-entry orbital reinforcement learning environment built with Gymnasium + Stable Baselines3 (PPO).
This script defines MjölnirEnvV33, a custom 2D physics-based orbital environment where an RL agent attempts to sling a “hammer” into a stable orbit around a black hole.
Version 33 introduces:
- Tight orbit-entry requirements (must hit the exact white ring radius)
- Explicit reward visualization in the render panel
- Improved gravitational model
- Quadratic orbit-error penalties