DeepRLCourse2022 / bguan_ppo_lunarlander /_stable_baselines3_version
bguan's picture
bguan's lunar lander model using PPO trained for 500K timesteps
807c5ec
1.5.0