DDQN Mountain Car Trainer v3

Height shaping (sin(3*pos) delta *100). 64-unit nets. Train/step. Epsilon decay 50k steps. Full-speed steps (no FPS cap). Solved: avg > -110.

Avg Reward (Last 10)
0.0
Best Reward
0
Global Steps
0
Epsilon (Exploration)
1.000
Episode Return
0.00
Training Log