Proximal Policy Optimization
PPO remains the primary standard for continuous action spaces in FPS and real-time strategy environments. By utilizing a clipped surrogate objective, it prevents the drastic policy updates that often lead to catastrophic forgetting in complex neural agents.
Implementation Note
"Ideal for agents navigating 3D volumetric space where incremental precision is prioritized over raw exploration speed."
Primary Use Case
FPS / Open World Movement
Sample Efficiency
High (On-Policy)