Machine learning – LO – Describe how

Describe how to optimize a policy in reinforcement learning