Guide to ML Agents 1.0

To summarize, we provide 3 training methods: BC, GAIL and RL (PPO or SAC) that can be used independently or together:

Leveraging either BC or GAIL requires recording demonstrations to be provided as input to the training algorithms.

Parameters: https://github.com/Unity-Technologies/ml-agents/blob/release_1/docs/Training-Configuration-File.md

PPO

Proximal Policy Optimization

Important Parameters