The book ‘Reinforcement Learning’ by Richard S. Sutton lead to two important conclusions.
First, a reinforcement learning AI requires with four key elements:
- Policy – the way of work. A set of rules.
- Reward function – a sign for the AI that it did something right
- Value function – determines how much an action leads to the reward function
- Goal – the desired result.
In the book, there are three key techniques that could be used to implement an AI:
- Dynamic Programming
- Monte Carlo Algorithms
- Temporal Differential Algorithms
However, since the book was published in 2009 these practices might be outdated.