Exploration-Exploitation Trade-off

Exploration is trying out random actions in order to find more information about the environment..

Exploitation is utilizing known information to maximize the reward.

../_images/RL_exploration_exploitation_tradeoff.png

Fig. 11 Broad types of ML

The most commonly implemented approach to the same is known as -greedy algorithm