How to model an RL problem: Markov Decision Processes
How to model an RL problem: Dynamic programming
Comparing activation functions