Revision
Back to Reinforcement Learning
Introduction
Value based method: Deep Q Network
Policy based methods: REINFORCE
Actor Critic