Revision

Back to Reinforcement Learning

Introduction

Value based method: Deep Q Network

Policy based methods: REINFORCE

Actor Critic