Use Monte-Carlo (MC) Methods and Temporal Difference (TD) Learning on couple of games and toy problems.