项目作者: philipshurpik

项目描述 :
Simple implementations of vanilla reinforce (policy gradient) and actor critic methods with numpy and different frameworks
高级语言: Python
项目地址: git://github.com/philipshurpik/reinforce-experiments.git
创建时间: 2018-01-23T23:21:05Z
项目社区:https://github.com/philipshurpik/reinforce-experiments

开源协议:

下载