RL_toolbox

May 12, 2017 · View on GitHub

all the algorithm is running on pycharm IDE, or the package loss error may exist.

a3c:for continous action space, use multi processes, but saving model has not been implemented.
trpo:for continous and discrete action space