RL_toolbox

May 12, 2017 ยท View on GitHub

all the algorithm is running on pycharm IDE, or the package loss error may exist.

implemented algorithm: trpo a3c

  • a3c:for continous action space, use multi processes, but saving model has not been implemented.
  • trpo:for continous and discrete action space

run

  • a3c:run a3c/a3c_continous.py in pycharm IDE
  • trpo:run experiment/trpo_continous.py in pycharm IDE

contain some useful reinforcement learning algorithm and relative tool

Contents

  1. 1all the algorithm is running on pycharm IDE, or the package loss error may exist.
  2. 2implemented algorithm: trpo a3c
  3. 3run
  4. 4contain some useful reinforcement learning algorithm and relative tool