Skip to content

Latest commit

 

History

History
31 lines (24 loc) · 1.15 KB

File metadata and controls

31 lines (24 loc) · 1.15 KB

Implementation of Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures

  • These results are from only 4 threads. So unstable to train.
  • Tensorflow Implementation
  • A3C type thread environment training method
  • PongDeterministic-v4 environment

Todo

  • Only CPU Training method
  • Use Network protocol method
  • Training on GPU, Inference on CPU

Reference