31 lines (24 loc) · 1.15 KB

Implementation of Scalable-Distributed-Deep-RL-with-Importance-Weighted-Actor-Learner-Architectures

These results are from only 4 threads. So unstable to train.
Tensorflow Implementation
A3C type thread environment training method
PongDeterministic-v4 environment

Todo

Only CPU Training method
Use Network protocol method
Training on GPU, Inference on CPU

Reference