Table of Contents

Model-based policy networks

Continuing on our reinforcement learning path, we will consider here how to build a “model for our environment” and then use this model to train our policy network, instead of the “actual environment”.

References

Initial implementation

Analysis