Hey @AlexKuhnle I noticed that the Runner interface actually never calls agent.experience(...) method. Regardless if I set evaluation=True or not.
I am wondering, if I do the following
runner = Runner(
agent=agent,
environment=environment
)
runner.run(num_episodes=nb_episodes)
runner.close()
does the actual training occurs?