Using supervised training signals of observable state dynamics to speed-up and improve reinforcement learning | IEEE Conference Publication | IEEE Xplore