A simultaneous perturbation stochastic approximation-based actor-critic algorithm for Markov decision processes | IEEE Journals & Magazine | IEEE Xplore