Paper
11 March 2002 Reinforcement learning and design of nonparametric sequential decision networks
Author Affiliations +
Abstract
In this paper we discuss the design of sequential detection networks for nonparametric sequential analysis. We present a general probabilistic model for sequential detection problems where the sample size as well as the statistics of the sample can be varied. A general sequential detection network handles three decisions. First, the network decides whether to continue sampling or stop and make a final decision. Second, in the case of continued sampling the network chooses the source for the next sample. Third, once the sampling is concluded the network makes the final classification decision. We present a Q-learning method to train sequential detection networks through reinforcement learning and cross-entropy minimization on labeled data. As a special case we obtain networks that approximate the optimal parametric sequential probability ratio test. The performance of the proposed detection networks is compared to optimal tests using simulations.
© (2002) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Emre Ertin and Kevin L. Priddy "Reinforcement learning and design of nonparametric sequential decision networks", Proc. SPIE 4739, Applications and Science of Computational Intelligence V, (11 March 2002); https://doi.org/10.1117/12.458718
Lens.org Logo
CITATIONS
Cited by 2 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Sensors

Computer programming

Automatic target recognition

Neural networks

Binary data

Error analysis

Feature selection

Back to Top