Abstract
We consider a controlled diffusion process, the description of which depends on an unknown parameter α, and investigate the following control policy. To each α an optimal stationary control is associated. α is estimated recurrently from the trajectory by Bayes' method, and the optimal stationary control corresponding to the estimate is used. We establish the consistency of the estimate, and present asymptotic properties of the criterion function. They follow from the central limit theorem, from the law of large numbers and from the law of the iterated logarithm for local martingales.

This publication has 2 references indexed in Scilit: