Planning and acting in partially observable stochastic domains