Solving Multi-objective Reinforcement Learning Problems by EDA-RL - Acquisition of Various Strategies

1 January 2009

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 426-431
https://doi.org/10.1109/isda.2009.92

Abstract

EDA-RL, estimation of distribution algorithms for reinforcement learning problems, have been proposed by us recently. The EDA-RL can improve policies by EDA scheme: First, select better episodes. Secondly, estimate probabilistic models, i.e., policies, and finally, interact with the environment for generating new episodes. In this paper, the EDA-RL is extended for multi-objective reinforcement learning problems, where reward is given by several criteria. By incorporating the notions in evolutionary multi-objective optimization, the proposed method is enable to acquire various strategies by a single run.

Keywords

This publication has 4 references indexed in Scilit:

EDA-RL
Published by Association for Computing Machinery (ACM) ,2009
Incorporating a Metropolis method in a Distribution Estimation using Markov Random Field Algorithm
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Using a Markov network model in a univariate EDA
Published by Association for Computing Machinery (ACM) ,2005
Estimation of Distribution Algorithms with Kikuchi Approximations
Evolutionary Computation, 2005

Cited by 10 articles