Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation

Top Cited Papers
Open Access
Abstract
The exploration/exploitation dilemma describes the choice between maintaining the current strategy, or trying new strategies, to maximize rewards. The authors show that genes controlling striatal dopamine function are associated with exploitative learning. In contrast, a gene controlling prefrontal dopamine function is predictive of exploration when the value of alternative strategies is uncertain. The basal ganglia support learning to exploit decisions that have yielded positive outcomes in the past. In contrast, limited evidence implicates the prefrontal cortex in the process of making strategic exploratory decisions when the magnitude of potential outcomes is unknown. Here we examine neurogenetic contributions to individual differences in these distinct aspects of motivated human behavior, using a temporal decision-making task and computational analysis. We show that two genes controlling striatal dopamine function, DARPP-32 (also called PPP1R1B) and DRD2, are associated with exploitative learning to adjust response times incrementally as a function of positive and negative decision outcomes. In contrast, a gene primarily controlling prefrontal dopamine function (COMT) is associated with a particular type of 'directed exploration', in which exploratory decisions are made in proportion to Bayesian uncertainty about whether other choices might produce outcomes that are better than the status quo. Quantitative model fits reveal that genetic factors modulate independent parameters of a reinforcement learning system.