A distributional code for value in dopamine-based reinforcement learning
Nature, Published online: 15 January 2020; doi:10.1038/s41586-019-1924-6
Analyses of single-cell recordings from mouse ventral tegmental area are consistent with a model of reinforcement learning in which the brain represents possible future rewards not as a single mean of stochastic outcomes, as in the canonical model, but instead as a probability distribution.
via Nature https://ift.tt/2qYAXTp
January 15, 2020 at 01:13PM