Hostname: page-component-586b7cd67f-gb8f7 Total loading time: 0 Render date: 2024-11-24T20:23:15.598Z Has data issue: false hasContentIssue false

100 words on reward prediction error – 100 words

Published online by Cambridge University Press:  23 December 2019

Rights & Permissions [Opens in a new window]

Abstract

Type
Extras
Copyright
Copyright © The Author 2019 

Reward prediction error is like Marmite – you either love it or hate it. I hate it because it commits to a view of the brain that inherits from 20th-century behaviourism and reinforcement learning. When people say dopamine encodes reward prediction error, they are assuming that the brain is in the game of maximising reward. But it is not – the brain updates its beliefs and selects a preferred course of action. On this (planning as active inference) view, the available evidence suggests that dopamine encodes the precision of beliefs about policies – or, more simply, the confidence afforded (subpersonal) plans of action.

Submit a response

eLetters

No eLetters have been published for this article.