Dopamine-independent effect of rewards on choices through hidden-state inference.
Marta Blanco-PozoThomas E AkamMark E WaltonPublished in: Nature neuroscience (2024)
Dopamine is implicated in adaptive behavior through reward prediction error (RPE) signals that update value estimates. There is also accumulating evidence that animals in structured environments can use inference processes to facilitate behavioral flexibility. However, it is unclear how these two accounts of reward-guided decision-making should be integrated. Using a two-step task for mice, we show that dopamine reports RPEs using value information inferred from task structure knowledge, alongside information about reward rate and movement. Nonetheless, although rewards strongly influenced choices and dopamine activity, neither activating nor inhibiting dopamine neurons at trial outcome affected future choice. These data were recapitulated by a neural network model where cortex learned to track hidden task states by predicting observations, while basal ganglia learned values and actions via RPEs. This shows that the influence of rewards on choices can stem from dopamine-independent information they convey about the world's state, not the dopaminergic RPEs they produce.
Keyphrases
- uric acid
- prefrontal cortex
- neural network
- healthcare
- health information
- single cell
- spinal cord
- metabolic syndrome
- randomized controlled trial
- type diabetes
- study protocol
- adipose tissue
- spinal cord injury
- functional connectivity
- electronic health record
- social media
- artificial intelligence
- skeletal muscle
- insulin resistance