The goal of this observational study is to investigate whether the healthy human brain shows a diversity of optimistic and pessimistic reward signals and whether changes in this distribution in Parkinson's disease (PD) can provide mechanistic insights into the cause of symptoms. The main hypotheses it aims to test are: 1. As shown in mice, a diversity of optimistic and pessimistic dopamine reward signals exists in the human ventral tegmental area (VTA) and the ventro-rostral basal ganglia circuit. 2. Pessimistic neurons are more severely affected by neurodegeneration in PD. Researchers will compare the diversity of optimistic and pessimistic dopamine reward signals in patients with PD and healthy participants to see if there is a skewed distribution of optimistic and pessimistic reward signals in PD. Participants will play a task probing reward- and movement related brain activity in an MRI scanner. Researchers will derive functional topographic maps of optimism/pessimism in VTA, substantia nigra pars compacta (SNc), striatum and cortical areas such as the anterior cingulate cortex (ACC). In sub-study 1, participants will be tested on one study day where patients with PD are tested in the off-medication state (40 control participants, 40 patients with PD). In sub-study 2, to test whether/how dopaminergic medication affects the distribution of optimism/pessimism, participants will be tested on two study days (30 control participants, 30 patients with PD). Patients with PD are tested one day in the off-medication state, another day in the on-medication state (order counterbalanced between patients with PD). Control participants are tested on two days without medication challenge to test for test-retest effects.
The dopamine system is tasked with motivating action and driving learning and thus lies at the core of adaptive behaviour. Whereas motivation of action has consistently been found to be impaired in Parkinson's disease (PD), less is known about the potential impairments of reward-based learning in PD. In this project, investigators use reinforcement learning (RL) as a computational framework for understanding the role of dopamine in reward-based learning in health and in PD. RL can be defined as learning based on prior experiences. A core concept of RL is the reward prediction error (RPE). An RPE is the deviation between an obtained reward and an expected reward that an agent (e.g. a person) gets in a given environment. The RPE is used to update future expectations in similar situations. Thus, RPEs are critical for the agent's ability to adapt its behavior to the environment. These RPEs are signalled by dopamine neurons in the ventral tegmental area (VTA). Recent advancements in the study of RL in mice have fundamentally changed our understanding of dopamine's role in reward-based learning. Dabney et al. (2020) discovered with single-cell recordings how dopamine signals in the VTA systematically differ in their RPE signals. They proposed a distributional RL (distRL) model where neurons can have diverse firing patterns. Neurons can be so-called "pessimistic" neurons which expect low rewards and can be positively surprised (as expressed by increased firing rate) by rewards that are even below-average, whereas "optimistic" neurons expect high rewards and can be negatively surprised (expressed by a decreased firing rate) even by above-average rewards. It remains to be examined whether distRL also applies to the dopamine system in the healthy human brain. Furthermore, if distRL does apply to the human brain, this framework might provide mechanistic insights into the cause of a range of different symptoms in PD. The importance of dopamine is particularly prominent in PD, a progressive neurodegenerative disease, in which the progressive loss of dopamine neurons in the midbrain causes the classical motor symptoms (referred to as parkinsonism) and contributes to non-motor disturbances, such as apathy and cognitive slowing. PD is the second most frequent age-related neurodegenerative disorder and the global burden of PD has more than doubled over the last two decades mainly as a result of increasing numbers of older people. Dopamine neurons have been suggested to be especially vulnerable to neurotoxicity. Interactive cascades of dopamine oxidation and mitochondrial stress due to aberrant calcium signaling are thought to be main causes of neurodegeneration. In this context, distRL and its representational implementation in dopaminergic cells leads to novel hypotheses: The pessimistic dopamine neurons (which are positively "surprised" by most outcomes and increase their firing rate) are exposed to higher oxidative mitochondrial stress over their lifetime than optimistic neurons and would thus be more prone to neurodegeneration than optimistic neurons. While the prominent motor symptoms are caused by massive degeneration in the SN, neurodegeneration in the less vulnerable VTA is less extensive, but still of relevant size. Consequently, already shortly after diagnosis, reward signaling in the VTA is significantly reduced. Thus, investigators of this project hypothesize that neurodegeneration in both the SN and VTA is biased towards "pessimistic" neurons. Consequently, for most reward outcomes in an environment, there are fewer neurons responding with dopamine release to an outcome (but the same number of neurons responding with a pause in firing). This results in a reduced overall dopamine response to most events, favouring anhedonia and apathy. Investigators of this project will use functional magnetic resonance imaging (fMRI) and derive functional maps of optimism/pessimism. During the fMRI experiment, participants will be shown different stimuli ("cards") and, upon squeezing a grip-force device, they win or lose a small amount of money each time, accumulating money as they play. The different stimuli have different probabilities of leading to a reward or loss event which participants learn through observation. Some trials are forced-choice trials with only a single stimulus available, other trials are open-choice trials where participants get to choose between stimuli and can thus affect their accumulated earnings by choosing the stimuli that they have observed to be most beneficial. The investigators will test the following main hypotheses: 1. As shown in mice, a diversity of "optimistic" and "pessimistic" dopamine reward signals exists in the human VTA and the ventro-rostral basal ganglia circuit. 2. "Pessimistic" neurons are more severely affected by neurodegeneration in PD. Furthermore, investigators will explore how potential shifts in the pessimism-optimism balance might be related to cognitive and motor symptoms in PD. In a second study-arm, investigators will also explore whether/how medication state affects the distribution of optimistic and pessimistic prediction error signals. Here, patients with PD are tested one day in the off-medication state, another day in the on-medication state (order counterbalanced between patients with PD). The pragmatic off-medication state implies that patients will not have taken their morning dose of antiparkinsonian medication before arrival. Control participants are tested on two days without medication challenge to test for test-retest effects.
Study Type
OBSERVATIONAL
Enrollment
140
Danish Research Centre for Magnetic Resonance (DRCMR), Hvidovre Hospital
Hvidovre, Denmark
RECRUITINGTopographic distribution of the degree of optimism or pessimism
The main outcome measure is the topographic distribution of the degree of optimism or pessimism of each voxel's prediction error signal across the brains' cortical and sub-cortical grey matter. Investigators focus on the dopaminergic midbrain regions VTA/SNc and their main target region, the striatum. In sub-study 1, this distribution will be compared between people with Parkinson's disease and healthy controls. In sub-study 2, the change in distribution between the off- and the on-medication state in patients with PD will be compared to the test-retest change in distribution in control participants.
Time frame: Day 1 and Day 2
Movement Disorder Society - Unified Parkinson's Disease Rating Scale (MDS-UPDRS)
MDS-UPDRS total score (sum of all 4 sub-scores listed below, range 0-260; higher score = worse outcome) and all four sub-scores. Sub-scores include: Part l subscore: Non-Motor Aspects of Experiences of Daily Living (nM-EDL; range 0-52; higher score = worse outcome); Part ll sub-score: motor experiences of daily living (M-EDL; range 0-52; higher score = worse outcome); Part lll sub-score: motor examination (range 0-132; higher score = worse outcome); Part lV sub-score: motor complications (range 0-24; higher score = worse outcome).
Time frame: Day 1 and Day 2
Major Depression Inventory (MDI)
Major Depression Inventory (MDI) total score (range 0-50; higher score = worse outcome)
Time frame: Day 1 and Day 2
Montreal Cognitive Assessment (MoCA)
Montreal Cognitive Assessment (MoCA) total score (range 0-30; higher score = better outcome) and sub-scores. Sub-scores include: Visuospatial/executive (range 0-5; higher score = better outcome); Naming (range 0-3; higher score = better outcome); Attention (range 0-6; higher score = better outcome); Language (range 0-3; higher score = better outcome); Abstraction (range 0-2; higher score = better outcome); Delayed recall (range 0-5; higher score = better outcome); Orientation (range 0-6; higher score = better outcome). Additionally, Memory Index Score (range 0-15; higher score = better outcome) is also examined.
Time frame: PD patients: ON-medication (Day 1 or Day 2, counterbalanced). Healthy controls: Day 1.
Apathy Scale (AS)
Apathy Scale (AS) total score (range 0-42; higher score = worse outcome)
Time frame: Day 1 and Day 2
Lille Apathy Rating Scale (LARS)
Lille Apathy Rating Scale (LARS) total score (range -36-36; higher score = worse outcome), sub-scores and factoral sub-scores. Sub-scores include: everyday productivity (EP); interests (INT); initiative (INI); novelty seeking (NS); motivation (M); emotional responses (ER); concern (C); social life (SL); self awareness (SA). Factorial sub-scores include: intellectual curiosity (IC); emotion (E); action initiation (AI); self awareness (SA). All sub-scores and factorial sub-scores have a range -4-4 and a higher score means worse outcome.
Time frame: Day 1 and Day 2
Edinburgh Handedness Inventory (EHI)
Edinburgh Handedness Inventory (EHI) handedness category (L, R, A) and Laterality Quotient (range -100-100; less than -40 means left-handed (L), -40 to 40 means ambidextrous (A), greater than 40 means right-handed (R)).
Time frame: PD patients: ON-medication (Day 1 or Day 2, counterbalanced). Healthy controls: Day 1.
Questionnaire for Impulsive-Compulsive Disorders in Parkinson's Disease - Rating Scale (QUIP-RS)
Questionnaire for Impulsive-Compulsive Disorders in Parkinson's Disease - Rating Scale (QUIP-RS) total score (range 0-112; higher score = worse outcome), total impulsive-compulsive disorder (ICD) score (range 0-64; higher score = worse outcome) and six sub-scores. Sub-scores include: sub-score A: Gambling (range 0-16; higher score = worse outcome), sub-score B: Sex (range 0-16; higher score = worse outcome), sub-score C: Buying (range 0-16; higher score = worse outcome), sub-score D: Eating (range 0-16; higher score = worse outcome), sub-score E: Hobbyism-Punding (range 0-32; higher score = worse outcome) and sub-score F: PD-Medication Use (range 0-16; higher score = worse outcome).
Time frame: Day 1 and Day 2
Snaith-Hamilton Pleasure Scale (SHAPS)
Snaith-Hamilton Pleasure Scale (SHAPS) total score (range 0-14; higher score = worse outcome)
Time frame: Day 1 and Day 2
Temporal Experience of Pleasure Scale (TEPS)
Temporal Experience of Pleasure Scale (TEPS) total score (range 18-108; higher score = better outcome) and two sub-scores. Sub-scores include: Anticipatory (ANT; range 10-60; higher score = better outcome) and consummatory (CON; range 8-48; higher score = better outcome).
Time frame: Day 1 and Day 2
Paired Associates Learning (PAL) from Cambridge Neuropsychological Testing Automated Battery (CANTAB), Total errors adjusted
Total errors adjusted (for expected errors on attempts that were not reached).
Time frame: Day 1 and Day 2
Paired Associates Learning (PAL) from Cambridge Neuropsychological Testing Automated Battery (CANTAB), Mean errors to success
Mean errors to success (number of trials required to locate the pattern(s) correctly).
Time frame: Day 1 and Day 2
Paired Associates Learning (PAL) from Cambridge Neuropsychological Testing Automated Battery (CANTAB), First attempt memory score
First attempt memory score (number of times a pattern was correctly recalled on the first attempt).
Time frame: Day 1 and Day 2
Paired Associates Learning (PAL) from Cambridge Neuropsychological Testing Automated Battery (CANTAB), Number of patterns reached
Number of patterns reached (number of patterns on the last problem being reached).
Time frame: Day 1 and Day 2
Neural activity (BOLD response)
Beyond the primary outcome measure (topographic distribution of the degree of optimism/pessimism of prediction error signals), other neural responses to experimental events (motor responses, contrasts between neural responses to winning vs. losing money) will be explored.
Time frame: Day 1 and Day 2
Response time
Time from stimulus presentation to response during fMRI task
Time frame: Day 1 and Day 2
Grip force
Maximally achieved grip force during responses
Time frame: Day 1 and Day 2
Response vigor
Maximally achieved vigor ("yank"/slope) of each grip response
Time frame: Day 1 and Day 2
This platform is for informational purposes only and does not constitute medical advice. Always consult a qualified healthcare professional.