Basolateral amygdala (BLA) neurons are phasically responsive to reward-predictive cues
8–11, which is consistent with the idea that cue-evoked neuronal firing emerges as a consequence of cue–reward associations. The BLA is composed of multiple nuclei, including the LA, the first site of convergence for sensory inputs carrying information about conditioned and unconditioned stimuli to the amygdala
1,4,12–15. Thus, the LA is a likely initial site for the formation of cue–reward associations that endow the cue with motivational significance that impacts on reward-seeking behaviour.
To test the hypothesis that successful acquisition of cue-directed reward-seeking behaviour is dependent on neuronal plasticity in the LA, we examined LA neuronal firing in response to a reward-predictive cue during training on a sucrose self-administration task (
Supplementary Fig. 1). To control for neural activity associated with the motor output of operant responding, and to ensure that the sensory cue predicted reward delivery and not the operant response alone, beam breaks at a nose-poke response port (‘nose-poke operandum’) were reinforced with a cue and sucrose reward after about 50% of nose-pokes (). In rats that successfully acquired this task (see Methods), about half of recorded neurons (49%; 60 of 122 neurons from seven rats during the first session in which each rat met the acquisition criterion) that did not respond to the cue before acquisition developed a robust phasic response to cue onset with acquisition (, and
Supplementary Fig. 2). Cue encoding increased across sessions: the cue-evoked population response of all neurons recorded in the third session was enhanced relative to the first session (session × time interaction,
F9,1944 = 4.15, P < 0.0001), specifically within the 50 ms after cue onset (P < 0.003; ,
Supplementary Fig. 3 and Supplementary Table 1). These changes over sessions were predictive of behaviour: increasing proportions of neurons were recruited to encode the reward-predictive cue as individual rats improved reward-learning performance (). Task efficiency, a behavioural index defined as the number of rewards earned divided by the number of cues presented, and task accuracy, a behavioural index defined as the difference in the number of correct and incorrect port entries divided by the total number of port entries, were significantly correlated (P < 0.0001 and P = 0.0066, respectively;
Supplementary Table 2) with the percentage of neurons per rat that showed phasic responses to the reward-predictive cue (). Control studies confirmed that the increase in cue encoding is specific to acquisition of the cue–reward association and is not due to non-associative factors, such as sensitization (
Supplementary Figs 4 and 5). These data demonstrate that development of cue-evoked responses in the LA depends on the acquired reward-predictive nature of the cue. Further, the greater the proportion of neurons recruited to encode the reward-predictive cue, the better the rat learned the cue–reward association, and the more successful the rat was at earning rewards.
Because our
in vivo recordings showed rapidly occurring changes in cue-related firing in the LA during successful cue–reward learning, we proposed that the mechanism underlying these changes was an increase in synaptic strength of thalamic or cortical sensory afferents onto LA neurons; we tested this hypothesis with
ex vivo experimentation (
Supplementary Fig. 6). Rats were trained on a single session of the same behavioural model and classified as learners (top 50%) or non-learners (bottom 50%) as defined by our learning indices of task efficiency and task accuracy (
Supplementary Fig. 7). Any unearned sucrose was delivered in the home cage immediately after the session, ensuring that all rats received the same amount of sucrose. Brains were collected about 30 min after the end of the session for the preparation of acute slices of the LA. We stimulated the internal or external capsule to evoke excitatory postsynaptic currents (EPSCs) from thalamic or cortical afferents, respectively, and used whole-cell patch-clamp techniques within visually identified pyramidal neurons to measure EPSCs containing AMPA receptor (AMPAR)-mediated and NMDA receptor (NMDAR)-mediated currents. We found that the AMPAR/NMDAR ratio, an index of glutamatergic synaptic strength
16,17, varied with task performance and afferent (main effects of group,
F2,29 = 11.01,
P < 0.001; afferent,
F1,29 = 22.13,
P < 0.001; group × afferent interaction,
F2,29 = 7.38,
P < 0.004) such that learners had a larger AMPAR/NMDAR ratio at thalamic synapses (
P < 0.001; learners, 1.03 ± 0.04; non-learners, 0.58 ± 0.08; naives, 0.47 ± 0.05 (means ± s.e.m.)) but not cortical synapses (learners, 0.45 ± 0.08; non-learners, 0.46 ± 0.10; naives, 0.47 ± 0.04) in the LA relative to non-learners and naives, which did not differ from each other (
P = 0.84; ). We determined the correlation between each rat’s behavioural performance, as measured by either task efficiency or task accuracy, and the AMPAR/NMDAR ratio, and found a significant positive relationship at thalamic inputs (
P = 0.0003 and
P = 0.006, respectively) but not cortical inputs (
P = 0.89 and
P = 0.55, respectively; and
Supplementary Table 3). Hence, thalamo-amygdalar synaptic strength predicted the success of individual rats’ reward-learning performance.
A change in the relative contribution of AMPARs and NMDARs to compound EPSCs may reflect an increase inAMPAR currents and/or a decrease in NMDAR currents at thalamo-amygdalar synapses. To determine whether AMPAR currents were modified during reward learning, we examined AMPAR-mediated miniature EPSCs (mEPSCs), which reflect spontaneously released vesicles of glutamate
18. Typically, an increase in mEPSC amplitude indicates an increase in postsynaptic AMPAR number or function, whereas an increase in mEPSC frequency indicates an increase in the probability of transmitter release (
Pr) or in the number of synapses
18. mEPSC amplitude was related to task performance (
F2,29 = 30.75,
P < 0.001), with a greater mean amplitude from LA neurons of learners (
P < 0.001; 15.88 ± 0.89 pA) than from those of non-learners (9.98 ± 0.29 pA) or naives (10.05 ± 0.39 pA), which did not differ from each other (
P = 0.87; ). In contrast, the mean mEPSC frequency was not different (
F2,29 = 0.5,
P = 0.61) in learners (6.45 ± 1.48 Hz), non-learners (5.36 ± 1.16 Hz) and naives (4.96 ± 1.14 Hz) (). To examine further whether learning altered
Pr, we examined the paired-pulse ratio
19 (inter-stimulus interval 50 ms; ). There was no change in the paired-pulse ratio for either afferent (
F1,33 = 0.02,
P = 0.89) among naives, non-learners or learners (main effect of group,
F2,33 = 0.35,
P = 0.71; group × afferent interaction,
F2,33 = 0.40,
P = 0.67), indicating that learning does not cause an immediate change in
Pr and that the rapid increase in AMPAR/NMDAR ratio is mediated postsynaptically.
The induction of associative long-term potentiation in the LA depends on the activation of NMDARs
20,21, which can lead to increases in AMPAR currents
18. In addition, NMDAR blockade within the BLA impairs acquisition, but not performance, in two similar appetitive tasks
22,23. To test whether the learning-induced synaptic changes we observed are dependent on NMDAR activation, we locally infused the NMDAR antagonist AP5 (3 µg per side) or vehicle (artificial cerebrospinal fluid; aCSF) into the LA bilaterally before training (
Supplementary Fig. 8). To control for the possibility that synaptic changes might be secondary to, rather than causal for, reduced behavioural performance, we included a third group in which rats received unilateral intra-LA infusions of AP5 and contra-lateral infusions of aCSF to provide a within-animal control. Task efficiency was impaired by AP5 (
F2,12 = 9.03,
P < 0.005) after both bilateral (
P < 0.007) and unilateral (
P < 0.018) intra-LA pre-training infusions (); bilateral, but not unilateral, intra-LA infusions of AP5 also impaired task accuracy (
F2,12 = 7.38,
P < 0.009; aCSF versus bilateral AP5,
P < 0.009; ). The effect of AP5 was not attributable to a spread of drug into the neighbouring central nucleus of the amygdala (
Supplementary Fig. 9).
After intra-LA infusions and the training session, brains from these rats were collected for the preparation of acute slices. Rats that received bilateral intra-LA infusions of AP5 showed a lower mean amplitude of mEPSCs (P = 0.003; 10.26 ± 0.41 pA; ) than after infusions with aCSF (13.09 ± 0.68 pA; ), whereas there was no change in mEPSC frequency between groups (P = 0.66; ). The decrease in task efficiency and the decrease in mEPSC amplitude after local infusion of an NMDAR antagonist suggest that cue–reward learning and the associated increase in AMPAR number or function are dependent on NMDAR activation. By comparing mEPSCs from rats with unilateral intra-LA AP5 infusions and contralateral aCSF infusions, we were able to determine with confidence that any differences between LA neurons treated with AP5 or aCSF are due to local NMDAR blockade rather than to an AP5-induced difference in task performance. Within subjects, we found that the amplitude of LA mEPSCs recorded after AP5 infusion into the LA on one side was significantly lower (P < 0.001; ) relative to aCSF infusion on the contralateral side (), whereas there was no difference in frequency (P = 0.99; ). Local NMDAR blockade therefore attenuates the learning-dependent increase in postsynaptic AMPAR currents and impairs the acquisition of reward-directed behaviour.
These results show that, with cue–reward learning, cue-responsive neurons are rapidly recruited
in vivo, thalamo-amygdalar synapses are selectively strengthened, and LA neurons show NMDAR-dependent increases and associated potentiation of AMPAR number or function. The proportion of cells recorded
in vivo that developed a response to the reward-predictive cue is less than the proportion of cells that showed enhanced synaptic strength with learning (). This suggests that the integration of multiple inhibitory and excitatory synapses on a given cell may constrain cue-related spike firing
3,24,25, even if that cell possesses enhanced thalamic inputs. The thalamic pathway is under strong inhibitory suppression
20,24 in vivo, whereas our
ex vivo recordings were performed under γ-aminobutyric acid (GABA)
A-receptor antagonism to isolate EPSCs.
The parallel emergence of increased synaptic strength and cue-related firing in the LA neurons during reward learning suggests that this excitatory synaptic increase contributes to enhanced spike activity of LA neurons in response to the conditioned stimulus, driven by auditory and visual thalamic inputs that terminate in the LA
1,12. Consistent with our results, auditory fear conditioning, which requires an intact LA
1,2,4, increases neuronal firing in response to a shock-predictive cue and potentiates transmission at thalamo-amygdalar synapses
26 by an NMDAR-dependent mechanism, probably a result of postsynaptic AMPAR trafficking
27. Previous work on fear conditioning suggests that plasticity also occurs at cortical
28 synapses in the LA, although this enhancement was found at later time points than tested here. Single-unit recordings in the LA show that the thalamic pathway conditions more rapidly than the cortical pathway during fear conditioning
1,29. Our findings, viewed in the context of fear conditioning, prompt further experimentation to determine whether rapidly occurring reward-learning-induced plasticity at thalamo-amygdalar synapses facilitates subsequent consolidation at other sites
30.
These findings indicate that rapid synaptic changes in the LA occur during the early stages of cue–reward learning. It is likely that this plasticity permits amygdala neurons to respond selectively to meaningful environmental stimuli and transmit this information to downstream brain regions for the expedited selection of an adaptive behavioural output.