PMCCPMCCPMCC

Search tips
Search criteria 

Advanced

 
Logo of nihpaAbout Author manuscriptsSubmit a manuscriptNIH Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
 
Nat Neurosci. Author manuscript; available in PMC Mar 1, 2010.
Published in final edited form as:
PMCID: PMC2774785
NIHMSID: NIHMS145299
Dynamic reorganization of striatal circuits during the acquisition and consolidation of a skill
Henry H Yin,1,2 Shweta Prasad Mulcare,1 Monica R F Hilário,1 Emily Clouse,1 Terrell Holloway,1 Margaret I Davis,1 Anita C Hansson,3 David M Lovinger,1 and Rui M Costa1
1Laboratory for Integrative Neuroscience, National Institute on Alcohol Abuse and Alcoholism, US National Institutes of Health, 5625 Fishers Lane, MSC 9411, Bethesda, Maryland 20852-9411, USA.
2Department of Psychology and Neuroscience, Center for Cognitive Neuroscience, Duke University, 103 Research Drive, Box 91050, Durham, North Carolina 27708, USA.
3Laboratory of Clinical and Translational Studies, National Institute on Alcohol Abuse and Alcoholism, US National Institutes of Health, 10 Center Drive, Bethesda, Maryland 20892-1108, USA.
Correspondence should be addressed to R.M.C. (costarui/at/mail.nih.gov).
The learning of new skills is characterized by an initial phase of rapid improvement in performance and a phase of more gradual improvements as skills are automatized and performance asymptotes. Using in vivo striatal recordings, we observed region-specific changes in neural activity during the different phases of skill learning, with the associative or dorsomedial striatum being preferentially engaged early in training and the sensorimotor or dorsolateral striatum being engaged later in training. Ex vivo recordings from medium spiny striatal neurons in brain slices of trained mice revealed that the changes observed in vivo corresponded to regional- and training-specific changes in excitatory synaptic transmission in the striatum. Furthermore, the potentiation of glutamatergic transmission observed in dorsolateral striatum after extensive training was preferentially expressed in striatopallidal neurons, rather than striatonigral neurons. These findings demonstrate that region- and pathway-specific plasticity sculpts the circuits involved in the performance of the skill as it becomes automatized.
Learning to execute and automatize certain actions is essential for survival. The learning of new skills by trial and error, such as riding a bicycle or playing a piano, is characterized by an initial stage of rapid improvement in performance, followed by a phase of more gradual improvements as the skills are consolidated and performance asymptotes14. These two different phases of skill learning have distinct behavioral and physiological hallmarks1,57. For example, the early fast phase is susceptible to interference, whereas the later, more automatic, phase is more resistant to interference5. After consolidation, the memory of how to do things is long-lasting and it can last a lifetime for well-learned skills.
Previous studies have shown that changes occur in neural activity in the striatum, the major input nucleus of the basal ganglia, during motor and procedural learning813. Some studies also suggested that the striatal circuits and processes engaged during the early and late phases of skill learning may differ2,3,14. For example, the dorsomedial or associative striatum (DMS, roughly homologous to the caudate in primates), which receives input primarily from association cortices such as the prefrontal cortex15,16, seems to be preferentially involved in the initial stages of visuomotor learning and during the rapid acquisition of action-outcome contingencies2,3,17. On the other hand, the dorsolateral or sensorimotor striatum (DLS, roughly homologous to the putamen in primates), which receives inputs from sensorimotor cortex15,16, is critical for the more gradual acquisition of habitual and automatic behavior2,3,18.
We recorded neural activity in the associative and sensorimotor striatal regions during the different stages of skill learning in vivo and found that the task-related activity in these striatal regions differed during the acquisition and consolidation of a new skill, with the DMS being engaged during the early phase and the DLS being engaged during the late phase. We confirmed the differential involvement of these striatal regions in the different stages of skill learning using selective excitotoxic lesions of the dorsal striatum. We next investigated whether the changes in striatal neural activity observed during skill learning could be mediated by synaptic plasticity or excitability changes in medium spiny projection neurons in the dorsal striatum using an ex vivo approach and found that learning was accompanied by long-lasting changes in glutamatergic transmission. These changes evolved dynamically during the different phases of skill learning; changes in the DMS were predominant early in training, whereas changes in the DLS emerged only after extensive training. Finally, we found that these long-lasting changes in the DLS after extensive training were pathway specific and occurred predominantly in dopamine receptor 2 (D2)-expressing striatopalidal medium spiny neurons. Consistent with this observation, the performance of the skill after extended training became less dependent on the activation of D1-type dopamine receptors, which are mainly expressed in striatonigral neurons.
DMS and DLS involvement in early and late skill learning
To investigate whether the different regions of the dorsal striatum were differentially engaged during the early and late phases of skill learning, we simultaneously recorded the activity of medium spiny neurons in the dorsolateral and dorsomedial striatal regions in vivo during the different phases of skill learning (Fig. 1a,b). Motor-skill learning on the accelerating rotarod has been shown to have distinct early and late phases14,19. We bilaterally implanted six C57/Bl6J mice with micro-electrode arrays that were designed to target these two striatal regions simultaneously (Supplementary Fig. 1 online) and trained the mice with ten trials a day for 8 d on an accelerating rotarod (see Methods; Fig. 1c). We then examined the changes in neuronal activity during the first two trials of training (naive), the last two trials of the first day (early) and the last two trials on day 8 (late). We quantified the modulation of the activity of putative medium spiny neurons during running versus the intertrial period, when the animal rested at the bottom of the apparatus in a relatively immobile state, using a rate modulation index of equation M1 (Fig. 1b,d). In the DMS, the rate modulation increased during the early phase of training (F5, 458 = 3.7, post hoc P < 0.05), but returned to naive levels with further training (no difference between naive and late phase, post hoc P > 0.05). In the DLS, however, the modulation of firing rate increased gradually with training (post hoc P < 0.05). There was no difference between the naive group and the early group (P > 0.05), but the rate modulation was significantly increased late in training compared with both naive and early groups (P < 0.05; Fig. 1d). The increased firing rate modulation in DLS during the late phase of skill learning was accompanied by a decrease in baseline firing rate during the intertrial period (naive = 2.9 ± 0.4 Hz, early = 3.2 ± 0.5 Hz, late = 1.9 ± 0.3 Hz; planned comparisons: naive versus late, P = 0.07; early versus late, P < 0.05), suggesting that the signal-to-noise ratio increased in DLS after extended training.
Figure 1
Figure 1
In vivo recordings of neuronal activity in the DMS and DLS striatal regions during different phases of skill learning. (a) Example of a single-unit waveform (vertical bar 20 μV; horizontal 200 μs), interspike interval and cluster separation (more ...)
Overall, the number of task-related neurons (in which firing rate was significantly modulated during running compared to the intertrial period, P < 0.01, see Methods) remained relatively constant throughout training in DMS (naive = 70%, early = 78%, late = 78%), but increased slightly with training in DLS (naive = 74%, early = 80%, late = 89%). We also measured the proportion of neurons in each striatal region that were either positively (increased firing rate during running) or negatively (decreased firing rate during running) modulated during running (P < 0.01). In the DMS, there was a transient increase in the percentage of positively modulated neurons early in training (χ21,87 = 7.9, P < 0.05), but the percentage returned to the level of the naive condition with extended training (Fig. 1e). Conversely, more neurons became positively modulated in the DLS after extended training, whereas fewer neurons became negatively modulated (χ21,160 = 7.17, P < 0.05; Fig. 1e).
We next assessed the effect of selective excitotoxic striatal lesions on the different phases of motor skill learning. Lesions of the dorsomedial or the dorsolateral striata did not alter initial performance on the rotarod, as indicated by similar levels of performance in lesioned and sham mice during the first two trials (naive DMS: F1,19 = 1.3, P > 0.05; DLS: F1,48 = 1.3, P > 0.05; Fig. 1f and Supplementary Fig. 2 online). However, lesions of the DMS reduced performance on the rotarod specifically during the early phase of training, whereas these lesions no longer impaired performance after extended training (early: F1,19 = 4.6, P < 0.05; late: F1,19 = 1.5, P > 0.05), suggesting that the DMS is only required during the early phase. On the other hand, excitotoxic lesions of the DLS affected skill learning during the early phase and continued to impair performance substantially during the late phase (early: F1,48 = 9.3, P < 0.05; late: F1,48 = 10.5, P < 0.05), indicating that the performance of the skill in the late phase requires the DLS, but not the DMS.
Region-specific plasticity during early and late learning
We next investigated whether the region-specific changes observed in vivo were driven by glutamatergic synaptic plasticity or excitability changes in the striatum by training C57/Bl6J mice for either 1 d (early phase) or 8 d (late phase) on an accelerating rotarod (Fig. 2a) and investigating ex vivo the changes in DMS and DLS related to the different phases of skill learning (Fig. 2b). We observed that performance improved rapidly during the first day of training (early group: F9,339 = 7.75, P < 0.05, post hoc first versus last trial, P < 0.05) and asymptoted after 3 d of training (late group: F70,247 = 70, P < 0.05, post hoc; for the first trial of each day, latency increased significantly until day 3, and for the last trial of each day, latency increased until day 2). Coronal slices containing the striatum were taken from trained and naive mice at least 72 h after the last training session.
Figure 2
Figure 2
Ex vivo striatal field potential recordings in the DMS and DLS during different phases of skill learning. (a) Performance of the animals on the accelerating rotarod for the early and late groups of ex vivo electrophysiology experiments. (b) Placement (more ...)
We first examined the effect of training on the magnitude of the evoked striatal field potentials using an input-output analysis. We found that the population spike amplitude in the DMS was larger in naive animals and decreased after extended training (F2,38 = 3.22, P < 0.05, post hoc naive versus late, P < 0.05; Fig. 2c). In contrast, the average population spike amplitude in the DLS was smaller in naive animals, but increased with training (F2,25 = 9.01, P < 0.05, post hoc naive and early versus late, P < 0.05; Fig. 2c). Consistent with these findings the amplitude of the half maximal evoked response was higher in the DMS than in the DLS in naive animals (F1,27 = 10.7, P < 0.05; Fig. 2d), whereas the half maximal response amplitude was higher in the DLS after extended training (F1,27 = 7.01, P < 0.05). During the early training phase, the evoked responses were equally large in these two striatal regions (F1,12 = 0.31, P < 0.05; Fig. 2c,d). Such results indicate that there are region-specific changes in responsiveness to afferent input stimulation as a skill is acquired and consolidated, which are consistent with the changes in neural activity that are observed in vivo.
These training-dependent changes in evoked population spike amplitude could be the result of changes in synaptic strength or changes in excitability. To investigate whether the changes in the input-output function in these regions of dorsal striatum corresponded to changes in synaptic strength, we performed ‘saturation’ experiments using a well-established protocol for striatal long-term depression (LTD)20,21. If there was substantial synaptic potentiation in the DLS after extended training, then the synapses in this region would be further away from the ‘floor’ of the modification range22 and thus would be capable of undergoing further depression. We therefore predicted that it would be more difficult to saturate LTD in the DMS early in training, whereas it would take more inducing attempts to saturate LTD in the DLS after extended training. After 15 min of baseline recording, we applied tetanic stimulation four times, 30 min apart (each induction attempt consisted of two trains of 100 pulses of 10-μs duration delivered at 100 Hz, delivered 10 s apart, using a stimulation strength that produced half the maximal response). In the DMS, the magnitude of the LTD induced after four tetani was greater at the onset of training (naive and early phase), but decreased with extended training, whereas the amount of induced LTD in the DLS increased in animals that received extended training (Fig. 2e,f). In naive animals, more tetani were needed to reach LTD saturation in the DMS than in the DLS, indicating that DMS synapses were further away from saturation than DLS synapses (naive F1,29 = 4.18, P < 0.05; Fig. 2e). Conversely, more tetani were needed to reach LTD saturation in the DLS than in the DMS after extended training (F1,29 = 4.54, P < 0.05; Fig. 2e). This pattern of results suggests that extended rotarod training results in synaptic potentiation in the DLS.
Training-induced synaptic plasticity in DMS and DLS
To further investigate the nature of these changes, we carried out ex vivo whole-cell patch-clamp recordings from striatal medium spiny neurons in slices from trained and naive mice. We measured the excitatory postsynaptic potential (EPSP) slope in response to increasing afferent stimulation strengths in each striatal region (Fig. 3a). Consistent with our observations in the field recordings, the average synaptic strength was higher in the DMS earlier in training, but increased in the DLS after extended training (F4, 221 = 6.7, P < 0.05; post hoc tests comparing late training group with all other groups, P < 0.05). During the early phase, the difference between the EPSP slope elicited in the dorsomedial and dorsolateral regions increased compared with naive animals, as evidenced by a larger EPSP slope being elicited even at our minimal stimulation strength (post hoc 0.2 mA, P < 0.05; Fig. 3b), but this difference disappeared during the late training phase.
Figure 3
Figure 3
Ex vivo striatal whole cell recordings in the DMS and DLS during the different phases of skill learning. (a) Input-output function of EPSP slope versus stimulation strength. Representative traces are presented on the left. (b) The same measure is presented (more ...)
These changes in EPSP slope were specifically related to learning the skill and were not just induced by being exposed to the rotarod, as they were not observed in yoked animals that had similar handling and were placed on the rod for the same amount of time (Fig. 3b). In addition, we did not observe any changes in paired pulse ratio (which could indicate changes in the probability of presynaptic glutamate release; Supplementary Fig. 3 online), input resistance and the resting membrane potential of the neurons (Supplementary Table 1 online). Because the EPSP slope is proportional to the fast AMPA receptor–mediated component of glutamatergic transmission, these results suggest that extended training results in a long-lasting potentiation of the AMPA component specifically in the DLS. We further confirmed that the potentiation observed in the DLS was the result of an increase in the AMPA receptor component by recording spontaneous excitatory postsynaptic currents (sEPSCs) from medium spiny neurons in the DLS after extended training. We found that sEPSC amplitude (t13 =3.3, P < 0.05), but not frequency (t13 = 1.6, P > 0.05), was increased after extensive training, an indication of a postsynaptic (but not presynaptic) site of expression for the plasticity (Supplementary Fig. 4 online).
We measured the relative amplitude of synaptic currents mediated by the NMDA and AMPA glutamatergic receptors in voltage-clamp recordings. To our surprise, we found a significant increase in the NMDA/AMPA ratio in the overtrained DLS, suggesting that the NMDA-mediated current increased even more than the AMPA component in this region after extended training (main effect of group: F4, 54 = 5.36, P < 0.05; post hoc comparisons, late group higher than all other groups, P < 0.05; Fig. 3c). No such increase in NMDA/AMPA ratio was observed in the DMS (F4, 39 = 0.35, P > 0.05).
We also measured intrinsic excitability by injecting current into medium spiny neurons and recording the number of action potentials that were generated (Fig. 4). Although we observed a small reduction in excitability in the DMS after extended training, we found a significant increase in excitability in the DLS, already evident during the early phase and maintained during the late phase (naive and early versus late, planned comparisons at 600 pA of injected current, P < 0.05; Fig. 4a). This is consistent with the observation of increased population spike amplitude in DLS early in training (Fig. 2c) and with the effects of the DLS lesions early in training (Fig. 1f). However, the increase in excitability in the DLS cannot be attributed solely to the learning of the skill, as it was also observed in yoked controls (late versus late yoked, planned comparison, P > 0.05; Fig. 4b), suggesting that these changes could arise from other factors such as the stress associated with the task. In the DMS, although the excitability was significantly reduced in the late phase group compared with the early or the naive conditions (planned comparisons at 600 pA of injected current, P < 0.05), there was no significant difference between the late group and its yoked controls (P > 0.05), suggesting that this reduction in excitability may not be directly associated with learning.
Figure 4
Figure 4
Training results in region-specific changes in excitability and in threshold to spike in the DLS and the DMS. (a) Changes in excitability in DMS and DLS during the different phases of skill learning. Representative traces are shown on the left. The number (more ...)
To examine the consequences of the changes in glutamatergic transmission and excitability for the relation between afferent excitatory input and the spiking of medium spiny neurons (output), we measured the effects of training on the threshold for spiking, as measured by the intensity of afferent stimulation needed to evoke an action potential in medium spiny neurons (Fig. 4c). We found that the average afferent stimulation intensity necessary to evoke a spike was lower in the DMS than in the DLS early in training (F1,28 = 6.46, P < 0.05; Fig. 4c,e). With extended training, however, the threshold afferent stimulation necessary to evoke spiking increased in the DMS and decreased significantly in the DLS (interaction training × region: F1,60 = 9.634, P < 0.05; post hoc DMS versus DLS early, P < 0.05; DLS early versus late, P < 0.05; Fig. 4e). Because the EPSP magnitude that is necessary to evoke a spike did not change with training, this decrease in spike threshold in the DLS was probably caused by increased synaptic strength and not by increased excitability (Fig. 4d,e). These results are consistent with the firing rate modulation that was observed in these two striatal areas during skill acquisition and consolidation in vivo (Fig. 1) and suggest that these changes were mediated by long-lasting plasticity of glutamatergic synapses onto medium spiny neurons.
Circuit-specific plasticity after extended training
We observed potentiation of glutamatergic transmission in about 55% of the neurons recorded in the DLS after extended training. Medium spiny neurons projecting preferentially to the substantia nigra (striatonigral or ‘direct’ pathway) and medium spiny neurons projecting to the external globus pallidus (striatopallidal or ‘indirect’ pathway) have different dopamine receptor expression, different physiological properties and different plasticity mechanisms2326. Until recently, long-term potentiation (LTP) induction in the striatum was thought to require D1-receptor activation27, suggesting that it would occur predominantly in striatonigral neurons, which preferentially express D1 receptors, and not in striatopalidal neurons, which mainly express D2 receptors. However, recent studies have shown that LTP can occur in both types of neurons, but by different mechanisms26. We therefore examined whether the long-lasting potentiation observed in DLS after extended training occurred in both striatonigral and striatopalidal neurons by recording from medium spiny neurons in the DLS of D2–enhanced green fluorescent protein (EGFP) mice that were extensively trained on the rotarod (Fig. 5a,b). These mice express EGFP under the D2 promoter and therefore allow the visualization of neurons that express D2 receptors, which are almost exclusively striatopallidal neurons (Fig. 5a)26. We observed that rotarod training resulted in a slight, but not significant, potentiation in the non–D2-expressing neurons (putative striatonigral neurons, no interaction between training and stimulation strength, F1,20 = 0.49, P > 0.05; no effect of training, F1,20 = 2.35, P > 0.05; no significant post hoc at each stimulation strength), but in a much greater potentiation in the D2-expressing neurons (striatopallidal, interaction between training and stimulation strength, F1,26 = 4.10, P < 0.05; effect of training, F1,26 = 4.54, P < 0.05; post hoc significant at 1-mA stimulation strength) in the DLS of extensively trained animals (Fig. 5c). No training-induced differences were observed in paired pulse ratio and excitability in D2-EGFP neurons and non–D2-EGFP neurons (Supplementary Fig. 5 online).
Figure 5
Figure 5
Extensive training results in pathway-specific plasticity. (a) GFP in D2-EGFP mice was not expressed in all medium spiny neurons (left, EGFP expression in green and nuclear staining with DAPI in blue). D1 receptor expression did not colocalize with GFP (more ...)
The difference in the magnitude of excitatory transmission potentiation between the striatonigral and striatopallidal neurons suggests that the indirect pathway becomes more engaged relative to the direct pathway as a skill is automatized. Whether its increased role is in suppressing unnecessary and competing motor programs or in facilitating automatic motor programs remains unclear. In rats, D1 receptor expression is slightly more prominent in ventrolateral and ventromedial striatum than in dorsolateral striatum, whereas D2 receptor expression is more abundant in dorsolateral striatum than in dorsomedial striatum28,29. By measuring D2 receptor binding in C57Bl6/J mice, we verified that D2 receptor membrane expression was much higher in DLS than DMS (F1,11 = 122,5, P < 0.05; Fig. 5d), whereas membrane expression of D1 receptors was more similar between DMS and DLS (F1,10 = 0.15, P > 0.05; Fig. 5d). Therefore, we tested whether blocking D1 or D2 dopamine receptors, which are preferentially expressed in striatonigral and striatopallidal neurons, respectively, would differentially affect the performance of the skill early and late in training.
We verified that blockade of D1 and D2 receptors in naive animals rendered the animals completely akinetic and incapable of staying on the rod during the first two trials of learning (overall effect, F3,19 = 20.3, P < 0.05; post hoc saline versus D1 + D2, P < 0.05; Fig. 5e and Supplementary Video 1 online). Blockade of either D1 or D2 receptors alone also affected the performance of naive animals in the rotarod, although to a lesser extent than blockade of both D1 and D2 receptors (post hoc saline versus D1 or D2, P < 0.05; D1 + D2 versus D1 or D2, P < 0.05; D1 versus D2, P > 0.05). During the early stage of training, the results were similar to those observed in naive animals (F3,57 = 50.2, P < 0.05; post hoc as in naive). However, after extensive training, D1 blockade no longer affected performance of the skill (F3,32.3 = 32.3, P < 0.05; post hoc saline versus D1, P > 0.05), whereas D2 blockade still affected the performance substantially (saline versus D2, P < 0.05). This suggests that D1 receptor activation is no longer necessary for the performance of the skill after extended training (Supplementary Video 2 online). Consistently, the performance of mice with both D1 and D2 receptors blocked was similar to the performance of animals with D2 blockade alone (D1 + D2 versus D2, P > 0.05).
Taken together, these data suggest that, with extensive training, the potentiation of excitatory transmission onto medium spiny neurons of the DLS occurs predominantly in D2 receptor–expressing striatopallidal neurons at the same time that the performance of the skill becomes less dependent on the activation of D1 receptors, which are mainly expressed in striatonigral neurons.
The data presented here suggest that there is a substantial functional reorganization of the dorsal striatum during the acquisition and automatization of a motor skill. Long-term synaptic plasticity in the striatum has been observed with a variety of protocols in brain slices and in anesthetized animals20,21,30. To the best of our knowledge, the data reported here, both in vivo, in awake behaving mice, and ex vivo, in striata taken from trained mice, constitute the first evidence of region-specific and pathway-specific long-lasting synaptic plasticity in the striatum during the acquisition and consolidation of a skill. Taken together with previous studies showing that mice that lack LTP specifically in the striatum are impaired in skill learning31 and that LTP can be induced in the striatum in vivo by intracranial self-stimulation30, our studies indicate that long-lasting potentiation of glutamatergic transmission in the striatum is necessary for skill learning.
A notable observation is the pronounced difference between the direction of changes observed in dorsolateral (sensorimotor) striatum and dorsomedial (associative) striatum during the different phases of skill learning. This pattern is consistent with previous work in both rats and monkeys suggesting distinct functional roles for these regions on the basis of the origin of their main cortical afferents, that is, sensorimotor versus association cortices2,3,17,18. In the associative striatum (DMS), potentiation of synaptic strength is only observed early in training, with extended training resulting in a return of synaptic strength back to naive levels. In contrast, no substantial potentiation of synaptic strength occurs during the early learning phase in the sensorimotor striatum (DLS), but long-lasting potentiation of glutamatergic transmission develops with extended training. How this long-lasting potentiation in DLS relates to skill automatization is still not clear, especially given that it was observed in a substantial proportion of the neurons; one possibility is that it could subserve the generation of central pattern generator–type ensembles in sensorimotor cortico-thalamo-striatal circuits32. Furthermore, it remains to be determined whether this long-lasting potentiation observed in DLS is accompanied by structural plasticity (for example, by an increase in the number of spines and synapses).
A straightforward explanation for these observations is that early on in acquisition, during the fast phase of skill learning, the inputs from associative cortices into dorsomedial striatum are preferentially strengthened, whereas the inputs into sensorimotor striatum are gradually potentiated with extended training, as shown by an increase in synaptic strength, excitability, NMDA currents and firing rate modulation during running on the rotarod. These results are consistent with recent in vivo work showing that the coordination between cortical areas and striatal regions has a mediolateral gradient33. These findings provide a possible mechanism to explain previous results showing that these different striatal regions are involved in controlling goal-directed actions and automatic habitual responses, respectively, and that, with extended training, the control over behavior by the goal-directed system is replaced by the habit system2,3,17,18,3436. Our lesion results suggest a model in which fast and slow skill learning develop in parallel in DMS and DLS, respectively, and not serially. DMS lesions seem to affect performance only early in training, but have no effect later on. However, DLS lesions affect behavior early in training and continue to affect it later on. This interpretation of our skill learning results is consistent with previous studies in operant conditioning showing that lesions of the DMS accelerate acquisition of an interval schedule of reinforcement, whereas lesions of the DLS impair it18, and that inactivation of the DLS in habitual animals renders the behavior goal-directed again37.
Although these differences in medial and lateral plasticity in the dorsal striatum in vivo can be explained in part by differences in anatomical connectivity, the distribution of key receptors and differences in dopaminergic release and uptake may also be involved. For example, the CB1 endocannabinoid receptor, which is critical for habit formation, is more abundantly expressed in the dorsolateral striatum38. The role of dopamine during the different phases of skill learning is also complex, not only because its mechanisms of release and reuptake show substantial regional variation39,40, but also because it has different physiological effects on two populations of projection neurons in the striatum: D1 receptor–expressing and D2 receptor–expressing neurons26. We found that D2-expressing striatopallidal medium spiny neurons in the dorsolateral striatum showed a preferential increase in synaptic strength in comparison with striatonigral neurons after extensive training. This is, to the best of our knowledge, the first report showing an increase in synaptic strength in striatopallidal neurons as a result of learning and is consistent with recent results that reported LTP in this class of neurons26,41. These studies showed that the activation of A2A adenosine receptors, which are colocalized with D2 receptors, is critical for LTP in the striatopallidal pathway, whereas the activation of D1 receptors is critical for LTP in the striatonigral pathway26,41. In addition, a notable finding from our study is the increased NMDA and AMPA currents following extended training. A recent study has shown the importance of A2A receptors in the LTP of the NMDA component of glutamatergic transmission42; such a mechanism may well be critical for the training-induced potentiation that we observed in the DLS.
We observed that performance on the rotarod became independent of D1 receptor activation with extensive training (expressed mainly in striatonigral neurons) but still required D2 receptor activation. These findings are consistent with previous studies using a Pavlovian approach task in rats43. We found it interesting that D1 and D2 receptors seem to be differentially expressed in the medial and lateral regions of the dorsal striatum, with D2 receptor being more prominent laterally28,29 (Fig. 5). In addition, D1 receptors have lower affinity for dopamine binding than D2 receptors, suggesting that D1 receptor activation requires a phasic increase in dopamine release, whereas D2 receptor activation can be achieved with tonic dopamine release44. This suggests that during early acquisition of a new skill, when dopaminergic neurons show a phasic increase in activity in response to novel elements of the task45, D1 receptors could be activated. With extensive training and the development of automaticity, however, phasic increases in dopaminergic activity can become less frequent45 and, consequently, the dopamine that is released would mainly activate D2 receptors. Also, given the recent data indicating that D2-expressing striatopallidal neurons have more and stronger inhibitory projections to D1-expressing striatonigral neurons than the converse46, potentiation of glutamatergic transmission onto striatopallidal neurons could result in increased inhibition of striatonigral neurons and serve as a substrate for the competition between these different pathways in behavioral control.
Our findings may have implications for our understanding of the symptoms of Parkinson's disease. It is well-known that individuals with Parkinson's disease can perform automatized movements, but have difficulty initiating voluntary movements47,48. The progressive loss of dopamine in Parkinson's disease may first affect D1 receptor activation (lower affinity) and affect D2 receptor activation later on. Thus, our observation that D2-expressing striatopallidal neurons are potentiated after extensive training and that automatized movements can become independent of D1 receptor activation may provide a mechanistic explanation for the relative preservation of more automatic movements in Parkinson's’ disease (see Supplementary Videos 1 and 2).
In summary, we report evidence for extensive functional reorganization in the striatum during the acquisition and consolidation of a skill via region-specific and pathway-specific changes in synaptic strength. We observed that the neuronal activity in the dorsomedial and dorsolateral striata of awake behaving mice changed across the different phases of learning. These changes were accompanied by dynamic patterns of synaptic plasticity in the associative striatum (DMS) and the sensorimotor striatum (DLS) during the different phases of learning. Although synaptic strength in the DMS was greater early in training, extensive training induced long-lasting potentiation at excitatory synapses onto medium spiny neurons in the sensorimotor striatum (DLS). This potentiation in the DLS after extended training occurred predominantly in D2 receptor–expressing striatopalidal neurons at the same time that the performance of the skill became less dependent on the activation of D1 receptors, which are mainly expressed in striatonigral neurons. Together, these data provide a first glimpse into the dynamic reorganization in neural circuits as a skill is learned and automatized. These findings could shed light on why voluntary movements are more affected than automatized movements in individuals with Parkinson's disease.
Animals
All procedures were approved by the National Institute on Alcohol Abuse and Alcoholism Animal Care and Use Committee. Adult C57Bl6/J male mice (2–5-months-old) were used for all of the behavior and recording experiments, with the exception of the D2-EGFP mice, which were on a FVB/N;Swiss Webster genetic background. The C57Bl6/J male mice were purchased from Jackson Laboratory at the age of 6–8 weeks and were given at least 2 weeks to acclimate to the new environment.
Rotarod training
A computer-interfaced rotarod accelerating from 4–40 rotations per min over 300 s was used (ENV-575M, Med Associates). Animals were trained with ten trials per day for either 1 d or 8 d (trained every other day). This training protocol was chosen on the basis of studies determining the time course of sensitivity of this task to interference, protein synthesis blockers (data not shown) and dopamine receptor antagonists (Fig. 5d). Each trial ended when the mouse fell off the rotarod or after 300 s had elapsed and there was a resting period of approximately 300 s between trials. Yoked animals were handled and placed in the rotarod in the same manner as the trained animals, but without the rotation of the rod. During the in vivo recordings, the beginning and end of the running period were signaled to the MAP recording system (Plexon) as events. The D1 receptor antagonist SCH-23390 (0.4 mg per kg of body weight, Sigma-Aldrich) and D2 receptor antagonist raclopride (2.0 mg per kg, Sigma-Aldrich) were dissolved in phosphate-buffered saline with 1% DMSO by volume (control injection) and injected intra-peritoneally at 10 ml kg–1 (these doses completely block the effects of 3,4-dihydroxy-l-phenylalanine after dopamine depletion49, indicating complete block of D1 and D2 receptors in vivo).
Lesions
Excitotoxic lesions of the striatum were performed by injecting NMDA into the striatum using a syringe pump (Razel Scientific) under general anesthesia with isofluorane. Two cannulae were inserted per striatum (two sites per dorsolateral or dorsomedial region on each hemisphere, one more anterior and one more posterior, along a line connecting +1.18 mm anterior-posterior, +2.0 mm medial-lateral with +0.22 mm anterior-posterior, +2.6 mm medial-lateral). We injected 0.25–0.4 μl of a 10 mg ml–1 solution of NMDA per site (2.25 mm from the surface of the brain) at a rate of 5–10 μl h–1. Animals were allowed to recover for at least 2 weeks before training started. Lesions were verified post-mortem after perfusion and overnight post-fixation with 4% paraformaldehyde (by weight) using Nissl staining of 50-μm brain slices.
In vivo extracellular recording
Implanting of multi-electrode arrays and in vivo recordings of neural activity in awake behaving mice were carried out as described previously14. The multi-electrode arrays consisted of two rows of eight polyamide-coated tungsten microwires (35 μm, CD Neural). The two rows were separated by 1 mm, so that one row targeted the DLS and the other targeted the DMS; electrodes in a row were separated by ~200 μm. The placement of the center of the array was 0.5 mm anterior to Bregma, 1.6 mm lateral to Bregma and 2.2–2.4 mm below the brain surface. The placement of the electrodes was verified post-mortem after perfusion and overnight post-fixation with 4% paraformaldehyde, using Nissl staining of 50-μm brain slices. After at least 2 weeks of recovery after surgery, single-unit and multi-unit activity were recorded using the Plexon data acquisition system (Plexon). Neural activity was initially sorted using an online sorting algorithm and then re-sorted using an offline sorting algorithm on the basis of waveform, amplitude and the interspike interval histogram. Because separate analysis of single-unit activity did not yield different data from that of multi-unit activity, the two datasets were combined.
Ex vivo slice recordings
The mice were killed by decapitation under halothane anesthesia and their brains were rapidly transferred to ice-cold modified artificial cerebrospinal fluid (aCSF) containing 194 mM sucrose, 30 mM NaCl, 4.5 mM KCl, 1 mM MgCl2, 26 mM NaHCO3, 1.2 mM NaH2PO4 and 10 mM d-glucose. Modified aCSF was brought to pH 7.4 by aeration with 95% O2/5% CO2. Coronal sections (250 μm thick) were cut in ice-cold modified aCSF using a vibrotome 1000 and transferred immediately to normal aCSF containing 124 mM NaCl, 4.5 mM KCl, 2 mM CaCl2, 1 mM MgCl2, 26 mM NaHCO3, 1.2 mM NaH2PO4 and 10 mM d-glucose maintained at pH 7.4 by bubbling with 95% O2/5% CO2 19–22 °C.
Extracellular field recordings were obtained with micropipettes (2.5–5 MΩ) filled with 1 M NaCl solution. Slices were maintained at a temperature between 28 and 32 °C, stable within ± 1 °C during the experiment and stimulated via a bipolar twisted-tungsten electrode placed in the striatum. For the LTD recordings, stimulus intensity was set to evoke a population spike amplitude that was approximately half the size of the maximal evoked response. The high-frequency stimulation protocol used to induce LTD in field potential recordings consisted of two 1-s duration trains of 100 pulses (each pulse lasted 10 μs) delivered at 100 Hz, with 10 s between trains.
Whole-cell recordings were carried out as previously described50. Pipettes were pulled from borosilicate glass capillaries on a Flaming-Brown micropipette puller (Novato). The internal solution for voltage-clamp experiments contained 120 mM cesium methane sulfonate, 5 mM NaCl, 10 mM tetraethylammonium chloride, 10 mM HEPES, 4 mM lidocaine N-ethyl bromide, 1.1 mM EGTA, 4 mM magnesium ATP and 0.3 mM sodium GTP, pH adjusted to 7.2 with CsOH and osmolarity set to 298 mOsm with sucrose. The internal solution for current-clamp experiments contained 150 mM potassium gluconate, 2 mM MgCl2, 1.1 mM EGTA, 10 mM HEPES, 3 mM sodium ATP and 0.2 mM sodium GTP, with pH adjusted to 7.2 with KOH and osmolarity set to ~300 mosM with sucrose. The external solution was aCSF with osmolarity adjusted to ~315 with sucrose.
Medium spiny neurons (soma diameter 5–10 μm) were identified visually with the aid of differential interference contrast–enhanced visual guidance. For the voltage-clamp experiments, the stimulus intensity was set to the level at which the EPSC amplitude (AMPA component) was 200–400 pA. Synaptic currents were recorded with an Axopatch 1D amplifier (Axon Instruments), filtered at 5 kHz and digitized at 10 kHz. Slices were maintained at a temperature between 28 and 32 °C, stable within ± 1 °C during the experiment. Fluorescent medium spiny neurons in the D2-EGFP mice were visualized using mercury arc lamp illumination and a Chroma cube U-41001 (Olympus BX51 microscope), and classified as putative striatopallidal neurons, whereas non-fluorescent medium spiny neurons were classified as putative striatonigral neurons. Our identification was independently verified by differences in action potentials evoked by injected current.
Statistics
Analyses of the rotarod behavior were carried out using one-way ANOVA. Analyses of slice electrophysiology were performed using one-way or two-way ANOVA, as appropriate. When appropriate, post hoc tests were applied (Fisher's protected least-significant difference, PLSD). The number of attempts for each slice to reach saturation in Figure 1e was calculated using paired t tests comparing the 10 min preceding the attempt and the 10 min before the next attempt; the last attempt to produce a significant depression (P < 0.05) of the population spike amplitude was considered to be the saturation point. For the analyses of in vivo data (neural activity and lesion in Fig. 4) we considered the first two trials of the first day, the last two trials of the first day and the last two trials of the eighth day of training as naive, early and late phase, respectively14. Significant positive or negative modulation of firing rate during running for each neuron in Figure 4e was determined by using a 99% confidence interval on the basis of the distribution of spikes during the intertrial period. The data in Figure 4e were analyzed using the χ2 statistic. Statistical analyses were performed using Statview (SAS), SPSS (SPSS) and Neuroexplorer (Neuroexplorer).
Additional methods on immunofluorescence, dopamine receptor blockade, and autoradiography are available in the Supplementary Methods online.
Supplementary Material
Supplement
Video1
Video2
ACKNOWLEDGMENTS
We thank T. Gremel and X. Jin for comments on the manuscript. This research was supported by the Division of Intramural Clinical and Basic Research of the National Institute on Alcohol Abuse and Alcoholism, US National Institutes of Health.
Footnotes
AUTHOR CONTRIBUTIONS
H.H.Y. planned the experiments, conducted the behavioral experiments and in vivo and ex vivo recordings, performed data analyses and wrote the manuscript. S.P.M. performed ex vivo field recordings and data analyses. M.R.F.H. performed lesions, histology and behavioral experiments. E.C. and T.H. performed behavioral experiments; M.I.D. carried out immunohistochemistry; A.C.H. performed the dopamine binding experiments; D.M.L. contributed to experimental design and the writing of the manuscript; and R.M.C. planned and supervised experiments, contributed to behavioral experiments, histology and in vivo recordings, performed data analyses and wrote the manuscript.
Note: Supplementary information is available on the Nature Neuroscience website.
1. Karni A, et al. The acquisition of skilled motor performance: fast and slow experience-driven changes in primary motor cortex. Proc. Natl. Acad. Sci. USA. 1998;95:861–868. [PubMed]
2. Miyachi S, Hikosaka O, Lu X. Differential activation of monkey striatal neurons in the early and late stages of procedural learning. Exp. Brain Res. 2002;146:122–126. [PubMed]
3. Miyachi S, Hikosaka O, Miyashita K, Karadi Z, Rand MK. Differential roles of monkey striatum in learning of sequential hand movement. Exp. Brain Res. 1997;115:1–5. [PubMed]
4. Kargo WJ, Nitz DA. Improvements in the signal-to-noise ratio of motor cortex cells distinguish early versus late phases of motor skill learning. J. Neurosci. 2004;24:5560–5569. [PubMed]
5. Shiffrin RM, Schneider W. Controlled and automatic human information processing. II. Perceptual learning, automatic attending, and a general theory. Psychol. Rev. 1977;84:127–190.
6. Muellbacher W, et al. Early consolidation in human primary motor cortex. Nature. 2002;415:640–644. [PubMed]
7. Kleim JA, et al. Cortical synaptogenesis and motor map reorganization occur during late, but not early, phase of motor skill learning. J. Neurosci. 2004;24:628–633. [PubMed]
8. Jenkins IH, Brooks DJ, Nixon PD, Frackowiak RS, Passingham RE. Motor sequence learning: a study with positron emission tomography. J. Neurosci. 1994;14:3775–3790. [PubMed]
9. Doyon J, Owen AM, Petrides M, Sziklas V, Evans AC. Functional anatomy of visuomotor skill learning in human subjects examined with positron emission tomography. Eur. J. Neurosci. 1996;8:637–648. [PubMed]
10. Carelli RM, Wolske M, West MO. Loss of lever press–related firing of rat striatal forelimb neurons after repeated sessions in a lever pressing task. J. Neurosci. 1997;17:1804–1814. [PubMed]
11. Ungerleider LG, Doyon J, Karni A. Imaging brain plasticity during motor skill learning. Neurobiol. Learn. Mem. 2002;78:553–564. [PubMed]
12. Brasted PJ, Wise SP. Comparison of learning-related neuronal activity in the dorsal premotor cortex and striatum. Eur. J. Neurosci. 2004;19:721–740. [PubMed]
13. Barnes TD, Kubota Y, Hu D, Jin DZ, Graybiel AM. Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories. Nature. 2005;437:1158–1161. [PubMed]
14. Costa RM, Cohen D, Nicolelis MA. Differential corticostriatal plasticity during fast and slow motor skill learning in mice. Curr. Biol. 2004;14:1124–1134. [PubMed]
15. Voorn P, Vanderschuren LJ, Groenewegen HJ, Robbins TW, Pennartz CM. Putting a spin on the dorsal-ventral divide of the striatum. Trends Neurosci. 2004;27:468–474. [PubMed]
16. McGeorge AJ, Faull RL. The organization of the projection from the cerebral cortex to the striatum in the rat. Neuroscience. 1989;29:503–537. [PubMed]
17. Yin HH, Ostlund SB, Knowlton BJ, Balleine BW. The role of the dorsomedial striatum in instrumental conditioning. Eur. J. Neurosci. 2005;22:513–523. [PubMed]
18. Yin HH, Knowlton BJ, Balleine BW. Lesions of dorsolateral striatum preserve outcome expectancy, but disrupt habit formation in instrumental learning. Eur. J. Neurosci. 2004;19:181–189. [PubMed]
19. Luft AR, Buitrago MM. Stages of motor skill learning. Mol. Neurobiol. 2005;32:205–216. [PubMed]
20. Calabresi P, Maj R, Pisani A, Mercuri NB, Bernardi G. Long-term synaptic depression in the striatum: physiological and pharmacological characterization. J. Neurosci. 1992;12:4224–4233. [PubMed]
21. Lovinger DM, Tyler EC, Merritt A. Short- and long-term synaptic depression in rat neostriatum. J. Neurophysiol. 1993;70:1937–1949. [PubMed]
22. Rioult-Pedotti MS, Friedman D, Donoghue JP. Learning-induced LTP in neocortex. Science. 2000;290:533–536. [PubMed]
23. Gerfen CR, et al. D1 and D2 dopamine receptor-regulated gene expression of striatonigral and striatopallidal neurons. Science. 1990;250:1429–1432. [PubMed]
24. Kreitzer AC, Malenka RC. Endocannabinoid-mediated rescue of striatal LTD and motor deficits in Parkinson's disease models. Nature. 2007;445:643–647. [PubMed]
25. Shen W, et al. Cholinergic modulation of Kir2 channels selectively elevates dendritic excitability in striatopallidal neurons. Nat. Neurosci. 2007;10:1458–1466. [PubMed]
26. Shen W, Flajolet M, Greengard P, Surmeier DJ. Dichotomous dopaminergic control of striatal synaptic plasticity. Science. 2008;321:848–851. [PMC free article] [PubMed]
27. Kerr JN, Wickens JR. Dopamine D-1/D-5 receptor activation is required for long-term potentiation in the rat neostriatum in vitro. J. Neurophysiol. 2001;85:117–124. [PubMed]
28. Savasta M, Dubois A, Scatton B. Autoradiographic localization of D1 dopamine receptors in the rat brain with [3H]SCH 23390. Brain Res. 1986;375:291–301. [PubMed]
29. Joyce JN, Loeschen SK, Marshall JF. Dopamine D-2 receptors in rat caudate-putamen: the lateral to medial gradient does not correspond to dopaminergic innervation. Brain Res. 1985;338:209–218. [PubMed]
30. Reynolds JN, Hyland BI, Wickens JR. A cellular mechanism of reward-related learning. Nature. 2001;413:67–70. [PubMed]
31. Dang MT, et al. Disrupted motor learning and long-term synaptic plasticity in mice lacking NMDAR1 in the striatum. Proc. Natl. Acad. Sci. USA. 2006;103:15254–15259. [PubMed]
32. Carrillo-Reid L, et al. Encoding network states by striatal cell assemblies. J. Neurophysiol. 2008;99:1435–1450. [PubMed]
33. Kasanetz F, Riquelme LA, Della-Maggiore V, O'Donnell P, Murer MG. Functional integration across a gradient of corticostriatal channels controls UP state transitions in the dorsal striatum. Proc. Natl. Acad. Sci. USA. 2008;105:8124–8129. [PubMed]
34. Yin HH, Knowlton BJ. The role of the basal ganglia in habit formation. Nat. Rev. Neurosci. 2006;7:464–476. [PubMed]
35. Belin D, Everitt BJ. Cocaine seeking habits depend upon dopamine-dependent serial connectivity linking the ventral with the dorsal striatum. Neuron. 2008;57:432–441. [PubMed]
36. Takahashi Y, Roesch MR, Stalnaker TA, Schoenbaum G. Cocaine exposure shifts the balance of associative encoding from ventral to dorsolateral striatum. Front. Integr. Neurosci. 2007;1:11. [PMC free article] [PubMed]
37. Yin HH, Knowlton BJ, Balleine BW. Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning. Behav. Brain Res. 2006;166:189–196. [PubMed]
38. Hilario MRF, Clouse E, Yin HH, Costa RM. Endocannabinoid signaling is critical for habit formation. Front. Integr. Neurosci. 2007;1:6. [PMC free article] [PubMed]
39. Wickens JR, Budd CS, Hyland BI, Arbuthnott GW. Striatal contributions to reward and decision making: making sense of regional variations in a reiterated processing matrix. Ann. NY Acad. Sci. 2007;1104:192–212. [PubMed]
40. Jedynak JP, Uslaner JM, Esteban JA, Robinson TE. Methamphetamine-induced structural plasticity in the dorsal striatum. Eur. J. Neurosci. 2007;25:847–853. [PubMed]
41. Flajolet M, et al. FGF acts as a co-transmitter through adenosine A(2A) receptor to regulate synaptic plasticity. Nat. Neurosci. 2008;11:1402–1409. [PMC free article] [PubMed]
42. Rebola N, Lujan R, Cunha RA, Mulle C. Adenosine A2A receptors are essential for long-term potentiation of NMDA-EPSCs at hippocampal mossy fiber synapses. Neuron. 2008;57:121–134. [PubMed]
43. Choi WY, Balsam PD, Horvitz JC. Extended habit training reduces dopamine mediation of appetitive response expression. J. Neurosci. 2005;25:6729–6733. [PubMed]
44. Richfield EK, Penney JB, Young AB. Anatomical and affinity state comparisons between dopamine D1 and D2 receptors in the rat central nervous system. Neuroscience. 1989;30:767–777. [PubMed]
45. Ljungberg T, Apicella P, Schultz W. Responses of monkey dopamine neurons during learning of behavioral reactions. J. Neurophysiol. 1992;67:145–163. [PubMed]
46. Taverna S, Ilijic E, Surmeier DJ. Recurrent collateral connections of striatal medium spiny neurons are disrupted in models of Parkinson's disease. J. Neurosci. 2008;28:5504–5512. [PMC free article] [PubMed]
47. Martin JP. The Basal Ganglia and Posture. Pitman Medical; London: 1967.
48. Briand KA, Strallow D, Hening W, Poizner H, Sereno AB. Control of voluntary and reflexive saccades in Parkinson's disease. Exp. Brain Res. 1999;129:38–48. [PubMed]
49. Costa RM, et al. Rapid alterations in corticostriatal ensemble coordination during acute dopamine-dependent motor dysfunction. Neuron. 2006;52:359–369. [PubMed]
50. Gerdeman GL, Ronesi J, Lovinger DM. Postsynaptic endocannabinoid release is critical to long-term depression in the striatum. Nat. Neurosci. 2002;5:446–451. [PubMed]