|Home | About | Journals | Submit | Contact Us | Français|
The debate regarding the role of ventral occipito-temporal cortex (vOTC) in visual word recognition arises in part from difficulty delineating the functional contributions of vOTC as separate from other areas of the reading network. Here we investigated the feasibility of using transcranial magnetic stimulation (TMS) to interfere with vOTC processing in order to explore its specific contributions to visual word recognition. Three visual lexical decision experiments were conducted using neuro-navigated TMS. The first demonstrated that repetitive stimulation of vOTC successfully slowed word, but not non-word, responses. The second confirmed and extended these findings by demonstrating the effect was specific to vOTC and not present in the adjacent lateral occipital complex. The final experiment used paired-pulse TMS to investigate the time course of vOTC processing for words and revealed activation starting as early as 80-120msec post-stimulus onset - significantly earlier than that expected based on electrophysiological and magnetoencephalography studies. Taken together, these results clearly indicate that TMS can be successfully used to stimulate parts of vOTC previously believed to be inaccessible and provide a new tool for systematically investigating the information processing characteristics of vOTC. In addition, the findings provide strong evidence that lexical status and frequency significantly affect vOTC processing, findings difficult to reconcile with pre-lexical accounts of vOTC function.
There is considerable debate regarding the role of the left ventral occipitotemporal cortex (vOTC) during reading (Cai, Lavidor, Brysbaert, Paulignan, & Nazir, 2008; Cohen & Dehaene, 2004; Dehaene, Cohen, Sigman, & Vinckier, 2005; Hillis, Newhart, Heidler, Barker, Herskovits et al., 2005; McCandliss, Cohen, & Dehaene, 2003; Price & Devlin, 2003, 2004). In part, this stems from a fundamental difficulty in identifying functional contributions from specific regions for tasks that engage highly distributed neural systems. For example, the claim that vOTC is the site of abstract orthographic representations is based on a combination of functional magnetic resonance imaging (fMRI), event-related potential (ERP), and neuropsychological studies, all of which implicate vOTC in visual word recognition. Individually, however, no single line of evidence is particularly convincing. Consider the fMRI data showing that vOTC activation patterns for words are robust to changes in font, case and retinal location (Cohen, Dehaene, Naccache, Lehéricy, Dehaene-Lambertz et al., 2000; Cohen, Lehéricy, Chochon, Lemer, Rivaud et al., 2002). Although these are important properties of orthographic representations, the same patterns of activation are also seen in other left hemisphere areas including inferior frontal gyrus, inferior parietal lobule, and middle temporal cortex, making it difficult to attribute specific processing characteristic exclusively to vOTC (Price & Devlin, 2004). Similarly, electrophysiological studies demonstrate that visual words activate vOTC approximately 170-200msec post-stimulus onset, suggesting that this activation may reflect the earliest stage of abstract orthographic processing (Cohen et al., 2000; McCandliss et al., 2003; Nobre, Allison, & McCarthy, 1994; Salmelin, Service, Kiesila, Uutela, & Salonen, 1996). Other ERP and MEG studies, however, reveal that vOTC activation is concurrent with activation in peri-Sylvian regions (Pammer, Hansen, Kringelbach, Holliday, Barnes et al., 2004; Pulvermüller, Huss, Kherif, Moscoso del Prado Martin, Hauk et al., 2006), again calling into question the unique processing characteristics of vOTC. Finally, lesions to vOTC can result in an acquired reading disorder called “pure alexia” (Dejerine, 1892; Gaillard, Naccache, Pinel, Clemenceau, Volle et al., 2006; Leff, Crewes, Plant, Scott, Kennard et al., 2001), suggesting that this region may be the site of stored visual word forms (Cohen et al., 2002; Gaillard et al., 2006). Unfortunately for this argument, left inferior parietal lesions can also result in pure alexia (Philipose, Gottesman, Newhart, Kleinman, Herskovits et al., 2007; Warrington & Shallice, 1980). In short, it has been difficult to unambiguously delineate the specific information processing that vOTC contributes to reading because correlational measures such as PET, fMRI, ERP and MEG do not distinguish contributions from co-activated areas and because of the well known difficulties in interpreting neuropsychological studies. Consider, for example, the fact that lateral occipital complex is consistently activated during visual word recognition studies (Duncan, Pattamalidok, Kneirim, & Devlin, submitted; Price, McCrory, Noppeney, Mechelli, Moore et al., 2006; Wright, Mechelli, Noppeney, Veltman, Rombouts et al., 2008) but damage to this region does not tend to result in reading deficits (Philipose et al., 2007). This discrepancy between the neuroimaging and neuropsychological literature highlights the need for a method to temporarily, and non-invasively, perturb the information processing in these regions to investigate the causal relations between activation and reading.
Transcranial magnetic stimulation (TMS) would seem to offer such a tool (Barker & Jalinous, 1985; Pascual-Leone, Bartres-Faz, & Keenan, 1999; Sack, 2006), but it is widely believed that the location of vOTC on the ventral surface of the brain makes it inaccessible to TMS. Here, we explicitly tested this assumption in three TMS experiments using a lexical decision task to engage visual word recognition. The primary aim of the first experiment was to determine whether rTMS to vOTC would interfere with visual word recognition. A second goal was to determine whether a similar effect would be seen for LOC stimulation. The second experiment further investigated potential differences between vOTC and LOC contributions to reading using rTMS while the third experiment used paired-pulse TMS to delineate the time course of processing in left vOTC during reading.
64 subjects (35M, 29F, aged 18 to 45, mean = 27) participated in three experiments. All were right handed, native English speakers with normal or corrected to normal vision. None had any form of dyslexia, a personal history of neurological disease, or a family history of epilepsy according to self-reports. Each gave informed consent after the experimental procedures were explained. The experiments were approved by the Berkshire NHS Research Ethics Committee.
All subjects participated in a single fMRI session whose purpose was to functionally localise two areas of occipito-temporal cortex for neuro-navigated stimulation. These were a region of ventral occipito-temporal cortex (vOTC) sensitive to reading visual words that is typically located in the posterior occipito-temporal sulcus (Ben-Shachar, Dougherty, Deutsch, & Wandell, 2007; Duncan et al., submitted; Wright et al., 2008) and a lateral region sensitive to visual objects, often called lateral occipital complex (LOC) (Grill-Spector, Kushnir, Edelman, Avidan, Itzchak et al., 1999; Malach, Reppas, Benson, Kwong, Jiang et al., 1995). The functional localiser scan used a one-back task with four categories of visual stimuli: written words, pictures of common objects, scrambled pictures of the same objects, and consonant letter strings. Subjects were instructed to press a button if the stimulus was identical to the preceding stimulus and 12.5% of the stimuli were targets. A block design was used to maximize statistical sensitivity. Each block consisted of 16 trials from a single category presented one every second. A trial began with a 650msec fixation cross followed by the stimulus for 350msec. In between blocks, subjects viewed a fixation cross for 16sec. Each run entailed six blocks per category and all subjects participated in two runs with the order counter-balanced across subjects. In total there were 192 stimuli per category.
Whole-brain imaging was performed on a Siemens 1.5 Tesla MR scanner at the Birkbeck-UCL Centre for Neuroimaging (BUCNI) in London. The functional data were acquired with a gradient-echo EPI sequence (TR = 3sec; TE = 50msec, FOV=192 × 192, matrix = 64 × 64) giving a notional resolution of 3 × 3 × 3mm. In addition, a high-resolution anatomical scan was acquired (T1-weighted FLASH, TR = 12msec; TE = 5.6msec; 1mm3 resolution) for anatomically localizing activations in individuals.
Image processing was carried out using FSL 4.0 (www.fmrib.ox.ac.uk/fsl). To allow for T1 equilibrium, the initial two images of each run were discarded. The data were then realigned to remove small head movements (Jenkinson, Bannister, Brady, & Smith, 2002), smoothed with a 6mm FWHM Gaussian kernel and pre-whitened to remove temporal auto-correlation (Woolrich, Ripley, Brady, & Smith, 2001). The resulting images were entered into a general linear model with four conditions of interest corresponding to the four categories of visual stimuli. Blocks were convolved with a double gamma “canonical” hemodynamic response function (Glover, 1999) to generate the main regressors. In addition, the estimated motion parameters were entered as covariates of no interest to reduce structured noise due to minor head motion. Linear contrasts of [words > fixation] and [objects > scrambled objects] identified reading- and object-sensitive areas, respectively. First level results were registered to the MNI-152 template using a 12-DOF affine transformation (Jenkinson & Smith, 2001) and a subsequent second level, fixed-effects model combined the two first level runs into a single, subject-specific analysis. This was then transformed into the participant’s native structural space and used to target stimulation in the TMS experiment.
The aims of the first experiment were to first determine whether TMS could be successfully used to stimulate vOTC and interfere with visual word recognition, and second to investigate the contributions of vOTC and LOC to reading. 14 participants (7M, 7F, aged 19 to 38, mean = 25) performed a visual lexical decision task while rTMS was pseudorandomly delivered to one of three target sites.
The lexical decision experiment consisted of four conditions, each comprising 100 items: low frequency words (1-10 occurrences per million), high frequency words (20-650 occurrences per million), pronounceable pseudowords (e.g. “glats”) and unpronounceable consonant letter strings (e.g. “btfj”). Frequency values were obtained from the Celex database of British written English (Baayen, Piepenbrock, & Van Rijn, 1993). Low and high frequency items were treated separately because imaging studies have shown greater vOTC activation for low than high frequency words (Bruno, Zumberge, Manis, Lu, & Goldman, 2008; Kronbichler, Hutzler, Wimmer, Mair, Staffen et al., 2004; Kuo, Yeh, Lee, Wu, Chou et al., 2003), suggesting that stimulation of this region may have a larger effect on low frequency words. In addition, two non-lexical conditions were included because imaging studies have also shown that pseudowords elicit either equivalent or greater vOTC activation than words (Mechelli, Gorno-Tempini, & Price, 2003; Nobre et al., 1994; Price, Wise, & Frackowiak, 1996; Rumsey, Horwitz, Donohue, Nace, Maisog et al., 1997), and thus stimulation of this region may affect both legal words and pronounceable non-words. Consequently, consonant letter strings were included as an additional control condition where stimulation was not expected to impair performance. To avoid repeating stimuli within a testing session, five versions of the lexical decision were created (each comprising 20 items per condition) and matched for written word frequency (overall and separately for both low and high frequency items) (Baayen et al., 1993), rated familiarity (Coltheart, 1981), letter length, number of syllables and bigram frequency. The order of the versions was balanced across subjects and stimulation sites. In addition, an independent set of items was used for practice.
Repetitive TMS was pseudorandomly delivered on half of all trials. Pulses were delivered at 100, 200, 300, 400 and 500msec post-stimulus onset (i.e., 10Hz for 500msec). The intensity was set to 110% of the subject’s motor threshold as measured by a visible twitch of the hand. Typically, this measure is approximately 10% higher than motor thresholds measured with motor evoked potentials and therefore is a very conservative measure in the sense that it ensured sufficient intensity to stimulate motor cortex. This value was increased by an additional 10% in the main experiment to ensure sufficient intensity to reach vOTC cortex. Even so, this was well within established safety limits (Wassermann, Grafman, Berry, Hollnagel, Wild et al., 1996; Wassermann, 1998) and this general protocol has been widely used to temporarily interfere with processing in relatively focal cortical zones (Göbel, Walsh, & Rushworth, 2001; Gough, Nobre, & Devlin, 2005; Manenti, Cappa, Rossini, & Miniussi, 2008; Pitcher, Garrido, Walsh, & Duchaine, 2008; Rushworth, Ellison, & Walsh, 2001; Sandrini, Rossini, & Miniussi, 2008; Skarratt & Lavidor, 2006). In summary, then, the experiment used a within-subject design with Site (vOTC, vertex, LOC), Stimulus (LF words, HF words, pseudowords, consonants), and TMS (rTMS, none) as independent factors.
Each trial began with a fixation cross displayed for 500msec, followed by a visual letter string for 200msec and then a blank screen for 2300msec, giving a total duration of 3sec. Subjects indicated whether the letter string formed a real word in English or not by pressing a button using either their right or left index finger. Responses were fully counter-balanced for response hand across subjects. Accuracy and reaction times were recorded.
The three stimulation sites were tested sequentially in a single session with their order counter-balanced across subjects. A session began by measuring the participant’s motor threshold using single pulses of TMS delivered to the hand area of left primary motor cortex. The participant then performed a practice session of the lexical decision experiment without any TMS to familiarise them with the task. Next, one of the three testing sites was chosen and the participant was introduced to the sensation of rTMS at that site. For both vOTC and LOC, the coil was held behind the left ear and therefore the sound of the coil discharge was prominent even though all participants used an earplug in their left ear to attenuate this noise. In addition, stimulation affected the temporalis muscle and produced a unilateral facial twitch. When asked afterwards, participants were unable to distinguish the sensation of stimulation from the two temporal lobe sites. In contrast, stimulation of the vertex was further from the ear, did not produce any muscle twitches, and was easily distinguished. After familiarisation with the sensation, the participant performed a practice lexical decision run with rTMS to get used to performing the task with concurrent rTMS. Finally, they completed the lexical decision experiment for the given site using one of the five stimulus versions. The procedure was then repeated for the other two testing sites using different stimulus versions. At each location, including finding motor threshold, the location and orientation of the coil was recorded for later analyses.
A Magstim Rapid2 stimulator (Magstim, Whitland, UK) with a 70mm figure-of-eight coil was used to deliver the stimulation. In addition, a frameless stereotaxy system (Brainsight software, Rogue Research, Montreal, Canada) was used with a Polaris Vicra infrared camera (Northern Digital, Waterloo, Ontario, Canada) to accurately target stimulation and to measure the distance between the coil and the stimulation site. vOTC and LOC sites were defined based on the individual’s fMRI data. The hand area of motor cortex was identified anatomically according to the method of Yousry et al (1997) and verified by finding the location which produced a twitch in the right hand. The distance from the scalp to these three targets was measured for each subject to evaluate differences relating to accessibility. In each case, the distance between the target and the scalp was measured using Brainsight along the trajectory of stimulation.
The aim of the second experiment was to investigate the anatomical specificity of the interference effect and clarify an ambiguous finding from the previous experiment by determining whether stimulation of LOC interfered with visual word recognition or not. 26 subjects (13M, 13F, aged 18 to 45, mean = 26) performed a visual lexical decision experiment similar to the first. The primary differences were that i) this experiment used a between-subject design with 13 participants per stimulation site, ii) only two sites were tested (vOTC and LOC) and iii) only two stimulus conditions were included (low frequency words and pseudowords), each containing 40 items and the same stimuli were used for both sites. There were two versions whose order was counter-balanced across the stimulation sites. The stimuli were a subset of those used in the previous experiment and were matched across version for written word frequency, rated familiarity, letter length, number of syllables and bigram frequency. Finally, the TMS intensity was set to 100% of motor threshold rather than 110% as in the previous experiment. Otherwise, the procedures were the same as those above.
The aim of the final experiment was to investigate the time course of vOTC involvement in reading using paired-pulse TMS. 24 participants (15M, 9F, aged 19 to 46, mean = 29) performed a visual lexical decision experiment in which every trial had two pulses of TMS separated by 40msec. Subjectively, this feels like a single pulse but it induces a more robust interference effect (Juan & Walsh, 2003; O’Shea, Johansen-Berg, Trief, Gobel, & Rushworth, 2007; Pitcher, Walsh, Yovel, & Duchaine, 2007). This experiment also used a between-subject design, with one group of 12 participants receiving TMS over vOTC and the other over vertex. Within each group there were five different timing conditions with pulses at either 0 and 40msec, 40 and 80msec, 80 and 120msec, 120 and 160msec, or 160 and 200msec post stimulus onset and the order was pseudorandomized. All trials had TMS in one of the five timing windows. All timing conditions included 20 low frequency words and 20 pseudowords and were matched for written word frequency, rated familiarity, letter length, number of syllables and bigram frequency. The five versions of the stimuli contained the same 200 items but each set of 40 items was rotated across each of the 5 timing conditions, in addition to being matched for written word frequency, rated familiarity, letter length, number of syllables and bigram frequency. TMS intensity was set to 100% of the subject’s motor threshold and the blank screen between trials was reduced from 2300 to 1300msec, giving a total trial duration of 2sec. Otherwise, all procedures were the same as those in the previous experiments.
Reaction times (RTs) were measured from the onset of the target. Responses times shorter than 300msec or longer than 1500msec were trimmed, amounting to 0.2%, 0.0%, and 0.2% of the data in Experiments 1-3, respectively. To minimise the effect of outliers in the RT data, median RTs for correct responses per condition per subject were used in the statistical analyses (Ulrich & Miller, 1994). Analyses of variance (ANOVA) were used to test for effects of interest, and two-tailed paired t-tests were used with Bonferroni corrections for multiple comparisons unless stated otherwise.
In our experience, the 10Hz, 500msec stimulation protocol rarely shows disruption effects of more than 100msec and moreover, the effect is typically specific to particular subset of conditions (i.e. not the control stimuli, task, or site). So when inhibitory TMS effects larger than 100msec are observed and are present across conditions, these are generally attributed to non-specific effects of TMS such as: i) anxiety about the stimulation, ii) a priori “knowledge” that TMS slows responses, or iii) strong peripheral muscle stimulation that the participant finds impossible to ignore. In all three cases, the effect size and distribution across conditions is clearly different from the normal pattern and is easily identified and excluded. Data from three participants fit these criteria for physiologically implausible TMS effects. In the first experiment, one participant had RT increases for vOTC stimulation that were on average 173msec slower than no TMS trials across all conditions. Two participants in the second experiment showed TMS effects of +216 and +201msec. In both cases, these were more than double the next largest effect, over 100msec outside the range of the other participants, and present across conditions, suggesting they were primarily due to peripheral, rather than central, effects of TMS. Consequently these data were excluded from further analyses.
Because vOTC is generally regarded as inaccessible to TMS due to its depth, we began by explicitly measuring the distance from the scalp to the most highly activated voxels within vOTC and LOC used to target the stimulation. The distance was measured along the trajectory of the magnetic field and therefore reflects the distance between the coil and the stimulated region of cortex (Figure 1). In addition, the distance to the hand area of primary motor cortex was measured as a comparison. Because we did not functionally localise this using fMRI, the “omega knob” was marked independently by two of the authors (KJD & JTD) using anatomical criteria (Yousry et al., 1997) and the mid-point of these voxels was used to compute the distance from the scalp along the stimulation trajectory. The results for all subjects are shown in Table 1. On average, the depth of vOTC was 26.3mm which was not significantly different from the M1 hand area at 27.2mm (t(13)= 1.0, p = 0.684), suggesting the two are equally accessible to TMS. LOC, on the other hand, was significantly closer to the scalp than M1 with an average depth of only 20.5mm (t(13) = 5.2, p = 0.000), confirming its accessibility. It is worth noting, however, that because M1 was localized anatomically while vOTC and LOC were localized with fMRI, the depth measurements are not fully compatible. Even so, the results suggest that the depth of vOTC is roughly equal to that of M1.
In the main lexical decision experiment, overall accuracy levels were 95%, indicating that subjects had no difficulty performing the task. Accuracy scores were entered into a 3 × 4 × 2 repeated-measures ANOVA examining the effects of Site (vOTC, vertex, LOC), Condition (consonant strings, pseudowords, LF words, HF words) and TMS (rTMS, none). The only significant main effect was for Condition (F(3,39) = 14.0, p = 0.000), indicating that subjects were significantly less accurate for pseudowords than high frequency words (t(13) = 6.0, p = 0.01) and consonant strings (t(13) = 7.4, p = 0.003 ) but only numerically less for low frequency words (t(13) = 2.6, p = 0.478). There was no main effect of TMS and no significant interactions, indicating the presence of TMS did not affect accuracy.
The reaction time results are shown in Figure 2 and were analysed with the same omnibus ANOVA. There was a main effect of Condition (F(3,39) = 51.1, p = 0.000) indicating that as expected, the four types of stimuli differed in difficulty, with consonant letters strings the easiest (471msec), followed by high frequency words (517msec), low frequency words (550msec), and then pseudowords (624msec). In addition, there was a significant Condition × TMS interaction (F(3,39) = 5.5, p = 0.003) demonstrating that on average, rTMS decreased RTs across stimulation sites for pseudowords (634 vs. 613msecs) but increased RTs for low frequency words (543 vs. 558msecs). Interestingly, the facilitation effect on pseudowords was consistent for all three sites, whereas TMS interfered with low frequency words at the vOTC and LOC sites but facilitated responses at the vertex. Surprisingly, the three-way interaction was not significant, probably due to insufficient power given the complexity of the design (i.e. four stimulus conditions and three testing sites). Even so, Figure 2 shows that the response profiles to TMS were different across sites and consequently, we chose to investigate these further.
In order to investigate these difference response profiles, we conducted a 2-way repeated measures ANOVA for each site. The main effect of Condition was present for all sites. For the vOTC site, there was no main effect of TMS (F(1,13) = 0.6, p = 0.449) but a significant Condition × TMS interaction (F(3,39) = 3.7, p = 0.021). Planned comparisons revealed that TMS selectively slowed responses to low frequency words (+34msecs, t(13) = 2.8, p = 0.017) without significantly affecting the other conditions (Figure 2a). The response profile for stimulation of the vertex, on the other hand, looked very different. Here, there was a significant main effect of TMS (F(1,13) = 6.2, p = 0.027) but no significant interaction (F(3,39) = 1.8, p = 0.160). Planned comparisons showed a significant TMS-induced speed-up for both pseudowords (-34msecs, t(13) = 2.7, p = 0.018) and high frequency words (-24msecs, t(13) = 2.9, p = 0.012), although both low frequency words and consonants were numerically faster as well (Figure 2b). In other words, TMS of the vertex appeared to have a non-specific, inter-sensory facilitation effect (Terao, Ugawa, Suzuki, Sakai, Hanajima et al., 1997). Finally, stimulation at LOC produced neither a significant main effect of TMS (F(1,13) = 0.6, p = 0.458) nor an interaction (F(3,39) = 1.5, p = 0.217), although low frequency words showed a non-significant slowdown of +14msec (t(13)=1.0, p = 0.327) similar to that seen in vOTC (Figure 2c).
In summary, Experiment 1 demonstrated that stimulation of vOTC selectively slowed response times to low frequency words. This pattern was noticeably different than the non-specific facilitation effect seen for vertex stimulation but qualitatively similar to the pattern at LOC. Given that both vOTC and LOC are activated by visual words (Duncan et al., submitted; Moore & Price, 1999; Price et al., 2006) and that the two stimulation sites were only 2cm apart, it is unclear whether there was an actual difference in the effects of TMS between these sites or whether the experiment was under-powered to detect a TMS effect at LOC. Consequently, the second experiment was designed to specifically address this question. To increase statistical sensitivity, the number of stimuli per condition was doubled and lexical items only consisted of low frequency words since these produced the largest TMS effect in Experiment 1. In addition, we chose to reduce the stimulation intensity from 110% to 100% of motor threshold based on the similar depths of M1 and vOTC. This helped to reduce the discomfort that some participants experienced during stimulation without reducing the likelihood of successfully stimulating the region.
Like the first experiment, overall accuracy was high (94%). Accuracy scores were entered into a 3-way mixed ANOVA where the within-subjects factors were Condition (words, pseudowords) and TMS (rTMS, no) and the between subjects factor was Site (vOTC, LOC). The only significant main effect was for TMS (F(1,22) = 7.7, p = 0.011), reflecting a small TMS-induced decrease in accuracy (from 95% to 93%) that was present across sites and conditions. This is likely due to the peripheral effects of TMS stimulation, which was identical across the two occipito-temporal sites.
The critical results, however, concerned the TMS effect on reaction times. To investigate this, the reaction time results (shown in Figure 3) were analysed with the same omnibus ANOVA. There was a main effect of Condition (F(1,22) = 43.3, p = 0.000), once again indicating that pseudowords (646msec) were more difficult than LF words (585msec). More importantly, the three-way interaction (F(1,22) = 8.0, p = 0.010) was clearly significant. Further analyses showed that vOTC stimulation significantly slowed RTs for words (+40msec, t(12) = 3.1, p = 0.010) but had no effect on pseudowords (+8msec, t(12) = 0.7, p = 0.521). In contrast, LOC stimulation did not significantly affect RTs for words (-15msec, t(12) = 1.0, p = 0.343) or pseudowords (-1msec, t(10) = 0.070, p = 0.945).
These results confirm that stimulation of vOTC interferes with reading low frequency words and demonstrates the effect is present even at a lower stimulation intensity. Critically, the findings help to clarify the ambiguous results in the first experiment regarding the effects of LOC stimulation. Despite the greater sensitivity in Experiment 2, rTMS over LOC had no significant effect on low frequency words - in fact, if anything LOC stimulation slightly reduced RTs for words - confirming that the interference seen for vOTC stimulation is specific to that site and not a general feature of occipital-temporal stimulation. In other words, these peripheral stimulation effects cannot explain the current pattern of results because vOTC stimulation did not affect all stimuli equally; specifically it inhibited word responses without affecting pseudowords. In addition, these effects were not present in LOC where stimulation produced essentially identical peripheral effects. In summary, then, these results demonstrate that vOTC stimulation successfully and selectively disrupted visual word recognition while LOC stimulation did not. Consequently, the final experiment did not include LOC as a testing site since stimulation did not affect reading performance but nonetheless involved some discomfort due to peripheral enervation of the temporalis muscle.
Like the previous experiments, accuracy was high with an average score of 94% across conditions. Although pseudowords were included in the task to ensure that participants correctly performed the lexical decision task, they were not included in the analyses as the previous experiments showed no effect of TMS on pseudowords. The word accuracy data were entered into a two-way mixed ANOVA, where the within-subjects factor was Time (0/40, 40/80, 80/120, 120/160, 160/200msec post-stimulus onset) and the between subjects factor was Site (vOTC, vertex). There were no significant main effects of Time (F(4,88) = 1.6, p = 0.184) or Site (F(1,22) = 0.3, p = 0.564) and no interaction (F(4,88) = 1.6, p = 0.174), indicating that TMS once again did not affect accuracy.
Reaction times for words are shown in Figure 4. Here, the 0-40msec time window served as the baseline condition (per testing site) as TMS was not expected to influence reading that early. A two-way mixed ANOVA revealed a main effect of Time (F(4,88) = 4.2, p = 0.004) but no main effect of Site (F(1,22) = 0.44, p = 0.514) or Site × Time interaction (F(4,88) = 1.9, p = 0.110). Visual inspection of Figure 4 suggests that the absence of interaction may be caused by an unexpectedly long RT for vertex stimulation in the 160-200msec time window (573msec) compared to the other time windows (range: 544 to 555msec). The +18msec effect, however, did not represent a significant slowdown relative to the 0-40 window (t(11) = 1.6, p = 0.145), consistent with the results of Experiment 1 and with the functional neuroimaging literature, neither of which demonstrates vertex involvement in reading at any level. As a result, we chose to exclude the 160-200msec time window from further analyses.
When this time window was excluded, there was no main effect of Time (F(3,66) = 1.1, p = 0.369) or Site (F(1,22) = 0.5, p = 0.501) but a significant Site × Time interaction (F(3,66) = 2.9, p = 0.044). Planned comparisons used Bonferroni corrected, one-tailed t-tests because the previous experiments showed that TMS to vOTC slowed, rather than speeded, responses. These indicated no significant effect at 40-80msec (t(11) = 0.6, p = 0.8205) but significantly slowed responses for the 80-120 (+24msec, t(11) = 3.4, p = 0.009) and 120-160msec (+25msec, t(11) = 2.6,p = 0.0375) time windows. In contrast, there were no significant effects for stimulation of the vertex at any of the time windows (all t(11)<1.3, p>0.72). Figure 4, therefore, illustrates three points. First, the earliest time window where TMS interfered with reading was 80-120msec post-stimulus onset; second, this effect was also present in the subsequent time window of 120-160msec; and third, that this interference effect was specific to the vOTC site.
The initial question we set out to investigate was whether it was possible to stimulate a left ventral occipital-temporal region involved in visual word recognition. The results of three independent experiments convincingly demonstrate that TMS can be used to successfully stimulate vOTC and interfere with reading. While the subjective experience of TMS of vOTC and vertex is noticeably different, it is unlikely that the disruption in word reading was a general, peripheral effect of TMS given that the impairment was condition-specific and replicated. Moreover, this possibility is rendered even less likely if we consider the difference between vOTC and LOC stimulation (Expt 2), given that the sensation of TMS is not noticeably different between these two posterior temporal lobe sites. Specifically, stimulation of LOC led to a (non-significant) increase in RTs for low frequency words that became a (non-significant) decrease in the second. The lack of any significant modulation by TMS and the fact that RT changes were not even in a consistent direction, suggests that TMS to LOC had no effect on reading low frequency words, which contrasts with the results of vOTC stimulation. These findings are consistent with the neuropsychological literature which suggests that left hemisphere lesions to either vOTC or the inferior parietal lobule - but not LOC - can result in preferential impairments for reading (Binder & Mohr, 1992; Damasio & Damasio, 1983; Dejerine, 1892; Geschwind, 1965; Philipose et al., 2007) and demonstrate that vOTC, but not LOC, is necessary for visual word recognition. These findings have both methodological and theoretical implications.
The specific region of vOTC associated with reading is typically centred on the occipito-temporal sulcus and spreads medially onto the posterior fusiform gyrus (Ben-Shachar et al., 2007; Duncan et al., submitted; Wright et al., 2008). This area lies at a complex junction between the inferior temporal and fusiform gyri, just superior and lateral to the principle fissure of the cerebellum (see Figure 1). As a result, it is possible to orient the TMS coil such that the line of maximum stimulation runs between the cerebellum and inferior temporal gyrus and targets the occipito-temporal sulcus. Due to the smoothly varying topology of the magnetic field generated by a figure-of-eight coil, there is likely to be some stimulation of both inferior temporal gyrus and of cerebellar lobule VI, but the maximum effect targets the occipito-temporal sulcus and lateral posterior fusiform gyrus.
Perhaps surprisingly, this region is on average no deeper than the hand area of primary motor cortex, which is easily accessible with TMS (Barker & Jalinous, 1985; Pascual-Leone, Vallssole, Wassermann, & Hallett, 1994; Rothwell, 1997). Consequently, the stimulation intensity necessary to affect vOTC can be estimated based on motor thresholds. Here we used a fairly conservative measure of motor threshold (i.e. a visible muscle twitch) but lower values based on motor evoked potentials may be sufficient and would help to reduce both the auditory and somatosensory effects of stimulating this region. In addition, although we used fMRI to functional localise the precise area of vOTC to stimulate in each participant, this was not strictly necessary. Pilot work indicated that it was possible to functionally localise the optimal stimulation site using short TMS experiments before proceeding to the main experiment as previously done for other stimulation sites (e.g. Gough et al., 2005; Taylor, Nobre, & Rushworth, 2007; Walsh, Ellison, Battelli, & Cowey, 1998). In practice, however, some participants found the extra stimulation uncomfortable, consequently we chose to localize the site using fMRI instead. Both methods were effective in localising the target site and presumably significantly reduce the inter-subject variability relative to targeting based on either standard space coordinates or the 10-20 electrode system (Sack, Cohen Kadosh, Schuhmann, Moerel, Walsh et al., 2009; Sparing, Buelte, Meister, Paus, & Fink, 2008)
A final point worth mentioning here concerns the spatial resolution of TMS given the clear dissociation between stimulation of vOTC and LOC, two occipito-temporal regions separated by approximately 2cm. Although it is well established that TMS effects from single pulses can spread via anatomical connections beyond the site of stimulation (Ilmoniemi, Virtanen, Ruohonen, Karhu, Aronen et al., 1997; Paus, Jech, Thompson, Comeau, Peters et al., 1997), TMS appears to have an effective spatial resolution of approximately 1cm. That is, single pulses of stimulation delivered roughly 1cm apart on the scalp evoke very different responses in both motor (Brasil-Neto, McShane, Fuhr, Hallett, & Cohen, 1992) and premotor cortices (Schluter, Rushworth, Mills, & Passingham, 1999). Even short trains of repetitive stimulation have been shown to produce double dissociations between regions separated by as little as 2cm (Gough, Nobre, & Devlin, 2005; Pitcher, Charles, Devlin, Walsh, & Duchaine, 2009). Thus the current findings of a clear dissociation between stimulation of vOTC and LOC, despite their anatomical proximity, are consistent with the growing literature suggesting that the functional resolution of the technique is sufficient to distinguish contributions from relatively focal patches of neocortex.
It is informative to compare the current TMS findings with those from related neuroimaging studies, where there are both similarities and important differences. For instance, many imaging studies report little, if any, activation in vOTC for consonant letter strings, consistent with the finding that vOTC stimulation had essentially no effect on consonant letter strings. Similarly, several imaging studies have demonstrated that activation in vOTC is modulated by word frequency, with greater activation for low than high frequency words (Bruno et al., 2008; Kronbichler et al., 2004; Kuo et al., 2003). Here, we observed a (non-significant) slow down for high frequency words of +7msec and a larger (significant) effect on low frequency words (+34msec), consistent with the imaging findings. It’s possible that with greater statistical sensitivity, we would be able to document a reliable, graded effect of TMS on different classes of orthographic stimuli, but the current findings certainly are consistent with the imaging literature. Pseudowords, on the other hand, typically evoke equal or greater vOTC activation than words (Mechelli et al., 2003; Nobre et al., 1994; Price et al., 1996; Rumsey et al., 1997) and yet, TMS did not significantly, or even consistently, affect responses to pseudowords. It is worth noting, however, that in a lexical decision task, participants make a different response for words (“yes”) and pseudowords (“no’) which may contribute to the different TMS effects for these two types of stimuli. On the other hand, the same response confound is present in the imaging studies using lexical decision, so this alone cannot explain the apparent disconnect between the TMS and imaging results. Further studies will be necessary to determine whether TMS differentially affects words and pseudowords even when the response is matched across conditions such as in a naming task. The current findings that vOTC makes different contributions to words and non-words, however, are consistent with previous imaging studies that also found reliable lexicality effects in vOTC (Devlin, Jamison, Gonnerman, & Matthews, 2006; Fiebach, Gruber, & Supp, 2005) and have potentially important implications for understanding vOTC contributions to reading.
How do these findings speak to theories of vOTC function? Lexical frequency (i.e. LF > HF) and lexicality (i.e. words > pseudowords) effects are both difficult to explain in terms of stored, pre-lexical representations (Cohen et al., 2000; Cohen et al., 2002; Dehaene et al., 2005; McCandliss et al., 2003), particularly when the stimuli were matched on orthographic factors including bigram frequency. According to this account, neurons in vOTC have receptive fields for bigrams - that is, they are specialized for detecting particular visual stimulus characteristics - and are therefore insensitive the specific response properties of the task. As a result, any stimuli comprised of legal bigrams should be equally affected by TMS, with no difference between low and high frequency words or pseudowords, irrespective of the response. Clearly the current TMS results, as well as previous imaging findings (Bruno et al., 2008; Devlin et al., 2006; Fiebach et al., 2005; Kronbichler et al., 2004; Kuo et al., 2003), are inconsistent with this proposal. An alternate possibility is that vOTC is the site of stored lexical representations of visual words (Kronbichler et al., 2004). Although sufficient to explain the current TMS results, this account is incompatible with previous imaging evidence showing neural repetition priming effects for visually similar, but lexically distinct word pairs such as “corner-corn” (Devlin et al., 2006). A different possibility is that vOTC acts as an interface, integrating bottom-up, visual form information (that is not specific to written words) with top-down, non-visual properties of the stimulus such as its sound or meaning (Devlin et al., 2006; Hillis et al., 2005; Price & Friston, 2005). Because low frequency items place greater processing demands than high frequency items, there is greater activation in vOTC and stimulation has a larger disruption effect. By this account, however, pseudowords could be considered extremely low frequency items. In this case, the additional processing demands reflect the search, and subsequent failure, to link the visual stimulus with its non-visual properties (e.g. what is the meaning of “bocket”?) and leads to the greater vOTC activation seen for pseudowords (Mechelli et al., 2003; Nobre et al., 1994; Price et al., 1996; Rumsey et al., 1997). Stimulation of the region does not have a significant effect on behaviour, however, because the integration process fails regardless of the TMS-induced disruption. If true, then this hypothesis predicts that TMS will not affect stimuli when this integration process fails, regardless of the response demanded by the task. Clearly further work is needed to better evaluate all three of these theories.
Finally, what do the paired pulse results reveal about the temporal nature of orthographic processing within vOTC? Numerous ERP and MEG studies demonstrate that both written words and pseudowords evoke an early midline occipital positivity at approximately 80-100msec post-stimulus, followed by a left occipito-temporal negativity between 170-200msec (Bentin, Mouchetant-Rostaing, Giard, Echallier, & Pernier, 1999; McCandliss, Posner, & Givon, 1997; Salmelin et al., 1996; Tarkiainen, Helenius, Hansen, Cornelissen, & Salmelin, 1999). The initial component reflects activity in V1 and is common to all visual stimuli whereas the second component is thought to arise in vOTC and distinguishes between orthographic and non-orthographic stimuli (Bentin et al., 1999; Cohen et al., 2000; Nobre et al., 1994). Thus it is typically assumed that vOTC is activated by an excitatory feedforward volley of activity spreading ventro-laterally from V1 to V2 and V4 and then into vOTC at approximately 170-200msec post stimulus onset (Dehaene et al., 2005; McCandliss et al., 2003). Our results, however, suggest that information is present in vOTC substantially earlier - possibly as early as 80 msecs after the stimulus appears on the retina. Even if the first pulse anticipated the feedforward volley of action potentials, its disruptive effect is believed to last only 30-40msecs (Amassian, Cracco, Maccabee, Cracco, Rudell et al., 1989; Corthout, Uttl, Walsh, Hallett, & Cowey, 1999; Ilmoniemi et al., 1997), which suggests a time lag between the TMS and ERP/MEG latency values. Interestingly, the TMS results better match those from multi-unit recordings in awake monkeys. For instance, the onset latencies for action potentials in V1 are between 20-30msec (Givre, Schroeder, & Arezzo, 1994; Maunsell & Gibson, 1992; Schroeder, Mehta, & Givre, 1998) which matches TMS-induced disruption (Corthout et al., 1999) but considerably anticipates the P80-100 component. Similarly, the initial action potentials from the ascending visual pathway appear in posterior inferotemporal cortex between 60-120msec post-stimulus onset (Ashford & Fuster, 1985; Chen, Lakatos, Shah, Mehta, Givre et al., 2007; Richmond, Optican, Podell, & Spitzer, 1987; Schroeder et al., 1998), consistent with the current TMS findings but much earlier than the N170 component. In other words, the time at which TMS exerts its disruptive effect precedes the peak times reported in ERP/MEG experiments (Walsh & Cowey, 2000). Presumably this reflects the fact that these components arise from large-scale neuronal synchrony across activity in multiple structures and therefore lag behind the earliest wave of activity in any given structure (Schroeder et al., 1998; Walsh & Cowey, 2000). As a consequence, chronometric TMS studies may offer a more accurate measure of absolute regional timings than ERP or MEG, despite their greater temporal resolution.
In conclusion, we have demonstrated that TMS can be used to successfully stimulate parts of ventral occipital-temporal cortex and interfere with visual word recognition. These experiments open the door for a systematic exploration of vOTC contributions to reading and its relation to other higher order visual functions (Joseph, Cerullo, Farley, Steinmetz, & Mier, 2006; Price & Devlin, 2003; Starrfelt & Gerlach, 2007). Similarly, although we have focused entirely on the left vOTC, this same approach should be useful for investigating right vOTC including the so-called fusiform face area and its contributions to face recognition (Haxby, Horwitz, Ungerleider, Maisog, Pietrini et al., 1994; Kanwisher, McDermott, & Chun, 1997; Sergent, Ohta, & Macdonald, 1992) and other categories of visual objects (Haxby, Hoffman, & Gobbini, 2000; Haxby, Gobbini, Furey, Ishai, Schouten et al., 2001).
We would like to thank Phillip Kelley and Iris Knierim for their help with data collection and also Marty Sereno for helpful discussions. This research was supported by the BBSRC (KJD), “Rayonnement International”, Université Libre de Bruxelles (CP), and the Wellcome Trust (JTD).