|Home | About | Journals | Submit | Contact Us | Français|
Models of selective attention predict that focused attention to spatially contiguous stimuli may result in enhanced activity in areas of cortex specialized for processing task-relevant and task-irrelevant information. We examined this hypothesis by localizing color-sensitive areas (CSA) and word and letter sensitive areas of cortex and then examining modulation of these regions during performance of a modified version of the Stroop task in which target and distractors are spatially coincident. We report that only the incongruent condition with the highest cognitive demand showed increased activity in CSA relative to other conditions, indicating an attentional enhancement in target processing areas. We also found an enhancement of activity in one region sensitive to word/letter processing during the most cognitively demanding incongruent condition indicating greater processing of the distractor dimension. Correlations with performance revealed that top-down modulation during the task was critical for effective filtering of irrelevant information in conflict conditions. These results support predictions made by models of selective attention and suggest an important mechanism of top-down attentional control in spatially contiguous stimuli.
In a complex visual scene, attentional selection allows relevant items to be processed while irrelevant items are ignored. Top-down selection processes are thought to bias competition in favor of task-relevant information and against task-irrelevant information. In support of this, neuroimaging research has reported that directed attention effectively modulates activity in sensorimotor brain regions. For example, using positron emission tomography (PET), Corbetta et al.  reported that directing attention to certain features of a visual stimulus, such as color, shape, or speed, increased activity in distinct areas of visual cortex that were specialized for processing the particular feature. Directing attention to a particular location also increases activity in extrastriate regions of monkeys  and humans [20,35,38] and anticipation of a stimulus before any visual stimulation occurs can also alter activity levels in task-relevant extrastriate regions that process color  and motion [7,22,36]. Therefore, attention can modulate activity in striate and extrastriate regions of cortex for a variety of task-relevant stimulus dimensions.
Both enhancement and suppressive processes in striate and extrastriate cortices have been purported to be essential components in a push–pull mechanism of attentional selection [49,52]. For example, attentional enhancement of activity in cortical regions that process an attended stimulus location or other feature (e.g. [10,46,48]) has been related to improved behavioral performance and better task-related selection processes . In addition, attention can reduce, inhibitor suppress, visual cortex activity associated with processing to-be-ignored stimuli or task-irrelevant distractors [48,57].
The efficacy of attentional selection to filter task-irrelevant information and bias task-relevant information is dependent on the cognitive demands of the task [32,31,42,47]. Specifically, the cognitive demands can moderate the influence that task-irrelevant distractors have on performance. For example, behavioral studies have shown that under conditions of greater cognitive load, interference from distractors is increased, suggesting that (1) the ability for selective attention to ignore irrelevant distractors is dependent on the cognitive demands of the task and (2) distractor information is processed to a greater degree under conditions of high cognitive load [31,32]. Consistent with behavioral results, De Fockert et al.  demonstrated that regions of the visual cortex that were related to processing distractor information exhibited increased activity under high memory loads, supporting the claim that distractor information is more extensively processed under conditions of higher cognitive demand. Therefore, both behavioral and neuroimaging results argue that the efficacy of attentional selection in filtering task-irrelevant material is dependent upon cognitive load and that the processing of distractors increases in cognitively demanding conditions.
The processing of distractor information, however, may require spatial separation between the targets and distractors [33,40]. Lavie  suggested that when the target and distractor form part of the same stimulus, increased attention to the target might result in a concomitant increase in attention towards the distractor. This argument predicts that for spatially contiguous or coincident stimuli, cortical processors specialized for processing the task-relevant and the task-irrelevant features should show increases in activity under attentionally demanding conditions.
The Stroop task is an example of an attentionally demanding task in which target and distractor information are not just spatially contiguous, but spatially coincident. In this task, participants are instructed to respond to the ink color of a printed word while ignoring the meaning of the printed word. Therefore, both target (i.e. responding to the ink color) and distractor information (i.e. the printed word) are spatially coincident and cognitive demands are higher in conditions in which the target and distractor information are incongruent with each other (e.g. when the word RED is printed in blue ink).
Models of selective attention and top-down control , predict that in the Stroop task, regions of visual cortex that are sensitive to processing color (target) information should be selectively enhanced as attentional demands are increased. Increased activity in target-sensitive areas of cortex would be representative of increased top-down bias to the task-relevant attribute. Greater top-down bias would then lead to better task performance. Therefore, the magnitude of activity in color-sensitive areas of striate and extrastriate cortex should be positively correlated with performance (negatively correlated with reaction times) when attentional demands are highest, that is when the printed word and ink color conflict.
Models of selective attention and top-down control also predict that areas of striate and extrastriate cortex sensitive to processing word (distractor) information should be selectively enhanced under conditions of high cognitive demand since the distractor information is more extensively processed in spatially coincident stimuli. Therefore, the magnitude of activity in distractor processing regions of cortex, such as those that are involved in lexico-semantic processing and grapho-phonological processing, can be considered an index of the degree of distractor processing. Greater distractor processing should result in greater interference during the most demanding incongruent Stroop task condition and less so in less demanding conditions, such as the congruent condition. Therefore, the magnitude of activity in some of the word-sensitive distractor regions that are involved in letter and symbol recognition, semantic processing, and grapho-phonological conversion, should be negatively correlated with performance (positively correlated with reaction times) in the incongruent Stroop condition.
We tested these hypotheses by localizing areas of the visual cortex that were sensitive to the presentation of passively viewed word/letter and color stimuli. Previous studies have successfully utilized a similar passive-viewing localizer approach to assess attentional modulation of face stimuli [14,15], color-checkerboards , word-finding processes , testing localization of theory-of-mind , and testing areas responsive to number symbols and numerosities  and objects and houses .
In the present study we employed a modified version of the Stroop task to examine the way sin which cognitive demand, engendered through conflict, in spatially coincident stimuli modulates activity in target-sensitive and distractor-sensitive regions of cortex. We examined this on individually assigned regions-of-interest (ROIs), which allowed us to define regions sensitive to the manipulation on an individual-by-individual basis. We also computed correlations between these regions and Stroop task performance to test the predictions of models of selective attention and top-down control, as described above.
Fourteen participants (9 male; 5 female) between the ages of 18 and 29 (mean age = 23.46) signed an informed consent approved by the University of Illinois and met or surpassed all criteria for participating in an MRI experiment including no previous head trauma, no claustrophobia, and no metallic implants. Participants were paid $ 15 dollars an hour for volunteering.
Visual stimuli were presented with MRI-safe fiber optic goggles (Resonance Technologies Inc.) and manual responses to Stroop stimuli were obtained by the use of an MRI-safe five-button pad. Three buttons were used during the task with each button corresponding to a different ink color (see Stroop task). Participants were first run in the Stroop task followed by the localizer tasks and structural scans.
A 3T Siemens Allegra head-only MRI scanner was used for structural and functional MRI measures. For the fMRI protocol, we employed a fast echo-planar imaging (EPI) sequence with Blood Oxygenation Level Dependent (BOLD) contrast and collected a total of 150 T2* weighted volumes per participant (TR = 1.5; TE = 26; flip angle = 60) for the checkerboard tasks and a total of 220 T2* weighted volumes per participant (TR = 1.5; TE = 26, flip angle = 60) for the word tasks. For the Stroop task we collected a total of 380 T2* weighted volumes per participant (TR = 1.5; TE = 26; flip angle = 60). Twenty-eight slices (4 mm thickness with a 0% gap) were collected in an ascending and sequential fashion parallel to the anterior and posterior commissures for all three tasks.
In addition, a high-resolution T1-weighted MPRAGE (1.3 mm × 1.3 mm × 1.3 mm) anatomical image was collected for each participant. The anatomical images were skull-stripped using a brain extraction technique  and subsequently used for registration purposes.
The functional MRI data for each participant was preprocessed using FSL version 3.2 . For the localizer tasks, the images were slice-time corrected, motion-corrected using MCFLIRT , temporally filtered with a Gaussian high pass cut-off of 120 s and a low pass cut-off of 9.32 s, and spatially smoothed with a 5 mm full-width half-max 3D Gaussian kernel. The data from the Stroop task were also slice-time corrected and motion-corrected using the same protocols as those for the localizer tasks. The Stroop task data was temporally smoothed with a high-pass cut-off at 80 s and a low-pass cut-off of 1.5 s, and spatially smoothed with an 8 mm full-width half-max 3D Gaussian kernel.
In order to localize CSA we presented flashing black and white checkerboards and flashing color checkerboards at a rate of 8 Hz. Each condition was presented in two separate 30 s blocks that alternated with 30 s blocks of a crosshair baseline. Participants were instructed to keep their eyes open and attend to the screen. The luminance of the black and white checkerboards (22.40 cd/m2) was slightly higher than the luminance for color checkerboards (10.93 cd/m2) while the Michelson contrast ratio was equivalent (color, .978; black and white, .979). A slightly higher luminance of the black and white checkerboard display compared with the color-checkerboard display should be a conservative estimate of the differences in which the brain shows greater activation for color when compared to black and white images.
After preprocessing, the data were convolved with a Gamma function to model the hemodynamic response to each block for each condition. This first-level analysis resulted in voxel-wise parameter estimate maps for the entire brain for each condition and for the direct comparison between color checkerboards and black and white checkerboards. These maps were then registered into standard space (MNI space) and forwarded to a higher-level group analysis where a fixed-effects analysis was performed to find areas across participants that were sensitive to color. A fixed-effects analysis was chosen at this stage because we were only interested in the CSA specific to the participants in this sample since our ROI analysis would be performed on an individual-by-individual basis. The result from the color versus black and white checkerboard comparison was thresholded at a voxel-wise z-score of 2.33 (p<.01) and a cluster-wise threshold of p<.05. The resulting statistical map was then converted into a binary map and used as a mask to find peaks for each participant. Therefore, the peaks that we report for each participant are located within the mask from the group analysis. The statistical peak from each participant within this mask was found for each person and voxels contiguous with the peak within 5 mm (10 mm diameter) were then used as an ROI for each participant. The mean of these voxels were then extracted for the color checkerboard condition versus baseline, the black and white checkerboard condition versus baseline, as well as each condition in the word localizer task (see below) versus baseline. This was done to (a) ensure that the peak was more sensitive to color checkerboards than black and white checkerboards, words, pseudo-words, and letter strings, and thus an area we could validly call ‘CSA’, and (b) to use as ROIs in the Stroop task to examine top-down modulatory effects of conflict.
In addition to these empirically derived CSA ROIs, we extracted the activity for each condition from V4, a classically defined region sensitive to processing color . Specifically, we used Tal2MNI software (http://imaging.mrc-cbu.cam.ac.uk/imaging/MniTalairach) to convert Talairach coordinates for V4 to MNI space as reported by Zeki et al. . The left hemisphere coordinates for V4 (−26, −68, −8) were converted to −26, −69, −13 and the right hemisphere coordinates for V4 (20, −66, −4) were converted to 20, −68, −8 (see Table 1). Others have also reported similar coordinates for V4 [3,27,41]. A 10 mm diameter sphere around each of the points was used as a pre-defined ROI, and the mean activity in these regions was extracted for each condition and compared using a repeated-measures ANOVA with planned comparisons.
In order to localize word and letter sensitive areas of cortex we presented 30 s blocks of words that were matched for frequency  and length with the color words and neutral words used during the Stroop task. In addition, we presented 30 s blocks of pronounceable pseudo-words and letter strings that were also matched for length with the Stroop words. Similar to the checkerboards, each condition was presented in two separate 30 s blocks that alternated with 30 s blocks of a crosshair baseline. Participants were instructed to keep their eyes open and attend to the screen. All letter strings, pseudo-words, and words were presented in white font on a black background.
Similar to the analyses for the CSA, the blocks of trials were convolved with a gamma function. Planned comparisons between words and both pseudo-words and letter strings resulted in parameter estimate maps for each individual participant. These maps were then registered into standard space (MNI space) and forwarded to a higher-level group analysis.
Because these main comparisons did not result in any regions that were specific to printed words, we performed a few other analyses to try and isolate word/letter sensitive areas of cortex. Therefore, we first used coordinates from previously published manuscripts (see Table 1) that have reported localization of a visual word form area (VWFA) as well as other regions associated with lexico-semantic processing from a recent meta-analysis [26,67]. This resulted in three areas in the posterior left hemisphere that were subsequently used as ROIs with a 5 mm radius. The mean percent signal change for each Stroop condition from each ROI was extracted for each individual and evaluated for modulatory effects. Second, we examined areas that were commonly active for words, pseudo-words, and letter strings compared to baseline, but were not active for either color or black and white checkerboards. These areas represent regions involved in processing the symbolic representation of letters. We then used the clusters from this comparison as ROIs and extracted the peaks for each participant. Similar to the checkerboard analysis, we extracted the values from these ROIs for each participant and took an average of the immediately contiguous 124 voxels around the peak (10 mm diameter). The mean of these voxels were then extracted for each condition versus baseline as well as each condition in the checkerboard localizer task versus baseline. This was done to (a) ensure that the peak was more sensitive to words, pseudo-words, and letter strings than either color checkerboards or black and white checkerboards, and (b) to use as ROIs in the Stroop task to examine modulatory effects of conflict on regions processing task-irrelevant information. These analyses resulted in five separate regions for word area analysis (3 predefined ROIs; 2 empirically defined ROIs).
We used incongruent, neutral, and congruent stimuli in an event-related design. Congruent stimuli were words that matched the ink color (e.g. RED in red ink). Neutral stimuli were words that were matched for frequency and word length with the color words, but that were not associated with color (e.g. SHIP in red ink). Incongruent-eligible stimuli were stimuli in which the printed word matched one of the potential responses (e.g. RED in blue ink if ‘red’ is one of the potential responses). In congruent-ineligible stimuli were stimuli in which the printed word did not match the set of potential responses (e.g. PURPLE in blue ink if ‘purple’ is not one of the potential responses). We had two ink color sets that were counter balanced across participants. For one of the ink color sets the eligible colors were red, orange, and purple while the ineligible words were blue, green, and yellow. The other color set had eligible responses of blue, green, and yellow while the ineligible words were red, orange, and purple. Each condition consisted of 36 trials. For the incongruent-eligible condition, each ink color was paired with both response eligible color words 12 times during a session (six times per word). However, for the incongruent-ineligible trials, each ink color was paired with one of three ineligible words 12 times (four times per word). Each stimulus was displayed for 1 s with a 1.5 s response window and a 3 s stimulus-onset asynchrony (SOA). A crosshair (+) was presented on the screen during all interstimulus intervals. We employed an event-related stimulus design with a 40% jitter, such that the timing between trials varied, in order to optimize the stimulus sequence and timing. Stimulus sequence and jitter for each participant was generated using OptSeq2 (http://surfer.nmr.mgh.harvard.edu/optseq/). Participants were instructed to respond as quickly and accurately as possible during the task.
Whole-head data from this task was analyzed by using a double-gamma convolution with each condition as a separate explanatory variable. The errors and motion parameters were modeled separately and used as covariates of no interest in the model. All comparisons between conditions were performed on an individual basis and then forwarded to a higher-level group analysis where a mixed effect ANOVA was performed on all participants. In order to examine the modulatory effects of conflict on enhancement or suppression of activity, we extracted the average percent signal change in each of the ROIs for all participants for each of the Stroop conditions (congruent, neutral, ineligible, eligible). These values were then subjected to statistical analysis using SPSS (11.01 for Mac).
Analysis of the functional MRI data was carried out using FEAT (FMRI Expert Analysis Tool) Version 5.1, part of FSL (FMRIB’s Software Library, www.fmrib.ox.ac.uk/fsl). Higher-level analyses were carried out using FLAME . All results from the final whole-head analysis resulted in Z statistic images that were thresholded using clusters determined by Z > 3.1 and a (corrected) cluster significance threshold of p<0.01.
For the Stroop task, the ROIs from the localizer scans (see above) were applied to the individual parameter estimate maps in the participant’s original space (before registration into standard space). The mean parameter estimates from each ROI for each condition and for each participant was extracted and subsequently used in an ANOVA in SPSS version 11.02. Correlations were carried out in SPSS between subject’s response times for each condition of the Stroop task and percent signal change in each of the ROIs.
The response times were analyzed using a repeated-measures ANOVA with condition as a within-subjects factor (see Table 2). We found a significant main effect of condition (F(3,39) = 8.83; p<.001). Planned comparisons revealed a marginal difference between the congruent and neutral conditions (p<.06), supporting previous claims for as light facilitation effect of congruent word and ink color information. We also found that the response times for the neutral condition were significantly faster than the incongruent-eligible condition (p<.003) and the incongruent-ineligible condition (p<.05). Furthermore, the response times for the incongruent-eligible condition were reliably slower than the response times for the incongruent-ineligible condition (p<.032), replicating previous findings that the incongruent-eligible condition differs in its level of conflict from that of the incongruent-ineligible condition .
The accuracy rates were also analyzed by a repeated-measures ANOVA with condition as a within-subjects factor (see Table 2). We found a main effect of condition (F(3,39) = 9.48; p<.001). Planned comparisons revealed that the error rates between congruent and neutral conditions were not different (p<.32). However, the incongruent-eligible (p<.001)and incongruent-ineligible(p<.007) trials had significantly higher error rates than the neutral condition. The error rates for the incongruent-eligible and incongruent-ineligible condition were not significantly different (p<.20).
The results from the group analysis comparing color-checkerboards with black and white checkerboards revealed a large cluster of activation in visual cortex (Fig. 1). This cluster was used as an inclusive mask, and the peaks from each subject were extracted for each of the localizer conditions (see Section 2). In a repeated-measures ANOVA with localizer (color checkerboards, black and white checkerboards, words, letter strings, pseudo-words) as a within-subjects variable, we found that the overall effect of condition was significant (F (4,52) = 4.154; p<.005). Importantly, in a series of planned comparisons we found that the CSA had significantly greater activity when processing color than when processing black and white checkerboards (p<.026), words (p<.015), letter strings (p<.049), or pseudowords (p<.016) (see Fig. 1). There were no differences in this ROI between black and white checkerboards, words, letter strings, or pseudo-words. This result indicates that we successfully localized color-sensitive regions of cortex.
We also created predefined ROIs in the left and right V4 region based on previous research on color-processing in the visual cortex  (see Fig. 2). For the left hemisphere, in a repeated-measures ANOVA with localizer (color checkerboards, black and white checkerboards, words, letter strings, pseudo-words) as a within-subjects variable, we found that the overall effect of condition was significant (F (4,52) = 14.28; p<.001). In a series of planned comparisons we found that activation to color checkerboards was significantly greater than activity for black and white checkerboards (p<.05), words (p<.001), letter strings (p<.001), and pseudo-words(p<.001). For the right hemisphere, in a repeated-measures ANOVA with localizer (color checkerboards, black and white checkerboards, words, letter strings, pseudo-words) as a within-subjects variable, we found that the overall effect of condition was significant (F(4,52) = 37.88; p<.001). In a series of planned comparisons we found that there was a trend for activation to color checkerboards to be greater than activity for black and white checkerboards (p<.09). In addition, activity to color checkerboards was significantly greater than activity to words (p<.001), letter strings (p<.001), and pseudo-words (p<.001).
The results from the group analysis that compared word activity with either letter strings or pseudo-words failed to show any clusters of activity at our threshold. However, many studies that have reported VWFA have reported greater activation for both letters and words compared to non-word stimuli such as checkerboards, pictures, textures, or faces [8,18,19,25,53–56,58] and even current arguments for extrastriate specialization for reading have focused on specialization for letters rather than entire words . Since our localizer conditions all consisted of letters, we examined if there were any areas sensitive to processing letters relative to either color or black and white checkerboards. Therefore, we first found the statistical conjunction of word activation, letter string activation, and activation for pseudo-words. This resulted in two brain regions sensitive to the presence of letters: left superior parietal lobule near a location that others have reported associated with word encoding , and a left middle temporal region located more anterior to the commonly reported VWFA [9,63], but overlapping with the middle temporal region reported in studies examining the lexico-semantic route of word reading [26,67]. We then found the peak for each participant within these clusters and extracted the percent signal change from this defined ROI for all five localizer tasks for each participant. We found that for the left parietal lobule the overall effect of condition from the repeated-measures ANOVA model was significant (F(4,52) = 4.328; p<.004). Planned comparisons revealed that the activity for letter strings was significantly greater than the activity for color (p<.043) and black and white checkerboards (p<.013) and the activity for pseudo-words was significantly greater than the activity for color (p<.011) and black and white checkerboards (p<.005). However, there were no differences between words and color (p<.723) or black and white checkerboards (p<.415) (see Fig. 3). In the left temporal region we found that the overall ANOVA from the repeated-measures analysis was also significant (F(4,52) = 3.98; p<.007). Planned comparisons revealed that activity in this region marginally differed between words and color checkerboards (p<.06) and significantly differed from black and white checkerboards (p<.027). Letter strings significantly differed from color checkerboards (p<.043) and black and white checkerboards (p<.037). Finally, pseudo-words also significantly differed from color checkerboards (p<.021) and black and white checkerboards (p<.048). Therefore, although we failed to find any areas specific to processing words, we successfully found areas specialized in processing letters compared to processing either color or black and white checkerboard displays. This result replicates prior research reporting areas in the occipito-parietal regions involved in processing words as well as middle temporal engagement during lexico-semantic processing [9,67].
In all Stroop task comparisons, we found a similar pattern of activity that others have reported for the incongruent relative to neutral conditions including left and right dorsolateral prefrontal cortex, ventrolateral prefrontal cortex, anterior cingulate cortex, left and right parietal cortices, and extrastriate cortex [1,2,15,16,37,42–44,50]. Furthermore, we replicated the pattern of eligibility effects that others have reported previously [34,42,44]. These results suggest that the right ventral prefrontal regions are involved in response conflict, while bilateral dorsolateral prefrontal regions are involved in maintenance of an attentional set in working memory. Since others have spent considerable time describing and explaining effects of the Stroop task on the attentional network, in this paper we focus on the modulation of CSA and word areas instead of an in depth discussion of the Stroop results.
To address the question of attentional modulation on target (color) areas of extrastriate cortex, we analyzed the CSA regions described above during each condition of the Stroop task. To do this we conducted a repeated-measures ANOVA with condition (neutral, congruent, incongruent-ineligible, incongruent-eligible) as a within-subjects factor. We found that the main effect of condition was significant (F(3,39) = 4.892; p<.006). Consistent with predictions made by models of selective attention and top-down control, planned comparisons revealed that in the CSA, the activity associated with the incongruent-eligible condition was significantly greater than the activity for the congruent condition (p<.015), the neutral condition (p<.02), and the incongruent-ineligible condition (p<.001) suggesting that activity in the CSA was significantly enhanced under the most cognitively demanding condition, that is, the incongruent-eligible condition relative to the other three conditions (see Fig. 4).
Interestingly, the incongruent-ineligible condition failed to show any significant enhancement of activity in the CSA relative to the neutral (p<.855) or congruent conditions (p<.787) suggesting that it is not simply the presence of conflict that results in enhanced activity in CSA, but specifically the greater cognitive demands manifested within the incongruent-eligible condition.
In the predefined V4 regions of visual cortex, we found a non-significant effect of condition using a repeated-measures ANOVA with condition (neutral, congruent, incongruent-ineligible, incongruent-eligible) as a within-subjects factor for the left hemisphere (F(3,39) = .421; p<.739) and the right hemisphere (F(3,39) = 2.03; p<.126). None of the planned comparisons reached significance at p<.05 for the left hemisphere region. For the right hemisphere the incongruent-ineligible condition had significantly greater activity than the congruent condition (p<.03) and there was also a trend for the incongruent-eligible condition to have greater activity than the congruent condition (p<.07). Therefore, we failed to find any task-related modulation of activity in the predefined V4 region of the left extrastriate cortex, but at least a trend for the incongruent conditions of the Stroop task to show task-related modulation in the right hemisphere.
It is possible that the modulated activity in CSA regions were being driven by a general increase in activity throughout visual cortex for the incongruent-eligible condition. To test this we extracted the mean signal from a 10 mm diameter ROI from primary visual cortex (x = −12, y = −96, z = −12) from each of the four Stroop conditions. If the modulation of activity during the incongruent-eligible condition was due to a non-specific and a general increase of activity throughout visual cortex, then this region should also demonstrate a significant increase in activity during the incongruent-eligible condition relative to the other conditions. However, if the modulation during the incongruent-eligible condition was specific to color-sensitive regions, then there should not be modulation of activity in the non-specific ROI. We failed to find any main effect of condition (F(3,39) = 1.062; p<.376)in this region. Furthermore, all of the planned comparisons between the conditions were not significant. This result suggests that there is not a general increase in visual cortex activity in response to the most demanding condition, but rather a specific increase in category-specific regions that process the target information.
To address the question of attentional modulation on distractor (word/letter) areas that were empirically derived by the localizer conditions, we conducted a repeated-measures ANOVA with condition (neutral, congruent, incongruent-ineligible, incongruent-eligible) as a within-subjects factor for each region separately. First, we found that for the left temporal area (F(3,39) = .15; p<.929) and the left parietal area (F(3,39) = .994; p<.406) the overall effect of condition was not significant. Further-more, planned comparisons revealed that in both the temporal and parietal cortex, none of the conditions were reliably different from one another. This effect suggests that in our empirically derived ROIs, there was neither enhancement nor suppression of distractor related areas of cortex during any condition of the Stroop task.
We also analyzed the predefined ROIs based on previous studies using a repeated-measures ANOVA with condition (neutral, congruent, incongruent-ineligible, incongruent-eligible) as a within-subjects factor for each region separately. We found that in the VWFA (coordinates: −43, −54, −12) there was a non-significant trend for a main effect of condition (F(3,39) = 1.99; p<.13; see Fig. 5). Planned comparisons in this region revealed that there was greater activity in this region for the incongruent-eligible condition than the neutral condition (p<.03) and significantly greater activity for the incongruent-eligible condition than the congruent condition (p<.016). No other effects were significant in this region. Furthermore, in the inferior occipital gyrus ROI (coordinates: −42, −78, −10), we found a non-significant trend for condition (F(3,39) = 2.12; p<.11). Planned comparisons revealed that there was a trend for activity for the incongruent-eligible condition to be greater than the activity for the neutral condition (p<.07). There were no significant effects for any of the other predefined ROIs. These results argue that greater attentional demands in the incongruent-eligible conditions of the Stroop task tend to be related to increased activity in word/letter sensitive regions of cortex. However, this modulation was dependent on the region examined – only one out of the five regions examined showed significant task-related modulation during the Stroop task (VWFA), with an additional region trending towards significance (inferior occipital gyrus).
We predicted that enhancement of activity in CSA regions would be associated with enhanced performance manifested by reduced reaction times during the Stroop task. To this end, we performed a series of one-tailed correlations that examined whether the level of activity in the CSA during each condition of the Stroop task and for each subject was correlated with response times. As predicted, we found that faster response times for the in congruent-eligible condition were associated with higher activity levels in the CSA (r = −.45; p<.05) (Fig. 6). In addition, the incongruent-ineligible condition also showed a significant negative correlation between response times and activity in the CSA (r = −.47; p<.04) despite the lack of any significant modulation of activity in the CSA for this condition (Fig. 6). The response times for the neutral (r = .05; p<.433) and the congruent (r = −.03; p<.456) conditions were not correlated with activity in the CSA. These results argue that activity in color-sensitive regions of cortex only impacts performance when interference and attentional demands are highest and that an up-regulation of this region during the Stroop task was associated with enhanced task performance (Table 3).
In the predefined V4 region in the left hemisphere we failed to find any task-related modulation of activity for any Stroop task condition. In a similar vein, the correlations with activity in this region and behavioral performance were not limited to the conflict conditions. Specifically, correlations between performance and activity revealed that, similar to the empirically defined ROIs described above, there were significant negative correlations between reaction times and percent signal change for all conditions but neutral. Specifically, we found that faster response times were associated with higher activity levels in left V4 for the incongruent-eligible condition (r = −.81; p<.001), the incongruent-ineligible condition (r = −.60; p<.001), the congruent condition (r = −.50; p<.03) and a trend for the neutral condition (r = −.42; p<.06). For the right hemisphere V4 region, the incongruent-ineligible condition showed a trend for greater activity than the other Stroop task conditions. However, unlike the left hemisphere V4 region none of the correlations between reaction time and activity were significant (incongruent-eligible condition (r = −.18; p<.265), the incongruent-ineligible condition (r = −.39; p<.08), the congruent condition (r = .26; p<.18) and neutral condition (r = .12; p<.33)).
To ensure that these correlations between activity in CSA regions and behavioral performance were not being driven by a general increase in activity throughout visual cortex for the better performers, we correlated the activity in a region of visual cortex that was unresponsive to our color manipulation with performance on the incongruent conditions. We extracted the mean signal from a 10 mm diameter ROI from primary visual cortex (x = −12, y = −96, z = −12). If enhanced performance on the Stroop task was associated with a non-specific and a general increase of activity in visual cortex, then this region should also be correlated with performance, however, if enhanced performance was specific to color-sensitive areas, then there should not be a correlation between performance and activity in the non-specific ROI. Indeed, we failed to find a significant correlation between activity in this ROI and performance on the incongruent-eligible condition (r = −.04) or performance on the incongruent-ineligible condition (r = −.07) suggesting that performance-related correlations were limited to category-specific regions of cortex and not due to a general, and non-specific increase of activity throughout visual cortex.
Similar to the CSA regions, we examined whether behavioral performance for each condition was correlated with the level of activity in the temporal and parietal regions defined empirically as well as the level of activity in each of the predefined ROIs. Top-down models of selective attention predict positive correlations between activity and response times in distractor regions of cortex. That is, performance should be worse with greater processing of the distractor information.
First, despite the lack of modulation during the Stroop task, we found that response times for the incongruent-eligible (r = .87; p<.001), congruent (r = .37; p<.09), and neutral (r = .64; p<.007) conditions were positively correlated, at least marginally, with the level of activity in the left parietal lobule such that faster response times, or better performance, were associated with less activity in this region (Fig. 7). The correlation between the response times for the incongruent-ineligible condition and activity in the left parietal region was not significant (r = .09; p<.37). Therefore, the magnitude of activity in this region does not seem to be dependent on the cognitive demands or the degree of conflict in the task, but rather a general mechanism associated with performing all task conditions.
For the left middle temporal lobe region we found that the response times for the incongruent-eligible condition (r = .42; p<.06) were marginally correlated with the level of activity, but neither the incongruent-ineligible (r = −.14; p<.31), congruent (r = −.12; p<.34), nor neutral (r = −.02; p<.47) response times were significantly correlated with activity in this region. The trend for a positive correlation specific to the incongruent-eligible condition suggests that like the parietal region, faster response times were marginally associated with less activity. However, unlike the parietal region, this correlation was specific for the most demanding condition, that is, the condition with the greatest degree of conflict.
Second, for the predefined ROIs, we failed to find evidence that the activity in these regions was positively correlated with reaction times. Instead, we found the opposite pattern of correlations. Specifically, the VWFA ROI showed a negative correlation between activity for the incongruent-eligible condition and response times (r = −.46; p<.05) such that faster response times were associated with greater levels of activity (Fig. 8). In addition, for the inferior occipital gyrus (coordinates: −42, −78, −10) we found a negative correlation between activity for the incongruent-eligible condition and response times (r = −.56; p<.01), as well as for the incongruent-ineligible condition and response times (r = −.54; p<.02). Interestingly, the direction of these effects is opposite to that which we report for the middle temporal region and parietal region, and are therefore inconsistent with predictions made from models of top-down control (Table 3). However, the tendency for these regions to show negative correlations with the conditions showing the greatest degree of conflict is partially consistent with push–pull mechanisms of top-down control. This is discussed in more detail below.
In this study, we examined whether cognitive demands engendered through conflict in the Stroop task would modulate activity in brain regions sensitive to processing target and distractor information. Consistent with theories of selective attention and top-down control  we found that activity in color-sensitive regions of cortex was up-regulated during conditions of high cognitive demand and behavioral conflict and that the magnitude of this up-regulation was positively correlated with performance (negatively correlated with response times). In addition, we found limited evidence for the hypothesis that there should be enhanced activity in distractor regions of cortex under highly demanding conditions. In only one of the five word/letter regions examined did we find significant enhancement of activity during the task, and this activity was positively, rather than negatively, correlated with performance. This effect might suggest that top-down control operations in this type of selective attention task are more associated with enhancement processes in target-related visual cortical areas rather than increased or decreased modulation of distractor regions. However, the correlations with performance in distractor related processing regions suggests that the magnitude of activity in these regions is still related to task performance, despite the lack of task-related modulation.
Interestingly, although we failed to find task-related modulation in many of the word/letter areas, we found that some of these regions were correlated with performance. Specifically, activity in the left parietal region was negatively correlated with performance, whereas activity in the VWFA and inferior occipital cortex was positively correlated with performance. These effects suggest that (1) Stroop task performance is related to the degree of activity in word/letter areas, despite a non-significant degree of modulation in many of these regions during the task, and (2) greater activity in some word/letter sensitive regions and less activity in other word/letter sensitive regions is related to enhanced task performance. These effects indicate that the capabilities of selective attention to influence category-specific regions of visual cortex are directly related with task performance and that the direction of the relationship is not monotonic.
The magnitude that attention during the Stroop task modulates, or correlates with, activity levels in word/letter areas of the visual cortex might vary as a function of the role that these regions play in word/letter recognition or processing. For example, the left middle temporal gyrus and the left superior parietal gyrus reported in this study have been linked to lexico-semantic processing and grapho-phonological processing, respectively (e.g. ). Neither of these regions showed task-related modulation of activity levels, but activity in the left parietal region was positively correlated with performance in nearly all of the Stroop task conditions suggesting that grapho-phonological processing plays a critical role in Stroop task performance, but not necessarily with resolving Stroop conflict. It is likely that the attentional set during the Stroop task (“Attend to the color and ignore the word”) is applied to all conditions regardless of the presence of conflict, and the level of activity in the left parietal region is representative of control operations that reduce grapho-phonological conversion of the distractor (word), resulting in an improvement in performance. In contrast, the VWFA and inferior occipital gyrus regions defined a priori based on prior investigations of word/letter string localization are responsive to a variety of symbols and letter strings . Both the VWFA and inferior occipital regions were negatively correlated with Stroop task performance. It is possible that semantic similarity between the color-word and the word itself results in a spreading of activation in early visual regions that process word/letter information, a hypothesis consistent with one connectionist model of Stroop task performance .
The different directions of the correlations between performance and activity and their locus within the extrastriate cortices suggests that a push–pull mechanism of attentional selection (enhancement and suppression mechanisms) might not only be dependent on the cognitive demands and semantic similarity between distractors and targets, but may also be dependent on the complexity of the receptive fields and processors of the region being moderated. The correlations and results discussed in this manuscript lend some support to this hypothesis. However, an alternative hypothesis is that better performers employ a different strategy during the Stroop task then their poorer performing counterparts. A difference in the type of top-down control mechanism may account for some of the differences in the direction of the correlations across the regions examined.
Although retinotopic mapping procedures would have been a more accurate method of demarcating regions of the visual cortex, we were able to successfully localize color-sensitive areas of cortex using our localizer method. These color-sensitive areas were distributed throughout primary and extrastriate cortices replicating other findings that regions involved with processing color extend from V1 through V4 . As discussed above, we demonstrated an increase in target (color) related activity during the most demanding condition of the Stroop task, thereby supporting predictions made by a number of models of attentional control [11,15,21,33,34,44,65,66] as well as arguments that top-down attention enhances target processing under high cognitive loads . In addition, better task performance for the incongruent conditions was related to greater activity in CSA. These correlations suggest that the efficacy of top-down modulation during the Stroop task has a direct effect on the performance capabilities to selectively attend to the task-relevant color dimension.
Interestingly, although response times were fastest for the congruent condition and reliably increased with attentional demands(congruent RT < neutral RT < incongruent-ineligible RT < incongruent-eligible RT), modulation of CSA was found only for the most demanding condition. This apparent discrepancy between the behavioral and neuroimaging results suggests that there is not a one-to-one relationship between cognitive demands, as measured behaviorally, and attentional modulation of target-related fMRI activity. This may highlight a limitation of models of top-down attentional control when target and distractor information are spatially contiguous or coincident .
The results described in this study are inline with previous work showing that posterior regions of the brain are up-regulated during an object-Stroop task . Other studies of attention have also reported modulation of activity in ventral visual cortex [10,20,48]. Previous Stroop studies have suggested that color areas of cortex would be up-regulated during the incongruent condition while word and letter related areas would be down-regulated . We failed to find evidence that distractor-related regions were down-regulated during the Stroop task. However, consistent with connectionist models of the Stroop task as well as models of perceptual and cognitive load [12,33], both task-relevant and task-irrelevant areas of cortex were up-regulated during the task.
One important caveat of this study is that empirically we failed to find any area of cortex specific to processing words, even at a lower threshold (p<.05). Our comparisons revealed that there were no differences between words, letter strings, and pseudo-words, and only by performing a conjunction analysis between the word, letter-string, and pseudo-word conditions and comparing the activation within the inclusive conjunction mask to the activity from color checkerboards and black and white checkerboards did we reveal areas that were sensitive to processing letters. Although difficult to explain a null result, our result is consistent with a large literature reporting the presence of word/letter related activity in extrastriate and occipito-temporal regions when contrasting letter strings or words with non-word stimuli such as textures, symbols, checkerboards, or a rest period [4–6,13,17–19,23,56,68]. In addition, the locality of our results are also consistent with other studies reporting an area of the left middle temporal region as involved in lexico-semantic processing and occipito-parietal engagement during memory for words  and grapho-phonological processing [26,67]. It should also be noted that our word/letter localizers were passively viewed and it is unknown whether the participants engaged in any deep processing of the words and letters. This also could have contributed to the data reported here. In short, our results complement the literature on localization of VWFA and demonstrate that areas of cortex sensitive to processing words and letters could only be found by contrasts with checkerboard displays.
In sum and consistent with our hypotheses, we report that selective attention enhances activity in cortical processors that are sensitive to target information (color), and also enhances activity in some distractor (word) sensitive areas of cortex. Importantly, these results provide partial support for models of attentional control that argue that attention towards distractors may be enhanced when target and distractor dimensions are spatially contiguous or coincident . Therefore, we conclude that selective attention, when task-relevant and task-irrelevant information are spatially contiguous, can function by enhancing the processing associated with the task-relevant target and some of the regions associated with distractor processing. Both positive and negative correlations with performance argue that these top-down processes play a critical role in performance on selective attention tasks.
We would like to thank Nancy Dodge, Holly Tracy, Maritza Alvarado, and Michelle Voss for their help in data collection and to Diane Beck for her helpful comments on previous versions of the manuscript. This research was supported by grants R37 AG25667 and RO1 AG25032 from the National Institute on Aging.