Individuals with Autism Spectrum Disorders (ASD) suffer from multifaceted impairments including deficits in language and communication skills, repetitive behaviors and restricted interests and difficulties with social interactions. Both fMRI and ERP studies suggest abnormal processing of faces in the brain of people with ASDs relative to typically developing controls (
Grice et al., 2001;
Pierce et al., 2001;
Schultz et al., 2000;
Webb & Aggarwal, 1981). Behavioral evidence also suggests that faces are processed by individuals with ASDs using an abnormal strategy (
Klin et al., 2002;
Langdell, 1978;
Rutherford et al., 2007). Yet studies on this issue paint a somewhat confusing picture from which it is difficult to extract the exact nature of the behavioral deficit.
One proposal has been that individuals with ASDs process faces in a more part-based fashion than typically developing controls. The idea that faces are perceived more holistically than other objects is a theme that pervades the literature on typical adult face processing (
Diamond & Carey, 1986;
Farah et al., 1998;
Tanaka & Farah, 1993). There are several definitions of holistic processing in the literature but they commonly refer to the facilitatory effect that the context of a whole upright face has on judgments about its parts and their relations (
Farah, Wilson, Drain & Tanaka, 1998). Holistic processing was not directly measured in ASD until a study by
Joseph and Tanaka (2003). Using a task called the whole-part paradigm (
Tanaka & Farah, 1993), these authors reported that children with ASDs processed faces holistically only when recognition depended on the mouth. That is, controls always recognized face parts better in the context of the whole faces in which the parts had been studied (a
whole-part advantage) and also recognized whole faces better upright than inverted (an
inversion effect). In contrast, children with Autism showed both whole-part and inversion effects when recognition depended on the mouth, whereas these effects were not obtained when recognition depended on the eyes, in which case their performance was very poor. These findings are generally consistent with prior studies that found a reduced, non-existent, or even reversed face inversion effect (
Hobson et al., 1988;
Langdell, 1978;
Tantam et al., 1989) as well as abnormally high attention to the mouth area relative to the eyes in individuals with ASDs (
Klin et al., 2002). A recent study measuring the ability to detect small displacements of the eyes and mouth found that while a subgroup of young adults with ASDs (those with lower verbal IQ) showed poor performance with eyes, no deficit or advantage was observed in ASD in the detection of mouth changes (
Rutherford, Clements & Sekuler, 2007). Another study argued for a non-specific configural deficit in ASD and reported a correlation between performance on configural tasks with non-face objects and performance on face tasks (
Behrmann et al., 2006). However, that study did not assess configural or holistic processing with faces.
At the same time, other studies have failed to replicate some of these effects. One study suggested that once floor effects are controlled for, a normal face inversion effect is obtained in individuals with ASDs (
Teunisse & de Gelder, 2003, see also
Lahaie et al., 2006). Another study (
Lopez et al., 2004) measured the whole-part advantage under two different conditions: without cueing the participants as to the part relevant for recognition (as was done originally by
Joseph and Tanaka, 2003) or cueing them to attend to the relevant part. While cueing had little effect on controls, it affected the performance of adolescents with ASDs: uncued trials led to abnormal performance (no whole advantage) while cued trials led to a normal whole-part advantage. Besides suggesting a role for attentional mechanisms in face processing deficits in ASDs, Lopez and colleagues also failed to replicate Joseph and Tanaka’s finding that mouth processing is normal in ASD when parts are not cued. Whether participants need to distribute their attention to all face parts to solve a given task may be an important factor in interpreting the results of other studies, but that factor alone may not explain all the results.
One problem concerning the partly discrepant results of
Joseph and Tanaka (2003) and
Lopez et al. (2004) resides in the use of the whole-part paradigm. First, it has been suggested that this task does not measure an effect unique to face perception, despite what was originally reported by
Tanaka and Farah (1993). For instance, a whole-part advantage for non-face objects has been observed (
Gauthier & Tarr, 1997,
2002;
Tanaka et al., 1996). Second, in expertise training paradigms where other hallmarks of configural and holistic processing can be obtained for non-face objects following extensive practice at individuating objects, the whole-part advantage is comparable in novice and expert participants even when other measures of holistic processing increase with perceptual expertise (
Gauthier & Tarr, 1997,
2002;
Gauthier et al., 1998). Moreover,
Leder & Carbon (2005) have shown that the whole-part advantage depends on participants initially studying whole faces: when parts are studied instead, part recognition is superior to whole recognition, consistent with the general predictions of the encoding specificity principle (
Tulving & Thomson, 1973). As a whole, this suggests that the whole-part advantage may be at best a problematic index of holistic processing and may greatly depend on domain-general principles of memory.
In contrast, the composite paradigm is a different task that offers a better measure of the holistic processing strategy that is specific to faces (
Young et al., 1987). In the matching version of the composite paradigm, face halves are combined to create new composite faces, and participants are asked to selectively attend to only the top or bottom half of a study face, which they are asked to match to the equivalent part in a test composite. Specifically, in the design used in this work,
congruent trials refer to trials where the relevant (attended) and irrelevant parts are both the same, or both different, between study and test. In contrast, on
incongruent trials, the relationship between the study and test irrelevant is opposite to the relationship for the relevant part. For instance, if the relevant parts are identical, then the irrelevant parts are different from each other (see ). In this task, holistic processing is characterized as a failure of selective attention measured by a
congruency effect: performance that is better on congruent than incongruent trials reveals that participants have difficulty ignoring the irrelevant part (
Cheung et al., in press;
Farah et al., 1998;
Richler et al., 2008;
in press-b). Typically developing adults process upright faces holistically according to this measure (
Boutet et al., 2002;
Cheung et al., in press;
Gauthier et al., 2003;
Goffaux & Rossion, 2006;
Hole, 1994;
Richler et al., 2008;
in press-b;
Young et al., 1987). Holistic processing is reduced when the two parts of the faces are inverted (
Hole, 1994) or misaligned (
Cheung et al., in press;
Richler et al., 2008;
Young et al., 1987). This
alignment effect suggests that when the meaningful configuration of the face is disrupted, so is holistic processing (see also
Goffaux & Rossion, 2006;
Hole, 1994;
Hole, George & Dunsmore, 1999;
Le Grand, Mondloch, Maurer & Brent, 2004; Michel, Rossion, Han, Chung, & Caldara, 2006;
Robbins & McKone, 2007;
Young, Hellawell & Hay, 1987). Note that the alignment effect is a relatively crude measure of the sensitivity of the perception of upright faces to configural information. In other tasks, configural processing translates into exquisite sensitivity to much smaller disruptions of the metric relations between parts (e.g.,
Le Grand, Mondloch, Maurer & Brent, 2004;
Tanaka & Sengco, 1997).
In summary, the composite paradigm allows the measurement of failures of selective attention to parts that result from a holistic processing strategy, and the specificity of this holistic strategy to the normal face configuration. In a typical adult population, both phenomena (holistic processing and its specificity to the normal face configuration) are more important for faces than for other objects (
Farah et al., 1998;
Gauthier et al., 2003;
Richler et al., in press-a) and both increase with expertise (
Gauthier et al., 2003). In addition, holistic processing as measured in this task correlates with activity in the FFA (
Gauthier & Tarr, 2002), an area that is hypoactive during face perception in ASD (
Schultz et al., 2000).
Only one study to date has tested individuals with ASDs using a composite task (
Teunisse & de Gelder, 2003). Adolescents with a clinical diagnostic of Autism (mean age 19.5) were compared to typically developing children (age 9 and 10) and to typical undergraduate controls. In addition to differing in chronological age, the groups were not compared on IQ, although only high-functioning individuals with Autism were included. Participants were presented with a three quarter front view target composite face for 1 s (for undergraduate subjects) or 3 s (for the two younger groups), and after a 3 s delay, two probes were shown. Both probes used an identical (and novel) bottom half, but only one probe had the same top half as the studied face. Participants made speeded matching judgments on the top half. In different blocks, the probes were shown upright/aligned, upright/misaligned, inverted/aligned and inverted/misaligned. The undergraduate controls showed the expected pattern of results: better performance for misaligned than aligned trials, but only in the upright orientation. In contrast, the Autism group showed no effect of alignment on either upright or inverted trials. However, typical children also failed to show the expected adult pattern.
On the basis of these results, Teunisse and de Gelder concluded that adolescents with Autism are less susceptible to contextual information in a task where facial features need to be matched. Several aspects of this study limit its interpretation and the current study was designed to address them. First, there was no control group matched in either chronological age or IQ to the participants with Autism. Second, the study only assessed judgments for the top half of faces, so it is not possible to assess differences between mouths and eyes. Third, the inspection time for the adolescents with Autism was three times longer than for the adult group, making any comparison difficult. Fourth, no direct group comparison was performed, to ascertain whether the individuals with Autism really performed differently from the two control groups. Fifth, Autism diagnoses were not confirmed with gold standard research procedures, such as the Autism Diagnostic Interview (ADI,
Lord et al., 1994) or the Autism Diagnostic Observation Schedule (
Lord et al., 1999). Sixth, the irrelevant part in both probe faces was always different from that in the target face. Although this is similar to the original design used by
Young et al. (1987), other recent studies have introduced a new version of the composite task (
Boutet et al., 2002;
Farah et al., 1998;
Gauthier et al., 2003;
Richler et al., in press-a). This variant has been called the “complete composite paradigm” because it includes the conditions where the irrelevant parts of the study and test faces are identical, in addition to the standard conditions where these irrelevant parts differ from each other (
Cheung et al., in press; Richler et al., submitted;
Richler et al., in press-a). It has been argued that analysis of sensitivity in the complete composite design can measure holistic effects without a confounding contribution of the considerable response biases that are often observed in this task (
Cheung et al., in press;
Richler et al., in press-a;
Wenger & Ingvalson, 2002).
To address all of these limitations and revisit the question of whether people with ASD process faces less holistically, we used the complete composite paradigm in adolescents with ASDs and age- and IQ-matched controls.