Search tips
Search criteria 


Logo of nihpaAbout Author manuscriptsSubmit a manuscriptHHS Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
Biol Psychiatry. Author manuscript; available in PMC 2012 May 15.
Published in final edited form as:
PMCID: PMC3105006




Various malfunctions involving working memory, semantics, prediction error, and dopamine neuromodulation have been hypothesized to cause disorganized speech and delusions in schizophrenia. Computational models may provide insights into why some mechanisms are unlikely, suggest alternative mechanisms, and tie together explanations of seemingly disparate symptoms and experimental findings.


Eight corresponding illness mechanisms were simulated in DISCERN, an artificial neural network model of narrative understanding and recall. For this study, DISCERN learned sets of “autobiographical” and “impersonal” crime stories with associated emotion-coding. In addition, 20 healthy controls and 37 patients with schizophrenia or schizoaffective disorder matched for age, gender and parental education were studied using a delayed story-recall task. A goodness-of-fit analysis was performed to determine the mechanism best reproducing narrative breakdown profiles generated by healthy controls and patients with schizophrenia. Evidence of delusion-like narratives was sought in simulations best matching the narrative breakdown profile of patients.


All mechanisms were equivalent in matching the narrative breakdown profile of healthy controls. However, exaggerated prediction-error signaling during consolidation of episodic memories, termed hyperlearning, was statistically superior to other mechanisms in matching the narrative breakdown profile of patients. These simulations also systematically confused “autobiographical” agents with “impersonal” crime story agents to model fixed, self-referential delusions.


Findings suggest that exaggerated prediction-error signaling in schizophrenia intermingles and corrupts narrative memories when incorporated into long-term storage, thereby disrupting narrative language and producing fixed delusional narratives. If further validated by clinical studies, these computational patients could provide a platform for developing and testing novel treatments.

Keywords: schizophrenia, artificial neural network, derailment, delusions, prediction-error, narrative language, memory

Certain language behaviors are characteristic of schizophrenia. Spoken discourse often fails to express a cohesive message (14). Many patients express fixed delusions as spurious narrative schema repeated over time-intervals ranging from weeks to years (5,6).

Mechanisms causing these behaviors remain uncertain. Language disorganization has been associated with disrupted working memory, semantic processing, attention, and linguistic context (721), while delusions have been associated with aberrant emotion-based reasoning and associative learning, anomalous perceptions, and jumping-to-conclusions (2225). Both syndromes have been associated with disturbed executive functioning (22,26,27), theory-of-mind (22,28,29), and dopamine neuromodulation (30,31), and elevated hippocampal/parahippocampal activation (3234). In this situation, connectionist models employing artificial neural networks can be used to compare the likelihood of alternative mechanisms and tie together explanations of seemingly disparate symptoms and experimental findings.

Connectionist models have been used to simulate some cognitive impairments associated with schizophrenia (3539), but not their characteristic narrative language. Below we describe the generation of stories by an established connectionist model called DISCERN (4042). Its details, many of which are not essential in understanding this study, can be found in the literature and in the Supplement. Key aspects of DISCERN, however, can be summarized as follows: (i) DISCERN learns to recognize words and the sentences and stories incorporating them using interconnected neural network modules dedicated to these different language processing levels; (ii) modules learn by updating their internal connection strengths to minimize prediction-errors while processing sequential language; (iii) after a group of stories is learned, DISCERN can recall any single story when prompted with an initial segment. In this study, several different illness mechanisms were simulated in DISCERN, and the resulting story-recall distortions were compared with those of healthy human subjects and patients with schizophrenia during a delayed story-recall task.



DISCERN is organized as a chain of neural network modules (Figure 1). These modules communicate using neural activation patterns that represent words in semantic memory: similar word activation patterns reflect similar roles in sentences. To process a story input (Figure 1A, left), word representations are presented to the sentence parser one at a time as a sequence of activation patterns. The sentence parser builds a representation of each sentence by sequencing word representations corresponding to agent, predicate, indirect object, modifier, and direct object. At the end of each sentence, the sentence representation is passed on to the story parser. The story parser transforms sequences of sentence representations into script representations. Scripts are standardized, multi-sentence schemas whose “slots” are filled by different sets of words. A story is a sequence of scripts stored in the episodic memory module with coherent slot-fillers. To generate an output story, the process is reversed (Figure 1A, right): The story generator module accesses the episodic memory representation and translates it into a sequence of sentences. The sentence generator then produces the word sequence for each sentence.

Figure 1
A schematic representation of the architecture of DISCERN. (A) Remembering and reproducing a story in DISCERN is achieved by a chain of modules (4042). Tan-colored modules are simple recurrent networks (SRNs, ref. 43). (B) The story generator ...

In this study, DISCERN learned 28 stories, half “autobiographical,” describing a first-person character, a doctor, his relationships with boss, girlfriend, etc., and activities such as going to a wedding and getting a traffic ticket. The other half were “impersonal” crime stories describing Mafia, police characters and their activities. The lexicon was 159 words. Each story was organized as a sequence of 3–7 scripts slotted with a coherent set of words/concepts and an emotion-code ranging from very positive to very negative (++/+/+−/−/−−). Memory recall is thus biased by emotionality analogous to human memory retrieval (44). Such emotionality could heighten simulated derailments and delusional language, as it does in patients (22,45). Autobiographical and crime stories at times incorporated common scripts, thereby providing opportunities for narrative confusion. For instance, an autobiographical story incorporated the following script (underlining indicates slot-fillers):

I was a doctor

I worked in New-York

I liked my job.

I was good doctor

whereas a crime story expressed the same script with different slot-fillers:

Tony was a gangster

Tony worked in Chicago

Tony hated his job

Tony was a bad gangster

Story learning in DISCERN is based on discrepancies between observed and predicted language. Such discrepancies, or prediction-errors, are propagated back from the output to the input neuron layers within each module (Figure 1B); through the backpropagation learning algorithm, they produce gradual, highly targeted changes in network connection strengths (46); this learning process requires thousands of repetitions. Successful learning for this study required between 5,000–30,000 backpropagation learning cycles for each module. Starting from different initial random connection weights, 30 DISCERN exemplars were independently developed in this fashion.

Story-recall by each DISCERN exemplar was prompted by first script of each story as input. DISCERN parsed this script, retrieved a story memory, and produced a story output.

Eight candidate illness mechanisms were applied to each of the 30 DISCERN exemplars after story learning was completed (Figure 1) based on prior studies of speech disorganization, delusions and schizophrenia:

  1. Working memory (WM) disconnection was prompted by neuroimaging studies suggesting cortical disconnection, especially involving WM networks, in schizophrenia (4750). Disconnection was simulated by pruning excitatory and inhibitory WM connections in the story generator if their absolute connection strength fell below a specified threshold (51). An extended version of disconnection also pruned connections between the hidden→output layer of the story generator.
  2. Noise added to WM networks was prompted by reports indicating excessive cortical noise, reduced signal-to-noise ratio, and inefficiency in frontal WM cortical systems in persons with schizophrenia (5254). These conditions were simulated by adding Gaussian noise to story generator WM neuron outputs.
  3. WM network gain reductions were prompted by a connectionist model of hypodopaminergic cortical neuromodulation in patients with schizophrenia expressed as reduced neural response (36), and neuroimaging studies showing reduced activation in WM circuits during task performance in patients with disorganization symptoms (55,56). This alteration was modeled as reduced gain (i.e., slope) of the response curve of neurons in the hidden and recurrent layers of the story generator module (36).
  4. Response bias shifts. Elevated arousal could produce overactivation at neuronal level was simulated as lateral shifts in the response curve of WM layer story generator neurons (57). This manipulation in theory could also simulate a failure to deactivate the superior temporal gyrus when performing a WM task detected in early-phase patients with schizophrenia (58).
  5. Semantic network distortion demonstrated by lexical categorization, priming and fluency tasks have been statistically linked to language disorganization and schizophrenia (1113). These abnormalities were simulated by adding noise to word representations in the semantic memory.
  6. Excessive activation in semantic networks. Increased temporal and prefrontal activity during semantic associations, and increased activation of the cingulate cortex during object naming has been reported in schizophrenia (14,59). These disturbances were simulated by increasing output activation of neurons in the semantic network.
  7. Heightened semantic priming. Studies have suggested heightened spread of activation in semantic networks in patients with schizophrenia based on word association data, especially among patients with language disorganization (10,1517). This disturbance was simulated by blurring semantic network outputs so that words semantically linked to a target word were coactivated.
  8. Exaggerated prediction-error signaling (“hyperlearning”). Elevated brain response to prediction-error during learning has been linked statistically to delusion-formation (25). Moreover, prediction-error coding and other salience-driven aspects of learning appear to be mediated, at least in part, by dopaminergic pathways (6062); elevated dopamine release is associated with schizophrenia (31). Backpropagation learning in DISCERN is driven by prediction-error. Exaggerated prediction-error signaling was consequently represented as amplified backpropagation learning rates, termed hyperlearning, applied for 500 backpropagation learning cycles to the memory encoder after DISCERN was trained. An extended version of hyperlearning also applied this mechanism to the story-generator module (Figure 1A).

Based on evidence of an editor function during human speech production (63), an output sentence filter was incorporated into the story generator that estimates the distortion of a sentence as the average computational distance between each component word of a sentence representation and its nearest lexical template in semantic memory. If the distortion exceeds a certain threshold, the sentence is discarded. The filter reduces word selection errors and disorganized language at the cost of reducing successful recall.

Human story-recall

Story-recall performance data from 20 normal subjects and 37 outpatients with schizophrenia or schizoaffective disorder were compared. All subjects provided written informed consent to participate in the study. DSM-IV diagnoses and symptoms were determined using the Comprehensive Assessment of Symptoms and History (CASH, 64). Patients were prospectively divided into two subgroups: those who definitely demonstrated evidence of fixed delusions expressing a plot-like, narrative scheme, and those where evidence for these delusions was questionable or absent.

The experimental task consisted of three stories presented binaurally on headphones. Two of these stories, which are reproduced below, share references and themes:

“The Gift” (65)

In one seat of the bus a wispy old man sat holding a bunch of fresh flowers. Across the aisle was a young girl whose eyes came back again and again to the man’s flowers. The time came for the man to get off. He thrust the flowers into the girl’s lap. “I can see you love flowers,” he explained, “and I think my wife would like you to have them. I’ll tell her I gave them to you.” The girl accepted the flowers and watched the man get off the bus and walk through the gate of an old cemetery.

“Hitchhiker” (written for this study)

I hitched into town. A wispy old man driving a pick-up truck with his frail wife gave me a ride. I sat in the back and watched the tires kick up dust. We stopped and waited for a traffic light. I turned around and peered into the rear window. I hadn’t eaten all day and my eyes came back again and again to a bag of Fritos on the dashboard. The man got out of the truck and walked around to the back. “My wife noticed that you kept looking at the Fritos,” he explained, “and she wanted you to have them.”

The third was the “Anna Thompson” story taken from the Logical Memory subtest of the Wechsler Memory Scale-III (66). Immediate recall, 45-minute recall, and 7-day recall were tape-recorded and transcribed for analysis blind to group and subject identification. 7-day recall data were used for this study.

Comparing human and DISCERN story-recall

Four story-recall variables could be scored comparably for both human and DISCERN story recall while demonstrating sufficient non-zero base-rates, and thus were used for assessing goodness-of-fit of DISCERN narrative breakdowns relative to human story-recall:

Recall success

For humans, recall success was the total number of story propositions successfully paraphrased (scored as 1) or partially paraphrased (scored as 0.5) divided by the number of propositions in the target stories (=36). Sentences in the stories were mapped into propositions, with full and partial recall for each proposition defined a priori (Table 1). For DISCERN, the total number of propositions fully reproduced was tallied and divided by the total number of propositions in the target stories (=550). No partial scores were given because DISCERN propositions were less complex than human propositions.

Table 1
Illustration of method for assessing story recall for first two sentences of, “The Gift”

Agent-slotting errors were word substitution errors involving story agents or characters. An example by a patient (recalling “The Gift”):

The girl gave the old man the flowers as a gift

reversing subject and indirect object. An example from DISCERN is

The cop arrested me for speeding

where the direct object, “me” is substituted for the mafia character, “Vince.” Agent-slotting errors that are systematic across contexts provided a model of fixed-delusions.

Lexical misfires are word or phrase substitutions not involving agents possessing sentence case-roles paralleling target words or phrases that significantly change meaning. For human recall, an example of this type of recall error is:

“wispy old man” → “whispering man” (Table 1)

For DISCERN an example of a lexical misfire was:

Kate was interested in guns (substituting for books, in DISCERN stories, both terms referred to objects of interest)

Derailed clauses

For human recall, a derailed clause was a text insertion comprised of an entire clause, not just a phrase or word, expressing extraneous meaning not inferable from the target story. An example generated by a control speaker when recalling “The Gift” (underlined scored as derailed clause) was:

A girl was sitting on the bus near him and he noticed her looking at his eyes.

An example while paraphrasing the same story produced by a patient is:

I remember the generosity of the flowers. I remember that he gave her flowers and she gave, she put the flowers on the tombstone. And I remember there was a truck involved, I believe it was a flatbed.

Here, three clauses were generated clearly departing from the meaning of the target story, with the last two borrowing from the “Hitchhiker” story.

For DISCERN, derailed clauses were those produced subsequent to a jump from the target story to another story in the corpus. For instance, DISCERN, after hyperlearning, initiated a story recounting a description of Joe, the boss of the first-person character, but then switched into another, related story describing meeting Mary, Joe’s fiancée, at a bar:

Joe was my boss.

I hated Joe.

Joe was in his 30s.

Joe had a beard.

Joe liked baseball.

Mary was the fiancee of Joe.

I liked talking to Mary.

I gave a kiss good-bye to Mary.

Interrater reliabilities for scoring these human performance variables were in the acceptable range (RI’s between .75–.98). DISCERN variables were scored algorithmically.

When comparing human and DISCERN story recall performance, agent-slotting errors, lexical misfires and derailed clauses were re-calibrated as penetrance scores, where totals for each type were divided by recall scores (prior to correction by total number of propositions in target texts). This strategy adjusted for: (i) much greater number of DISCERN stories compared to human stories, amplifying opportunities for error for the former; (ii) effects of editing/filtering language outputs, which reduced errors and successful recall in parallel.

Goodness-of-fit (GOF) analysis

To assess GOF of alternative illness mechanisms, we used the following strategy: First, a 4-variable story-recall profile was averaged, first for human controls, then for patients. For each of the 30 DISCERN exemplars and 8 illness mechanisms, parameters were adjusted to best reproduce these two story-recall group-profiles. This yielded 30 GOF measures for each mechanism and group-profile. These were then used as the dependant variable in subsequent analyses of variance to identify which mechanisms best matched group-profiles. Each illness mechanism was studied initially by adjusting two parameters: the mechanism parameter itself (level of noise, pruning, etc) and the filter threshold. GOF utilized a mean square deviation metric (67). A mixed model was used with GOF as the dependent variable, the 30 DISCERN exemplars as independent subjects, and type of mechanism (8 levels) and human group (healthy controls vs schizophrenia patients) as factors. Pairwise comparisons were performed to interpret differences between mean GOF of the 8 illness mechanisms relative to the story-recall profile of healthy controls and patients. The two mechanisms best-fitting patient data (hyperlearning and WM disconnection) in two dimensions were extended by adding an additional mechanism parameter to each (Figure 1) followed by a second analysis of variance.

In order to model fixed delusions, DISCERN’s agent-slotting errors need to be systematic, i.e. the same confusion of an autobiographical and a crime-story agent needs to recur in different output stories. Systematicity of these errors was assessed using a randomization test.

Detailed descriptions of DISCERN simulations, GOF metric, and randomization test are provided in the Supplement.


Table 2 profiles the two subject groups; WAIS vocabulary scores were reduced for patients compared to controls. Table 3 shows narrative-recall breakdown profiles for the two human subject groups. Patients were more impaired than controls for recall success, derailment penetrance, and agent-slotting error penetrance, but not for lexical misfire penetrance. Pooling data across subjects groups and repeating analyses within groups, there was no significant correlation between any of these variables and age, parental education level, or WAIS-scaled vocabulary score.

Table 2
Comparison of two groups of subjects for individuals completing Day 7 delayed recall of stories
Table 3
Comparison of two subject groups on seven-day story recall

Illness mechanisms were initially simulated in terms of a single, corresponding mechanistic parameter combined with a variable output filter for each of the 30 DISCERN exemplars. These exemplars, by parameter adjustment, were optimized to match profiles of narrative breakdown distortions of the healthy control group and the patient group, respectively. With optimized GOF as the dependent variable, a mixed model revealed a significant group × mechanism interaction (F(7, 203)=36.7, p<0.0001). No mechanism had a significant advantage matching the narrative breakdown profile of controls (F(7, 203)=.91, p=0.07; Figure 2A). In contrast, the eight mechanisms differed significantly in how well they matched the patients’ narrative breakdown profile (F(7, 203)=50.5, p<0.0001): WM disconnection and hyperlearning were robustly superior to the other six models in terms of GOF to patient performance (p<0.0005 on paired t-tests) but were not different from each other (Figure 2B, Table 4).

Figure 2
Scatterplots of 30 independently generated simulations showing each of the eight mechanisms mapped using a mean square deviation metric of GOF to the language profile of controls (A) and patients (B). GOF was log converted to normalize distributions; ...
Table 4
Pairwise comparisons of optimized GOF for 2D hyperlearning and WM disconnection relative to the other six 2D models based on mixed model analysis13

These two best-fit mechanisms were further studied by adding another parameter to each: hyperlearning was extended to the story generator module (Figure 1A), and disconnection was extended to hidden→output layer projections in the story generator module (Figure 1B). GOF to the patient narrative breakdown profile improved overall (F(1,29)=37.3, p<0:0001; Figure 2C) with a significant mechanism × parameter interaction (F(1,29)=10.3, p=0.003): 3-parameter hyperlearning producing a significantly better fit to the patient narrative breakdown profile than three-parameter disconnection (t(29)=4.2, p=0.0002).

A content analysis was undertaken examining outputs of best-fit 3-parameter hyperlearning simulations to the patient group. For derailments, jumps from one story to another by these simulations occurred in a highly nonrandom fashion. The same emotional valence was retained from the pre- to the post-derailment story in 90.1% of instances averaged across all 30 best-fit simulations. Moreover, autobiographical stories tended to derail to other autobiographical stories, and impersonal mafia/police stories tended to derail to other mafia/police stories, with only 15.1% of derailments crossing the personal/impersonal context.

Regarding delusion-like narratives, best-fit three-parameter hyperlearning exemplars produced cross-context agent-slotting errors in a highly organized fashion: the same two characters, one from an autobiographical story and the other from a crime story, were interchanged on average 2.4 occasions per exemplar. For example, one exemplar generated the following when recalling Story 2:

Vito (substituting for Joe) was in his 30’s.

Joe was a doctor.

Joe worked in New York

Joe was my boss.

I hated Joe.

Vito was the boss of the mafia gang in the crime stories, while Joe was the boss of the first-person character in the autobiographical stories. Later, for Story 10, this same simulation produced:

Vito was a famous gangster.

Vito was the boss of Tony

Tony hated Vito.

Tony feared Joe (substituting for Vito).

This confusion occurred again in recalling Story 27:

Vince went to Starbucks.

Vince sat at a table.

Vince liked Vito.

Vince feared Joe (substituting for Vito).

In DISCERN crime stories, Vince and Tony were gangsters working under Vito. A tendency to confuse Joe and Vito in human terms could lead to the emergence of a delusional belief, where a person comes to believe that his boss in the hospital is really a mafia boss. Non-randomness of recurrent cross-context agent-slotting conflations assessed via a randomization test was pronounced (p<0.00001).

Table S1 in the Supplement shows comparable baseline characteristics of patients with and without fixed narrative delusions (FND). To test an agent-confusion model of these delusions, the number of agent-slotting errors on story-recall was compared for patients with and without these delusions and healthy controls As predicted, FND+ patients made significantly more agent-slotting errors compared to both other subject groups (F(2,54)=4.5, p=.015, Duncan post hoc comparisons, α=0.05, effect size contrasting FND+ patients versus controls = 0.69; effect size contrasting FND+ patients versus FND− patients = 0.79). The contrast between FND+ patients and healthy controls utilizing an analysis of covariance controlling for WAIS vocabulary score remained statistically significant (F(1,44)=7.0, p=0.011).

Correlations between global thought disorder ratings, medication level, number of hospitalizations and the four story-recall variables (Table 3) computed for patients were non-significant.


Whereas all eight illness mechanisms were equivalent in matching the healthy control narrative breakdown profile, hyperlearning was significantly better than the others in matching the narrative breakdown profile of patients. The differential advantage of hyperlearning in matching patient story-recall data suggests that exaggerated prediction-error signaling during memory consolidation captures pathophysiology underlying schizophrenia specifically rather than nonspecific story-recall distortions demonstrated by human subjects overall.

A majority of three-parameter best-fit hyperlearning simulations also recurrently confused specific agents in personal stories (including the self-representation) with specific agents in crime stories (and vice-versa) in a highly nonrandom fashion. Noteworthy was the high frequency of agent-slotting exchanges between the hospital boss, Joe, and the mafia boss, Vito, and parallel confusions between the “I” self-reference and underling mafia members suggesting generalization of boss/underling relationships. Insofar as story scripts provide templates for assigning intentions to agents (68), a consequence of recurrent agent-slotting confusions could be assignment of intentions and roles to autobiographical characters (possibly including the self) that borrow from impersonal stories derived from culture or the media. Confusion between agent representations in autobiographical stories and those in culturally determined narratives could account for the bizarreness of fixed, self-referential delusions, e.g., a patient insisting that her father-in-law is Saddam Hussein or that she herself is the Virgin Mary. This hypothesis is supported by data showing that the number of agent-slotting errors was greater in patients reporting delusions with plot-like narrative organization compared to patients without these delusions. These findings suggest that fixed delusions are story memories contaminated by misappropriated agents.

The hyperlearning model extends prediction-error abnormalities in schizophrenia during associative learning (25,69,70) to learning narrative sequences stored as episodic memories. The model is consistent with a mechanistic role of excessive dopaminergic release in schizophrenia (31,71,72) insofar as dopamine release appears to enhance memory consolidation and prediction-error signaling (61,73,74). In humans, hyperlearning would likely require greater activation in hippocampal structures central to consolidating episodic memories (7577). This formulation accounts for elevated baseline activation in hippocampal regions demonstrated in patients with schizophrenia (33,34), which appears to be reduced by antipsychotic medication (78); these medications may therefore achieve antipsychotic effects by curtailing hyperlearning, possibly by dopamine antagonistic effects (79).

Insofar as stories provide templates for understanding intentions of others (68), hyperlearning provides a model for disrupted theory-of-mind detected in prior studies of schizophrenia (22,28,29).

Our study has multiple limitations:

First, some candidate disturbances suggested by prior studies of schizophrenia, such as dysfunctional executive control (22,26,27) and disrupted sentence-level linguistic context (20,21), were not modeled. Moreover, some semantic disturbances associated with speech disorganization (11,12) could arise from semantic network disconnection rather than noise. These disturbances will be addressed in a future iteration of DISCERN.

Second, DISCERN learns stories by backpropagating prediction-error signals (40,41,46), which are exaggerated in hyperlearning. Backpropagation learning, which require thousands of repetitions, is unlikely to be replicated precisely in the human brain. However, long-term memory consolidation in humans also appears to be gradual and incremental, occurring over days to weeks (77). During this process, memories are replayed repeatedly (80), as they are in backpropagation (81). Therefore, hyperlearning in DISCERN may have a parallel in human narrative memory consolidation. Relatively few cycles of amplified backpropagation learning (500 epochs) were needed to match the schizophrenia narrative-breakdown profile. This finding suggests that limited bursts of hyperlearning, perhaps lasting only a few days in human terms, could produce enduring schizophrenic psychosis.

Third, thought disorder scores for patients were not significantly correlated with dependent variables developed for this study. This could be due to the fact that these were stable outpatients demonstrating minimal evidence of impairment: mean SAPS global thought disorder score fell between “questionable” and “mild” (Table S1 in the Supplement). Including a more symptomatic, actively psychotic group may reveal correlations between story-recall distortions and thought disorder scores.

Fourth, DISCERN should be further validated by assessing its capacity to replicate patterns of story-recall failure within a healthy subject group.

These limitations notwithstanding, it is noteworthy that this is the first computational study of narrative disorganization and fixed delusional narratives characteristic of schizophrenia. Hyperlearning, hypothesized as a unitary mechanism producing both disturbances, is potentially detectable via functional neuroimaging as accelerated shifts from hippocampal to cortical representations during memory consolidation (82). If this prediction is confirmed in patients with early-phase schizophrenia, computational patients could be used to test novel somatic and psychological treatments.

Supplementary Material



This work was supported by National Institute of Mental Health Grant R01MH066228, National Institute of Health Grant R21-DC009446, National Science Foundation grant EIA-0303609, a Dana Foundation grant, a National Alliance for Research on Schizophrenia and Depression Independent Investigator Award, Peterson 50th Anniversary Research Partner, and the Department of Mental Health and Addiction Services of the State of Connecticut through its support of the Abraham Ribicoff Research Center at the Connecticut Mental Health Center.


Financial Conflict

All authors report no biomedical financial interests or potential conflicts of interest.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.


1. Bleuler E. Dementia praecox or the group of schizophrenias. New York: International Universities Press; 1911/1950.
2. Andreasen NC. Thought, language, and communication disorders. II. Diagnostic significance. Arch Gen Psychiatry. 1979;26:1325–1330. [PubMed]
3. Covington MA, He C, Brown C, Naci L, McClain JT, Fjordbak BS, et al. Schizophrenia and the structure of language: The linguist’s view. Schizophr Res. 2005;77:85–98. [PubMed]
4. Docherty NM, DeRosa M, Andreasen NC. Communication disturbances in schizophrenia and mania. Arch Gen Psychiatry. 1996;53:358–364. [PubMed]
5. Vinogradov S, King RJ, Huberman BA. An associationist model of the paranoid process: Application of phase transitions in spreading activation networks. Psychiatry. 1992;55:79–94. [PubMed]
6. Harrow M, Jobe TH. How frequent is chronic multiyear delusional activity and recovery in schizophrenia: a 20-year multi-follow-up. Schizophr Bull. 2010;36(1):192–204. [PMC free article] [PubMed]
7. Berenbaum H, Kerns JG, Vernon LL, Gomez JJ. Cognitive correlates of schizophrenia signs and symptoms: I. Verbal communication disturbances. Psychiatry Res. 2008;59(1–2):147–56. [PMC free article] [PubMed]
8. Kerns JG. Verbal communication impairments and cognitive control components in people with schizophrenia. J Abn Psychology. 2007;116(2):279–89. [PubMed]
9. Kravariti E, Dixon T, Frith C, Murray R, McGuire P. Association of symptoms and executive function in schizophrenia and bipolar disorder. Schizophr Res. 2005;74(2–3):221–31. [PubMed]
10. Kreher DA, Holcomb PJ, Goff D, Kuperberg GR. Neural evidence for faster and further automatic spreading activation in schizophrenic thought disorder. Schizophr Bull. 2008;34(3):473–82. [PMC free article] [PubMed]
11. Goldberg TE, Aloia MS, Gourovitch ML, Missar D, Pickar D, Weinberger DR. Cognitive substrates of thought disorder, I: the semantic system. Am J Psychiatry. 1998;155(12):1671–6. [PubMed]
12. Aloia MS, Gourovitch ML, Missar D, Pickar D, Weinberger DR, Goldberg TE. Cognitive substrates of thought disorder, II: specifying a candidate cognitive mechanism. Am J Psychiatry. 1998;55(12):1677–84. [PubMed]
13. Tallent KA, Weinberger DR, Goldberg TE. Associating semantic space abnormalities with formal thought disorder in schizophrenia: use of triadic comparisons. J Clin Exp Neuropsychol. 2001;23(3):285–96. [PubMed]
14. Kuperberg GR, Deckersbach T, Holt DJ, Goff D, West WC. Increased temporal and prefrontal activity in response to semantic associations in schizophrenia. Arch Gen Psychiatry. 2007;64(2):138–51. [PubMed]
15. Spitzer M. A cognitive neuroscience view of schizophrenic thought disorder. Schizophr Bull. 1997;23(1):29–50. [PubMed]
16. Wentura D, Moritz S, Frings C. Further evidence for “hyper-priming” in thought-disordered schizophrenic patients using repeated masked category priming. Schizophr Res. 2008;102(1–3):69–75. [PubMed]
17. Moritz S, Woodward TS, Kuppers D, Lausen A, Schickel M. Increased automatic spreading of activation in thought-disordered schizophrenic patients. Schizophr Res. 2003;59(2–3):181–6. [PubMed]
18. Moskowitz J, Davidson M, Harvey PD. Effect of concurrent distraction on communication failures in schizophrenic patients. II. Medication status correlations. Schizophr Res. 1991;5(2):153–9. [PubMed]
19. Docherty NM, Gordinier SW. Immediate memory, attention and communication disturbances in schizophrenia patients and their relatives. Psychological Med. 1999;29(1):189–97. [PubMed]
20. Kuperberg GR, McGuire PK, David AS. Reduced sensitivity to linguistic context in schizophrenic thought disorder: evidence from on-line monitoring for words in linguistically anomalous sentences. J Abn Psychology. 1998;107(3):423–34. [PubMed]
21. Kuperberg GR, Kreher DA, Goff D, McGuire PK, David AS. Building up linguistic context in schizophrenia: evidence from self-paced reading. Neuropsychology. 2006;20(4):442–52. [PubMed]
22. Bentall RP, Rowse G, Shryane N, Kinderman P, Howard R, Blackwood N, Moore R. The cognitive and affective structure of paranoid delusions: a transdiagnostic investigation of patients with schizophrenia spectrum disorders and depression. Arch Gen Psychiatry. 2009;66(3):236–47. [PubMed]
23. Gilleen J, David AS. The cognitive neuropsychiatry of delusions: from psychopathology to neuropsychology and back again. Psychological Med. 2005;35(1):5–12. [PubMed]
24. Bell V, Halligan PW, Ellis HD. Explaining delusions: a cognitive perspective. Tr Cognitive Sci. 2006;10(5):219–26. [PubMed]
25. Corlett PR, Honey GD, Aitken MR, Dickinson A, Shanks DR, Absalom AR, et al. Frontal responses during learning predict vulnerability to the psychotogenic effects of ketamine: linking cognition, brain activity, and psychosis. Arch Gen Psychiatry. 2006;63(6):611–21. [PubMed]
26. Stirling J, Hellewell J, Blakey A, Deakin W. Thought disorder in schizophrenia is associated with both executive dysfunction and circumscribed impairments in semantic function. Psychological Med. 2006;36(4):475–84. [PubMed]
27. Barrera A, McKenna PJ, Berrios GE. Formal thought disorder in schizophrenia: an executive or a semantic deficit? Psychological Med. 2005;35(1):121–32. [PubMed]
28. Langdon R, Coltheart M, Ward PB, Catts SV. Disturbed communication in schizophrenia: the role of poor pragmatics and poor mind-reading. Psychological Med. 2002;32(7):1273–84. [PubMed]
29. Frith CD, Corcoran R. Exploring ‘theory of mind’ in people with schizophrenia. Psychological Med. 1996;26(3):521–30. [PubMed]
30. Levy DL, Smith M, Robinson D, Jody D, Lerner G, Alvir J, et al. Methylphenidate increases thought disorder in recent onset schizophrenics, but not in normal controls. Biol Psychiatry. 1993;34(8):507–14. [PubMed]
31. van OsJ, Kapur S. Schizophrenia. Lancet. 2009;374(9690):635–45. [PubMed]
32. McGuire PK, Quested DJ, Spence SA, Murray RM, Frith CD, Liddle PF. Pathophysiology of ‘positive’ thought disorder in schizophrenia. Br J Psychiatry. 1998;173(2):231–5. [PubMed]
33. Heckers S, Rauch SL, Goff D, Savage CR, Schachter DL, Fischman AJ, Alpert NM. Impaired recruitment of the hippocampus during conscious recollection in schizophrenia. Nature Neurosci. 1998;1:318–323. [PubMed]
34. Schobel SA, Lewandowski NM, Corcoran CM, Moore H, Brown T, Malaspina D, Small SA. Differential targeting of the CA1 subfield of the hippocampal formation by schizophrenia and related psychotic disorders. Arch Gen Psychiatry. 2009;66(9):938–46. [PMC free article] [PubMed]
35. Braver TS, Barch DM, Cohen JD. Cognition and control in schizophrenia: A computational model of dopamine and prefrontal function. Biol Psychiatry. 1999;46(3):312–28. [PubMed]
36. Cohen JD, Servan-Schreiber D. Context, cortex, and dopamine: a connectionist approach to behavior and biology in schizophrenia. Psychol Rev. 1992;99(1):45–77. [PubMed]
37. Monchi O, Taylor TJ, Dagher A. A neural model of working memory processes in normal subjects, Parkinson’s disease and schizophrenia for fMRI design and predictions. Neural Networks. 2000;13:953–73. [PubMed]
38. Siekmeier PJ, Hoffman RE. Enhanced semantic priming in schizophrenia: a computer model based on excessive pruning of local connections in association cortex. Br J Psychiatry. 2002;180:345–50. [PubMed]
39. Carter JR, Neufeld RW. Cognitive processing of facial affect: Connectionist model of deviations in schizophrenia. J Abn Psychology. 2007;116(2):290–305. [PubMed]
40. Miikkulainen R. Subsymbolic natural language processing. Cambridge, MA: MIT Press; 1993.
41. Miikkulainen R, Dyer MG. Natural language processing with modular PDP networks and distributed lexicon. Cognitive Sci. 1991;15(3):343–399.
42. Grasemann U, Miikkulainen R, Hoffman R. Proceedings of Cognitive Science – 2007 (Nashville, TN) Hillsdale, NJ: Erlbaum; 2007. A Subsymbolic Model of Language Pathology in Schizophrenia.
43. Elman JL. Finding structure in time. Cog Sci. 1990;14:179–211.
44. Bower GH. Mood and memory. Am Psychologist. 1981;36(2):129–48. [PubMed]
45. Docherty NM, Hall MJ, Gordinier SW. Affective reactivity of speech in schizophrenic patients and their nonschizophrenic relatives. J Abn Psychology. 1998;107:461–467. [PubMed]
46. Rumelhart D, Hinton G, Williams R. Learning internal representations by error propagation. In: Rumelhart D, McClelland J, editors. Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Vol. 1. Foundations, Cambridge, MA: MIT Press; 1986. pp. 318–362.
47. Meyer-Lindenberg A, Poline JB, Kohn PD, Holt JL, Egan MF, Weinberger DR, Berman KF. Evidence for abnormal cortical functional connectivity during working memory in schizophrenia. Am J Psychiatry. 2001;158 (11):1809–17. [PubMed]
48. Karlsgodt KH, van Erp TG, Poldrack RA, Bearden CE, Nuechterlein KH, Cannon TD. Diffusion tensor imaging of the superior longitudinal fasciculus and working memory in recent-onset schizophrenia. Biol Psychiatry. 2008;63(5):512–518. [PubMed]
49. Kim JJ, Kwon JS, Park HJ, Youn T, Kang DH, Kim MS, et al. Functional disconnection between the prefrontal and parietal cortices during working memory processing in schizophrenia: a [15(O)]H2O PET study. Am J Psychiatry. 2003;160 (5):919–23. [PubMed]
50. Yoon JH, Minzenberg MJ, Ursu S, Ryan Walter BS, Wendelken C, Ragland JD, Carter CS. Association of dorsolateral prefrontal cortex dysfunction with disrupted coordinated brain activity in schizophrenia: relationship with impaired cognition, behavioral disorganization, and global function. Am J Psychiatry. 2008;165(8):1006–14. [PMC free article] [PubMed]
51. Hoffman RE, McGlashan TH. Synaptic elimination, neurodevelopment, and the mechanism of hallucinated “voices” in schizophrenia. Am J Psychiatry. 1997;154(12):1683–9. [PubMed]
52. Potkin SG, Turner JA, Brown GG, McCarthy G, Greve DN, Glover GH, et al. Working memory and DLPFC inefficiency in schizophrenia: the FBIRN study. Schizophrenia Bull. 2009;35(1):19–31. [PMC free article] [PubMed]
53. Tan HY, Callicott JH, Weinberger DR. Dysfunctional and compensatory prefrontal cortical systems, genes and the pathogenesis of schizophrenia. Cereb Cortex. 2007;17 (Suppl 1):i171–81. [PubMed]
54. Winterer G, Weinberger DR. Genes, dopamine and cortical signal-to-noise ratio in schizophrenia. Tr Neurosci. 2004;27(11):683–90. [PubMed]
55. Perlstein WM, Carter CS, Noll DC, Cohen JD. Relation of prefrontal cortex dysfunction to working memory and symptoms in schizophrenia. Am J Psychiatry. 2001;158(7):1105–13. [PubMed]
56. MacDonald AW, 3rd, Carter CS, Kerns JG, Ursu S, Barch DM, Holmes AJ, et al. Specificity of prefrontal dysfunction and context processing deficits to schizophrenia in never-medicated patients with first-episode psychosis. Am J Psychiatry. 2005;162(3):475–84. [PubMed]
57. Grossberg S, Pepe J. Schizophrenia: possible dependence of associational span, bowing and primary versus recency on spiking threshold. Behav Sci. 1970;15(4):359–362. [PubMed]
58. Crossley NA, Mechelli A, Fusar-Poli P, Broome MR, Matthiasson P, Johns LC, et al. Superior temporal lobe dysfunction and frontotemporal dysconnectivity in subjects at risk of psychosis and in first-episode psychosis. Hum Brain Mapp. 2009;30(12):4129–37. [PubMed]
59. Assaf M, Rivkin PR, Kuzu CH, Calhoun VD, Kraut MA, Groth KM, Yassa MA, et al. Abnormal object recall and anterior cingulate overactivation correlate with formal thought disorder in schizophrenia. Biol Psychiatry. 2006;59(5):452–9. [PubMed]
60. Gibbs AA, Naudts KH, Spencer EP, David AS. The role of dopamine in attentional and memory biases for emotional information. Am J Psychiatry. 2007;164(10):1603–9. [PubMed]
61. Iordanova MD. Dopaminergic modulation of appetitive and aversive predictive learning. Rev Neurosci. 2009;20(5–6):383–404. [PubMed]
62. Niv Y, Schoenbaum G. Dialogues on prediction errors. Tr Cognitive Sci. 2008;12(7):265–72. [PubMed]
63. Fox Tree JE. Coordinating spontaneous talk. In: Wheeldon L, editor. Aspects of language production. New York, NY: Psychology Press; 2000. pp. 375–406.
64. Andreasen NC. Comprehensive assessment of symptoms and history. Iowa City: University of Iowa; 1987.
65. Cerf B. In: Chicken Soup for the Soul. Carfield J, Hansen MV, editors. Health Communications, Inc; Deerfield Beach FL: 1993.
66. Wechsler D. Manual for the Wechsler memory scale – revised. San Antonio: The Psychological Corporation; 1987.
67. Marchiori D, Warglien M. Predicting human interactive learning by regret-driven neural networks. Science. 2008;319(5866):1111–3. [PubMed]
68. Bower GH, Morrow DG. Mental models in narrative comprehension. Science. 1990;247(4938):44–8. [PubMed]
69. Corlett PR, Murray GK, Honey GD, Aitken MR, Shanks DR, Robbins TW, et al. Disrupted prediction-error signal in psychosis: evidence for an associative account of delusions. Brain. 2007;130(Pt 9):2387–400. [PMC free article] [PubMed]
70. Murray GK, Corlett PR, Clark L, Pessiglione M, Blackwell AD, Honey G, Jones PB, Bullmore ET, Robbins TW, Fletcher PC. Substantia nigra/ventral tegmental reward prediction error disruption in psychosis. Mol Psychiatry. 2008;13(3):239, 267–76. [PMC free article] [PubMed]
71. Abi-Dargham A, van de Giessen E, Slifstein M, Kegeles LS, Laruelle M. Baseline and amphetamine-stimulated dopamine activity are related in drug-naive schizophrenic subjects. Biol Psychiatry. 2009;65(12):1091–3. [PubMed]
72. Howes OD, Montgomery AJ, Asselin MC, Murray RM, Valli I, Tabraham P, et al. Elevated striatal dopamine function linked to prodromal signs of schizophrenia. Arch Gen Psychiatry. 2009;66(1):13–20. [PubMed]
73. Wittmann BC, Schott BH, Guderian S, Frey JU, Heinze HJ, Duzel E. Reward-related FMRI activation of dopaminergic midbrain is associated with enhanced hippocampus-dependent long-term memory formation. Neuron. 2005;45(3):459–67. [PubMed]
74. Schott BH, Seidenbecher CI, Fenker DB, Lauer CJ, Bunzeck N, Bernstein HG, et al. The dopaminergic midbrain participates in human episodic memory formation: evidence from genetic imaging. J Neuroscience. 2006;26(5):1407–17. [PubMed]
75. Piefke M, Weiss PH, Zilles K, Markowitsch HJ, Fink GR. Differential remoteness and emotional tone modulate the neural correlates of autobiographical memory. Brain. 2003;126(Pt 3):650–68. [PubMed]
76. Chiu CY, Schmithorst VJ, Brown RD, Holland SK, Dunn S. Making memories: a cross-sectional investigation of episodic memory encoding in childhood using FMRI. Dev Neuropsychol. 2006;29(2):321–40. [PubMed]
77. McGaugh JL. Memory: a century of consolidation. Science. 2000;287(5451):248–51. [PubMed]
78. Liddle PF, Lane CJ, Ngan ET. Immediate effects of risperidone on cortico-striato-thalamic loops and the hippocampus. Br J Psychiatry. 2000;177:402–7. [PubMed]
79. Howes OD, Egerton A, Allan V, McGuire P, Stokes P, Kapur S. Mechanisms underlying psychosis and antipsychotic treatment response in schizophrenia: insights from PET and SPECT imaging. Curr Pharmaceutical Des. 2009;15(22):2550–9. [PMC free article] [PubMed]
80. Euston DR, Tatsuno M, McNaughton BL. Fast-forward playback of recent memory sequences in prefrontal cortex during sleep. Science. 2007;318:5853. [PubMed]
81. McClelland JL, McNaughton BL, O’Reilly RC. Why there are complementary learning systems in the hippocampus and neocortex: Insights from the successes and failures of connectionist models of learning and memory. Psychol Rev. 1995;102:419–457. [PubMed]
82. Takashima A, Nieuwenhuis IL, Jensen O, Talamini LM, Rijpkema M, et al. Shift from hippocampal to neocortical centered retrieval network with consolidation. J Neurosci. 2009;29(32):10087–93. [PubMed]