Introduction

Functional brain imaging and MRI lesion-deficit correlation studies over the past several decades have dramatically reawakened interest in the neurobiology of language and aphasia while raising questions about many aspects of classical aphasia models. Several recent reviews focused on the concept of “Wernicke’s area” and how it might be interpreted in light of this new information [1, 2, 3•, 4•]. Here, I attempt to reconcile and synthesize some of these views and to present a clinically useful and straightforward model of Wernicke aphasia and related posterior aphasia syndromes.

Unpacking “Speech Comprehension”

One point garnering universal consensus recently is that there appears to be nothing like a localized “speech comprehension center” in the brain. One reason for this conclusion is that speech comprehension is itself not a single process. Imagine a task in which a patient is asked to decide if two meaningless speech sounds, say “kah” and “gah,” are identical, a task called phoneme discrimination. To perform the task, a complex cortical auditory network must identify the phonemes (consonant and vowel sounds) in each stimulus and hold these identities briefly in memory, but word meanings play no role in the task. Now imagine a task in which a patient is asked to decide which of two pictures matches the spoken word “frog.” In this case, the phonemes of frog must be identified, but there is an additional, qualitatively different process of retrieving a meaning or concept associated with the particular string of phonemes. Both of these processes, the first dealing with the auditory information in the stimulus and the second with retrieving an associated word concept, are necessary for speech comprehension. As might be expected, these distinct processes also have very different localization in the brain; thus, it serves no useful purpose to think of speech comprehension as a single, localized process.

Wernicke understood this two-stage model of comprehension well (Fig. 1), as did many subsequent authors who distinguished between a “sensory speech area” that processes the sounds of speech and a more broadly distributed “concept field” that stores word meanings [5,6,7]. Unfortunately, this critical distinction seems not to have survived translation into the clinical arena and was lost altogether from late twentieth century introductory neurology textbooks. As discussed below, extensive evidence from functional imaging and lesion-deficit correlation studies showing distinct phoneme perception and semantic memory networks in the brain can only be understood in terms of this two-stage model, and any discussion of what Wernicke’s area might mean must begin with an acceptance of this distinction.

Fig. 1
figure 1

Wernicke’s 1874 drawings (slightly modified from the original for clarity) illustrating centers and pathways supporting language. Figure 3 shows the word sound center (a 1 ) in the STG, which analyzes incoming speech sounds and guides the motor speech center (b) during speech output. Figure 4 shows, in addition, pathways linking the word sound center to the centers for tactile (c) and optical (d) memory images, which Wernicke believed were necessary for retrieval of the word concept. (“The concept is nothing more than the path cd.” [5])

A second aspect of Wernicke’s original model also needs clarification. In Wernicke’s view, the “word sound images” activated during speech perception are also activated during speech production. Their role in speech production, according to this view, is to provide a kind of template for the sounds to be produced, thereby “guiding” the motor system during articulation [5]. Damage to the sound images was therefore believed to not only impair speech perception but also result in errors (paraphasia) during speech production. As discussed below, the weight of the evidence suggests instead that there are separate neural systems in the superior temporal region for phoneme perception and for guiding motor output during speech production. A debate has ensued regarding which of these systems ought properly to be labeled Wernicke’s area [1].

Anatomical Definitions of Wernicke’s Area

Accompanying the confusion about speech comprehension processes was a gradual tendency by neurologists to attribute these processes to the posterior superior temporal gyrus (pSTG) and to label this structure Wernicke’s area. Wernicke originally localized his “auditory images” to the entire STG, based on what was at the time a well-educated guess that subcortical auditory pathways probably projected to the STG [5]. Subsequent generations of neurologists refined the localization to pSTG, or to pSTG and adjacent supramarginal gyrus [8,9,10,11]. Many authors over the years have disputed this localization, variously including parts of the middle temporal gyrus, angular gyrus, and even inferior temporal gyrus in their definition of Wernicke’s area [12,13,14,15,16], though in most of these cases, it is unclear whether the claims referred to Wernicke’s phoneme perception process or to a nonspecific speech comprehension function.

Much of modern thinking on this topic is influenced by the work of Geschwind, who followed earlier authors in emphasizing the role of the pSTG in Wernicke aphasia [17]. Geschwind was heavily influenced by findings of left-right asymmetry in the size of the planum temporale, which is the cortex posterior to Heschl’s gyrus on the dorsal STG. This structure is larger on the left side in approximately two thirds of right-handed people [18,19,20]. Geschwind and others interpreted this asymmetry as confirming a central role for the pSTG in language [21,22,23] and as the site where damage causes Wernicke aphasia. Many late twentieth century textbooks and review articles thus equate the pSTG with Wernicke’s area [17, 24,25,26]. Today, the label is reinforced on numerous internet sites, including Wikipedia (http://en.wikipedia.org/wiki/Wernicke's_area).

What has remained unclear as this evolution of terminology has proceeded is what actual process or processes are carried out by the pSTG and surrounding areas. The following sections review some more recent evidence on this critical issue.

Posterior Superior Temporal Gyrus and Conduction Aphasia

Autopsy reports from the pre-CT era are replete with cases of conduction aphasia resulting from focal damage to the pSTG [11, 27, 28], often accompanied by ad hoc explanations of why speech comprehension remained intact. CT and MRI studies have since confirmed that conduction aphasia is a typical result of focal lesions of the pSTG and SMG, sometimes including the posterior insula [29,30,31,32, 33•]. The constellation of phonemic paraphasia and intact comprehension that characterizes conduction aphasia is also elicited by cortical electrical stimulation of the pSTG and adjacent cortex [34,35,36,37].

According to the classical Wernicke-Lichtheim aphasia model, intact comprehension is not possible if Wernicke’s sensory speech area is damaged; thus, the combination of paraphasic speech and intact comprehension was explained as a disconnection between the intact sensory speech area and the intact motor speech area. Damage to this connection, later identified as the arcuate fasciculus, prevented the word sound images in Wernicke’s area from properly guiding the motor area, resulting in paraphasia.

An alternative view, strongly favored by evidence accumulating over several decades, is that conduction aphasia results from damage to a cortical region separate from both sensory speech perception and speech articulation systems. The function of this region, which includes the pSTG and adjacent cortex in the superior temporal sulcus and supramarginal gyrus, is to store and mentally activate phonological (speech sound) forms [38]. To demonstrate this function, consider that in reading this sentence, it is possible without much effort to recognize that the word snow rhymes with the word blow but not with plow. This “hearing in the mind” occurs without the need for any external speech input or overt speech production, indicating that there are ensembles of neurons in the brain that, when activated, produce a “mental image” of speech. A modern term for this process is phonological representation, where “representation” refers to both the mental image activated and the process of activation, and “phonological” specifies that it is the spoken form (as opposed to the written form or the meaning) of the word that is activated. Alternative terms in the literature include phonological encoding, phonological access, phonological retrieval, and others.

In this alternative account, phonological paraphasia occurs when brain damage disrupts the phonological representation itself, causing deletion, duplication, reordering, and other errors in the phoneme sequence that makes up the representation [39,40,41]. Phonological representation is a necessary stage prior to all speech output tasks, including spontaneous speech, naming, reading aloud, and repetition; thus, impairment causes paraphasia in all of these tasks. Importantly, the same phonological representations are also used to maintain speech sounds in short-term memory, such as when holding a phone number or other meaningless list in mind for a brief period [33•, 42•].

Four types of evidence support this new view of conduction aphasia and the pSTG. First are the previously mentioned studies of phonological paraphasia elicited by electrical cortical stimulation [34,35,36,37], which demonstrate that cortical disruption in this region (without white matter damage) can produce the paraphasic deficit, and do so without disrupting speech comprehension. Second are a large number of functional imaging studies that link neural activation in this region specifically with activation of phonological representations or maintenance of phonological representations during short-term memory tasks [33•, 42•, 43,44,45,46,47,48,49,50,51,52] (Fig. 2). Third is a recent lesion-deficit correlation study that identified sites where damage from stroke impairs performance on a silent rhyme matching task illustrated by the snow-blow-plow example above [53•]. This task requires no motor speech output, so failure cannot be ascribed to a disconnection between auditory and motor representations. Impairment on this task was correlated with damage to the left pSTG and supramarginal gyrus, indicating that patients with lesions in this region are unable to retrieve an internal mental image of the phonemes represented by the written words. Notably, lesions in this region did not cause impairment on a control task in which patients had to match written words on similarity of meaning, indicating that the deficit is specifically one of retrieving a phonological representation. Finally, the deficits observed with cortical degeneration in this brain region are consistent with an impairment of phonological representation. This syndrome, known as logopenic variant primary progressive aphasia (lvPPA), is characterized by phonological paraphasia, impaired verbal short-term memory, and intact word comprehension [54,55,56]. Other components of the lvPPA syndrome include anomia and varying degrees of sentence comprehension impairment. These components can also be explained by phonological representation failure, if one assumes that a severe deficit could cause complete failure of phonological retrieval (i.e., anomia) as well as inability to maintain a string of words in short-term memory during sentence comprehension.

Fig. 2
figure 2

A summary of 14 functional neuroimaging studies, showing peak locations where brain activation was correlated with activation or maintenance of phonological representations (adapted with permission from [2]). See appendix e-1 in [2] for a description of the studies

As this discussion suggests, the concept of phonological representation is central to understanding many language tasks, and impairments in this process account for a variety of related deficits. The broad evidence linking this function to pSTG and adjacent inferior parietal regions suggests that the anatomical zone traditionally designated Wernicke’s area plays a much larger role in speech production and verbal short-term memory tasks than in speech comprehension.

Phoneme Perception and Phoneme Deafness

Wernicke and other early aphasiologists believed that speech comprehension deficits from temporal lobe damage were the result of inability to recognize speech sounds. Damage to the sensory speech area was thought to prevent activation of the auditory images of spoken words. Modern evidence, however, suggests that sensory perception of phonemes (i.e., ability to distinguish between different speech sounds) is intact in most aphasia patients with comprehension impairment and that the deficit in these patients arises instead from inability to assign meaning to the speech sounds [57, 58]. Sparing of phoneme perception, even in patients with large temporal lobe lesions, is due to the fact that phoneme perception occurs bilaterally and can be supported independently by either hemisphere in isolation [59, 60]. Although patients with true phoneme perception impairment have been reported, this “pure word deafness” syndrome is rare because it generally requires bilateral temporal lobe lesions [7, 61, 62].

Functional neuroimaging studies have consistently localized phoneme perception to the lateral STG and adjacent superior temporal sulcus [63, 64, 65•]. The most consistent activation occurs in the mid-portion of the gyrus, anterior to the classical Wernicke area (Fig. 3). Consistent with the lesion evidence, activation in this region is bilateral, though typically stronger on the left side. Given its location, some authors have advocated “moving” the Wernicke area designation from pSTG to this more anterior location [65•, 66]. Two considerations weigh against this proposal. First is the evidence, mentioned above, that phoneme perception is intact in most patients with comprehension impairment. Thus, damage to this anterior phoneme perception region, even when it occurs, is not the cause of comprehension impairment in Wernicke aphasia. Second, Wernicke believed that one major function of his sensory speech center was to “guide” speech production. Although he was mistaken about the details of this auditory-motor transformation process, it turns out that damage to the pSTG reliably causes the phonological paraphasia he was attempting to explain. From a theoretical point of view, therefore, the pSTG has as much or more claim to the title of Wernicke’s area than the more anterior phoneme perception zone.

Fig. 3
figure 3

Meta-analysis of 22 fMRI studies comparing brain activation to speech vs. nonspeech control sounds, highlighting a mid-STG region (larger on the left) where responses are reliably stronger for speech sounds (adapted with permission from DeWitt I, Rauschecker JP. Phoneme and word recognition in the auditory ventral stream. Proc Natl Acad Sci U S A. 2012;109:E505–14). The more posterior region usually called Wernicke’s area is indicated with a dashed line

Semantic Memory and Comprehension Impairment

Given the considerable evidence, reviewed above, that lesions of the STG are not responsible for comprehension impairments in Wernicke aphasia, the question naturally arises: Where are the lesions responsible for this impairment? Examination of the extensive pre-CT era literature on this topic shows that lesions associated with Wernicke aphasia (including Wernicke’s two original cases; Fig. 4) are relatively large, typically including portions of the middle temporal gyrus (MTG) and angular gyrus in addition to STG and supramarginal gyrus [67], suggesting that these regions surrounding the STG may be critical for comprehension. More recently, several large, quantitative lesion-deficit mapping studies demonstrated an association between comprehension impairment and damage to MTG, angular gyrus, anterior STG, and several areas in left prefrontal cortex, but no association with damage to pSTG or SMG [68,69,70,71,72]. A smaller literature reports on lesions associated with transcortical sensory aphasia, a syndrome in which language comprehension is impaired but phonological processing is intact, as exemplified by the patients’ preserved ability to repeat aloud words and sentences they cannot understand. This syndrome is associated with large lesions affecting the left inferolateral temporal lobe (MTG, inferior temporal gyrus, fusiform gyrus) and angular gyrus, sparing the STG and supramarginal gyrus [73,74,75,76]. Notably, left frontal lobe lesions can also produce this syndrome [77,78,79].

Fig. 4
figure 4

A depiction of the lesions in Wernicke’s two original cases, inferred from the text description, with the principal areas cited as damaged listed below each drawing. In retrospect, Wernicke’s main conclusion (“We can thus assert with certainty that the softening of the first temporal convolution in the case of Rother was the only brain lesion which could have caused the localized symptom of aphasia.”[5]) appears somewhat premature

Comprehension impairment is a defining feature of the semantic variant of primary progressive aphasia (svPPA), also known as semantic dementia [80]. Similar to transcortical sensory aphasia, patients with svPPA are unable to comprehend words and sentences normally, but show normal phonological skills in speech repetition and other tasks. Cortical degeneration in this condition is focused in the anterior half of the temporal lobe, well outside the standard Wernicke area [3•, 81, 82]. The comprehension impairment in svPPA has been extensively studied and shown to be due to loss of basic concept knowledge, or semantic memory [80, 83,84,85]. Patients lose their knowledge of basic facts, such as the color, shape, size, and function of common objects.

Considered together, the data from stroke patients and patients with svPPA suggest that language comprehension depends on a large, distributed network including much of the inferolateral and anterior temporal lobe, angular gyrus, and prefrontal cortex. Extensive evidence from functional neuroimaging studies supports this conclusion. A meta-analysis of 120 such studies, focusing on retrieval of semantic information during word and sentence comprehension tasks, identified a widespread network of brain regions involved in this function, including angular gyrus, MTG, ventral temporal lobe, medial parietal cortex, medial prefrontal cortex, and inferior lateral prefrontal regions [86]. These results closely mirror the previously mentioned lesion data. Together, the data converge on the conclusion that many cortical regions support language comprehension, whereas the classical Wernicke area is one of the few brain regions that does not.

Middle Temporal Gyrus: a Language Comprehension Hub?

Much recent research on language comprehension networks in the brain has turned to the problem of understanding the roles of various nodes in this large system. At a broad level, frontal lobe components of the system, such as the pars orbitalis of the inferior frontal gyrus (Brodmann area 47) and the superior frontal gyrus, appear to be involved in initiating and controlling the activation of semantic information, rather than in long-term storage of the information per se [87,88,89,90]. Conversely, the temporal and inferior parietal components of the system are believed to function as the actual repositories of concept knowledge. Within this large posterior subnetwork, there appear to be many subregions defined by different types of knowledge content and varying degrees of intramodal and multimodal integration [91,92,93,94,95,96]. Defining these knowledge subnetworks is an ongoing area of active research.

The MTG, particularly its posterior half, occupies a unique position in the larger comprehension network, in that it is situated just below the pSTG/supramarginal phonological system, and may provide an interface or “hub” linking the inferior and anterior temporal lobe semantic memory system with the phonological system. In a study exploring the functional and structural connectivity of five regions in the comprehension network, Turken and Dronkers [97••] demonstrated that the posterior MTG has the most extensive connectivity of these regions, linking with angular gyrus, anterior and inferior temporal lobe, inferior and superior frontal gyri, and homologous nodes in the right hemisphere (Fig. 5a). This broad connectivity is due in large part to the fact that posterior MTG lies at a crossroad of several large white matter tracts linking these regions, including the inferior longitudinal, middle longitudinal, arcuate, and inferior fronto-occipital fasculi.

Fig. 5
figure 5

a Resting-state fMRI functional connectivity of five regions implicated in a large voxel-based lesion study of spoken language comprehension impairment [71]. The region covering the posterior half of the middle temporal gyrus (yellow) has the most widespread connectivity (adapted with permission from [97••]). b Results of another voxel-based lesion study, showing left hemisphere regions where damage produces greater impairment of spoken language comprehension than impairment of picture naming (adapted with permission from [98])

One likely role for such a hub is to facilitate comprehension of sentences, a complex task that requires analyzing the meanings of multiple words and the syntactic role of each word, while combining this information gradually into an overall meaning. Because these processes must unfold over several seconds as the sentence is presented, the phonologic, semantic, and syntactic information extracted must be maintained in short-term memory for a period of time. The function of the posterior MTG might therefore be to integrate word meanings activated throughout the semantic memory network, while maintaining the word forms and meanings in short-term memory through interactions with the phonological system and frontal lobe control systems. In a recent voxel-based lesion study, Pillay et al. tested this hypothesis by identifying lesions associated with phrase and sentence comprehension deficits [98]. Performance on a picture naming task was included as a control for nonspecific phonological and executive dysfunction. The posterior half of the left MTG was the only location where damage caused greater phrase and sentence comprehension impairment than naming impairment (Fig. 5b). Thus, the posterior MTG appears to be particularly critical for sentence comprehension, probably due to its widespread connectivity with semantic memory, phonological, and frontal control networks.

A Model of Posterior Language Networks and Related Aphasias

Figure 6 presents a visual summary of key temporal and parietal lobe language components and their participation in some common tasks used to assess aphasia. These tasks include

  1. 1.

    Speech repetition, which requires phoneme perception (yellow region in Fig. 6), followed by transformation of the auditory input into a phonological representation (Wernicke’s area, blue), which is then transformed into a motor sequence. Unilateral lesions in the phoneme perception area typically cause no impairment because this function is represented in both hemispheres. Lesions to the phonological representation system result in phonological paraphasia.

  2. 2.

    Single-word comprehension, which for spoken words require phoneme perception, followed by activation of an associated concept in the semantic memory network (red and green regions in Fig. 6). Damage to the semantic memory component is typically extensive in patients with single-word comprehension impairment, the most severe deficits occurring with bilateral damage to the anterior temporal component [3].

  3. 3.

    Sentence comprehension, which in addition to the processes engaged by word comprehension, requires integration of multiple word meanings and maintenance of word sounds and meanings in short-term memory. This integration process is particularly vulnerable to lesions in the mid-to-posterior MTG.

  4. 4.

    Sentence (spontaneous speech) production, which requires integration of multiple word concepts, followed by activation of corresponding phonological representations, followed by transformation into a motor sequence. Lesions confined to Wernicke’s area (phonological representation) result in phonological paraphasia during spontaneous speech, whereas lesions in the semantic system sparing Wernicke’s area result in verbal paraphasia (incorrect word selection), anomia, and “empty” speech lacking content.

Fig. 6
figure 6

A schematic model of posterior language zones and their involvement in several language tasks. The region often called Wernicke’s area (blue) supports activation of phonological representations, which play a central role in all speech output tasks (including reading, not shown here) and a supporting role in holding words in phonological short-term memory during sentence comprehension. The posterior half of the MTG (green) is proposed to serve as a comprehension “hub” that integrates the meaning of multiword utterances during sentence comprehension and production. Lesions in this latter region likely account for most of the language comprehension impairments observed in patients with Wernicke aphasia

From this model, it is easy to understand the Wernicke aphasia syndrome as a commonly occurring combination of damage to phonological representations in Wernicke’s area (resulting in the paraphasic speech characteristic of the syndrome) together with varying amounts of damage to the surrounding semantic memory network, resulting in varying degrees of comprehension impairment. When the surrounding damage is confined to the MTG, the comprehension impairment is likely to affect mainly sentence comprehension and spare single-word recognition (for example, ability to match words with pictures).

It should be emphasized that all of these processes are under varying degrees of “control” by frontal systems. Posterior inferior frontal regions (precentral gyrus and pars opercularis) interact closely with Wernicke’s area to maintain phonological representations in short-term memory, and more anterior inferior frontal regions (pars orbitalis and triangularis) likely control activation of particular concepts in semantic memory depending on context and task demands [87,88,89,90].

Conclusion: Is Wernicke’s Area a Useful Concept?

Current controversies regarding the meaning of Wernicke’s area are understandable given the incompatibilities between classical notions of this region and the modern empirical evidence just reviewed. To summarize,

  • The anatomical region now almost universally called Wernicke’s area (pSTG and adjacent supramarginal gyrus) likely plays no role in speech comprehension. The main function of this region is phonological representation prior to speech production.

  • The classical concept of a single phonological representation supporting both speech comprehension and speech production is not supported by the evidence. Phonological representation and phoneme perception show different localization in fMRI studies. Distinct lesions produce phonological paraphasia and phoneme deafness.

  • The classical notion that comprehension impairment in Wernicke aphasia results from damage to a sensory speech perception process is mistaken. The comprehension deficit in this syndrome is unrelated to phoneme perception ability and instead reflects an inability to activate word and sentence meaning.

Given these significant incompatibilities, does the term Wernicke’s area retain any clinical utility? As discussed above, proposals to move Wernicke’s area to the more anterior phoneme perception zone seem ill considered on both theoretical and historical grounds. As discussed elsewhere, the notion of redefining Wernicke’s area to include all the regions where damage causes comprehension impairment would render the term anatomically meaningless [2]. Tremblay and Dick [4•] recently proposed that the terms Wernicke’s area and “Broca’s area” both be abandoned, though the argument in the case of Wernicke’s area was based on small variations in how the term is applied anatomically, rather than on the more substantial theoretical problems outlined above.

In my view, the use of the term Wernicke’s area is problematic and confusing because of its continued strong association with speech comprehension. As discussed above, speech comprehension is not a unitary process, and the anatomical region typically labeled Wernicke’s area plays little or no role in this complex function. Given these considerations, the term should never be used to mean “speech comprehension area.” It may retain some usefulness as an anatomical label for the cortex surrounding the lower bank of the posterior Sylvian fissure (pSTG and posterior supramarginal gyrus) and as shorthand for “cortex involved in phonological representation” or “cortex where a focal lesion causes conduction aphasia.”