Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (51)
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Salmelin, R.
Right arrow Articles by Freund, H.-J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Salmelin, R.
Right arrow Articles by Freund, H.-J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Brain, Vol. 123, No. 6, 1184-1202, June 2000
© 2000 Oxford University Press

Single word reading in developmental stutterers and fluent speakers

R. Salmelin1, A. Schnitzler2, F. Schmitz2 and H.-J. Freund2

1 Brain Research Unit, Low Temperature Laboratory, Helsinki University of Technology, Espoo, Finland and 2 Department of Neurology, Heinrich-Heine-University, Düsseldorf, Germany

Correspondence to: Riitta Salmelin, Brain Research Unit, Low Temperature Laboratory, Helsinki University of Technology, PO Box 2200, FIN-02015 HUT, Finland E-mail: riitta{at}neuro.hut.fi


    Abstract
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Ten fluent speakers and nine developmental stutterers read isolated nouns aloud in a delayed reading paradigm. Cortical activation sequences were mapped with a whole-head magnetoencephalography system. The stutterers were mostly fluent in this task. Although the overt performance was essentially identical in the two groups, the cortical activation patterns showed clear differences, both in the evoked responses, time-locked to word presentation and mouth movement onset, and in task-related suppression of 20-Hz oscillations. Within the first 400 ms after seeing the word, processing in fluent speakers advanced from the left inferior frontal cortex (articulatory programming) to the left lateral central sulcus and dorsal premotor cortex (motor preparation). This sequence was reversed in the stutterers, who showed an early left motor cortex activation followed by a delayed left inferior frontal signal. Stutterers thus appeared to initiate motor programmes before preparation of the articulatory code. During speech production, the right motor/premotor cortex generated consistent evoked activation in fluent speakers but was silent in stutterers. On the other hand, suppression of motor cortical 20-Hz rhythm, reflecting task-related neuronal processing, occurred bilaterally in both groups. Moreover, the suppression was right-hemisphere dominant in stutterers, as opposed to left-hemisphere dominant in fluent speakers. Accordingly, the right frontal cortex of stutterers was highly active during speech production but did not generate synchronous time-locked responses. The speech-related 20-Hz suppression concentrated in the mouth area in fluent speakers, but was evident in both the hand and mouth areas in stutterers. These findings may reflect imprecise functional connectivity within the right frontal cortex and incomplete segregation between the adjacent hand and mouth motor representations in stutterers during speech production. A network including the left inferior frontal cortex and the right motor/premotor cortex, likely to be relevant in merging linguistic and affective prosody with articulation during fluent speech, thus appears to be partly dysfunctional in developmental stutterers.

human; magnetoencephalography; language disorders; speech production; reading

BA = Brodmann area; fMRI = functional MRI; MEG = magnetoencephalography; ROI = region of interest; SPECT = single photon emission computerized tomography; TOI = time window of interest; TSE = temporal spectral evolution


    Introduction
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Developmental stuttering is a sporadic disorder of speech production, which typically emerges at the age of 2–4 years. Stuttered speech is characterized by repetitions and prolongations of phonemes or syllables. For some individuals certain phonemes, especially consonants, may be particularly problematic. Dysfluency occurs most often at the beginning of a sentence or, more generally, when a new complete idea has to be expressed. Isolated words are stuttered less often. Stuttering occurs most frequently in self-initiated, self-paced discourse, but it is also evident when reading aloud. Increased emotional content in the discourse increases the frequency of stuttered events. The original incidence of stuttering is about 4%, but a vast majority of affected children show spontaneous recovery. About 1% of the population continues to suffer from severely stuttered speech even in adulthood, with a male-to-female ratio of 3 : 1 (cf. Starkweather, 1987; Bloodstein, 1995).

Theories of stuttering range from an isolated disorder of the speech motor system or auditory feedback to disrupted interaction in the multiple sensorimotor systems, and further to stuttering as a manifestation of an unsuccessful alignment of subsequent sentence plans. Stuttering could be an outcome of periodic irregularities in the timing of muscle movements within the speech system (Zimmermann, 1980Go). When the background tension is high, as is often the case in stutterers (Freeman and Ushijima, 1978Go), the high-precision adjustments needed during speech become difficult to perform and the movements are jerky. Stutterers display poor coordination of antagonistic laryngeal muscles (Freeman and Ushijima, 1978Go), and they are systematically slower in initiating phonation than non-impaired speakers (Bloodstein, 1995Go). It has been suggested that specific neural correlates of the dysfunction of the motor system of stutterers can be found, e.g. in the coordination of speech movements by the supplementary motor area (Caruso, 1991Go). Stutterers also show abnormalities in rapid finger movements (Jäncke et al., 1995Go).

Malfunction of the auditory system during self-monitoring of speech has been suggested to underlie stuttering (Fairbanks, 1954Go). Normal speakers become dysfluent when exposed to delayed auditory feedback (Lee, 1951Go). In dichotic presentation of meaningful linguistic stimuli, a large proportion of stutterers fail to show the normally observed right-ear advantage (Curry and Gregory, 1969Go; Hall and Jerger, 1978Go). In fluent speakers, the left auditory cortex is more sensitive to the side of stimulation (right versus left ear), whereas the right auditory cortex is more sensitive in stutterers (Salmelin et al., 1998Go). Stutterers have also been reported to have difficulties in sound localization (Rousey et al., 1959Go). On the other hand, a possible central auditory processing deficit is likely to play only a minor role at the sentence-initial positions where stuttering most often occurs.

The small differences in motor or auditory function between fluent speakers and stutterers may reflect a more general interference between speech production and language formulation in stutterers (Starkweather, 1987Go). Karniol has recently proposed a framework for understanding stuttering, based on the idea that we produce complete sentences rather than single words when speaking or reading aloud (Karniol, 1995Go). Stuttering typically appears in children at the point where they work their way into more and more complex sentence structures (Starkweather, 1987Go; Bloodstein, 1995Go). Words are produced differently, in a shortened and modulated fashion, in a sentence than in isolation, and the way the word is produced depends on the sentence in which it is embedded. Sentences have suprasegmental features, including rhythm, melody and stress, that are largely determined prior to initiation of the utterance (see Karniol, 1995 for a review). The theory suggests that stuttering is caused by a misalignment at the border between subsequent suprasegmental plans. The more complex the utterance, the more difficult it is to superimpose a fundamental frequency contour, or prosody, on it.

Stuttering can be made to essentially disappear by reading in chorus with another person, or even by pacing the speech with a metronome (Johnson and Rosen, 1937Go). Both of these procedures reduce the need to build a prosodic contour for the expression, but the effect may also be due to the overall slowing down of speech production (Starkweather, 1987Go). Delayed auditory feedback (Lee, 1951Go; Soderberg, 1968Go) and auditory masking (Cherry and Sayers, 1956Go) can relieve stuttering. Again, the beneficial effect may arise from suppression of a defective auditory feedback system or from an overall slower speech rhythm resulting from the interference (Starkweather, 1987Go; Bloodstein, 1995Go). Stutterers can usually sing fluently, possibly because songs have no self-formulated propositional content (Starkweather, 1987Go).

Recently, the neural basis of developmental stuttering has been assessed using single photon emission computerized tomography (SPECT; Pool et al., 1991) and PET (Wu et al., 1995Go; Fox et al., 1996Go; Braun et al., 1997Go). Pool and colleagues reported global absolute blood flow reductions in stutterers as compared with fluent speakers in a resting condition (Pool et al., 1991Go). Braun and colleagues found significant differences between the stutterer and control groups when all subjects were speaking fluently (Braun et al., 1997Go). The cerebral function of fluent speakers and stutterers thus shows fundamental variance even in the absence of overt stuttering. To evoke dysfluent versus fluent speech, Wu et al. and Fox et al. asked the subjects to read aloud both self-paced and in chorus, while Braun et al. employed metronome pacing and recital of a familiar song in the fluent condition, and spontaneous narrative speech and sentence construction in the dysfluent condition (Wu et al., 1995Go; Fox et al., 1996Go; Braun et al., 1997Go). In these studies, the activation patterns differed between the subject groups during fluent speech, and between the fluent and dysfluent conditions in the stutterers, extending over a large number of cortical and subcortical areas. Results of previous neuroimaging studies seem to indicate that stuttering is associated with reduced activation of the left-hemisphere frontotemporal language areas (Wu et al., 1995Go; Fox et al., 1996Go; Braun et al., 1997Go). At the same time, the right hemipheric regions, including the motor and premotor cortices, show exceptionally strong blood flow in stutterers (Fox et al., 1996Go; Braun et al., 1997Go). Both PET (Fox et al., 1996Go) and magnetoencephalographic (MEG; Salmelin et al., 1998) studies have implied altered auditory cortical function in stutterers, particularly in the left hemisphere. Although the picture still remains rather unspecific with regard to the possible causes of stuttering, there is clear evidence for extensive functional differences in the brains of stutterers and fluent speakers.

We employed whole-head MEG to investigate the timing of cortical activation sequences in developmental stutterers and fluent speakers. As MEG combines excellent temporal resolution with good accuracy of localization of active cortical areas, it is a useful tool for characterizing the activation sequence from visual perception to oral output (cf. Salmelin et al., 1994) and for identifying cortical correlates of disorders in these processes. The subjects read aloud single words. The vocalization was delayed by half a second to highlight perceptual and motor production aspects of the process, and to reduce the effect of mouth movement artefact on the early cortical activation patterns. This task is both behaviourally and experimentally straightforward and convenient. It was not expected that it would evoke much stuttering, except in the most severely affected individuals. The paradigm allows comparison of brain function in fluent speakers and stutterers during essentially identical overt performance. Earlier imaging results have shown differences in cerebral blood flow between stutterers and fluent speakers at rest and in fluency-evoking reading conditions (Pool et al., 1991Go; Braun et al., 1997Go). It is thus possible that the seeds for stuttering are present constantly but the threshold for overt dysfluency is exceeded only periodically (Bloodstein, 1995Go). Developmental stuttering is a particularly intriguing disorder as the deficit is only functional, without an obvious structural correlate. Distinct cortical activation patterns in stutterers and fluent speakers, associated with identical overt behaviour, could elucidate not only the neuronal basis of stuttering but also the steps necessary for normal speech production.


    Methods
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Subjects
Nine developmentally stuttering subjects (S1–S9; 22–53 years, mean 36 years, 7 males) and 10 fluent speakers (C1–C10; 25–52 years, mean 34 years, 8 males) gave their informed consent to participate in the study, which was approved by the ethical committee of The Medical Faculty of the Heinrich-Heine-University. All subjects were German-speaking and right handed as assessed by a handedness questionnaire (Annett, 1967Go) and a standardized hand performance test (Jäncke et al., 1995Go). Stuttering was very mild in one subject, mild in two, moderate in two, severe in two and very severe in two (Stuttering Severity Instrument; Riley, 1972). All our stutterers had tried at least one type of therapy for their speech problems at some point in their life, including conventional speech therapy (6 subjects), metronome-paced speaking (2 subjects), psychotherapy (3 subjects), relaxation techniques (1 subject) and fluency therapy (1 subject). No therapeutical interventions were administered while the subjects participated in this study.

All subjects were tested on the Wechsler adult intelligence scale (WAIS; Wechsler, 1991). The performance of both fluent speakers (115 ± 4.5, mean ± standard deviation) and stutterers (110 ± 7.2) was within normal range. There were no significant group differences.

Task
The stimuli were common German nouns, composed of 7–8 letters (48% concrete nouns, 42% abstract nouns, 10% with both a concrete and an abstract meaning). Each word was presented for 300 ms. After a blank interval of 500 ms, a question mark appeared for 2000 ms, prompting the subject to read the word aloud. The question mark was followed by a blank period of 2000 ms. The whole sequence (word–blank–question mark–blank) was thus repeated every 4.8 s. Altogether the stimulus set contained 250 different words.

To obtain functional landmarks in the auditory cortex and sensorimotor hand area, in separate runs, the subjects received 1 kHz, 50 ms tones every 1 s, alternately to the left and right ear, and performed self-paced index finger lifts approximately every 3 s, alternately with the left and right hand. Spontaneous brain activity during resting was recorded for 1 min when the subjects had their eyes closed and for 1 min when their eyes were open. The results of the auditory experiment have been reported separately (Salmelin et al., 1998Go).

MEG
Neuromagnetic signals reflect synchronous postsynaptic potentials in tens of thousands of pyramidal cells within a cortical patch on the order of a square centimetre. Because of the closely spherical symmetry of the human brain and skull, the detected MEG signals are mainly associated with electric current flowing parallel to the skull, i.e. with activation in the fissural cortex. A detailed description of the MEG method is given by Hämäläinen and colleagues (Hämäläinen et al., 1993Go).

The Neuromag-122TM whole-head MEG system (Neuromag, Helsinki, Finland) contains 122 sensors arranged on a helmet-shaped surface. Each sensor is composed of a pick-up coil, which collects the magnetic field associated with neuronal current flow, and a superconducting quantum interference device (SQUID). The SQUID transforms the magnetic field to voltage which can be measured with high accuracy (Ahonen et al., 1993Go). The planar gradiometers used in Neuromag-122TM detect the maximum signal immediately above an active cortical area.

Measurement procedure
The measurements were performed in a magnetically shielded room. The subject was seated on a chair, with the head supported against the helmet-shaped bottom part of the MEG apparatus. The words, white letters on a dark grey background, subtended a 4° visual angle on a back-projection screen placed at 1 m from the subject.

The MEG signals were recorded with a 0.03–130 Hz filter and digitized at 0.4 kHz. Both vertical and horizontal EOG (electro-oculogram) were recorded simultaneously. In addition, lip movements were monitored with EMG across the opposite corners of the mouth (orbicularis oris muscle); in two stutterers and in one control subject, technical problems prevented the collection of the lip EMG signals. The subject's speech was registered with a microphone and stored on audiotape. The continuous MEG, EOG, EMG and microphone records were stored on magneto-optical disk for off-line analysis.

MEG signals were averaged on-line from –200 to +1500 ms with respect to stimulus onset. Epochs contaminated by eye or eyelid movements were rejected from the average. A minimum of 90–100 artefact-free epochs were collected for all subjects. The stimuli were presented in blocks of about 60 words, each lasting for 5 min. In fluent subjects, two blocks usually provided enough repetitions. In the two most severe stutterers, all four blocks had to be run to obtain the minimum number of non-stuttered and artefact-free epochs.

Data analysis
Behavioural measures
The verbal responses of the stutterers were evaluated after the measurement from the audiotape recording by two independent workers. A response was accepted as fluent when both referees unequivocally agreed on this classification. Stuttered responses had a clear repetition of a phoneme or syllable. If there was a hint of dysfluency but not full-blown stuttering, or if the two referees disagreed on the classification, the response was considered ambiguous. The ambiguous category never exceeded 20% of the trials. The subjects were also interviewed for their personal impression after the MEG recording.

Mouth movement and speech onset latencies were determined from burst onsets in the lip EMG and microphone records in all subjects. In each individual, the EMG and microphone signals were rectified and averaged with respect to word onset to obtain the overall shape of mouth muscle activity and vocalization. For calculating the mean shape across subjects, the individual signals were normalized by setting the maximum equal to 1.

Time-locked evoked responses
The MEG signals of the stutterers were re-averaged off-line for fluent, stuttered and ambiguous responses, from 200 ms before to 1800 ms after word presentation. The MEG signals were also averaged off-line with respect to lip movement (–1000 to +1000 ms) and microphone signal onsets (–1000 to +1000 ms) in all subjects. In the following text, these three different reference points will be referred to as word onset, mouth movement onset and speech onset. Before source analysis, the MEG data were further low-pass filtered at 40 Hz.

All subjects' data were analysed individually. A muscle/tongue artefact coincided with microphone onset in seven of the 10 controls and in all stutterers. The conspicuous bilateral artefact pattern, which was at a maximum towards the rim of the helmet, was highly similar in all subjects. The artefact distribution was identified at the time point where there was the least evidence of simultaneous cortical responses, and the field pattern was removed from the MEG signals using the signal-space-projection method (Uusitalo and Ilmoniemi, 1997Go). Field patterns at other time intervals before and after the peak disturbance were visually inspected to verify that the actual cortical activation patterns were not affected by the signal-space-projection procedure. In two control subjects, the artefact could not be removed because of simultaneous strong cortical activity, but the cortical activation patterns could still be analysed satisfactorily. When source modelling was complete, the data were checked both in the original form and with the artefact removed.

The active cortical areas were modelled as current dipoles (Hämäläinen et al., 1993Go). The dipole's location, orientation and amplitude represent the centre of gravity of the active cortical patch and the direction and mean strength of the current flow therein. The process of source modelling consists of continuous interplay between visual inspection of coherent local signal variations in the original responses, a search for clear dipolar field patterns in the analysis programme, and evaluation of how well the source model accounts for the measured signals (goodness-of-fit). The current dipoles were identified one by one, at time points where each specific field pattern was clearest. The sources were then brought into a multi-dipole model where the source locations and orientations were kept fixed while their amplitudes were allowed to vary as a function of time to best account for the signals measured by all the 122 sensors. The resulting time courses of activation in the cortical source areas are referred to as source waveforms, to separate them from the original 122 MEG sensor waveforms. The complete model included between nine and 13 sources in each individual, when responses were averaged with respect to stimulus onset, and between four and nine sources when the responses were averaged with respect to lip EMG onset. The goodness-of-fit varied between 70 and 90% across subjects and analysis intervals. Each source accounted for 18 ± 13% (mean ± standard deviation) in the goodness-of-fit value during the time interval when the source was most active, and a minimum of 5% within one hemisphere. The accuracy of source localization was on average 6 mm (95% confidence limit).

The individual source models were compared across subject groups. Based on clustering of the sources, the brain was divided into 12 regions of interest (ROIs): midline occipital cortex, left and right occipito-temporal, inferior frontal, superior temporal, inferior parietal and rolandic cortex, and vertex (see detailed description in Results). The source waveforms were averaged across subjects within each ROI. When a subject had several sources in one region, the waveforms were added together. If a subject did not have a source within a certain ROI, the source waveform was set equal to zero, as the signal was apparently so small that no distinct source area could be identified. The waveforms in a specific ROI were included in further analysis only if at least half of either controls or stutterers, i.e. a minimum of five subjects, showed a response there. This criterion excluded the right superior temporal cortex from further consideration, leaving 11 ROIs for detailed analysis.

Within each ROI, time windows of interest (TOIs) were chosen as those time intervals where the averaged waveforms in fluent speakers and stutterers differed by at least their standard errors of mean. The source waveforms were analysed in 25 ms bins, corresponding to the smoothing effect of the 40-Hz low-pass filter. The time-locked responses are typically quite sharp (duration up to 100 ms) within the first 400 ms after stimulus onset and become temporally more wide-spread or sustained at longer latencies. Therefore, for an interval to qualify as a TOI, the mean source waveforms of the two groups were required to differ continuously for at least 50 ms (two adjacent 25 ms bins) at latencies 0–400 ms after stimulus onset. For latencies longer than 400 ms, the waveforms of the two groups were required to show a difference lasting for at least 100 ms. The requirement of a minimum difference of 100 ms was also applied for source waveforms averaged with respect to lip EMG onset. For each candidate ROI/TOI, the mean signal strength was calculated in each subject. Group differences in the mean source strengths were tested using one-way ANOVA (analysis of variance).

For estimating whether source strength differed from zero within a certain ROI/TOI, we used the base level of each waveform, i.e. the standard deviation within the prestimulus interval (–200 to 0 ms). The activation was taken to differ significantly from zero at P < 0.05, P < 0.01 and P < 0.001, when the amplitude exceeded 1.96, 2.58 and 3.29 SD, respectively. Again, to be accepted as a true response, a peak was required to be non-zero at least for 50 ms within the first 400 ms and for at least 100 ms at longer latencies.

Cortical rhythmic activity
In addition to the time-locked evoked responses, we analysed event-related modulation of cortical rhythmic activity. First, amplitude spectra were calculated for each subject in all measurement conditions (word reading, auditory stimulation, finger movements, resting) by advancing a 2.6 s window in 1.3 s steps through the entire non-averaged data set and averaging the resulting spectra. In all subjects, it was possible to identify four distinct spectral ranges (passbands): (i) 8–11 Hz (low 10 Hz); (ii) 11–15 Hz (high 10 Hz); (iii) 15–21 Hz (low 20 Hz); and (iv) 21–28 Hz (high 20 Hz), with the borders varying by 1–3 Hz across individuals. The event-related modulation of the cortical rhythms was analysed with the Temporal Spectral Evolution (TSE) approach (Salmelin and Hari, 1994bGo). The MEG signals were filtered through the four individually determined passbands described above, rectified (absolute value), and averaged with respect to stimulus onset. For quantification of the modulation, the TSEs were also calculated in the fixed passbands of 8–14 Hz (`10 Hz') and 16–28 Hz (`20 Hz') for all subjects.

Sources of the rhythmic activity were searched every 10 ms from 150 s of non-averaged signals (2 x 30 s during word reading and finger movements and 30 s while resting with eyes closed) filtered through the individually selected passbands, using subsets of sensors over the left and right hemispheres (covering the temporal lobe and the central sulcus), the posterior areas (posterior parietal and occipital cortex), and the vertex (Salmelin and Hari, 1994aGo). Rhythmic activity was concentrated close to the hand and mouth areas, and around the parieto-occipital sulcus and calcarine fissure. Additional clusters (1–2) in the posterior parietal cortex were seen in some subjects. During strong bursts of rhythmic activity, in particular, it was possible to model the generators as current dipoles. The hand area rhythms could be identified in any frequency range whereas in the mouth area the bursts concentrated in the 20-Hz range. Each distinct cluster (5–7 per subject) was represented by a single dipole. By forming a multidipole model, TSE curves of the cortical sources were obtained (Salmelin et al., 1995Go). Visual comparison of the whole-head TSE curves resulting from this dipole model with the original TSE curves showed that the selected generators explained the measured modulation of rhythmic activity in all subjects.

Statistical tests were performed on the source TSE waveforms in the hand and mouth areas (instead of original sensor outputs), using a mixed-model ANOVA (hemisphere x area x subject group).

MEG and MRI
Anatomical MRIs were available for all subjects. For presenting the functional MEG results on the MRIs, the two coordinate systems were aligned with the help of three small coils placed on the subject's head prior to the measurement. Using a 3D digitizer (Isotrak 3S1002, Polhemus Navigation Sciences, Colchester, Vt., USA), the positions of these coils were determined with respect to three landmarks on the head, i.e. nasion and points just anterior to the ear canals, which are readily identified on the MRIs. The locations of the coils with respect to the MEG helmet were determined by briefly energizing the coils and calculating their locations from the magnetic field patterns.

The MRIs of all subjects were available both as slices and as surface renditions. Sources of the auditory and finger sensorimotor activations were superimposed on the subject's MRIs. Their locations in the suprasylvian auditory cortex around Heschl's gyrus and in the hand knob along the central sulcus (cf. Yousry et al., 1997) verified the correct alignment of the MEG and MRI coordinate systems. The sources of the word reading task were then superimposed on the individual MRIs. The sources were located at a depth of 12 ± 4 mm (mean ± standard deviation). For surface renditions, the sources were projected along the head radius, and adjusted to the correct sulcus with the help of the 3D MRI slices.

For comparison of active areas in the two subject groups, the sources were further transferred on a single subject's brain. Care was taken to ensure that the source locations remained correct in relation to the sulcal structure and functional landmarks, with the help of both surface MRIs and 3D MRI slices.


    Results
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Mouth movement and speech
Distribution of onset latencies and mean time behaviour of mouth muscle activity and speech signal are displayed in Fig. 1Go across all fluent speakers (Fig. 1AGo) and for non-stuttered words in developmental stutterers (Fig. 1BGo). Mouth movement started on average 960 ms after word onset, i.e. 160 ms after the question mark prompt for vocalization, and overt speech began on average 290 ms later. Mouth movements lasted for ~1050 ms and speech for ~650 ms. Fluent speakers and fluently reading stutterers did not differ in any of these measures. Their overt performances were thus indistinguishable in this task.



View larger version (28K):
[in this window]
[in a new window]
 
Fig. 1 Distribution of mouth movement onset latencies (upper panels) and speech onset latencies (lower panels), displayed as bar graphs, and the mean shape of mouth EMG (upper panels, black curve) and microphone signal (lower panels, grey curve) in (A) fluent speakers and (B) stutterers when they were fluent. The proportion per bar is given on the left vertical axis (left arrow) and the mean normalized signal strength (maximum set equal to 1 in each individual) on the right vertical axis (right arrow). The word was shown at 0 s (left vertical dashed line) and the question mark at 0.8 s (right vertical dashed line). The width of each bar is 0.1 s.

 
Although muscle activity occasionally started before the question mark onset, both the maximum muscle activity and the microphone signal followed the vocalization prompt, verifying the correct performance of the task. Interestingly, while the speech onset latency was the same in fluent speakers and stutterers, the mouth movement onsets tended to spread into earlier latencies in stutterers than fluent speakers (bars in the upper row). Severity of stuttering did not correlate with mouth movement or speech onset or duration, or delay from mouth movement to speech onset.

As expected, the stutterers were mainly fluent in this task. Five stutterers were dysfluent at least occasionally (range 9–108 stuttered words), but only one of them had a large enough number of stuttered trials to provide an acceptable signal-to-noise ratio in the MEG signals. The ensuing analysis compares cortical activity of stutterers and control subjects when both were fluent.

Activity time-locked to word presentation
Figure 2Go displays the whole-head MEG signals and the corresponding source waveforms in one fluent subject (C1) from 200 ms before to 1400 ms after word onset. The signals are strongest immediately above the active cortical area. Even from the whole-head data, one can readily recognize the early occipital and right fronto-parietal activations, and a persistent response over the left temporo-parietal cortex. A detailed source analysis (see Methods) revealed 11 reliable source areas which accounted for at least 80% of the MEG signal variance during most of the studied interval when all the 122 sensors were included. The medial and lateral occipital areas (sources 1, 2 and 3) were active first, followed by the right rolandic source 4, each for less than 100 ms. The right (source 5) and left (source 7) inferior parietal and left superior temporal cortex (6) then started to participate, remaining active for 200–400 ms. A brief signal from the left inferior frontal cortex (source 8) was then followed by activation of the left posterior temporoparietal cortex (source 9). Activation of sources 10 and 11, reflecting involvement of the left posterior temporal cortex and the left dorsal premotor cortex, respectively, continued when the question mark appeared at 800 ms, prompting the subject to read the word aloud.



View larger version (37K):
[in this window]
[in a new window]
 
Fig. 2 Left: MEG responses of fluent subject C1, averaged with respect to word onset (at 0 ms). The measurement helmet is viewed from above, flattened onto a plane, with the nose pointing upwards. Time is given in the horizontal axis (from 200 ms before to 1400 ms after word onset) and the variation of magnetic field in the vertical axis. Neuromag-122TM records maximum signal immediately above an active cortical area. The 122 sensors are arranged in 61 locations along the helmet. In each location, there are two overlapping sensors (see schematic heads in the upper right corner). The topmost one is most sensitive to `vertical' currents flowing towards or from the vertex, and the lower one to `horizontal' currents flowing around the perimeter. Right: Source analysis of the MEG signal distribution over the same time interval as on the left. Source strengths (in nanoamperemetres, nAm) are plotted as a function of time. The word onset at 0 ms and the appearance of the question mark prompt at 800 ms are indicated with solid vertical lines. The locations (dots) and orientations (tails) of the dipolar sources are shown on schematic drawings of the subject's brain.

 
The original 122 MEG signals were resolved into the time behaviour of distinct cortical areas in all individuals. Figure 3Go combines the source areas of all fluent subjects on the left and all stutterers (when they were fluent) on the right. The similarity of the sources was judged by the location and orientation of current flow. The distinct clusters of sources are indicated with different shapes and colouring of the markers.



View larger version (95K):
[in this window]
[in a new window]
 
Fig. 3 Source areas in fluent subjects (left) and stutterers (right), when the MEG signals were averaged with respect to word onset. The different shapes and colours of the symbols (white and black circles, triangles, squares and arrows) depict the grouping of sources into distinct ROIs (see text for details). The grey circles denote sources which do not belong to any well-defined cluster. The black diamond indicates the location of the hand sensorimotor cortex. The size of the symbols equals the mean accuracy of localization (6 mm, see Methods).

 
The ROIs were identical in the left and right hemispheres. The `inferior frontal' subregion (Fig. 3Go, white squares) included sources clustering predominantly in the frontal operculum and insula, with posterior-to-anterior current flow (cf. Fig. 2Go, source 8, for single-subject data). This ROI was delimited from the inferior end by the superior temporal sulcus. The `superior temporal' subregion (black triangles) included sources with current flow directed away from the vertex (cf. Fig. 2Go, source 6). The `frontoparietal' subregion (Fig. 3Go, white circles) covered the primary somatosensory and motor cortices and the premotor and dorsal prefrontal cortex, delimited by the postcentral sulcus, precentral sulcus, and the inferior frontal sulcus. In the `frontoparietal' ROI the current was mainly orientated along the posterior–anterior axis (cf. Fig. 2Go, sources 4 and 11). The `inferior parietal' subregion (Fig. 3Go, white arrowheads) was defined by the postcentral sulcus, sylvian fissure, and the posterior ascent of the superior temporal sulcus, thus covering the angular gyrus and supramarginal gyrus. The current was typically orientated along the anterior-to-posterior axis (cf. Fig. 2Go, source 7). In the `occipitotemporal' ROI (black circles) the current usually flowed towards the vertex (cf. Fig. 2Go, sources 2 and 3), and the area was confined by the ascending ramus of the superior temporal sulcus, the anterior occipital sulcus, and the lunate sulcus. The `occipital' subregion (Fig. 3Go, black squares) included all parieto-occipital sources clustering close to the posterior midline. In the `vertex' ROI (Fig. 3Go, white inverted triangles), the current flow was in the posterior–anterior direction and the sources concentrated frontally to the central sulcus. In addition, scattered sources were found in the left middle temporal cortex (Fig. 3Go, grey circles) but they did not form clear clusters, based on either location or orientation, or on the percentage of subjects showing the same type of response. Accordingly, further analysis was conducted in 11 ROIs: left and right inferior frontal, frontoparietal, inferior parietal, and occipitotemporal cortices, left superior temporal cortex, the occipital subregion, and over the vertex.

Figure 4Go gives the mean ± standard error of the mean source waveforms in each of the 11 ROIs, averaged over fluent subjects and stutterers. The responses can be divided roughly into three stages. In the first stage, within 200 ms after word onset, the occipital (area 1) and left and right occipitotemporal cortices (areas 2 and 3) showed strong, transient signals, and also weaker responses after the question mark. In the second stage, at 200–600 ms, the persistent occipital activation was accompanied by responses in the left and right inferior frontal (areas 4 and 5), left superior temporal (area 6), and left and right inferior parietal subregions (areas 7 and 8). In the third stage, the left and right frontoparietal cortices (areas 9 and 10) and the vertex (area 11) became involved and remained active throughout the vocalization prompt (question mark at 800 ms) and mouth movement and speech onsets (at about 960 ms and 1240 ms).



View larger version (46K):
[in this window]
[in a new window]
 
Fig. 4 Mean ± standard error of the mean (black curve and shading) source strengths as a function of time (cf. Fig. 2Go) in fluent subjects (left) and stutterers (right). The word and question mark onsets are indicated with solid vertical lines and the mouth movement (M) and speech (S) onsets with dashed lines. The black arrowheads denote the ROIs and TOIs (see text) where the responses of stutterers and fluent speakers differed significantly from each other. The studied ROIs are illustrated on the schematic drawings of brains on the left.

 
The mean source waveforms of fluent speakers and stutterers differed from each other (see Methods) in the ROIs and TOIs, which are listed in Table 1Go. Out of these, activations in four areas and time ranges showed statistically significant differences. The stutterers had a stronger mean activation than control subjects in the left frontoparietal cortex at 100–200 ms after word onset [F(1,17) = 6.0, P < 0.03, one-way ANOVA]. On the other hand, fluent speakers but not stutterers activated the left inferior frontal cortex 200–275 ms after word onset [F(1,17) = 6.9, P < 0.02]. During speech production, 1025–1400 ms after word onset, the right frontoparietal response was significantly stronger in fluent speakers than in stutterers [F(1,17) = 10.5, P < 0.005]. Finally, the groups differed at 1100–1400 ms in the left occipitotemporal cortex [F(1,17) = 4.7, P < 0.05]; as this difference arose mainly from an apparent negative signal in two fluent speakers rather than an actual consistent activation in most stutterers it will not be discussed further.


View this table:
[in this window]
[in a new window]
 
Table 1 ROIs and TOIs where mean ± standard error of the mean source strengths differed between stutterers (S) and fluent speakers (F)
 
The individual sources contributing to the significant group differences are displayed in Fig. 5Go. The early left frontoparietal response (Fig. 5AGo) was evident in five stutterers (at least two sources in four stutterers, 10 sources in total) and in five controls (one source per subject). Note the involvement of the lateral end of the central sulcus, overlapping the typical tongue representation, in four stutterers. In fluent speakers, all frontoparietal sources were identified closer to the hand area (cf. Fig. 3Go). The left inferior frontal ROI/TOI (Fig. 5BGo) was active in seven of 10 control subjects but only in three of nine stutterers (not significant, n.s.). The right frontoparietal cortex (Fig. 5CGo), with the centre of activity on average 1 cm inferior and anterior to the hand area, was involved in seven control subjects (two sources in four subjects) but only in one stutterer (P < 0.02, Fisher's exact test).



View larger version (107K):
[in this window]
[in a new window]
 
Fig. 5 Source areas of individual subjects showing a positive non-zero signal (P < 0.001) within those ROIs and TOIs (AC, see text) where activation strengths differed significantly between fluent speakers (left) and stutterers (right). The symbols are filled with grey in that subject group which showed on average less activation in each ROI/TOI.

 
The sequence of activation in the left hemisphere was observed directly in the five fluent speakers and four stutterers who showed activation both in the inferior frontal and frontoparietal ROIs. Comparison of the latencies of the earliest peak activations in the two areas implied that the response sequence was from motor cortex to inferior frontal cortex in all four stutterers but from inferior frontal to motor cortex in four of the five fluent speakers (P < 0.05, Fisher's exact test). In the nine fluent speakers who showed activation in at least one of these ROIs, the peak latencies (mean ± standard error of the mean) of the left inferior frontal and frontoparietal responses were 230 ± 27 ms and 348 ± 58 ms and in the eight stutterers 377 ± 44 ms and 197 ± 32 ms, respectively. A 2 x 2 mixed-model ANOVA resulted in a significant area-by-group interaction [F(1,7) = 10.4, P < 0.02], indicating a reversed order of activation in the two groups.

Activity time-locked to mouth movement onset
The MEG signals were also averaged with respect to mouth movement onset, which was expected to emphasize activation patterns directly related to motor output. As is evident in Fig. 6Go, this procedure gave the most prominent source clusters in the bilateral frontoparietal subregions (white circles), with some sources also in other areas which had been identified from the responses averaged with respect to word onset (Fig. 3Go). In the left hemisphere, the sources extended to the lateral end of the central sulcus whereas in the right hemisphere the active areas concentrated closer to the hand area.



View larger version (105K):
[in this window]
[in a new window]
 
Fig. 6 Source areas identified from the responses averaged with respect to mouth movement onset in fluent speakers (left) and stutterers (right). The conspicuous clusters in the frontoparietal subregions are plotted as white circles. The other sources are indicated with grey triangles.

 
Figure 7Go depicts the mean ± standard error of the mean source waveforms, averaged with respect to mouth movement onset, in the left and right frontoparietal ROIs in fluent subjects and stutterers. The activation increased slowly prior to movement onset bilaterally in the fluent speakers but was left-hemisphere dominant in the stutterers. The activation reached maximum at 69 ± 15 ms and 45 ± 9 ms in the left and right hemispheres of the fluent speakers and at 43 ± 12 ms and 65 ± 12 ms, respectively, in the stutterers (n.s. by hemisphere and group). When the difference between the right- and left-hemisphere waveforms was calculated in each individual, the stutterers showed significant left-hemisphere dominance, as compared with fluent speakers, at –550 to –450 ms [F(1,14) = 5.5, P < 0.04] and –250 to +25 ms [F(1,14) = 4.8, P < 0.05] with respect to mouth movement onset.



View larger version (41K):
[in this window]
[in a new window]
 
Fig. 7 Mean ± SEM (black curves and shading) source waveforms in the left and right frontoparietal subregions in fluent speakers (left) and stutterers (right) when the MEG signals were averaged with respect to mouth movement onset. Each division in the horizontal axis corresponds to 200 ms.

 
Responses averaged both with respect to word onset and mouth movement onset thus demonstrated a reduced time-locked activation in the right frontoparietal ROI of stutterers.

Modulation of cortical rhythms
Figure 8Go illustrates task-related modulation of 20-Hz activity in one fluent speaker (C7) during a 15 s interval. Rhythmic activity over the central sulcus was suppressed after word onset and remained at a low level throughout the utterance. The 20-Hz TSE curves, depicting the mean amplitude of the oscillations with respect to word onset, illustrate that the suppression lasted for about 2 s and was concentrated over the central sulcus, with left-hemisphere dominance in this subject. Both the 10- and 20-Hz oscillations were suppressed by word onset and speech production in most subjects.



View larger version (33K):
[in this window]
[in a new window]
 
Fig. 8 (A) A 15-s interval of non-averaged MEG signals of subject C7, recorded approximately over the mouth and hand areas along the left central sulcus (solid and dashed rectangles, respectively), with the simultaneous mouth EMG and microphone records. The MEG signals were filtered through 22–28 Hz. Word onsets are indicated with dashed vertical lines. (B) The corresponding whole-head 20-Hz TSE curves, plotted from 1 s before to 5 s after word onset, and illustrating the local suppression of cortical rhythms over the central sulcus.

 
Generators of cortical rhythmic activity concentrated in two distinct loci along the central sulcus bilaterally, in the hand area and about 2 cm inferior to it along the central sulcus, approximately in the mouth area. As usual, cortex around the parieto-occipital sulcus produced rhythmic activity as well, with the dominant component around 10 Hz (Salmelin and Hari, 1994aGo; Hari and Salmelin, 1997Go). These five sources were employed to model the non-averaged MEG activity (see Methods). It was thus possible to compare the task-related modulation of cortical activity for specific cortical areas, rather than MEG sensor outputs (cf. Fig. 8Go).

Figure 9Go illustrates modulation of 20-Hz activity in the left and right mouth and hand areas and around the parieto-occipital sulcus over a 6 s interval (including a 1 s prestimulus baseline), averaged over all fluent speakers and stutterers. Rhythmic activity in the left and right hand and mouth areas was modulated in nine controls and seven stutterers (P < 0.05, exceeding 2 SD of the baseline variation); in the other subjects, one of the four areas did not show significant 20-Hz modulation. Around the parieto-occipital sulcus, 20-Hz variation was evident in six controls and eight stutterers.



View larger version (43K):
[in this window]
[in a new window]
 
Fig. 9 TSE curves of sources in the bilateral mouth and hand areas and around the parieto-occipital sulcus (see the schematic drawings of brains) across fluent speakers (black curves) and stutterers (grey curves), from 1 s before to 5 s after word presentation. Source strengths are given in nanoamperemetres (nAm). The question mark onset is indicated with a solid vertical line at 0.8 s. Mouth movement (M) and speech (S) onsets are marked with dashed vertical lines.

 
The time behaviour of 20-Hz suppression in both subject groups is collected in Table 2Go. The suppression onset, peak, and end latencies were compared across hemispheres (left and right) and areas (hand and mouth; 2 x 2 x 2 mixed-model ANOVA with two subject groups). The parieto-occipital 20-Hz activity, tested separately, did not show group differences.


View this table:
[in this window]
[in a new window]
 
Table 2 Timing of the 20-Hz suppression in fluent speakers and stutterers in the hand and mouth areas of the left and right hemispheres
 
In the mouth area, suppression of 20-Hz activity began 100–300 ms after word onset, i.e. ~700 ms before mouth EMG onset (Table 2Go). Onset latencies in the left and right mouth areas correlated with each other (fluent speakers r = 0.69, P < 0.03; stutterers r = 0.70, P = 0.051). The onset was earlier in the left than the right hemisphere in the fluent speakers but earlier in the right than the left hemisphere in the stutterers, in both the hand and mouth areas [hemisphere-by-group interaction F(1,15) = 4.8, P < 0.05]. Furthermore, the suppression started significantly earlier [F(1,15) = 14.0, P < 0.002] in the mouth than hand area in both hemispheres and subject groups. The onset latencies in the hand areas did not correlate with those in the mouth areas. The 20-Hz suppression onset in the hand area tended to coincide with the appearance of the question mark serving as vocalization prompt (at 800 ms after word onset). The 20-Hz suppression onset or peak latencies did not correlate with mouth movement onset in either group, nor with the severity of stuttering.

For amplitude comparison, the TSE curves were integrated between –1000 ms and –200 ms to estimate the base level, between 200 and 2200 ms after word presentation to quantify the suppression, and between 2200 and 3200 ms to cover the post-suppression rebound above the base level which was detected in the left mouth area of some stutterers (Fig. 9Go). The base levels, suppressions, and rebounds were compared across hemispheres (left and right) and areas (hand and mouth; 2 x 2 x 2 mixed-model ANOVA with two subject groups).

The base levels did not vary between groups. However, the interhemispheric balance of suppression was significantly different in stutterers and controls [hemisphere-by-group interaction F(1,17) = 4.5, P < 0.05]. Comparison of the difference between right- and left-hemisphere suppressions in the hand and mouth areas (2 x 2 mixed-model ANOVA for area x group) indicated that the interaction was due to suppression in the mouth area, which was stronger in the right than left hemisphere in stutterers but stronger in the left than right hemisphere in the fluent speakers [F(1,17) = 7.0, P < 0.02, planned contrast]. The hemispheric imbalance of suppression tended to co-vary with severity of stuttering (right minus left; {rho} = –0.59, P = 0.09, Spearman rank correlation, corrected for ties). A comparison of the differences between right- and left-hemisphere post-suppression amplitudes indicated a stronger rebound of 20-Hz activity in the left than right hemisphere mouth area in stutterers but not in fluent speakers [F(1,17) = 7.1, P < 0.02, planned contrast]. The hand area rhythms did not show significant group differences.

The opposite interhemispheric balance of 20-Hz suppression in the mouth area in fluent speakers and stutterers is also evident in Fig. 10AGo which illustrates the reductions as percent of the base level. The 20-Hz suppression was significantly stronger in the mouth than in the hand area [F(1,17) = 12.1, P < 0.003]. In controls, the difference in attenuation was pronounced both in the left and right hemisphere (P < 0.02, paired two-tailed t-test). However, in the stutterers the hand and mouth area suppressions did not differ significantly.



View larger version (40K):
[in this window]
[in a new window]
 
Fig. 10 (A) Suppression of 20-Hz activity in the left- and right-hemisphere mouth (M, black bars in fluent speakers, grey bars in stutterers) and hand areas (H, white bars), plotted as percent of the prestimulus baselevel. (B) Mean source locations (black dots in fluent speakers, grey dots in stutterers, dimensions denote the SEMs) and orientations of current flow (tails) of the dipolar sources representing clusters of rhythmic activity in the hand and mouth areas. The grey curve indicates the approximate course of the central sulcus. The origin is set to the hand area as determined from the finger movement task.

 
The locations of the dipolar sources representing the generators of the cortical rhythms in the hand and mouth areas are plotted in Fig. 10BGo. The hand area rhythms originated within 4 mm of the hand sensorimotor cortex, as identified in the finger movement task, similarly in both hemispheres; the hand area coordinates did not differ between groups. The mouth area rhythms originated about 20 mm inferior and 10 mm anterior to the hand area, along the central sulcus, in agreement with lip representation determined in previous intracranial and MEG recordings (Penfield and Rasmussen, 1957Go; Woolsey et al., 1979Go; Hari et al., 1993Go; Nakamura et al., 1998Go). In the left hemisphere, the mouth area sources were located on average 6 mm more inferiorly in stutterers than controls [F(1,17) = 4.8, P < 0.05, planned contrast, 2 x 2 mixed-model ANOVA for hemisphere x group] and, accordingly, the distance between the hand and mouth 20-Hz generators was significantly longer in stutterers than controls [30 versus 22 mm; F(1,17) = 5.5, P < 0.04, planned contrast].

The 10-Hz modulation was highly variable across subjects. The suppression was most salient in the parieto-occipital cortex. Modulation depths in both the hand and mouth areas were small (0.8–1.5 nAm or 6–12% of base level) and did not differ from each other. Like in the 20-Hz range, the 10-Hz suppression started earlier in the mouth than in the hand area [F(1,14) = 6.9, P < 0.02].

In both groups, reading words aloud was thus associated with pronounced modulation of the 20-Hz activity, particularly in the bilateral mouth areas. The suppression started well before mouth movement, and was correlated across hemispheres. Hand area suppression coincided with the vocalization prompt. The attenuation of rhythmic activity was stronger and earlier in the left hemisphere of fluent speakers but in the right hemisphere of stutterers. In fluent speakers, the modulation concentrated in the mouth area whereas in stutterers both hand and mouth areas were markedly engaged in speech production.


    Discussion
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Neuroimaging studies of reading
Reading familiar real words aloud is supposed to proceed via visual feature analysis, pre-lexical letter detection, and word-level visual processing to activation of the semantic and phonological representations, and finally to activation of the phoneme and articulation system to produce speech (Coltheart et al., 1993Go). Interaction between the adjacent processing stages is believed to take place at all levels. Whether these processing stages and their apparently modular structure, derived largely from studies on subjects with acquired language impairments, have actual counterparts in the organization of language in a healthy human brain is not clear. There is a general consensus of the involvement of at least the left-hemisphere perisylvian areas in reading, including the classical inferior frontal Broca's and posterior superior temporal Wernicke's areas, both from lesion and functional neuroimaging studies but the suggested correspondence between cortical areas and different subprocesses vary considerably according to the paradigm used (see, for example Poeppel, 1996).

Fiez and Petersen recently reviewed nine PET and fMRI data sets of reading aloud single words and reported consistent activations in the lower part of the motor cortex bilaterally [Brodmann area (BA) 4], supplementary motor area (BA 6), the inferior frontal cortex (Broca's area, BA 44/45/insula), the superior temporal cortex bilaterally (BA 21/22/24, including Wernicke's area on the left), left inferior occipitotemporal border (BA 37/19), left lateral occipital cortex (BA 18/19) and cerebellum (Fiez and Petersen, 1998Go). The possible functions were suggested to be visual analysis specific to word-like stimuli in the left inferior occipitotemporal region, semantic analysis near the border of the superior and middle temporal gyri, acoustically based phonological analysis in the left posterior temporal regions, articulatory based phonological analysis in the left inferior frontal cortex and insula, motoric aspects of language function in the motor cortex and supplementary motor area, and monitoring of one's own voice in the bilateral auditory cortices. These interpretations were supported by both haemodynamic and lesion data.

On the other hand, Price, in her review of recent PET and fMRI studies on word comprehension and production, again paralleled by lesion data, concluded that semantic processing involves the left inferior frontal cortex (BA 47, anterior to Broca's area), the left temporal pole (BA 20/28/38), and the posterior temporoparietal (BA 39, angular gyrus) cortex (Price, 1998Go). Phonological retrieval was associated with activation of the left basal occipitotemporal border (BA 37/19) and the left frontal operculum (BA 44/45/insula). The left supramarginal gyrus (BA 40) was proposed to have a specialized role in converting the orthographic form of the word to the corresponding phonological representation. The two interpretations thus seem to vary most in the functional neuroanatomy of semantic processing (superior middle versus inferior anterior temporal lobe) and in the role of the left basal occipitotemporal cortex (visual analysis of word-like stimuli versus phonological processing).

The combined timing and localization obtained from MEG and intracranial recordings may shed some light on the possible roles of the different cortical areas. Letter-string specific neuronal responses, maximum at about 200 ms after stimulus presentation, were identified in the fusiform gyrus during intracranial recordings (Nobre et al., 1994Go). In MEG studies of word recognition, similar responses peaking 150–200 ms after word onset have been identified in the left inferior occipitotemporal cortex in fluent readers (Salmelin et al., 1996Go; Kuriki et al., 1998Go; Tarkiainen et al., 1999Go) but not in developmentally dyslexic subjects (Salmelin et al., 1996Go; Helenius et al., 1999Go). This early response may reflect an interface process which detects letter strings and conveys them from visual to language domain (Tarkiainen et al., 1999Go). Therefore, the role of this region, apparently corresponding to BA37/19 above, might be interpreted differently depending on the paradigm and the image subtractions used in the PET and fMRI studies.

By showing semantically constrained sentences, word by word, and varying the congruency of the final word in the sentence context, one can extract a response the strength of which increases for increasingly inappropriate sentence-ending words (N400 paradigm; Kutas and Hillyard, 1984). MEG studies have indicated that, based on this approach, the middle superior temporal cortex is involved in word and sentence comprehension 200–600 ms after word onset, with clear left-hemisphere dominance (Simos et al., 1997Go; Helenius et al., 1998Go). fMRI studies of sentence reading have also emphasized the role of the superior temporal cortex in reading comprehension (Just et al., 1996Go; Bavelier et al., 1997Go). According to intracranial recordings, a semantic N400 response is also generated in the medial temporal structures and the temporal pole (Halgren et al., 1994Go; McCarthy et al., 1995Go; Nobre and McCarthy, 1995Go). Thus, MEG data would agree with the interpretation of Fiez and Petersen (Fiez and Petersen, 1998Go), whereas intracranial recordings in surgical patients would also support that of Price (Price, 1998Go).

Although mouth muscle and tongue movements tend to cause strong electric disturbances, MEG has been successful in imaging cortical activations associated with speech production. Both preparatory motor activity around the mouth and tongue areas (Sasaki et al., 1995Go; Kuriki et al., 1999Go) and activation of supplementary motor area and Broca's area in a picture naming task (Salmelin et al., 1994Go) have been reported. To our knowledge there are no previous MEG studies of reading words aloud.

Neuroanatomy of reading in the present data set
In this study, we employed a delayed response paradigm which was successful in postponing the speech-related artefact, thus allowing us to image the spatiotemporal dynamics of word perception and preparation for overt output in a slightly extended time window. On the other hand, this somewhat artificial procedure could enhance a short-term memory component and promote verbal rehearsal of the word during the 800 ms delay from word onset to the vocalization prompt. The delay is, however, so brief that the subjects did not report any need to actively memorize the word during the process.

The overall spatiotemporal activation patterns in fluent speakers and stutterers agreed reasonably well with those in previous PET, fMRI and MEG studies reviewed above: (i) occipital and parieto-occipital responses starting at 100–150 ms after word onset (and continuing until the vocalization prompt), presumably involved in visual analysis; (ii) left and right inferior occipitotemporal clusters at 150–200 ms, with the left-sided sources probably reflecting letter-string specific analysis; (iii) left inferior frontal cortex (200–600 ms) and some activation in the homologous right-hemisphere locus, apparently reflecting articulatory aspects of phonological processing; (iv) left middle superior temporal cortex at 200–600 ms, as a signature of semantic activation; (v) left and right posterior parietal cortices (200–800 ms), possibly associated with phonological aspects of linguistic processing (left) or attentional aspects of visual perception (right; Mesulam, 1981Go; Nobre et al., 1997Go; Corbetta, 1998Go); and (vi) left and right motor and premotor cortices and supplementary motor area from 200 ms onwards, involved in motor preparation for oral output and actual vocalization.

The behavioural measures, i.e. mouth movement or speech onset latencies, did not differ between the groups. However, the cortical responses differed significantly in the left inferior frontal cortex and in the motor and dorsal premotor cortex bilaterally. Within 400 ms after seeing the word, the activation proceeded from left inferior frontal to motor cortex in the fluent speakers. The sequence was reversed in the stutterers who showed an abnormally early left motor/premotor response, followed by a delayed left inferior frontal activation. The exceptionally early motor cortical activation in stutterers may be reflected in the tendency to earlier initiation of mouth muscle activity in stutterers than in fluent speakers. During vocalization, 20 Hz activity was bilaterally suppressed in the mouth areas, with slight left-hemisphere dominance in fluent speakers and right-hemisphere dominance in stutterers. In stutterers, 20-Hz activity was also strongly suppressed in the hand areas. Moreover, stutterers failed to show a pronounced time-locked response in the right frontal cortex which was evident in the fluent speakers throughout vocalization. Below, we will discuss these differences in detail and relate the present findings with functional imaging and lesion data on fluent and dysfluent reading.

Differences in time-locked activity after seeing the word
Activation of the left inferior frontal cortex, ventral portion of BA44/45 and extending to the insula, has been reported in PET and fMRI studies of vocalized word reading (see Fiez and Petersen, 1998), verbal working memory (Paulesu et al., 1993Go; Fiez et al., 1996Go), memory for pitch (Zatorre et al., 1994Go), auditory and phonological processing (Fiez et al., 1995Go; Fiez, 1997Go; Gandour et al., 1998Go), and verbal fluency (Paulesu et al., 1997Go), and interpreted to reflect high-level articulatory encoding and involvement of a subvocal rehearsal system. Pugh and colleagues and Fiez and Petersen have proposed that the left frontal operculum contributes to the process of orthographic-to-phonological transformation from word to sound (Pugh et al., 1996Go; Fiez and Petersen, 1998Go). The timing in the present study is in line with these interpretations, as the articulatory activation in the left inferior frontal cortex started approximately at the same time as the semantic activation in the left superior temporal cortex, and both were preceded by the left posterior letter-string specific response. Naturally, our delayed reading paradigm could have evoked the subvocal rehearsal system as well. The processes listed above are all mutually intertwined and, for the present, we associate the left inferior frontal activation with articulatory encoding.

A severe disorder of articulation, without language disabilities, has been reported from focal lesions in Broca's area (Mohr et al., 1978Go; Schiff et al., 1983Go) and in the lower part of the left precentral gyrus (Tonkonogy and Goodglass, 1981Go; Schiff et al., 1983Go; Mori et al., 1989Go). More recently, Dronkers identified a discrete region in the left precentral gyrus of the insula, damage to which was consistently associated with articulatory planning deficits (Dronkers, 1996Go). Broca's aphasics may also show defective laryngeal control (Blumstein, 1995Go), a disturbance implicated in developmental stutterers as well (Freeman and Ushijima, 1978Go). Moreover, acquired stuttering, although in many ways behaviourally different from developmental stuttering (Koller, 1983Go), has been observed after lesions to Broca's area and to the lower third of the premotor cortex (Tonkonogy and Goodglass, 1981Go; Freedman et al., 1984Go).

The cortical dynamics in our fluent speakers thus