Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jan 1;26(7):952-981.
doi: 10.1080/01690960903498424.

The DIVA model: A neural theory of speech acquisition and production

Affiliations

The DIVA model: A neural theory of speech acquisition and production

Jason A Tourville et al. Lang Cogn Process. .

Abstract

The DIVA model of speech production provides a computationally and neuroanatomically explicit account of the network of brain regions involved in speech acquisition and production. An overview of the model is provided along with descriptions of the computations performed in the different brain regions represented in the model. The latest version of the model, which contains a new right-lateralized feedback control map in ventral premotor cortex, will be described, and experimental results that motivated this new model component will be discussed. Application of the model to the study and treatment of communication disorders will also be briefly described.

PubMed Disclaimer

Figures

Figure 1
Figure 1
The DIVA model of speech acquisition and production. Recently added modules and connections are highlighted by black outlines. Model components associated with hypothesized neuroanatomical substrates. Abbreviations: GP = globus pallidus; HG = Heschl's gyrus; pIFg = posterior inferior frontal gyrus; pSTg = posterior superior temporal gyrus; Put = putamen; slCB = superior lateral cerebellum; smCB = superior medial cerebellum; SMA = supplementary motor area; SMG = supramarginal gyrus; VA = ventral anterior nucleus of the cerebellum; VL = ventral lateral nucleus of the thalamus; vMC = ventral motor cortex; vPMC = ventral premotor cortex; vSC = ventral somatosensory cortex.
Figure 2
Figure 2
Neuroanatomical mapping of the DIVA model. The location of DIVA model component sites (red dots) are plotted on renderings of the left (top) and right (bottom) lateral surfaces of the SPM2 canonical brain. Sites immediately anterior to the central sulcus (dotted line) represent cells of the model’s articulator velocity () and position (M) maps. Sites located immediately posterior to the central sulcus represent cells of the somatosensory state map (S). Subcortical sites (basal ganglia, thalamus, paravermal cerebellum, deep cerebellar nuclei), are not shown. Additional abbreviations: Au = auditory state map; ΔAu = auditory error map; FB = feedback control map; IM = initiation map; Lax.int, Lax.ext = intrinsic and extrinsic larynx, Lat Cbm = lateral cerebellum; Resp: respiratory motor cells; ΔS = somatosensory error map; SSM = speech sound map; TAu = auditory target map; TS = somatosensory target map.
Figure 3
Figure 3
Effective connectivity within the auditory feedback control network. Structural equation modeling demonstrated significant modulation of interregional interactions within the schematized network when auditory feedback was perturbed during speech production. Pair-wise comparisons of path coefficients in the normal and perturbed feedback conditions revealed significant increases in the positive weights from left posterior superior temporal gyrus (pSTg) to right pSTg (the path labeled a in the diagram above), from left pSTg to right ventral premotor cortex (PMC; path b), and from right pSTg to right inferior frontal gyrus, pars triangularis (path c) when auditory feedback was perturbed during speech production. Additional abbreviation: MC = motor cortex.
Figure 4
Figure 4
Learning in the DIVA model. Simplified DIVA model block diagrams indicate the mappings that are tuned during the two learning phases (heavy black outlines). Left: Early babbling learning phase. Pseudo-random motor commands to the articulators are associated with auditory and somatosensory feedback. The paired motor and sensory signals are used to tune synaptic projections from sensory error maps to the feedback control map. The tuned projections are then able to transform sensory error inputs into feedback-based motor commands. Right: Imitation learning phase. Auditory speech sound targets (encoded in projections from the speech sound map to the auditory target map) are initially tuned based on sample speech sounds from other speakers. These targets, somatosensory targets, and projections in the feedforward control system are tuned during attempts to imitate a learned speech sound target.

Similar articles

Cited by

References

    1. Ackermann H, Vogel M, Petersen D, Poremba M. Speech deficits in ischaemic cerebellar lesions. Journal of Neurology. 1992;239(4):223–227. - PubMed
    1. Adams RD. Principles of Neurology. New York: McGraw-Hill; 1989.
    1. Adolphs R, Damasio H, Tranel D. Neural systems for recognition of emotional prosody: a 3-D lesion study. Emotion. 2002;2(1):23–51. - PubMed
    1. Alario FX, Chainay H, Lehericy S, Cohen L. The role of the supplementary motor area (SMA) in word production. Brain Research. 2006;1076(1):129–143. - PubMed
    1. Albin RL, Young AB, Penney JB. The functional anatomy of disorders of the basal ganglia. Trends in Neurosciences. 1995;18(2):63–64. - PubMed

LinkOut - more resources