Abstract
Exposure to an acute stressor can lead to unreliable remembrance of intrinsically neutral information, as exemplified by low reliability of eyewitness memories, which stands in contrast with enhanced memory for the stressful incident itself. Stress-sensitive neuromodulators (e.g., catecholamines) are believed to cause this low reliability by altering neurocognitive processes underlying memory formation. Using event-related functional magnetic resonance imaging, we investigated neural activity during memory formation in 44 young, healthy human participants while incidentally encoding emotionally neutral, complex scenes embedded in either a stressful or neutral context. We recorded event-related pupil dilation responses as an indirect index of phasic noradrenergic activity. Autonomic, endocrine, and psychological measures were acquired to validate stress manipulation. Acute stress during encoding led to a more liberal response bias (more hits and false alarms) when testing memory for the scenes 24 h later. The strength of this bias correlated negatively with pupil dilation responses and positively with stress-induced heart rate increases at encoding. Acute stress, moreover, reduced subsequent memory effects (SMEs; items later remembered vs forgotten) in hippocampus and midbrain, and in pupil dilation responses. The diminished SMEs indicate reduced selectivity and specificity in mnemonic processing during memory formation. This is in line with a model in which stress-induced catecholaminergic hyperactivation alters phasic neuromodulatory signaling in memory-related circuits, resulting in generalized (gist-based) processing at the cost of specificity. Thus, one may speculate that loss of specificity may yield less discrete memory representations at time of encoding, thereby causing a more liberal response bias when probing these memories.
Introduction
Stress has a powerful impact on learning and memory. Whereas exposure to an acute stressor can enhance emotional aspects of episodic memories, memory for intrinsically neutral information encountered within the same context becomes unreliable (Payne et al., 2006). For instance, eyewitness testimony often proves untrustworthy (Loftus, 1979), and studies have shown increased false positives under stress (Payne et al., 2002). Psychological views hold that these phenomena result from a stress-induced shift toward generalized processing, such as extracting central thematic information (gist) at the cost of specificity (Payne et al., 2002). Such a shift has been related to stress-induced alterations in neurocognitive processes underlying memory (Christianson, 1992) through hyperactivation of stress-sensitive neuromodulatory (e.g., catecholaminergic) systems (Aston-Jones and Cohen, 2005). However, it remains open how acute stress affects the neural basis underlying memory formation, particularly for neutral information encountered in a stressful context.
The formation of new memories, supported by the hippocampus, among other regions, is modulated by noradrenergic and dopaminergic systems in the midbrain (Lisman and Grace, 2005; Sara, 2009). In an attentive state, these systems facilitate hippocampal functioning through increased stimulus-related phasic firing patterns, thereby promoting the formation of clean, discrete memory representations. In agreement with this notion, neuroimaging studies have found robust subsequent memory effects (SMEs; stronger phasic neural responses for later remembered than forgotten items) in hippocampus and in midbrain regions where catecholaminergic nuclei are located (Schott et al., 2006; Shohamy and Wagner, 2008).
Exposure to an acute stressor leads not only to activation of the sympathetic nervous system (SNS) and the hypothalamic-pituitary-adrenal (HPA) axis, but also to hyperactivation of central catecholaminergic systems (de Kloet et al., 2005). For instance, the locus ceruleus-centered norepinephrine (LC-NE) system has been shown to shift into a mode with tonically elevated background activity, but diminished stimulus-related phasic firing (Valentino and Van Bockstaele, 2008). Neural responses to sensory stimuli consequently become less selective and more generalized, creating a hypervigilant state (Aston-Jones and Cohen, 2005). In humans, it has been shown that stimulus-related phasic pupil responses, which closely parallel LC-NE activity (Bradley et al., 2008; Gilzenrat et al., 2010), are diminished under acute stress (Henckens et al., 2009). Thus, one may conjecture that stress-induced catecholaminergic activity shifts neuronal processing into a mode with high tonic but reduced phasic responses, resulting in altered neural activity and pupil responses related to memory formation.
To investigate this, 44 young healthy participants underwent event-related functional magnetic resonance imaging (fMRI) and continuous pupillometry during incidental encoding of complex scenes depicting meaningful, emotionally neutral activities. Acute stress was experimentally induced in half of the participants by showing aversive movie clips with self-referencing instructions, in conjunction with continuous threat of electrical shock. Twenty-four hours later, a surprise memory test was performed using short, written descriptions of studied scenes. Subsequent memory analyses for fMRI and pupil data were used to localize and quantify SMEs reflecting phasic neural and pupil responses associated with memory formation. We predicted that acute stress would alter SMEs in memory-related neural circuits and in pupil dilation responses.
Materials and Methods
Participants
Forty-four young, healthy, male volunteers (aged 19–36 years) with normal or corrected-to-normal vision participated in this study. Participants reported no history of neurological, psychiatric, or endocrine disease, no current use of any psychoactive drugs or corticosteroids, and no habit of watching violent movies or playing violent video games. None of them had experienced severe physical or emotional trauma. To avoid gender differences (Kudielka and Kirschbaum, 2005) and menstrual cycle effects (Ossewaarde et al., 2010) in stress responsiveness, only men were included. Ethical approval was obtained from the local institutional review board (CMO Region Arnhem-Nijmegen, The Netherlands) and all participants gave written informed consent before the experiment.
Participants were randomly assigned to either the stress induction (n = 22; aged 21.65 ± 3.73 years) or the control group (n = 22; aged 22.71 ± 4.01 years). There was no difference in age between the two groups (t < 1). Data from four participants were excluded from further analyses (two in each group) due to either poor memory performance or excessive head movement during scanning.
General procedure
The experiment was performed between 2:00 and 8:00 P.M. to ensure relatively stable and low levels of endogenous cortisol. To reduce anticipation of stress induction for participants in the neutral group, they were told which of the two experimental groups they were assigned to 1 d before the experiment. Participants were restricted from food or drink intake at least 2 h before the experiment. After arrival, 1.5 h before entering the MR scanner, they completed personality questionnaires, trained in the memory encoding task, and prepared for heart rate (HR) measurement. The actual fMRI experiment consisted of three runs of an incidental encoding task that were fully embedded in either a continuously stressful or an emotionally neutral control context. The three encoding runs were separated by four short movie clips to boost stress induction. The experiment ended with a structural scan (Fig. 1).
Stress induction
Acute psychological stress was induced by showing strongly aversive movie clips with a self-referencing instruction in the scanner (Qin et al., 2009; van Marle et al., 2009), combined with continuous threat of (mild) electrical shock (Hermans et al., 2006). Stress-induction movie clips contained scenes with extreme male-to-male aggressive behavior and violence in front of a crowd, selected from a commercial movie (Irreversible, 2002, by Gaspar Noé). After short introductory texts, participants were asked to constantly and attentively view the clips and project themselves into the scene from an eye-witness perspective, thus attempting to involve them maximally in the experience. Participants were told that they would receive a random number of shocks during the encoding session, and that subsequent shocks would increase in strength and latency. To visualize the increase in the strength of shocks, an analog scale indicating a gradually increasing intensity of electrical shocks was always presented at the bottom of the screen. In reality, only two mild electrical shocks were given at fixed time points: one during the last three encoding trials in the first encoding block, and one during the third movie clip. The electrical shocks, generated by a 9 V battery-operated device (Tens Elpha 2000; Danmeter), were delivered transcutaneously over the volunteers' left index and middle fingertips using Ag/AgCl electrodes.
In the control group, participants watched equally long movie clips selected from another movie (Comment j'ai tué mon père, 2001, by Anne Fontaine), and no shocks were given. Matching for audiovisual characteristics was performed in a separate pilot study by selecting aversive and neutral movie clips out of a set of candidate clips. Two types of movie clips best matched on the following measures: presence of faces in the foreground, presence of background actors, amount of distinct camera movements, and percentage of time the camera was moving (Hermans et al., 2011). At the end of the experiment, participants were debriefed about the stress induction procedure.
It should be noted that the present stress induction method closely corresponds with the determinants of the human stress response described by Mason (1968), that is, unpredictability, novelty, and uncontrollability.
Stimuli
Initially, 200 color photographs of complex scenes, depicting distinct meaningful activities, were carefully selected from a commercially available image database. For each picture, we created a unique, one-sentence description of 5–10 words describing the central meaning (gist) of the scene based on the following two criteria: the sentence should be sufficient to identify the photograph and to distinguish it from the other scenes (Adolphs et al., 2001, 2005). Another set of sentences (based on different photographs not used in this study) were created as lures for the memory test. The clarity and distinctness of these short one-sentence written descriptions of pictures was assured in a separate pilot study, in which 10 additional participants judged whether each description sufficiently described the corresponding picture, and unclear descriptions were excluded. The final stimulus set used for the fMRI experiment consisted of 150 scenes (plus 10 extra ones for the training set) with unique, one-sentence descriptions. Luminance of all selected pictures was equalized to ensure reliable pupil dilation measurements.
Encoding task and memory test
The experiment consisted of a study phase on Day 1 (Fig. 1) and a surprise memory test on the consecutive day (Day 2). During the study phase, participants were scanned while performing an incidental encoding task on 150 sequentially and centrally presented photographs (presentation time: 6 s; mean inter trial interval: 6 s, randomly varying from 3 to 9 s). Participants were instructed to imagine themselves being in the scene as vividly as possible, and to make a judgment of how much they would like to be present in that scene on a four-point scale (i.e., from “would like it very much” to “would not like it at all”). To ensure that participants did not expect any subsequent memory test, they were told that they were participating in a study on mental imagery with brain imaging and pupillometry. The study phase was divided into three runs of 50 trials each. Within each run, 10 null events of 6 s duration were intermixed to minimize potential effects of expectation and to optimize contrast-to-noise ratio of event-related fMRI signal. Each run lasted ∼12 min. To familiarize participants with the procedure beforehand, they were trained twice using 10 trials that were not used in the actual experiment.
Approximately at the same time on the consecutive day (∼24 h later, when stress had subsided), participants came back and performed a surprise memory test. Before the start of the memory test, participants were asked whether they had expected such a test while being scanned the day before. None had expected a memory test. After the test, the experimenter debriefed the participants by explaining the rationale of the surprise memory test. The memory test consisted of 150 written descriptions of studied scenes that were randomly intermixed with 150 descriptions of scenes that were not studied before. Participants were asked to judge whether each description was associated with a scene studied before or not (i.e., “Yes” or “No”) and give a confidence rating on a visual analog scale ranging from 0% to 100% by moving the cursor via a mouse movement to the appropriate position (Qin et al., 2011). This task was self-paced with a trial duration limited to a range between 2 and 8 s.
It should be noted that we used a memory test for identifying written descriptions of studied scenes rather than a conventional recognition memory test for the following three reasons. First, this paradigm has been shown to assess memory for the gist of an episode (Adolphs et al., 2001, 2005; Qin et al., 2011). Thus, it allowed us to more closely examine the hypothesis of how acute stress leads to a shift of mnemonic processing toward extracting central thematic information, or gist. Second, unlike conventional recognition paradigms, which are dominated by familiarity-based recognition, identification of written descriptions of studied complex scenes without actual perceptual support is most likely more based on recollection. Finally, our paradigm more closely resembles real-life phenomena of eye-witness reports, which typically involve recollection rather than recognition, and therefore optimizes ecological validity.
To accommodate variability of their distribution, confidence ratings of each participant were grouped into four bins with increasing ratings. These four bins corresponded with equal lengths on the visual analog scale in terms of each participant's own distribution of responses. To probe neural correlates truly related to successful memory formation, the fMRI data analysis only focused on trials within the highest of the four confidence level categories compared with trials that were forgotten (i.e., all misses; Table 1). Trials remembered with lower levels of confidence were modeled using a regressor of no interest and were excluded from further analyses. We opted for this method to minimize heterogeneity of memory strength that occurs when trials remembered with different levels of confidence are grouped together. Such an approach has been shown to improve statistical power for identifying neural correlates of successful memory formation (Wagner et al., 1998; Paller and Wagner, 2002; Ranganath et al., 2004; Shrager et al., 2008).
Physiological and psychological measurements of stress
To monitor the HPA axis response, saliva samples were collected using salivette collection devices (Sarstedt). Salivary sampling consisted of six measurements on Day 1 at −60, −45, +15, +30, +50, and +65 min relative to the start of the stress induction procedure. Two saliva samples were collected on Day 2 around memory testing and at time points approximately corresponding with the saliva samples taken at time points +15 and +65. All samples were stored at −20°C until analysis. Samples were prepared for biochemical analysis by centrifuging at 3000 rpm for 5 min (in the Department of Biopsychology, Trier University Dresden, Germany), which resulted in a clear supernatant of low viscosity. Salivary-free cortisol concentrations were determined using a chemiluminescence assay (CLIA; IBL) with high sensitivity of 0.16 ng/ml. Cortisol data from one participant in the control group were excluded from further analyses because of unexpected residues in the first two samples which precluded reliable cortisol assays.
To increase statistical power for analyses of stress responses, cortisol levels on Day 1 were corrected for baseline differences. We did not use saliva samples taken before stress induction on Day 1 for this baseline correction, because participants were informed in advance to which group they were assigned and thus group differences due to anticipation may have been present in these samples. Instead, after carefully verifying that no remaining group differences were present at this time, we used (averaged) saliva samples from Day 2 for the baseline correction. To incorporate stress-induced changes in cortisol levels measured over time, area under the curve (AUC) with respect to the baseline (obtained from Day 2) was calculated for cortisol levels at all time points after stress induction (i.e., +15, +30, +50, and +65 min).
To assess activation of the autonomic nervous system, HR and pupil dilation responses were recorded continuously throughout MRI scanning. HR was recorded by using an MR-compatible BrainAmp MR plus ECG recording system (Brain Products). Eye tracking was performed using an MR-compatible eye-tracking device (MEye Track-LR camera unit; SensoMotoric Instruments) to assess pupil dilation responses. Eye-tracking was also used to confirm attentive viewing of movie clips and pictures. Offline analysis for HR included artifact correction and peak detection using Brain Vision Analyzer (Version 1.05) and calculation of HR frequency in Matlab 7.5. Eye pupil data were also analyzed using in-house software implemented in Matlab 7.5, which was based on methods described previously (Siegle et al., 2003; Henckens et al., 2009). Eyeblink artifacts were identified as eye pupil changes occurring too rapidly to represent actual dilation. Blinks were removed from the signal using linear interpolation. The time course of pupil diameter changes (reflecting phasic pupil dilation response) for each trial was normalized to the average 1 s prestimulus onset baseline. The averaged baseline-corrected pupil diameter within a 2–4 s window during picture presentation was used as response measure. Data from four subjects (two from each group) were excluded from further analyses due to excessive signal artifacts.
In addition, subjective state was assessed on the first day using the positive and negative affect scale (PANAS) questionnaire (Watson et al., 1988) at time points coinciding with collection of saliva samples. Physiological data including HR, and pupil response, as well as psychological data including PANAS and memory performance, were analyzed in SPSS (16.0; SPSS) using repeated-measures ANOVAs and appropriate follow-up t tests. Alpha was set at 0.05 throughout.
fMRI data acquisition.
Whole brain T2*-weighted gradient echo EPI images with BOLD contrast were acquired with a Siemens Tim Trio 3.0T MR scanner using an ascending slice acquisition sequence (37 axial slices; TR, 2.1 s; TE, 25 ms; flip angle, 90°; slice matrix size, 64 × 64; slice thickness, 3.0 mm; slice gap, 0.3 mm; FOV, 212 × 212 mm). There were three runs of 12 min each. High-resolution structural images (1 × 1 × 1 mm) were acquired using a T1-weighted three-dimensional magnetization-prepared rapid gradient-echo sequence (TR, 2.3 s; TE, 2.96 ms, flip angle, 8°, FOV, 256 × 256 mm).
fMRI data analysis.
Image preprocessing and statistical analysis were performed using SPM5. The first five EPI volumes were discarded to allow for T1 equilibration. Remaining functional images were rigid-body motion corrected, coregistered to the corresponding T1-weighted image, corrected for slice acquisition timing, spatially normalized into a common stereotactic space, resampled into 2 mm isotropic voxels, and smoothed by convolving with an isotropic 3D Gaussian kernel (8 mm). The data were statistically analyzed using general linear models and statistical parametric mapping.
To assess transient neural activity related to memory formation, two separate regressors of interest (i.e., encoding trials later remembered with high confidence and those later forgotten; Table 1) were created based on subsequent memory performance, and then convolved with the canonical hemodynamic response function. To avoid potential confounds of group difference in memory-related response bias, the number of trials of each regressor was equalized between the two groups. Remaining trials were modeled as a single regressor of no interest. Additionally, realignment parameters were included to account for movement-related variability. The analyses included high-pass filtering using a cutoff of 1/128 Hz, global intensity normalization, and serial correlations correction using a first-order autoregressive model.
Relevant contrast parameter estimate images were initially generated at the single-subject level, and then submitted to a 2 (group) by 2 (memory) ANOVA for the second-level group analysis treating participants as a random variable. In the whole-brain exploratory analysis, results from the group analysis were initially thresholded at p < 0.001, uncorrected, and cluster size statistics were used as the test statistic. Unless otherwise specified, only clusters significant at p < 0.05, corrected (Worsley et al., 1996), are reported. A spherical search region with an 8 mm radius, centered at the peak voxel of the main effect of memory in the hippocampus, was used for small volume corrections to detect effects of stress on SMEs. To examine regional overlap between the main effect of memory and stress-by-memory interaction, an additional conjunction analysis was performed using the minimum statistic compared with the conjunction-null method in SPM5 (Nichols et al., 2005).
To further characterize patterns of hippocampal and midbrain activity related to memory formation in the two groups, we conducted regions of interest (ROI) analyses on extracted data for these two regions. The ROI in hippocampus was defined by the combination of a hippocampal anatomical mask and an orthogonal contrast (with respect to the interaction effect) associated with the main effect of memory across two groups, while the midbrain mask was defined anatomically. The hippocampal and midbrain masks were defined using the WFU PickAtlas toolbox (Maldjian et al., 2003). Beta values corresponding to conditions of interest were extracted from those ROIs by using MarsBar (Brett et al., 2002) and then subjected to further statistical tests in SPSS.
Results
Endocrine, autonomic, and psychological measurements of stress
Baseline-corrected salivary cortisol data are shown in Figure 2A. A 2-by-4 ANOVA, with group (stress vs control) as between-subject factor and time (four time points) as within-subject factor revealed main effects of stress induction (F(1,37) = 4.72, p < 0.05, with higher cortisol levels in the stress group) and time (F(3,50.22) = 4.40, p < 0.05), but no interaction (F < 1). The observed downward trend of cortisol levels over time is most likely due to diurnal rhythm and stress anticipation.
To check whether differences in anticipation of stress induction had led to group differences in cortisol before the start of the experiment, we tested for group differences in the cortisol samples obtained before the start of the experiment on Day 1. Although cortisol levels were numerically higher in the stress group (i.e., average cortisol level and SEM for the two samples before the experiment: 7.92 ± 0.79 and 7.32 ± 1.11 in the stress group, 6.93 ± 0.68 and 5.99 ± 0.81 in the control group), this difference did not reach significance (F(1,37) = 1.13 p = 0.30). Moreover, no group difference whatsoever was found in cortisol levels on Day 2 (i.e., average cortisol level and SEM for the two baseline samples: 4.63 ± 0.70 and 3.17 ± 0.50 in the stress group, 4.82 ± 0.43 and 3.02 ± 0.29 in the control group; difference: F < 1). Thus, the two experimental groups did not differ in cortisol levels at the time of retrieval.
Baseline-corrected HR (Fig. 2B) was averaged separately for three encoding runs and four movie clips. A 2 (group) by 7 (time: four movie clips, three encoding runs) ANOVA was conducted for HR data. A main effect of stress induction was found (F(1,38) = 15.97, p < 0.001), with increased HR in the stress compared with the control group. The two groups did not differ in HR (t(38) = −1.13, n.s.) at baseline.
Subjective ratings of negative affect were submitted to a 2 (group) by 4 (time) ANOVA, which revealed a main effect of stress induction (F(1,38) = 5.82, p < 0.02), with higher negative affect in the stress induction group.
Altogether, these results confirm that stress induction resulted in activation of the HPA axis and the autonomic nervous system, and led to increased negative affect.
Stress-induced changes in pupil responses related to memory formation
Averaged pupil dilation responses time-locked to scene picture onsets are shown in Figure 3 as a function of trials later remembered versus forgotten, and separately for the two groups. A 2 (group) by 2 (memory) ANOVA revealed a significant main effect of group (F(1,36) = 4.58, p < 0.05) and an interaction effect (F(1,36) = 4.23, p < 0.05) within the 2–4 s time window. The main effect of group indicates that stress induction reduced pupil dilation responses to pictures during encoding in general. Subsequent paired t tests revealed a significant effect of memory in the control (t(18) = 2.67, p < 0.01) but not in the stress induction group (t(18) < 1, n.s.), indicating that participants had larger pupil dilation responses to pictures later remembered than to those forgotten in the control condition, but this effect was diminished after stress induction.
Stress-induced changes in memory performance
Averaged memory performance for the stress and the control groups is listed separately in Table 2 and plotted in Figure 4. Accuracy in both groups was clearly above chance level (t(19) > 25.67, p < 0.001), indicating successful memory formation. Further comparison of the two groups revealed no difference in accuracy (t(38) < 1, n.s.), but did show that there was a more liberal response bias after acute stress (t(38) = 2.76, p < 0.01). In other words, acute stress at encoding increased the tendency of participants to endorse items (both studied and unstudied lure items) as having previously been studied.
Moreover, this response bias in the stress group showed a statistical trend for a positive correlation with cortisol elevation (r(38) = 0.39, p = 0.09), a positive correlation with averaged HR increases in response to the stressor (r(38) = 0.48, p < 0.05), and a negative correlation with pupil dilation responses to those scenes later remembered (r(38) = −0.67, p < 0.01). This pattern of results indicates that stress-induced activation of the HPA axis, the SNS, and the LC-NE system predicts how liberal the response bias became.
Brain activity showing main effects of stress, memory formation
First, when comparing neural activity related to the encoding of complex scenes across two groups (main effect of stress), we identified differences of activation in bilateral posterior visual cortices and several other regions, which are thought to comprise a salience processing network (Seeley et al., 2007), in limbic and paralimbic systems including insula, dorsal-anterior and middle cingulate cortex, and striatum. Interestingly, we found that cortisol elevation (i.e., AUC) correlated positively with the magnitude of activation in posterior visual cortices, middle cingulate cortex, bilateral thalamus, and insula extending into striatum in the stress group (Table 3). In line with the notion of hypervigilant processing under stress, these results indicate that acute stress led to increased activity in a posterior visual network and salience processing network during encoding of scenes. Next, we contrasted encoding trials later remembered with later forgotten (i.e., SME) across the two groups (main effect of memory) and revealed activation of an extended frontal-medial temporal lobe network (Table 4) typically engaged in successful memory formation. This network included clusters in the left inferior and medial PFC and clusters extending from the bilateral hippocampus into parahippocampal regions (Fig. 5A).
Stress-induced modulations on brain activation related to memory formation
To examine modulatory effects of stress on neural activity related to memory formation, we conducted a contrast reflecting the interaction between stress and memory by comparing trials later remembered with those later forgotten in the control group versus the stress group. This contrast revealed a large cluster in the midbrain (Fig. 6E,F), likely covering several neuromodulatory nuclei that are known to be involved in regulation of arousal, homeostatic balance, and vegetative functions (Bandler et al., 2000; Parvizi and Damasio, 2001; Joëls and Baram, 2009), and other clusters in a salience processing network (listed in Table 5). In the medial temporal lobe, we found an interaction in the left hippocampus (likely subiculum) and the right hippocampus. As shown in Figure 5, A and B, the hippocampus showed a robust SME in the control group but no reliable effect in the stress group. To ensure actual spatial overlap between this interaction and the main effect of memory, we conducted a conjunction analysis over the two contrasts, which indicated that the right hippocampus and the left subiculum indeed exhibited both the main effect of memory and the interaction effect (Fig. 5C; Table 5).
Moreover, to further characterize the pattern of stress-induced modulations of event-related or phasic hippocampal and midbrain activity, data were extracted and averaged for these two ROIs. Separate 2 (group) by 2 (memory) ANOVAs confirmed the interaction effect in the right hippocampus (F(1,38) = 4.26, p < 0.05; Fig. 6A,B) and midbrain (F(1,38) = 8.31, p < 0.01; Fig. 6C,D). Separate paired t tests revealed that there were significant SMEs in the right hippocampus (t(19) = 3.68, p < 0.01) and midbrain (t(19) = 2.57, p < 0.02) in the control group but not in the stress-induction group (both t(19) < 1.80, n.s.). Together, these results indicate that stress induction reduced SMEs in the hippocampus and midbrain, reflecting less discrimination between items later remembered and forgotten.
Discussion
The present study investigated how acute psychological stress affects memory formation for neutral information. We observed that acute stress induced a more liberal response bias (i.e., a tendency to endorse items as old) in a memory test involving identification of short written descriptions of scenes encountered 24 h earlier during stress induction. The strength of this bias correlated with stress-induced changes in cortisol, HR, and pupil dilation. Moreover, acute stress reduced SMEs during memory formation in hippocampus, midbrain, and in pupil dilation. As indicated by stress-induced elevation of cortisol and HR, our stress induction procedure resulted in a shift toward a prolonged state of increased HPA axis and SNS activity. Such states are known to be accompanied by tonic elevation of stress-sensitive catecholamines, regulated by the LC-NE system and midbrain dopaminergic nuclei, and glucocorticoids (de Kloet et al., 2005; Ulrich-Lai and Herman, 2009). We therefore discuss how elevation of these neuromodulators may account for our observed effects of stress on memory formation.
Acute stress had no effect on overall memory accuracy, but did induce a more liberal response bias. Moreover, individual measures of the amount of stress (i.e., changes in cortisol, HR, and pupil dilation responses) predicted the strength of this bias. These findings are reminiscent of earlier findings of stress-induced elevations of false memory (Payne et al., 2002, 2006, 2007). In these studies, stress induction at the time of encoding led to an increase in the number of false positives for related but novel items during memory testing. Our findings are also consistent with a large amount of literature on the limited reliability of eyewitness reports (Loftus, 1979) and observations of strong subjective remembrance in the absence of increased accuracy for emotional memory (Sharot et al., 2004; Phelps and Sharot, 2008). Notably, effects of stress on retrieval cannot readily explain our findings, because we tested memory after stress had subsided and cortisol levels did not differ anymore. Thus, the more liberal response bias observed after stress likely results from alterations in neurocognitive processing underlying memory formation.
According to influential accounts, emotional arousal or acute stress biases mnemonic operations underlying memory formation toward rapid extraction of central thematic information, or gist (Payne et al., 2002, 2006; Adolphs et al., 2005). Such a shift would have clear survival benefit over encoding details that are not of immediate importance. This alteration can be thought of as increasing the reconstructive nature of such memories: to reduce the amount of detail and focus on a central theme, one would likely need to reconstruct such episodes by reactivating and integrating prior representations of more general categories, i.e., an enhancement of gist-based encoding at the cost of specificity. As a consequence, memory representations encoded under stress may exhibit stronger overlap and thus become more generalized and less discrete, which in turn may explain the observed response bias when probing these memories.
Turning to functional neuroimaging and pupil data, our findings are in line with mechanistic accounts of stress-induced alterations in catecholaminergic neuromodulation of memory-related neural circuits. According to these models, catecholaminergic activity exhibits an inverted U-shaped relationship with efficacy of neurocognitive functioning (Aston-Jones and Cohen, 2005). Intermediate levels of catecholamines are thought to represent an optimal state characterized by selective phasic firing patterns in response to novel and/or salient stimuli. Through such phasic signaling, the hippocampus forms a functional loop with neuromodulatory midbrain nuclei that determines selective processing of novel information and entry into long-term memory (Lisman and Grace, 2005; Harley, 2007). Specifically, novelty signals generated in the hippocampus (Fernández and Tendolkar, 2006) may trigger phasic catecholaminergic activity in neuromodulatory midbrain nuclei, which in turn enhances hippocampal neuroplasticity (Lisman and Grace, 2005; Sara, 2009). Such theoretical models also predict, however, that acute stress would alter the phasic functional coupling of the hippocampus with neuromodulatory nuclei by shifting catecholaminergic systems into a state with reduced phasic, but high tonic background, activity at the right side of the inverted U-shaped curve (Aston-Jones and Cohen, 2005; Arnsten, 2009).
Four findings from the present study lend credence to these notions. First, multiple physiological measures of stress suggest that stress induction lead to tonic elevation of stress-sensitive catecholamines, and the correlation of these stress measures with response bias strength supports an association between activation of stress-sensitive neuromodulatory systems at time of encoding and alterations in memory. Second, the stress-induced enhancement of neural activity during encoding in early visual regions and several regions, which are thought to comprise a salience processing network (Seeley et al., 2007) found in this study and in our previous work (Henckens et al., 2009; Hermans et al., 2011), indicates a state of sensory hypersensitivity under stress. Third, the reduced SMEs in hippocampal, midbrain regions indicate that mnemonic activity in these regions becomes noisier and less selective. Fourth, we found similar reduced effects during memory formation in the event-related pupil dilation response (Fig. 3), a peripheral index of phasic LC-NE system activity (Gilzenrat et al., 2010). Note that the observed reduction of SMEs would also be in line with an alternative interpretation that assumes a drowsy state at the left side of the inverted U-shaped curve with low phasic and low tonic activity (Aston-Jones and Cohen, 2005). However, our multiple stress response measurements speak strongly against such an interpretation. Therefore, our behavioral, physiological, and neuroimaging findings converge to support the notion that acute stress prompts a hypervigilant state characterized by unselective hypersensitivity to sensory stimuli and noisier signaling during memory formation.
An important question is how such a putative state of hypervigilant processing and noisier signaling can lead to a more positive response bias in the absence of a loss of accuracy. According to recent findings in humans, memory generalization does not rely solely on inferential processes that take place during retrieval, but also relies on integrative and constructive processes already at the time of encoding (Shohamy and Wagner, 2008). Such processes have been associated with functional coupling of the hippocampus and midbrain: phasic dopaminergic firing in response to novel stimuli is thought to trigger reactivations of related or overlapping representations of prior memories and thus result in updating through integration of new information into existing memory (Kumaran and Duzel, 2008; Shohamy and Wagner, 2008; Shohamy et al., 2010). It is therefore possible that the overgeneralization of memory representations that results in a positive response bias is caused by stress-induced tonic activation of this circuit. Yet, memory accuracy for such generalized memory traces could still be preserved under these circumstances because elevated levels of catecholamines and glucocorticoids may generally enhance neural plasticity during encoding (Lisman and Grace, 2005; Harley, 2007) and postencoding consolidation (Joëls et al., 2006; Sara, 2009). Although future research using, for instance, pharmacological manipulations will be necessary to provide more definite answers, our findings suggest that changes in catecholaminergic signaling may play a key role in altering memory formation under stress.
Some caution is warranted when interpreting activations observed in the midbrain. Anatomically, the clusters we found correspond with midbrain sections that are known to contribute to regulating arousal and homeostatic balance in fear and acute stress (Mobbs et al., 2007; Hermans et al., 2011), which parallels our observed vegetative effects such as stress-induced changes in heart rate and pupil responses. These regions are also known as the main sources of various stress-sensitive neuromodulators that are involved in modulating hippocampal functioning underlying learning and memory (Lisman and Grace, 2005; Joëls et al., 2006; Sara, 2009). The involvement of these midbrain nuclei and the hippocampus in memory formation has been reported in many previous neuroimaging studies (Schott et al., 2004, 2006; Wittmann et al., 2005; Sterpenich et al., 2006; Shohamy and Wagner, 2008; Düzel et al., 2009). The observed reduction of SMEs in the midbrain and the hippocampus concur with the proposition that activation of stress-sensitive neuromodulatory systems alter the neural processes underlying memory formation. However, given the limitations of conventional fMRI in terms of spatial resolution and the small size of these midbrain nuclei, the precise location of these individual regions is difficult to determine with certainty. Nonetheless, it is conceivable that the observed reduction of SMEs during memory formation results from stress-induced alterations in catecholaminergic signaling, because we found related effects in heart rate and pupil responses. Future research, however, will be necessary to establish this link more directly.
In conclusion, the present study shows that acute stress during encoding results in a more liberal response bias in a memory test for remembrance of neutral information 24 h later. Moreover, acute stress diminishes SMEs in the hippocampus and midbrain, and in pupil dilation responses (reflecting noradrenergic activity) during memory formation. The reduction of SMEs during encoding and the more liberal response bias at test may be related to a chain of changes in neurocognitive processes underlying memory formation, most likely resulting from stress-induced alterations in catecholaminergic signaling in hippocampus and midbrain circuits. A similar mechanism may account for the low reliability of eyewitness memories for intrinsically neutral information encountered in a stressful context.
Footnotes
This work was supported by the Netherlands Organization for Scientific Research Grants 918.66.613, 451.07.019, and 446.10.010.
- Correspondence should be addressed to Dr. Shaozheng Qin, Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, 1070 Arastradero Road, Ste 220, Palo Alto, CA 94304. szqin{at}stanford.edu