Introduction

Working memory (WM) and attention are typically considered distinct cognitive constructs, yet they frequently overlap (Awh & Jonides, 2001; Gazzaley & Nobre, 2012). For example, the active maintenance of items in visual WM can bias attention toward matching items in the environment, even at the expense of the current task goals (Soto, Hodsoll, Rotshtein, & Humphreys, 2008). However, this bias is not always observed (e.g., Woodman & Luck, 2007), generating debate about whether the coupling between WM and attention is involuntary or under strategic control. While differences in memory-based attentional guidance could be attributed to a complex set of task parameters, it is also possible that the various task paradigms that engender conflicting interpretations are actually tapping into different cognitive processes.

Studies of memory-based attentional guidance typically involve a visual search task while simultaneously maintaining a visual object in WM. The critical manipulation is whether the search target, distractors, or neither matches the stimuli in WM. In one paradigm (e.g., Soto, Heinke, Humphreys, & Blanco, 2005), participants view a colored shape WM item (e.g., a blue circle) before performing a visual search for a slanted-line target among straight-line distractors (Fig. 1A). The WM item can reappear in the search display surrounding a target (valid), distractor (invalid), or fail to reappear at all (neutral). The visual search stimuli are thus distinct from, but embedded within, the class of stimuli that could match WM. The canonical finding is that search is speeded when the memory item validly cues the target and slowed when it invalidly cues a distractor (relative to the neutral condition; see Soto et al., 2008). Even when valid trials are eliminated from this task—de-incentivizing any strategic orienting toward the WM-match—search is still slowed on invalid trials (e.g., Kiyonaga, Egner, & Soto, 2012; Olivers, 2009; Soto et al., 2005), suggesting the link between WM and visual attention is automatic (Soto et al., 2008; see also Dowd & Mitroff, 2013; Hollingworth, Matsukura, & Luck, 2013).

Fig. 1
figure 1

Trial sequences for the four behavioral tasks. (A) Binary-stimulus dual-task paradigm that combines a memory task with an intervening search task (see Soto et al., 2005); participants searched for the tilted line target among vertical or horizontal line distractors. (B) Unitary-stimulus dual-task paradigm (see Woodman & Luck, 2007); participants searched for the top or bottom gap target among right or left gap distractors. In both dual-task paradigms, the memory task tested whether the memory probe was same or different from the memory cue. (C) Visual working memory capacity task (see Luck & Vogel, 1997); participants reported whether the memory probe color was same or different from the item at that location in the sample array. (D) Automated operation span task (Unsworth et al., 2005); participants alternated between remembering letters and solving arithmetic problems

In contrast, other seemingly similar memory-based guidance tasks have produced different results. In one prominent task, participants view a colored square to hold in WM before searching among an array of colored Landolt C-like stimuli for a C whose gap is on the top or bottom (Woodman & Luck, 2007; Fig. 1B). The search target or one of the distractors could match the WM color. In some studies using this task, visual attention was not captured by the memory-matching items (e.g., Han & Kim, 2009; Woodman & Luck, 2007). In fact, when there was no possibility of WM content matching the search target (i.e., no valid trials), there was a tendency for participants to be faster when a distractor matched WM, suggesting that participants were strategically avoiding the memory-matching distractors (Arita, Carlisle, & Woodman, 2012; Woodman & Luck, 2007, Experiment 3). These experiments suggest that there is no automatic visual selection bias toward WM-matching items, and that WM content can be strategically avoided (see Carlisle & Woodman, 2011a).

Several studies have attempted to address why these superficially similar paradigms produce divergent results by focusing on specific task parameters that might underlie differences in memory-based guidance (e.g., Dalvit & Eimer, 2011; Dombrowe, Olivers, & Donk, 2010; Han & Kim, 2009; Olivers, 2009; Soto & Humphreys, 2008; Tsvetanov, Arvanitis, & Humphreys, 2012; Woodman, Luck, & Schall, 2007; Zhang et al., 2010, 2011). These investigations, however, have not coalesced into a unified picture of whether and how these factors impact memory-guided attention. Moreover, contradictory behavioral indices of memory-guided attention have been joined by contradictory electrophysiologic findings (i.e., differential modulation of the N2pc event-related potential component) from these two types of paradigms (Carlisle & Woodman, 2011b, 2013; Kumar, Soto, & Humphreys, 2009). Taken together, these findings challenge the assumption in this field that both types of paradigms probe the same underlying processes of how WM contents guide attention. Thus, one possible explanation for why two very similar paradigms engender different interpretations is that they are not measuring the same cognitive construct in the first place. This possibility has not been addressed, so the current study used an individual-differences approach to examine the relationship between performance on the two tasks. As a secondary focus, we also compare memory-based capture in each task relative to independently measured constructs of attention and WM.

Methods

Participants

Seventy-four individuals from Duke University (24 men, ages 18–27 years) participated for $10/hour or course credit. All signed informed consent in accordance with institutional guidelines. Data were excluded from six additional participants who did not complete all four experimental tasks.

Behavioral testing

Participants completed four cognitive tasks (Fig. 1): two dual-task paradigms that combined WM and visual search (see Soto et al., 2005; Woodman & Luck, 2007), a visual WM capacity task (see Luck & Vogel, 1997), and a complex WM span task (automated operation span task; Unsworth, Heitz, Schrock, & Engle, 2005). Visual WM capacity was included to reflect the strength of a WM representation (e.g., Bays & Husain, 2008), which in turn could affect how memory-matching contents in the visual field capture attention. Complex span was included as a measure of WM maintenance in the face of interference, which resembles the dual-task setup of the attentional guidance paradigms in question here. Tasks were counterbalanced across participants, but the two dual-task paradigms were never completed consecutively.

Dual-task paradigms

The two dual-task paradigms were based on designs by Soto and colleagues (Soto et al. 2005, 2008) and by Woodman & Luck (2007; also see Carlisle & Woodman, 2011a, 2011b; Han & Kim, 2009). We refer to these tasks as using “binary stimuli” (when the memory-related circles and search-related lines have separate forms; see Soto et al., 2005) and “unitary stimuli” (when the memory-related color and the search-related gap exist within the same form; see Woodman & Luck, 2007), respectively. We adapted the paradigms to have identical instructions, timing, display sizes, and stimulus colors; the contributions of these elements to memory effects on attention have been addressed by previous studies (e.g., Dombrowe et al., 2010; Olivers, 2009), and thus the current study focuses on the differences in stimulus arrangement. Importantly, the memory cue never matched the location of the search target, removing incentive to strategically look for the memory-matching search item (see Soto et al., 2005, Exp. 4).

Binary-stimulus paradigm

In the binary-stimulus dual-task paradigm (Fig. 1A), each trial began with a 500-ms fixation display, followed by a 500-ms memory cue that was a colored circle (1.3°×1.3°) centered at fixation. Participants were to remember the color of the cue for a potential future test. After a 1000-ms blank delay, a search array was presented until response (maximum presentation 2,000 ms). The search display consisted of four black lines (stroke width=0.2°, length=1.0°), each surrounded by a colored circle. Three were horizontal or vertical, whereas the target line was tilted left or right at 45°. Each search item was placed with a slight spatial jitter within randomly selected cells of an invisible 5×4 grid subtending 10.3°×8°, with no items overlapping. Each target location and orientation occurred equally often and in randomized order. Participants searched for the tilted line and reported the direction of the tilt via speeded keypress.

To prevent participants from strategically using the search array to refresh their memory of the memory item during the search task (see Woodman & Luck, 2007), memory probes were given on 20 % of trials instead of a search display. The memory probe was a colored circle (1.3°×1.3°) presented at the center of the screen beneath a question mark for up to 2,000 ms. Participants indicated whether the probe color was identical to the memory cue color via keypress. The memory cue, search circles, and memory probe were drawn randomly from five equidistant and equiluminant colors. In the search array, each color could only appear once.

Half of the search trials were neutral (memory item did not reappear) and half were invalid (memory item reappeared surrounding a distractor line). Memory probe trials were either a match (50 %) or a non-match (50 %) to the WM cue. After ten practice trials, participants completed 200 experimental trials.

Unitary-stimulus paradigm

The unitary-stimulus dual-task paradigm was identical to the binary-stimulus task in timing and structure, but employed different stimuli (Fig. 1B). Memory cues and probes were colored squares (0.65°×0.65°), and the search stimuli were outlined squares (stroke width=0.1°) with a 0.25° gap on the top, bottom, left, or right. Three of the squares had a gap on the left or right, while the target square had a gap on the top or bottom. Participants searched for the top or bottom gap and reported the orientation of the gap via speeded keypress.

Visual working memory capacity

The visual WM task (Fig. 1C) was adapted from Luck & Vogel (1997). After a 500-ms fixation display, a sample array of two, four, six, or eight colored squares (1.0°×1.0°), jittered across a grid of 15.4°×10.7°, was presented for 1,000 ms. After a 1,000-ms blank delay, a test array was presented, consisting of a single colored square in the same location as one of the sample array items. Participants reported whether the test item was identical to the sample item that had previously been shown in that location. Colors were pulled pseudorandomly from a circular spectrum of equiluminant colors. If the test item was different from the sample item, the difference was 180° in color space.

Operation span

The operation span task was administered with an automated version packaged by Unsworth et al. (2005). Participants were to remember information (sequences of unrelated letters, set sizes: 3–7) while concurrently engaging in online processing (solving arithmetic problems). On each trial participants alternated between solving math problems and memorizing single letters until a recall phase wherein they reported the letters in order (Fig. 1D). Each participant completed three sets of each set size (i.e., 15 sets). Full details of task structure and timing can be found in Unsworth et al. (2005).

Individual-differences assessments

Behavioral task data were collected along with a large battery of self-report assessments (see Appelbaum, Cain, Schroeder, Darling, & Mitroff, 2012). The current study examined two attention-related surveys: the Attention Deficit-Hyperactivity Disorder (ADHD) questionnaire (Jasper & Goldberg, 1993) and the Autism-Spectrum Quotient (ASQ; Baron-Cohen, Wheelwright, Skinner, Martin, & Clubley, 2001) with a focus on the two attention-related sub-categories of attention-switching and attention to detail. Both ADHD and autism have been linked to broader deficits in executive functions, including attentional control and WM (see Gargaro, Rinehart, Bradshaw, Tonge, & Sheppard, 2011; Pennington & Ozonoff, 1996), which could have important implications for memory-guided attention. No other questionnaire data were analyzed.Footnote 1

Results

In the dual-task paradigms, our primary measure of interest was response time (RT) in the visual search task. Specifically, our key comparison was the time to find the target in neutral versus invalid conditions. To represent a single measure of attentional capture by WM, a normalized RT difference score was calculated for each individual: (invalid RT – neutral RT) / (neutral RT), reflecting the absolute RT difference as a proportion of neutral RT. Positive values indicate high attentional capture, while negative values indicate reverse attentional capture.

Binary-stimulus paradigm

Performance on memory catch trials was nearly perfect (M=99.1 %, SD=1.7 %). Search trials were excluded for response errors (0.7 % of all search trials) and for exceeding the time limit (0.4 % of trials). RTs for correct search responses were slower for invalid trials (M=1,045 ms, SD=138 ms) than for neutral trials (M=996 ms, SD=131 ms), t(73)=10.57, p<0.001 (Fig. 2A). There was high variability across participants, with normalized RT difference scores (M=0.05) ranging from +0.16 to –0.03 (for similar reports of individual variability, see Soto, Rotshtein, & Kanai, 2014).

Fig. 2
figure 2

(A) In both the binary-stimulus and unitary-stimulus paradigms, search times were slower for invalid trials compared to neutral trials. Error bars represent 95 % confidence intervals. (B) Individual differences in normalized difference scores (i.e., the amount of attentional capture by memory-matching distractors) did not correlate between the binary-stimulus and unitary-stimulus paradigms

Unitary-stimulus paradigm

Performance on memory catch trials was nearly perfect (M=98.6 %, SD=2.7 %). Search trials were excluded for response errors (0.4 %) and for exceeding the time limit (1.0 %). Search RTs for correct responses were slower for invalid trials (M=1172 ms, SD=135 ms) than neutral trials (M=1128 ms, SD=136 ms), t(73)=7.08, p<0.001 (Fig. 2A). There was high variability in normalized RT difference scores (M=0.04) ranging from +0.15 to –0.09. This particular paradigm has revealed variable capture effects (Carlisle & Woodman, 2011b; Han & Kim, 2009), and the data here demonstrated significant positive capture, in contrast to the reverse attentional capture effect from Woodman & Luck (2007, Experiment 3).Footnote 2 Importantly though, the presence or direction of a mean capture effect in this protocol is not a key concern of the purpose of the present study, which is to assess the relationship of individual differences in performance across tasks.

Visual working memory capacity

Visual WM capacity was calculated using Cowan’s K (Pashler, 1988; Cowan, 2001): K=S(HF), where K is memory capacity, S is set size, H is observed hit rate, and F is false alarm rate. Final visual WM capacity was calculated by averaging K values for set sizes 4, 6, and 8 (Fukuda & Vogel, 2009). As in previous studies (e.g., Vogel & Machizawa, 2004), mean WM capacity for our population was 3.20 (SD=1.04), ranging from 1.05 to 5.39.

Operation span

Operation span score was calculated by summing all perfectly recalled letter sets (see Unsworth et al., 2005). The mean operation span score was 49.2 (SD=13.4), ranging from 20 to 75.

Regression analyses

Our analysis strategy was to use regression models to assess whether performance on the two dual-task paradigms was related. In a simple linear regression model, normalized difference scores on the binary-stimulus task did not significantly predict normalized difference scores on the unitary-stimulus task, adj. R 2<0.001, F(1,72)=0.02, p=0.885, 95 % CI [−0.20, 0.17] (Fig. 2B). A Bayesian linear regression (Kruschke, 2011) revealed a posterior distribution of standardized slopes centered on M=–0.09 [95 % HDI –0.33, 0.16], supporting the conclusion that performance on binary-stimulus and unitary-stimulus tasks were not significantly related. To show that this lack of correlation was not due to unreliability of either task, we used attenuation correction to adjust the correlation according to the reliability of the two measures being correlated (Schmidt & Hunter, 1996). With difference score reliabilities of ρbinary=0.45 and ρunitary=0.41, the corrected correlation between attentional capture effects in binary and unitary paradigms was still not significant, r(74)=–0.04, p = 0.740.Footnote 3 To confirm that this lack of correlation was not due to validity effect differences between the tasks, search response times were subjected to a 2×2 repeated-measured ANOVA across validity and task; the interaction was non-significant, F(1,73)=0.51, p =0.479.

The primary analysis of interest was the above correlation between attentional capture effects in the binary-stimulus and unitary-stimulus paradigms. Having found that they are not related to one another, we conducted exploratory regression analyses to see whether performance was differentially predicted by WM capacity or self-report measures of attention. A stepwise linear regression model was applied to both dual-task paradigms to explore what factors best predicted attentional capture for each. Five factors were included in an initial model for each paradigm: visual WM capacity, complex span, self-report scores of ADHD, self-report scores on the attention-switching subcategory of ASQ, and self-report scores on the attention-to-detail subcategory of ASQ.

In predicting performance for the binary-stimulus task, four factors (operation span, ADHD score, and two ASQ sub-category scores) did not significantly contribute to the model (full model, adj. R 2=0.01). Only visual WM capacity significantly predicted attentional capture, adj. R 2=0.040, F(1,72)=3.99, p=0.049 (Fig. 3A). Outliers were assessed based upon Cook’s D≥1 (Cook & Weisberg, 1982), and no data were trimmed.

Fig. 3
figure 3

Fitted linear regression models predicting memory-based capture in each dual-task paradigm by selected, significant variables of working memory and attention. (A) Normalized difference scores (i.e., the amount of attentional capture by memory-matching distractors) in the binary-stimulus paradigm were significantly predicted by individual differences in visual working memory capacity. (B) Normalized difference scores in the unitary-stimulus paradigm were significantly predicted by self-report assessments of attention deficit-hyperactivity disorder and two sub-categories of the autism spectrum quotient. Lines depict the 95 % confidence intervals around mean standardized regression coefficients (black dots) in predicting normalized difference scores in a multiple regression model

In predicting performance for the unitary-stimulus task, two factors (visual WM capacity, operation span) did not significantly contribute to the model (full model, adj. R 2=0.10). Self-report scores of ADHD symptoms and the two ASQ sub-categories significantly predicted attentional capture, adj. R 2=0.119, F(3,70)=4.26, p =0.008 (Fig. 3B). Again, no data points were trimmed for Cook’s D≥1. No factor included in the model had a variance inflation factor above 1.12, indicating that collinearity was not an issue.

General discussion

A great number of studies have explored the extent to which WM content can capture visual attention, but debate remains open over whether attentional guidance by WM is automatic (e.g., Soto et al., 2008) or strategic (e.g., Woodman & Luck, 2007). This debate has been fueled by conflicting findings from two seemingly similar dual-task paradigms, which putatively test the same aspect of attentional guidance. The current study employed an individual-differences approach to examine whether performance in one paradigm related to performance in the other. Attentional capture by WM in the binary-stimulus (see Soto et al., 2005) and unitary-stimulus (see Woodman & Luck, 2007) paradigms did not correlate, suggesting a fundamental difference in how each task taps into WM biasing, even within the same individuals.

Much of the evidence that has led to debate over memory-guided attention has been drawn from one or the other of these dual-task paradigms. On the surface, these paradigms seem very similar, generating the assumption that the two are assessing the same underlying phenomenon. In fact, at the group level, the current study observed capture effects of roughly the same magnitude across the two paradigms. Thus, it is all the more striking that the magnitude of attentional capture did not correlate at an individual level. The current behavioral results, along with previous contradictory electrophysiological findings between the two paradigms (see Carlisle & Woodman, 2013), suggest the possibility that these two paradigms—which differ only in terms of stimulus arrangement—are actually tapping into distinct aspects of how memory and attention interact. The current results thus offer a new perspective on reconciling the disparate literature about memory-guided attention.

What might these distinct aspects of memory-based guidance entail? Using the other individual-differences measures, subsequent exploratory analyses hinted that attentional capture in the two paradigms was differentially predicted by independently measured constructs of WM and attention—specifically, by visual WM capacity for the binary-stimulus task, and by self-report assessments of attention for the unitary-stimulus task. It is important to emphasize that the amount of variance explained by these factors was relatively small; furthermore, the current individual-differences approach did not allow for empirical manipulation of WM- and attention-related factors. Thus, these results only serve to generate initial hypotheses as to why the two paradigms produce different patterns of behavior.

The exploratory analyses revealed a dissociation between predictive factors for the binary-stimulus versus the unitary-stimulus paradigm. For the binary-stimulus paradigm, greater visual WM capacity predicted greater attentional capture. Previous research has suggested that visual search processing taps into WM resources (e.g., Anderson, Vogel, & Awh, 2013). We speculate that visual WM capacity might be more relevant for the binary-stimulus paradigm due to the physical distinction between binary and unitary stimuli. In binary-stimulus search arrays, there are two spatially distinct forms, the colored circle and the search line, which could theoretically be processed as separate “objects” (Duncan, 1984); in contrast, unitary stimuli consist of one form only. Thus, although only one “object” is encoded during the memory cue display, twice as many “objects” are processed in binary versus unitary search displays, which may tax WM resources to a greater degree.

Memory-based guidance in the unitary-stimulus paradigm was predicted by self-report assessments of ADHD and autism, which have been linked to difficulties with attentional control and flexibility (Geurts, Verté, Oosterlaan, Roeyers, & Sergeant, 2004; Manly et al., 2001). These findings point to greater attentional capture in individuals who might be less able to flexibly deprioritize WM items (i.e., relegate them to accessory status; Olivers, Peters, Houtkamp, & Roelfsema, 2011) during visual search. We speculate that these factors might be specifically predictive for performance in the unitary-stimulus paradigm due to the spatial concentration of task-relevant features, which may require more attentional control to dissociate the search-relevant feature (gap) from the memory-relevant feature (color). This issue goes hand-in-hand with search difficulty, in the sense that distinguishing between memory-relevant and task-relevant features in unitary stimuli is more perceptually demanding. However, general search difficulty in and of itself has not been found to influence memory-based attentional capture (Olivers, 2009). Overall, these exploratory analyses present a preliminary window into how the spatial differences between binary and unitary stimuli might differentially recruit object-encoding and attentional control processes, which may underlie the differences in attentional capture between the binary-stimulus and unitary-stimulus paradigms.

In much of the memory-guided attention literature, there has been an assumption that two superficially very similar and highly influential tasks are testing the same relationship between WM and attention, thus driving a debate about why results differ. By exploring these differences within the same individuals, the present study offers an alternative explanation: these two tasks produce non-correlating effects because they might be tapping into distinct aspects of memory-based attentional guidance. These findings underscore the importance of stimulus attributes and task design in determining the recruitment of particular attention and memory processes, and reveal that a multitude of cognitive factors can contribute to and influence the interaction between memory and attention.