Cognitive load effects on early visual perceptual processing

Liu, Ping; Forte, Jason; Sewell, David; Carter, Olivia

doi:10.3758/s13414-017-1464-9

Cognitive load effects on early visual perceptual processing

Published: 23 January 2018

Volume 80, pages 929–950, (2018)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Cognitive load effects on early visual perceptual processing

Download PDF

Ping Liu¹,
Jason Forte¹,
David Sewell² &
…
Olivia Carter¹

4291 Accesses
9 Citations
5 Altmetric
Explore all metrics

Abstract

Contrast-based early visual processing has largely been considered to involve autonomous processes that do not need the support of cognitive resources. However, as spatial attention is known to modulate early visual perceptual processing, we explored whether cognitive load could similarly impact contrast-based perception. We used a dual-task paradigm to assess the impact of a concurrent working memory task on the performance of three different early visual tasks. The results from Experiment 1 suggest that cognitive load can modulate early visual processing. No effects of cognitive load were seen in Experiments 2 or 3. Together, the findings provide evidence that under some circumstances cognitive load effects can penetrate the early stages of visual processing and that higher cognitive function and early perceptual processing may not be as independent as was once thought.

No one knows what attention is

Article Open access 05 September 2019

Bernhard Hommel, Craig S. Chapman, … Timothy N. Welsh

Guided Search 6.0: An updated model of visual search

Article 05 February 2021

Jeremy M. Wolfe

The effect of emotional arousal on visual attentional performance: a systematic review

Article Open access 07 July 2023

Andras N. Zsidó

Introduction

At any given moment, our brain is overwhelmed by incoming information from our sensory environment. At the same time, our behavioral goals and the execution of actions need to be maintained. The ability of the brain to coordinate concurrent perceptual processing and higher cognitive functions is crucial for us to behave in a coherent and efficient manner in daily life.

The influence of cognitive processes on concurrent visual perceptual processing has been mainly explored in two seemingly related but independent literature streams. One stream focuses on how working memory content can bias concurrent visual processing when there is a content overlap between the two (Kosslyn et al., 1999; Scocchia et al., 2013; Serences et al., 2009). This line of research has provided evidence for strong links between cognitive processes and low-level visual perceptual mechanisms. The second stream focuses on understanding to what extent cognitive load may affect early visual processing when there is no content overlap between the two (de Fockert et al., 2001; Lavie, 2005, 2010). The current study falls into the second category.

Top-down attention mechanisms supporting perceptual information processing have been the subject of countless studies (Carrasco, 2011; Chen et al., 2014; Crist et al., 2001; Gilbert & Li, 2013; Li et al., 2004, 2006). Visual attention can be selectively directed to different visual properties such as location, color, etc. The majority of such studies have looked at how visual spatial attention facilitates the processing of attended information and suppresses unattended information (Carrasco, 2011; Desimone & Duncan, 1995)

Whether spatial attention modulates early visual processing was difficult to prove for more than two decades due to the variety of visual tasks and methodologies employed in spatial attention studies (Carrasco, 2011; Zhaoping, 2014). The flow of visual perceptual processing is believed to follow an approximately hierarchic feedforward path, i.e., from early to high level vision. Each stage is associated with its specific category of tasks that have been developed to rigorously assess the relevant level of visual processing (Marr, 1982; Zhaoping, 2014). Contrast sensitivity tasks are generally considered to assess early processing stages. Demonstrating spatial attention effects on early visual processing with only behavioral measures has required rigorous control of stimulus configuration and experimental methodology (Dosher & Lu, 2000; Herrmann et al., 2010; Lu & Dosher, 1998; Pestilli et al., 2011). For example, the target has to be presented alone free from any distractors and external noise (Dosher & Lu, 2000; Lu & Dosher, 1998; Pestilli et al., 2011) and the stimulus size of the target needs to be carefully controlled in relation to the spatial attention distribution (Herrmann et al., 2010).

The majority of cognitive load studies, however, have not made a clear distinction between the visual tasks used to assess cognitive load on early versus high level visual processing and this has led to some discrepancies in the interpretations of results obtained. For example, findings from a class of studies employing flanker tasks have been interpreted as suggesting that cognitive load doesn’t modulate early visual processing (de Fockert et al., 2001; Lavie, 2005). Flanker tasks represent an experimental paradigm known to be more closely associated with a higher-level visual mechanism, i.e., visual crowding (Dayan & Solomon, 2010; Levi, 2008; Levi et al., 2002; Strasburger, 2005). While these studies are interesting and informative they cannot be used to rule out an impact of cognitive load on early visual processing.

Recently a few studies using early visual tasks have provided some initial indication that such tasks may be sensitive to cognitive load. Cocchi et al., (2011) reported an unexpected finding that visual spatial working memory loads facilitated the performance of a concurrent but independent visual grouping-by-proximity task. Similarly, de Fockert and Leiser (2014) showed that high cognitive load enhanced collinear facilitation, which is an established early visual perceptual mechanism. The “facilitative” effects reported in Cocchi et al., (2011) and de Fockert and Leiser (2014) are at odds with the existing research (i.e., cognitive load and other dual-task studies) that suggests cognitive load has no impact on concurrent early visual processing (Pashler, 1994; de Fockert et al., 2001; Lavie, 2005). However, the grouping-by-proximity task in Cocchi et al., (2011) and the collinear facilitation task in de Fockert and Leiser (2014) differ considerably from the flanker tasks employed in cognitive load studies. Firstly, both the grouping and the collinear facilitation tasks are generally considered early visual tasks whereas flanker tasks are considered a high-level vision task. Secondly, there is literature suggesting that the grouping and the collinear facilitation tasks are facilitated by a more distributed visual spatial attention field (Ben-Av et al., 1992; Casco et al., 2005; Freeman et al., 2001, 2003; Han et al., 2005a, 2005b; Ito & Gilbert, 1999; Mack et al., 1992). In contrast, a focused spatial attention field has been shown to improve performance on flanker tasks (Chen et al., 2014; Fang & He, 2008; Harrison et al., 2013; He et al., 1996; Motter, 1993; Petrov & Meleshkevich, 2011; Scolari et al., 2007; Strasburger, 2005; Van der Lubbe & Keuss, 2001). The finding and design differences in the research raise the crucial question as to whether cognitive load can indeed modulate early visual processing.

The center-surround antagonistic organization of the receptive field of early visual neurons is thought to be fundamental to optimal contrast-based visual information processing. The center excitatory drive to the classical receptive field (CRF) establishes a neuron’s basic stimulus selectivity, which can be strongly modulated by the surround inhibition from the extra-classical receptive field (eCRF) in many neurons along the visual pathway (Adelson & Bergen, 1991; Fujita et al., 1992; Hubel & Wiesel, 1962, 1965). This center-surround interaction has been proposed to be one of the most fundamental underlying mechanisms supporting the efficient encoding of raw visual inputs (Heeger, 1992; Marr, 1976; Zhaoping, 2014).

Neurophysiological findings of top-down modulation effects on center excitation and surround inhibition suggest that variations in top-down modulation strength lead to differential effects on the final output of neural responses in early visual cortical neurons (Hupe et al., 1998, 2001; Nassi et al., 2013; Sandell & Schiller, 1982; Wang et al., 2010). Specifically, inactivation of feedback to V1 neurons has been found to reduce responses in some neurons to low-contrast stimuli confined to the CRF, suggesting that cortico-cortical feedback provides a weak, predominantly excitatory influence on the CRF (Hupe et al., 1998, 2001; Sandell & Schiller, 1982; Wang et al., 2010). In contrast, when assessed using stimuli that engage both the CRF and eCRF, eliminating feedback results in strong and consistent response facilitation, effectively reducing the strength of surround inhibition on center excitation in V1 neurons (Angelucci et al., 2002; Angelucci & Bullier, 2003; Nassi et al., 2013). Thus, theoretically in the presence of both center excitation and surround inhibition, the final outputs reflect the balance between these two forces in the absence of spatial attention.

Spatial attention has been argued to shift the balance between center excitation and surround inhibition, which in turn alters the neural response to visual stimulation. The modulation effects have been characterized by many computational models (Cutzu & Tsotsos, 2003; Pestilli et al., 2011; Reynolds & Heeger, 2009). For example, according to the normalization model of spatial attention (Reynolds & Heeger, 2009), the size of the attentional field determines how much surround inhibition enters into the normalization process and, consequently, the final response intensity of a given neuron (Reynolds & Heeger, 2009).

The aim of the current study is to explore cognitive load effects on early visual processing. Given center excitation and surround inhibition are the fundamental contrast-based early visual processing mechanisms, the current study explored cognitive load effects on center excitation and surround inhibition separately with established early vision tasks in three experiments. Spatial attention effects were taken into account in the design of the experiments and interpretation of results because of its possible modulation effects on the interaction between the two forces.

Experiment 1 - Cognitive load effects on center excitation

Converging evidence from psychophysical, neurophysiological, and imaging studies suggests that top-down modulation enhances neural response to visual stimulation when the dominant driver of the neural response reflects the center excitation mechanism. Findings from psychophysical studies of spatial attention suggest that the effects of spatial attention are equivalent to increasing the contrast of weak stimuli when the target stimulus is small relative to the spatial attention field (Herrmann et al., 2010; Ling & Carrasco, 2006; Pestilli et al., 2009). Neurophysiological and imaging studies of spatial attention have also found spatial attention effects are equivalent to increasing stimulus contrast for small stimuli (Li et al., 2008; Reynolds & Chelazzi, 2004; Reynolds et al., 2000). Together with the neurophysiological findings that cortico-cortical feedback provides a weak excitatory influence on the CRF for low-contrast stimuli (Hupe et al., 1998, 2001; Sandell & Schiller, 1982; Wang et al., 2010), these results suggest top-down modulation can enhance center excitation.

In Experiment 1, cognitive load effects on center excitation were assessed. As cognitive load is assumed to tax limited cognitive resources, it was hypothesized that it may reduce the brain’s ability to provide top-down modulation for concurrent early perceptual processing. In other words, the capacity for top-down enhancement of center excitation should be diminished causing contrast sensitivity to be lower under high-load conditions.

An orthogonal orientation discrimination task was employed as a proxy for a typical peripheral contrast detection task. This task was adapted from previous spatial attention studies and was carefully chosen based on three major considerations. Firstly, the performance on this task is generally believed to reflect the contrast responses of orientation selective early cortical visual neurons (Skottun et al., 1987). Secondly, the orthogonal discrimination and yes-no detection tasks produce equivalent contrast thresholds (Thomas & Gille, 1979). Thirdly, by asking participants to judge the orientation contingent dimension of interest (contrast) rather than contrast itself, the task minimizes response bias usually associated with yes-no contrast detection tasks (Smith & Wolfgang, 2007). Similar methodology has been adopted in multiple spatial attention studies (Carrasco et al., 2000; Liu et al., 2009; Skottun et al., 1987; Smith & Wolfgang, 2004). While it is acknowledged that orientation discrimination is generally regarded as requiring high-level visual processing (Zhaoping, 2014), the processing demand on orientation discrimination in this task is, however, minimal. A peripheral contrast detection task was employed because it is thought to recruit distributed spatial attention. Cognitive load has been shown to defocus spatial attention when focused spatial attention is required (Caparos & Linnell, 2010; Linnell & Caparos, 2011). By using an early visual task that requires a distributed spatial attention, the design minimized the chance that cognitive load effects on center excitation could be confounded by its effects on altering spatial attention distribution.

Cognitive load was manipulated with a general alphanumeric working memory task similar to that used previously (de Fockert et al., 2001), in which observers held zero, one or five alphanumeric characters in working memory. If cognitive load reduces top-down modulation on early visual processing, our high working memory load condition should result in a relative elevation of contrast detection thresholds.

Methods

Participants

Four graduate students (three females and one male aged 21 to 35 years) from the University of Melbourne participated in the experiment. Three were experienced psychophysical participants and one had no previous experience observing psychophysics experiments. All had normal or corrected-to-normal vision. Participants were screened and consented in accordance with approval from the human research ethics board of the University of Melbourne.

Apparatus

Stimuli were created on a MacPro computer using MATLAB (version 7.8) and the Psychophysics Toolbox 3.0 (Brainard, 1997; Pelli, 1997) and displayed on a gamma-corrected 17-inch CRT monitor, 1024-by-768-pixel at 85 Hz in a dimly lit room. The background was a uniform gray with the luminance set to the middle of the monitor’s range, about 55 cd/m². The stimuli were viewed binocularly at 80 cm with participant’s head position stabilized with a chin rest.

Stimuli

The contrast detection task

The target stimuli for the contrast detection task were Gabor patches (sinusoidal gratings embedded in a Gaussian window) subtending 1^∘ of visual angle presented at 4^∘ eccentricity from the fixation. The Gabor stimuli had a center spatial frequency of 3.6 cycles per degree (cpd). On each trial, a Gabor patch was presented with equal probability at one of the four corners of an imaginary square, centered on a fixation square (0.2^∘× 0.2^∘ of visual angle), which was present at the center of the screen throughout the perceptual task. Half of the trials contained a vertical Gabor and the other a horizontal Gabor.

The luminance profile L (x, y) of a static vertical Gabor patch as a function of spatial coordinates along the horizontal (x) and vertical (y) axes was

$$\begin{array}{@{}rcl@{}} L\left( x,y\right) &=& L_{0}+L_{0}\cdot m\cdot{exp\left[-\frac{(x-x_{0})^{2}}{2\sigma^{2}}\right]}\\ &&\cdot\, exp\left[-\frac{(y-y_{0})^{2}}{2\sigma^{2}}\right]\cdot cos(2\pi f(x\,-\,x_{0})\,+\,\theta) \end{array} $$

(1)

where L₀ is the mean luminance of the display, m is the amplitude (contrast) of the Gabor function, x₀ and y₀ are its horizontal and vertical center positions respectively, σ is the standard deviation of the Gaussian envelope, f is the frequency of the sinusoid, and 𝜃 is the phase of the sinusoid with respect to the center of the Gaussian window. All Gabors were in cosine phase with 𝜃 set at 0.

To signal the target location and terminate visual perceptual processing, a square mask consisting of a high contrast checkerboard pattern (subtending 1.1^∘ of visual angle) was presented for 200 ms at the same location of the target immediately after the offset of the Gabor patch.

The method of constant stimuli was used. All participants performed the contrast detection task prior to formal testing to establish the contrast levels required to measure the full extent of the psychometric function (five or six levels of contrast linearly spaced on a log scale from chance to asymptote performance level).

The working memory task

The working memory set was displayed in a 3×3 grid at the center of the monitor in font Arial size 18. The grid was made of English consonants randomly selected from the available 20 without replacement. The remainder of the grid was filled with tilde symbols (∼). The entire grid measured approximately 2.5^∘ squared, with each letter within the grid subtending approximately 0.6^∘. The combination of the letter and tilde symbols within the grid varied as a function of load (no load, low load, and high load). In the no-load condition, the grid consisted purely of tilde symbols. In the low-load condition, one letter was presented in the central location of the grid. In the high-load condition, five letters were presented with one at each corner and one at the center. The memory grid was presented in dark red to indicate the encoding phase of the working memory task.

In the low- and high-load conditions, the memory of the letter set was later probed by a single letter presented in one of the locations previously occupied by a letter, with all remaining locations filled by the tilde. In half of the trials, the probe letter was identical to the one previously presented at the exact location and different in the other half. In no-load conditions, a grid of tilde symbols was presented to occupy the time. The probe array was presented in dark green to indicate that this was the probe phase of the working memory task. The luminance of the red letters of the memory array and the green letters within the probe array were matched.

Procedure

As depicted in Fig. 1, each trial started with a light grey fixation square, appearing for 1000 ms, indicating the start of a new trial. The fixation square was then replaced by the memory grid presented for 1500 ms (individually adjusted for one of the participants to be 2000 ms). This was then followed by the presentation of a fixation square for 1200 ms before the Gabor patch was presented. The Gabor patch was presented for 50 ms. A square mask consisting of a high contrast checkerboard pattern (subtending 1.1^∘ of visual angle) was presented for 200 ms at the same location of the target immediately after the offset of Gabor patches. Each trial ended with a probe grid that was presented for 800 ms. The working memory load, location of the Gabor stimulus and contrast level were all randomized within sessions.

Participants were instructed to fixate on the central square throughout the trial except for reading the memory grid and the probe grid. With respect to the memory and detection tasks, participants were told to remember and maintain the memory set online for the full length of each trial and to report the target orientation as accurately as possible immediately following the presentation of the Gabor. Their response of orientation (vertical vs. horizontal) of the Gabor was indicated by pressing the arrow (left vs. right) key on the computer keyboard using a finger (index vs. middle) of their right hand respectively. Feedback for an incorrect response was given by a high-frequency tone to encourage stability of decision criteria (Sperling & Dosher, 1986). A response window of 1500 ms was provided for the contrast detection task. Participants then responded to the memory task indicating whether the probe letter was the same vs. different to the one in the memory set (in the exact location) by pressing the arrow (left vs. right) key on the computer keyboard using a finger (index vs. middle) of their right hand. A response window of 2000 ms was provided for the working memory task. No feedback was provided for the working memory task. Responses made outside the response time window for each task were not recorded.

Each session of the dual-task paradigm was about 1 h long with multiple breaks. The inexperienced participant was trained on the contrast detection task for ten sessions with a total of 100 trials per contrast level. The three experienced participants were given one practice session each with a total of ten trials per contrast level. All participants performed a total of 20 1-h testing sessions. The trial randomization process ensured that at least 180 valid trials were completed per contrast level per working memory load. Trials with missing responses and responses with reaction time less than 200 ms for either the visual perceptual task or working memory task (high- and low-load conditions) were excluded. Depending on the number of excluded trials, participants typically completed between 180 and 200 trials per condition. See the results section below for specific details regarding the percentage of trials excluded for respective participants.

Analysis and results

All data were analyzed in R (R Development Core Team, 2011). The psychometric function fitting and associated model comparisons were analyzed using the psychy 0.1-7 package (Knoblauch & Maloney, 2012). All figures were plotted using ggplot2 package (Wickham, 2009). Analyses of performance for both tasks were only made on trials with legitimate responses. The percentages of trials that were excluded from the final analyses of individual participants were 3.18, 3.29, 5.87, and 0.54%, respectively.

The working memory task

Accuracy and reaction time of the working memory task for the four participants are reported in Table 1. The working memory task performance in the current experiment was comparable with the 92 and 98% performance and mean reaction times of 953 and 1394 ms for the low- and high-load conditions, respectively, reported in previous cognitive load studies (de Fockert et al., 2001), suggesting that our cognitive load manipulation was successful.

Table 1 Working memory task performance for each participant in Experiment 1

Full size table

The contrast detection task

To assess whether cognitive load modulates the contrast detection task, a modified cumulative Gaussian function was fitted to the data from each participant, where x is the stimulus contrast, α,β,λ, and γ are the fitted model parameters which determine the shape of the psychometric function,

$$ F(x,\alpha,\beta,\gamma,\lambda)=\gamma+(1-\gamma-\lambda)F(x,\alpha,\beta) $$

(2)

and F is the cumulative Gaussian function:

$$ F(x;\alpha,\beta)=\frac{\beta}{\sqrt{2\pi}}{\int}_{-\infty}^{x}exp\left( -\frac{\beta^{2}\left( x-\alpha\right)^{2}}{2}\right) $$

(3)

with α ∈ (−∞, + ∞), β ∈ (−∞, + ∞). The contrast threshold (α) and the slope (β) of the psychometric functions were left to vary freely and estimated separately for the no-, low-, and high-load conditions. The range of the asymptote (λ) was constrained to be within 1∼5%, and additionally, was forced to be equal across all levels of working memory load due to the limits of computational capacity of the psychy 0.1-7 package (Knoblauch & Maloney, 2012). Gamma (γ) represented the chance performance and was set at 0.5 (Wichmann & Hill, 2001a, 2001b).

Fits were performed using maximum-likelihood estimation. To determine whether there was a change in threshold (α) and a change in slope (β) of the psychometric functions under different working memory load, three models were compared using a nested hypothesis test (Mood et al., 1974). In the one-function model, a single psychometric function was fit to all the data; the threshold (α) and slope (β) of the psychometric functions for the three working memory loads were constrained to be the same. In the threshold model, three psychometric functions were fit to the three working memory load conditions with the threshold (α) being varied freely but the slope (β) being constrained to be the same. Finally, in the threshold-slope model, both the threshold (α) and slope (β) were estimated for each working memory condition. Goodness-of-fit was assessed with deviance scores, which were calculated as the log-likelihood ratio between nested models (Wichmann & Hill, 2001a, 2001b). The deviance scores of the one-function model and the threshold model were compared to assess whether thresholds were different across working memory load conditions, and the deviance scores of the threshold model and threshold-slope model were compared to evaluate whether slope differed across working memory load conditions. The results of these fits are summarized in Table 2.

Table 2 GLM model fits and model comparisons for each participant in Experiment 1

Full size table

Figure 2 shows the psychometric functions for the three working memory load conditions for each of the four participants with the fits of the threshold model. As expected, performance increased as a function of target contrast under all working memory loads. The psychometric function for the high-load condition shifted to the right compared to the no- and low-load conditions. Although two participants showed slope (β) changes, the slope effects were not consistent across participants, and potentially reflected individual differences.

Reaction time was evaluated as a secondary measure of the contrast detection task performance. The mean RT for each participant was fitted with the two parameter Piéron’s law function (Piéron, 1920; Smith et al., 2004):

$$ F(c)=\alpha c^{-\beta} $$

(4)

Piéron’s law is a power function that describes the decrease in mean RT with increasing stimulus contrast, c (Smith et al., 2004). It describes an empirical rather than a theoretical relationship, which is known to characterize the dependency of RT on stimuli intensity in a variety of tasks (Teichner & Krebs, 1972, 1974). As with accuracy data, the cognitive load effects were quantified by comparing the fits of a one-function model in which the scale (α) and exponent (β) were constrained to be the same and a multi-function model in which the scale (α) and exponent (β) varied with working memory load condition. Goodness-of-fit was assessed with deviance scores, which were calculated as the log-likelihood ratio between nested models (Mood et al., 1974). The deviance scores of the single-function and multi-function models were compared to assess whether RT changes differed across working memory load conditions. The model fits are given in Table 3. The mean RT data were better described for all participants by the multi-function model. Plots of mean RT for each participant are shown in Fig. 3. These results show that participants almost always responded faster as contrast increased and mean RTs were generally longer under the high-load condition (vs. the no- and low-load conditions).

Table 3 Piéron’s law model fits for reaction time data for each participant in Experiment 1

Full size table

Discussion

The aim of Experiment 1 was to evaluate whether cognitive load modulates the strength of the center excitation mechanism. We assessed the effects of an unrelated but concurrent working memory task on the contrast detection thresholds of small Gabors. Under the high working memory load, the contrast detection thresholds were found to be higher in three participants. The same pattern was seen in the fourth participant although the model comparison did not reach significance for this participant. Reaction time data suggest that participants were generally slower on the contrast detection task under the high working memory (vs. no- and low-loads), suggesting there was no speed accuracy tradeoff on the contrast detection task. The model comparisons showed no significant difference between the low- and no-load conditions in either contrast thresholds or reaction time.

To the best of the authors’ knowledge, this experiment represents one of the first demonstrations of cognitive load effects on early visual processing with an established early visual perceptual task. We believe that the behavioral effects found in Experiment 1 are consistent with a slight reduction in top-down enhancement to center excitation under high cognitive load based on several factors. Firstly, the performance on the orientation discrimination task is generally accepted to be dependent on orientation selective neurons in early visual cortical areas (e.g., V1) (Hubel & Wiesel 1962, 1968; Skottun et al., 1987). Secondly, because the single target Gabor was presented against a blank background and in the absence of flankers, the current experiment maximally reduced the processing demand at later cognitive levels (Dosher & Lu, 2000; Pelli, 1985; Pestilli et al., 2011). Thirdly, placing a backward mask at only the target location helped minimize spatial uncertainty (Smith, 2000) and associated performance decrements due to increased decisional noise believed to be related to target selection from multiple spatial channels (Dosher & Lu, 2000; Pelli, 1985). Any performance difference seen therefore can be more confidently attributed to reductions in the quality of perceptual representation due to diminished strength of top-down modulation associated with increased working memory load. While our use of backward masking had some clear advantages, one interesting question that arises is whether cognitive load effects on early visual processing are only evident when stimulus presentation time is limited. The spatial attention literature suggests that behavioral measures of top-down modulation effects on early visual processing may be dependent on the use of backward masking (Cameron et al., 2002; Carrasco et al., 2000; Smith, 2000). However, neurophysiological measures demonstrate top-down modulation effects in early visual area when stimulus presentation time is less strictly controlled (Buracas & Boynton, 2007; Ito & Gilbert, 1999; O’connor et al., 2002; Silver et al., 2007; Tootell et al., 1998; Roberts et al., 2007). This issue is discussed further in the general discussion.

Experiment 2 cognitive load effects on surround inhibition

Under natural viewing conditions, the center excitation and surround inhibition mechanisms are believed to function in a coordinated fashion to best process contrast variations in visual scenes (Bonds, 1989; Mach, 1866; Petrov & McKee, 2006; Tadin et al., 2003). While the visual task in Experiment 1 was designed to optimally measure cognitive load effects on center excitation, the aim of Experiment 2 was to determine whether cognitive load could also be shown to impact surround inhibition in early vision.

It is, however, not straight forward to psychophysically separate out top-down modulation effects on surround inhibition from its effects on center excitation. Surround inhibition by definition is modulatory in nature—behaviorally assessing surround inhibition effects usually involves measuring the contrast sensitivity to a central target with versus without the presence of a high-contrast surround mask. At the behavioral level, measuring contrast sensitivity of the target recruits spatial attention as participants are usually explicitly instructed to focus on the central target and ignore the high contrast mask. This creates a confound as spatial attention has been shown to alter the interaction between center excitation and surround inhibition (Herrmann et al., 2010; Reynolds & Heeger, 2009).

To measure cognitive load effects on surround inhibition strength relatively independent of spatial attention effects, here we used a motion discrimination task that is thought to represent a perceptual correlate of surround inhibition (Tadin et al., 2003). One key aspect of this motion task is that only one large size stimulus is used as the target for the perceptual task so that there is no distinction between a target and its surrounding. This has the benefit that the task does not require spatial attention to play the typical dual role of focusing on a target while ignoring its surroundings.

Tadin et al., (2003) showed that when a high contrast drifting stimulus was presented very briefly, motion direction discrimination deteriorated with increasing stimulus size. The results were interpreted as suggesting that the high contrast large motion stimulus induces strong surround inhibition. This, in turn, reduces the motion direction signal rendering the motion direction more difficult to perceive (Tadin et al., 2003). The counterintuitive psychophysical observation for the motion task is believed to result from the neuronal surround inhibition in the middle temporal area (MT or V5) (Tadin et al., 2011). MT neurons are known to be highly selective for motion direction, and roughly half of them exhibit inhibitory center-surround interactions at high contrasts but show weak or nonexistent surround inhibition at low contrasts (Born, 2000; Born & Bradley, 2005; Hunter & Born, 2011; Jones et al., 2001; Tsui & Pack, 2011). This surround inhibition is direction-specific and strongest for large, slow-drifting stimuli (Pack et al., 2005). It has also been shown that MT neurons with surround inhibition integrate motion signals relatively quickly compared to MT neurons without surround inhibition (Churan et al., 2008, 2009). This finding suggests that brief motion stimuli preferentially probe MT neurons that have strong center-surround configurations (Churan et al., 2008, 2009).

Consistent with neurological findings that top-down modulation through feedback connections enhances surround inhibition, a recent study provided novel evidence that higher cognitive capacity might provide more efficient top-down modulation, which in turn, might result in stronger surround inhibition. Melnick et al., (2013) found that individual variability in surround inhibition reflected in the motion task negatively correlated with IQ (r = -0.71), a measure thought to reflect mainly higher cognitive functions. Thus high-IQ individuals exhibited disproportionately large impairments in the performance of this motion task when the stimulus was large and of high contrast. The finding suggests that higher cognitive capacity may be associated with stronger surround inhibition.

Taken together, the current literature suggests that strong top-down modulation may increase surround inhibition in early visual processing. Since the results of Experiment 1 were consistent with cognitive load reducing top-down enhancement of center excitation, Experiment 2 aimed to identify evidence consistent with effects of high cognitive load on surround inhibition. With respect to the motion discrimination task used here, any reduction in surround inhibition should result in better performance on the task (i.e., shorter exposure duration thresholds) under high cognitive load (vs. no and low loads).