Keywords

General Introduction

Imagine two hypothetical individuals, Tyler and Olivia. Tyler is an undergraduate at Duke University. He is taking an Introductory Psychology course that requires him to participate in three experimental psychology studies. For one of the required experiments, Tyler selects a “visual search” study in a cognitive psychology laboratory and signs up at a time immediately after his morning Linear Algebra class . Tyler arrives at the lab at 11 a.m. and fills out the necessary consent forms before being led into a dimly lit testing room where he is instructed to search for 'T'-shaped targets on a computer screen among 'L'-shaped distractor items and to indicate whether a 'T' is present or absent on each trial. Tyler is bored by the time the practice segment is complete and begins to muddle through the task, exerting the minimal effort required. While he is sure some graduate student really cares about how he performs, he is not too concerned. After checking his email on his phone for the 25th time, he finally nears the end of this exercise. He speeds up as the end of the hour approaches; his stomach is beginning to rumble, and he’s getting hungry. He knows that the faster he can get through the remainder of the trials, the sooner he’ll be able to eat lunch. Tyler completes the task and heads to a university café while the experimenters in the lab examine his data, pooled with data from other participants just like Tyler, with the intentions of drawing conclusions about the nature of human visual search processes.

Olivia is an X-ray operator at the Raleigh-Durham Airport in North Carolina. She has worked with the Transportation Security Administration for 6 years and is currently a full-time employee who works 5 days a week. She starts her normal shift at 5 a.m., and during each shift she works several 30 min stints at the X-ray machine , searching for contraband that may be hidden in passengers’ luggage. Olivia has completed an initial training on optimal search strategies and numerous refresher courses that are designed to make her a better searcher. As her supervisors monitor her performance, she is cognizant of maintaining a certain level of accuracy in order to keep her job. Additionally, Olivia is keenly aware of the consequences of letting a bag with a bomb slip through the cracks. Very few, if any, of the bags Olivia searches contain any actual lethal items, but her job is to remain vigilant and conduct thorough searches on every bag, regardless of the improbability of finding a bomb, a gun, or a knife.

Clearly, our hypothetical individuals, Tyler and Olivia, are conducting very different visual searches, with very different motives, in very different environments. Moreover, the differences highlighted above only scratch the surface of the variability between the nature of inexperienced and expert searchers. Realistically, how can search performance between undergraduates and career searchers be compared, given the drastic differences between these scenarios? Can we conclude anything from Tyler’s performance about how Olivia should be performing her job? Does understanding Olivia’s performance inform cognitive theories of visual search?

An eventual goal of basic research is translating findings from the lab to the field Footnote 1; however, researchers often struggle to overcome the inherent differences between the sterile, controlled environment of a research lab and the complex, messy environment of the real world. The study of visual search—finding a target amongst distractors—is an excellent paradigm to illustrate this relationship. On the one hand, visual search is a powerful research method for psychologists , as it encompasses several aspects of cognition (e.g., memory, perception , attention). As such, researchers have extensively studied and theorized about the nature of visual search (see Nakayama and Martini 2010 ; Eckstein 2011 , for recent reviews). On the other hand, visual searches are regularly conducted in everyday tasks outside of a laboratory setting.

The goal of this chapter is to discuss several of the hurdles encountered when moving between the lab and the field and how they might be overcome. We will begin with a general overview of visual search followed by a brief review of the research history and theories. We will then introduce some relevant applied visual search findings before detailing four primary hurdles that stand in the way of translating search findings between the lab and the field :

  1. 1.

    Target prevalence : Is search performance affected by the relative likelihood of a target being present (e.g., do searchers perform worse if targets are rarely present)?

  2. 2.

    Number of targets and target categories: Does search performance decline if a searcher is required to search for more than one target in the same image (e.g., multiple fractures in a medical X-ray) or more than one possible target type (e.g., a gun or a bomb in a luggage X-ray)?

  3. 3.

    Motivation and anxiety: How does the context within which a search is conducted affect performance? Is search performance helped or hindered by added motivation or anxiety?

  4. 4.

    Level of experience: Is search performance altered by expertise? How might years of experience alter visual search strategies or abilities?

Overview of Visual Search

Visual search is the process of finding specific target items within an environment based on particular visual features or semantic information. In its simplest form, visual search could operate via basic pattern matching; for example, detecting a red vertical line in a field of green horizontal lines would solely require invoking a red and/or vertical pattern template. However, even this easy visual search depends on many attentional and perceptual factors, and more complex searches move well beyond basic pattern matching. More common visual searches, both in the lab (e.g., finding a target ‘T’ amongst distractor ‘L’s, finding a particular shape amongst variable distractors) and in the field (e.g., finding a tumor in a radiograph, finding keys in a purse), involve an array of cognitive processes. Search involves perception (i.e., processing and interpreting visual features), attention (i.e., allocating resources to the relevant areas of a visual area), and memory (i.e., storing a representation of the target item or items). Thus, search represents a fruitful and exciting area of research.

In the lab, visual search has been used extensively to learn about cognition. For example, search studies have informed theories of basic perception (e.g., Wolfe et al. 2005), the structure of visual short-term memory (e.g., Alvarez and Cavanagh 2004 ), and attentional capture (e.g., Yantis and Jonides 1996 ; Franconeri et al. 2005 ), to name just a few. Beyond using visual search as a powerful tool for understanding cognitive processing, researchers have also focused on search as an experimental paradigm with the goal of understanding how searches are conducted. Over the past several decades, psychological research has made tremendous headway in understanding the processes responsible for performing visual search tasks and the mechanisms that allow for the successful identification of target items. The findings from visual search research have been extensive, and, in turn, the contributions to the scientific community have been invaluable.

Given the relevance of visual search to real-world environments as well, ideally, what is learned from studying search processes in the lab can be applied to searches in the field. Beyond the vital function of search in navigating our everyday lives, the search performance of radiologists , X-ray operators, and many others can be life-or-death critical. As recent technological advancements have allowed for the improvement of screening techniques, additional key advancements lie in understanding the cognitive processes of the searchers themselves, identifying common search errors, and improving the manner in which searches are conducted.

A Brief History of Visual Search Research

Early Evidence from Non-Human Visual Searchers

While the current era of visual search research is largely laboratory-based, the first investigations of search were focused on its primary goal in the world—survival. Animals engage in survival activities that require visual search, such as finding food, avoiding predators, detecting a potential mate’s signs, and locating appropriate shelter. Search was perhaps first scientifically investigated in 1890 by Edward Poulton , a zoologist who was interested in how animals elude predators. Poulton noted that a single species tends to evolve many different appearances, making it more difficult to be detected by predators, a phenomenon known as cryptic pattern polymorphism . For instance, a single species of forest moth appears with many different wing patterns, and Poulton noticed that it is more difficult for a bird to search for a multiple kinds of targets simultaneously than to search for a single type. The added difficulty that accompanies a search for multiple kinds of targets is now a well-documented idea in cognitive psychology (e.g., Menneer et al. 2007 ) and is especially relevant to current-day X-ray baggage screening at airports, as security officers must search for a number of potentially hazardous items .

Expanding on Poulton’s observations, Tinbergen (1960 ) found that insectivorous birds maximized their rate of detection by confining their searches to only a few prey types at a given time and by focusing on either the most common prey available or those that had been seen most recently. In effect, this research demonstrated that non-human animals are sensitive to the statistics of their environments and are able to quickly adapt to maximize search efficiency, and contemporary work with human searchers has found similar results (e.g., Cain et al., in press ). Pigeon studies have illustrated that search is specialized for ecologically relevant tasks, as pigeons demonstrate a fantastic ability to find food (e.g., Bond 1983 ) and effectively optimize their rate of food discovery. These early studies of search with non-human species have served to both establish the evolutionary basis of search processes and demonstrate the practical nature of visual search. For the remainder of this chapter, we focus on human visual search research that has built upon, and complements, these and other non-human search findings.

Early Evidence from Human Visual Searchers

Speculations about the nature of human visual search—also from an applied angle—began with Bernard Koopman in the 1950s, when he explored theories of search in the context of radar operators locating enemy ships (Koopman 1956a, b ). Tasked by the US Navy to systematically determine the location of enemy ships and lost personnel, he revealed many basic theoretical properties of visual search, such as the distribution of attention and the criteria for termination (Koopman 1957), that remain fundamentally important for current theories of search (e.g., Chun and Wolfe 1996 ).

Cognitive psychologists entered the visual search research arena in the 1960s and 1970s and have played a primary role ever since. Early work (e.g., Neisser 1963; Schneider and Shiffrin 1977 ) laid the groundwork for two influential theories (see Palmer et al. 2000, for a review ): the feature integration theory (Treisman and Gelade 1980 ) and the guided search model (Wolfe 1998 ) .

While seeking to isolate the fundamental elements of vision, Treisman developed the feature-integration theory (FIT; Treisman and Gelade 1980), which served as a driving force of the surge of research in visual search that was soon to follow. The basic idea behind FIT is rooted in Neisser’s (1967) original division of visual processing into two distinct stages, but Treisman expanded on the meaning of these stages dramatically. According to the theory, the basic features of items (color, shape, orientation, etc.) are first processed effortlessly and automatically in the early stages of vision, in separate, spatially organized maps. Next, directed attention is required during the “attention” stage in order to successfully bind the separate features into integrated object percepts (Treisman 1998 ). Finally, a subset of these items is selected for further processing.

FIT allows for the dissociation of two types of searches, often categorized as “parallel” and “serial.” Parallel search occurs when all items in a search array are assessed simultaneously, as the target item is different from all distractor items on at least one dimension, so it simply “pops out” at the observer (e.g., a red vertical line amongst green horizontal lines). In parallel searches, increasing the set size (number of objects in the search display) has little effect on response time because individual processing of each item is not required. Serial search occurs when the individual items within an array need to be searched one-by-one (or small group by small group) because the target item does not immediately pop out at the observer. Serial searching is needed when the target is only separable from the distractors by a conjunction of multiple features, such that it shares some features with the distractors (e.g., a target red vertical line amongst red horizontal lines and green vertical lines). In these cases, response time increases as the number of items in an array increases because more items need to be searched successively.

This strictly dichotomous view of serial vs. parallel search is no longer considered an accurate characterization of search processes (e.g., Townsend 1990 ; Wolfe 1998 ), but it continues to offer a useful framework for understanding the variation in processing between simple and complex searches. Because parallel and serial searches are thought to rely on different cognitive processes, it is typically necessary to differentiate between the two when examining an effect, as many conditions may only modulate performance for one of these two types of search.

Feature-integration theory is useful in understanding a simple two-stage concept of the preattentive and focused stages of search, but preattentive processing is more complex than Treisman’s original model captures (Wolfe 1998 ; Wolfe and Horowitz 2004). Wolfe’s “Guided Search ” theory (2007) has a similar, but less linear, model of the stages involved in search. In Guided Search, the basic features serve as guiding attributes to direct the deployment of attention . Both basic sensory processes and selective attention are used in tandem, as basic perception identifies relevant features and guides the observer’s attention appropriately . The many versions of the Guided Search model (Wolfe et al. 1989; Wolfe 1994; Wolfe and Gancarz 1996; Wolfe 2007) offer a more comprehensive understanding of visual search. These theories of the basic mechanisms of search are important for understanding the underlying processes of visual cognition and allow for analysis of how more complex searches occur.

Bridging the Gap Between the Lab and the Field

Historical studies and key cognitive theories of search have built a solid framework for further exploration of exactly what guides visual search performance. A recent trend has been to build upon this framework to approximate critical differences between lab and field searches. In doing so, researchers have purposefully deviated from standard parameters employed in typical lab-based search tasks to introduce factors usually found in field-based searches. For instance, in a typical visual search task in the lab, only one target is possible on any given trial, and half of the trials have a target present. Such standards offer ideal experimental control and statistical power; however, they are not necessarily representative of the nature of field searches where more than one target may be present, and targets may be infrequent. Efforts to better approximate conditions in the field have begun including multiple targets and target categories and decreasing the relative frequency of the targets (e.g., in radiology , Berbaum et al. 1998 ; Samuel et al. 1995 ; Franken et al. 1994 ; in cognitive psychology , Menneer et al. 2007; Wolfe et al. 2005 ; Fleck and Mitroff 2007 ; Fleck et al. 2010).

In addition to modifying lab-based search tasks to more directly approximate field conditions, considerable effort has also been dedicated towards advancing technological aids for field searches. Consider, for example, the nature of airport baggage screening; searching X-rays in airports is particularly difficult because of the wide range of potential targets , variability of distractor items, clutter, and potential for purposefully hidden or obscured objects in the search array. When presented with this difficult, but critical, search scenario, it is important to pursue all available means by which to improve performance in both the technology and in the searchers themselves. Key insights have already been offered in terms of how technological advances may help or hurt the human operator by examining interactions between human factors and technology changes (e.g., Bolfing et al. 2008 ; Schwaninger 2006a, b, c ; Schwaninger and Hofer 2004; Schwaninger and Wales 2009 ; von Bastian et al. 2008 ; Wiegmann et al. 2006 ).

Technological advances can improve field searches along several fronts, but search accuracy still relies on the performance of individual X-ray operators. As such, it is important to study the searchers themselves to find additional ways to increase accuracy. Several research projects have brought the lab and the field together to address this by assessing factors that may both positively and negatively affect search performance (e.g., McCarley and Steelman 2006 ; Mitroff and Hariri 2010 ; Neider et al. 2010 ; Schwaninger 2003a, b ; Schwaninger et al. 2005). Contextual and situational factors potentially present during field searches, such as motivation and anxiety, can impact search processes and performance. Some recent work has examined the effects of motivated and anxious conditions on a variety of cognitive processes (e.g., declarative memory, Murty et al. 2011 ), but few studies have investigated the interplay of these factors with visual search specifically. Given the numerous cognitive mechanisms underlying successful search and the complexity of many searches in the field, career searchers may be significantly influenced by situational factors that may induce anxiety or increase motivation. Thus, research has begun to explore the impact of context on performance (e.g., Cain et al. 2011 ) and has determined that such factors can significantly influence search accuracy.

Finally, because of the differences in experience between undergraduates in the lab and career searchers, another technique that has been instrumental in bridging the lab and the field is to test search experts (e.g., radiologists and airport security officers) in a laboratory setting. By controlling for many of the other differences between lab and field, directly comparing the performance between inexperienced searchers and experts on the same task in the same environment allows for the assessment of the effects of expertise on search (e.g., Clark et al. 2011a ; Mitroff et al. 2012 ). Preliminary results of such studies demonstrate an increase in accuracy with expertise and suggest that the experts employ different strategies.

Many open questions stand in the way of directly translating findings from the lab to the field, and vice versa, but there are good reasons to be hopeful. The current state of visual search research suggests that it is possible to successfully bring a result from one realm to the other, and the current goal is to make this process more and more robust.In the ‘Target Prevalence’ section, we discuss four hurdles that present potential problems and strategies for how to overcome them. Specifically, we discuss target prevalence , target number and target category, motivation and anxiety, and level of experience.

Target Prevalence

When Tyler, our hypothetical undergraduate, begrudgingly sat through a visual search experiment in a dark room , as described in the beginning of this chapter, a target was present on half of the trials he viewed. He did not find every target, but the frequency of targets kept him alert. He was not exerting a significant amount of effort, but the fact that he was able to find a target so frequently may have helped to keep him on task. If a much smaller percentage of the trials had contained targets, perhaps Tyler would have been more likely to miss those targets.

Olivia, our hypothetical X-ray operator, has rarely encountered actual harmful items in the bags she inspects . Threatening items are, in fact, so rare that the Transportation Security Administration (TSA) has devised a method of inserting images of harmful items onto the viewing screens, superimposed over real luggage. These images, called Threat Image Projections (TIP) (Schwaninger 2006a, b, c ), are designed to appear as real, dangerous contraband. When the X-ray operator sees a threatening item, he or she pushes the appropriate button on the console. If the item was a TIP image, rather than a legitimately harmful item, the screener is provided with immediate feedback (to avoid detaining an innocent passenger). TIP images are used to keep X-ray operators like Olivia alert, to maintain an index of performance, and to counteract potentially negative effects of the actual low prevalence of targets.

The difference in target prevalence between Tyler’s and Olivia’s searches highlights a potentially major hurdle: lab-based searches typically have a target present on half the trials, and field-based searches rarely have a target present so often. Lab-based searches use 50 % target prevalence levels to maximize statistical power. However, as Olivia experiences, many visual searches conducted in the field do not have this nice balance of 50 % target-present and 50 % target-absent displays. A termite inspector seldom finds pests on routine inspection; a lifeguard, thankfully, rarely deals with a drowning swimmer; border patrol agents do not routinely see individuals trying to illegally cross a border; and pilots almost never detect a physical defect on their routine pre-flight structural inspections.

The same case follows for searches in radiology and airport security screening ; there is not an abnormality in half of the X-ray images viewed by a radiologist , and there is not a dangerous item in half of the bags viewed by a TSA X-ray operator. The numbers are difficult to calculate for airport security screening, but the rate of truly hazardous items is well below 1 % (e.g., Rubenstein 2001 ). The prevalence is a bit easier to determine in radiological screening; it is estimated that only around 0.3 % of routine mammograms contain an abnormality (Gür et al. 2004) . These values deviate substantially from the typical 50 % used in the lab, and a critical question is whether the factor of target prevalence actually has a functional role in visual search performance. While laboratory search performance is usually quite good, an estimated 30 % of malignancies are missed in radiological exams (e.g., Berlin 1994 ; Kundel 1989 ; Renfrew et al. 1992 ). Might target prevalence factors account for some of this disturbingly high miss rate? This question is critically important, as failure to identify targets in rare-target search could be potentially disastrous.

Vigilance tasks bear a strong relationship to rare-target search, as they typically consist of a monitoring task in which events occur at rare and unknown intervals, in contrast to a visual search study in which each trial demands a separate response of absent or present. Early studies with vigilance tasks found that performance declines over time while performing a monotonous task (e.g., Mackworth 1950 ; Parasuraman and Davies 1976 ; Davies et al. 1983 ). Because rare-target visual searches resemble vigilance tasks in the monotonous response of “no target,” it is a reasonable assumption that visual search accuracy could decline over the course of time; for example, as screeners repeatedly determine that X-rays do not contain tumors .

Radiological examinations of target prevalence effects have found conflicting results . One study varied target prevalence from 20 to 60 % and found a much higher accuracy rate in higher prevalence conditions (Egglin and Feinstein 1996 ). However, another study found no difference in performance related to prevalence rates varying from 2 to 20 % (Gur et al. 2003 ); this lower prevalence rate better maps onto the actual rates of screenings and routine examinations. An additional study by the same group (Gur et al. 2007) demonstrated an influential effect of prevalence expectations on confidence ratings following target identification, in which decreasing prevalence tended to increase confidence ratings; yet again, the data indicated no detrimental effect on accuracy.

Given the complexities of the radiological environment, it not easy to directly assess the role of prevalence with radiologists as the participants and radiographs as the search arrays (e.g., Gur et al. 2003). Likewise, it is not easy to test such questions with X-ray operators and luggage X-rays . Prevalence is, however, possible to address in the lab with inexperienced searchers. Using simplified displays and untrained participants, Wolfe et al. (2005) found a robust prevalence effect. Participants searched arrays of line-drawn objects and were to find “tools” amongst distractor shapes drawn from other categories. Each participant completed searches in which a target “tool” could appear on 50 % of the trials (high prevalence), 10 % of the trials (low prevalence) and 1 % of the trials (very low prevalence). Visual search accuracy significantly declined as the target prevalence decreased, suggesting that target prevalence , per se, may have affected performance (Wolfe et al. 2005 ). In the 1 % target prevalence condition (where in participants searched 2,000 individual trials with only 20 actually containing a tool), participants missed 30 % of the targets. While it may just be coincidental, it is nonetheless striking that this number mirrors the probable miss rate from radiology (e.g., Gür et al. 2004). For additional discussion of this point, see Wolfe (2012).

In a typical vigilance task, participants slow down over the course of the experiment (e.g., Buck 1966 ). However, the Wolfe et al. (2005) participants were found to speed up over the course of the 2,000 rare target trials. It is proposed that as participants repeatedly and continuously correctly reject most target-absent trials , the time taken to reject decreases dramatically. In effect, participants may become so accustomed to saying that no target is present that they stop performing a sufficient search to actually find a target, thus causing a high miss rate on the few target-present trials.

The Wolfe et al. (2005 ) finding of a target-prevalence effect with simple displays and inexperienced searchers has the potential to be highly relevant to visual searches in the field. If this effect has been properly modeled in the lab, then manipulations can be tested that might improve accuracy (e.g., motivation ; Navalakkam et al. 2009 ) and more precise methods can be used to better assess why misses occur (e.g., eye tracking ; Rich et al. 2008 ). However, before this lab-to-field link can directly inform visual searches conducted in the field, it is critical to ensure that the link is valid. Does the underlying mechanism of the prevalence effect found in the lab match those of possible prevalence effects in the field?

Follow-up studies have raised concerns about whether the initial prevalence effect found in the lab sufficiently matches prevalence effects in the field (Fleck and Mitroff 2007 ; Li et al. 2011; Madden et al., in press ). Fleck and Mitroff (2007) and Li et al. (2011) suggest that the prevalence effect found in the lab may be an error of response execution rather than a perceptual or identification error. When Fleck and Mitroff (2007) offered participants an option to “correct” their responses on a previous trial, this alone removed a previously found prevalence effect. At least in these studies, participants were able to correct such errors, indicating that they were not actually “missing” the targets perceptually; they were simply responding quickly out of habit. In effect, participants fell victim to a classic “oops” problem—they were quickly responding “no” trial after trial, until suddenly they hit the “no” key when, in fact, they had not intended to do so. Such a physical perseveration or inhibition problem is not likely to underlie a prevalence effect in the field. Fast-paced responding is not a common aspect of radiology or baggage screening, and such searches offer the option to correct mistakes.

The results of Fleck and Mitroff (2007) suggest that lab-based visual searches with simple stimuli may not be able to adequately translate to the complex searches conducted in the field , given that the purported mechanism (a response-based error) is not a part of most field searches. An additional study, however, offered a different conclusion. Support for a prevalence effect was found in a study that employed realistic X-ray luggage images (Van Wert et al. 2009 ), even when participants were offered the option to correct their responses. This suggests that a prevalence effect can be observed in the lab, with the option to correct, as long as the stimuli are sufficiently complex. Moreover, this suggests that prevalence effects are not driven entirely by response-based errors.

A potential problem remains, however, with extrapolating prevalence effects from the lab to the field. While Van Wert et al. (2009 ) clearly involved more complexity than Fleck and Mitroff (2007) and Wolfe et al. (2005), the locus of the complexity is not as clear. Fleck and Mitroff (2007) and Wolfe et al. (2005) used a set of six possible target “tools” and showed pictures of the targets to the participants before the experiment. Van Wert et al. (2009) used 100 images of knives and 100 images of guns and only showed a small subset to the participants prior to the experiment. While this added desired complexity to the stimuli, it also, unfortunately, added complexity to the participant’s task. On 94 % of the occasions when participants used the “correction” option in Fleck and Mitroff (2007) to report that they had pressed the wrong response key by accident, they changed misses (responding “no” when a target was present) into hits (responding “yes” when a target is present). In contrast, when the participants in Van Wert et al. (2009) used the correction option in the low prevalence condition, they primarily (81 % of uses) changed correct rejections (responding “no” when no target was present) into false alarms (responding “yes” when no target was present). This suggests that the participants in Van Wert et al. (2009 ) did not understand what was and was not a target and did not have a sufficient grasp of their task. Ultimately, it is not clear what this means for relating prevalence effects from the lab to the field.

While the effects of a correction option on rare-target search performance remain debatable, additional studies have suggested alternative mechanistic accounts of the prevalence effect that suggest viable connections between lab findings and the field. Further work by Wolfe and Van Wert (2010) demonstrated that not only did searchers’ decision criteria shift toward increasing misses at low prevalence, the reverse criterion shift also occurred with very high target prevalence leading to an increase in false alarms. They also found that target prevalence not only influences the criterion shift, but also the decision of when to stop searching in target-absent trials . Another nuanced study examined the prevalence effect in older adults, who typically exert greater top-down attentional control and more cautious approaches when completing search tasks (e.g., Madden 2007 ). The older adults not only exhibited less severe prevalence effects but also benefited even more greatly from the ability to correct responses (Madden et al., in press). The prevalence effect was also found to vary with the number of response alternatives, as the effect was eliminated in a four-alternative forced-choice task but remained intact in the standard two-alternative forced-choice task (Rich et al. 2008 ). Finally, Lau and Huang (2010) varied instructions given to participants regarding whether there were a high or low number of targets and found that this sort of instruction did not affect performance but that the prevalence effect was driven by the actual distribution of the targets encountered. Furthermore, participants showed the prevalence effect in conditions with a consistent prevalence level, but the miss rate did not increase when the prevalence level varied throughout a block of trials.

These conflicting results highlight the complexities of interpreting visual search performance data. Participants fall victim to the prevalence effect in some cases but not in others. The fact that the prevalence effect differs between younger and older adults (Madden et al., in press ) demonstrates that inherent differences between participants affects search performance and provides a note of caution when trying to translate results from undergraduate searchers in the lab to older, professional searchers in the field. When these findings are taken together, it becomes apparent why interpreting visual search data is almost never straightforward. Both the characteristics of the experiment and the characteristics of the participants can dramatically alter performance results, such that isolating an effect to one specific cause is often impossible. Using a rare-target search task is a far more comparable means by which to consider search performance in the field, but it must be done in an informed way while taking all nuances discussed here into consideration.

Number of Targets and Number of Target Categories

Tyler, our hypothetical undergraduate, is aware there is never going to be more than one target-'T' shape within any display he views. His task is rather simple—he searches for a single target of a single category, and once he finds the target, his search is complete. Tyler does not need to concern himself with additional targets nor additional target types. He is not required to maintain two (or more) separate templates in memory while searching, and once he finds a target, he knows no further searching is required.

Olivia, our hypothetical X-ray operator, is tasked with searching for multiple kinds of items at all times . Not only does she need to identify guns, knives, and bombs, but she also needs to search for other items such as laptops, shoes, and liquids within each bag she examines. Furthermore, when she finds one target in a bag, she cannot terminate her search after the identification of this single target because there is no limit to the number of harmful items potentially present.

The possibility of multiple targets highlights another critical question: does it matter if someone is searching for more than one thing at the same time? The majority of lab-based visual search tasks present participants with well-defined stimuli and ask them to search arrays that contain either zero or one target. However, searches in the field can often contain more than one target type (e.g., either a tumor or a broken bone in a single radiograph) and/or more than one target (e.g., a tumor and a broken bone in a single radiograph). These types of searches, which we will refer to as multiple-category and multiple-target search, respectively, are rarely employed in the lab but are frequently present in the field. Is it possible to generalize from lab-based single-target research to multiple-category and multiple-target field-based searches? What is the cognitive cost of having to maintain in memory more than one target type? Is search performance worse if there may be multiple targets in the same display? We discuss these questions in this section and explore how they may present hurdles for translating research between the lab and the field.

Multiple-Category Visual Search

An X-ray operator is tasked with finding dangerous items and must simultaneously search for guns, knives, bombs, water bottles, and several other potentially dangerous items . Multiple-category visual search has a long history in cognitive psychology (e.g., Kaplan and Carvellas 1965 ; Krueger and Shapiro 1980 ; Menneer et al. 2004 , 2007, 2008, 2009; Neisser et al. 1963 ; Vreven and Blough 1998 ), and several conclusions have emerged. It is clear that there is a negative impact of having to hold more than one potential target in memory (e.g., Gould and Carn 1973 ). When varying the number of possible target categories, larger numbers of target categories led to steeper search slopes (Kaplan and Cavellas 1965) and slower searches overall (Metlay et al. 1970 ).

Kyle Cave and his colleagues have convincingly shown that multiple-category search has a detrimental effect in terms of both visual search speed and accuracy (e.g., Menneer et al. 2007; Menneer et al. 2009 ; Godwin et al. 2010 ). In one study, different groups of participants searched for either one or two colors, one or two shapes, or one or two line orientations (Menneer et al. 2007). Search times were slower, and miss rates were drastically higher in the dual-category trials. In a study that was directly inspired by airport baggage screening, participants had to search X-ray images for either weapons or bombs in separate searches or weapons and bombs in the same search (Godwin et al. 2010 ). They found that there were dual-category search costs in both accuracy and response time and that low-prevalence targets were missed more often than high-prevalence targets, but these factors appear to be additive. This suggests that searchers in the field who are searching for rare targets in many categories may be subject to many sources of miss errors.

The aforementioned work highlights two important points for the current discussion. First, multiple-category search is considerably slower and more error-prone than single-target visual search. Second, lab-based research can inform—and be informed by—visual searches in the field. Experiments using both simplified stimuli (e.g., Menneer et al. 2007 ) and X-ray baggage images (e.g., Godwin et al. 2010 ), have revealed ways in which multiple-category search demands impact performance and have offered suggestions for how to improve real-world searches. For example, Menneer et al. (2007) suggests that because simultaneously searching for multiple types of targets (e.g., guns, knives, and bombs) produces costs for both search speed and accuracy, it may be more effective to have multiple, specialized searchers that are focused on a single target type (e.g., only guns or only bombs).

Multiple-Target Visual Search Findings from the Lab

Multiple-category visual search requires holding more than one item in memory but does not necessarily involve identifying more than one target within the same array. Once a target has been found in a single-target search, the search can immediately be terminated, but what happens when there are potentially more targets? Does the successful detection of one target make a searcher more likely to notice additional targets? Or, does it act as a distractor and impair further search? These are critically important questions, as many visual searches in the field—where misses can be disastrous—have an unconstrained number of targets.

The nature of multiple-target search has been directly examined in the lab in a few studies (e.g., Cain et al. 2011, in press ; Chan and Courtney 1995; Fleck et al. 2010 ; Schneider and Shiffrin 1977 ; Wolfe et al. 2005). One series of studies examined multiple-target visual search accuracy using an array of measures : useful field of view (Chan and Courtney 1995), target discriminability (Chan et al. 2002 ), and the time course of search (Chan and Chan 2000 ). In the first of these experiments (Chan and Courtney 1995), participants were briefly (250 ms) shown a horizontal line of shapes and were to report whether any ‘o’ shapes were present. The majority of the shapes were ‘x’s, but there were a variable number of target ‘o’s. When there were two targets present, participants were more likely to report the target that was presented closer to center and less likely to report the target presented in the periphery than when targets in those same locations were presented as the only target on a trial (Chan and Courtney 1995). This result implies that multiple-target search effectively reduced searchers’ useful field of view, compared to single-target searches. In a version of the task designed to look at the time course of multiple target search, participants again scanned an array of ‘x’s for ‘<’ and ‘>’ but with both targets present on all trials. The time taken to find a second target was much more variable than that needed for the first target (Chan and Chan 2000 ), suggesting that modeling a dual-target search as two, serial single-target searches would not properly reflect actual search behavior. In another accuracy-focused study, participants searched for either two hard-to-spot shapes (‘<’ and ‘>’) or an easy-to-spot and a hard-to-spot shape (‘o’ and ‘v’) among ‘x’s. The presence of a hard-to-spot target impaired detection of an easy-to-spot target more so than the presence of an easy-to-spot target impaired detection of a hard-to-spot target (Chan et al. 2002 ), reinforcing similar findings from radiology (Berbaum et al. 2001 ).

The above studies suggest that several factors can impact multiple-target search accuracy , and an additional study has suggested that the top-down knowledge of a multiple-target search can affect search even before the first target is located . Körner and Gilchrist (2008) compared eye movements between a condition in which there were 0 or 1 targets present and a condition in which there were either 1 or 2 targets present, with participants informed of the conditions. Even before a target was found, participants made more distractor re-fixations in the 1 vs. 2 condition on the trials with just 1 target present than on physically identical 1-target stimuli in the 0 vs. 1 condition. This difference was argued to arise from participants “setting aside” memory for a possible second target before the search began, thus limiting the available memory for which locations have been searched (Körner and Gilchrist 2008 ). This finding suggests that, not only do physical aspects of the search array affect performance, but that searchers’ expectations about the likely number of targets may also affect the efficiency of their search .

Multiple-Target Search in Radiology: “Satisfaction of Search”

The problems accompanying the presence of more than one target and different types of targets (e.g. pulmonary nodules and fractures) have been well documented in radiology (e.g., Berbaum et al. 1998 ; Samuel et al. 1995 ; Franken et al. 1994 ). A classic pitfall, known as “satisfaction of search” (SOS), occurs when the identification of a second target is less successful after the identification of a first target in the same display . SOS has been a topic of radiological research since the 1960s (Tuddenham 1962 ), but radiologists still fall victim to SOS . In fact, 28 % of radiological misses have been attributed to SOS errors, which makes this a critically important problem to solve.

Radiography studies have delineated three possible types of errors contributing to SOS (Nodine and Kundel 1987 ): scanning errors (the search path never encounters the target area, Berbaum 1996, 2005 ; Samuel et al. 1995 ), recognition errors (scanning in the region of a possible target but failing to dwell on the correct area for further inspection, Berbaum 2000 ), and decision-making errors (fixating and dwelling on a possible target but ultimately failing to identify it as a target, Franken 1994 ). To date, evidence has suggested that all three likely contribute to SOS and the latter two explanations differ primarily in the amount of time spent analyzing a potential target. The time required to examine a target stems, in part, from the relatively low spatial frequency of radiographs and radiological targets (e.g., pulmonary nodules), which may require extra analysis to visually parse targets from background noise. By categorizing errors as scanning or decision-making/recognition errors, radiologists have attempted to understand whether SOS arises primarily as a function of a basic perceptual failure to properly scan an image or more of a cognitive failure in determining whether a particular item is indeed a target (Kundel et al. 1978 ).

Several suggestions have been proposed for the causes of multiple-target search errors , and some possibilities include a truncated search (finding one target leads to a non-exhaustive search) and a perceptual set (e.g., once a tumor is detected, the searcher engages a “tumor set” where additional tumors are likely to be spotted but other abnormalities, for example a fracture, are less likely to be spotted, Berbaum et al. 2000 ). However, no clear mechanism has been identified as responsible for the SOS effect. Eye-tracking data within radiological research has indicated that search is not actually terminated early (Samuel et al. 1995 ) and that participants continue to search after the successful identification of a first target; participants may even fixate on a second target but fail to identify it as such. This is confirmed and extended by a recent eye-tracking study done with undergraduate searchers and simplified search displays (Cain et al. 2012a ). The eye-tracking data suggest that SOS errors are likely due to a combination of scanning, recognition, and decision-making errors .

The stimuli used in radiological search studies, however, are actual radiographs, which are highly complex and greatly variable. Furthermore, participants are radiologists who have extensive training and experience with the experimental tasks . In contrast, many lab-based visual search tasks do not involve a heavy decision-making component because the targets and distractors are easily distinguishable from one another (in part so that untrained participants can perform reasonably). Recent work has begun to explore the SOS effect outside of radiology using simplified stimuli and undergraduate participants rather than radiographs and radiologists (Fleck et al. 2010 ) in order to generalize SOS to outside of the medical world .

In seeking to establish the scope of SOS errors in nonmedical searches , Fleck et al. (2010) aimed to understand the cognitive processes broadly involved in multiple-target search. In a series of experiments, basic ‘T’ and ‘L’ shapes were presented at varying degrees of visibility against a cloudy background. The cloudy background and the more–and less-salient targets and distractors served to approximate the noise typically present in radiographs. See Fig. 1. Each trial contained 0, 1, or 2 targets, and participants were to make a localization mouseclick on each target found and then clicked a ‘DONE’ button at the bottom of the screen to terminate their searches.

Fig. 1
figure 1

Sample trial: find the “perfect” T shapes

In previous studies, the SOS effect was typically observed when radiologists were less likely to identify a low-salience target when it was in the presence of a high-salience target than when the same low-salience target was the only target present in the array . Thus, the dual-target trials in this paradigm contained both a low–and high-salience target, and the SOS effect was calculated as the difference between a participant’s accuracy in identifying low-salience targets in single-target trials and his or her accuracy in identifying low-salience targets in dual-target trials, provided the high-salience target had been successfully identified in the same trial. The SOS effect was found to be sensitive to both target prevalence and time pressure, as it was exacerbated when high-salience targets were three times as likely as low-salience targets and when participants had a time limit of 15 seconds per trial (Fleck et al. 2010) .

These findings demonstrate generalized SOS errors in nonmedical searches ; the inclusion of the possibility of multiple targets allows this paradigm to be applied to the field, as it is clear that search processes become infinitely more complex when multiple targets may be present. In order to gain a full understanding of search processes employed in the field, multiple-target searches should be used when exploring other issues related to lab-field differences.

Motivation and Anxiety

Tyler, our hypothetical undergraduate student, is simply not very concerned with his performance on his computer-based visual search experiment . He has little reason to care if he finds every target; regardless of how he performs, he will receive the participation credit for his Introductory Psychology class. There are also no consequences for poor accuracy. In addition to a lack of motivation, he likely has little to no anxiety about how well he does on this task since this really is just some “meaningless” experiment to him.

On the other hand, Olivia, our hypothetical X-ray operator, is strongly motivated to perform well since her job security is, at least partially, based on good performance and accurate searching. She takes great pride in her search abilities, as she has been with the TSA for quite some time and is a seasoned X-ray operator. More importantly, she is well aware of the consequences that might accompany her failure to identify harmful items in her search, and this keeps her motivated. Unfortunately, Olivia finds that she is often quite anxious while at work since passengers are always around her, and they are usually visibly (and often verbally) annoyed and hurried.

Difference in Context: Motivation

In the above scenarios, it is clear that Tyler and Olivia are faced with wildly different motivational contexts. Tyler’s performance has no impact on his life, and there is little reason, beyond personal pride, to perform well. Olivia’s performance can impact her livelihood (e.g., whether she has a job in the future) and others’ lives (e.g., whether they are boarding a plane along with a bomb). Searches in the field are often linked to high-stakes outcomes; a radiologist or X-ray operator could save lives by identifying harmful targets in X-rays . Does performing a life-critical search cause individuals to be more motivated than when completing a lab-based task with no tangible consequences? The primary issue addressed in this section is whether differing levels of motivation affect visual search performance, and if so, how. If higher levels of motivation result in higher levels of performance, then how comparable are unmotivated, inexperienced searchers to highly motivated career searchers? This is a third fundamental hurdle for translating findings from the lab to the field.

Visual search tasks conducted in the lab often reveal a great deal of variability in performance in undergraduate participants. While some of this variability may be tied to differences in underlying search ability, some variability may also result from differences in motivation : Some participants may be intrinsically motivated to perform well regardless of a tangible outcome while others may not be motivated at all. For instance, more conscientious participants are likely to exert greater care and effort when performing the task, even though their levels of performance have no external consequences for them.

Unfortunately, it is not feasible to directly motivate laboratory participants in the same way career searchers are motivated. Participants cannot possibly believe that people’s lives are in their hands nor that their careers depend on their performance in a computer-based experimental task. One reasonable approximation of motivation, however, is performance-based monetary reward. The prospect of receiving money for good performance provides an effective global incentive that, for most people, will increase their interest and effort (e.g., Camerer and Hogarth 1999 ).

In typical lab-based visual search experiments, monetary reward has been used to examine the impacts of motivation on attentional selection (e.g., Libera and Chelazzi 2006 ; Kiss et al. 2009 ), priming (e.g., Hickey and Theeuwes 2008 ; Kristjánsson et al. 2010 ) and attentional capture (e.g., Anderson et al. 2011 ). These studies have presented clear evidence that monetary rewards can improve performance; however, they have primarily focused on changes in the speed of attentional deployment . While this is a critical component of visual search performance, the majority of field-based searches place a larger emphasis on accuracy than on speed. Two recent experiments have employed monetary incentives with a focus on visual search accuracy: one with rare-target visual search (Navalakkam et al. 2009 ) and one with multiple-target visual search (Clark et al. 2011b ).

Navalakkam et al. (2009) investigated whether the prevalence effect (that targets are missed more when they occur rarely than when they occur frequently) could be overcome when participants were sufficiently motivated. Participants searched for a target object in a cluttered scene, with the target prevalence (2, 10, and 50 %) varied across blocks. A typical pattern emerged, with impairments in accuracy at low target-prevalence (e.g., Wolfe et al. 2005). However, when participants were motivated with a monetary incentive, the prevalence effect decreased significantly, restoring detection rates to near optimal levels. It was argued that fatigue, carelessness, and lack of vigilance were not responsible for the prevalence effect, but instead, the prevalence effect was caused by a shifted decision criterion, which could be modified through proper reward (Navalakkam et al. 2009 ).

In the ‘Number of Targets and Number of Target Categories’ section, we described the pitfalls of multiple-target visual search: searchers are less likely to find a target if they have already found another target in the same display (a phenomenon termed “satisfaction of search,” SOS ). This is a potentially dangerous problem that has been consistently observed in both lab-based and field-based visual searches (e.g., see Berbaum et al. 2010 ; Fleck et al. 2010 ). A recent series of experiments (Clark et al. 2011b ) has explored whether SOS errors can be alleviated with the incentive of monetary reward . Can certain motivational frameworks lead to performance differences, and do such differences provide information on how to better structure work conditions for career searchers?

Clark et al. (2011b) employed a multiple-target search tasks that mirrored a paradigm that has previously found robust SOS errors (Fleck et al. 2010, Experiment 3, described in more detail in the ‘Number of Targets and Number of Target Categories’ section) but manipulated the participants motivation by including a monetary incentive. Participants competed against nine other participants, and the “best” performer was awarded an additional $ 50. By simply adding this motivation of a performance-based reward, accuracy improved and the SOS effect was effectively eliminated.

These results raise a curious concern for the translatability of research considering that motivated, inexperienced participants show a decreased SOS effect, yet career searchers such as radiologists , presumably operating while motivated, still exhibit the SOS effect . Clearly, the incentive for the inexperienced participants differs greatly from the incentives for career searchers, but one could argue that a relatively small monetary incentive is qualitatively less than the incentive for career searchers—the chance at winning an extra $ 50 at some point in the next week or two is seemingly less motivating than keeping a job and preventing fatalities. How can this be reconciled? Perhaps despite the immense focus on accuracy for career searchers, the monotony of their daily routines interferes with their motivation . It is possible that the undergraduate searchers could actually be more motivated than the career searchers because they are completing a task for only an hour-long period, over which it is relatively easy to maintain a high level of motivation. Career searchers may not be not equally motivated at every hour throughout their workdays (or weeks, or months, etc.), and the SOS effect is observed may result from an inability to maintain consistently high levels of motivation.

Differences in Context: Anxiety

The dire consequences of missing a target in field searches could be potentially motivational but could also induce anxiety. Anxiety—the displeasurable psychological experience of worry or concern—is difficult to replicate in the lab, but it may be an element in many field searches. Beyond the general anxiety of knowing that missed targets could have life-threatening consequences, there is also more acute anxiety that can occur when searchers anticipate tangible stressors , such as a visit from a supervisor or a large workload. These states of heightened anxiety can be detrimental to accuracy, and anxiety has been linked to a decline in cognitive performance across species (e.g., in mice, Ohl et al. 2003 ; in humans, Eysenck et al. 2007 ).

As discussed earlier in this section, the motivation to earn rewards can significantly improve performance (e.g., Callan and Schweighofer 2008 ; Murayama and Kuhbandner 2011 ), but motivation to avoid punishments can increase anxiety and substantially diminish performance (Davis and Whalen 2001 ; Lang and Bradley 2009 ). Recent work has examined the differing effects of approach and avoidance motivation (earning rewards and avoiding punishments, respectively) on declarative memory (Murty et al. 2011 ). While approach motivation enhanced memory performance, avoidance motivation hindered performance, and this effect was especially amplified in participants who showed high levels of arousal. In the lab, the experience of anticipatory anxiety can be induced using a “threat of shock” paradigm , in which electrical shocks are administered at unpredictable intervals, unrelated to performance (e.g., Grillon et al. 2004 ; Rhudy and Meagher 2000 ). A recent study has found this type of anticipatory anxiety to be specifically detrimental to multiple-target visual search performance (Cain et al. 2011 ).

Cain et al. (2011) used a variant of a standard multiple-target search paradigm (Fleck et al. 2010, Experiment 5) in which an SOS effect was not expected. When participants were anticipating a neutral event (an innocuous tone), they did not show SOS (which replicates the previous instantiation of these particular experimental parameters). However, when those same participants were anticipating a negative event (an electrical shock) they produced SOS errors. Interestingly, the participants did not show a difference on single-target performance between the non-anxious and anxious blocks of trials—the SOS effect was due solely to poorer second-target identification (Cain et al. 2011 ). Moreover, this effect was modulated by the level of anxiety that participants were experiencing at the start of the experiment . Less anxious participants showed high levels of SOS when anticipating a shock but no SOS in the control condition, while more anxious participants showed mild SOS throughout the entire experiment, regardless of condition. These results suggest that both acute and generalized anxiety could negatively affect search performance in the field by inducing SOS errors. Thus, efforts should be made to shield professional searchers in the field from anticipatory anxiety in order to improve target identification in multiple-target displays. This sort of anxiety potentially poses an extra risk of misses in searchers with post-traumatic stress disorders or clinical anxiety disorders given that these individuals have been shown to be more likely to generalize specific causes of anxiety to the environment itself (e.g., Fanselow 1980 ; Grillon et al. 1998) .

Just as the prevalence effect can be overcome with the proper motivation , multiple-target search appears to be influenced by contextual conditions. Performance on single-target searches in both motivated (Clark et al., in press) and anxious (Cain et al. 2011 ) conditions were unaffected by context, and influences were seen only on multiple-target conditions. The complex mechanisms responsible for the SOS effect may simply be more sensitive to contextual influences, and the motivation and anxiety inherent in career searches may work both for and against performance.

Though SOS can be eliminated in the laboratory via monetary incentive, SOS remains a problem in the field. This could be attributable to the monotony of the daily grind detracting from the value of motivation , but the anxiety associated with career searching could also contribute negatively. While motivation appears to positively affect performance, anxiety may serve as a hindrance, and it is important to take all of these factors into account when evaluating differences in searching between the lab and the field. Furthermore, enhancing motivation while decreasing anxious circumstances may be the best combination of contexts for optimal search performance.

Level of Experience

Tyler, our hypothetical undergraduate, searches for items in his everyday life—the books for his classes, the keys to his dorm room, and his cell phone. He rarely, if ever, dedicates any sort of mental effort toward improving the efficiency of these searches since they are mundane and generally completed successfully. Tyler has also never been trained on how to conduct visual searches to increase accuracy, nor does he regularly spend hours at a time conducting visual searches (e.g., he usually finds his keys within a minute or two at the most).

Olivia, our hypothetical X-ray operator, has undergone extensive training in order to improve her search skills. Additionally, she spends hours every workday actively conducting visual searches as she scans bag after bag. She has years of experience in search that have allowed her the opportunity to increase her ability to scan X-ray images for harmful items, allocate her visual attention more effectively, and utilize any superior strategies she may have developed.

The above vignettes about Tyler and Olivia highlight the last major hurdle we discuss in this chapter—experience. Many career searchers have years of training and experience on specific search tasks, and it is important to understand how this might influence their abilities. It is not clear exactly how career searchers’ levels of experience may affect their performance, both on their typical job-related searches and on search tasks more generally. How might search expertise on the job translate to search performance on standard lab-based search tasks? Which conclusions drawn from inexperienced undergraduates, without extensive training, are applicable to the field?

Trained professionals are often better at visual searches related to their jobs than are novices (e.g., farmers improve their ability to sort chickens by sex with experience, Biederman and Shiffrar 1987 ; wine connoisseurs learn to discriminate between fine wines, Bende and Nordin 1997 ; bank tellers are better than the general public at detecting counterfeit currency, Klein et al. 2004 ; and chess players are better able to see patterns of moves on a chessboard, Chase and Simon 1973 ). This apparent benefit of experience leads to two key questions: What are the bases for these expertise differences ? And how can the differences be accounted for when assessing the performance of inexperienced searchers in an attempt to translate from the lab to the field? Observing how expertise may alter both trained task performance specifically, and visual/cognitive abilities more generally, has the potential to inform questions about visual search as well as the general malleability of cognitive abilities.

Perceptual Training in the Lab

In most cognitive psychology studies , a participant (like our hypothetical undergraduate, Tyler) arrives in the lab, runs through a minute or so of practice, and then completes an hour-long study. They are then dismissed and may never think about the task again. The experimental results provide a useful assessment of performance but do not allow for an investigation of learning. One class of experiments, however, is focused primarily on learning effects. In perceptual learning experiments , a research participant may make several visits to the lab and undergo thousands of trials of the same specific task so that they ultimately receive extensive training .

Research in perceptual learning has shown that it is possible for very basic visual abilities to change with experience . If a participant is asked to make a difficult visual discrimination over many trials—often spread out over days—his or her threshold for discrimination will decrease dramatically (Westheimer and McKee 1978 ); these changes are often attributed to plasticity in primary visual cortex (V1). There has been evidence for a host of sensory and perceptual improvements in which basic feature discrimination improves with extensive practice of a task. Participants improve in discriminating the orientation of a line (Ramachandran and Braddick 1973 ; Fiorentini and Berardi 1981 ; Matthews and Welch 1997 ), identifying the direction of motion (Ball and Sekuler 1982, 1987 ), and show increased vernier acuity (Westheimer and McKee 1978; Saarinen and Levi 1995 ; Beard et al. 1995 ). In all of these cases and in others (e.g., Vogels and Orban 1985 ; Karni and Sagi 1991 , 1993; Poggio et al. 1992 ; Fahle and Edelman 1993), learning is specific to the stimulus on which the participant was trained. In fact, a hallmark aspect of perceptual learning is that the training effects appear to be quite specific . Because the learning is believed to take place at such a basic perceptual level, improvement is only seen when examining performance on the exact trained stimulus. If, after training, participants showed an overall improvement in a task, beyond that of the trained stimulus (e.g., he/she was trained to identify rightward motion, but also improved in identification of leftward motion), more generalized training would be said to have occurred.

Generalized Training

Perceptual learning studies in the lab have produced highly specific training effects, but this does not directly inform generalized learning effects . Given the uncontrolled and variable nature of field-based searches, expertise gained through career searching likely produces more generalized benefits . In airport security screening , for example, X-ray operators never search two entirely identical suitcases, so they cannot rely on simple sensory-level template matching to successfully identify targets . The experience gained through their daily training relies on improvements that can transfer from bag to bag. Furthermore, visual search, even its most simplistic, laboratory form relies on the integration of both sensory perception and strategic attentional allocation.

One of the few perceptual learning studies to demonstrate generalized learning used a visual search task (Sireteanu and Rettenbach 1995 ) . Training accumulated over the course of the experiment resulted in improved search efficiency, even on untrained stimulus sets. Perhaps because of the complex attentional processes required for effective visual search, the learning occurred in a less specialized manner ; visual search involves cognitive processes more complex than basic sensory discrimination, so the improvement likely occurred at a level that can generalize beyond the perception of one specific stimulus . For example, search efficiency may improve via changes in strategies—participants may learn to better distribute their attention, disregard irrelevant cues, or react quickly to relevant ones.

Because search relies on strategies and attentional processes, it is impossible to isolate learning for visual search tasks to the sorts of low-level feature discrimination improvements seen in classic perceptual learning tasks . As such, “perceptual learning” has recently undergone a redefinition, which encompasses even strategic and attentional improvements. Using visual search paradigms to study trained improvement allows for a more comprehensive investigation of the many elements of attention that can be improved with training.

Generalized Learning via Action Video Game Playing

Basic sensory perceptual learning cannot allow for improvement in skills in the field , where search arrays consistently vary. Certain experiences, however, have been found to elicit improvement in a wide variety of skills and are far more generalized than basic perceptual learning processes. Extensive experience with specific activities can influence perceptual and attentional abilities that generalize beyond those activities, and a host of studies have shown that those who regularly play action video games (usually an average of 6 or more hours per week for at least 6 months) show improved performance on a variety of tasks. Specifically, when compared to those who did not regularly play action video games, avid action video game players respond more rapidly (Castel et al. 2005 ; Dye et al. 2009 ; Orosy-Filders and Allan 1989; Yuji 1996 ), have improved spatial abilities (Okagaki and Frensch 1994 ; Quaiser-Pohl et al. 2006 ; Terlecki and Newcombe 2005 ), have enhanced temporal abilities (Donohue et al. 2010 ; Green and Bavelier 2003, 2006b, 2007 ; West et al. 2008 ), can enumerate briefly displayed items more quickly (Green and Bavelier 2006b), can switch between tasks faster (Cain et al. 2012b ; Karle et al. 2010 ), and have enhanced eye–hand coordination (Griffith et al. 1983 ).

Studies exploring the causal role of video game playing have trained non-gamers on action video games and shown improved performance (e.g., De Lisi and Cammarano 1996 ; De Lisi and Wolford 2002; Dorval and Pepin 1986 ; Green and Bavelier 2003, 2006a, b, 2007 ; however, see Boot Kramer et al. 2008 for lack of training effects; and Nelson and Strachan 2009 for more nuanced training effects ). The issue of causality explores an important mechanistic explanation of gamers’ benefits, but regardless of the causal nature of such benefits, differences between gamers and non-gamers have been reliably demonstrated.

However, there is a mechanistic question regarding these differences; two feasible accounts have both received support and are not mutually exclusive. The basic-sensory hypothesis suggests that action video game exposure trains better “vision” and “attention,” honing basic abilities (e.g., Dye et al. 2009 ; Green and Bavelier, 2006a, 2007 ; Li et al. 2009 ; West et al. 2008 ; Caplovitz and Kastner 2009 ). According to this hypothesis, gamers may have an increased capacity to process visual information compared to non-gamers. Alternatively, the improved-strategy hypothesis suggests that video game playing leads to the development of enhanced higher-level abilities such as attentional control (Cain et al. 2012b ; Chisholm et al. 2010 ; Hubert-Wallander et al. 2010b ), shifts in attentional allocation, and improved strategy (Clark et al. 2011c ) for generalized use across a variety of visually demanding tasks. In line with this account, gamers need not necessarily have an increased information-processing capacity but rather could be better able to use what resources they have to process perceptual information (e.g., Colzato et al. 2010 ).

Generalized Learning via Stroboscopic Training

In addition to video-game learning , stroboscopic training has been shown to improve visual cognition abilities (Appelbaum et al. 2011 ; Appelbaum et al., in press). Stroboscopic, or intermittent, vision is the process of presenting an individual with snapshots of the visual environment rather than a continuous visual experience. Training in such a visual environment can alter perceptual-motor abilities (e.g., Bennett et al. 2004 ; Mitroff et al., in press; Smith and Mitroff, in press ), and recent work suggests it can influence visual attention and memory as well. For example, in Appelbaum et al. (2011) , participants trained on sports activities (e.g., playing catch) while either wearing transparent eyewear or stroboscopic eyewear that occluded vision at regular intervals. Before and after training , participants completed computer-based tasks without the eyewear. In one task, participants viewed patches of moving dots presented either centrally or peripherally and reported which of two sequentially-presented patched had coherent motion. Those participants who wore stroboscopic eyewear during training showed greater test-retest improvements on motion coherence sensitivity for centrally presented patches than participants who wore transparent eyewear, but no effects were seen for peripherally presented motion (Appelbaum et al. 2011). In another task, a useful field of view experiment, participants were briefly (~ 90 ms) shown a central letter and a dot in one of 24 peripheral locations. After a masked delay, they were asked to report the location of the dot and whether the central letter was upper or lower case. While the central task was primarily intended as a fixation control, the participants who trained with stroboscopic eyewear showed significant test-retest improvement at accurately reporting the case while the control group did not. No differences were found in peripheral performance for either group. Taken together, these results suggest that stroboscopic training may lead to generalized perceptual improvements, particularly in the center of the visual field (Appelbaum et al. 2011) .

Career Training and Visual Abilities

Another form potentially generalized training occurs on the job; radiologists , for instance, spend years learning how to properly scan radiographs. Medical searches are among the most commonly studied visual searches in the field and have provided evidence for both specialized and generalized learning . Trained orthodontists are better able to detect subtle facial asymmetries than general dentists or lay people (Kokich et al. 1999 ), suggesting a specific enhancement in their trained skill set. Similarly, radiologists and cytologists are better able to detect abnormalities in medical images than inexperienced searchers, but they do not have enhanced memory for these abnormalities nor are they better at simple scene or object detection, suggesting specific training benefits. However, surgeons who regularly engage in video game playing were found to perform better at assessments of laparoscopic surgery (Rosser et al. 2007 ), suggesting a generalized benefit.

Radiologists typically have years of experience searching medical radiographs for abnormalities, but research in radiology shows they still fall victim to many of the same types of errors as inexperienced searchers . The studies of radiological visual search described in the ‘Number of Targets and Number of Target Categories’ section focused on the satisfaction-of-search (SOS) effect using radiologists as participants and real radiographs as test stimuli. Using actual radiologists and the stimuli they normally view to address research questions is entirely sensible, but it limits the ability to compare performance across different real-world expert populations . Showing an inexperienced searcher a radiograph or an X-ray of a bag may not be the best way to assess abilities since there would be a baseline difficulty that could mask their performance. One way to more directly compare abilities from the lab to the field is to use simplified displays for all participants, and a few recent studies have done so. One study used simple visual search arrays to study SOS in both undergraduate participants and training radiologists (Clark et al. 2011a ), and another used simple visual search arrays to compare and contrast undergraduates and working airport baggage screeners (Mitroff et al. 2012 ).

To compare performance between radiologists and inexperienced searchers, Clark et al. (2011) administered a simplified multiple-target search task (e.g., Fleck et al. 2010 ) to both trained radiologists and to undergraduate students. A broad analysis of the data indicated that, perhaps surprisingly, radiologists and inexperienced searchers did not differ in overall search accuracy; the percentage of trials that they completed correctly (no misses, no false alarms) was not significantly different. However, the radiologists spent significantly longer per trial than did the undergraduates, and they frequently exceeded a trial time limit (15 s), while the undergraduates searchers hardly ever did so. The inexperienced searchers were actively deciding they had finished searching and electing to terminate their searches, while the expert searchers may have just run out of time while attempting to complete a more thorough search. When only assessing performance on trials in which participants indicated they had completed the search before the time limit, radiologists were more accurately able to detect the presence of a second target in a display, showing a reduced satisfaction of search effect relative to the inexperienced searchers .

Returning to the key hurdle discussed in the ‘Motivation and Anxiety’ section, different levels of motivation between searchers in the lab and in the field, it is worth considering whether the above differences between radiologists and inexperienced searchers might stem from radiologists simply caring more about their performance. Perhaps they took longer to respond and were more accurate because they were more motivated to perform well. This is a general concern for any such comparison, and one way to address this is to have the participants perform an additional, orthogonal task that does not tap into the specific skill in question. This was done in this study, with all participants also completing a control task, on which they made judgments about the temporal order of appearing squares. No differences between groups were found on this control task, which helps dampen the motivational concerns.

Experience appears to dramatically impact performance on cognitive tasks, but not in a simple, straightforward manner. Because of the perceptual variability from X-ray to X-ray, it is unlikely that any improvement would stem from enhanced basic sensory abilities. Instead, it appears that improvement in strategy or better attentional allocation may contribute to the improvement that comes with experience. Expert searchers are going about their searching in very different ways from inexperienced searchers and are likely more effective as a result. However, the mechanisms responsible for these differences remain largely unclear.

Discussion and Conclusions

The goal of this chapter was to explore ways in which visual search findings can translate between the lab and the field. On one side, a tremendous number of visual search experiments have been conducted in the lab, and the data have served as the basis for intricate and powerful theories of search. On the other side, career searchers conduct visual searches daily and are constantly looking for ways to improve performance. The critical question is whether each side can inform the other. Can cognitive theories and data be used in the field to guide and inform search practice? Likewise, can the nature of field-based searches be analyzed to further refine cognitive theories?

At first blush, it would be easy to say that searches from the lab and the field are not compatible given the vast differences between the manner in which search research is typically conducted in the lab and how search is performed in the field. However, such a conclusion would be both pragmatically unfortunate and empirically premature. Four significant hurdles were discussed in this chapter, and while each raises a critical concern when attempting to use lab-based findings to improve searches in the field, all show that with proper consideration, they can be overcome.

Target Prevalence

The overwhelming majority of published cognitive psychology studies on visual search have employed paradigms in which targets appear on a substantial percentage of the trials . Yet, many field-based searches rarely have a target present (e.g., there is not a gun in the majority of baggage X-rays). While this difference initially presented itself as a critical hurdle for translating between the lab and the field, a number of studies have now explicitly focused on the impact of target prevalence (both in cognitive psychology searches and in radiological searches). The hope is that, with careful experimentation, the effects of target prevalence will be isolated so that researchers can explore this topic for its own sake, but can also explore field-related visual search questions without prevalence serving as a confound. Several recent studies have made significant advances along this front, and target prevalence may no longer serve as a critical hurdle.

Number of Targets and Target Categories

As for target prevalence , most cognitive psychology studies have employed search arrays that only have one possible target at any given time . Likewise, most lab-based search experiments inform the searcher of their exact target prior to the start of the search. In contrast, most searches in the field can have an unconstrained number of targets and targets from multiple different categories. This difference is potentially devastating given that the possibility of multiple targets within a search array could have broad influences on search strategies and accuracy. However, recent efforts have explicitly married lab–and field-based searches (e.g., Fleck et al. 2010 ), finding numerous commonalities with multiple-target visual searches. This not only suggests that field-based visual search can learn from searches in the lab, but, that even more so, career searchers can partner with cognitive psychologists to take advantage of the benefits afforded by testing in the lab. It is not easy to experiment with working radiologists or X-ray operators, so anything that can be tested out in the lab, such that it will translate, can be profoundly helpful. With the knowledge that multiple-target search in the lab can translate to the field, researchers are now equipped to use these search paradigms to explore the additional differences between the lab and the field.

Motivation and Anxiety

The contexts in which searches are performed in the lab and the field are extraordinarily different, given both added motivation and added anxiety when conducting life-critical searches. In examining the effects of contextual motivation and anxiety in the lab, it becomes especially apparent why the deviations from standard search paradigms must be employed in order to properly investigate the effects of these factors in the field. In a multiple-target search paradigm—in which some trials only have one target, but other trials have more than one target— there were no differences in performance on single-target trials in motivated vs. non-motivated conditions (Clark et al. 2011b ) and in anxious vs. non-anxious conditions (Cain et al. 2011 ). In both cases, the differences were only apparent in dual-target trials. As is known from the investigation of multiple-target search in general, complex mechanisms may be interacting to cause performance differences that basic single-target searches are simply not sensitive enough to show. Searches in the field can contain more than one target and can be conducted in motivated and/or anxious contexts; by examining the effects of these factors in tandem, meaningful conclusions can be made about how these contexts may affect performance on field searches.

Levels of Experience

\Finally, experts have demonstrated vastly different performance on a variety of tasks when compared with inexperienced searchers. Even when the results may appear similar between the two groups, more sensitive analyses reveal that expert searchers could be approaching the tasks very differently. In these cases, one must be careful not to jump to conclusions about expert searchers from what is observed with undergraduate, inexperienced searchers. Other tasks, however, may demonstrate similar processes between the two groups. Early work used real radiographs and actual radiologists to explore performance in the lab, but since inexperienced searchers do not have the experience to identify targets in such stimuli, researchers must use simplified displays accessible to both groups. By using these tasks, researchers can test both inexperienced and expert searches, and if in some tasks, performance is similar between the groups, conduct follow-up experiments using inexperienced searchers and potentially draw conclusions about experts from these data.

Despite the tremendous differences between lab and field searches, these hurdles are not insurmountable. Experimental search research is invaluable to the applied world, but only with an acknowledgement of the differences and shortcomings. By modifying the parameters of search tasks in the lab to account for the differences in target distributions in the field, by adding contextual factors present in the field such as motivation and anxiety to tasks in the lab, and by exploring the differences in performance between inexperienced and expert searchers, researchers are able to appropriately examine visual search processes as they exist in the applied world. While these are lofty requirements, ideally all examined simultaneously, with careful experimentation, we can understand the contributions of the individual factors and how they may interact. By accounting for all of these differences, we have the ability to use the performance of Tyler, the undergraduate, to inform and improve work conditions for Olivia, the X-ray operator.