The Impact of Information Presentation on Visual Inspection Performance in the International Nuclear Safeguards Domain

Matzen, Laura E.; Stites, Mallory C.; Smartt, Heidi A.; Gastelum, Zoe N.

doi:10.1007/978-3-030-22660-2_5

Laura E. Matzen¹⁰,
Mallory C. Stites¹⁰,
Heidi A. Smartt¹⁰ &
…
Zoe N. Gastelum¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11569))

Included in the following conference series:

International Conference on Human-Computer Interaction

1629 Accesses
1 Citations
7 Altmetric

Abstract

International nuclear safeguards inspectors are tasked with verifying that nuclear materials in facilities around the world are not misused or diverted from peaceful purposes. They must conduct detailed inspections in complex, information-rich environments, but there has been relatively little research into the cognitive aspects of their jobs. We posit that the speed and accuracy of the inspectors can be supported and improved by designing the materials they take into the field such that the information is optimized to meet their cognitive needs. Many in-field inspection activities involve comparing inventory or shipping records to other records or to physical items inside of a nuclear facility. The organization and presentation of the records that the inspectors bring into the field with them could have a substantial impact on the ease or difficulty of these comparison tasks. In this paper, we present a series of mock inspection activities in which we manipulated the formatting of the inspectors’ records. We used behavioral and eye tracking metrics to assess the impact of the different types of formatting on the participants’ performance on the inspection tasks. The results of these experiments show that matching the presentation of the records to the cognitive demands of the task led to substantially faster task completion.

You have full access to this open access chapter, Download conference paper PDF

Overcoming Hurdles in Translating Visual Search Research Between the Lab and the Field

Interaction Effects of Environment and Defect Features on Human Cognitions and Skills in Visual Inspections

Visual search behavior and performance in luggage screening: effects of time pressure, automation aid, and target expectancy

Article Open access 25 February 2021

Keywords

1 Introduction

International nuclear safeguards are measures that provide assurance to the global community that nations are using nuclear technologies for peaceful purposes. The International Atomic Energy Agency (IAEA), which operates under the auspices of the United Nations, is the agency tasked with verifying States’ safeguards agreements. A State declares nuclear materials and facilities and the IAEA periodically verifies those declarations to ensure that nuclear materials are not being diverted from known (safeguarded) facilities and that safeguarded facilities are not being misused for undeclared nuclear purposes. They also attempt to detect any undeclared nuclear activities within a State.

The basic verification method used by the IAEA is nuclear material accountancy (NMA), which is achieved through nuclear materials measurements and examination of records and reports. The IAEA also inspects nuclear facilities to determine operational status, design, and production capacity. Containment and surveillance technologies, such as seals and cameras, are applied to maintain continuity of knowledge for nuclear materials between inspection intervals.

During a facility inspection, IAEA inspectors complete tasks such as verifying that seals have not been tampered with, verifying the inventory of nuclear material by checking seal numbers against the IAEA and facility records, comparing the facility’s records to their declarations to the IAEA, taking material measurements, and looking for any anomalies in a facility that may indicate misuse. Safeguards inspections are physically and cognitively demanding [1]. The inspectors are working under time pressure in an industrial environment that may be loud, hot, and/or cramped, in addition to containing radiological hazards. For many aspects of the inspections they must wear protective gear which makes it more difficult to manipulate the tools they need to take samples and record observations. They may not share a common language with the facility operators and are often dealing with jet lag on top of the demands of the working environment. While working in this challenging environment, the inspectors must take care to record accurate data and notes, while also being on the alert for any subtle discrepancies or indications of unusual activity in the facility.

Despite the importance of an IAEA inspector’s role and the cognitive demands of the job, there has been very little application of cognitive science research to this domain [2]. To address this gap, our research team conducted an evaluation of key safeguards inspection tasks to identify where cognitive science research methods could be applied to support the inspectors’ cognitive processing in the field [3]. One of the areas that we identified was visual inspection. IAEA inspectors must complete several types of visual inspection tasks, including paper-based tasks, such as comparing the facility’s inventory and shipment records to State declarations and records from prior inspections, and object-based tasks, such as finding seal numbers on containers of nuclear materials and checking them against IAEA records.

There has been a great deal of research on visual search in general [4], as well as visual search of lists [5, 6] and visual inspection in industrial contexts [7,8,9,10,11,12]. These studies have shown that factors such as the work environment [9, 12], task structure [9, 11], feedback [7], list formatting [5] and other types of job aids [8] impact visual search and inspection performance. Although safeguards inspectors do not have control over many of these factors, it may be feasible to format their own records and inspection-related materials in ways that make their visual inspection tasks faster and easier. We conducted a series of experiments to test the impact of changing the formatting of the inspector’s materials on the speed and accuracy of their inspection performance.

In our first study in this area [13], we developed a computer-based mock inspection task. Participants saw two lists of seal and container numbers displayed side-by-side on the screen. One of these lists was designated the “inspector’s list” and one was the “facility’s list.” Participants were tasked with checking all the items on the inspector’s list against the facility’s list and marking which seals were present, which were missing, and which (if any) were anomalous in other ways. While the facility’s list was always presented in a random order, we altered the presentation of the inspector’s list by changing its order and color coding, two factors long known to impact visual search performance [5, 14,15,16,17]. The experiment found that participants had equally high accuracy across all of the list presentation conditions, but very different response times. Participants were fastest when the order of the seals on the inspector’s list matched the order of the seals on the facility’s list. When the order did not match, participants benefited from having color coding to narrow their search of the facility’s list, which significantly improved their response times.

Given the results of this study, we expanded this line of research to other types of inspection tasks. In [13], participants were tasked with verifying all of the seals on the facility’s list. However, in real-world IAEA inspections, it is common for inspectors to check a randomly determined, statistically representative subset of the seals in a facility during each inspection. Checking a subset of the full list may change how the inspectors use the lists, which may in turn impact which list presentation conditions lead to the biggest benefits to the inspectors’ performance. The work presented here addresses this scenario.

Experiment 1, like our prior study, involved a list-to-list comparison activity, except that participants were only looking for a subset of the seals in the facility’s list. Experiments 2 and 3 represented inspection activities in which the inspectors must walk through a facility to check a list against physical items, such as sealed containers. Although these were computer-based tasks, they were designed to mimic list-to-item inspection activities. In Experiment 2, participants had a list of seals to verify but could only view one sealed container at a time, mirroring the process of checking a subset of the sealed containers in one room of a facility. In Experiment 3, participants used a map to navigate between different “rooms” in a facility. Across all three experiments, the information provided to the participants was manipulated to determine the impact of list order and different types of information about a seal’s likely location on the speed and accuracy of the inspection.

2 Experiment 1

In Experiment 1, participants were given a mock safeguards inspection task in which they were asked to compare two lists to ensure that the information matched. The “inspector’s list” contained a subset of half of the items on the “facility’s list,” mimicking a paper-based inspection task in which inspectors verify a representative subset of a facility’s records. The order and color coding of the inspector’s list was manipulated across six conditions and participants were assessed in terms of their accuracy and response times for each condition. Eye tracking data were collected to identify any differences in inspection strategy across the list presentation conditions.

2.1 Method

Participants.

Nineteen participants were recruited from the employee population of Sandia National Laboratories and were compensated for their time. Four participants were later excluded from the analysis due to dropped or noisy eye tracking data. The remaining 15 participants (10 female) had an average age of 32 years. Four of the participants held a high school degree, three held a bachelor’s degree, four held a master’s degree, and three held a PhD.

Materials.

As in our prior study [13], the experimental materials consisted of six sets of lists containing seal numbers and container numbers. The inspector’s list, presented on the left side of the computer screen, contained 18 pairs of seal and container numbers arranged in two columns. The facility’s list, presented on the right side of the screen, contained 36 pairs of seal and container numbers arranged in four columns. See Fig. 1 for an example.

The seal numbers were six-digit numerical strings. Within each condition, the first digit of the seal number was always the same and the five final digits were pseudorandomly generated such that every digit (0–9) appeared approximately the same number of times in each position. This was done to avoid any patterns within the seal numbers that could have made some numbers more memorable than others. The container numbers consisted of two letters and two numbers, separated by a hyphen, such as “AB-37.” Each container number was unique, although the same letter pairs appeared in multiple container numbers.

On the facility’s list, there were 20 filler items that did not appear on the inspector’s list. The remaining items corresponded to the conditions outlined in Table 1. Some of the conditions contained transposed digits, which were intended to make the inspection more difficult. If participants did not pay close attention, they could mismatch or mis-categorize these items, which could lead to confusion later in the inspection process. The types of transpositions were the same as in [13].

Table 1. Conditions and example items for Experiment 1.

Full size table

The experiment consisted of six inspection tasks. The seal-container pairs in the facility’s list were always presented in a random order, but the presentation of the information on the inspector’s list was manipulated across the six blocks. The items on the inspector’s list appeared in one of three orders: random order (fixed so that it was the same order for all participants), numerical order, or facility order (in which the seals were presented in the same order as those in the facility’s list). There were also two color-coding conditions. In half of the blocks, all the list items were presented in black font. In the other half, each column of the facility’s list was assigned a color and the items on the inspector’s list were color-coded according based on which column on the facility’s list contained the corresponding seal-container pair. The ordering and color-coding conditions were fully crossed, creating a 3 × 2 within-subjects design.

During each block, the background color of the screen changed two to three times. The possible colors were purple, blue, and teal (examples of the colors are shown in Figs. 1, 4 and 6). The changes were linked to specific seals, such that after a participant clicked on that seal, the background color would change on the next trial. The seals that triggered the color changes were different for each block. The color change detection task was included to encourage participants to maintain their situational awareness by attending to a secondary task while completing their primary inspection task. This mimics the work of safeguards inspectors, who must maintain their overall situational awareness in addition to completing their inspection tasks. There were relatively few color changes per block, so the participants’ accuracy on the color change detection task was not analyzed separately for the different inspection conditions.

Procedure.

After giving their informed consent, participants were seated in a dimly lit, sound attenuating booth so that their eyes were 80 cm from the computer monitor. Participants completed a practice session that explained the task and allowed them to complete a shortened version of the inspection. After the practice block, the eye tracker was calibrated. Eye tracking data was collected with a Fovio eye tracker and recorded and analyzed with EyeWorks software. The participants completed a five-point calibration sequence, and then the accuracy of the calibration was assessed by the experimenter and repeated if necessary. The calibration process was repeated prior to each block.

The participants completed the six blocks of the experiment in a counterbalanced, pseudorandom order. Each block began with a description of how the inspector’s list would be organized. The participants were instructed to check off each item on the inspector’s list. When they clicked on an item in the inspector’s list, four response choices appeared in the center of the screen. The choices were “Seal present, correct container,” “Seal present, incorrect container,” “Seal missing” and “Other issue.” Participants clicked on one of the four choices to indicate their response for that seal. After a response was recorded for a seal, that seal was grayed out on the inspector’s list to indicate that it had been checked off. Following each response, a fixation cross was presented in the center of the screen for 1.5 s, initiating the next trial.

Participants were instructed to click on a button labeled “Color Change” as soon as they noticed a change in the background color. Clicking on the “Color Change” button also initiated a new trial. Once participants had checked off all the seals on the inspector’s list, they clicked the “Inspection Complete” button. Then they were asked how many times the background color changed during the inspection task. Their choices ranged from zero to four.

Finally, participants were asked to describe their search strategy. The instructions stated: “Please give a brief description (2–3 sentences) of the strategy that you used for this inspection task. For example, which list did you start from? What visual cues were you looking for? Did you ever switch to a different strategy during the task, and if so, what was it?” The participants typed their answer in a text box on the screen. Upon finishing each inspection task, participants were given a short break. In total, the experiment lasted 1–1.5 h, depending on how fast the participants completed each task.

2.2 Results

Accuracy.

Across all blocks, the participants detected an average of 61% (SD = 41%) of the background color changes in real time and 79% (SD = 21%) of the color changes when asked at the end of the inspection to report the total number of changes that had occurred. The participants were not penalized for overestimating the number of color changes. These results indicate that the participants generally maintained their awareness of the secondary task.

For the primary task, the seal checking task, participants performed near ceiling on all inspection conditions, correctly identifying which seals were present, missing, or paired with the wrong container number. For items in the Transpose condition, the responses “Seal Missing” and “Other Issue” were both counted as correct. The average percentage of seals categorized correctly ranged from 95% to 97%. A 3 × 2 repeated measures ANOVA showed that there were no significant main effects or interactions for the different list presentation conditions (all Fs < 1).

Response Times.

In contrast with the accuracy results, the participants had very different response times across the six inspection conditions. For each trial, the participants’ response time was calculated as the time from trial onset to the time the participant clicked on one of the seals on the inspector’s list. The average response times across all trials for each condition are shown in Fig. 2. A 3 × 2 repeated measures ANOVA showed that there was a significant main effect of list order (F(2,70) = 25.42, p < 0.001), a significant main effect of color coding (F(1,70) = 41.25, p < 0.001), and a significant interaction between the two (F(2,70) = 11.67, p < 0.001).

Post-hoc paired t-tests showed that participants responded significantly faster when there was color coding in the random (t(14) = 7.24, p < 0.001) and numerical order conditions (t(14) = 3.55, p < 0.01). Color coding did not have a significant effect in the facility order condition (t(14) = 0.21). Paired t-tests were also used to compare across the list order conditions. When the lists did not have color coding, participants were significantly faster for the facility order condition than for the numerical order (t(14) = 6.58, p < 0.001) and random order conditions (t(14) = 7.48, p < 0.001). The numerical and random order conditions did not differ significantly from one another (t(14) = 0.53). For the conditions with color coding, there were no significant differences in response times across the three list order conditions (all ts < 1.69, all ps > 0.11).

Eye Tracking Data.

The eye tracking data showed that the length of the participants’ visual search process was the driving factor behind the differences in response times across the six inspection conditions. The average number of fixations per trial in shown in Fig. 3. A 3 × 2 repeated measures ANOVA showed that there was a significant main effect of list order (F(2,70) = 11.09, p < 0.001), a significant main effect of color coding (F(1,70) = 23.00, p < 0.001), and a significant interaction between the two (F(2,70) = 3.23, p < 0.05).

The gaze data were used to determine how many items the participants scanned as they were searching for each seal number. The seal-container pairs on both lists were labeled as regions of interest (ROIs) and we calculated the average number of ROIs containing gaze data points on each trial. These data are also shown in Fig. 3. Once again, there was a significant main effect of list order (F(2,70) = 14.25, p < 0.001), a significant main effect of color coding (F(1,70) = 30.98, p < 0.001), and a significant interaction (F(2,70) = 8.41, p < 0.01). The patterns seen in both eye tracking analyses mirrored the pattern observed in the response time data.

Search Strategy.

After completing each inspection task, participants were asked to describe the search strategy that they had used during the task. Across all inspection conditions, all of the participants reported that they started from the inspector’s list and searched for the items in the facility’s list. When there was color coding available, all 15 participants reported using the colors to constrain their search to the appropriate column on the facility’s list. For example, one participant wrote “I looked at the color on my list and then found the right column, then scanned for the first two numbers and then if I found the first two numbers matching I looked to see if the rest matched.”

When the inspector’s list was in the same order as the facility’s list, but with no color coding, 13 of the 15 participants reported that they used the matching order to constrain their search. For example, one participant reported “Since the lists were in the same order, I simply looked at the number on the left and looked to see if there was a match on the right side, below the previous match.” The other two participants did not specify whether they used this information.

When the inspector’s list was in numerical order, participants could have used the order information to constrain their search. If they started from the facility’s list, they could have used the numerical ordering of the inspector’s list to quickly match or eliminate seals. However, none of the participants reported using this strategy, nor did the behavioral or eye tracking results indicate that any participants used this strategy. Only one participant mentioned the numerical ordering, saying “It was really not helpful for me to have my list in number order, having the facility list in number order would have been better.”

2.3 Discussion

The results of the partial-to-full list comparison in Experiment 1 were generally consistent with those for the complete list-to-list comparison used in our prior work [13]. Participants were equally accurate across all of the list presentation conditions, but they had faster response times when their list was ordered to match the order of the items on the facility’s list. The use of color coding to narrow the participants’ search space to specific columns in the facility’s list also had a significant impact on response times. All the participants took advantage of this cue, which allowed them to search more efficiently. This was reflected in faster response times, fewer fixations per trial, and fewer items scanned per trial for the conditions that used color coding.

Interestingly, none of the participants used the numerical ordering to narrow their search space. In our prior work, we found that few participants took advantage of the numerical ordering, but those that used it were able to complete the inspections faster [13]. In the present study, all of the participants searched by choosing an item on the inspector’s list and searching for it in the facility’s list, regardless of list presentation condition. This is most likely due to the imbalance in the length of the lists in the present study. If participants started from the facility’s list, they would need to spend time eliminating the seals that were not on their checklist. Taking advantage of the numerical ordering would have made the search process more efficient but would have increased the number of items that participants needed to search for.

3 Experiment 2

In Experiment 2, we built on the findings of Experiment 1 by extending the experimental paradigm to another aspect of safeguards inspections. In addition to comparing inventory lists to one another, safeguards inspectors must also physically check the seals on containers to verify their presence within a facility and to ensure that the seals have not been tampered with. In this case, inspectors must navigate through a facility to find and check the seals on their list. Unlike the list-to-list comparisons, where all of the information can be placed side-by-side, in this list-to-seal comparison, inspectors are only able to look at one seal at a time.

This scenario changes the dynamics of search process, which may also change which types of list presentation conditions are most helpful. Thus, in Experiment 2 the inspector’s lists were the same as in Experiment 1, but instead of checking the inspector’s list against a second list, the participants checked the list against images of sealed containers which could only be viewed one at a time. In this scenario, we hypothesized that participants would start by looking at the image of the sealed container, then search for the seal or container number in their list. Searching in this manner should lead more participants to take advantage of non-color cues on the inspector’s list, such as numerical ordering.