The Tobii Pro Spectrum: A useful tool for studying microsaccades?

Nyström, Marcus; Niehorster, Diederick C.; Andersson, Richard; Hooge, Ignace

doi:10.3758/s13428-020-01430-3

The Tobii Pro Spectrum: A useful tool for studying microsaccades?

Open access
Published: 23 July 2020

Volume 53, pages 335–353, (2021)
Cite this article

Download PDF

You have full access to this open access article

Behavior Research Methods Aims and scope Submit manuscript

The Tobii Pro Spectrum: A useful tool for studying microsaccades?

Download PDF

Marcus Nyström¹,
Diederick C. Niehorster²,
Richard Andersson³ &
…
Ignace Hooge⁴

6339 Accesses
8 Citations
8 Altmetric
1 Mention
Explore all metrics

Abstract

Due to its reported high sampling frequency and precision, the Tobii Pro Spectrum is of potential interest to researchers who want to study small eye movements during fixation. We test how suitable the Tobii Pro Spectrum is for research on microsaccades by computing data-quality measures and common properties of microsaccades and comparing these to the currently most used system in this field: the EyeLink 1000 Plus. Results show that the EyeLink data provide higher RMS precision and microsaccade rates compared with data acquired with the Tobii Pro Spectrum. However, both systems provide microsaccades with similar directions and shapes, as well as rates consistent with previous literature. Data acquired at 1200 Hz with the Tobii Pro Spectrum provide results that are more similar to the EyeLink, compared to data acquired at 600 Hz. We conclude that the Tobii Pro Spectrum is a useful tool for researchers investigating microsaccades.

The effect of sampling rate and lowpass filters on saccades – A modeling approach

Article 27 January 2017

David J. Mack, Sandro Belfanti & Urs Schwarz

Eye Tracking in Visual Search Experiments

PyTrack: An end-to-end analysis toolkit for eye tracking

Article Open access 04 June 2020

Upamanyu Ghose, Arvind A. Srinivasan, … Eng Siong Chng

Introduction

Fixational eye movements—i.e., those that humans and some animals produce when they have the task to fixate on an object—consist of small (micro) saccades, slow drift, and high-frequency, low-amplitude tremor (Martinez-Conde et al., 2004; Rolfs, 2009; Rucci & Victor, 2015). Despite early work dismissing fixational eye movements as oculomotor noise or even evolutionary mistakes (Kowler & Steinman, 1980), there has over the past two decades been a surge in work on fixational eye movements, which have been linked to various perceptual and cognitive functions, and connected to neurological underpinnings (Martinez-Conde et al., 2013). For example, the occurrence of microsaccades has been linked with the allocation of covert attention (Engbert & Kliegl, 2003), onset of visual (Scholes et al., 2015) and neural (Martinez-Conde et al., 2000) transients, anticipation (Fried et al., 2014), mental fatigue (Di Stasi et al., 2013), and cognitive workload (Siegenthaler et al., 2014). It is a longstanding fact that microsaccades occur in most participants at a rate of about 1–2 Hz. However, other properties of microsaccades are still debated (Collewijn & Kowler, 2008; (Nyström et al., 2016), 2017). In the early studies in the 1950s and 1960s, for example, the agreed-upon upper limit of microsaccade amplitudes was 12 min arc (Collewijn & Kowler, 2008), whereas today 60 min arc (1 deg) is a common cut-off when distinguishing micro- from larger saccades (Martinez-Conde et al., 2013). Nyström et al. (2016) conclude that while we still do not have a definite answer to why such discrepancies in amplitudes exist between the old and the new literature, the technology to record fixational eye movements, which has changed fundamentally since the 1950s, is an important factor (for a comprehensive review, see Poletti and Rucci 2016).

Over the past decades, the EyeLink family of eye trackers is unarguably the most used eye tracker in microsaccade research. For instance, in their review of microsaccade research between 2004 and 2009, Martinez-Conde et al., (2009) list 37 studies (with 43 experiments in total) in their Table 1 of which 30 use an EyeLink. From 2010 to 2018, a Google Scholar search ends up with more hits when entering EyeLink together with microsaccade and eye tracker compared to combining the last two words with any of the other competing companies or techniques. Importantly, similar microsaccade rates were found in EyeLink 1000 data when compared to co-recorded data acquired with scleral search coils (McCamy et al., 2015), considered the ‘gold standard’ in oculomotor research (Collewijn, 1999) Today, the EyeLink 1000 Plus is the most commonly used eye tracker for microsaccade research. It is based on the traditional pupil and corneal reflection (CR) principle, where gaze locations on a calibration plane (usually a screen) are inferred from pupil-CR vectors through polynomial mapping (SR Research, 2017).

Table 1 Output of the linear mixed effects models predicting RMS precision, SD precision, microsaccade amplitude (Amp), and microsaccade rate (Rate). The two latter variables are generated from the algorithm by Engbert and Kliegl (2003) using a fixed λ = 6. The intercept corresponds to the EyeLink F setup. Values in parenthesis represent standard errors

Full size table

While the EyeLink family of eye trackers has long been the clear choice for microsaccade researchers due to its high sampling frequency and precision, other eye trackers are now approaching similar specifications. Two examples using stereo cameras and more than one source of illumination in combination with physical 3D models of the eye are the open-source eye tracker by Barsingerhorn et al., (2018) and the commercially available Tobii Pro Spectrum. These eye trackers may therefore be interesting for researchers studying fixational eye movements. However, since history has taught us that introducing a new way of measuring the small fixational eye movements also may change their measured properties, a validation of its capabilities against current instruments is crucial (Collewijn & Kowler, 2008; Nyström et al., 2016).

What is important for an eye tracker that will be used to record fixational eye movements? Since it is often not critical to know the exact position where participants are looking, small inaccuracies (systematic errors) in the eye-tracker signal are usually not problematic in the majority of research on fixational eye movements (Poletti et al., 2013, but see, for instance). Since microsaccades can be very small (Poletti and Rucci, 2016, use a lower bound of 3’) it is however critical to record data with high precision, i.e., data with a low variable error. Articles reporting data collected by an EyeLink often point the reader to the EyeLink manual, which lists a ‘spatial resolution’ of 0.01 deg (0.6’) (SR Research, 2017, p. 9), which refers to a measurement with a static artificial eye. This value is computed as the root mean square (RMS) of distances between consecutive samples.^{Footnote 1} Reporting values from a manual is unfortunate since precision differs for real eyes and artificial eyes (Holmqvist et al., 2011, p. 35), and across studies depending on factors such as the eye physiology of the participants, the recording environment, the setup and settings of the eye tracker, and the method to compute precision (Nyström et al., 2013a).

A few researchers report the actual precision in the data, how to calculate it, and the state of the filters during recording. For instance, Nyström et al., (2017) reported both filtered and unfiltered standard deviation (SD) and root mean square (RMS) of intersample distances. Unfiltered data gave precision values between 0.03 to 0.06 deg, while filtered data (with a Savitzky–Golay filter of length 21 ms) provided an order of magnitude higher precision (0.003 to 0.006 deg).

Another important property is sampling frequency. Since the smallest microsaccades also have very short durations (a few milliseconds), it is critical to have a high enough sampling frequency to be able to detect them, but also to quantify more detailed properties like velocity and shape. Since frequencies above 100 Hz carry little information about fixational eye movements (Findlay, 1971), it would according to the Nyquist–Shannon theorem (Nyquist, 1928; Shannon, 1949) be sufficient to record at twice that frequency (200 Hz) to be able to capture all fixational eye movements. In practice, however, studies investigating microsaccades typically use eye trackers recording at 250 Hz and above (Martinez-Conde et al., 2009). The lower bound of 250 Hz likely reflects the fact that the SMI EyeLink I, which was the state-of-the-art video-based eye tracker in the early 2000s, had a maximum sample rate of 250 Hz.

The goal of this paper is to test whether a remote stereo-camera-based eye tracker, the Tobii Pro Spectrum, is suitable for studying microsaccades in a typical experimental paradigm where participants fixate centrally located targets on a computer screen. Suitability is defined in terms of properties of both the eye tracker signal (precision, power spectral density) and the detected microsaccades (rate, amplitude, displacement, direction, and shape). Results will be compared against one of the currently most used eye trackers in microsaccade research, the EyeLink 1000 Plus in desktop mount, across participants who perform the same task. Since recordings are performed within participants across eye trackers, we assume that any differences we find are attributed to differences between the eye trackers.

In Experiment I, four experienced participants are recorded in a fixation task where blink-free data on fixational eye movements are acquired and analyzed in detail on an individual level. To test whether the results generalize to a wider population with non-expert participants, eight naive participants are recorded in Experiment II using a less demanding fixation task. Finally, statistical analyses using all participants from both experiments are used to compare the data quality and properties of the detected microsaccades across the eye trackers.

Experiment I

Methods

Participants and apparatus

Binocular eye movements from four male participants (P1, P2, P3, P4) were recorded on the EyeLink 1000 Plus in Desktop Mode (Host version 5.12) and the Tobii Pro Spectrum (firmware version 1.7.6). To reduce the variance across setups and recordings, all of the participants were authors trained to sit very still and to avoid blinking. Each of the participants has over 10 years of experience working with eye trackers in various contexts. Two of the participants (P3, P4) wore glasses. Informed consent was obtained from each participant.

The EyeLink was set to record binocular eye movements at 1000 Hz in pupil centroid mode, and was set up according to the recommendations of the manufacturer (SR Research, 2017).

Stimuli were presented on the native Tobii Pro Spectrum screen (EIZO FlexScan EV2451) with a resolution of 1920 × 1080 pixels (52.8 × 29.7 cm). Participants sat at a distance of 63 cm from the screen and positioned themselves such that the average position of the eyes was in the center of the headbox of the Tobii Pro Spectrum. Participants’ heads were supported with the EyeLink chin-, and forehead rest. The setup can be seen in Fig. 1.

Stimuli were presented on the screen with PsychoPy 1.85.0 Peirce (2007, 2008).

Procedure

Participants completed a calibration followed by a validation. For the Tobii Pro Spectrum, the default calibration in Titta (Niehorster et al., 2019) was used, where five calibration points were followed by four points for validation. The EyeLink was calibrated with the standard nine-point calibration followed by a nine-point validation. Although accuracy is not the main variable of interest in this study, re-calibrations were performed if visual inspection of the validation data revealed that there were large deviations in one or more validation points.

Prior to the onset of calibration, eye images were inspected to ensure that relevant eye features (pupil and corneal reflection(s)) were clearly visible to the operator. Prior to calibrating the EyeLink, the focus of the camera was adjusted such that the sizes of the CRs were minimized in the eye image, followed by auto-adjustments of the pupil-, and CR thresholds.

Following a calibration, each participant was recorded in four setups:

A.
Tobii Pro Spectrum at 1200 Hz.
B.
Tobii Pro Spectrum at 600 Hz.
C.
EyeLink with both heuristic filters switched on.
D.
EyeLink with both heuristic filters switched off.

Consequently, each eye tracker was tested in two setups. For the Tobii Pro Spectrum, the two highest sampling frequencies were used to investigate whether there is a sampling frequency/noise trade-off leading to a noisier signal in 1200-Hz recordings compared to 600-Hz recordings. This is to be expected due to the shorter exposure time, and hence worse image quality, in the 1200-Hz case (see Appendix A). Conversely, the higher sampling frequency may provide more information about the microsaccades, aiding their detection. Some microsaccade researchers using the EyeLink record data with the proprietary heuristic filters turned on, or do not report the state of the filter at all. Therefore we collected filtered and unfiltered data. Using unfiltered EyeLink data also makes comparisons with Tobii Pro Spectrum data (which are unfiltered) more relevant from an evaluation perspective.

In the remainder of the text, the setups will be referred to as Spectrum 1200 (A), Spectrum 600 (B), EyeLink F (C), and EyeLink U (D). For each setup, participants were asked to fixate a stimulus dot (inner diameter 0.1 deg, outer diameter 0.6 deg, denoted ABC in Thaler et al., 2013) on a mid-gray background (RGB: (128, 128, 128), luminance 37.6 cd/m²) in the center of the screen for five trials of 20 s each.

If the reported pupil size in either of the eyes was ‘not a number’, nan (Tobii), or less than 100 pupil area units (EyeLink), the trial was interrupted and a new trial was added to the experiment. This typically happens when a participant blinks, and was added to ensure that five 20-s trials without data loss were recorded for each participant and setup.

Each participant was recorded twice for each setup in the order ABCDABCD for two of the participants and CDABCD AB for the other two. All data for each participant were collected within one hour with only short breaks in-between.

No attempts were done during the recordings to change, e.g., in the EyeLink case, pupil thresholds in case the signal for some reason was lost.

Data analysis

Accuracy values from the EyeLink were taken directly from the output files generated by the edf-converter tool provided by SR Research. For the Spectrum, accuracy values for each validation point were computed by extracting data in a 500-ms interval starting 500 ms after each validation point onset. The average distance between each validation point and the median value of the corresponding gaze data was taken as the accuracy value.

Prior to being fed in the microsaccade detection algorithms, data were converted to degrees and lowpass filtered with a Bartlett window filter of size of 20 ms (cf. Appendix C).

Microsaccades were detected with a standard algorithm in the field (Engbert and Kliegl, 2003), using a minimum microsaccade duration of 5 ms and λ = 6. To prevent counting overshoots as additional microsaccades, an additional requirement was that a minimum duration of 10 ms had to separate two consecutive microsaccades. Microsaccade rate was computed per trial by dividing the number of microsaccades with the total duration of valid samples in the trial; this to prevent that, for instance, trials with few microsaccades and much data loss would be taken as a low microsaccade rate of a participant.

To investigate how the choice of detection algorithm influences the results (cf. Appendix D), microsaccades were also detected with the algorithm by Otero-Millan et al., (2014) and, following Engbert and Mergenthaler (2006), by using surrogate data to find optimal λ values for the Engbert & Kliegl-algorithm.

Results

Data quality

Average accuracy over all validation points, participants, and eyes was (in degrees): EyeLink F (M = 0.44, SD = 0.28), EyeLink U (M = 0.56, SD = 1.02), Spectrum 1200 (M = 0.58, SD = 0.38, Spectrum 600 (M = 0.58, SD = 0.37). Figure 2 shows two seconds of binocular horizontal and vertical gaze signals recorded from one participant in the four recording setups. Since microsaccades occur at rates of about 1–2 Hz and are more prevalent in the horizontal direction (Rolfs, 2009), we expect to see at least a few examples of microsaccades in the horizontal component of the signals. Indeed, microsaccades can clearly be seen with the naked eye in all four setups in Fig. 2a. In contrast, microsaccades are virtually absent in the vertical data (Fig. 2b). From visual inspection, the signal-to-noise ratio appears to be similar in the first, second, and fourth panels. Unsurprisingly, EyeLink data are less noisy when switching on the heuristic filters (third panel).

To characterize the noise, the power spectral density (PSD) of the signals was computed using Python 2.7 and the psd function in matplotlib (v. 2.1.2, default settings). In short, the PSD is a measure of the strength (power) of the variations in a signal as a function of frequency. It is computed over 256 sample long windows by calculating the average of the Fourier transform of each window. Figure 3 illustrates how the power of signals from the four setups is distributed across different frequencies for a representative trial.

There are a few observations one can make from this figure. First, the EyeLink data recorded with the heuristic filters turned on have a very different distribution of power at frequencies above 100 Hz. Instead of a flat power spectrum, indicating white noise, the power of the filtered data reaches a local minimum between 300 and 400 Hz, after which it starts to increase again. Second, since noise from a completely still artificial eye typically is white and thus flat (Coey et al., 2012; Wang et al., 2017), it is likely that eye movements, which are characterized by pink (1/f) noise (Coey et al., 2012), contribute with power only at frequencies lower than 100 Hz (cf. Findlay, 1971). Since the power at a certain frequency is proportional to the squared amplitude of the signal, it can from the higher frequency range (> 100 Hz) in this plot also be predicted that the precision of the gaze signals from the recording setups will be ordered in the same way as in Fig. 3, i.e., highest to lowest: EyeLink F, EyeLink U, Spectrum 1200, Spectrum 600.

To quantify precision, the root-mean-square (RMS) of sample-to-sample distances and the standard deviation (SD) of the signals are computed for all participants across each trial (Fig. 4). SD was included to capture slower variations in the signal indicative of, e.g., drift. Overall, the above observations were confirmed, and the horizontal and vertical RMS of the unfiltered EyeLink data (EyeLink U) had marginally lower values than that of the Spectrum recordings.

Horizontal and vertical SD is on average higher in the EyeLink data compared to the Spectrum data, and only small differences exist between filtered and unfiltered EyeLink data.

Detailed summary statistics of precision values for each participant and measurement is provided in Table 4, which shows that precision appears to be stable within participants and across recordings.

Microsaccades

While the calculated precision of data can be improved by lowpass filters internal or external to the eye tracker, high precision per se does not mean that microsaccades can be measured and represented more accurately. Therefore, a detailed analysis of detected microsaccades and their properties follows below.

Since experienced participants were recorded within a limited time period while performing the same task, we assume a similar microsaccade production within participants and across setups.

Microsaccade rates for each participant and setup are shown in Fig. 5. The data show some variation across participants and setups, but it does not seem like a particular setup systematically shows lower or higher microsaccade rates.

Other frequently reported properties of microsaccades are their amplitudes and directions. Figure 6 shows both the (a) amplitude (maximum excursion during the microsaccadic interval) and (b) displacement (distance between onset and offset) of microsaccades and Fig. 7 illustrates microsaccade directions.

Again, there are only small differences across setups in both amplitude, displacement, and direction of the detected microsaccades. Note that the ranking of participants on amplitudes and displacements in Fig. 6 is stable across setups.

Finally, we align and plot the average waveform of a microsaccade for each recording setup and participant (Fig. 8). The waveforms were generated by extracting all microsaccades with amplitudes (A) in the range A = [0.14,0.26] deg, and then normalizing each waveform by its maximal value. As can be seen, the waveforms are virtually indistinguishable across different recording setups for all participants.

Discussion

Eye-tracker data recorded from four experienced participants in a fixation task generated similar data quality and microsaccade rate across the tested setups. Besides the obviously higher precision in the EyeLink F setup, data from the four setups seem comparable.

Experiment II

In early work, fixational eye movements were often recorded from experienced participants (Ditchburn & Foley-Fisher, 1967), sometimes the authors themselves, as was also the case in Experiment I of this paper. However, it is becoming increasingly more common to study microsaccades in naive or clinical populations (e.g., Alexander et al. 2018). It is therefore necessary to see if the results generalize to a wider, and less expert, population, which is the aim of Experiment II.