Setting things straight: A comparison of measures of saccade trajectory deviation

Tudge, Luke; McSorley, Eugene; Brandt, Stephan A.; Schubert, Torsten

doi:10.3758/s13428-016-0846-6

Setting things straight: A comparison of measures of saccade trajectory deviation

Published: 11 January 2017

Volume 49, pages 2127–2145, (2017)
Cite this article

Download PDF

Behavior Research Methods Aims and scope Submit manuscript

Setting things straight: A comparison of measures of saccade trajectory deviation

Download PDF

Luke Tudge^1,2,
Eugene McSorley³,
Stephan A. Brandt⁴ &
…
Torsten Schubert^1,5

2392 Accesses
9 Citations
1 Altmetric
Explore all metrics

Abstract

In eye movements, saccade trajectory deviation has often been used as a physiological operationalization of visual attention, distraction, or the visual system’s prioritization of different sources of information. However, there are many ways to measure saccade trajectories and to quantify their deviation. This may lead to noncomparable results and poses the problem of choosing a method that will maximize statistical power. Using data from existing studies and from our own experiments, we used principal components analysis to carry out a systematic quantification of the relationships among eight different measures of saccade trajectory deviation and their power to detect the effects of experimental manipulations, as measured by standardized effect size. We concluded that (1) the saccade deviation measure is a good default measure of saccade trajectory deviation, because it is somewhat correlated with all other measures and shows relatively high effect sizes for two well-known experimental effects; (2) more generally, measures made relative to the position of the saccade target are more powerful; and (3) measures of deviation based on the early part of the saccade are made more stable when they are based on data from an eyetracker with a high sampling rate. Our recommendations may be of use to future eye movement researchers seeking to optimize the designs of their studies.

The effect of sampling rate and lowpass filters on saccades – A modeling approach

Article 27 January 2017

Speed-accuracy tradeoffs influence the main sequence of saccadic eye movements

Article Open access 28 March 2022

Widely applicable MATLAB routines for automated analysis of saccadic reaction times

Article Open access 02 May 2014

When a new object appears in our field of view, we may make a quick eye movement (a saccade) to bring our gaze to that object. During these saccades, the path that our gaze follows across our field of view is rarely a straight line from our current point of regard to the location of the new object. Instead, saccades describe a curved path, and do not always land exactly on target (Erkelens & Sloot, 1995; Viviani, Berthoz, & Tracey, 1977). This deviation is systematically influenced by the presence of other objects that we have not chosen to look at, termed distractors (for reviews, see Van der Stigchel, 2010; Walker & McSorley, 2008). This phenomenon may be termed the saccade trajectory deviation.

A widely accepted explanation of saccade trajectory deviation is that it occurs because the visual system prepares eye movements to both the target and the distractor, and the resulting eye movement is an average or combination of the two different planned movements at the moment when the saccade is initiated (McPeek, Han, & Keller, 2003; McPeek & Keller, 2001; Port & Wurtz, 2003; Tipper, Howard, & Paul, 2001; White, Theeuwes, & Munoz, 2012). To the extent that the planned eye movement to the distractor has not been fully suppressed by the time the saccade is executed, the trajectory of the saccade will deviate toward the distractor. Conversely, deviation away from the distractor may reflect an “overinhibition” of the planned eye movement to the distractor (McSorley et al., 2006).

Saccade trajectory deviation provides a convenient quantification of the allocation of attention to the distractor. By varying the content of the distractor or of the target, and by varying the conditions under which participants view the two objects, we may learn what priorities and strategies the visual system employs. Saccade trajectory deviation has been widely used in this way as an operationalization of attention and cognitive control in investigations of diverse phenomena, such as phobias (McSorley & Morriss, 2015), the processing of word meaning (Weaver, Lauwereyns, & Theeuwes, 2011), emotion (McSorley & van Reekum, 2013), social behavior (Laidlaw, Badiudeen, Zhu, & Kingstone, 2015), cognitive decline in the elderly (Campbell, Al-Aidroos, Pratt, & Hasher, 2009), and participants’ preparedness for the task (Tudge & Schubert, 2016).

When studying saccade trajectory deviations, it is necessary to quantify the extent of a saccade’s deviation. No single, agreed-upon method for doing so exists. Rather, different studies have quantified deviation in different ways (for an overview, see Van der Stigchel, Meeter, & Theeuwes, 2006). If these different measures reflect slightly different aspects of saccade planning, or if some measures are better suited than others to detect the effects of experimental manipulations, then studies using different measures may not be easily comparable, or may in fact be drawing conclusions about different underlying phenomena. Our aim in the present study is to systematically compare different measures of saccade trajectory deviation, to find out which of them are likely to reflect the same underlying phenomenon, and which are most sensitive to certain experimental manipulations. We hope that this information will help future researchers in choosing an optimal measure for a planned study, and help to better compare the findings of studies that use different measures.

Several different features of a saccade trajectory might reflect its apparent deviation from a straight path. A widely cited review of research with saccade trajectory deviations lists eight methods of measuring deviation (Van der Stigchel et al., 2006). In the present study, we compared these eight measures. It is therefore important to describe them briefly before continuing. The measures are also summarized in Table 1, and illustrated in Fig. 1.

Table 1 Summary of saccade measures

Full size table

Overall direction is the angle between a straight line from saccade start to target position and a straight line from saccade start to saccade end. It measures the extent to which a saccade lands to one side of its target, and does not take into account any part of the saccade apart from its landing point.
Saccade deviation is the mean of all the angles formed between a straight line from saccade start to target position and straight lines from saccade start to each sample within the saccade. Like overall direction, it measures the extent to which the saccade deviates to one side of its target, but averaged over the entire trajectory.
Overall initial direction is the angle between a straight line from saccade start to target position and a straight line from saccade start to a point 10 ms after saccade start (i.e., early in the saccade). Again, it measures deviation relative to the target, but does so only for the earliest part of the saccade.
Maximum curvature is the maximum perpendicular distance of the saccade trajectory from a straight line from saccade start to saccade end. It measures the curved shape of the trajectory. Some previous studies have standardized maximum curvature by dividing it by saccade amplitude (Doyle & Walker, 2001). This is intended to correct for the fact that longer saccades have more space within which to describe a larger curve. We also followed this standardization procedure in our analyses.
Area curvature is an estimate of the area between the trajectory of the saccade and a straight line from saccade start to saccade end. Different studies have estimated this area in slightly different ways. In all methods, rectangles drawn along the straight line from saccade start to saccade end and located between saccade samples are used to approximate the area of the curve. These rectangles may extend either to each sample (see, e.g., Fig. 1 in Ludwig & Gilchrist, 2002) or to a point halfway between each sample and the previous sample (Walker et al., 2006). We used the latter procedure in our analyses (see Fig. 1, right panel). Like maximum curvature, this measure is often standardized to saccade amplitude (Walker et al., 2006), and we followed this standardization procedure in our analyses.
Initial direction is similar to the overall initial direction, in that it measures an angle to a saccade sample 10 ms into the saccade. The difference is that this angle is measured relative to a straight line from saccade start to saccade end, and not to the target position.
Initial average curvature is similar to the maximum curvature. It measures the perpendicular distance of saccade samples from a straight line from saccade start to saccade end, but instead of the maximum such distance, it is the mean of distances to samples within the first 10 ms of the saccade. This measure is a variant of a measure that has been termed simply the initial average. In the literature on saccade trajectory deviations, there has been some confusion of terms regarding the initial average. To our knowledge, the first occurrence of a measure with this name is in the work of Sheliga and colleagues (e.g., Sheliga, Riggio, Craighero, & Rizzolatti, 1995). The authors described a measure that averages the perpendicular distances from a straight line from saccade start in an absolute direction (up, down, left, or right, depending on where the target is located). Later, Ludwig and Gilchrist (2002) described a measure called initial direction, and referenced the description by Sheliga et al., but in fact described a slightly different process of calculation, using perpendicular distances from a straight line from saccade start to saccade end. In the present study, we followed the method from Ludwig and Gilchrist, but use the novel term initial average curvature to avoid confusion with the slightly different method described as the initial average in Sheliga et al. (1995). To avoid further confusion, it is also important to note here that the term initial average also appears in Van der Stigchel et al. (2006), with yet another very slightly different method of calculation. The authors there described the initial average as the average of angles between the saccade trajectory and a straight line from saccade start to saccade end. We did not use this method of calculation in the present study.
Quadratic curvature is calculated by fitting a quadratic polynomial to the saccade samples after normalizing the amplitude of the saccade onto a scale from –1 to 1. The quadratic coefficient of the fitted curve is the quadratic curvature, and measures the curved shape of the trajectory (Ludwig & Gilchrist, 2002).

To give some structure to this list of measures, we classified them according to three features. The first is the choice of ideal straight line to which the saccade trajectory is compared. Overall direction, saccade deviation, and overall initial direction are calculated relative to a straight line from the start of the saccade to the correct target position. We term these target-based measures. The other measures are calculated relative to a straight line from the start of the saccade to the end of the saccade. We term these endpoint-based measures. These two categories have sometimes been termed deviation and curvature, respectively. We have not followed this convention here, since the term deviation is also commonly used to refer to the overall notion of distortions of saccade trajectory, both target-based and endpoint-based (e.g., in McSorley et al., 2006), and it is in this more general sense that we also use the term deviation in this article.

Target-based measures quantify the extent to which the saccade misses its target, whereas endpoint-based measures quantify the curved shape of the saccade trajectory, irrespective of whether it is on target or not. It is in principle possible that these two types of measure be independent of one another; a saccade may be on target but have reached the target via a very curved trajectory, or conversely a saccade may be a long way off target but have an entirely straight trajectory. However, some evidence suggests that this independence is not realized in practice. McSorley, Haggard, and Walker (2004) found that overall direction, a target-based measure, is positively correlated with area curvature, an endpoint-based measure, though only for saccades that are directed upward,, not downward (see Fig. 6 in McSorley et al., 2004). Similarly, Van der Stigchel, Meeter, and Theeuwes (2007) found that overall direction and initial direction are strongly positively correlated.

The second feature concerns the amount of information that the measure makes use of. An eyetracking device samples gaze position at many different points along the trajectory of the saccade. Saccade deviation, area curvature, and quadratic curvature make use of all these samples, by averaging or integration. We term such measures full-sample measures. The other measures make use of only one sample or a subset of samples that are deemed to be of particular importance, for example the first few samples after saccade start, the endpoint of the saccade, or the point at which deviation reaches a maximum. We term these subsample measures.

It has been argued that full-sample measures are preferable, because combining multiple samples may help to average out measurement error in the eyetracking system (Ludwig & Gilchrist, 2002). Although plausible on theoretical grounds, to our knowledge this assertion has not been tested. If it is the case that different features of a saccade reflect different underlying phenomena, then it may nonetheless be preferable to focus only on a subset of samples, if these are the samples most likely to reflect the phenomenon of interest. In addition, it is not necessarily the case that measurement error is of the same magnitude throughout a saccade. For example, gaze might be measured more noisily while the eye is in motion than when it has stopped moving, which could make the overall direction less noisy than full-sample measures, despite being based on only one sample.

The third distinction is between “early” and “late” measures of saccade trajectory deviation. An early measure of deviation is a type of subsample measure that takes its subsample from the beginning of the saccade. These measures therefore reflect the state of the saccade shortly after initiation, before any corrective processes have brought the trajectory closer in line with the target (Van der Stigchel et al., 2006). Overall initial direction, initial direction, and initial average curvature are early measures, since they use only samples within the first 10 ms of the saccade. The use of 10 ms as a cutoff for the early part of a saccade is an arbitrary choice, and its appropriateness will depend on the expected duration of the saccades in a given experiment. Some previous studies have used 8 ms (e.g., Ludwig & Gilchrist, 2002), 10 ms (e.g., Sheliga, Riggio, Craighero, & Rizzolatti, 1995), 12 ms (e.g., Van der Stigchel & Theeuwes, 2005), or 20 ms (e.g., Van der Stigchel & Theeuwes, 2006) as the cutoff.

Conversely, late measures take their subsample from the end of the saccade. Only one measure, overall direction, is explicitly based on a subsample taken from the end of the saccade, and as such is the only strictly late measure. Many measures are neither early nor late, either because they are full-sample measures or because they are based on a subsample that may occur anywhere during the saccade, for example the maximum curvature.

The fact that so many different measures are in use to quantify saccade trajectory deviation raises two potential problems. The first is the issue of comparability. If different studies on similar topics make use of different dependent measures, it remains unclear to what extent their findings are comparable. Studies of saccade trajectory deviation may in fact be investigating different phenomena if they employ different methods of measurement. Saccade trajectory deviations may be the outcome of a process with several different components, such as selecting the target, inhibiting the distractor, deciding when to execute the saccade, and correcting the saccade trajectory “online”—that is, while it is underway (Quaia, Lefèvre, & Optican, 1999). Different features of a saccade trajectory may be measuring some of these components, but not others. For example, early measures are made before much online correction has taken place, and may therefore reflect more closely the initial amount of attention allocated to the distractor, whereas late measures may additionally reflect the success or failure of online correction.

If the different measures were found to be strongly correlated with one another, then we could be more confident that they all reflect broadly the same phenomenon. One previous study reported the correlations of some measures, and found these to be generally high (between .70 and .98; Ludwig & Gilchrist, 2002). However, this study only investigated endpoint-based measures, and correlation does not of itself guarantee that the measures will respond identically to experimental manipulations.

To more systematically address the problem of comparability, we employed principal components analysis (PCA) with all eight measures. PCA reduces a set of correlated variables to a smaller number of underlying components that describe most of the variance in the data (Hotelling, 1933). If it can be established that particular subsets of measures are likely to reflect the same underlying phenomenon, then we may be more confident in comparing the results of studies using different measures from within one subset. Conversely, where discrepant findings arise, we may be able to explain these as a consequence of having employed two different measures of deviation that may reflect different underlying phenomena.

The second problem is the issue of selecting a measure that maximizes statistical power. All else being equal, we wish to use a measure that gives us the best chance of detecting the effects of our experimental manipulation. The power of a particular measure to detect a particular effect depends on the magnitude of the effect on that measure, relative to the measure’s variance. To quantify the power of each measure, we used the standardized effect size generalized eta-squared (η ² _G), as a metric that is comparable across different study designs (Olejnik & Algina, 2003). If it can be established that a certain measure reflects more clearly the effects of experimental manipulations, then that should be the preferred measure for future studies.

Saccade trajectory deviations have been used as the dependent measure for a wide variety of experimental manipulations. Since it is not feasible to investigate effect sizes for all of these manipulations, we instead restricted the investigation to two well-established experimental paradigms. The first was arguably the simplest target–distractor paradigm possible, one in which a target and a distractor are presented simultaneously. The participant’s task is to make a saccade to the target as quickly as possible. The target and the distractor are distinguishable only by virtue of their shapes (e.g., one is a cross and the other a circle, as in McSorley et al., 2006). In this paradigm, the effect of interest is the negative relationship of saccade trajectory deviation to saccade latency. Saccades that occur very soon after the stimuli appear tend to deviate more toward the distractor, whereas saccades that occur later show less deviation toward the distractor, and may even deviate away from it (McSorley et al., 2006).

The negative relationship between deviation and latency is typically explained as the result of competition between target and distractor, as described above. When target and distractor appear, the oculomotor system generates planned eye movements to both of them. If a saccade is initiated while both of these eye movement plans are still active, the resulting eye movement trajectory will be something of an average between the two plans, and will therefore deviate toward the distractor. Only after some time is knowledge of the task brought to bear, with the result that the plan for an eye movement to the distractor is gradually inhibited. So, the later the saccade is executed, the less it will deviate toward the distractor (McSorley et al., 2006; Van der Stigchel, 2010).

It is particularly important to establish which measure is most sensitive to this basic effect of saccade latency. This is because latency is often investigated as a modulating factor in studies involving additional variables of interest, and in many studies the principal finding is an interaction of saccade latency with this additional variable. For example, elderly people show a more shallow slope relating deviation and latency than do younger people (Campbell et al., 2009), and some manipulations, such as the physical salience of the distractor, are only apparent at short saccade latencies (van Zoest, Donk, & Van der Stigchel, 2012), whereas others, such as the social relevance of the distractor, are only apparent at longer saccade latencies (Laidlaw et al., 2015).

The second paradigm in which we measured effect sizes was one that is designed to investigate the effect of distractor salience on saccade trajectory deviation. In this paradigm, the target appears within an array of vertical lines. One line is oriented slightly differently from the others, and this line serves as the distractor. By varying the extent to which the orientation of the distractor differs from that of the surrounding vertical lines, how this contrast, or “salience,” affects the trajectory of the saccade can be investigated. As we noted above, this paradigm reveals that more-salient distractors (i.e., those whose orientation contrasts more starkly with that of the surrounding lines) elicit greater deviations toward them, but only for short-latency saccades (van Zoest et al., 2012). This finding has been explained as the result of more-salient distractors eliciting more oculomotor activity during the planning of the saccade (White et al., 2012). However, this activity is transient, which results in salience effects on saccade trajectories disappearing at longer latencies (Donk & van Zoest, 2008). Similar findings have been made for other sources of salience, such as the luminance of the distractor (Jonikaitis & Belopolsky, 2014).

We considered it important to investigate the effect sizes for the effect of a basic feature of the distractor because the measures most sensitive to the basic effect of saccade latency may not be the same measures that are most sensitive to changes in the distractor. In view of the fact that many studies have varied the type of distractor (e.g., Jonikaitis & Belopolsky, 2014; Laidlaw et al., 2015; McSorley & Morriss, 2015; McSorley & van Reekum, 2013; van Zoest et al., 2012; Weaver et al., 2011), we wished to be able to recommend optimal measures specifically for this type of study.

Study 1: McSorley et al. (2006)

In Study 1, to investigate measures of saccade trajectory deviation in one of the simplest situations possible, we analyzed data from the basic target–distractor paradigm described above, in which the target and the distractor are two shapes that appear simultaneously at random locations and are not varied in any way. We extracted the eight measures described in the introduction above and used PCA to identify clusters of related measures. We also calculated the effect sizes for the basic effect of saccade latency on trajectory deviation, to identify the measures that have the most power to detect this effect.

Method

Data

The data were taken from a previously published eye movement study (McSorley et al., 2006) with the authors’ permission. Readers are referred to the original article for a detailed description of the methods. Briefly, seven participants completed 420 trials each of a saccade task in which the goal was to make an eye movement to a target shape that could appear randomly in one of four possible locations, while ignoring a simultaneously appearing distractor shape, which appeared nearby. Eye movements were recorded using an EyeLink with a sampling rate of 250 Hz. Figure 2 gives a schematic of the stimulus display.

Data processing

All gaze samples falling outside the dimensions of the stimulus monitor were discarded. Gaze samples that did fall within the dimensions of the monitor were smoothed, in order to average out small-scale sampling noise. This was achieved by replacing the x- and y-coordinates of each sample with the mean of coordinates from all samples within 2.5 ms of the current sample (i.e., smoothing with a “rectangular sliding window”).

For each trial, gaze samples were recentered on the fixation spot to correct for drift in the eyetracking system. This was accomplished by assuming that the participant was fixating the fixation spot as instructed during the 60 ms prior to the onset of the task display. The median gaze position during this time window was then assumed to be the center of the screen, and all samples for the trial were re-centered on this point by rigid body translation.

To extract the first saccade from the processed samples, we used a “velocity peak method” (e.g., Smeets & Hooge, 2003). This method avoids erroneously categorizing small fluctuations in gaze velocity as saccades, as may occur with a fixed saccade velocity criterion (Nyström & Holmqvist, 2010). The first velocity peak was identified as the first set of contiguous samples with a velocity greater than 100°/s. The start- and endpoints of the saccade were identified by searching from this peak backward and forward in time, respectively, until finding a sample with a velocity below 35°/s and an acceleration below 0°/s².

The eight measures of saccade trajectory deviation described above were calculated for each extracted saccade. Each measure was calculated in a clockwise direction. An implementation of all saccade trajectory calculations for the MATLAB programming environment is available from the corresponding author’s website.^{Footnote 1} A baseline measure of deviation was calculated as the mean deviation in trials with no distractor, separately for each target position that appeared in the experiment. This was subtracted from the deviations in distractor trials to correct for any tendency to make slightly leftward or rightward saccades even in the absence of a distractor (Walker & McSorley, 2008). If on a given trial the distractor was located anticlockwise of the target, the sign of the measures was reversed, so that positive values indicate deviation toward the distractor and negative values deviation away. In addition to the eight measures of saccade trajectory, saccade latency was also calculated. Latency is defined as the duration in milliseconds of the period between the onset of the target and the participant’s initiation of a saccade.

Trials were excluded from further analysis if saccade latency was less than 80 ms (suggesting an anticipatory saccade) or greater than 600 ms (suggesting a saccade that was not an immediate reaction to the onset of the stimuli), if saccade landing point was more than 30 angular degrees either side of the target, or if the participant was not fixating the screen within 2 deg of visual angle of the fixation point at the time the saccade was initiated.

This data analysis procedure is slightly different from the published data processing procedure applied in the original study (McSorley et al., 2006). These differences were undertaken to ensure compatibility with the analysis of the data from our own experiment. To check that this harmonization of data processing procedures did not alter the conclusions drawn, we repeated all analyses described below but after processing the raw data according to the procedures described in the original article rather than the procedure described above. This version of the analysis entailed no qualitative differences in any of the conclusions drawn.

To identify groups of measures that may reflect the same underlying phenomenon, a principal components analysis (PCA) was conducted. For each principal component, the loadings of each measure onto that component were calculated. Groups of measures that may reflect the same underlying phenomenon will load maximally onto the same component. To prepare data for PCA, data were combined across all participants by standardizing values within each participant. For each measure, each participant’s mean was subtracted from their values, then values were divided by their standard deviation. Using all standardized values together, eight principal components were extracted. Results are reported for PCA using only those components with eigenvalues greater than 1, indicating that they accounted for more variance than did the measures themselves on average (Kaiser, 1960). The component loadings were calculated using the oblimin rotation so as to allow for correlations among the components themselves.

It is possible that some relevant between-participant differences remain after the standardization procedure, and that the results of the PCA reflect these differences and not a structure of relationships among the eight measures that is common to all participants. To check for this possibility, PCA was therefore also carried out separately for each participant using only their data.

For the analysis of effect sizes, the standardized effect size (η ² _G) for the effect of saccade latency was calculated for each measure. To prepare data for analysis of effect sizes, four “latency bins” were created for each participant. This was achieved by grouping each participant’s trials into four quarters, from lowest to highest latency, and then calculating the mean latency and mean saccade trajectory deviation within that latency bin for each of the eight measures of deviation. For each measure, the participant means were then entered into a one-way analysis of variance, with latency bin as a four-level factor. Effect sizes were based on the main effect of the latency bin factor. In the original study (McSorley et al., 2006), eight latency bins were used, and not four. However, we used four so as to preserve comparability with other studies that also used four (e.g., Tudge & Schubert, 2016; van Zoest et al., 2012).

Results

Principal components analysis

Three principal components had eigenvalues greater than 1, and were therefore included in the final analysis. Area curvature, maximum curvature, and quadratic curvature all loaded maximally onto the first component. These are all measures that are neither early nor late, but measure the curved shape of the saccade trajectory, so we term this the mid-saccade component. Initial direction, overall initial direction, and initial average curvature all loaded maximally onto the second principal component. Since these are all early measures, we term this the early component. Finally, the two remaining measures, saccade deviation and overall direction, loaded maximally onto the third principal component. The interpretation of this third component is somewhat less clear (see the Discussion, below), but since it includes the only measure of late deviation, we term this the late component. Table 2 gives the loadings of the eight measures onto the three components.

Table 2 Loadings for the different measures on the first three components for all four data sets (excluding the down-sampled data from our replication of McSorley et al., 2006)

Full size table

The three components were also positively correlated with each other. The early and mid components were most strongly correlated (r = .44). The late component was somewhat less strongly correlated with the early (r = .23) and mid (r = .21) components. Figure 3 shows the correlations among the individual measures themselves.

Effect sizes: saccade latency

The effect sizes for the main effect of saccade latency were greatest for overall direction (.77) and saccade deviation (.75), the two measures that loaded maximally onto the late component. For the three measures that loaded maximally onto the mid-saccade component, the effect sizes were somewhat smaller (between .30 and .35). For the remaining measures, which loaded maximally onto the early component, the effect sizes were variable, ranging from .07 for initial average curvature to .52 for overall initial direction. All effect sizes are listed numerically in Table 3. Figure 4 gives a visual comparison of the effect sizes. Overall direction and saccade deviation yielded the largest effect sizes, and initial direction and initial average curvature yielded the smallest.

Table 3 Effect sizes (η ² _G) and p values for the main effect of saccade latency for all eight measures for all three data sets, based on the target–distractor paradigm in McSorley et al. (2006)

Full size table

Figure 5 gives an alternative visualization of the differences between a measure with a large effect size, overall direction, and a measure with a small effect size, initial direction. For each measure, the mean saccade latency and deviation are plotted for the four latency quartiles. The established negative association of latency and deviation (McSorley et al., 2006) is clearly visible for overall direction and is large relative to the variance in the measure, whereas the same trend is not clearly discernible for initial direction, and to the extent that the trend exists, it is slight relative to the variance in the measure.

The results of the analysis of variance also illustrate the advantage of a measure with a large effect size over a measure with a small effect size. Analysis of variance compares differences among groups, in this case latency quartiles, to differences within groups, which in this case are a reflection of the variance in the measure being used. As Fig. 5 shows, for initial direction the differences in deviation between latency quartiles are small relative to the variance in the measure, whereas for overall direction the opposite is the case. Initial direction should therefore have less power to detect the effect of saccade latency. The hypothesis test for the analysis of variance confirmed this conclusion. We found a significant main effect of saccade latency quartile on overall direction, F(3, 18) = 33.92, p < .001, but not on initial direction, F(3, 18) = 1.23, p = .33.

Comparison of effects across saccade trajectory

As we noted above, it appears to be the case that the overall direction measure affords a particularly clear reflection of the effect of saccade latency. This provides some initial support for the conclusion that gaze samples from later in the saccade are more informative. A reviewer suggested that we follow up on this conjecture by analyzing in more detail the change in effect size as the saccade progresses from the start- to the endpoint.

To do this, we calculated separate measures of saccade trajectory deviation for different parts of the saccade. To create a set of comparable points along the trajectories of many different saccades of different amplitudes and durations, ten “virtual” gaze samples were created for each saccade, evenly spaced along the path of the saccade. The coordinates of each of these virtual gaze samples were estimated by linear interpolation between the two closest real samples in the saccade (see van Zoest et al., 2012, for a similar use of linear interpolation to create evenly spaced gaze samples).^{Footnote 2} For each of these ten gaze samples, the angle between a straight line from saccade start to the gaze sample and a straight line from saccade start to the target was calculated, as for the saccade deviation measure. The first interpolated sample occurred at one tenth of the distance along the saccade, the second at one twentieth the distance, and so on; the final one occurred at saccade endpoint, and was therefore equivalent to the overall direction measure.

In the results of this additional analysis, the effect size for the main effect of saccade latency on the angular deviation of the saccade was greatest at the end of the saccade (i.e., for overall direction, .77), and lowest at the beginning of the saccade (.68), with a monotonic increase in-between. Figure 6 illustrates this increase in the effect sizes from saccade start to saccade end.

Discussion

On the basis of the results from Study 1, three clusters of measures appear to reflect three different underlying components of a saccade: its early deviation, its curved trajectory, and its later deviation. These components are themselves moderately positively correlated with each other. The later measures, saccade deviation and overall direction, appear to have the greatest power to measure the effect of saccade latency. This conclusion is further supported by the finding that, within the saccade, effect sizes increase for measures based on later gaze samples.

With the exception of the overall initial direction, the early measures seem particularly poorly suited to measuring the effect of saccade latency, since they have low effect sizes relative to the other measures. However, this may be due in part to the fact that McSorley et al. (2006) used an eyetracker with a fairly low sampling rate of 250 Hz. Generally, the effect of a higher sampling rate is to help average out random variance in the eyetracker’s estimates of gaze position, particularly if spatial smoothing of the gaze samples is applied. With a low sampling rate, there may be a large amount of variance in the gaze samples, which probably leads to more variance in the measures themselves, which in turn means smaller effect sizes, all else being equal.

To see why spatial noise might disproportionately affect the early measures of saccade trajectory deviation, it helps to consider Fig. 1. The gaze samples on which the early measures are based are located close to the start of the saccade, near the corner at which the angle of deviation is calculated. This means that these samples have high leverage on that angle: Small movements of these samples can lead to big changes in the angle. Movements of the same magnitude for later samples lead to much smaller changes in the angle of deviation.

Study 2: replication of McSorley et al. (2006)

To check the generalizability of the results from Study 1 to a new group of participants and to different eyetracking system, we conducted our own experiment with the same paradigm, and repeated all the analyses described above. In addition, to check whether the sampling rate of the eyetracker is relevant for effect sizes, we conducted the experiment using an eyetracker with a high sampling rate (1250 Hz), and conducted the analysis once using all samples, and a second time after down-sampling the data to 250 Hz.