Pointing to Visible and Invisible Targets

Flack, Zoe M.; Naylor, Martha; Leavens, David A.

doi:10.1007/s10919-017-0270-3

Pointing to Visible and Invisible Targets

Original Paper
Open access
Published: 11 January 2018

Volume 42, pages 221–236, (2018)
Cite this article

Download PDF

You have full access to this open access article

Journal of Nonverbal Behavior Aims and scope Submit manuscript

Pointing to Visible and Invisible Targets

Download PDF

Zoe M. Flack¹,
Martha Naylor¹ &
David A. Leavens¹

4997 Accesses
9 Citations
7 Altmetric
Explore all metrics

Abstract

We investigated how the visibility of targets influenced the type of point used to provide directions. In Study 1, we asked 605 passersby in three localities for directions to well-known local landmarks. When that landmark was in plain view behind the requester, most respondents pointed with their index fingers, and few respondents pointed more than once. In contrast, when the landmark was not in view, respondents pointed initially with their index fingers, but often elaborated with a whole-hand point. In Study 2, we covertly filmed the responses from 157 passersby we approached for directions, capturing both verbal and gestural responses. As in Study 1, few respondents produced more than one gesture when the target was in plain view and initial points were most likely to be index finger points. Thus, in a Western geographical context in which pointing with the index finger is the dominant form of pointing, a slight change in circumstances elicited a preference for pointing with the whole hand when it was the second or third manual gesture in a sequence.

How to point and to interpret pointing gestures? Instructions can reduce pointer–observer misunderstandings

Article 10 November 2016

Perspective determines the production and interpretation of pointing gestures

Article Open access 15 October 2020

The effects of distance on pointing comprehension in shelter dogs

Article 10 February 2021

Humans use a diverse array of deictic gestures, from pointing with the lips (Enfield 2001), to index-finger pointing (Eibl-Eibesfeldt 1989), to pointing with the whole hand (e.g. Wilkins 2003). Even within a culture, people display remarkable variety in their pointing hand shapes (Kendon and Versante 2003), with different shapes used for different functions. Kendon and Versante (2003) analyzed the pointing gestures of Neapolitans, finding that different hand shapes signified different dialectical functions. Similarly, Wilkins (2003) noted that the Arrernte people of Australia frequently give directions with whole-hand pointing gestures. Enfield et al. (2007) analyzed the variety of pragmatic functions of different pointing types among Lao speakers in Laos, finding that “big” points signified a primary location marker, whereas another, “small” form of pointing was used to more subtly support location information provided in speech. Hence, research over the last couple of decades has revealed pointing with the index finger to be—far from the “canonical” human pointing gesture—merely one of a large number of gestural devices for deixis that humans use, often as paralinguistic adjuncts to ongoing discourse, and variable in both form and semantic function within and between cultures (e.g. Enfield 2001; McNeill 2003).

Variations in gesture frequency, form, and size can reflect multiple communicative demands in a conversational interaction, although there is evidence that humans continue to display some gestures, even when not visible by the recipient (Alibali et al. 2001), the speaker’s awareness of the recipient’s knowledge is important. Cleret de Langavant et al. (2011) found differences in pointing and underlying neural activity when participants pointed for a recipient rather than pointed for no one in particular, demonstrating significant cerebral blood flow changes in the right hemisphere during communicative pointing, compared to non-communicative pointing. Elsewhere, use of less elaborated gestures when participants had a shared understanding of a communicative topic suggests that speakers use different communicative tactics depending on the different task demands presented by varying degrees of shared knowledge between sender and receiver (Gerwing and Bavelas 2004; Holler and Stevens 2007).

In experimental studies, the number of digits extended by the pointing hand in humans is also subject to dynamic contextual influences. For example, Iverson and Goldin-Meadow (1997) reported that when they blindfolded otherwise sighted participants in their study, these people displayed a dramatic shift away from pointing with the index finger and towards pointing with the whole hand, when gesturing during a Piagetian conservation task. In this respect, they resembled congenitally blind children, who also tended to point with their whole hands. This finding was later replicated by Iverson and Goldin-Meadow (2001), who commented, “the fact that the blindfolded children also use [index-finger] pointing gestures infrequently, suggests that even temporary loss of vision affects the ability to establish a line of regard” (p. 420). Thus, simply blocking participants’ visual access has a dramatic effect on the number of fingers with which people pointed, in some contexts. Iverson and Goldin-Meadow (2001) interpreted these patterns to suggest that there were two contrasting cognitive tactics at play: where a target could be encompassed by a line of regard, then pointing with the index finger served to augment the visual perception of the referent, but where the referent could not be seen—e.g. when participants were either congenitally blind or sighted, but blindfolded—then participants used a communicative tactic focussed on path segments, as a series of waypoints to the referent. These findings led us to hypothesize that blocking visual access to a referent might also alter the shape of the pointing hand in more naturalistic, less controlled circumstances (i.e. to explore the ecological validity of the previous, laboratory-based findings). Here, we wanted to find out whether this change in the number of fingers extended in a pointing hand was merely an artifact of laboratory testing or a more general phenomenon. If we find that, for example, people in an outdoors, naturalistic setting also pointed more with the whole hand when direct sight of a referent was blocked, then this would be consistent with the interpretation of Iverson and Goldin-Meadow (2001) that establishing a line of regard is important to hand shapes while pointing. In contrast, if we fail to find this influence of target visibility in a more naturalistic context, then this might implicate other aspects of their experimental or laboratory environment than line of sight.

In a task eliciting directions to local landmarks, Iverson (1999) reported that “information about direction and location tended to be conveyed primarily in gesture” (p. 1140). Displacement is a defining feature of language (e.g. Fitch 2010), and with the present studies we sought to impose a problem of displaced reference. We expected to elicit substantial amounts of pointing behavior, providing a window into how visible and invisible targets influence the morphology of nonverbal referential signaling. Relative to situations in which the referent is clearly visible, we thought an invisible displacement condition would require greater gestural elaboration, and sought to directly test this assumption, by measuring gestural sequence lengths.

In Study 1, we adapted a procedure by Kita (2003), and administered two experimental conditions: in the In-view condition, a researcher asked passersby for directions to a local landmark that was fully in view behind the researcher. In the Out-of-view condition, the same researcher asked passersby for directions to the same local landmark that was located at a similar or identical distance, but completely blocked from view by buildings. In each case, we recorded the palmar orientation and number of extended digits of the pointing hand, if any, for each pointing gesture displayed in this study. We administered this protocol in three different locations: on an English university campus, in a large English city, and in a small English town. We expected to see longer gesture sequences in the Out-of-view condition than the In-view condition because of the need to impart more information in the absence of a visible target; that is, we expected that the invisible target would create a more demanding communicative task, as evidenced by gesture sequence length. We also expected there would be more whole-handed pointing in the Out-of-view condition than in the In-view condition, based on the findings of Iverson and Goldin-Meadow (1997, 2001).

In Study 2, we used a similar procedure to that of Study 1, but used concealed recording equipment to capture the naturally occurring speech and gesture of our participants. We covertly collected audio and video recordings and obtained consent “after the fact”. With this study, we aimed to (a) confirm the findings of Study 1, (b) examine speech-gesture relationships, and (c) examine palm orientations while pointing (Kendon and Versante 2003).

Study 1

Method

Participants

Data were collected from 605 participants; 200 in the city of Brighton (100 in the In-view condition of which 48 were females, and 100 in the Out-of-view condition of which 56 were females), 205 in the town of Devizes (100 in the In-view condition of which 63 were females and 105 in the Out-of-view condition of which 52 were females), and finally, 200 on a university campus in the south of England (100 in the In-view condition of which 53 were females and 100 in the Out-of-view condition of which 54 were females). Subjects were adults who were approached in one of the three locations and assigned to a condition based on their proximity to the target location. There were no exclusion criteria for selection of participants, and the ethnic composition of each sample was apparently representative of each locale, although there was no systematic collection of data on ethnicity.

Locations and Targets

In Brighton, the target location was the Royal Pavilion. The location of the researcher was equidistant from the target location in both the Out-of-view and In-view conditions, a distance of 198 m. In Devizes, the target location was a local public library; here, the distance of the researcher from the target location was 116 m in the Out-of-view condition and 100 m in the In-view condition. At the university campus, the target was the main library building and the researcher was equidistant from this landmark in the two conditions, at a distance of 152 m.

Procedure

Participants were approached in close proximity to the target in one of the three locations. There were two conditions for each location; one where the target was in view of the participant (the In-view condition) and another where the target was not in view of the participant (the Out-of-view condition). In the In-view condition the target was directly in front of the participant (i.e. directly behind the researcher), and in the Out-of-view condition the target was not in direct view, due to intervening buildings, although a simple right-angle path could be described to the target. Using a standardized script, participants were asked for directions to the target location. Participants were always approached when they were already facing towards the target landmarks, so that the researcher had her back to the target location. Their pointing gestures were recorded on a paper sheet by the researcher who was both observer and interlocutor for every interaction, and observation ended when the participants withdrew from the interaction.

Behavioral Measures

Five types of pointing gestures were initially recorded, categorized by the position of the forearm, hand, and fingers, following Kendon and Versante (2003). In this coding scheme, there were two kinds of index-finger points: (a) index palm down (ID) where the forearm was pronated, palm facing downwards and index-finger extended and (b) index palm vertical (IV) where the forearm was extended in a neutral position, the palm of the hand in a vertical position and index-finger extended. There were three types of open-hand points: (c) whole-hand palm up (OU) where the hand was fully open with palm supine, (d) whole-hand oblique (OB) had the palm at an oblique angle and (e) whole-hand palm vertical (OV). As reported, below, however, interobserver reliability for this five-category coding scheme was poor, therefore, categories were collapsed into two categories for analysis, here: index-finger points and whole-hand points.

Reliability

Reliability was assessed by observer and by order of gesture (i.e. first, second, and third gestures). In each of the three locations, 30 interactions (90 in total) were independently coded by two observers, and assessed for interobserver reliability (15% of observations). In both Brighton and the university campus, the same two observers were used, so reliability was assessed on the 60 cases coded by these two individuals, and reliability is reported separately for the Devizes location, for an additional 30 cases. In the reliability samples, we examined (a) the agreement between two observers that a first, second and third gesture occurred, and (b) given that two observers agreed that a pointing gesture occurred, the agreement on the type of pointing gesture.

Our initial coding of gestures yielded Cohen’s kappa values ranging from .44 to .73. When we collapsed the data into two types: index-finger and whole-hand pointing, reliability estimates significantly improved, and therefore we focus on these two types of gesture in our analyses. In all three locations, there was 100% agreement on whether a point occurred as the first gesture, 100 or 93% (Cohen’s kappa = .86) agreement on whether a second gesture occurred, and 90% (Cohen’s kappa = .45) or 97% agreement on whether a third gesture occurred (in Devizes, both observers agreed in 29 out of 30 cases that no third gesture occurred, and there was one disagreement about the presence of a third gesture, hence kappa is not appropriate). For the first gesture, there was 100% agreement on which type of gesture, agreement on type of gesture was between 87% (Cohen’s kappa = .72) and 100%. Because there were only 3 cases in which both observers agreed that a third gesture occurred, we did not include third gestures in analyses related to gesture type or the effects of visibility on gesture type.

Results

Initial Analyses

There were no effects of location (i.e. whether the data were collected in Brighton, the university campus, or Devizes) or gender of participant on either gesture sequence lengths or gesture types, therefore neither location nor gender will be further considered. Of the 605 participants, one did not display a manual pointing gesture (.17%), hence the total sample size in the following analyses is 604.

Sequence Length

Sequences ranged in length from 1 to 3 gestures (no participant displayed more than 3 pointing gestures). Unsurprisingly, gesture sequence length was significantly longer in the Out-of-view condition (Mdn = 2 gestures) than in the In-view condition (Mdn = 1 gesture); U(1) = 3608, Z = − 21.77, p < .001. As depicted in Fig. 1a, 97% (296/304) of participants approached in the Out-of-view condition went on to display a second gesture, whereas only 6% (17/300) of participants in the In-view condition did so (χ²(1, N = 604) = 505.57, p < .001). Nineteen percent (57/304) of the participants in the Out-of-view condition displayed a third point, whereas none of the 300 participants in the In-view conditions did so (χ²(1, N = 604) = 62.11, p < .001). Typically, participants in the Out-of-view condition used subsequent points, after their first, to outline a route to the landmark in question.

Effects of Target Visibility on Gesture Type

There were significant effects of target visibility on gesture type for both the first gesture (χ²(1, N = 604) = 25.98, p < .001) and the second gesture (χ²(1, N = 313) = 4.85, p = .028); see Fig. 2a. There were substantially more whole-handed points displayed in the Out-of-view condition, compared to when the targets were in full view (the In-view condition).

Use of Whole-Hand Pointing in the Out-of-View Condition

No participant in the In-view condition displayed more than two manual points. In the Out-of-view condition, 57 participants displayed three consecutive points. With increasing ordinal number of the pointing gesture, there was an increased probability that a point with the whole hand would be displayed (Cochran’s Q(2) = 43.45, p < .001).

Indexicality Indices

Given the large number of points with the whole hand recorded during this study, here we categorized people in terms of the degree to which they displayed index-finger or whole-hand pointing, as a function of the length of their gestural sequences. We depict these data with an “indexicality index”, defined in Leavens and Hopkins (1999) as:

$$\frac{I - W}{I + W}$$

where I means the frequency of index-finger points and W means the frequency of whole-hand points; this renders a scale ranging from − 1.0 to + 1.0, with positive numbers for samples in which index-finger points outnumber whole-hand points and negative numbers for the opposite result. In the case that I = W, zero is assigned as the quotient. As is evident in Fig. 3a, there is an immense swing away from a preference for pointing with the index finger to pointing with the whole hand, with an increase in subjects’ gestural sequence lengths. Thus, pointing with the whole hand became more prominent in these samples’ gestural repertoires as their apparent need to elaborate increased.

Sequences

We characterized two-gesture sequences for both conditions as I–I, I–W, W–I, or W–W where W refers to whole hand point, and I refers to an index finger point. There was a significant difference in the distributions of these two-gesture sequences across the In-view (n = 17) and Out-of-View (n = 239) communicative contexts (χ²(3, N = 256) = 11.81, p = .008). Overall, 70 of 256 people (27.3%) who displayed two-gesture sequences displayed two successive index-finger points, avoiding use of the whole hand, but the majority of people displaying two-gesture sequences incorporated at least one whole-hand point into their sequences (186/256 or 72.7%). This, despite the fact that the number of two-gesture sequences beginning with an index-finger point (213/256 or 83.2%) was significantly larger than the number of two-gesture sequences beginning with a whole-hand point (43/256 or 16.8%; binomial test, Z(255) = 10.56, p < .001). Hence, although pointing with the index finger was the preferred initial gesture, most people who felt the need to display two gestures in this observational context incorporated a whole-hand point into the sequence.

Discussion

We found evidence that context influences gesture production. Specifically, adults produced fewer gestures when the target location was visible than when it was not, consistent with our expectation that a single point to a visible target requires less elaboration. We also found fewer whole hand points when the target location was visible than when it was not. Index finger points accounted for more first gestures than other point types, but where further gestures were needed, these were more likely to be whole hand points.

Although we found evidence of contextual influences on gesturing we did not record speech. A reviewer of our initial submission recommended filming some additional trials because there is evidence that gestural responses vary as part of a wider communicative interaction (e.g. Enfield et al. 2007; McNeill 2003). Therefore, we were keen to examine the speech types accompanying these gestures, to address this possibility. Moreover, we achieved poor interobserver agreement in Study 1 on palm orientations, and it was expected that video records would foster better reliability on this measure, permitting comparison with Kendon and Versante (2003). Finally, as also noted by an anonymous reviewer, a second study with video records would permit a direct verification of the results of Study 1.

Study 2

We obtained ethical approval for covert collection of audio and video recordings of the interactions in question with a proviso that consent be obtained “after the fact”. In Study 2 we used a similar procedure to that of Study 1, but used concealed recording equipment to capture the naturally occurring speech and gesture of our participants. The amount of time involved in setting up the covert filming apparatus and, especially, in obtaining post hoc informed consent, resulted in a reduced sample size, relative to Study 1. We aimed to confirm the patterns we found in Study 1, to examine the relationship between the content of speech and pointing types, and to increase interobserver reliability for the palm orientations during pointing.