(1) Field site and study subjects
We conducted this study from December 2022 to March 2023 during the dry season in Mole National Park, Ghana (09°12′–10°06′ N, 01°25′–02°17′ W). This park is located in the Guinea savannah zone (Lawson et al. 1968), which mainly comprises the open savannah woodland, dominated by a layer of grass and riverine forest (Schmitt and Adu-Nsiah 1993). We studied a one-male group of patas monkeys (named the Motel group) comprising 17 individuals, including one adult male, five adult females, eight juveniles and three infants. Individuals in the group were identified and almost completely habituated by 2021.
(2) Spatial data collection
We fitted GPS collars (GLT-02, Circuit Design, Inc.) to the sole adult male (MC) and four adult females (Sa, Sk, Kr and Sh), except for one adult female that was not fully habituated to observers (supplementary material, Fig. S1). Veterinarians (YM or RS) immobilized the animals with zoletil (5 mg/kg) and medetomidine (0.1 mg/kg) using a blowpipe and later reversed the drugs with atipamezole (0.5 mg/kg). The GPS collar (less than 250 g) was less than 5% of the body mass (6.2–10.8 kg (n = 5); YM unpublished data), as suggested by the American Society of Mammalogists (1998). The GPS collars recorded data on location (latitude and longitude), time and positional dilution of precision (PDOP, an index of position accuracy) every 10 min from 7 a.m. to 5 p.m. After data collection, we successfully removed the collars from all the monkeys using remote electronic drop-off systems. We calculated the three-dimensional positioning success rate (the number of successful positioning attempts divided by the total number of positioning attempts), which was 100% for all data. We also confirmed a PDOP of 6 or less (D’Eon and Delparte 2005) in all data to ensure that only high-precision GPS fixes were used for analyses. Consequently, 28,849 GPS fix points were collected and distributed almost evenly across all dates and individuals for the analysis (Table S1).
(3) Behavioral data collection
One of five GPS-collared individuals was observed for 1-h sessions at a time. If we lost the focal individual but found it within 20 min, we resumed recording until the total following time reached 1 h. Consequently, the individuals were observed for a total of 708 h (Table S1). During focal animal observation, we recorded the individual’s activities (foraging, feeding, travelling, resting, drinking and others) every 10 min, corresponding to the timing of the GPS fixes. Foraging was defined as searching for food such as moving rocks, digging up the ground or looking at the ground and surrounding trees, regardless of whether moving forward or not and without having food in the mouth or with hands; feeding was defined as having food in the mouth or manipulating food with the hands, which was recorded exclusively with foraging; travelling was defined as continuing to move without looking at the ground or surrounding trees; resting was defined as an inactive state occurring either on the ground or tree.
We recorded the visibility of the location (clear or unclear) where the focal animal was observed at 10-min intervals, in accordance with the recording interval of the GPS collars. This is because habitat visibility generally affects the efficacy of visual information (Byrne 2000; Davidson et al. 2021). Clear locations included open lands with almost no undergrowth and unclear locations included rocky areas, grasslands, and forests. Unclear locations were typically covered with grass taller than the eye height of the patas monkeys (ca. 30–50 cm). We also recorded the number of non-collard individuals (eight juveniles and one adult female) within 20 m of the focal animal using instantaneous scan sampling at 10-min intervals. A distance of 20 m around the focal individual was the reliable range for observing other individuals. We excluded infants because they are usually in contact with their mothers (Chism 1986).
The time of visual monitoring for the focal animal was recorded using one-zero sampling at one-minute intervals. Patas monkeys perform the behavior known as ‘arboreal scan’ or ‘bipedal scan’ (Enstam and Isbell 2002; Rowell and Olson 1983): the former is defined as gazing into the distance with the head moving from side to side for more than 3 s on a tree and the latter is defined as gazing while standing on hind legs on the ground. We also frequently observed the patas monkeys gazing into the distance on the rocks and fallen trees; thus, we regarded these situations and two of the scanning behaviors as visual monitoring. When tracking females, the time of emitting contact calls was recorded using one-zero sampling at one-minute intervals. We did not record the contact calls of the male because the sole male hardly emitted contact calls (Nakagawa 1992).
(4) Statistical analyses
(4-a) Handling of location data collected by GPS collars and behavioral records
We handled the behavioral records of focal animals that lasted for more than 30 consecutive minutes. These included feeding, foraging, travelling or pausing (i.e., less than 10 minutes resting). Regarding visual monitoring, patas monkeys exhibit this behavior to observe group members and vigilance against predators and other groups (Enstam and Isbell 2002). Therefore, we omitted the visual monitoring data when we visually or auditorily detected potential predators such as olive baboon (Papio anubis) and African rock python (Python sabae), other groups of conspecifics or people while tracking the group to eliminate vigilance outside of the group. Consequently, we used data from 543 visual monitoring for analyses (MC: n = 120, Sa: n = 87, Sk: n = 140, Kr: n = 128, Sh: n = 68). Data on contact calls were excluded during encounters with other groups of conspecifics or other animals (Nakagawa 1992) to eliminate unrelated contexts and focus on analyzing the role of contact calls in coordinating group cohesion and synchrony. Consequently, we used 142 contact calls for analyses (Sa, n = 37; Sk, n = 65; Kr, n = 16; Sh, n = 24).
Behavioral records and GPS fixes were combined as follows: GPS fixes were collected every 10 min and the location of visual monitoring or contact calls was assigned to the GPS fix recorded immediately before the occurrence of these behaviors. We used only the behavioral records that occurred until 5 min later from GPS fixes to avoid a larger gap between the exact location at the time of these behaviors and GPS fixes. We calculated several indicators from GPS fixes using ‘adehabitatLT’ package (Calenge 2006) and ‘sf’ package (Pebesma 2018) in R as follows: (1) IID between individuals and (2) individual travel velocity (m/min). Generally, the spatial cohesion of groups was estimated using (1) and the number of members in close proximity to the focal animals (Cowlishaw 1998; Byrne 2000; King and Sueur 2011), while temporal synchrony was measured using (2) (Bousquet et al. 2011). We created time blocks of 10 min in accordance with the record interval of GPS collars, for which we established the distance covered by individuals as travel velocity (i.e., distance covered per 10-min time block; Fig. 1). As for the number of adults within 20 m of the focal animal, we combined the number of collared individuals calculated from GPS fixes with the presence of the non-collared female (1 or 0) collected by direct observation.
(4-b) Which indicators are used for each sex to monitor group cohesion and synchrony? (Model 1)
Our modelling approach was twofold when running the generalised linear mixed models (GLMMs). First, we examined which indicators of group cohesion and synchrony were related to the occurrence of monitoring behavior of each sex in the same 10-min time blocks (Model 1, Fig. 1). Second, with predictor variables that were significantly related to the occurrence of the monitoring behavior, we assessed changes in these predictor variables in the next time block, that is, the effect of monitoring behavior on future group cohesion and synchrony (Model 2). We initially aimed to construct separate models for conditions in the environment in which the group was in either clear or unclear visibility. However, owing to the limited number of samples under unclear conditions for both visual monitoring and contact calls, we restricted our analyses to data collected under clear visibility conditions.
In Model 1, we first investigated the occurrence of visual monitoring in females (Model 1a). The occurrence of visual monitoring (1 or 0) was included as a response variable. The following indicators were added as predictor variables: (i) nearest IID between the focal female and other females; (ii) IID between the focal female and the male; (iii) mean IID between other collared individuals; (ⅳ) the number of adults and (v) juveniles within 20 m from the focal female; (vi) travel velocity of focal female; (vii) whether the focal female was faster than the nearest collared individual (1 or 0), (viii) absolute value of the difference between the travel velocity of focal female and mean travel velocity of others.
Second, we analyzed the male as well (Model 1b). We included the following variables as predictor variables, which were largely the same as in the female model, to examine their differences: (i) nearest IID between the male and the collared female; (ⅱ) mean IID between collared females; (ⅲ) the number of females and (iv) juveniles within 20 m from the male; (v) travel velocity of the male; (vi) whether the male was faster than the nearest collared female (1 or 0) and (vii) absolute value of the difference between the travel velocity of the male and mean travel velocity of collared females.
Third, we examined the occurrence of female contact calls (Model 1c). We included the occurrence of contact calls (1 or 0) as the response variable and added the same predictor variables as in Model 1a to investigate the differences in indicators affecting visual monitoring. Variables (i)–(v) in Model 1a and (i)–(iv) in Model 1b were treated as indicators of spatial cohesion, whereas variables (vi)–(viii) in Model 1a and (v)–(vii) in Model 1b were used as indicators of temporal synchrony within the group. For the analysis, we included session ID as a random intercept in all models to account for non-independence among observation sessions and Focal ID as a random intercept in models for females to control for repeated measures from the same individual.
(4-c) Do visual monitoring and calling affect group coordination? (Model 2)
The next step was to examine whether visual monitoring and calling would coordinate group cohesion and temporal synchrony in the future. Hence, three different 10-min time blocks were used: one containing visual monitoring and calling, another representing the 10 min before the behaviors and the third covering the 10 min after the behaviors (hereafter: current time periods, previous time periods and future time periods; Fig. 1). Using predictor variables significantly related to the occurrence of visual monitoring and calling in Model 1, changes in these predictors were examined by comparing the pre- and post-visual monitoring and calling times.
First, we assessed the function of female visual monitoring as a group coordination (Model 2a). Because the number of juveniles within 20 m was significantly related to the occurrence of visual monitoring in Model 1a, the effect of visual monitoring on coordinating spatial cohesion with juveniles was investigated. Therefore, we incorporated changes in the number of juveniles within 20 m at the future time as the response variable (i.e., the value obtained by subtracting the number of juveniles at the current time from that at the future time). We added (ⅰ) whether visual monitoring occurred at the current time (1 or 0), (ⅱ) whether the number of juveniles within 20 m at the current time decreased from the previous time (1 or 0) and (ⅲ) their interaction as predictor variables. This interaction was added based on the prediction that a decrease in the number of juveniles would lead to increased visual monitoring, which in turn would help increase the number in the future.
Second, we assessed the function of visual monitoring in the male. The travel velocity difference between himself and females were significantly related to the occurrence of visual monitoring in Model 1b. Thus, we investigated whether visual monitoring of the male is effective in maintaining a small velocity difference with females. We incorporated the change in velocity difference from females between the current time period and the future time period as the response variable (Model 2b). We added (i) whether visual monitoring occurred at the current time (1 or 0), (ⅱ) whether the velocity difference with females at the current time period was smaller than the previous time period (1 or 0) and (ⅲ) their interaction as predictor variables. We incorporated the interaction because we predicted that when the velocity difference with females was small, visual monitoring would help to keep the small difference in the future.
Third, we examined the function of female contact calls. The travel velocity difference between herself and others were significantly related to the occurrence of contact calls in Model 1c. Hence, we first examined whether contact calls were effective in maintaining a small difference in travel velocity with others. We included the change in velocity difference from others between the current time period and the future time period as the response variable (Model 2c-1), the same as Model 2b. We added (i) whether contact calls occurred at the current time (1 or 0), (ⅱ) whether the velocity difference with others at the current time period was smaller than the previous time period (1 or 0) and (ⅲ) their interaction as predictor variables.
To further investigate the potential influence of contact calls on group coordination, we tested an additional model to examine whether contact calls affected changes in the travel velocity of other individuals. Unlike visual monitoring, vocalizations are inherently directed toward recipients and may elicit behavioral responses in receivers. We incorporated the change in the mean velocity of others from the previous time period (i.e., before calling) to future time periods (i.e., after calling) as a response variable (Model 2c-2). We added (ⅰ) whether contact calls occurred at the current time (1 or 0), (ⅱ) whether the velocity difference with others at the current time period was smaller than the previous time period (1 or 0), (ⅲ) their interaction and (ⅳ) focal velocity at the current time period. We added (ⅳ) to test whether the increase in other’s velocity was due to the contact calls rather than simply a consequence of the focal individual moving faster.
We included random effects in Model 2, following the same structure as in Model 1.
Model implementation
All analyses were conducted in R version 4.2.1 (R Core Team 2022). GLMMs were fitted using the ‘glmmTMB’ package (Brooks et al. 2023). Model 1 was fitted with a binomial distribution and a logit link function. Model 2 used a Gaussian distribution with an identity link function following confirmation of residual normality. We detected no problems with the GLMMs when checking the model assumptions, including under/overdispersion and zero-inflation, using the ‘simulateResiduals’ function in the DHARMa package (Hartig and Lohse 2022). Variance inflation factors (VIFs) were calculated for all test predictors using the ‘check_collinearity’ function in the performance package (Lüdecke et al. 2021), indicating low collinearity (VIF < 10; Roberts and Roberts 2015). If the models included interaction terms (Model 2), we performed post hoc pairwise comparisons to compare the estimated marginal means among parameters. We used the ‘estimate_contrasts’ function in the model-based package (Makowski et al. 2020) to conduct this analysis and p-values were adjusted using Holm’s method. The alpha level was set at 0.05 for all analyses.