Introduction

The ability to vocalize is an important aspect of animal behavior, allowing individuals to convey information to others concerning their social relationships, status, age, sex, the location of resources, and the presence of potential threats (e.g., Strier 2007). This information, in turn, permits conspecifics and heterospecifics to respond appropriately (e.g., fleeing, heading toward a food source) (Bradbury and Vehrencamp 1998). Phylogenetic relationships play a role in the development of primate vocalizations. For example, the usage and structural properties of alarm calls in Cercobeus torquatus atys is similar to that of the closely related Cecrocebus galeritus (Range and Fischer 2004; Waser 1982). In addition, vocal repertoire sizes of Gorilla gorilla and Gorilla beringei beringei are similar (Salmi et al., 2013). A correlation also exists between the structural similarities within the loud calls of surilis, Presbytis spp., and data on their genetic distance (Meyer et al. 2012).

A positive association between vocal and social complexity exists in a variety of primate species (e.g., Bouchet et al. 2013; McComb and Semple 2005). This association support the social complexity hypothesis, which proposes that with increasingly well-developed social structure (e.g., larger group size, distinct social roles, stable social network) there is selection for increasingly well-developed vocal behavior (e.g., larger vocal repertoire, individual distinctiveness, diverse use across contexts) (Bradbury and Verhencamp 1998; Freeberg et al. 2012). Such vocal behavior adaptations provide flexibility that allows individuals to cope with the challenges of more complicated social arrangements, by allowing more detailed communication. For example, much larger repertoires are present in Cebus capuchinus (27 calls, group size 16–21) and Saimiri sciureus (52 calls, group size 15–75) than in Aotus spp. (4–6 calls, group size 2–6) (Andrew 1963; Campbell et al. 2011; Gros-Louis et al. 2008; Kantha et al. 2009; Moynihan 1964; Schott 1975).

A variety of external and internal factors can alter the spectrotemporal features of vocalizations, including age, context, and sex. For example, juveniles have specific calls in multiple animal groups such as primates, bats, and rodents (Moss et al. 1997; Sokoloff and Blumberg 2004; Zimmerman 1995), which they typically use to request help from caregivers (Redican 1975), although some use calls to coordinate play behavior (Burke et al. 2018). Infant calls often have higher frequencies than adult ones, which is likely due to their smaller body size (e.g., Fischer et al. 2002; Inoue 1988) and is an effective indicator of age. In addition, juvenile calls may be shorter (e.g., Hradec et al. 2017). Owing to differences in anatomical features such as lung capacity, vocal tract length, and vocal fold size and thickness, smaller animals should produce calls that are shorter and with a higher fundamental frequency (Ey et al. 2007).

Context can also have a major influence on the structure and use of vocalizations. For example, the motivational-structural rule proposes that calls that have a hostile or friendly intention differ in form, with hostile calls being harsh and low frequency to suggest size to an opponent, while friendly or appeasement calls are more tonal and higher in pitch (Morton 1977). This is hypothesized to have evolved to avoid physical conflict, thus saving energy and reducing the risk of physical harm (Morton 1977). This relationship between call structure and context may also reflect the caller’s emotional state. In times of high emotional arousal, respiration, salivation, and muscular tension change, resulting in changes to fundamental frequencies, durations, and rates of vocalizations (Pollermann and Archinard 2002; Titze 1994).

Sex differences in vocalizations occur in many species. For example, sex differences exist in the lowest frequency, peak frequency, and end frequency of phee calls in Callithrix kuhlii (Smith et al. 2009). Similarly, male alarm barks in Papio ursinus are of a lower frequency and longer than those made by females (Fischer et al. 2002), while female juvenile Macaca mulatta emit more complex vocalizations and longer calls than male juveniles (Tomaszycki et al. 2001). Similar to the structural differences expected from adults and juveniles due to size differences, we can predict that species showing sexual dimorphism should display differences in call structure. Evidence to support this prediction exists across various primate species (Ey et al. 2007). However, an exception to this trend exists in Leontopithecus rosalia, which show no external sexual dimorphism but display structural differences in calls between sexes (Benz et al. 1990). These differences may result from differences in the medial ventral laryngeal sac, which is larger in males than in females (Hershkovitz 1977).

We have limited information on the ecology and behavior of Pithecia species, likely due to the difficulties involved in conducting studies of wild populations, such as their often cryptic behavior and coloration, resistance to habituation, and low population densities (Ayres 1986; Mittermeier and Roosmalen 1981; Norconk 2006; Pinto et al. 2013; Vie et al. 2001). Pithecia occupy home ranges of 10.3200 ha and live in groups of 212 individuals (Izawa and Yonada 1981; Lehman et al. 2001; Norconk and Setz, 2013). These groups may include multiple breeding males and females or consist only of a breeding pair and young (e.g., Lehman et al. 2001; Setz and Gaspar 1997). They are territorial, with some studies suggesting they use long calls to defend their home ranges (DiFiore et al. 2007; Norconk 2006, 2011). The IUCN currently consider Pithecia spp. as Least Concern (Mittermeier et al. 2015). However, all Pithecia species are likely impacted by hunting, habitat fragmentation, and anthropogenic disturbance (Marsh 2014; Norconk and Setz, 2013). The recent revision of the taxonomy of the genus Pithecia describes 16 full species where there were previously 5, creating a need for further research on these species, including their vocalizations (Bezerra et al. 2017; Marsh 2014). Previous studies of vocal behavior in Pithecia have described 718 call types in Pithecia pithecia and Pithecia aequatorialis (Buchanan 1978; Henline 2007; Keiran 2012).

Although vocal repertoires exist for Pithecia species, there are no published studies examining vocal communication in this group. Accordingly, here, we aimed to add to the current understanding of Neotropical primate communication by investigating the behavior of Pithecia chrysocephala (golden-faced sakis) living in a forest fragment within the city of Manaus, Amazonian Brazil. Pithecia species can survive in isolated forest fragments and such areas are useful because it is easier to find and follow groups (Boyle 2016; Ferrari et al. 2003; Pinto et al. 2013), although the primates may show behaviors that differ from those of their counterparts in continuous forests (Schwitzer et al. 2011).

Our goals were to determine the vocal repertoire of Pithecia chrysocephala, investigate the behavioral contexts in which they use calls, and examine the association between behavioral context and call structure. We tested predictions derived from five hypotheses:

  1. 1)

    If congeneric species have similar vocal repertoires (see Henline 2007 for Pithecia), then the vocal repertoire of P. chrysocephala will consist of 7–18 calls.

  2. 2)

    If the P. chrysocephala vocal repertoire fits the Social Complexity Hypothesis (Freeberg et al. 2012), then vocal repertoire size will be comparable to that of other New World species with similar group sizes and family structure, such as Aotus spp. and Callicebinae.

  3. 3)

    If age affects call structure (e.g., Zimmermann 1995), then P. chrysocephala will have juvenile-specific calls that are structurally different from those of adult males and females.

  4. 4)

    If call structure reflects motivation (Morton 1977), then calls used in hostile contexts will have lower frequencies than those used in nonhostile contexts.

  5. 5)

    If calls reflect the physical size of an animal (Ey et al. (2007), then males will produce calls with lower fundamental frequencies and longer durations than females.

Methods

Study Site and Groups

We conducted the study in a natural forest fragment located on the Instituto Nacional de Pesquisas da Amazonas (INPA) Campus-II in the central-western area of Manaus, Amazonian Brazil (3′′05′S, 59′′59′W) (Fig. 1). The 20-ha fragment is composed of secondary terra firme forest, bordered by tarmacked roads. Part of the area (13 ha) is open to the public, with a main pathway laid through the fragment, while several INPA offices occupy the remainder. This tropical region experiences a dry season (June–November) and a wet season (December–May), with a mean annual rainfall of 2420 mm and mean annual temperature of 26.7°C (Alvares et al. 2013).

Fig. 1
figure 1

Map of the study site, a forest fragment in INPA Campus II, Manaus, Amazonian Brazil.

Two groups of Pithecia chrysocephala lived at the study site: group A with eight individuals (two adult females, two adult males, three subadults, and one female juvenile ca. 1 yr old) and group B with 10 (two adult females, three adult males, and five subadults). We defined adults as individuals >6 yr old, subadults as 4–5 yr old, juveniles as 1–3 yr old, and infants as 0–1 yr old (M. Take pers. comm.). Group A occupies a home range of 7.68 ha and group B 8.6 ha (Take 2017). Both groups consist of a breeding pair and their offspring. They are habituated to human presence as a result of general human visitation and previous primatological work (Take 2017).

Data Collection

JM collected data during the early dry season, from June 15 to August 1, 2018, using daily follows 06.00–11.00 h and 11.00–16.00 h, alternating each day. On 06.00–11.00 h shifts, we first attempted to locate the sleeping tree of the group studied on the 11.00–16.00 h shift the previous day. Once we located the group, we conducted observations in 10-min blocks, with 5-min breaks between blocks. For 11.00–16.00 h shifts we followed the groups until they had settled at a sleeping site and remained there for 30 min at rest. We conducted 18 shifts (ca. 90 h), equally distributed between morning and afternoon shifts. We noted the location of the sleeping site and used this as a starting point the next day. We alternated the groups followed each week (group A for 34.8 h, group B for 31.2 h). During observation blocks we recorded vocalizations continuously in WAV format using a ZOOM H4n recorder (16-bit res, sampling rate of 44 Hz) with a Yoga HT-81 microphone for unidirectional recording (frequency response 1–16 kHz) (Martin and Bateson 2007). During recordings we directed the microphone toward where most (>50%) of the group were located. We maintained a distance of 2–10 m from the subjects to avoid overloading the microphone or disturbing them.

At the beginning and end of each observation block, we took a scan sample of the behavior of all visible members of the group to determine the group’s main activity (i.e., the behavior that 50% or more of the group were performing; Altmann 1974). We used the categories of general (rest, locomotion, feeding/foraging, alert), social (affiliative, play, grooming), and conflict (food conflict and intragroup conflict) based on an ethogram in Buchanan (1978). We did not take scan samples if fewer than half the group were visible. We also noted any calls that could be attributed to a specific individual in the group, detailing their age group (adult/juvenile), sex, and the behavioral context in which the call was performed ad libitum during the blocks. Additional categories used to describe juvenile behavior included parent–offspring interactions, which included nursing and requesting (in which the juvenile called repeatedly until a member of the group attended to her, often by feeding or carrying). It was not always possible to associate a call with an individual or note its age and sex.

We did not conduct tests of intraobserver reliability, but JM collected all data within a short period.

Potential Limitations

Our results must be viewed with caution for four reasons. First, the adult vocalization data are pseudo-replicates, because we could not always distinguish individuals when recording data and we do not know how much each individual contributed to the sample. Second, as only one juvenile was present, this individual was the sole source for juvenile vocal data and so results for this aspect are preliminary. Third, we could not distinguish subadults from adults owing to similarities in body size and pelage color, so we describe them as adult/subadult. Finally, the study area is in an urban fragment, so we may not have observed full behavioral or vocal repertoires (e.g., owing to lack of predation or threats; Barnett et al. 2017).

Data Manipulation

Using the spectrograms, we chose only calls of high quality for measurement after removing those with excessive background interference. We also excluded overlapping calls. When we used sequences of calls, we considered each element as an individual call. We took measurements of duration, frequency of maximum energy (FME), lowest frequency, start frequency, and end frequency for each call type using Praat (V 6.0.5) (Boersma 2001) (Fig. 2). We used the fast Fourier transform method to generate spectrograms with a Gaussian window shape and a sampling frequency of 44,100 Hz. We used a view range of 7520,000 Hz. We tested these variables for normality using a Shapiro–Wilk test, and found that they had nonparametric distributions. We calculated the median and interquartile range (Q1–Q3) for each measurement across each call type and for each behavioral context in which a call occurred (e.g., median call duration during locomotion).

Fig. 2
figure 2

Measurements used to describe Pithecia chrysocephala vocalizations in Manaus, Brazil, between June and August 2018.

We grouped contexts into friendly (e.g., play, feeding, locomotion) and hostile (e.g., intragroup conflict, food conflict) intention for statistical comparison.

Statistical Analyses

We recorded 70 h of vocal data and used 1500 calls for analysis. We could distinguish sex for 70 calls (34 female calls, 36 male).

JM performed all statistical analyses using SPSS v.25. We grouped calls according to their structural differences in spectrograms, their duration, frequency of maximum energy, lowest frequency, start frequency, end frequency, and by ear. When naming calls, we used Henline’s (2007) categories for Pithecia pithecia calls where appropriate; otherwise we chose new names onomatopoetically. We used a cross-validated discriminant function analysis (DFA) to assess the reliability of the recorded vocal repertoires.

To determine the effect of behavioral context on call structure, we compared chuck call parameters across feeding/conflict contexts using a cross-validated DFA. We chose this call because it was the most evenly spread between contexts. We did not test juvenile calls for contextual differences, as sample sizes between contexts were not large enough for valid analysis. We used Mann–Whitney U tests to determine the effect of sex on call structures of whistles, chucks, and trills. We chose these calls because they were the only calls in which we identified both male and female callers.

Ethical Note

We obtained ethical clearance from Oxford Brookes University before beginning data collection. We also obtained additional permission from INPA to study Pithecia chrysocephala in the area, and all research complied with Brazilian law. While conducting this study we took care to adhere to the guidelines for the treatment of animals in research outlined by the Association for the Study of Animal Behaviour (Buchanan et al. 2012). We did not trap or handle animals, and as such the potential for zoonotic disease transmission was minimal. During daily follows we maintained minimum distance of 2 m from individuals to minimize stress for the animals, and we retreated if they showed excessive stress (e.g., emitting alarm calls while looking at the researcher). The authors declare that they have no conflict of interest.

Data Availability

The data used for this study are available from the authors upon request.

Results

Adult/Subadult Vocal Repertoire

We identified five structurally different adult/subadult call types (Fig. 3, Table I).

Fig. 3
figure 3

Spectrograms of calls by adult/subadult Pithecia chrysocephala in Manaus, Brazil, June–August 2018, including whistle(a), chuck(b), trill(c), soft growl(d), and multiple throat rattle(e) calls. We rarely heard soft growl calls and could not obtain clearer spectrograms.

Table I Acoustic measurements (medians and interquartile range) for five call types made by adult/subadult Pithecia chrysocephala in Manaus, Brazil, June–August 2018

Whistles were simple, unmodulated sounds.

Chucks were very short, sharp, and unmodulated calls.

Trills were long and often slightly modulated.

We heard the soft growl only four times over the data collection period and only in males. Soft growls were the least variable and lowest frequency calls in the adult/subadult repertoire.

The throat rattle was a low-frequency call, and the longest in the adult/subadult repertoire.

Calls were correctly classified 85% of the time by DFA with leave-one-out cross-validation (N = 5 call types and N = 1343 calls). Each call type was correctly classified >80% of the time (Table II, Fig. 4). All call parameters contributed significantly to the DFA (Duration: Wilk’s λ = 0.332, P < 0.001; FME: Wilk’s λ = 0.933, P < 0.001, lowest frequency: Wilk’s λ = 0.528, P < 0.001; start frequency: Wilk’s λ = 0.506, P <0.001; end frequency: Wilk’s λ = 0.674, P < 0.001).

Table II Cross-validated DFA results for call types in adult/subadult Pithecia chrysocephala in Manaus, Brazil, between June and August 2018
Fig. 4
figure 4

Distribution of discriminant scores for adult/subadult Pithecia chrysocephala call types along two canonical discriminant functions in Manaus, Brazil, June–August 2018.

Juvenile Vocal Repertoire

We identified six calls emitted by the juvenile (Fig. 5).

Fig. 5
figure 5

Spectrogram of calls by one juvenile Pithecia chrysocephala in Manaus, Brazil, June–August 2018, including whistle(a), chuck(b), trill(c), trew(d), peep(e), and cry(f) calls. We rarely heard trew calls, and so could not obtain clearer spectrograms.

Juvenile whistles and chucks were similar in structure to those of adults/subadults but always at a higher frequency (Table III).

Table III Acoustic measurements (median and interquartile range) of the five call types made by juvenile Pithecia chrysocephala in Manaus, Brazil, June–August 2018

The trill was similar to a lower frequency adult/subadult trill.

The trew was a short and low-frequency call.

Cries and peeps were both modulated high-frequency calls, with peeps increasing in frequency and cries increasing and curving back down on spectrograms.

Calls were correctly classified 54.8% of the time by DFA with leave-one-out cross-validation (N = 6 call types and N = 126 calls). Each call type was correctly classified >37% of the time, with the exception of trills, which were 0% correct (Table IV, Fig. 6). All call parameters contributed significantly to the DFA (Duration: Wilk’s λ = 0.754, P < 0.001; FME: Wilk’s λ = 0.401, P < 0.001, lowest frequency: Wilk’s λ = 0.449, P < 0.001; start frequency: Wilk’s λ = 0.489, P < 0.001; end frequency: Wilk’s λ = 0.394, P < 0.001).We identified six calls emitted by the juvenile (Fig. 5).

Table IV Cross-validated DFA results for call types in one juvenile Pithecia chrysocephala in Manaus, Brazil, June–August 2018
Fig. 6
figure 6

Distribution of the discriminant scores for call types in one juvenile Pithecia chrysocephala along two canonical discriminant functions in Manaus, Brazil, June–August 2018.

Context and Relationship to Call Structure

Both adult and juvenile calls varied in their context (Tables V and VI). Adult/subadult whistle calls were present in the widest variety of contexts of all adult call types and were the only calls used during grooming. Individuals used these calls alone but also in quick succession with other whistles in play or locomotion contexts, and with chucks during feeding. Adult/subadult chucks varied in their dominant frequency and context, with situations such as territorial conflicts with the other group eliciting higher dominant frequency and longer calls than in less distressing situations such as feeding. Individuals often used chuck calls in succession during play or as an alert (e.g., individuals frequently used them when researchers approached at the start of data collection). The juvenile used whistle and chuck calls in a similar way to adults/subadults, with the addition of when requesting help from adults/subadults (e.g., food or for adults to carry them) (Table V). The use of trills was much less variable across contexts when used by the juvenile compared to adults/subadults, appearing in only one context compared with five in adults/subadults. We typically heard soft growls before conflicts between groups, usually when the other group was nearby but out of the sakis’ sight. We visually confirmed the presence of the other group. Pithecia chrysocephala typically performed throat rattle calls several times in succession, decreasing in dominant frequency over time and often synchronized with other group members in combination with piloerection and branch-shaking displays. We observed soft growls and throat rattles exclusively during conflict with the other saki group. We observed juvenile trew, cry, and peep calls only in social contexts and in succession until the behavior observed ended. While peeps and cries were both used in request contexts, cries were emitted in more apparently distressing situations such as after falling from a tree or when the juvenile could not cross a gap between two trees unaided, whereas peeps were present for less apparently stressful requests, such as for food or to for adults to carry them.

Table V Call types and the contexts in which they were observed in adult/subadult Pithecia chrysocephala in Manaus, Brazil, June–August 2018
Table VI Call types and the contexts in which they were observed in juvenile Pithecia chrysocephala in Manaus, Brazil, June–August 2018

The structure of feeding and conflict chuck calls varied with context (Table VII). Call context was correctly classified 66.3% of the time by DFA with a leave-one-out cross-validation (N = 2 contexts and N = 163 calls). Each call type was correctly classified >62% of the time (Table VIII, Fig. 7). Only duration contributed significantly to the DFA (Wilk’s λ = 0.791, P < 0.001).

Table VII Acoustic measurements (mean and interquartile range) of chuck calls by adult/subadult Pithecia chrysocephala across contexts in Manaus, Brazil, June–August 2018
Table VIII Cross-validated DFA results for call contexts in adult/subadult Pithecia chrysocephala chuck calls in Manaus, Brazil, June–August 2018
Fig. 7
figure 7

Distribution of the discriminant scores for adult/subadult Pithecia chrysocephala chuck calls along the discriminant function (duration) in Manaus, Brazil, June–August 2018. Bars show the behavioral context, boxes show the interquartile range of call durations, and whiskers show the minimum and maximum durations.

The Relationship Between Call Structure and Sex

Call structure differed significantly for chucks in terms of duration (Mann–Whitney U test: U = 11.5, P = 0.02, N = 17) but not FME (U = 18, P = 0.09, N = 17), lowest frequency (U = 20.5, P = 0.156, N = 17), start frequency (U = 16.5, P = 0.07, N = 17), or end frequency (U = 22.5, P = 0.221, N = 17) (see Table IX for acoustic measurements). We found no significant sex differences in duration (U = 39, P = 0.487, N = 20), FME (U = 37, P = 0.396, N = 20), lowest frequency (U = 42.5, P = 0.67, N = 20), start frequency (U = 42.5, P = 0.671, N = 20), and end frequency (U = 43, P = 0.699, N = 20) in trills. Similarly, we found no significant sex differences in duration (U = 75, P = 0.643, N = 26), FME (U = 68.5, P = 0.425, N = 26), lowest frequency (U = 62, P = 0.257, N = 26), start frequency (U = 63.5, P = 0.291, N = 26), and end frequency (U = 67, P = 0.403, N = 26) in whistles.

Table IX Acoustic measurements (medians and interquartile ranges) for chuck, trill, and whistle calls for male and female adult/subadult P. chrysocephala in Manaus, Brazil, June–August 2018

Discussion

Adult/Subadult Vocal Repertoire

We identified five calls for adult/subadult Pithecia chrysocephala, which differs from the predicted 7–18 calls based on the work of Buchanan (1978) and Henline (2007) on P. pithecia, and Keiran (2012) on P. aequatorialis. This smaller repertoire is potentially due to the amount of gradation seen in whistle, chuck, and trill calls, which we could not objectively categorise into separate calls. In Henline (2007), chucks are split into chucks and churks. It is possible that Henline (2007) overlooked gradation between calls because of their smaller sample sizes, meaning that calls appearing as several groups rather than variations in one group. We also noted soft growls and throat rattles, two other calls recorded by Henline (2007) in the current study. Only one call from Henline (2007) was not present; the Z-trill, a low-frequency call used when an individual became separated from the group. This may not have been present during this study owing to masking from background noise or simply because individuals did not stray far from the group.

Comparing spectrograms, only chuck calls were visually similar in Pithecia pithecia and P. chrysocephala. Throat rattle and scream are similar in P. chrysocephala and P. aequatorialis, as are the juvenile cry and yip. The calls of P. aequatorialis are less visually similar to P. chrysocephala than P. pithecia, potentially reflecting the geographical and evolutionary distance between these species, with the range of P. aequatorialis being further from P. chrysocephala than that of P. pithecia (IUCN 2019).

Outside Pithecia species, similar repertoire sizes exist in Aotus species (4–6 calls) and species from the Callicebinae, (6–13), two groups with group sizes and structure similar to those of Pithecia (Andrew 1963; Cäser et al. 2012; Kantha et al. 2009; Kinzey et al. 1977; Moynihan 1964, 1966; Robinson 1979; ). This provides some support for the social complexity hypothesis. However, other groups with more complex social structures and larger groups also show similar repertoire sizes, such as uacaris, Cacajao spp. (9–12 calls) and brown woolly monkeys, Lagothrix lagothricha (6), suggesting that other factors may have influenced the evolution of call repertoire sizes in New World monkeys (Bezerra et al. 2010; Casamitjana 2002; Fontaine 1981). Another possibility is that these repertoires are similarly complex, but in different ways. For example, P. chrysocephala have smaller but more stable groups in terms of size and composition, while the larger Cacajao spp. groups show high fission–fusion dynamics and spend most time with a smaller subunit (7 individuals in Cacajao ouakary: Barnett et al. 2018). Increased vocal complexity may have evolved in P. chrysocephala to deal with the social complexity of interacting with the same individuals consistently over time, which could be comparable to the level of complexity Cacajao spp. face living in subunits (Freeberg et al. 2012).

Of the calls we recorded, throat rattles were the longest, while chucks were the shortest. In terms of frequencies, trills were highest and soft growls lowest. The structure of these calls and the contexts in which Pithecia chrysocephala uses them is consistent with the hypothesis that high-frequency calls are typically friendly, and low-frequency calls are hostile (Morton 1977). The long durations of throat rattle calls also agree with evidence that longer calls are used in high-stress situations (Briefer 2012). Individuals did not use soft growls when in direct visual contact with other groups, and so this call may function to maintain the spacing between the two groups at a distance, similar to the loud calls of Callicebinae spp. (Robinson 1981).

Among the calls, whistles, chucks, and trills were present in a wide number of contexts, while throat rattles and soft growls were used specifically in intragroup conflicts. whistles, chucks, and trills were also the only calls to show gradation, which may be associated with their more variable use, allowing communication of more detail across differing contexts (Green 1975).

Juvenile Vocal Repertoire

The juvenile repertoire of Pithecia chrysocephala comprised six calls. No juvenile calls are described by Buchanan (1978), Henline (2007), or Keiran (2012), so we did not base juvenile call names and groupings on previous works. Similar to adult P. chrysocephala, juveniles performed graded chucks and whistles across a variety of contexts. Juvenile trills are split into two calls: trills resemble the adult version, and trews are used exclusively during nursing. The juvenile trill did not appear in several contexts, unlike in the adult version, possibly as the establishment of this call in their repertoire had not advanced as much as whistles and chucks by the stage of development the juvenile had reached when we made the recordings (e.g., Snowdon and Elowson 2001). Juvenile equivalents of adult whistles were longer and higher in frequency than adult calls, while chucks were shorter and higher and trills shorter and lower than adult calls. The higher frequencies are likely due to the smaller body size of juveniles (e.g., Hauser 1993). The lower frequency of trills compared to those in adults is not as predicted, but could be due to the trill calls becoming more distinct from structurally similar trews, which are low-frequency calls.

The only juvenile exclusive calls we recorded were the trew, peep, and cry vocalizations. Trew calls were low and repetitive, similar to a purr and used only while nursing. Cacajao calvus, Leontopithecus rosalia, and mantled howler monkeys (Alouatta palliata) display similar calls (Carpenter 1934; Fontaine 1981; Kleiman et al. 1988). In these cases, the purr-like vocalizations are also present in a parental care context, possibly functioning to solicit nursing from the mother or communicate contentment (Bradshaw and Cameron-Beaumont 2000; Leyhausen 1979). Peeps were the highest frequency calls in the juvenile repertoire and are present in resting, social, and request contexts. Similar to adult trills, the highest frequency call is present only in friendly contexts (Morton 1977). The structure of the juvenile cry is in line with the finding that infant distress calls are often a “chevron” shape on spectrograms, which is common across multiple mammal species (Lingle et al. 2012). However, cry calls are not as high frequency as expected for a distress call, potentially because the call must be low enough to travel when from separated infants to the group.

The Relationship Between Context and Call Structure

There was a structural difference in chuck calls between feeding and hostile contexts. However, this occurred only in terms of call duration. This finding did not support our prediction that more hostile calls would be lower in frequency than friendly ones (Morton 1977). The higher stress level of the situation could explain the higher than expected call frequency during hostile contexts (e.g., Linhart et al., 2015). The chuck call in Pithecia chrysocephala is similar in use to the tchó call in golden-backed uacaris, Cacajao ouakary, which also occurred at higher than expected frequencies in hostile contexts (Bezerra et al. 2010). The authors proposed that this finding was due to the call being used to communicate appeasement rather than hostility in conflict situations. If this is the case for chuck calls, then they support the motivational–structural rule.

The Relationship Between Call Structure and Sex

Call structures for whistle and trill calls did not differ across sex, contra our prediction that males would produce longer calls with lower fundamental frequencies than females do. We found partial support for our predictions for chuck calls, which were significantly different in duration between sexes, but not across other parameters. This similarity in calls between sexes may be due to the sex difference in body mass being only mild to moderate (Norconk 2011). However, other primate species with little size dimorphism display differences in call structure, such as Leontopithecus rosalia, and Callithrix jacchus (Benz et al.1990; Norcross et al. 1999). One explanation for this could be that these species show internal differences which alter call structure, as is the case for L. rosalia (Hershkovtiz 1977). Another possibility is that structural differences do exist for Pithecia chrysocephala, but in parameters we did not measure, such as frequency range and highest frequency.

Potential Applications

Our findings have several potential applications. Monitoring the use of chuck calls in captive groups could act as a means of assessing stress levels, as this is primarily an alert/conflict call. Similar to those in captive Sapajus apella, higher frequency chucks and more frequent use could indicate higher stress levels (Jacobsen et al. 2010). Vocalizations are also applicable as enrichment, as carried out with Callithrix jacchus (Watson 2014), to encourage more positive behaviors. In this case, playing a group’s whistle and trill calls back to them could support behaviors such as grooming and play. However, excessive grooming is undesirable, so playback should be brief. Additional research on the use of whistles and trills across contexts is necessary, as using the wrong calls could cause distress. Soft growl calls from another group could also be used as enrichment to encourage natural behaviors such as territorial defense (e.g., Farmer 2011). While these calls are less ambiguous than whistles and trills, their use is for intergroup encounters and could increase stress levels.

The IUCN recommend a new red list assessment for Pithecia chrysocephala since its classification as a full species (Mittermeier and Roosmalen 2015). The population trends of this species are currently unknown, and while currently considered Least Concern they occupy an area experiencing rapid deforestation (INPE 2016). Knowledge of vocalizations could be used to conduct population censuses of this species without relying on visual contact. This could be highly beneficial, as sakis can be difficult to follow visually in the wild (Pinto et al. 2013). Censuses could also use playbacks of calls, but, as this species is territorial, hearing another group’s calls could cause distress.

Future Studies and Conclusion

Our study provides information previously unknown on the vocalization of Pithecia chrysocephala. Our findings offer a foundation for future studies into the vocalizations and behavior of P. chrysocephala, such as investigations into vocal development and further examination of the association between context and call structure, as well as more data for comparisons of vocal behavior across the Pitheciidae. There is also potential for using our results in captive welfare and field research on P. chrysocephala, potentially to monitor stress levels, enrich the captive environment, or make censuses of these cryptic animals easier.