Introduction

Several fish species are known to emit sounds in different social contexts, in particular during agonistic interactions and courtship (Ladich and Myrberg 2006; Myrberg and Lugli 2006). These sounds demonstrate a large inter- and intra-specific diversity, and one particular interest has been to study the potential function of these acoustic signals in communication (Amorim 2006). In cichlids, inter-specific differences found among closely related species emphasize the important role acoustic signals may have played in the spectacular rate of speciation witnessed in this group (Verzijden et al. 2010). Amorim et al. (2004, 2008) showed that courtship calls of five closely related species of cichlids differed in some temporal features, that is, number of pulses, pulse period and duration, and in peak frequency. Thus, acoustic signals could play a role in species recognition.

Previous studies also suggest that sounds of fish include individual characteristics that can enable sender recognition and assessment at the intra-specific level (Amorim et al. 2010). One of the most widespread differences is an inverse relationship between the frequency of sound and the size of the emitter (e.g. Ladich et al. 1992; Crawford et al. 1997; De Jong et al. 2007; Amorim et al. 2008; Colleye et al. 2009; Bertucci et al. 2012). Since bigger individuals are usually dominant and territorial, sound characteristics may be used by a receiver to assess the size of a potential rival. For example, beside the fact that sounds increased the chance of winning a fight in croaking gouramis (Trichopsis vittata) (Ladich et al. 1992), some size-related acoustic features may predict the outcome of a fight, that is, winners produce sounds with a lower dominant frequency and a higher sound pressure level (Ladich 1998). Acoustic signals could therefore encode social status and/or motivation. Moreover, in some species, sounds also show individual characteristics. Most examples come from Batrachoidids in which several studies have reported individuality of male courtship calls (e.g. Edds-Walton et al. 2002; Amorim and Vasconcelos 2008) and in male aggressive sounds (e.g. Thorson and Fine 2002).

If sounds seem to carry a wide range of information from the specific to the individual level, we can then ask whether sounds have context-specific characteristics. In birds and mammals, sounds of aggressive interactions are usually low in frequency and short, whereas courtship or affiliative sounds are purer and of higher frequency (Morton 1977). These differences seem to exist in fish. In the midshipman (Porichthys notatus), male advertisement calls directed to females are long and tonal while aggressive sounds are short and broadband (Bass and McKibben 2003). More recently, Parmentier et al. (2010) described the acoustic repertoire of a damselfish (Dacyllus flavicaudus). Sounds could be classified in three categories, that is, fighting sounds, mating sounds and chasing sounds. Differences between signals were found in the number of pulses, the pulse period and their relative intensities. For example, sounds associated with aggressive behaviours had a smaller number of pulses than courtship sounds. In cichlids, several studies have reported context-related differences. In Oreochromis mossambicus, pulse period and number of pulses of the male courtship sounds are significantly different depending on the behaviours they are associated with (Amorim et al. 2003). The structure of these sounds is also affected by recent social experience of males as winners produced more sounds with longer pulses and lower peak frequencies than losers (Amorim and Almada 2005).

One way of studying the function of sounds and the relevance of different sound characteristics in signal recognition is to use playback experiments. Whereas playbacks have been widely used in many vertebrates (McGregor 1992), only a small number of such experiments have been performed in vocal fish. One possible limitation to the use of this technique would be the ability of current loudspeakers to accurately broadcast sounds with low frequencies (Ladich and Myrberg 2006). However, early experiments showed that the responsiveness to conspecific sounds was higher than to heterospecific sounds (e.g. Delco 1960; Myrberg and Spires 1972). In damselfishes, in the oyster toadfish and in gobies, playback experiments have revealed that male sounds attract females and may provoke courtship displays in neighbours (Winn 1967, 1972; Gerald 1971; Ibara et al. 1983; Lugli et al. 1996; Rollo and Higgs 2008). In damselfish, a series of experiments have shown that the higher response to conspecific sounds depends on the number of pulses and pulse rate of ‘chirps’ (Myrberg et al. 1978; Spanier 1979). Moreover, playbacks of large male are more attractive than playbacks of small males (Myrberg et al. 1986; McKibben and Bass 1998) and may play a role in male–male interaction as a ‘nesting male present’ signal (McKibben and Bass 1998). In a subsequent study, McKibben and Bass (2001) were interested in the temporal pattern of sounds emitted by male P. notatus and used playbacks in choice test experiments in which sounds were altered in terms of pulse duration, the gap between pulses and beat of sounds. Playbacks to gravid females revealed that longer pulses and greater pulse-to-gap ratios resulted in more attractive signals and that pure tones were more attractive than beats. Overall, these playback experiments show that temporal pattern of sounds seems to be important in fish acoustic communication.

In cichlids, Verzijden et al. (2010) showed that female Pundamilia nyererei prefer conspecific males associated with playbacks of courtship sounds, and sounds emitted during aggressive interactions reduce the level of aggressiveness witnessed during a fight between males of the Malawian cichlid Metriaclima zebra (Bertucci et al. 2010). However, Bertucci et al. (2010) found no behavioural response to playbacks of sounds when they were not associated with other (visual) stimuli. The same result was found in male gobies exposed to aggressive sounds (Lugli 1997). Given this tight interaction of acoustic cues with other modalities in fish (Lugli 1997; Lugli et al. 2004; Ladich and Myrberg 2006), the results of previous experiments may be confusing. Furthermore, playback experiments focusing on the role of sounds at the intra-specific level remain rare (Stout 1963; Schwarz 1974; Rigley and Muir 1979). Hence, no study comparable to McKibben and Bass (2001) has been performed to investigate the role of, for example, temporal structure of sounds in cichlids.

Metriaclima (formerly Pseudotropheus) zebra is a rock-dwelling cichlid from Lake Malawi. Males produce sounds while defending territories against other males and while courting females (Amorim et al. 2004; Simões et al. 2008). During agonistic postures like quivering or lateral displays, males produce low-frequency sounds consisting in a train of short pulses. Simões et al. (2008) highlighted differences between sounds produced by males and females and differences in characteristics of males’ sounds emitted during agonistic interactions and courtship, as agonistic sounds were longer in duration and pulse period than courtship sounds.

In this study, we attempt to set-up an apparatus allowing us to elicit a behavioural response to acoustic playback, with no other associated stimuli, in male cichlid M. zebra. In order to test whether the observed behavioural response of males could be modulated, we then tested the relevance of acoustic cues by modifying two temporal characteristics that could be good candidates for encoding information, that is, the number of pulses and the sound duration, and broadcasting these modified stimuli to territorial males. We aimed to determine the possible function of temporal variability of sounds in coding aggressiveness in this species.

Materials and methods

Fish

Metriaclima zebra were purchased from N’Guyen International (Kingersheim, France) and stored for 2 months on a 12-h light/dark cycle, in mixed sex groups, in holding tanks (60 × 120 × 50 cm) containing 8–10 individuals with a male/female sex ratio of 1:2. They were 4-years old and sexually mature at the time of the experiment. Each tank was equipped with an external filter (Rena Filstar xP3, Rena France, Annecy, France), an aeration device, sand substrate, terracotta pots and bricks as shelters. The temperature was maintained at 25 ± 2 °C by an internal heater (RenaCal 200, Rena France, Annecy, France) and the pH remained at 8.0. Fish were fed daily with commercial cichlid food (JBL NovoRift, JBL GmbH & Co. KG, Neuhofen, Germany). Once a week, this diet was complemented with a frozen mixture of mussels, shrimps and spinach.

Twelve male individuals were involved in the experiments (mean ± SE total length—from the tip of the head to the tip of the caudal fin—of 137.08 ± 4.47 mm, standard length—from the tip of the head to the caudal peduncle—of 112.17 ± 5.48 mm, and a weight of 34.30 ± 1.66 g). The fish were identified by the number and the position of egg-spots on their anal fin, any marks on their body, in combination with VIE tags (Visible Implant Elastomer, Northwest Marine Technology, Shaw Island, WA, USA) implanted under the skin.

Experimental set-up

The experiment was conducted in an aquarium (60 × 30 × 30 cm) placed on a vibration-insulated shelf and located in an acoustically insulated room to reduce background noise. Additionally, three walls of the aquarium were covered with bubble wrap in order to reduce sound reverberation within the aquarium. The experimental tank contained a filter, an aeration device, an internal heater, a sand substrate and a terracotta pot in the middle in order to provide a shelter for the fish. During the experimental sessions, the filter and the aeration device were switched off to allow recordings of fish sound. On one side of the aquarium, we placed an underwater loudspeaker (University sound, Electrovoice, UW–30) (Fig. 1) connected to an amplifier (Denon PMA–100 M) and a Marantz PMD670 solid-state recorder.

Fig. 1
figure 1

Schematic representation of the experimental set-up (not to scale). H hydrophone, Ls loudspeaker, PB playback, S shelter. Grey dashed line represents the middle of the aquarium

A video camera (BUL520, Active Media Concept, Vallauris, France) was placed in front of the apparatus in order to record behaviour during trials. To record acoustic signals, a hydrophone (Aquarian Audio Products H2a-XLR, sensitivity: −180 dB re 1 V/μPa, flat frequency response: ±4 dB in the range 20–4.5 kHz) was placed in the centre of the aquarium, that is, above the shelter. The hydrophone was connected to a preamplifier (Yamaha MLA8, Yamaha Music France, Marne-la-Vallée, France) and a video capture card (Osprey-450e) of a PC, which synchronized audio and video signals.

In order to broadcast sounds at a ‘natural’ level, we placed the loudspeaker at a distance from the hydrophone corresponding to the distance at which the sounds were originally recorded. We then adjusted the intensity of playbacks to get the same level of recording. We confirmed that this apparatus did not alter played back sounds in a previous experiment (Bertucci et al. 2010) by comparing two features of the sound, that is, mean frequency and pulse period: these sound features did not differ before and after a playback (see Online Resource 1 for results of the comparison tests and a figure showing oscillograms of a recorded sound and of the same sound after transmission through the broadcasting apparatus).

Sounds to be used as acoustic stimuli were obtained from recordings made in our laboratory. Two unfamiliar size-matched males were introduced into adjacent tanks, separated by opaque partitions and were given 24 h to acclimatize and become territorial. A hydrophone was placed in each tank, between the shelter and the walls separating the aquaria, that is, where individuals would spend most of their time displaying; details of the hydrophone and its connections were as same as for the recording of the stimulus sounds. At the start of a recording session, the opaque partition was removed, allowing the two fish to interact visually for 20 min before the partition was replaced. We performed a maximum of three recording sessions per male to collect a sufficient number of sounds. We only considered sounds consisting of more than two pulses and recorded within a distance of 1–2 body lengths from hydrophones. Sounds recorded from four individuals with a mean ± SE total length of 115.00 ± 7.99 mm, a standard length of 98.00 ± 6.38 mm and a weight of 25.20 ± 4.25 g were used and digitized at 44.1 kHz (16-bit resolution). None of the recorded fish were involved in the present experiment, and none of the fish tested had experience with any of the individuals from which the sounds were recorded.

Analysis of behavioural responses

Behavioural data were collected from the videos recorded during the trials. For each period in the two sessions (intruder and resident), we quantified the number of aggressive behaviours performed, that is, lateral display, quiver, sound production and bite attempts, using the behavioural transcription software EthoLog 2.2.5 (Ottoni 1995–1999). We also quantified behaviours defining space occupancy, that is, time spent swimming (s) in the zone of the loudspeaker, which was one half of the aquarium, number of territory maintenance behaviours (when the fish was moving–digging—sand within the aquarium while becoming territorial, for example Oliveira and Almada 1996; Simões et al. 2008), number of times that the tested fish visited the shelter and number of times it went close (less than one body length) to the loudspeaker.

We analysed the number of lateral displays, quivers, sounds produced, bite attempts, the number of territory maintenance acts, visits to the shelter, number of positions close to the loudspeaker and time spent in the zone of the loudspeaker by means of either a repeated measures analysis of variance when the data were normally distributed or a Friedman analysis of variance if this was not the case. The analysis was performed using STATISTICA 6.0 (Statsoft Inc 2004).

Control experiment: how do male M. zebra respond to playback of conspecific sounds?

The first step of our study was to set-up an efficient playback apparatus that would allow us to study the behavioural response of M. zebra to played back signals. As we were also interested in the effect of social status on the behavioural response to playback, each fish was tested in the experimental tank in two successive playback sessions, separated by 24 h of rest. This apparatus allows testing fish in two different social conditions. During the first session, the fish was considered as an intruder, whereas it was considered as resident during the second session.

The test fish was introduced to the experimental set-up and was given 10–15 min to acclimatize. The first session was then divided into three periods of 10 min. During the first 10 min, no sounds were played back (pre-playback control). At the end of this first period, we started to play back aggressive sounds to the subject every time it entered in the half of the aquarium containing the loudspeaker (playback period, see Fig. 1). The fish then received no playback for 10 min, until the end of the first session (post-playback period). We then switched the electric devices back on—filter and aeration—and left the fish in the tank for 24 h, so that they could become resident. During the second session, we repeated the procedure with the 10 min pre-playback, 10 min playback and 10 min post-playback periods playing back the same sounds as the day before. At the end of the trial, the fish was returned to its home tank and 2/3 of the water in the test aquarium were renewed and allowed to stand for 24 h before the introduction of a new fish.

Experiment with modified signals: do modifications of sound temporal pattern alter the behavioural response to playbacks?

In a second step, we wanted to test whether modifications of a sound’s temporal pattern could elicit differential behavioural responses in the tested fish. Sounds of M. zebra consist of trains of a variable number of short and low-frequency pulses. Male sounds show a range of variation in the period of pulses from 40 to 180 ms and sound duration varies from 250 ms to 1 s with 5–15 pulses per sound (Simões et al. 2008; Bertucci et al. 2012). We altered temporal features of these sounds using the PRAAT software version 5.0.35 (Boersma and Weenink 1992–2008) (Fig. 2). To test the significance of the pulse period, we built sounds for which we retained the number of pulses of the original sounds (Fig. 2a), but modified pulse period corresponding either to twice the maximum peak-to-peak distance observed for each individual (PP Max, Fig. 2b) or to half the minimum peak-to-peak distance (PP Min, Fig. 2c). These two modifications, respectively, resulted in a series of slow sounds and fast sounds. To test the significance of the number of pulses, we built sounds with no alteration of the pulse period but we either deleted the second half of the original sounds (half pulses, Fig. 2d) or pasted a copy of the original sound at its end (double pulses, Fig. 2e). For the latter modification, the two parts were separated by the mean peak-to-peak interval of the original sounds. These two modifications, respectively, resulted in a series of short sounds and long sounds.

Fig. 2
figure 2

Temporal modifications of male M. zebra sounds recorded during aggressive interactions. a Oscillogram of an unmodified control sound. b Oscillogram of the same sound with an increased pulse period (PP), resulting in a slow sound (PP Max). c Oscillogram of a sound with a decreased pulse period, resulting in a fast sound (PP Min). d Oscillogram of a sound with only the first half of pulses resulting in a short sound (half pulses). e Oscillogram of a sound with a double number of pulses, resulting in a long sound (double pulses). Oscillograms are shown to scale

Prior to the experiment, fish were put into the aquarium for 24 h to let them become resident. Resident fish were then tested with playbacks of modified sounds following the same procedure as previously described for the control experiment. We used a repeated measures design with five different treatments corresponding to the five categories of sounds that we created, that is, normal sounds, PP Max, PP Min, Double pulses and Half pulses. We tested two fish a day for six consecutive days over a period of 5 weeks. Each individual thus received a new treatment every 7 days in a balanced order to avoid order effects. Since we broadcast sounds when a fish was located in the half part of the aquarium containing the loudspeaker, the number of playbacks received by each individual depended on its motivation to approach the loudspeaker, which explains why the number of stimulations varied between individuals and treatments (Table 1).

Table 1 Number and mean values of playbacks received by each individual in the five different treatments

Results

How do male M. zebra respond to playback of conspecific sounds?

Numbers of aggressive displays (lateral displays, quivers and sounds) were too low to be analysed.

We found significant differences in the number of territory maintenance behaviours (Friedman ANOVA, n = 12, F 4,44 = 39.59, P < 10−3) with no difference in intruders and a significant increase of this number in residents during the playback and after the playback as revealed by a Fisher’s least significant difference (LSD) post hoc test (Fig. 3a). Residents visited significantly less the shelter than intruders after the playback (Friedman ANOVA, n = 12, F 4,44 = 16.31, P = 0.006) (Fig. 3b). Residents came close to the loudspeaker significantly more often after the playback than intruders during the same period (Friedman ANOVA, n = 12, F 4,44 = 16.26, P = 0.006, followed by a Fisher’s LSD post hoc test) (Fig. 3c). We also found that residents tended to spend more time in the zone of the loudspeaker than intruders after the playback (Friedman ANOVA, n = 12, F 4,44 = 10.58, P = 0.06).

Fig. 3
figure 3

Behavioural responses to playback of intruders and residents, during the three successive observation periods. Pre-PB:10 min before playback, PB: playback period (duration: 10 min; sound played back each time the fish approached the loudspeaker, Post-PB: 10 min after playback. a Number of territory maintenance behaviours, b number of visits in the shelter, c number of positions close to the loudspeaker. The boxes represent the first and third quartiles, points (filled circle) are the median (second quartile) and whiskers correspond to the range (min–max) of the distributions. Different letters indicate significant differences (P < 0.05, Fisher post hoc test)

In summary, playback of conspecific sounds does not alter the behaviour of fish that have just entered the experimental set-up (intruders), whereas it elicits a territorial response by fish that have been present for 24 h (residents). This territorial response is well characterized by territory maintenance behaviour.

Do modifications of the temporal pattern of sounds alter the response?

For the territory maintenance activity, we found a significant effect of temporal modifications in the post-playback period (Repeated measures ANOVA, n = 11, F 4,40 = 3.15, P = 0.02), with a higher number of territory maintenance acts in response to sounds with a high pulse period (PP Max) compared to other modifications as revealed by a Fisher’ LSD post hoc test. However, none of the treatments differed from the control playback (Table 2; Fig. 4).

Table 2 Mean ± SE values of the different variables analysed for each period (Pre-PB, PB and post-PB) and for each sound modification
Fig. 4
figure 4

Behavioural response (number of territory maintenance behaviours) elicited by sounds with modified time pattern. Fish are tested when resident (i.e. after 24 h in the aquarium, see text for details). Pre-PB: 10 min before playback, PB: playback period (duration: 10 min; sound played back each time the fish approached the loudspeaker, Post-PB: 10 min after playback. Different letters indicate significant differences (P < 0.05, Fisher post hoc test)

We found no differences between modifications for the number of visits in the shelter (repeated measures ANOVA, n = 11, F 4,40 = 0.96, P = 0.44), the number of positions close to the loudspeaker (repeated measures ANOVA, n = 11, F 4,40 = 0.79, P = 0.54) or the time spent in the zone of the loudspeaker (s) (Repeated measures ANOVA, n = 11, F 4,40 = 1.12, P = 0.36) (Table 2).

Discussion

This study used playbacks of sounds produced by males during agonistic interactions in order to investigate the role of these signals in M. zebra. Playbacks have been successfully applied when investigating the biological function of advertisement sounds of insects, frogs, birds or mammals (McGregor 1992). In comparison, however, most of the studies using playback experiments in fishes have been unsuccessful, leading to confusing results. Such studies therefore remain rare. One explanation comes from the close-range feature of fish sounds and the requirement of additional stimuli to elicit a behavioural response (Stout 1963; Schwarz 1974; Rigley and Muir 1979; Ladich 1997; Bertucci et al. 2010).

Control experiment: only resident males respond to acoustic playbacks

We showed in the first part of this experiment that acoustic playbacks of sounds could trigger a behavioural response in residents, with an increased number of maintenance acts (or digging) during and after the playback periods. Substrate plays an important social role in cichlids especially during the breeding season, when males build nests, that is, a depression in the substrate, representing their territory in which females spawn (Oliveira and Almada 1996; Galhardo et al. 2008, 2009). Hence, our results suggest that aggressive sounds are relevant to a resident (territorial) male and lead him to defend or at least advertise his territory against a potential competitor. While the increased number of territory maintenance behaviours may also be due to the fact that fish became habituated to the experimental tank and start exhibit some behaviours, the occurrence of this response during and after the playbacks clearly indicates that sounds are the source of this reaction. Aggressive sounds might then play the role of a ‘territorial male present’ signal in resident individuals, similar to the ‘nesting male present’ signal proposed by McKibben and Bass (1998) in the midshipman. At the same time, males reduced their visits in their shelter and came more often close to the loudspeaker. Such phonotaxic responding is comparable to that found in previous experiments conducted in fishes (e.g. Myrberg and Spires 1972; Winn 1972; Ibara et al. 1983). Thus, besides attracting mates or being used during fights (Simões et al. 2008), acoustic signals of cichlids may be used by territorial males to detect potential rivals interacting with conspecifics in their vicinity.

This response is robust against modifications of the temporal pattern of sounds

In contrast to previous findings that the number of pulses and pulse rate were important sound characteristics in attracting a mate or neighbour’s attention and for species discrimination in damselfishes (Myrberg et al. 1978; Ibara et al. 1983; Amorim 2006), most of the modifications of temporal features here had little or no effect on the behaviour of resident male M. zebra. Only a massive increase in the pulse period (PP Max signal) resulted in a stronger behavioural response implying a bigger threat in the receiver. A longer pulse period is generally associated with agonistic interactions in M. zebra (Simões et al. 2008) and with larger males (Bertucci et al. 2012) and might thus promote the observed reaction.

If the total sound duration was responsible for the observed response to playbacks, we would have expected the double pulse signal to show the same or a greater effect as PPmax sounds on the number of territory maintenance behaviours. This was not the case, which suggests that the increased pulse period, not the total sound duration, may be the cause of the increased territorial activity induced by the PP Max signal. Increasing the sound duration by playing the same sound twice successively not only creates a twofold longer sound, but also alters the sound envelope. Indeed, most cichlid sounds start with pulses of low amplitude; pulse amplitude then increases and declines at the end (e.g. Lobel 2001; Amorim et al. 2008). Repetition of the same sound twice may affect the dynamic pattern of pulses. Pulse amplitude would therefore be a good candidate parameter to be modified in further playback experiments.

Apart from the PPMax modification, and in relation to the previous point, playbacks of only the first half of sounds provided the same results as fast sounds (PPMin) and long sounds (Double pulses), with a decreased number of territory maintenance behaviours. This decrease was not significantly different from the control, but it suggests that during the first half of a sound (which usually corresponds to an increase in pulse amplitude), a fast sound or a long sound do not provide the same amount or quality of information regarding territorial behaviour as unmodified sounds. The question whether information is carried by the entire sound or in other (decreasing) parts of the sound thus deserves more investigation.

Only two temporal acoustic features were considered here, and we cannot exclude the possibility that the key characteristics of our stimuli were preserved despite our drastic modifications and that other cues might also be involved in the encoding process affecting, in particular, the phonotaxic response. Sound pressure level and dominant frequency of sounds may play an especially important role in conspecifics’ assessment (Myrberg et al. 1986; Ladich 1998; McKibben and Bass 1998) and deserve further investigations. On present data, the system seems to be able to sustain and tolerate a wide range of variations in term of pulse period and sound duration before the behavioural response of receivers is altered.

The association of the territorial and the phonotaxic response found in the control experiment validates the efficiency and reliability of our results and represents one of the first convincing playback experiments conducted in a cichlid fish. We also provide good evidence of the important role of acoustic communication at the intra-specific level in M. zebra.

There is a major gap in the fish literature compared to insects and anurans in particular. While our drastic modifications may have resulted in some supernormal stimuli, that is, very slow signals or unnatural stimuli, that is, with a modified amplitude envelope, the results of the second part of our study have to be considered as a first attempt at deciphering the encoding mechanisms of this fish. Numerous studies have shown that fish auditory system is well suited for temporal processing (e.g. Suzuki et al. 2002; Wysocki and Ladich 2003; Vasconcelos et al. 2011). However, to our knowledge, no neurophysiological studies have been performed to know how pulse interval and other temporal features are represented in the auditory system of this species. Further playback experiments focusing on other acoustic cues like frequency, amplitude of pulses or spectral properties of sounds, in association with neurophysiological experiments, will be necessary to understand more thoroughly information processing in fish acoustic communication.