Introduction

The cognitive neuroscience of aging is a growing disciplinary area that studies the factors and mechanisms that affect aging at neuroanatomic and cognitive levels (Diaz and Pereiro 2018). Cognitive changes are a consequence of aging, a period of life when several cognitive functions are affected. Normal age-related declines have been reported in different cognitive functions, such as attention, processing speed, working memory, spatial skills, and reasoning (Erel and Levy 2016; Grandi and Tirapu-Ustárroz 2017; Hedden and Gabrieli 2004; Salthouse 2010), among others. A cognitive function that received special interest is memory, defined as the capacity that allows the storage of new information for later restitution or evocation with a specific goal (Alberini and LeDoux 2013; McGaugh 2002). Short-term memory (Rhodes and Katz 2017), working memory (Rieckmann et al. 2017), and long-term episodic memory (Fjell et al. 2016) are the memory systems most affected by aging.

However, the aging process and its consequent cognitive decline show inter-individual differences (Fauvel et al. 2014). Hence, the interest in brain plasticity during adulthood and the environmental factors that could affect aging has increased, with special emphasis on the factors that impact on cognitive reserve, i.e., the compensatory neurocognitive mechanisms that attempt to maximize the performance against brain damage or deterioration through the use of structures or non-compromised brain networks (Stern 2009; Stern et al. 2018). Several investigations have identified specific protective factors that benefit cognition in old age, such as physical activity (Erickson et al. 2011), educational level (Evans et al. 1993), type and level of occupation (Adam et al. 2013), intellectual skills, social interactions, and lifestyle activities (Seinfeld et al. 2013; Wang et al. 2012). Some of these protective factors have been associated with a lower risk of dementia (Hall et al. 2009; Verghese et al. 2003; Wilson et al. 2002).

Music has also been identified as a protective factor (Kraus and Chandrasekaran 2010; Skoe and Kraus 2012). Studies suggest that long-term music learning and some musical activities promote healthy cognitive development throughout lifetime (Hanna-Pladdy and Gajewski 2012; Strong and Mast 2018; Thaut and Hoemberg 2014) and that they could mitigate the cognitive deterioration that emerges in the aging process (Wang and Schlaug 2010), influencing the neuroendocrine and autonomic nervous system (Chanda and Levitin 2013). In addition, music activates the brain extensively, engaging temporal, frontal, parietal, and limbic regions (Benz et al. 2016; Justel and Diaz 2012; Zatorre et al. 2007; Zhao et al. 2017; Zuk et al. 2014). Nevertheless, music is a complex phenomenon and the proposals that emerge from it are innumerable (Peretz 2006). For this reason, recent research has specialized in identifying the specific effect of music-based interventions by isolating its musical components (Särkämö 2018; Särkämö et al. 2016). In this framework, those interventions that involve active (e.g., music playing, singing, dancing; Schneider et al. 2018) and receptive proposals (e.g., music listening; Särkämö et al. 2008) have been studied. One benefit of these interventions is that they are flexible enough to be employed regardless of the participants’ formal knowledge of music (Diaz Abrahan et al. 2019).

By linking music and memory, different lines of research have investigated the efficacy of acute or focal music-based interventions, focusing either on perceptive or receptive activities (Ferreri et al. 2013; Ferreri and Rodriguez-Fornells 2017; Justel and Rubinstein 2013; Justel et al. 2016; Ratovohery et al. 2018). For example, Judde and Rickard (2010) had adults listen to music for 3 min (with positive or negative valence, experimental groups) or white noise (control group) after the acquisition of verbal information. Their results indicated that both experimental groups had better deferred memory performance than the control group. Similar results were found by Justel et al. (2015). On the other hand, studies on music-based interventions that involve music-making are scarce.

Divergent thinking, the ability to generate ideas, was significantly associated with episodic memory (Addis et al. 2016; Madore et al. 2019). This relationship is justified by the associative cognitive mechanisms involved in the divergent thinking task. In the music field, music improvisation, the ability to generate music, also showed this type of association (Limb and Braun 2008). Indeed, creative musical tasks involve decision-making processes and activates several cortical and subcortical areas, such as the premotor cortex, supplementary motor areas, language areas, insula, cingulate cortex, medial prefrontal cortex, and temporal areas (Beaty 2015; Engel and Keller 2011; Loui 2018; McPherson et al. 2014).

The cognitive associative demands involved in the creation of music engage memory and other cognitive functions. Our research team has focused on this line of research by investigating the effect of a creative musical activity. We employed musical improvisation, from a music-therapy perspective, as a cognitive modulator (Diaz Abrahan et al. 2019). In this study, older adults watched neutral and emotional images and were then randomly exposed to one of three different interventions: a free musical improvisation, the imitation of a rhythmic pattern, or a period when the participants remained silent. The findings indicated that the participants in the improvisation group remembered more neutral and emotional information than the other groups. Similar results were found with young adults (Diaz Abrahan et al. 2018, 2019). However, not all types of memory were evaluated under this approach, and how musical interventions would impact verbal learning is still unknown.

Finally, the samples are frequently divided depending on their formal knowledge of music, since it is known that music learning and practice generate a brain adaptation that affects the cognitive performance (Justel and Diaz 2012). Long-term musical practice seems to prevent age-related decline in higher-order cognitive functions such as non-verbal memory, naming, and executive processes (Hanna-Pladdy and MacKay 2011). However, the results are controversial since some studies found positive associations and correlations between music and cognition (Franklin et al. 2008; Hanna-Pladdy and MacKay 2011), but others did not (Fauvel et al. 2014).

The current study

Following the methodology used in previous studies (Diaz Abrahan et al. 2018, 2019), the goal of our work was twofold: (1) to investigate the effect of free musical improvisation, as a focal music-based intervention, on the verbal memory of older adults and (2) to determine whether musical expertise protects the aging brain from deterioration of verbal memory. Therefore, we present two studies in which the effect of musical improvisation was compared with two control activities (active and passive) in a neutral (Study 1) and an emotional (Study 2) verbal memory task. In both studies, formal musical experience was taken as a between-subjects factor. We predicted that musical improvisation would improve neutral and emotional verbal memory and expected musicians to have better mnemonic performance than non-musicians.

Study 1: neutral verbal memory

Participants

A priori power analysis revealed that to detect a small- to medium-sized effect (η2 = .06) with 90% power, the study required total sample size of 78 (software G*power, Faul et al. 2007), based on a previous study (Diaz Abrahan et al. 2019). Eighty-four volunteers aged between 60 and 90 (M = 72.28; SD = .81) participated in this study (81% women). Twenty-four were musicians (M) with more than 5 years of formal and/or informal music training (schools, institutes, music conservatories, private practice), and 64 were considered non-musicians (NM). They were recruited from different adult day programs and music conservatories through online announcements. The participant exclusion criteria included visual or hearing impairment, amusia, or any music-related pathology, cognitive impairment, and depression. Musicians had an average of 19 years of music training, and 26% of them were not musically active, while the remaining 74% were. In the musicians’ sample, 44.8% played string instruments (piano and violin mostly), 10.03% wind instruments (flute and harmonica), 13.79% percussion instruments, and 31.03% were singers. Non-musicians had less than 6 months of musical experience, and only two volunteers reported playing guitar. Each participant signed a written informed consent form according to current ethical principles for research involving human subjects (World Medical Association Declaration of Helsinki, 2013) and completed a questionnaire where sociodemographic and musical expertise information was requested. The procedure was approved by the University of Buenos Aires Ethics Committee.

Measures

General cognitive state evaluation

Standardized psychometric tests were used to evaluate the general cognitive state (administered individually at the beginning of the procedure). The Mini Mental State Examination (MMSE, Folstein et al. 1975) was employed as a screening test to evaluate cognitive impairment. Scores between 9 and 11 are considered to be within the dementia range, scores between 12 and 24 signal cognitive impairment, and scores between 24 and 26 suggest sensitivity to dementia. For schooled participants under 75 years of age, 27 points were the cut score; when the schooled participants were over 75 years of age, 26 was the score selected to exclude participants (Butman et al. 2001). The Yesavage Geriatric Depression Scale (GDS, Martinez de la Iglesia et al. 2002; Sheikh and Yesavage 1986) was administered to measure depression specifically in older adults by assessing anhedonia, sadness, loss of interest, etc. Scores between 0 and 10 are considered to be within the normal range, scores 11–14 show sensitivity to depression, and scores over 14 signal depression. In this work, participants with a score of 11 or higher were excluded.

Verbal memory evaluation

The Rey Auditory Verbal Learning Test was used to evaluate neutral verbal memory (RAVLT, Dalmás 1993; Rey 1941, 1964; Ryan and Geisser 1986). The dependent variables of memory performance were the number of words recalled and recognized, immediately and deferred. The list was reproduced in wma format, with a Flip 3 JBL speaker (volume intensity according to the hearing of older adults), with an inter-word interval of 9 s for the acquisition task, and 7 s for the recognition task.

Interventions

Music improvisation (MI)

A music-therapy perspective of improvisation was used. During these real-time musical experiences, different sounds, melodies, and rhythms are created and combined spontaneously with the available resources, according to the possibilities of the subject in musical interaction with other people (Bruscia 1998, 1999; Wigram 2004). For this reason, musical improvisation is considered a musical experience that is flexible enough to be adapted to people with and without musical skills (Diaz Abrahan et al. 2018). Although non-musicians are not trained to play pieces of music, they are able to improvise and create melodies and rhythms (Sági and Vitányi 1988). In this study, the researcher performed a rhythmic pattern of two 4/4 bars played as a loop (ostinato), with a percussion instrument at a medium volume, during 3 min as a base, encouraging participants to join the performance (Diaz Abrahan et al. 2018, 2019). For this musical intervention, the participants were allowed to choose either percussion instruments (e.g., drums, maracas, bells, woodblocks, shakers, and tambourine) or melodic/harmonic instruments (e.g., guitar, melodica, xylophone, flutes). These instruments were included because they were easy to play. After choosing a musical instrument, the participants combined and produced musical patterns with the instruments or their voices or bodies, spontaneously creating some musical feature fitting the context provided by the base pattern.

Rhythmic imitation (RI)

The researcher performed a rhythmic pattern as a loop, the one used in MI, repeatedly during 3 min as a model to be imitated by the participants with their instruments, voices, or bodies (the participants could choose from the same array of musical instruments as that used in the improvisation intervention). This pattern was performed with a percussion instrument at a medium volume. The instructions included imitating the pattern heard as faithfully as possible, avoiding variations or new musical materials. This intervention was designed as an active control condition, and it was meant to control for possible effects of movements, music perception, and musical instruments, among others, that could explain the results.

Rest (REST)

The participants remained seated and silent for 3 min. They were asked not to write in their notebooks and not to use cell phones or have conversations with each other. This intervention was designed as a passive control condition, without sound or movement.

General procedure

A schematic design of the general procedure is depicted in Fig. 1. The study was divided into two sessions separated by a 1-week inter-session interval.

Fig. 1
figure 1

Scheme of the general procedure used in Studies 1 and 2

In the first session, four immediately consecutive phases were performed. In the first phase (information phase, 15 min), the participants signed the informed consent form and completed the sociodemographic and musical background questionnaire. Then, the MMSE and GDS tests were individually administered. In the second phase (acquisition, 9 min), the participants listened to the 15 words from RAVLT once. Simultaneously, they were asked to rate on a 0–10 scale “how emotional” or “activating” they felt each word was. This evaluation aimed to keep the attention of the participants during the task and to use the procedure implemented in Study 2. In the third phase (treatment phase, 3 min), the participants were exposed to the interventions (music improvisation, rhythmic imitation, or rest). They were randomly and blindly assigned to the different groups and always tested in groups, with a minimum of four and a maximum of 10 participants, to control the involvement of each participant in the music performance. In the musical improvisation intervention (Diaz Abrahan et al. 2018, 2019), the participants were instructed as follows: “We will listen to a rhythmic base, from which you have to create something musical as a group. This rhythmic base will help you to start the improvisation at any time you want. You can use instruments, your voice or your body. It is important to listen not only to the base but also to your group”. In the rhythmic imitation intervention, the participants were instructed as follows: “We will listen to a rhythmic base and, anytime you want, you can start to imitate me. You can use instruments, your voice or your body”. The rest condition was instructed as follows: “We ask you to remain silent and please do not do anything for a few minutes”. Before starting, the researcher corroborated that all the participants had understood the instructions. Then, the participants freely chose the musical instrument that they wished to play (MI and RI interventions), and they performed the improvisation or rhythmic imitation task in groups for 3 min, while the REST group remained seated together and silent for 3 min. Soon afterward, the fourth phase (test phase, 11 min), a two-task memory test, was run. Participants were given paper and pencil to write down as many words as they could remember from the RAVLT (Immediate Free Recall task), and then the 15 words of the RAVLT were mixed with 15 new words and the participants were asked to indicate whether they had listened to the item before or not (Immediate Recognition task).

A week later, the second session (11 min) was held, and the two-task memory test was run again (Deferred Free-Recall Recognition task).

Musical experience analysis

To ensure that each group complied with the instructions given, three external evaluators were recruited to watch the videotapes and assess the following parameters using a 5-point scale: (1) the level of detachment from the rhythmic base, (2) the presence and variation of melodic patterns, (3) the degree of participation of each participant, and (4) the degree of creativity in the music production. In order to estimate the inter-rate reliability, the correlation between evaluators was analyzed. The Pearson correlation results showed a high level of agreement r(9) = .915, p < .01.

Data analysis

Age, years of Academic Education, and years of Musical Education were analyzed independently via univariate analysis of variance (ANOVA), where Intervention (Music Improvisation vs Rhythmic Imitation vs Rest) and Training (Musicians vs Non-musicians) were the between factors, while Sex was analyzed via χ2.

Free recall and recognition (immediate and deferred) were analyzed independently with an analysis of covariance (ANCOVA). The between factors were Intervention (Music Improvisation vs Rhythmic Imitation vs Rest) and Training (musicians vs non-musicians), while Sex, Age, and Academic Education were the co-variables used to ensure that the factors were isolated.

Post hoc least-significant difference (LSD) pairwise comparisons were conducted to analyze significant main effects and significant interactions. The partial Eta square (η2p) was utilized to estimate effect size. The alpha value was set at .05, and the SPSS software package was used to compute descriptive and inferential statistics.

Results

Sample characteristics

The final sample was composed of 78 participants because six evaluations were discarded due to cognitive deficit and/or depression. The final number of participants per group was as follows: M/MI = 10; M/RI = 10; M/REST = 12; NM/MI = 13; NM/RI = 19; and NM/REST = 14.

Regarding sociodemographic information, the groups were balanced in terms of Age and Academic Education, but significant differences emerged for Sex, regarding the Training factor, χ2 = 8.01, p = .005. Regarding Music Education, the ANOVA indicated significant differences in Training F(1, 71) = 70.88, p < .0001, η2p = .500, where musicians had a higher music education than non-musicians, as expected, and the ANOVA also indicated a main effect of Intervention F(2, 71) = 3.86, p = .026, η2p = .098 and a significant effect in the double interaction Training x Intervention F(2, 71) = 4.30, p = .017, η2p = .108, which indicated that musicians who participated in the musical improvisation intervention had fewer years of musical experience than the musicians who participated in the other two interventions (Table 1).

Table 1 Sociodemographic and general cognitive state information from Study 1

Memory performance

Immediate measures

After the interventions (improvisation, rhythmic imitation, or rest), the participants were asked to recall as many words as possible. The ANCOVA yielded no significant differences for free recall (p > .05).

After the free recall, the participants listened to the 15 original RAVLT words randomly, intermixed with 15 new ones. They had to discriminate the new words from the original ones (recognition task). False recognitions were subtracted from the total recognition score. The ANCOVA yielded no significant differences for free recall (p > .05).

Deferred measures

The two memory tests were administered again a week after the first session. Figure 2a illustrates the results of the deferred free-recall task. The ANCOVA indicated a main effect of Training F(1, 58) = 5.118, p = .027, η2p = .081 and Intervention F(2, 58) = 19.653, p < .0001, η2p = .404. The post hoc analysis showed that the musicians remembered more words than the non-musicians (p = .027) and that the musical improvisation groups remembered more words than the other two interventions (rhythmic imitation and rest; p < .001).

Fig. 2
figure 2

Deferred free recall and recognition. Mean number of words that the groups could remember (a) and recognize (b) a week after the first session. M/MI: musicians’ improvisation group; M/RI: musicians’ rhythmic imitation group; M/REST: musicians’ rest group; NM/MI: non-musicians’ improvisation group; NM/RI: non-musicians’ rhythmic imitation group; NM/REST: non-musicians’ rest group. *Significant differences in Intervention, #Significant differences in Training. Vertical lines represent standard errors of the mean

The second memory task was recognition, illustrated in Fig. 2b. The ANCOVA indicated a main effect of Intervention F(2, 58) = 5.69, p = .006, η2p = .164, and the post hoc analysis showed that the musical improvisation groups recognized more words than did the rhythmic imitation (p = .013) and rest (p = .002) groups.

Study 2: emotional verbal memory

Participants

Ninety-one new volunteers participated in this study (81% women), and they were aged between 60 and 90 (M = 71.91; SD = .85). Twenty-four were musicians (M) with more than 5 years of formal and/or informal music training (schools, institutes, music conservatories, private practice). Fifty-seven were considered non-musicians (NM). Musicians had an average number of 17 years of musical experience, 63% of the musicians were not musically active, while 37% were. In the musicians’ sample, 56.66% performed string instruments (piano and violin mostly), 2.66% wind instruments (flute and harmonica), and 36.66% were singers. Non-musicians had formal music training of less than 3 months and only two volunteers reported playing guitar and piano. The exclusion criteria, recruitment criteria, and ethical aspects were those used in Study 1.

Measures

General cognitive state evaluation

See Study 1.

Emotional memory evaluation

Thirty-six words were selected from the Spanish version of the Affective Norms for English Words (ANEW, Bradley and Lang 1999; Redondo et al. 2007). Twenty-four words were emotionally arousing (12 with positive valence and 12 with negative valence), and 12 were non-arousing, neutral words. Following guidelines from previous works (Bradley and Lang 1999), we selected the words that covered a wide range of arousal (from 3.99 to 7.79) and valence (from 1.11 to 8.50). The dependent variables of memory were the number of words recalled and recognized, immediately and deferred.

Interventions

The interventions in Study 2 were the same as those in Study 1.

General procedure

This study was also performed in two sessions with a 1-week intersession interval (Fig. 1). The first session consisted of four immediately consecutive phases. The first phase was identical to that of Study 1. In the second phase (acquisition phase, 7 min), the participants listened to the 36 selected words for 7 s each. The words were presented in random order except for the first and last locations in the series, which were neutral words (Cahill et al. 2003). To validate the selection of the ANEW words for this research context, while listening to the words, the participants were asked to rate on a 0–10 scale “how emotional” or “activating” they felt each word was (from 0 = not arousing at all to 10 = highly arousing). The third phase (intervention) was identical to the one employed in Study 1. Soon afterward, in the fourth phase (test phase, 11 min), a two-task test was run. The participants were asked to write the words they could remember (Immediate Free-Recall task). Next, they listened to the 36 original words mixed with 36 new words in random order, and they were asked to mark on a sheet of paper if they had listened to the words before or not (Immediate Recognition task).

A week later, in the second session (11 min), the two-task memory test was run again (Deferred Free-Recall and Recognition tasks).

Data analysis

Age, years of Academic Education, and years of Musical Education were analyzed independently via univariate analysis of variance (ANOVA), where Intervention (Music Improvisation vs Rhythmic Imitation vs Rest) and Training (Musicians vs Non-musicians) were the between factors, while Sex was analyzed via χ2.

Free recall and recognition (immediate and deferred) were analyzed independently with repeated-measures ANCOVA. The between factors were Intervention (Music Improvisation vs Rhythmic Imitation vs Rest) and Training (musicians vs non-musicians), while Word (neutral, positive, and negative) was the RM, and Sex, Age, and Academic Education were the co-variables, aimed at ensuring the isolation of the factors.

Post hoc least-significant difference (LSD) pairwise comparisons were made to analyze significant main effects or interactions. The partial Eta square (η2p) was utilized to estimate effect size. The alpha value was set at .05, and the SPSS software package was used to compute descriptive and inferential statistics.

Results

Sample characteristics

The final sample consisted of 86 participants because four evaluations were discarded due to cognitive deficit and/or depression, and the final number of participants per group was as follows: M/MI = 13; M/RI = 10; M/REST = 10; NM/MI = 16; NM/RI = 21; and NM/REST = 16.

Regarding sociodemographic information, Sex yielded no significant differences, but Age did F(1, 80) = 7.57, p = .007, η2p = .087, with the non-musicians being older than the musicians. Also, Academic Education yielded significant differences F(1, 75) = 19.46, p < .0001, η2p = .206, with the musicians having a higher academic educational level than the non-musicians. Regarding Music Education, the ANOVA indicated significant differences in Training F(1, 80) = 66.74, p < .0001, η2p = .455, where the musicians had a higher music educational level than the non-musicians, as expected. The ANOVA also indicated a main effect of Intervention F(2, 80) = 4.11, p = .020, η2p = .093 and a significant effect for the double interaction Training x Intervention F(2, 80) = 4.21, p = .018, η2p = .095, which showed that the musicians who participated in the rhythmic imitation intervention had more years of musical experience than the musicians who participated in the other two interventions. Accordingly, Age and Academic Education were employed as co-variables (Table 2).

Table 2 Sociodemographic and general cognitive state data from study 2

Memory

Immediate measures

The ANCOVA indicated a main effect of Word F(2, 144) = 3.233, p = .042, η2p = .043. The post hoc analysis showed that emotional words were better remembered than neutral words in the free-recall test (p < .001), and there were no differences between positive and negative words. No other significant differences were found in Intervention or Training factors in the free-recall and recognition tests (p > .05).

Deferred measures

The free-recall and recognition tasks were conducted again a week after the first session. Figure 3a shows the results of the free-recall task. The ANCOVA indicated a main effect of Training F(1, 66) = 12.89, p = .001, η2p = .166 The post hoc analysis showed that the musicians remembered more words than did the non-musicians (p = .001).

Fig. 3
figure 3

Deferred free recall and recognition. Mean number of neutral, positive, and negative words that the groups could remember (a) and recognize (b) a week after the first session. M/MI: musicians’ improvisation group; M/RI: musicians’ rhythmic imitation group; M/REST: musicians’ rest group; NM/MI: non-musicians’ improvisation group; NM/RI: non-musicians’ rhythmic imitation group; NM/REST: non-musicians’ rest group. #Significant differences in Training. Vertical lines represent standard errors of the mean

Figure 3b shows the results of the recognition task. The ANCOVA indicated a main effect of Training F(1, 65) = 10.70, p = .002, η2p = .141, and the post hoc analysis showed that the musicians recognized more words than did the non-musicians.

Discussion

The main goal of this investigation was to evaluate the effect of musical improvisation, as a focal music-based intervention, on neutral and emotional verbal memory in older adults. According to our previous studies, the first prediction was that musical improvisation would enhance both types of verbal memories. This prediction was confirmed to some extent. Overall, the results showed an effect of the type of intervention since the group involved in musical improvisation performed better than the other groups, though only in neutral verbal memory.

A novel approach to musical improvisation was implemented. Improvisation in music is generally associated with jazz, which requires high musical knowledge to for musicians to perform it (Limb and Braun 2008; Kleinmintz et al. 2014; McPherson and Limb 2013; McPherson et al. 2016). Here we opted for a music-therapy approach, where the technique has such flexibility that it can be applied to different populations, with or without musical knowledge (Abrahan and Justel 2015; MacDonald and Wilson 2014). In this perspective, the improvisation intervention was applied after learning of verbal information (neutral and emotional), and the performance was compared with two control groups.

Research on the effect of non-pharmacological or music-based interventions on the health of older adults has had methodological difficulties. For example, the studies that investigated groups of adults had no adequate control groups, which suggests that the positive effect of the interventions could be due to social variables (Belleville 2008; Reijnders et al. 2013). Therefore, we employed two different control groups, one active (rhythmic imitation) and one passive (rest) to control those variables that could explain the results. Musical improvisation, in comparison with rhythmic imitation and/or rest, enhanced recall and/or recognition of neutral verbal memory. As regards the improvisation and imitation groups, we intended to compare two interventions involving the same sensory and motor components, in order to isolate creative and interactive music behavior. In both interventions, the musical activities involved sound, and the participants listened to the same rhythmic pattern, played instruments in groups, and had the opportunity to use their bodies and/or voices. However, only music improvisation modulated memory, and therefore, the findings could be attributed to the characteristics of each musical experience. In the following paragraphs, we discuss the differences between these distinct music tasks.

As regards rhythmic imitation, the pattern employed in this study was an easy-to-reproduce rhythmic base. However, from a neurocognitive perspective, imitation tasks are far from simple (Engel et al. 2001; Raichle 2010). It has been shown that, in the general population, rhythm processing activates specific pathways in several brain areas, such as auditory cortices, as well as frontal, parietal, and motor regions, including the supplementary motor area, the basal ganglia, and the cerebellum (Grahn 2012; Grahn and Brett 2007; Merchant et al. 2015). The necessary sensory-motor coupling for rhythmic synchronization requires a sophisticated mechanism of temporal prediction and feedback (Miendlarzewska et al. 2013; Schroeder et al. 2010), which in turn makes high cognitive demands, especially attention, inhibitory control, and working memory (Benz et al. 2016). In this sense, imitation could impose restrictions due to its great cognitive demand (especially attention), which would “consume” the necessary resources for memory consolidation and result in an interference effect.

The research on the neurocognitive mechanisms of musical improvisation is focused on musicians (Limb and Braun 2008; Kleinmintz et al. 2014; McPherson and Limb 2013; McPherson et al. 2016), whereas studies on improvisations performed by non-musicians and those from a music-therapy perspective are scarce. However, a study conducted by Berkowitz and Ansari (2010) compared the brain activation during a musical improvisation (melodic and rhythmic) of musicians and non-musicians and found only one difference between the groups: the deactivation of the right temporoparietal junction (Fink et al. 2009). This research allows using the background that exists on musical improvisation to justify the results of the modulatory effect on memory.

Multiple studies reported that musical improvisation involved creative decision-making processes and activated the premotor cortex, supplementary motor areas, language areas, insula, cingulate cortex, medial prefrontal cortex, temporal areas (Loui 2018; McPherson et al. 2016), and amygdala (Beaty 2015; Engel and Keller 2011). This brain activation and its behavioral outcome with its consequent emotional state could play a decisive role in the modulation of memory performance by older adults who participated in musical improvisation, considering that emotion was identified as a modulator of acquisition and consolidation of memory (Shields et al. 2017; Tambini et al. 2017).

From music psychology and music-therapy perspectives, musical improvisation is a complex phenomenon with unique psychological features (MacDonald and Wilson 2014). In addition to the creative features of the process, another characteristic is that this kind of musical improvisation is social and it involves the contributions and communication of two or more individuals (in our case four to ten people), each of them creating and musically responding to the other(s) and their playing (Walton et al. 2018). It is spontaneous since music is formed as it is played through real-time responses to an immediate musical context. It is creative since improvising music produces novel musical items every time the participant plays, which may be similar to, but different from, any previous performance (MacDonald and Wilson 2014). This type of social characteristic present in improvisation could induce a positive emotional state that could modulate memory, but research in this line, with empirical studies, is scarce.

Study 2 was aimed at evaluating verbal emotional memory by means of a focal music-based intervention, which had no effects on memory measures. This unexpected result differed from our previous research on visual emotional memory, where musical improvisation had robust effects (Diaz Abrahan et al. 2019). A possible explanation is that the emotional induction of the material presented in the acquisition phase (emotional words) was combined with the emotional induction of the musical experience (Rickard et al. 2012), resulting in too much emotion, which may have saturated the system. Besides, in this relationship, both stimuli (coding information and modulating intervention) shared the same auditory sensory modality. This double interaction could have resulted in the overload of the system, thus interfering with consolidation (Duncan et al. 1997).

The second goal of this work was to identify the influence of music training on older adult’s memory. A systematic review showed that musicians have better memory performance than non-musicians (Talamini et al. 2017) and two studies found that older adults who were instrumental musicians performed significantly better than non-musicians on tests of visual memory, executive functioning, and language (Hanna-Pladdy and MacKay 2011; Strong and Mast 2018). For this reason, we expected an effect of music training on both types of verbal memory. This prediction was partially confirmed because the effects were observed on deferred memory measures.

Neutral verbal memory has been widely investigated in studies contrasting older musicians and non-musicians. Some studies indicated that music training improved performance in long-term verbal memory (Franklin et al. 2008; Hanna-Pladdy and MacKay 2011) and short-term memory (Fauvel et al. 2014), while others found no effects (Strong and Mast 2018). Our results are in line with the first group of evidence. A plausible explanation for the better performance of musicians is that there are cerebral adaptations as a product of music training, which in turn enhance cognitive functions (Benz et al. 2016; Delogu et al. 2019; Justel and Diaz 2012; Zatorre et al. 2007; Zhao et al. 2017; Zuk et al. 2014). Neuroimaging studies showed that musical learning generated greater activation of the temporal planum and the left dorsolateral prefrontal cortex, areas involved in verbal processing (Taylor and Dewhurst 2017; Schlaug et al. 2005). Along the same lines, Patel (2003) suggested that although these two types of representations (words vs music) are stored in different neuronal substrates, the processing could share neural networks, and in this sense music training could induce changes in the neuronal circuits related to verbal memory.

However, the change reported for music training may not be intense enough or may not extend to the circuits of short-term memory or immediate evaluation, highly affected during aging (Rhodes and Katz 2017; Rieckmann et al. 2017), which could explain the lack of effect of music training on every immediate measure, compared to deferred evaluation. More studies are necessary to further investigate this result.

In recent years, research has focused on delimiting the relationship between music and cognitive performance. Several behavioral (Jakobson et al. 2008; Tamminen et al. 2015) and electrophysiological studies (Dittinger et al. 2016) indicate that the advantage of musicians over non-musicians in verbal and visual memory could be due to the employment of more effective encoding strategies by musicians when learning new material, and not due to better storage mechanisms. This type of evidence opens up new topics for discussion about the specific effects of music learning.

Limitations and future directions

The limitations identified in this study are associated with the characteristics of the samples. First, music and academic educational level have both been identified as protective factors of cognitive decline in aging. In this study, the academic level was evaluated in the statistical analysis of memory performance as a co-variable. In future studies, groups of musicians and groups of professional non-musicians with a high educational level should be compared. In addition, the groups of musicians were unbalanced according to their musical experiences. Although a group of elderly musicians is a difficult population to recruit, having homogeneous samples in terms of music training is a point to be considered in future studies.

Regarding the effect of music training, some studies investigated whether the protective effect is due to early experiences or whether it can be constructed in later stages of life (Herholz et al. 2013). We did not include this item in our musical background questionnaire, which could have yielded differences in memory performance. This would be another interesting point to investigate in the future.

Another activity identified as a protective factor during aging is the learning of a second language (Abutalei et al. 2015). Our sociodemographic questionnaire did not include questions about bilingualism, an aspect that should have been considered as an exclusion criterion due to the relationship between the type of learning and the type of memory that was evaluated in this study.

The research conducted in our laboratory was focused on different types of episodic memory, i.e., visual and verbal modalities (with or without emotional content). However, the effect of music training or participation in music-based interventions in implicit procedural memories was not investigated. In this line, future research could consider this topic.

Finally, to isolate the effect of musical intervention based on a creative task, we included music reproduction as a control condition because it does not include the creative component. The musical interaction involved in improvisation, but not in reproduction, was not controlled for, so future studies could compare music improvisation with another task that is musical and collaborative but not creative.

Conclusions

Our ability to perceive, produce, and enjoy music is considered a universal trait. Humans have the creative capacity to generate and modify patterns of sound in response to each other, from early infant–mother interactions onward to older age (Trevarthen 2002). Since all humans are musical improvisers to some extent, everybody can engage in improvisation at some level, unlike many other musical activities where specific technical skills or knowledge are required for people to participate. This approach was taken in the present work, and we found that musical improvisation modulates neutral verbal memory. The current results have implications for regular or pathological decline during aging. This accessibility of music and musical activities shows that it has the potential for a wide application as an intervention for cognitive stimulation and rehabilitation of older adults.

We also found that long-term musical practice also seems to prevent age-related decline in verbal memory cognition. Learning music may provide protective factors against age-related cognitive decline and affected cognitive reserve.

Experiencing a musically enriched environment, by learning music or participating in music-based intervention programs, affects our cognitive functions, such as the neutral and emotional verbal memory studied in this research, thus promoting healthy aging.