Introduction

Sound waves with a frequency greater than the upper limit of the human hearing range are known as ultrasound (US). Typically, ultrasound has a frequency in the range from 20 kHz to 10 MHz, which can be subdivided into three main regions: low-frequency, high-power ultrasound (20–100 kHz); intermediate-frequency, medium-power ultrasound (100 kHz–1 MHz); and high-frequency, low-power ultrasound (1–10 MHz) [7]. The frequency range selected for food processing depends on the requirements of the process in question.

Low-frequency, high-power ultrasound uses intensities higher than 1 W/cm2, which induce strong cavitation effects that influence the physical, mechanical or chemical/biochemical properties of foods [75, 77]. Examples of suitable applications include emulsification and homogenisation. In contrast, high-frequency low-power ultrasound uses intensities below 1 W/cm2 where the physical effects are comparatively gentle and as such can be utilised for processes such as non-invasive analysis and monitoring food materials, and non-destructive separations of multi-component mixtures [68]. The intermediate frequency range is characterised by peak sonochemical effects and as such can be selected to initiate chemical modifications in food systems.

There are quite a number of review articles published focussing on ultrasonic processing of different food applications in recent years [11, 13, 2527, 34, 68, 91, 93, 104, 111, 115]. Although many reviews focussed on different food systems, only one review concentrated on US processing of dairy products [11]. Recent advances in understanding ultrasound processing, particularly in dairy systems, have been reported in the last 4 years. In this review, we highlight some of these recent studies with particular attention to the benefits and/or undesirable properties obtained through the use of ultrasound compared with conventional processing. Examples of such applications include emulsification, inactivation of bacteria, crystallisation, functionality improvements and fat separation.

High-Intensity Ultrasound

When ultrasound passes through a liquid medium, the interaction between the ultrasonic waves, liquid and dissolved gas leads to a phenomenon known as acoustic cavitation [25]. Several review articles and book chapters describe acoustic cavitation in detail [7, 12, 74] and hence will not be described in significant detail here.

In summary, dissolved gas bubbles within a liquid medium oscillate under the influence of the acoustic field. Over time, these bubbles will grow due to two main growth mechanisms. An individual bubble within a system experiences a fluctuating pressure during bubble oscillation. This can cause dissolved gas and/or solvent vapour to diffuse in and out of the oscillating bubble. If the driving pressure is above a certain threshold, the amount of gas/vapour that diffuses into the bubble during the expansion phase will be greater than the amount that diffuses out of the bubble during the compression phase of the bubble oscillation. This leads to a net growth over time. Such a process is called rectified diffusion. In systems that contain many bubble nuclei (i.e., food and dairy systems), two or more bubbles can combine by bubble–bubble coalescence. This is another possible growth pathway.

Both rectified diffusion and bubble–bubble coalescence lead to the growth of bubbles towards what is known as a resonance size range. This size range varies with the applied frequency of ultrasound and is in general inversely proportional to the ultrasonic frequency. When bubbles reach the resonance size range, they may grow to a maximum size within one or several acoustic cycles and violently collapse. This is known as transient cavitation. In some cases, bubbles can reach the resonance size and continue to oscillate at or near this size for some time. This is known as stable cavitation.

One of the physical phenomena resultant from strong bubble collapse is the generation of temperatures of thousands of degrees in the local vicinity of the bubble [12]. It was found that collapsed bubbles have hot plasma cores with temperatures up to 10,000 K. The temperatures near the surfaces of the bubbles are only few hundreds of degrees [106]. The importance of the medium towards the temperature within bubbles is discussed thoroughly by Ashok kumar [12]. In addition to generating chemical reactions, acoustic cavitation also generates violent physical forces that include micro-jets, shear forces, shock waves and turbulence [7]. Pressures up to several hundred atmospheres can be produced. Highly reactive radicals can be generated within the cavitation bubbles. In food processing, the normal solvent is water, and hence, the extreme conditions generated on collapse of the cavitation bubbles are sufficient to cause rupture of the O–H bonds leading to the formation of H and OH radicals and other recombined products such as hydrogen peroxide [76]. These radicals can participate in a range of oxidation reactions in food media.

High-Intensity Ultrasound Applications in Dairy Processing

The application of ultrasound to liquid-type products in dairy processing leads to the generation of physical forces that include cavitation, acoustic streaming, acoustic radiation, shear, micro-jetting and shockwaves. These physical forces can be used in the generation of emulsions, destruction of bacteria, crystallisation of lactose and separation of fat in dairy systems. In addition, the functional properties of dairy components can be significantly affected by both the physical and chemical effects resultant from acoustic cavitation and other forces of ultrasound.

Formation of Metal Particulates

One of the biggest concerns of using high-intensity ultrasound on food systems is the cavitation-related release of metallic particles into food systems from sonication with ultrasonic transducer probes that come in direct contact with food. Mawson et al. [78] and his CSIRO team investigated the formation of metallic particulates by erosion of a series of transducers in water systems at various frequencies ranging from 18 kHz to 2 MHz. Their study showed that metal leach did occur, but values remained below the accepted drinking water limits, even after long exposure times (7.5-h horn transducers, 7 days for plate transducers). Furthermore, metallic nanoparticles were not detected within the size range of concern to health (i.e., below 80 nm), therefore suggesting no evidence of health hazards from the use of sonotrodes and plate transducers in direct contact with food materials at a range of ultrasonic frequencies.

These long exposure times, as investigated by Mawson et al., [78] are not typically required in food processing applications. It has been shown that ultrasound-induced benefits for various applications in dairy processing can be achieved within the first few seconds of ultrasound application [9, 10]. With the availability of continuous flow-through systems (Prosonix, CA & Hielscher Ultrasonics, Germany), the contact time of product with the acoustic field and sonotrode surfaces can be minimised. Hence, metal particulate generation due to ultrasound application can be considered negligible and not a significant safety concern for ultrasonic dairy processing.

The influence of radicals formed towards the sonochemical production of metal nanoparticles [86] is of another concern in using high-intensity ultrasound on food systems. However, Mawson et al. [78] suggested that the buffering capacity of food materials may prevent the acceleration of production of nanoparticles with the use of radicals.

Furthermore, Ashok kumar et al. [8] showed that unwanted reactions between ultrasonically generated radicals and food ingredients can be minimised by selecting 20 kHz conditions. The addition of radical scavenging compounds like ascorbate can also be used to further reduce the effect radicals. At present, the use of high-intensity ultrasound (especially 20 kHz) on dairy applications can be deemed acceptable for use without any major concerns.

Emulsification

Ultrasonic emulsification offers several benefits over conventional emulsification methods used in dairy systems such as mechanical shaking, high- or ultra-high-pressure homogenising and microfluidising. Improved energy efficiency, higher emulsion stability, lowered requirement of surfactants and controllable size distributions have been reported [1, 55].

Ultrasonic emulsification is primarily driven by cavitation, wherein bubble collapse at or near the oil–water interface causes disruption and mixing of the two phases, resulting in the formation of very fine emulsions [107]. Ultrasound-assisted emulsification is influenced by many variables such as irradiation power [2, 43], position of the ultrasonic source with liquid–liquid interface [102], tip diameter [55], vessel size [55], viscosity of the continuous phase, surfactants concentration [1], hydrostatic pressure and the presence of dissolved gases [14].

The intensity of bubble collapse during cavitation events is the controlling mechanism of power dissipation in an acoustic field. Two important parameters are known to affect this: the hydrostatic pressure of the system and the gas content in the liquid phase.

The hydrostatic pressure may have two effects on the cavitation. Firstly, an increase to the ambient pressure raises the cavitation threshold pressure. This suppresses the likelihood of cavitation by decreasing the parameter space in which cavitation will occur (see Fig. 1). However, it is also known that the intensity of the shock waves generated by collapsing bubbles increases with ambient pressure [101].

Fig. 1
figure 1

Onset cavitation due to local pressure oscillations in a sound field: effect of a shift in ambient hydrostatic pressure (adapted from [14])

The concentration of gas in the liquid phase provides the basis for bubble formation and growth. But at the same time, increasing the amount of gas in the system tends to increase the gas/vapour pressure ratio inside the bubbles, which can cushion the bubble collapse and hence reduce the shock wave intensity. Therefore, overly ‘gassy’ liquids may be less efficiently emulsified.

Behreud and Schubert [14] showed that the application of higher ambient pressure and increased concentration of gas neither influences the magnitude of the cavitational effects per unit volume nor the intensity of the cavitation collapse, although the energy dissipation per unit volume was found to control the particle size distribution of a system. In contrast, Leong et al. [67] showed that pressures up to 400 kPa improved the efficiency of ultrasonic emulsification of food oils, whereas operating with ambient pressures over 450 kPa suppressed the cavitation activity and no emulsification could be achieved.

Shanmugam and Ashokkumar [100] demonstrated the possibility of incorporating novel food oils into milk systems by ultrasonic emulsification. A 20-kHz ultrasound horn was used to emulsify flaxseed oil in a volume of skim milk. No addition of surfactants was required to stabilise the emulsion, as it was found that a small amount of partially denatured whey proteins surrounded the emulsified oil droplets, providing sufficient stability for a minimum of 9 days. The advantage of US is further highlighted since no stable emulsions could be produced when emulsification was instead performed with high-energy mechanical mixing (using matched specific energies).

Another emulsification technique is micro-fluidisation. However, Jafari [51] showed that increasing the micro-fluidisation energy input beyond moderate pressures (40–60 kPa) led to over-processing of emulsion droplets due to re-coalescence and found that decrease in emulsion droplet size <0.5 µm by micro-fluidisation was not possible. In contrast, increased energy input helped to reduce emulsion droplet size with minimum re-coalescence of new droplets with US emulsification [51]. Different reasons for re-coalescence were found including low adsorption rate of the surface active agent, low residence time of the emulsion in the emulsification zone, high rate coalesce frequency and extreme amount of energy density [52].

Homogenisation

High-pressure homogenisation is the most frequently used technique within the dairy industry for homogenisation of milk. Certain dairy products such as milk, yoghurt and ice-cream are homogenised to improve the product stability against creaming during storage. In recent years, the use of ultrasound has attracted much interest for the homogenisation of milk [5, 20, 42, 114] with benefits being similar to those already described in ultrasonic emulsification. However, there remains some uncertainty as to which technique is more efficient for homogenisation and the primary mechanism for its effectiveness.

Koh et al. [61] used a 20-kHz ultrasonic horn and a high-pressure homogeniser to investigate the particle size reduction of whey protein concentrate (WPC) systems in the presence/absence of cavitational conditions. The particle size reduction in WPC systems under high-pressure conditions showed comparable results when subject to ultrasound under similar energy conditions. Notably, cavitation-derived shear forces were found to be absent in high-pressure homogenisation. Hence, the study highlighted the fact that the shear forces generated were mainly responsible for the observed particle size reductions even in the absence of cavitational effects [61]. In the case of ultrasound application, the shear forces are resultant from strong bubble collapse.

The frequency, amplitude and diameter of the ultrasonic probe have a significant influence on the physical properties and variance of milk that is homogenised. Bermúdez-Aguirre et al. [14] showed that ultrasonic homogenisation (400 W, 24 kHz, using a 22-mm probe) of milk at 63 °C for 30 min caused disintegration of the milk fat globule membrane. The diameter of the milk fat globules was reduced to <1 μm compared with the native fat globule dimensions of 4.3 μm and displayed a granular surface morphology due to their interaction with casein micelles. Villamiel and de Jong [110] reported a milk fat globule size reduction of up to 82 % during continuous flow, high-intensity (150 W, 20 kHz, using a Branson sonifier with 18.76 mL cavity) ultrasonication of milk. At either 70 or 75 °C, a monomodal particle size distribution was obtained, whereas a bimodal distribution was observed at lower temperatures. Bosiljkov et al. [20] showed that an increase of the amplitude (20, 60 and 100 %) and time (2–15 min) of ultrasound (30 kHz, using 7 and 10 mm probes) significantly influenced the degree of homogenisation of milk.

Wu et al. [114] reported that high-amplitude ultrasound (90, 225 and 450 W, 20 kHz) not only effectively homogenised milk, but also significantly improved the viscosity and water holding capacity and reduced syneresis of yoghurt produced from ultrasonicated milk. These effects are directly related to the yoghurt structure, which is based on strings or clusters of casein micelles interacting physically with each other and with denatured serum proteins entrapping serum and fat globules. Furthermore, ultrasound could cause some qualitative changes in the fat globule membrane, which would modify the ability of fat globules to interact with themselves and/or casein micelles, improving the gelling properties (unpublished data).

Sonocrystallisation

Sonocrystallisation is the use of power ultrasound to control the crystallisation process commonly used during the nucleation phase of crystallisation. Rapid sonocrystallisation for lactose recovery in the food industry relies on different aspects of ultrasound [26]. The crystallisation of biological soft materials with the use of ultrasound plays a key role in controlling the crystal structure, shape and rate of crystallisation. US is a promising tool for rapid crystallisation of food materials, which will increase the efficiency of some traditional processes, leading to cost-effectiveness [39]. Some of the interesting dairy applications for sonocrystallisation will be highlighted here.

Lactose Crystallisation

In the dairy industry, liquid whey is spray-dried to manufacture WPC as a means of utilising whey waste streams. Whey powders have significant value when incorporated into products such as infant formula and body building powders. In liquid whey, lactose is the most abundant component. However, its presence restricts the spray-drying abilities. The removal of lactose by crystallisation is hence an important process that is required prior to spray-drying liquid whey.

Conventional methods for lactose crystallisation are, however, laden with issues such as long induction times and slow crystallisation rates. A rapid recovery of lactose by ultrasound-assisted crystallisation has been reported recently [21, 22, 89]. In the study by Bund and Pandit [21, 22], the whole process of lactose sonocrystallisation was rapidly completed from reconstituted lactose solutions in the presence of ethanol as an anti-solvent at 30 ± 2 °C. The lactose recovery was much higher (91 %) in the case of sonicated samples when compared to non-sonicated (14 %) samples at the end of 5 min. As ultrasound travels throughout the crystallisation vessel, it mixes the anti-solvent uniformly in the lactose solution. Cavitation events are also likely to be increased due to the lowering of the vapour pressure of the solution as a result of the addition of ethanol. This causes a sharp decrease in the local solubility of lactose. The solution reaches super saturation, causing nucleation and rapid lactose recovery.

Similarly, Patel and Murthy [88, 90] used sonocrystallisation for lactose recovery from whey waste streams. They pointed out that recent increases to worldwide demands for dairy products have led to increased amounts of whey waste, driving an increasing necessity to make use of such waste streams. The crystallisation was reportedly completed with yields in the range of 80–92 % within 4 min of sonication. A recent pilot-scale study by Zisu et al. [120] focussed on crystallisation of lactose in whey systems using a non-contact approach at flow rates of up to 12 L/min, where lactose was concentrated to 32 % prior to sonication at 30 °C. The applied energy densities were varied from 3 to 16 J/mL at a frequency of 20 kHz. It was found that sonication of whey initiated the rapid formation of a large number of lactose crystals in response to acoustic cavitation, which increased the rate of crystallisation. The rate of sonocrystallisation was faster compared with mechanical stirring for processing durations of approximately 120 min. However, once the metastable limit was reached between 120 and 180 min, the rate slowed. A second treatment with ultrasound for 120 min delivering an applied energy density of 4 J/mL stimulated further nuclei formation, and the rate of crystallisation was maintained for >300 min. Yield on the other hand was limited by the solubility of lactose and could not be improved. The crystal size distribution was also narrower than that with stirring, and the overall crystal size was smaller. The study highlighted the importance of the conditions used for promoting crystallisation.

A dramatic shortening of the induction times (along with an increase in the corresponding nucleation rates) observed in the sonicated systems has been attributed to two major factors: ultrasound-stimulated nucleation in the high-pressure vicinity of the collapsing bubbles due to the melting point shift and the consequent increase in supercooling [21, 22]. The cavitation hot spots where bubbles collapse are thought to be privileged nucleation centres, where the critical energy for crystal formation is decreased. The shock wave and micro-jet from local high pressure of the cavitation collapse can also accelerate the motion of the liquid molecules and increase molecular impacts so as to prompt nucleation. In addition, acoustic-induced nucleation is a well-defined initial point for crystallisation, which permits better modulation of crystal growth.

Furthermore, the rapid collapse of bubbles reduces the crystallisation temperature and thereby increases super saturation [60]. Even small changes in super saturation were found to significantly reduce the nucleation rate. In addition, the micro-turbulence governs the growth of the nuclei [83]. The metastable zone width (MZW) provides information for developing a controlled crystallisation process (Fig. 2). The application of ultrasound can reduce the MZW, and this can have many significant benefits such as improved control over crystal size and habit.

Fig. 2
figure 2

Schematic representation of reduction in the metastable zone of lactose with ultrasound [39]

In comparison with mechanical agitation, ultrasound can provide more uniform mixing that can avoid unwanted zones of excessive super saturation in the vessel. Ultrasound can be used to induce nucleation where spontaneous primary nucleation cannot occur, and hence, ultrasound can potentially replace seeding in lactose solution [21, 22].

The effect of cavitation to speed up the nucleation rate is a complex phenomenon and is not yet completely understood. It could be hypothesised that every cavity takes up some part of the solvent, causing instantaneous local super saturation. It could also be speculated that due to hot spot formation as a result of cavity collapse, local solvent evaporation takes place within the cavitation nuclei. On collapse of a cavity, these nuclei are introduced into the crystallisation slurry, causing further cavitation opportunities due to the presence of these solids [45].

Although research on sonocrystallisation of lactose has been reported mostly with the use of the anti-solvents, two recent studies by Zamanipooor et al. [116] and Dincer et al. [41] investigated the effectiveness of ultrasound in enhancing the crystallisation process of lactose in aqueous (i.e., non-solvent) systems. These studies aimed to model the effects of ultrasonic variables on nucleation rate, growth rate and size distribution of lactose crystals. In the study by Zamanipooor et al. [116], ultrasound with a frequency of 20 kHz was applied using an ultrasonic probe to the lactose in water solutions maintained at a constant temperature of 22 ± 1 °C. The variables that significantly (P < 0.05) affected the responses were found to be the lactose concentration and amplitude, with concentration being the most significant. The sonicated sample at the optimal sonication conditions led to nucleation rates 10.6 times higher and yields 5.6 times higher compared with the control. The results of this study by Zamanipooor et al. [116] emphasised the potential effects of ultrasonic treatment in promoting the nucleation rate and yield in the manufacture of lactose without use of solvents. Furthermore, the fact that sonication time had no significant effect (P > 0.05) on the responses implied that sonication duration can be kept to a minimum to save on energy and costs [116].

Furthermore, Dincer et al. [41] found that ultrasound had a significant effect in reducing induction times and narrowing the MZW but had no effect on individual crystal growth rate or morphology. A rapid decrease in induction time was observed up to 0.46 W g−1 power density. Sonication up to 3 min decreased the induction time, but no further reduction was observed beyond 3 min. It was not possible to generate the nucleation rates achieved by sonication using agitation alone. 1-min sonication at 0.46 W g−1 power density followed by continuous stirring was found to be the optimum under the experimental conditions tested [41].

Fat Crystallisation

There has been considerable interest in the application of ultrasound with regards to crystallisation of fats within the food industry [47, 73, 92, 103]. The crystal size and shape of fats within a product play a significant role to the texture and mouth feel. Within the dairy industry, controlling fat crystallisation is a key factor governing the structure and texture of secondary dairy products. Examinations of pure triacylglycerides (TAG’s), confectionary fats, vegetable fats and milk fats indicate that ultrasound affects the rates of polymorph-dependant crystallisation, crystal size and morphology [47, 73, 92, 103]. Martini et al. [73] studied the use of ultrasound as an additional processing condition to alter the crystallisation behaviour of anhydrous milk fat. It was shown that ultrasound decreases induction time of crystallisation, generates smaller crystals and higher viscosities. However, the degree of super cooling and ultrasound settings influenced the degree of ultrasound effect on the crystallisation to a great extent. The studies reviewed above indicate that ultrasound affects the crystallisation behaviour of fats in many ways.

The effectiveness of this technology depends entirely on the processing conditions. However, like sonocrystallisation of lactose, it remains unclear as to the exact mechanisms responsible for these effects. More importantly, this procedure may suggest a rational way of controlling the polymorphism and nucleation rate of various fats/lipids by sonication in dairy-based systems. A primary effect of sonocrystallisation may be due to the high pressure generated when a sonication-induced cavity collapses. The different polymorphic forms have different stabilities that form specifically as a function of the supercooling temperature, mechanism and lifetime of the collapsing bubbles.

Crystallisation of fats is of considerable interest in chocolate manufacture. The crystallisation process can be controlled by means of the amplitude and frequency of the ultrasonic wave along with the exposure time, thus controlling crystal size distribution as well as the point at which crystallisation starts [95]. Cocoa butter exists in a number of polymorphic forms depending on the triglyceride composition. The polymorphic form V is considered the most desirable crystal form in well-tempered chocolate [4]. In order to avoid fat bloom on chocolates, the unstable polymorphic forms of cocoa butter are crystallised during the first cooling process of traditional tempering. In the reheating process, a melt-mediated transformation leads to formation of the stable polymorph form V. This traditional method of tempering chocolates is time-consuming and needs to be precise with respect to the temperature conditions used for obtaining the proper polymorph form and thereby obtain the correct texture and appearance of chocolates. However, in the case of sonocrystallisation (20 kHz, 100–300 W for 3 s) for cocoa butter, it has been found that the stable form is directly crystallised without the formation of subsequent unstable forms [49]. This points to the fact that ultrasound irradiation is an effective tool for controlling polymorphic crystallisation of fats and reducing induction times. Furthermore, a range of crystal structures can be controlled. This means it is possible to tune the desired texture conditions for a particular dairy product. However, the ultrasound-induced cavitation that produces free radicals appears to have restricted the use of sonocrystallisation for systems containing fats and oils, which are susceptible to oxidation by free radicals [97]. The formation of off-flavours prevents using ultrasound as a viable choice for fat crystallisation in dairy systems. On the contrary, Patrick et al. [92] reported that sonication at 66 kHz did not cause any off-flavour production due to oxidative changes to palm oil. Their results indicate that the optimum conditions for obtaining small crystals in the shortest time period are just below the cavitation intensity threshold.

Ice Crystallisation

Ice-creams are designed to be consumed in a frozen stage. The ice crystal size directly influences the texture and taste. For a desired creamy mouth feeling within the frozen product, the ice crystals are required to be as small as possible. When US is applied during the crystal growth phase, fragmentation of large crystals under acoustic stress will occur and lead to crystal size reduction. It has been shown that during freezing of ice-lollipops, the ice crystals that were formed with the application of US were significantly smaller and distributed evenly across the product. The resulting product also adhered more strongly to the supporting stick. But at the same time, smaller crystals made the product harder and difficult to bite [113].

Ice-cream contains up to 50 % by volume of entrapped air. Ultrasonic degassing can occur during the application of ultrasound, and this process can result in undesirable modifications to the ice-cream texture. Acton and Morris [3] overcame this issue by increasing the initial gas content so that the proportion of air lost due to US can be compensated. But the question that remained unanswered was how much extra air needs to be added to obtain the desired texture. Therefore, unique approaches and conditions are needed for each and every product for obtaining desired textures.

Further, high-intensity ultrasound may lead to fat oxidation. Despite the favourable effects of sonication on the crystal structure, the aforementioned effects (degassing and fat crystallisation and oxidation) can lead to off-flavours and improper textures in ice-creams. However, by keeping the cooling regime constant, it has been found that the structure of the crystallised product can be adjusted by varying the ultrasonic intensity [35]. Mortazavi and Tabatabai [81] found that application of 20-min pulsed ultrasound resulted in the best sensory flavour, texture and mouth feel evaluations of ice-creams. Flavour and texture of samples prepared with 5- and 20-min pulse time also had better mouth feel than the control. Hence, the conditions need to be carefully monitored in using ultrasound on ice-cream applications.

Inactivation of Microbes

The dairy industry has typically relied on heat to inactivate microorganisms and enzymes as a means of preservation and ensuring food safety. Thermal treatment typically inactivates enzymes, kills vegetative microorganisms, and destroys some spores but causes significant loss of the nutritional and organoleptic properties of the products. In milk systems, pasteurisation and ultra-high temperature sterilisation often result in changes to the composition and surface properties of the colloidal particles present and alters the physical properties of the milk. While some changes such as those used to improve the texture of products like yoghurt are desirable, others such as gel formation during the manufacture of Ultra-High Temperature milk are highly undesirable [23]. Hence, novel preservation technologies that maintain the nutritional value and organoleptic characteristics of milk unchanged have attracted some interest within the dairy industry. This trend towards the development of ultrasound as an alternative non-thermal technique is further driven by other advantages including reduced energy consumption, ability to target specific organisms and no requirement for the introduction of preservatives [26].

The main mechanism responsible for the ultrasonic deactivation effect is the physical forces generated by acoustic cavitation. The asymmetric collapse of a cavitation bubble leads to a liquid jet rushing through the centre of the collapsing bubble. The speed of this micro-jet can be up to a few hundred metres per second. Microorganisms that have hydrophobic surfaces will promote the collapse of cavitation bubbles on the surface and lead to severe damage of the cell wall. Similarly, micro-streaming effects can lead to the erosion of cell walls, again resulting in inactivation of the microorganisms.

In addition to the cavitation effects discussed above, the bactericidal effect of ultrasound in liquid foods is also attributed to intracellular cavitation [50], which disrupts the structural and functional components up to the point of cell lysis. The effects of localised heating, free radical production causing DNA damage and micro-streaming which causes thinning of cell membranes are crucial in the inactivation [19]. It should be noted that the resistance of different microorganisms to ultrasound varies widely, and many process treatment parameters and ultrasound variables have to be carefully selected to compensate.

When applied alone, intense ultrasound power and long contact times are required to inactivate microorganisms at ambient conditions for microbial inactivation in real food systems. Recent improvements to the use of ultrasonics for microbial inactivation recognise that the combination of more than one established technique in combination with ultrasound provides enhanced benefits. The most effective sonication approaches to inactivate microbes for industrial purposes are the combination of ultrasound with heat thermo sonication (TS), pressure manosonication (MS), or heat and pressure mano-thermo sonication (MTS). The effectiveness of microbial inactivation by these methods is dependent on the amplitude of the ultrasonic waves, exposure/contact time, volume of food being processed, the composition of the food and the treatment conditions.

Several studies have shown additive or even synergistic effects of TS, MS and MTS compared with the individual treatments alone [62, 94]. Some microorganisms, in particular bacterial spores, are more resistant under certain conditions than others, and achieving inactivation can be relatively difficult. Bacillus and Clostridium spores were found to be more resistant to heat and similarly resistant to ultrasound [38]. Raso et al. [96] investigated the inactivation of Bacillus subtilis spores by ultrasonic treatments under static pressure and a combined pressure and heat treatment conditions. They showed that MS treatment at 500 kPa and 117 μm of amplitude for 12 min inactivated ~99 % of the B. subtilis spore population.

The influence of ultrasound amplitude was also reported to be very important to the microbial inactivation. When higher amplitudes were used, higher inactivation rate was observed, which could be due to an increase in the number of bubbles undergoing cavitation per unit time [106] or to an increase in the volume of liquid in which cavitation is liable to occur [106]. While MS treatment (20 kHz, 300 kPa, 70 °C, 12 min) at 90-μm amplitude inactivated 75 % of the B. subtilis spore population, the same treatment at 150-μm amplitude inactivated 99.9 % of this population. The MS treatments at temperatures higher than 70 °C (MTS) also led to more spore inactivation. In the range 70–90 °C, the combination of heat with a MS treatment (20 kHz, 300 kPa, 117 mm, 6 min) had a synergistic effect on spore inactivation.

D’Amico et al. [37] showed that ultrasound treatment combined with mild heat (57 °C) for 18 min resulted in a 5-log reduction of L. monocytogenes in milk and a 5-log reduction in the total aerobic bacteria in raw milk. Juraga et al. [59] investigated the inactivation of Enterobacteriae in raw milk with the use of high-intensity ultrasound. Temperature (20, 40 and 60 °C), amplitude (120, 90 and 60 μm) and time (6, 9 and 12 min) were varied in the study. They found that inactivation of microorganisms using ultrasound depended on the amplitude of the ultrasonic waves, the exposure/contact time with the microorganisms and the temperature of treatment. The results indicated significant inactivation of microorganisms under longer period of treatment with ultrasonic probe particularly in combination with higher temperature and amplitude.

Table 1 highlights some literature that has investigated the influence of different parameters on microbial inactivation for different food systems. TS is considered to increase the lethality values of conventional pasteurisation and sterilisation treatments of liquid foods. Furthermore, the reduction in temperature and/or process time is expected to improve the retention of the nutritional and organoleptic quality of food [87]. Despite its benefits, the effectiveness of TS decreases with increasing temperature. The decrease in effectiveness of TS at increasing temperature is due to a reduction in the cavitation effect as a result of the increase in vapour pressure and the decrease in liquid surface tension [110]. However, this effect can be overcome by MS or MTS. In recent studies, TS has been combined with other emerging technologies for improved microbial inactivation. Some of the highlights of these studies are also presented in Table 1.

Table 1 Some literature data on microbial inactivation in milk systems with regards to different parameters and combination of techniques

Functionality Changes of Proteins in Milk

The use of ultrasound in dairy systems is increasingly being developed for a range of suitable applications, not simply due to the improved processing effectiveness as mentioned in the previous sections, but also for its ability to manufacture products with ‘tailored’ functionality, the ability to preserve food and modulate enzyme activity and the capability of improving the microstructure through component interactions [62]. Manipulating the functional properties of dairy ingredients is interesting, particularly from the point of protein functionality, although the actual mechanisms responsible for the observed effects are still unclear and continue to be investigated.

Recently, Chandrapala et al. [33] investigated the effect of shear on the solubilisation of a range of dairy powders containing casein micelles. In their study, the rate of solubilisation of less soluble milk protein concentrate and micellar casein powders was examined during ultrasonication, high-pressure homogenisation and high-shear rotor–stator mixing and was compared with low-shear overhead stirring. The high-shear techniques were able to greatly accelerate the solubilisation of these powders by physically breaking apart the powder agglomerates and accelerating the release of individual casein micelles into solution. This was achieved without affecting the structure of the solubilised proteins. Furthermore, the effect of high shear on the re-establishment of the mineral balance between the casein micelles and the serum was examined by monitoring the pH of the reconstituted skim milk powder after prior exposure to ultrasonication. Only minor differences in the re-equilibration of the pH were observed after sonication for up to 3 min, suggesting that the localised high-shear forces exerted by sonication did not significantly affect the mass transfer of minerals from within the casein micelles.

Ultrasound has also been used for many years for estimating changes in protein conformation [24, 28, 46, 53, 54]. The protein conformations may be related to functional properties of proteins in foods such as solubility, foaming capacity and flexibility. Jambrak et al. [53] showed that ultrasound with a high-intensity (20 kHz) probe had a major effect on whey protein’s functional properties such as solubility and foam ability. Their results also showed that ultrasound of 40 kHz frequency had less effect on whey protein than a 20 kHz probe. It was explained by the ultrasound mode which was applied to the milk. The 20-kHz ultrasound was applied using a horn inserted directly in solution which favours contact between the tip and the sample. On contrary, the 40-kHz ultrasound was applied using an ultrasound bath with the milk contained in flasks such that there was no direct contact with the irradiating surface.

Treatment of a protein sample with different frequencies has also shown differences in the type of functionality changes. A 40-kHz bath treatment decreased the conductivity of the protein sample, but increased the solubility and foaming ability of the proteins. The larger increases of foaming ability might be due to the homogenisation effect of ultrasound according to the authors. The homogenisation effect of ultrasound usually disperses the protein and fat particles more evenly, which may improve the foaming property. During ultrasound treatment, proteins can become partially unfolded in structure due to physical effects such as shear and temperature, which increase the foaming ability [53, 54]. In contrast, higher-frequency ultrasound of 500 kHz frequency did not impact the foaming ability of whey protein, but it affected solubility and conductivity, due to reduced physical effects at higher frequency.

Whey Proteins

Whey protein powders are commercially produced through membrane filtration, evaporation and spray-drying for application in a range of processed foods. During this process, the aqueous whey protein solutions containing significantly high levels (4–15 % by weight) of protein are subjected to heat treatment. Exposing whey proteins to temperatures ≥70 °C causes denaturation, which in turn leads to protein aggregation through both hydrophobic interactions and the formation of intermolecular disulphide bonds [112]. This can result in excessive thickening or gelling during processing of the dairy product [80]. During the manufacture of secondary dairy products, heat treatment results in further viscosity increases. Ashokkumar et al. [9, 10] and Zisu et al. [117] outlined a novel approach to overcome this problem. The application of ultrasound for a very short duration after such a heating step breaks down these aggregates and prevents their reformation on subsequent heating, thereby reducing the viscosity increase that is usually associated with this process (Fig. 3).

Fig. 3
figure 3

The effect on solutions viscosity for 6.4 % protein solution reconstituted from whey protein concentrate and sonicated with 20 kHz with a calorimetric power of 31 W: control (C), preheating (PreH), sonication (US) and post-heating (PostH) (adapted from [9, 10])

Initially, it was argued that these observed viscosity changes may have been caused by a combination of both the physical and chemical effects of acoustic cavitation. However, it was found that whey solutions sonicated at 20 kHz showed the highest viscosity reductions [9, 10] even though 20 kHz ultrasound formed the least amounts of radicals [40]. The authors therefore attributed the observed viscosity reduction primarily to the physical forces generated during acoustic cavitation. Conversely, whey protein isolate (WPI) solutions were relatively unaffected by sonication, possibly reflecting the absence of larger aggregates in the initial solution, or differences in composition. Adjustment of pH prior to ultrasound treatment did not result in significant differences compared with samples that were sonicated at neutral pH. This suggested that the mechanism for gel promotion is different from effects induced by pH changes [118]. Similarly, Kresic et al. [63] also investigated the rheological and thermophysical properties of WPC and WPI solutions subjected to sonication. According to these results, the use of ultrasound changed the flow behaviour and this was attributed to altered protein structure, namely that the hydrophilic parts of amino acids are opened towards the surrounding aqueous phase, leading to an increased binding of water molecules. There has been some concern that the sonication of proteins in solution can lead to the formation of amyloid-type fragments [105]. This formation was observed when excessively high specific energy was delivered to a small volume. However, a recent study by Chandrapala et al. [24] showed that no significant protein structural changes were observed up to 60 min of sonication, although these minor changes cannot be completely omitted from the observed functional property changes.

A more thorough understanding of the mechanism was investigated by Chandrapala et al. [29]. The three main types of main interactions within protein solutions were investigated: surface charge of the protein aggregates (indicative of electrostatic interactions), reactive thiol groups (indicative of thiol-disulphide interactions) and surface hydrophobicity (indicative of hydrophobic interactions). Interestingly, it was found that surface charge and reactive thiol groups remained unchanged with the sonication step applied in between the preheating and post-heating steps (Fig. 4a, b). However, the surface hydrophobicity of these aggregates was altered markedly (Fig. 4c). Preheating denatures the whey proteins and thereby exposes the hydrophobic groups. This was indicated by an increase in the surface hydrophobicity of these preheated aggregates. This increase in surface hydrophobicity was greatly reduced with the introduction of a sonication step. It was speculated that sonication broke down the protein aggregate networks through physical shear caused by acoustic cavitation, leading to the formation of smaller aggregates with lower surface hydrophobicity. These smaller aggregates are resistant towards further aggregation during post-heating, thereby improving heat stability.

Fig. 4
figure 4

a Surface charge, b reactive thiol content and c surface hydrophobicity of reconstituted WPC solutions after various heat and sonication treatments

Casein Proteins

Although casein micelles are considered relatively stable particles, their composition and size respond to alterations in pH, temperature and milk protein concentration [72]. It is possible that the localised high temperatures and shear forces created by sonication can physically alter the casein micelles or their interactions with other milk components. Madadlou et al. [71] found that the average size of re-assembled casein micelles could be reduced by exposure to ultrasound (35 kHz frequency) for 6 h provided the pH was above 8. However, it is unclear how this relates to native casein micelles since the re-assembled casein particles were considerably larger (275 nm) and structurally and functionally different [82]. In another study involving true casein micelles in milk, a decrease in particle size resulting from sonication was observed. This particle size decrease was attributed to a reduction in casein micelle size by the authors, although this was not substantiated [84].

A recent study by Chandrapala et al. [31] showed that sonication did not appear to affect casein micelle size or composition or permanently affect the mineral balance in fresh skim milk. Sonication did, however, reduces the size of the already small fat globules remaining in skim milk, appeared to break up whey protein aggregates, and assist in breaking apart casein micelle, non-micellar casein and whey protein aggregates present in reconstituted casein powder systems. The results showed that controlled application of ultrasonic energy can help break up large casein and whey protein aggregates, thereby influencing macroscopic properties such as viscosity, without inducing changes to the casein micelles or mineral balance. Similar results were obtained by Shanmugam et al. [99] when using up to 30 min of sonication, but prolonged sonication resulted in the partial disruption of some whey proteins from the whey–whey aggregates. In contrast, Liu et al. [65] found that casein micelles are disrupted at high pH values with sonication. In their study, reconstituted skim milk at pHs 6.7–8 was sonicated at a specific energy input of 286 kJ/kg using 20 kHz. According to this study, ultrasound caused greater disruption of casein micelles, causing release of proteins from the micellar to the serum phase at high pHs. Furthermore, they showed that the released proteins re-associated to form aggregates of smaller size but with surface charge similar to that of casein micelles in the original milk. However, in other studies, the dissociation of κ-casein was found to be dominant with increase in pH, which can result in an increase to the total protein content in the supernatants [6]. They hypothesised that it is due to a pH-dependent conversion of the native colloidal calcium phosphate (CCP) to an alternative form of calcium phosphate, which is less capable of maintaining the micellar integrity, particularly at higher pH values where the charges of the proteins are greater. Hence, it is arguable as to whether the effects observed by Liu et al. [65] under high pH conditions are just an ultrasound effect or a pH effect and/or a combination of both.

In another study by the same group [66] observed the renneting properties of the aforementioned systems. It was shown that gelation attributes were significantly modified (i.e., faster gelation) in rennet gels made from milk sonicated at pH 8.0 and re-adjusted back to pH 6.7 compared with those made from milk sonicated at pH 6.7. The renneting properties were also modified (i.e., firmer gels) in milk sonicated at pH 6.7 compared with those of non-sonicated control milk. The modified renneting behaviour was attributed to ultrasound-induced changes to the proteins in milk [66].

Chandrapala et al. [30, 32] looked at the phosphate-induced micellar casein gelation and the influence of sonication (20 kHz) to this process. Gels were formed by the addition of 7.6 mM tetra sodium pyro phosphate (TSPP) to 5 wt% micellar casein (MC) solutions. It was shown that sonication at 20 kHz and 31 W for up to 30 min changed the surface hydrophobicity of the proteins, whereas surface charge was unaltered. Sonication before the addition of TSPP formed a firm gel with a fine protein network and low syneresis. Conversely, sonication after TSPP addition led to an inconsistent weak-gel-like structure with high syneresis. Gel strength in both cases increased significantly after short sonication times, while the viscoelastic properties were less affected. Overall, the results showed that sonication can have a significant effect on gelation of micellar casein systems, but the state of the casein micelle prior sonication is a dominant factor and should be carefully controlled. Hence, the effects observed in the study by Liu et al. [65] can be confirmed as a combined effect of pH and ultrasound rather than just simply ultrasound being responsible.

High-power, low-frequency ultrasound (20 kHz) has potential industrial application for lowering the viscosity of concentrated milk prior to spray-drying. A recent study by Zisu et al. [119] investigated the high-intensity low-frequency ultrasound on concentrated skim milk to lower viscosity through the process of acoustic cavitation. Batch sonication for 1 min at 40–80 W and continuous treatment delivering an applied energy density of 4–7 J mL−1 reduced the viscosity of medium-heat skim milk concentrates containing 50–60 % solids. Viscosity was reduced by approximately 10 %, but this has improved to 17 % in highly viscous age-thickened material. Sonication also showed changes in the shear thinning behaviour at shear rates below 150 s−1. Although ultrasound lowered the viscosity of skim milk concentrated to 50 % solids, the treatment could only delay the rate of thickening once the ageing process was established. It was only when ultrasound was activated during concentration that sonication prevented the viscosity of skim milk concentrates from increasing rapidly.

Of the work to date on casein containing systems, a considerable focus has been given to understanding the effects of ultrasound on bulk physical and functional properties. The effect of sonication on milk gels has been reported [30, 32, 84, 97, 98, 109]. Acid gel firmness (G′) was found to be altered when skim milk was ultrasonically treated prior to acidification, although the effect was attributed largely to denaturation of whey protein caused simply by temperature increases (up to about 95 °C) resulting from sonication performed without temperature control [84]. Wu et al. [114] reported that high-intensity ultrasound (90, 225 and 450 W, 20 kHz) not only effectively homogenised milk, but also significantly improved the viscosity and water holding capacity and reduced syneresis of yoghurt produced from sonicated milk. These effects are directly related to the yoghurt structure, which is based on strings or clusters of casein micelles interacting physically with each other and with denatured serum proteins entrapping serum and fat globules. Furthermore, ultrasound could cause some qualitative changes in the fat globule membrane, which would modify the ability of fat globules to interact with themselves and/or casein micelles, thereby improving the gelling properties. However, direct effects of ultrasound on casein micelles or individual caseins cannot be ruled out completely, although negligible effects were found on the individual caseins/casein micelles [31]. This may be because big multimeric protein complexes are more sensitive to the shearing forces created by micro-streaming and bubble implosion than single dissolved monomeric proteins.

Yoghurt

In yoghurt, a key aspect of product quality is associated with the physical properties of the gel. Optimal gel structures will yield yoghurt with a smooth textural character in mouth during consumption and display low serum separation during storage (i.e., syneresis). Vercet et al. [109] studied the use of MTS to obtain tailored functional properties of yoghurt products. The application of ultrasound allowed yoghurts with superior rheological properties such as flow curves, apparent viscosity, yield stress and viscoelastic properties to those of control yoghurts prepared with non-sonicated milk. The authors further showed that MTS yoghurts had more rigid structures, which resulted in higher values of almost all of the many relevant rheological parameters. They suggested that ultrasound effects are mainly related to the cavitation phenomenon. As a result of the cavitation conditions, water molecules can be homolyzed, generating highly reactive free radicals that can react with and modify several molecules. Mechanical stress generated either by shock waves derived from bubble implosion or from micro-streaming derived from bubble oscillations is also able to disrupt large macromolecules or particles. However, the cause of this behaviour was not ascertained. The authors suggested that alterations to the size of the fat globules were not responsible, although a proper comparison of the fat globules was not performed.

Reiner et al. [97] also found that compared with conventional yoghurts, yoghurts from (TS) milk had higher gelation pH values, greater viscosities and higher water holding capacities. The authors further stated that the gel network structure was different; it showed a honeycomb-like network and exhibited a more porous nature. Similarly, Reiner et al. [98] found superior rheological properties of yoghurts prepared from ultrasonicated milk than the yoghurts prepared from conventionally heated milk. Further, Bermudez-Aguirre et al. [18] found only minor changes to the nutritional properties of milk after ultrasound, with the advantage of extending the shelf-life of the product for more than 16 days at 4 °C without the use of intensive heat treatments. These tailored functional properties are desirable in the end products. With the added advantage of minimal nutritional loss compared with thermal treatments but still retaining long stable shelf-lives, the use of ultrasound technology in dairy streams shows promise.

Ultrasonic Separation of Fat

Milk fat separation is essential to the production of dairy products such as butter, cheese, yoghurt and skim milk. Typically performed in large-scale manufacture by centrifugation, high-frequency ultrasound (>400 kHz) has recently been reported as an alternative separation technology that can enhance the rate of milk creaming by ‘natural’ gravitational sedimentation [69, 70].

When an ultrasonic standing wave is set up in a container, the fat globules distributed in the milk experience what are known as acoustic radiation forces that cause them to migrate specifically to the pressure anti-nodes such that ‘banding’ of the fat globules can be observed. This was observed by Miles et al. [79] in a small cuvette container. The milk fat globules have an increased probability to collide together to form collections with a larger effective diameter when disposed into these regions. These larger entities then rise more rapidly to the surface of the container according to Stokes’ Law [64], resulting in faster creaming. The acoustic and buoyancy forces influencing the separation are dependent on the size of the fat globules [68]. The acoustic forces can be manipulated by adjusting the applied frequency and energy density.

Ultrasonically enhanced fat separation in milk has been demonstrated in batch systems ranging in scale from millilitre to litre. Juliano et al. [56] demonstrated the concept of ultrasonic separation for natural whole milk in small test tubes of 7-mL volume and was later successful in scaling this process up to 6 L using a recombined milk emulsion [57]. Leong et al. [69] established suitable ultrasonic parameters that enable successful separation of natural whole milk in large scale, demonstrating the importance of energy density and effectiveness of high-frequency ultrasound (1 and 2 MHz) to separate the small fat globules distributed in milk (~4 µm diameter). Further studies by the same group have demonstrated that operation at moderate temperatures between 25 and 40 °C are more optimal to the fat separation process due to the influence of temperature on the physical properties of the fat globules such as density, viscosity and liquid/solid ratio [70].

One possible concern when using high-frequency ultrasound for the separation of milk fat is the potential for oxidation of fat to occur (i.e., lipolysis). Efforts should be made to limit this as excessive oxidation of fats can lead to rancid off-flavours in the final product. It is noted that peak sonochemical effects have been reported to occur in the frequency range between 400 and 800 kHz, which incidentally is in the range of frequencies reportedly suitable for ultrasonic fat separation. However, Juliano et al. [58] has shown that when sonicating milk within this frequency range, oxidative volatiles derived from sonication was detectable above human sensory threshold limits only when very high specific energies were delivered to the milk. In contrast, Torkamani et al. [108] showed no significant oxidation of fat with sonication of cheddar cheese whey at similar frequencies and energy densities.

Conclusion

Ultrasound is a promising technology suitable for a range of different applications in the dairy industry. In liquid media, the extreme physical forces generated by low-frequency, high-intensity ultrasound induce acoustic streaming, cavitation, shear, micro-jet and shockwaves. These physical forces have been used successfully for the generation of dairy emulsions, functionality improvements of dairy systems, inactivation of microbes, crystallisation of lactose and fat in dairy systems, among several other applications. High-frequency ultrasound on the other hand has been used to initiate rapid creaming of fat from milk. Ultrasound processing has advantages of minimising flavour loss, increasing homogeneity, reducing energy requirements, reducing processing times, enhancing end-product quality, reducing chemical and physical hazards and lowering the environmental impact, when compared to conventional dairy processes. Synergies with pressure and/or temperature have been reported, but caution is advised to minimise nutritional losses and adverse flavour modifications if very high specific energies are to be delivered to the process. Although there are a range of advantages from adopting ultrasonic technology for the dairy industry, more research is needed to improve designs to enable efficient large-scale operations.