1 Introduction

In the month of August 2018, Kerala, a southern state in the west coast of India, witnessed extremely heavy rainfall. This event led to 42% above normal rainfall for the duration of 1–19 August (i.e., 2346.6 mm rainfall received against the normal of 1649.5 mm), and the maximum rainfall was experienced from 13 to 17 August over Kerala (India Meteorological Department (IMD) Report 2018). This Heavy Rainfall Event (HRE) resulted in massive floods, displacing more than a million people from their homes, damaging roads and bridges, causing about 480 deaths, and leading to an estimated loss worth approximately $3 billion USD. The spatial distribution of district-wise rainfall shows that the highest excess rainfall occurred over Idukki (92% above normal) followed by Palakkad (72% above normal). These types of extreme weather events producing heavy rainfall and floods are the major natural hazards in many parts of the world, including India (Ahern et al. 2005). During the active monsoon phase, the movement of synoptic-scale systems many times causes HREs over the east and west coast of India and contributes substantially to Indian seasonal rainfall (Dodla and Ratna 2010; Pattanaik and Rajeevan 2009). Usually, orographic convection plays an important role in the occurrence of the HREs in the hilly areas of the west coast and parts of north-eastern India (Saha 1974; Vellore et al. 2014, 2016, 2019), leading to flash floods (Kumar et al. 2012). Accurate and quantitative forecasts of HRE (daily rainfall amounts ≥ 120 mm as per IMD) with adequate lead time over areas of a few thousand square kilometers are needed by disaster managers and administrators to minimize the loss of lives and properties (Hally et al. 2014).

1.1 Retrospective studies on HREs over the Indian region

In general, exclusively with reference to HRE, only a few studies have been carried out over the Indian region. In a recent study, Baisya and Pattnaik (2019) inferred that the HRE over Kerala in 2018 was caused by a continuous supply of moisture to the Western Ghats by an anomalous moisture channel along with the advection of moisture towards the southern peninsula by a monsoon depression. They also identified the presence of a positive quasi bi-weekly oscillation, along with an intra-seasonal oscillation over the region, to play an important role in the shedding of the towers of moisture flux convergence (MFC) by the depression. Another study by Viswanadhapalli et al. (2019) identified the strong westerly jet along with the formation of an offshore vortex, the transport of mid‐tropospheric moisture under the presence of conducive vertical shear of horizontal wind, and the transport of mid‐tropospheric moisture from the BOB to be the major factors responsible for the HRE of 2018 in Kerala. The sensitivity of CMP schemes for the HRE over Chennai in 2015 was investigated by Reshmi Mohan et al. (2018). Their results suggest that Thompson et al. (2008) and Morrison et al. (2005) schemes were able to identify features such as the location of maximum rainfall, its spatial distribution, and time of occurrence in close agreement with the observations by realistically predicting the high magnitude of vertical velocity associated with high instability, strong mid-level convergence and upper air divergence associated with strong cyclonic vorticity. Another study by Srinivas et al. (2018), identified that the veering wind with height associated with strong wind shear in the layer 800–400 hPa and the dry air advection are the main initiators of the development of instability and initiation of convection. Based on the results obtained from the simulation of the 2013 HRE in Uttarakhand with High-Resolution Land Data Assimilation (HRLDAS) modelling system, Rajesh et al. (2017) concluded that better representation of different land surface fields such as soil moisture, soil temperature etc. consequently leads to improved prediction of the anomalous extreme weather events over the Indian region. In another study, Agnihotri and Dimri (2015), using Weather Research and Forecasting (WRF) model simulations, found that the presence of a north–south trough, an easterly trough and low pressure areas over the surrounding Indian seas during the pre-monsoon season are the major precursors for the HREs caused by widespread thunderstorm activities over the southern peninsular India.

1.2 Studies of HREs in the global context

A study on the Sierra Nevada spillover precipitation of 1997 and 2006 by Kaplan et al. (2012) identified a mesoscale circulation with rising motion in the leeward side to be the major factor for the HREs and concluded that the investigation of dynamical interactions on both synoptic and sub-synoptic scales is essential for better understanding of this kind of HREs. Numerical experiments carried out over the Red Sea by Dasari et al. (2014) indicated that the model-produced rainfall is strongly modulated by the choice of microphysical schemes, and it was noted that Lin and Thompson scheme produces the best result in that scenario. In another study, Ault et al. (2011) suggested that Asian dust may have an impact on atmospheric rivers and speculated that Thompson Aerosol Aware (TAA) scheme might simulate more accurate precipitation for simulations for HREs over the west coast of the United States of America. However, Tan (1997) found that TAA had less accuracy in the rainfall prediction than the Thompson scheme in the simulation of the California New Year’s flood event in 1997.

1.3 Impacts of cloud microphysical parameterization

With the evolution of Numerical Weather Prediction (NWP) and Climate models, there has been immense progress in the understanding of cloud microphysical processes, and subsequently, many cloud microphysical parameterization (CMP) schemes have been developed. Water vapour, cloud droplets, cloud ice crystals, snow, rimed ice, graupel, and hail are the primary microphysical species whose budgets depend on the atmospheric dynamical and thermodynamical conditions which determine the partitioning of the hydrometeors themselves (Huang and Wang 2017). The NWP models predict rainfall using convective and microphysics parameterization for the representation of the clouds and precipitation processes (Powers et al. 2017). These parameterizations play an important role in prediction of the precipitation, hydrometeor distributions, and other associated variables. At higher resolution (1–3 km), precipitation can be explicitly resolved with better representation of mesoscale features through a cloud microphysics scheme without cumulus parameterization (Ghosh et al. 2016). In a CMP sensitivity study using the MM5 model, Grubišić et al. (2005) showed that the Resiner2 mixed-phase scheme simulated the event with the highest accuracy among other schemes, and also the quantitative precipitation forecast skill was higher on the windward side than the leeward side of the mountain. Rajeevan et al. (2010) suggested that with an enhanced horizontal resolution of the models, the cloud microphysical processes play an important role by directly influencing the cold pool strength by modulating evaporation of rainfall and condensational heating in HREs over Southeast India. According to McCumber et al. (2010), the inclusion of mixed-ice phase cloud microphysics in the cloud model significantly contributes to the betterment of the output in convective simulations.

1.4 Importance of moisture flux convergence

In addition to cloud microphysical processes/parameters, synoptic scale features such as Moisture Flux Convergence (MFC) plays an important role in the evolution, sustenance of HREs. The prominent relationship between surface horizontal mass convergence and convective storms is well established (Breiland 1958; Matsumoto 1967; Peslen 1980) and many studies have quantified the role of surface horizontal mass convergence to storm initiation through real-time forecasting experiments and field projects (Wilson and Schreiber 1986; Wilson and Mueller 1993; Wilson and Megenhardt 1997).

1.5 Objective

This primary objective of this study is to investigate the effects of CMPs in the WRF model on the HRE of Kerala in August 2018. This study validates the model-simulated results, especially rainfall, with in-situ observations for better understanding of location-specific impacts of the HRE and also, quantifies the contributions of the different components of the Mean Squared Error (MSE), i.e., pattern, displacement, and volumetric error, over the study region by means of a spatial verification method, such as Contiguous Rain Area (CRA) (Ebert and McBride 2000). However, in the present study, not only the relative sensitivities of various CMPs are critically analyzed for the HRE of Kerala (2018) using the WRF model, but also the contribution and interactions of large scale factors such as MFC and orographic influence are investigated. The paper is organized in four sections. The experiment design is given in Sect. 2, followed by the discussion of results and conclusions in Sects. 3 and 4, respectively.

2 Model and data description

2.1 Experiment design

The simulations of the HRE of Kerala have been executed using the state-of-the-art Advanced Research Weather Research and Forecasting (WRF-ARW) Model version 3.9 (Skamarock et al. 2008) developed by the collaborative efforts of the National Center for Atmospheric Research (NCAR), National Oceanic and Atmospheric Administration (NOAA) [represented by National Center for Environmental Prediction (NCEP) and the (then) Forecast System Laboratory (FSL)], University of Oklahoma and other US Governmental agencies. It is a fully compressible, non-hydrostatic (with a run-time hydrostatic option) and a next-generation mesoscale numerical weather prediction model used for both research and operational forecasting purposes. The model was configured with two interactive nested domains with 35 vertical levels with the model top at 50 hPa. To incorporate the effects of boundary layer processes properly, high resolution (interval of 25 hPa) of vertical levels is employed between the surface (1000 hPa) to 850 hPa pressure level. The outer domain was configured with 9 kmresolution, and the inner domain had a horizontal resolution of 3 km (Fig. 1). As the objective is to investigate the rainfall characteristics of 13–15 August, the simulations were initialized at 00 UTC 12 August 2018, and the model was integrated up to 96 h, i.e., till 00 UTC 16 August 2018. The initial and boundary conditions for the simulations were obtained from NCEP Global Data Assimilation System (GDAS)/Final (FNL) 0.25° × 0.25° Global Tropospheric Analyses (NCEP 2015), and the boundary conditions were updated every 6 h with the GDAS forecasts. Also, Real-Time Global (RTG) (www.polar.ncep.noaa.gov) data have been used to update the sea surface temperature (SST) every 6 h throughout the course of the simulation. The physics options included the RRTM scheme (Mlawer et al. 1997) for longwave radiation and Dudhia scheme (Dudhia 1989) for shortwave radiation, Noah scheme (Niu et al. 2011) for land surface processes, Monin–Obukhov Similarity scheme (Monin and Obukhov 1954) for surface heat and moisture fluxes, MRF scheme (Hong et al. 1996) for planetary boundary layer processes. The Kain-Fritsch (new eta) scheme (John 2004) was used for cumulus convection for the outer domain with 9 kmresolution, and in the inner domain, only microphysical parameterization was used, but no cumulus convective parameterization. Keeping other physics options same, four bulk CMP schemes, namely the WRF single-moment 6-class (WSM6) (Lim and Hong 2009), WRF double-moment 6-class (WDM6) (Hong et al. 2010), TAA scheme (Thompson and Eidhammer 2014), and Milbrandt (MIL) scheme (Milbrandt and Yau 2005) are considered (Table 1). The features and advantages of each of the CMP schemes taken into consideration for this study have been briefly discussed in Table 2. The model output is further post-processed with the Development Testbed Center Unified Post Processor (DTC-UPP) for analysis.

Fig. 1
figure 1

The domains selected for the simulation, the above offset indicates Domain 2 (inner domain) and the below offset indicates the analysis polygon in the inner domain. The elevation is given in m

Table 1 WRF-ARW model configuration
Table 2 Table with the features and basic advantages of the CMPs used in this study

2.2 Descriptions of the datasets

  1. A.

    ERA5 (Hersbach et al. 2018) The European Centre for Medium-Range Weather Forecast (ECMWF) Reanalysis 5th Generation dataset (ERA5) hourly data with a 0.25° spatial resolution has been used for hydrometeor validation. The values of the variables cloud liquid water; rain and ice have been employed for comparing the performance of the model. (url: https://cds.climate.copernicus.eu/cdsapp#!/dataset/reanalysis-era5-pressure-levels)

  2. B.

    TRMM 3B42 (Huffman et al. 2007) The Tropical Rainfall Measuring Mission (TRMM) is a joint venture of the National Aeronautics and Space Administration (NASA) and Japan Aerospace Exploration Agency (JAXA) which provides 3 hourly gridded rainfall data (3B42). The TRMM 3B42 precipitation data of 3-h temporal resolution and 0.25° spatial resolution has been used to find the peaks of rain rate, which were then compared to the peaks of moisture flux convergence produced by the model. (https://gpm.nasa.gov/data-access/downloads/trmm)

  3. C.

    GPM (Huffman et al. 2014) The Global Precipitation Measurement (GPM) final run merged daily rainfall data with 1° spatial resolution has been used to validate the quantity and areal span of the daily accumulated rainfall as produced by the simulations. This dataset has also been used for calculating the Equitable Threat Score of rainfall as simulated by the model. This data was also used for CRA Analysis. (url: https://gpm.nasa.gov/data-access/downloads/gpm)

  4. D.

    IMD Station data Data from India Meteorological Department (IMD) stations over Kerala are used for point validation of the model produced rainfall. Out of 88 stations, 31 stations are taken into consideration in this study.

3 Results and discussion

The model output analysis is carried out for the duration i.e. 12–15 August 2018 (day-1 to day-4 model forecasts). This section has been divided into six sub-sections illustrating the IMD station data validation, spatial distribution of rainfall predicted by different schemes and their comparison to the Global Precipitation Measurement (GPM) observations and the CRA analysis, reflectivity and hydrometeor analysis using Contour Frequency Altitude Diagram (CFAD), analysis of vertical velocity and temperature profiles, analysis of the contributions of the different terms of the moisture budget equation and the influence of orography, respectively.

3.1 Station validation

Based on the threshold amount of rainfall (3 days accumulated rainfall > 100 mm), model-derived 24 h rainfall is validated against 31 IMD stations (out of 87) for day-2 to day-4 forecasts. In addition, model-simulated three days accumulated rainfall is also validated against IMD station observations. Furthermore, the maximum rainfall was observed in the districts of Idukki and Palakkad (IMD Report 2018). Therefore, 4 stations from the district of Idukki, viz., Idukki, Thodupuzha, Munnar and Peermade, and 3 stations from the district of Palakkad, viz., Alathur, Parumbikulam, and Pattembi are considered for validation among these 31 stations. The IMD stations in the state of Kerala are shown in Fig. 2.

Fig. 2
figure 2

Source: Daily Weather Report, IMD Thiruvananthapuram. https://www.imdtvm.gov.in/index.php?option=com_content&task=view&id=21&Itemid=35

IMD stations situated in Kerala.

To investigate and analyze the error of the model simulated rainfall with IMD station observations, the Taylor diagram (Taylor 2001) is prepared (Fig. 3). In this diagram, the radial distance from the origin gives the value of standard deviation and the azimuthal position of model computed rainfall gives the correlation coefficient (CC). Also, the distance between the observation and the model computed rainfall gives the root mean square error (RMSE). It is evident from Fig. 3 WDM6 (Fig. 3c) simulated the rainfall with minimal RMSE and maximum CC for the maximum number of stations, followed by WSM6 (Fig. 3d).

Fig. 3
figure 3

a–d Taylor Diagram for station data validation (31 stations chosen on the basis of 3AR > 100) for all the schemes with respect to IMD stations for 3AR

The day-wise rainfall from day-2 to day-4 and also the 3 days accumulated rainfall from 13 to 15th August (3AR), as predicted by WDM6 with the 31 IMD station observations are given in Table 3 (the tables for MIL, TAA and WSM6 are not given). All the schemes overestimated the rainfall at most of the stations (Trivandram Aero, Aryankavu, Alathur, Chengannur, Haripad, etc.) on day-3 (14 August 2018), i.e., the day of maximum rainfall. But the 3AR was underestimated at some of the stations (Irikkur, Idukki, Kochi, Kannur, Quilandi, Kodungallur, Vadakkanchrry, Vallanikkara etc.). In general, it is noted that model experiments were able to better capture the rainfall at day-3 compared to day-4. The increase in errors for day-4 (leading to an increase in 3AR forecast errors) is due to the underestimation in the rainfall forecast. For specific discussion, three stations are chosen based on their geographical locations in the Kerala state i.e., Kannur (North), Kochi (Central), and Thiruvananthapuram (South) (see Fig. 2 for specific locations of the stations). At Kannur, all the schemes have underestimated the 3AR compared to IMD. However, over the same location, day-3 (24 h accumulated) rainfall was well forecasted by the different schemes considered in this study, but with an underestimation tendency in magnitude as compared to observations. The differences were within the range of 14–30 mm, with the highest (lowest) values 29.13 mm (14.105 mm) is noted for WDM6 (WSM6). Similarly, at Kochi, huge underestimation (> 150 mm) of 3AR is noted for all schemes with TAA and MIL predicting the least amount of rainfalls with the magnitude of differences of 177.47 mm and 166.3 mm, respectively. For the same station, the least errors are noted for WDM6 (WSM6) with 76.415 mm (95.415 mm). However, a completely different picture was seen at Thiruvananthapuram, where all the schemes except MIL underestimated the 3AR (difference of 31.24 mm). But for day-3, all the schemes overestimated the rainfall, with WDM6 producing the least error (5.655 mm) and TAA having the largest error (61.465 mm). In the district of Idukki, where the heaviest rainfall was observed, at the stations Idukki and Peermade, the 3AR was underestimated by all the schemes with MIL and TAA showing the largest error. But on day-3, all the schemes forecasted the rainfall close to that of IMD. WDM6 yielded the best performance with the estimation of day-3 and day-4 rainfall differences to be the least by only 1.685 and 5.805 mm, respectively, at Munnar. TAA produced a large error in this district. Again, in the district of Palakkad (second highest rainfall), 3AR was best predicted by WDM6 and worst by the TAA. Furthermore, WDM6 again produced the least 3AR error differences (< 50 mm) for stations such as Parumbikulam, Pattembi (Palakkad district) (Table 3). In general, it is noted that 3AR was best predicted by WDM6 consistently with the least errors (< 100 mm) at 22 stations followed by WSM6 with the least errors (< 100 mm) at 16 stations. The MIL and TAA are the worst-performing schemes with 11 stations having differences more than 150 mm. The percentage error in the prediction of 3AR for all the schemes is shown in Table 4. It is seen from Table 4 that out of the 31 stations taken into consideration, the 3AR was predicted with less than 25% error by WDM6 at 12 stations (highest) and TAA at 5 stations (lowest).

Table 3 The day wise account of the accumulated rainfall (day-2, 3, and 4) as predicted by WDM6 along with the IMD observations (31 stations over Kerala)
Table 4 Percentage error of model-produced 3AR with respect to the IMD observations at the different stations (+ ve and − ve error signifies overestimation and underestimation of rainfall by the model, respectively)

3.2 Rainfall

Figure 4a–t shows the spatial distribution of cumulative rainfall received on all the days from model experiments and observation (GPM). As there are no rainfall datasets available at a high resolution of 3 km (model resolution), the GPM Final Run dataset, which has a spatial resolution of 25 km, is used for spatial rainfall validation. Therefore, the GPM dataset is remapped (rainfall is conserved in the remapping) to the model grid resolution for validating the model-produced rainfall with the aim to capture the signatures of finer scale variabilities of rainfall seen at model resolution. Though the spatial extent of rainfall from north to south Kerala was captured by all the CMP schemes, the intense rainfall pockets were captured at different locations by the different CMP schemes. WDM6 and WSM6 are able to locate the pocket of intense rainfall zones in agreement with the observations on day-3 and day-4 (Fig. 4m and n). On day-3 and day-4, both the MIL (Fig. 4k and p) and TAA (Fig. 4l and q) have shown the rain systems to be dominant in the southern portion of Kerala more than the northern part, which does not agree with the GPM (Fig. 4o and t). WSM6 was able to capture the spread of rainfall over the adjacent coastal seas better than the other CMP schemes on day-3 (Fig. 4n). This finding is in agreement with the study over the East China Sea by Hong et al. (2010). Though WDM6 has predicted heavy rainfall over the land area splendidly on both day-3 and day-4, it was unable to capture the rainfall over the adjacent sea.

Fig. 4
figure 4

a–t Daily accumulated rainfall (mm) for all the schemes for day-1 to day-4

The zone of heavy rainfall over the state of Kerala is identified from Fig. 4. Hence, a polygon (shown in Fig. 1) over the desired area has been taken and further analyses of Equitable Threat Score (ETS) and Heidke Skill Score (HSS), hydrometeors, reflectivity, vertical velocity, latent heat, and moisture convergence have been carried out by masking the grid-points not belonging to this polygon, over the whole domain. This polygon will be termed as the analysis polygon from now on.

ETS and HSS are useful for categorical statistical validation of rainfall (Wilks 2011). To assess the performances of CMP experiments, the ETS and HSS have been calculated for all the four days (Fig. 5) over the analysis polygon. WSM6 shows the highest skill on day-1 (Fig. 5e). On day-2, the skills shown by WSM6 and WDM6 both are considerable (Fig. 5f). WDM6 shows the highest skills (both ETS and HSS) on day-3 and day-4 for almost all the thresholds of rainfall (Fig. 5g and h). MIL and TAA have consistently had poor skill scores (i.e., ETS and HSS) throughout the forecast duration. The same pattern of scores is also noted for HSS as well (Fig. 5a–d).

Fig. 5
figure 5

Day-wise HSS (ad) and ETS (eh) of rainfall of CMPs in comparison with GPM, calculated over the analysis polygon (Fig. 1)

3.2.1 Contiguous Rain Area (CRA) Analysis

The main objective of the CRA method is to evaluate systematic errors in the prediction of rain systems (Ebert and McBride 2000; Grams et al. 2006). In this spatial verification method, the forecast and observational fields are matched to be on the same spatial grid. They are merged by overlaying the forecast on the observations and taking the maximum value at each grid point (Ebert and Gallus 2009). Thus forecast and observed entities overlapping with each other can now be taken into account in a quantitative perspective in the merged field. After this, by setting up some minimum intensity threshold, an entity finder is applied to isolate distinct CRAs in the merged field.

The mean squared error (MSE) of the original forecast is decomposed in to the displacement (location), volume, and pattern error components as the following:

$${\mathrm{MSE}}_{\mathrm{total}}={\mathrm{MSE}}_{\mathrm{displacement}}+{\mathrm{MSE}}_{\mathrm{pattern}}+{\mathrm{MSE}}_{\mathrm{volume}}$$
(1)

The original decomposition, used with the minimum squared error best-fit criterion, computes the location component as the difference in the mean squared error before and after shifting the forecast, the volume error as the bias in mean intensity, and the pattern error as a residual:

$${\mathrm{MSE}}_{\mathrm{displacement}}={\mathrm{MSE}}_{\mathrm{total}}-{\mathrm{MSE}}_{\mathrm{shifted}}$$
(1a)
$${\mathrm{MSE}}_{\mathrm{volume}}={\left(\stackrel{-}{F}-\stackrel{-}{X}\right)}^{2}$$
(1b)
$${\mathrm{MSE}}_{\mathrm{pattern}}={\mathrm{MSE}}_{\mathrm{shifted}}-{\mathrm{MSE}}_{\mathrm{volume}}$$
(1c)

where \(\stackrel{-}{F}\) and \(\stackrel{-}{X}\) are the mean forecast and observed values after the shift.

The CRA analysis has been carried out for 72 h (day-2 to day-4 of forecast hours) with the model data generated by the four CMP schemes, and for observations, GPM has been considered. The threshold for the analysis has been taken as 150 mm as the rainfall event considered for this study was an HRE. The total error has been decomposed into three components, viz. displacement error, pattern error, and volume error. The number of grid points having 150 mm or more rainfall was identified differently by the different schemes. According to the GPM observations, 710 points had 150 mm or more rainfall, and the maximum number of grid points with this criterion was identified by the WSM6 (5513), followed by TAA (5213), WDM6 (4405), and MIL (2556). The magnitude of mean (maximum) rainfall in the grid points were underestimated (overestimated) by all the schemes. These results indicate that maximum rainfall zones have less spread characteristics across these experiments.

In Table 5, we can see that MIL captured two features of rainfall. The displacements of the features were slightly to the northeast and north for the two features, respectively, compared to that of the GPM. The first feature exhibited higher displacement error and lower volume error than the second feature, and the pattern error is similar for both these features. At the same time, all the other schemes are able to capture only one CRA. TAA showed second-highest amount of Mean Square Error (MSE). The CRAs captured by the TAA and WSM6 (Fig. 6b and d) have similar characteristics to another small feature detected by the GPM observations just north of 12° N, but they could not capture the actual feature in accordance with the major feature identified by GPM. WSM6 showed the highest amount of error among the four schemes. The feature identified by WDM6 was the closest to the dominant feature identified (marked green in Fig. 6c) by the GPM (marked green in Fig. 6e) and had the least volumetric error. However, both the features identified by MIL have a huge pattern error. In the aspect of location features, MIL identified a better CRA, but volumetrically, WDM6 was the best scheme in determining the CRA. It is also to be noted that the GPM identified three features, but the third feature (indicated by yellow in the plot) could not be identified by any of the CMP schemes (Fig. 6e). In all the cases, pattern error appeared to be the major contributor to the total MSE which in accordance with the findings of Chen et al. (2018).

Table 5 Error decomposition for the CRAs for all the CMP schemes
Fig. 6
figure 6

a–e CRAs predicted by different schemes along with the GPM for the total 3 days (day-2 to day-4) accumulated rainfall. The three different colours represent the three different features of rainfall obtained from GPM and the features identified by the different CMP schemes are coloured based on the similarity of those features with respect to the respective feature of GPM. The three error components for the features identified by the different CMP schemes are calculated with respect to the respective GPM feature

3.3 Hydrometeors and reflectivity

The time series of spatially averaged cloud ice, cloud liquid water, and rainwater for day-2 to day-4 over the analysis polygon (Fig. 1) for all the CMP schemes are shown in Fig. 7. For validation, the results obtained from the simulations with the four different CMP schemes are compared with the European Centre for Medium-Range Weather Forecast (ECMWF) Reanalysis 5th Generation (ERA5) data. It is evident that both MIL and TAA are not able to capture the evolution of the cloud ice at higher levels, but MIL captured the cloud ice better than the TAA with the maximum value of 6 × 10–6 kg kg−1 at 200 hPa level on the beginning of day-4. The lack of cloud ice in the case of the MIL in spite of the prominent presence of cloud liquid water till higher levels might be justified by the presence of supercooled water in the mid-levels. Both the WDM6 and WSM6 were able to capture the evolution of the cloud ice from day-2 and show a similar structure of cloud ice with a maximum value of 40 × 10–6 kg kg−1 at 300–400 hPa level from halfway of day-3 (Fig. 7c and d). The most important thing is that these two schemes were able to predict the high concentration of the ice particles up to the middle troposphere. This pattern is in agreement with ERA5, where the upper-level cloud ice is prominent, and the extent of the cloud ice content is up to the middle atmosphere where the contours of the cloud ice overlap the contours of the cloud liquid water (Fig. 7e). The closely packed contours of cloud liquid water ranging from 15 to 60 × 10–6 kg kg−1 as shown by the ERA5, were also found in the WDM6 and WSM6. The cloud liquid water and cloud ice content distribution is also similarly predicted by the WDM6 and WSM6. The WDM6 shows the highest peak of rainwater on day-3, which is well captured in terms of maximum rainfall simulated by the model. Thereafter, the magnitudes of rainwater decrease but remain significant till the end of day-4. This suggests that a better representation of liquid hydrometeors is noted in WDM6 with respect to the other schemes. This is to mention that rainwater was not at all captured by the MIL (Fig. 7a). Furthermore, WSM6 and TAA have shown a massive decrease in rainwater content on day-4, which does not agree with the ERA5. On day-4, the maximum rainwater content was shown by ERA5 (Fig. 7e), which is not shown by any of the schemes; however, WDM6 shows the nearest values on day 4 compared to the others. The model-simulated cloud snow and graupel distribution are also compared with ERA5 data. As the graupel data is not available from ERA5, only the cloud snow distribution is taken into account for ERA5. It is seen that WDM6 and WSM6 simulated almost similar cloud snow and graupel structure, whereas TAA and MIL were unable to predict these frozen hydrometeors to that extent. MIL showed a little amount of graupel in the higher levels, though the amount was considerably less than that of WDM6 and WSM6. This, along with the lack of cloud ice simulated by MIL, suggests that the generation of graupel is attributed to the riming of cloud ice, which in turn decreased the cloud ice content. The graupel produced by TAA was negligible with respect to the other schemes; hence the graupel distribution for TAA has been omitted. Hence, from the hydrometeor analysis, it is evident that the prediction of upper-level frozen hydrometeors has a huge influence on the rainfall forecast. These results suggest that mixed-phase processes played a dominant role in this HRE and the inability of MIL and TAA to capture the cold rain component contributed hugely to the decrease in skills of rainfall prediction for these two schemes on the days of heaviest rainfall (day-3 and day-4). The lack of frozen hydrometeors in the MIL and TAA simulations signify that the rainfall produced by these two schemes throughout the course of simulation was majorly driven by warm rain processes. On the other hand, WDM6 and WSM6 were able to capture the development of frozen hydrometeors, thus capturing the cold rain component, which led to the relatively realistic simulation of rainfall by these two schemes. Besides, WDM6 and WSM6 have the tendency to hold precipitation hydrometeors (i.e., rainwater, cloud snow, graupel) without precipitating them instantaneously, and this might have facilitated more accumulation of frozen hydrometeors at the upper level. In addition to that, the warm rain component is better simulated by WDM6 as, being a double moment scheme, it can realistically simulate the number concentration of the cloud condensation nuclei (CCN), cloud droplet, and raindrop (Zhang et al. 2018). Hence, capturing the warm rain component better than WSM6 must have led WDM6 to have a higher skill of rainfall prediction than WSM6 on the days of the heaviest rainfall (day-3 and day-4).

Fig. 7
figure 7

a–e Spatial averaged time series of the mixing ratios of hydrometeors over the analysis polygon (Fig. 1): (1) cloud liquid water (solid contours), (2) cloud ice water (dotted contours) and (3) rain water (shading). All of the mixing ratios are scaled by 106 and unit is kg kg−1

Figure 8 shows the spatial average of model-derived radar reflectivity along longitudes against pressure levels over the inner domain for all the CMP schemes for day-3. It is evident that strong updraft motions and high hydrometeor activity till the mid-levels are better predicted by WDM6, WSM6, and MIL (Fig. 8a, c and d), wherein TAA poorly represented these key features (Fig. 8b). For MIL, high reflectivity zones (> 30 dBz) are found at 500–600 hPa levels. High reflectivity is prominently visible at the lower as well as mid-levels in the WDM6. The column integrated radar reflectivity as simulated by the model has been validated with the observations of the IMD Doppler Weather RADAR (DWR) situated at Kochi (9.93° N, 76.26° E) for two time instants, i.e., 1200 UTC of day-3 (14 August) and day-4 (15 August) (Fig. 9). The DWR captured scattered zones of high reflectivity (30–50 dBz) in the eastern and northern portions of the state of Kerala on day-3 (Fig. 9e). On day-4 a zone high reflectivity stretching from the adjacent sea to the middle portion of the state of Kerala is seen in the DWR (Fig. 9j). All the CMP schemes captured zones of high reflectivity (> 30 dBz) in the northern portion on day-3, and the strongest (weakest) zones of reflectivity were captured by WDM6 (TAA) (Fig. 9c and b). The WDM6 (WSM6 to some extent) captures the high reflectivity zones stretching from the northern to the southern portion of Kerala on day-4, but, the cluster of zones of high reflectivity at the middle of the state of Kerala (9.5–10.5° N), as shown by the DWR, was best captured by WDM6 on day-4 (Fig. 9h). Though the locations of zones of higher reflectivity values were captured by MIL and TAA, the magnitude of reflectivity was significantly underestimated by TAA on both the days.

Fig. 8
figure 8

a–d Latitudinal averaged vertical cross section of the model derived radar reflectivity over the inner domain on day-3

Fig. 9
figure 9

The model-produced radar reflectivity for all the CMP schemes along with the observations of the IMD Doppler Weather RADAR situated at Kochi (9.93° N, 76.26° E) at 1200 UTC of day-3 (14 August 2018) (ae) and day-4 (15 August 2018) (fj). The red spot indicates the location of the DWR at Kochi

To further analyze the hydrometeor activity at different levels and to thoroughly investigate the nature of updraft for WDM6, WSM6 and MIL, the CFAD (Yuter and Houze 1995) with the model-derived radar reflectivity for all the CMP schemes over the analysis polygon (Fig. 1) are shown for day-2, day-3, and day-4 (Fig. 10). CFAD analysis helps us to locate the level of maximum reflectivity which indicates the zones of maximum microphysical activity, and thus provides a huge insight in the understanding of the inherent structure of hydrometeors in the atmosphere. MIL shows higher reflectivity near the surface compared to upper layers throughout the simulation duration. This explains the lack of upper-level ice predicted by the scheme. TAA failed to capture the reflectivity at the upper levels (Fig. 10b, f and j). WDM6 shows the highest reflectivity (> 40 dBz) contours at the upper levels (200 hPa) on day-2, but these highest reflectivity contours are seen at middle levels (500–600 hPa) on day-3 and day-4. This high reflectivity at the middle level in spite of the high amount of rainwater in the lower levels, indicates the domination of phase-change processes in the middle atmosphere. On day-2, for WSM6, very high reflectivity contours are noted at the lower levels, whereas for day-4 these high reflectivity patterns are visible at both the upper and lower levels (Fig. 10l). The CFAD of MIL suggests that the zone of maximum reflectivity stays near the ground, which is indicative of high liquid water content at the lower levels resulting in the dominance of warm rain processes. Similarly, TAA fails to capture the higher level reflectivity due to the absence of cloud ice at the upper levels. The WSM6 and the MIL scheme show similar CFADs with the higher levels showing higher reflectivity contours on day-3 and day-4. However, the reflectivity contours are attributed to cloud liquid water for MIL and cloud ice for WSM6.

Fig. 10
figure 10

a–h Contour Frequency Altitude Diagram (CFAD) of model-simulated radar reflectivity (dBz) for all the CMPs for all the days over the analysis polygon (Fig. 1). Contours represent frequency of radar reflectivity relative to the maximum absolute radar reflectivity in the model derived data samples

3.4 Vertical velocity and temperature

Pressure–time plots for the differences of the spatial averaged vertical motion i.e. omega (Pas−1) for all the schemes over the analysis polygon (Fig. 1) with respect to WDM6 from day-2 to day-4 are shown in Fig. 11. It is evident that MIL is not able to capture the updraft motion at all with respect to (w.r.t.) WDM6 (Fig. 11a). Though there are similar updraft pattern for TAA (Fig. 11b) w.r.t. WDM6 in the lower levels (upto 700 hPa) for day-3. This lack of vertical velocity in the mid-levels inhibited the transport of water vapor and liquid hydrometeors to higher levels. WSM6 has shown resemblance in vertical velocity magnitudes to WDM6 (Fig. 11c). Similar plots for specific humidity and moisture convergence are shown in Figs. 12 and 13. It is evident that all the schemes simulated less specific humidity in the lower levels than WDM6 (Fig. 12) from day-2 to day-4. However, TAA and WSM6 showed higher specific humidity in the low levels (up to 850 hPa) than WDM6 for the entire simulation duration. This is corroborated by Fig. 13, where we see TAA and WSM6 showing higher surface moisture convergence than WDM6 on day-3 and day-4, whereas MIL couldn’t capture the peaks except for one at the beginning of day-4.

Fig. 11
figure 11

a–c Difference of spatial averaged time series of vertical velocity omega (Pa s−1) calculated over the analysis polygon (Fig. 1) for all the schemes with respect to WDM6

Fig. 12
figure 12

a–c Same as Fig. 11, but for specific humidity (kg kg−1)

Fig. 13
figure 13

a–c Same as Fig. 11 but only for moisture convergence (kg kg−1 s−1)

Pressure–time plots for the differences of the spatial averaged temperature (K) for all the schemes over the analysis polygon (Fig. 1) w.r.t. WDM6 from day-2 to day-4 are shown in Fig. 14. It is evident that over the region, the upper-level temperature is warm for TAA and MIL w.r.t WDM6 (Fig. 14a and b), which creates a less favourable environment for the generation of frozen hydrometeors. However, the difference in temperature between WSM6 and WDM6 is marginal. This is in agreement with the frozen hydrometeor distribution as simulated by MIL and TAA compared to WDM6. The averaged latent heat time series for upper (200–100 hPa) and lower level (900–600 hPa) are shown in Fig. 15. Results suggest that even though the upper-level latent heat is not varying much in the different MFC scenarios, the lower level latent heating is strongly modulated by the fluctuations in MFC.

Fig. 14
figure 14

a–c Same as Fig. 11, but for temperature (K)

Fig. 15
figure 15

a–d Time series of the (1) MFC (s−1), (2) latent heat averaged between 600–900 hPa (J kg−1) and (3) latent heat averaged between 100–200 hPa (J kg−1) over the analysis polygon (Fig. 1)

3.5 Moisture budget

The Moisture Flux Convergence (MFC) have been calculated following the methodology of Banacos et al. (2005). By vector identity, horizontal MFC can be written as

$$\mathrm{MFC}=-\nabla .\left(q{V}_{h}\right)=-{V}_{h}.\nabla q-q\nabla .{V}_{h}$$
(2)
$${\text{MFC}} = \underbrace {{ - u\frac{\partial q}{{\partial x}} - v\frac{\partial q}{{\partial y}}}}_{{{\text{Advection}}\;{\text{term}}}} - \underbrace {{q\left( {\frac{\partial u}{{\partial x}} + \frac{\partial v}{{\partial x}}} \right)}}_{{{\text{Convergence}}\;{\text{term}}}}$$
(2a)

where \(q\),\({V}_{h} ,u\) and \(v\) stand for specific humidity, horizontal velocity vector, zonal velocity, and meridional velocity.

The advection term represents the horizontal advection of specific humidity, whereas the convergence term denotes the product of specific humidity and horizontal mass convergence.

In this study, the contributions of the different components of the moisture budget equation (Eq. 2a) are studied for the Kerala HRE for the threshold of 100 mm. The advection and convergence terms are calculated using the formulation in Eq. (2a). This TRMM 3B42 3-hourly rain rate data are plotted alongside the different terms of the moisture budget equation to better understand the correlation between the model-predicted moisture convergence and the rainfall observations at higher temporal intervals (Fig. 16). The peaks of the MFC are prominent for all the schemes, and the contribution from the advection term is negative. In all the cases, the contribution of the MFC term peaks before the increase in rain rate predicted by the model and TRMM. But for WDM6, peaks of MFC are the most consistent with the peaks of rain rate (Fig. 16c). The lead-lag analysis is also carried out over the analysis polygon (Fig. 1) and the results are shown in Fig. 17. The WSM6 shows the lag between increased rain rate and MFC to be 2 h, which is highest among all (Fig. 17b). MIL and TAA predicted the MFC leading the peaks of rainfall by 1 h, which is similar to WDM6. In addition, it is prominently visible that the MFC peaks are synchronized with the model-simulated rain-rate peaks in day-3 and day-4 for WDM6 and WSM6 (Fig. 17a).

Fig. 16
figure 16

a–d Time series of the different components of the moisture budget equation, viz., (1) MFC (s−1), (2) local term, (3) advection term for 100 mm threshold along with the model predicted rain rate (mm h−1) and TRMM

Fig. 17
figure 17

a, b Time series of a MFC (dotted lines) and rainrate (solid lines) for all the schemes (above) along with b the lead-lag correlation (including lag between MFC and rainrate, calculated over the analysis polygon shown in Fig. 1) for all the schemes. The colours represent the corresponding CMP schemes according to the legend of (b)

From the moisture convergence scenario, it is seen that MIL simulated less moisture convergence than WDM6 almost throughout the course of the simulation (Fig. 13). Hence the accumulation of specific humidity at the lower levels is much less than WDM6, except for the beginning of day-4 when there was a momentary peak in moisture convergence in MIL than WDM6. This sudden increase of moisture convergence also resulted in the formation of relatively higher cloud ice formation than day-3 and also justified the patch of higher latent heat for MIL during a few initial hours of day-4. However, the significant lack of moisture convergence is one of the key factors attributed to the substantial reduction in latent heat release for MIL as compared to WDM6. In addition, weaker surface convergence resulted in weaker updrafts that led to the formation of fewer amounts of frozen hydrometeors in MIL compared to WDM6; this is due to the fact that weaker updrafts (MIL) couldn’t transport the low-level liquid hydrometeors to upper levels as seen in WDM6. In addition, the rain water content simulated by MIL is much less than WDM6, leading to poor skills in rainfall prediction. On the other hand, in TAA, up to day-3, the vertical velocity structure is similar to that of WDM6 (Fig. 11); however, afterward, strong inhibition of updraft is noted in TAA compared to WDM6, leading to the poor simulation of frozen hydrometeors realistically. Simultaneously, the restriction in vertical transport of cloud liquid water and large accumulation of rainwater in TAA suggests that the autoconversion process might have played a key role in producing rainwater from cloud liquid water at the lower levels. Furthermore, TAA is not able to capture the graupel like any other schemes; hence the rainfall simulated by TAA is mostly due to the warm rain process. Analysis of temperature (Fig. 14) and latent heat (Fig. 15) results clearly suggest that the MFC is strongly influencing the low level as well as the upper level hydrometeor distribution, rainfall, and the latent heating, and in turn modulating the convection and precipitation over this region.

3.6 Orographic influence

To understand the influence of the orographic scenario in this HRE, the day-3 averaged cross sectional vertical velocity along with the cloud liquid water and cloud ice is plotted (the cross-sectional orography is shaded) for the latitudes of Kochi (10.15°), situated in the middle of the state (Fig. 18), and Thiruvananthapuram (8.47°), situated in the southern Kerala (Figure not shown). It is evident from Fig. 18 that the Western Ghats played an important role in the occurrence of this HRE. The moisture convergence patches present beyond the coast of Kerala implies the accumulation of the moisture in the foothills of the orographic scenario. This accumulated moisture is transported to the upper atmosphere by means of orographic lifting facilitated by the windward slope of the Western Ghats. In Fig. 18, we can see that all the schemes show updraft motion in the windward slope. However, WDM6 shows the strongest updraft among the four CMP schemes. The cloud ice distribution confirms the lack of frozen hydrometeors formation by MIL and TAA. A downdraft motion is also seen in the leeward side of the Western Ghats, which is corroborated by the divergence patches in the eastern longitudes. A similar picture is also seen for the other station, i.e., Thiruvananthapuram.

Fig. 18
figure 18

a–d Averaged cloud liquid water (shaded, top colorbar) and cloud ice water (dotted, middle colorbar) overlaid with vertical velocity (contours, bottom colorbar) at the latitude of Kochi (10.15° N) for all the CMP schemes on day-3. The cross sectional orography is shaded and is plotted from 1000 hPa

4 Conclusions

Numerical simulations are carried out for the Kerala HRE (2018) using WRF-ARW model by employing four different CMP schemes, viz., MIL, TAA, WDM6, and WSM6. The 3AR for each experiment is validated against IMD station observations. It is found that WDM6 (TAA) has the best (worst) forecast skills among the suite of CMPs. The differences of 3AR have increased for all the schemes due to the underestimation of rainfall at most of the stations on day-4. However, the WDM6 rainfall forecast for day-3 is close to IMD. WDM6 predicted 3AR with less than 100 mm difference at 22 out of the 31 stations considered for validation. This clearly implies that WDM6 is the best CMP scheme among the others for the heavy rainfall prediction in this scenario. Furthermore, the spatial distribution of rainfall over the land as predicted by the WDM6 and WSM6 is consistent with the GPM observations, whereas predictions from MIL and TAA are poor. For the days with the heaviest rainfall (day-3 and day-4), WDM6 shows higher skill in all the thresholds (10–150 mm). The CRA analysis has been conducted to find the decomposed errors for all the CMP schemes by keeping the threshold as 150 mm for CMP experiments and GPM. The CRA identified by WDM6 had moderate displacement and pattern errors, but the volumetric error was least among all the experiments.

From the hydrometeor analysis, it is prominent that the inability of MIL and TAA to simulate the frozen hydrometeors in the upper levels affected their rainfall prediction badly, resulting in poor skills. However, hydrometeors simulated by WDM6 and WSM6 are in agreement with the ERA5 data. The realistic simulation of frozen hydrometeors led these schemes to have a much better rainfall prediction than MIL and TAA. WDM6, being the double moment scheme, captured the warm rain processes better than the WSM6, which led to its higher accuracy than WSM6. The latitudinal averaged vertical cross section of reflectivity showed zones of high updraft in MIL, WDM6 and WSM6, with WDM6 showing the strongest updrafts. Also, the comparison of model-simulated radar reflectivity with the IMD DWR observations suggests that the strongest (weakest) zones of reflectivity were captured by WDM6 and WSM6 (TAA) on both day-3 and day-4. However, the cluster of zones of high reflectivity on day-4 at the middle portion of Kerala (9.5–10.5° N) was captured by WDM6, which is in agreement with the IMD DWR observation. Furthermore, the CFAD analysis of model-produced radar reflectivity showed that the hydrometeor activities were restricted to the lower level for MIL, which in turn, agrees with the lack of frozen hydrometeors in the MIL simulations, but presence of enough cloud liquid water in the lower levels. In the case of WDM6, the domination of phase-change processes in the middle atmosphere is prominently visible, strengthening the indication that the cold rain processes were properly captured by WDM6.

Analysis of the moisture budget indicated that there is a significant impact of MFC in the rainfall prediction. In all the schemes, MFC led the rainrate peaks. The highest lead time was seen in WSM6 (2 h), whereas, for all the other simulations, it was 1 h. WDM6 simulated the MFC peaks in consistency with the TRMM rain rate. The peaks of moisture convergence can be associated with the convergence patterns visible beyond the coast of Kerala, in the foothills of Western Ghats. Orographic lifting played an important role in transporting the moisture to higher levels. There are patches of high moisture divergence along the leeward side of the Western Ghats indicating strong downdraft along the slope of the mountain. The results demonstrate that MFC plays an important role in accumulation of the moisture in lower and middle levels with further aggravation by the orographic presence. WDM6 captured the strongest updraft among all these experiments and this characterized with the adequate moisture leading to the better rainfall forecast skills. Furthermore, the better rainfall prediction skills of WDM6 is mainly attributed to, the accurate representation of frozen as well liquid hydrometeors facilitating a realistic simulation of both warm and cold rain processes.

This study provides the characteristic baseline behavior of key microphysical parameters and processes during a HRE in the complex geomorphological scenario over the southern Indian peninsular. However, investigation of this kind of meteorological event is highly influenced by several other factors (choice of boundary layer and cumulus convection parameterizations, resolution of the model domain etc.). Hence there is scope for further research for increasing the accuracy of the forecast for events such as this HRE. It is evident from researches such as this one, and the ones by Rajesh et al. (2017) and Mohan et al. (2018), that WRF has the capability of simulating HREs with reasonable accuracy over the Indian subcontinent, may it be the Hilly regions of Uttarakhand, or a coastal city in the eastern coast India, or a state with an orographic scenario in the Western coast of the country. It is anticipated that the findings of this research will be highly beneficial to operational forecasters, policy planners, and disaster risk managers for better prediction and preparedness to minimize the losses during these kinds of extreme HREs.