Abstract
Existing approaches for determining wind speed distribution usually fit a single Weibull distribution to the entire measured data or fit multiple Weibull distributions to the data segregated by months. However, wind speed varies not only monthly or seasonally but also hourly. The present study captures monthly and diurnal variability by fitting a Weibull distribution to data measured at four locations in the central-western part of India. Two methods, a maximum likelihood (MLE) method and an empirical method, were used to estimate the Weibull parameters with several goodness-of-fit tests including R2, root mean square error (RMSE), average power density error, and average energy output error. The findings demonstrated a marginal improvement in fitting accuracy with the implementation of monthly–hourly bin segmentation and fitting calculating Weibull parameters for each dataset.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Background
Due to natural variability in the availability of wind resources, the knowledge of wind speed distribution is important for assessing economic viability of wind energy systems at any location. A two-parameter Weibull distribution is often used to represent wind speeds at a given location. Some studies used a single Weibull distribution to describe wind speeds at a particular location by treating all of the measured data as one group [1,2,3,4,5,6,7]. Since a single Weibull distribution fails to capture intra-annual variability of wind speed, some studies have segregated the measured wind data into various groups according to months of the year [2, 8,9,10,11,12,13,14,15], seasons [8, 13, 16], or hours of the day [13] to account for variations in wind speeds and estimate the Weibull parameters more accurately.
However, existing approaches to fitting a Weibull distribution to the observed wind data have ignored the fact that wind speed varies with hour of the day as well as month of the year and thus might result in poor fitting. The diurnal variations are not the same for each month as some months have higher daily wind speed than others. For example, the hourly profiles of wind speed for each month at selected locations in India are shown in Fig. 1. At these locations, diurnal profile of wind speed depends on the month. Monsoon months, from June to September, usually have higher wind speed as compared to other months for each hour of day.
The present study explored how segregating the data into different temporal resolutions affected the accuracy of representing wind speed. The study calculated the wind energy potential from the observed data and compared that with the potential estimated from a fitted Weibull distribution for the following four groupings of observed data: (i) A single Weibull distribution was used to represent the entire data. (ii) Twelve Weibull distributions were each fitted separately to the data for each month. (iii) Twenty-four Weibull distributions were each fitted separately to the data for each hour of a day. (iv) For each hour of the month, 288 (24 X 12) Weibull distributions were fitted separately to the data.
Knowledge Contribution
Existing studies on fitting Weibull distribution capture either monthly or diurnal variation but not both, lowering the accuracies of estimated Weibull parameters. This study addresses this research gap and captures diurnal variability for each month to help better assess availability of wind resources. The present study showed that fitting Weibull parameters for wind data on much more granular scale improves the accuracy of the fitting process. Such improvements may have significant impacts on the choice of suitable locations for installing wind turbines.
Method and Data
The overview of methodological steps is shown in Fig. 2 ([2, 7]). The following subsections describe the methodology used in the present work:
Parameter Estimation
A Weibull distribution describes observed wind speed data reasonably well [17,18,19,20,21]. Equation (1) expresses the probability density function of wind speed v as described by a two-parameter Weibull distribution at a particular location [18]:
where k = shape parameter (higher k means higher Vmax but lower power density), and c = scale parameter (in m/s).
Several methods have been used to estimate k and c parameters from observed wind speed data [1, 4, 8, 22,23,24]. These methods can be broadly classified into iterative and non-iterative methods. Non-iterative methods present closed-form solutions, while iterative methods require mathematical iterations to arrive at a final estimated value of parameters. Nonetheless, most methods fit a Weibull distribution to the observed wind data well, and no rule exists for choosing the best method, with “best” being defined based on goodness-of-fit criteria [21]. In the present work, a maximum likelihood (MLE) method and an empirical method were used for parameter estimation. The empirical method proposed by [22] presents a closed-form formula and requires only mean and standard deviation of wind speeds to determine the Weibull parameters, as expressed in Eqs. (2) and (3). As a scale parameter uses a somewhat complicated gamma function, several variants of this method have been proposed to approximate the value of the scale parameter [25, 26]. These variants use the same formula for k but use an approximated formula for c.
The MLE proposed by [27] (cited in Seguro and Lambert, 2000) is an iterative process and perhaps the most widely used method for estimating Weibull parameters [1, 15]. The parameters can be estimated using Eqs. (4) and (5).
The values of k and c were estimated using the two methods explained above. The fitting steps were carried out for four cases: In the first case, a Weibull distribution was fit to the whole data, and a single pair of (k, c) was estimated. In the second case, the data were separated according to the months of a year, and subsequently, a Weibull distribution was separately fitted to get monthly estimates of parameters. In this case, a total of 12 pairs of Weibull parameters were obtained. Similarly, for the third case involving fitting a Weibull function to the data, the data were segregated by hour of the day, resulting in 24 estimated pairs of Weibull parameters for all the data. Lastly, in the fourth case, the data were segregated according to the hour of the day for each month, and a Weibull distribution was fit to 288 segregated datasets, resulting in a total of 288 pairs of estimated parameters.
Goodness-of-Fit
Different statistical tests have been used for assessing how well estimated distribution describes the observed data [28]. Some tests (e.g., Chi-square test or Kolmogorov–Smirnov statistics) should be avoided because they are susceptible to small sample size and fewer number of bins [4, 29]. Therefore, these tests were not included in this study.
Root Mean Square Error
Root mean square error (RMSE) is computed using Eq. (6) [1, 12]. Equation (6) shows the modified RMSE formula to account for fitting multiple Weibull distributions to the observed data. A smaller value of RMSE indicated a better fit.
where \({{\text{y}}}_{{\text{i}}}\) : observed frequency;
\({{\text{x}}}_{{\text{i}}}\): expected frequency (from the fitted Weibull distribution);
\({\text{n}}\): number of bins of data of a group g;
G: segregated groups for a particular case (e.g., January to December for monthly group, 1 to 24 for
the hourly group); and
\({\text{N}}\): total number of data points in all groups.
R-Squared
The coefficient of determination (\({R}^{2}\)) is calculated using Eq. (7) [4, 30]. Similar to Eq. (6), Eq. (7) is modified to account for the goodness-of-fit of multiple Weibull distributions to the observed data. \({R}^{2}\) explained how well variations in the data were explained by the fitted model. \({R}^{2}\) usually lies between zero and one, with one considered the best fit.
where \(\overline{{\text{y}} }\) is the global mean of observed frequency (mean frequency of the entire data).
Average Power Density Error
Average power density error (APDE) is defined as the difference between the average power density calculated from the fitted Weibull distribution and power density calculated from the observed data. It can be denoted by Eq. (8) [16, 18]. This method has several advantages, such as simple mathematical expressions, freedom from binning errors, and better prediction accuracies [13, 30].
Average Energy Output Error
For modeling wind energy systems, Eqs. (6) to (8) may not be suitable measures of goodness-of-fit. These tests give equal weight to each wind speed, irrespective of its magnitude, but wind turbines do not always produce power. The output is zero when wind speed is below cut-in or above cut-out threshold of the wind turbines. Any fit might on average describe wind speed of certain magnitudes more accurately than other wind speeds, but it may be considered a poor fit if it fails to describe wind speeds at which turbines actually produce power [10, 31]. If the purpose of obtaining wind speed distribution at any location is to eventually predict wind power generation, suitable wind speed distribution must be able to predict wind power generation reasonably well [4, 10, 16].
Therefore, in this study, average energy output error (AEOE) was also used as an indicator of goodness-of-fit. It was computed as the difference between the estimated energy and the actual energy. The relevant mathematical expression is shown using Eq. (9). The power curve of a generic turbine was used to map wind speed to a particular power output of a wind turbine. The data interval was taken to be one hour. For a turbine of MW rating, the output of f(x) was in MWh, depending upon the turbine rating.
where f(x): wind power output of a given turbine, and x: wind speed.
Equation (5) requires as many estimated wind data points in each group as there are points in the corresponding group of observed data. Monte Carlo simulation (MCS) was used to generate data from estimated parameters to match the numbers in observed data. MCS is useful for capturing uncertainties such as natural variabilities in wind speed and obtaining a probability distribution of the goodness-of-fit metric [11, 23]. For each group, a random sample of n data points was drawn from the estimated Weibull distribution associated with that group. The procedure was repeated for 500 trials to account for natural variability of wind, resulting in 500 values of error for each group in each case. The advantage of this approach over other similar indicators, such as expected energy/power output (EEO) used by [4, 10, 23, 25], is that it produces a distribution of errors rather than a single value of error.
Data
The National Institute of Wind Energy (NIWE) maintains hourly wind speed data for various locations across India and makes the data freely available at http://niwe.res.in:8080/NIWE_WRA_DATA/). Based on the meta-data of wind monitoring stations mentioned in the Wind Atlas of India [32], the top four sites in central India, where data were observed for the longest time period, were selected for this study. A script was written in Python (www.python.org) for simulation and estimation. The linearized power curve of a commercial turbine of size 2 MW was used in the study [33].
Results and Discussion
Wind Data Summary
Wind speed data of four locations in the central region of India were analyzed. The meta-data are shown in Table 1.
The wind statistics at these locations are summarized in Table 2. Hourly and monthly variations in wind speed curves are shown in Fig. 1. Using the formula expressed by Eq. (10), the data were rescaled to estimate the speed values at the height of 100 m, the height at which wind energy potential is usually estimated in India [34]. The value of α (terrain factor) at these locations was assumed to be equal to 0.2 [26].Footnote 1 The parameters were estimated using wind speeds at 20 m.
Table 3 shows how much wind speed varied within a day for different months. The range for a particular month is the difference between the maximum and minimum average hourly wind speeds. This range shows how much wind varies throughout the day, and a higher value indicates wide diurnal variations in wind speed. The range was similar for three of the locations, but for location D, the range was much smaller, indicating that diurnal variabilities in wind speed were smaller at this location.
Estimation of Parameters
The results are presented in Table 4. The findings showed that fitting a Weibull distribution to the data segregated by the hours of each month resulted in a better fit as compared to estimating Weibull parameters from more aggregated datasets. The findings were consistent across all four locations and for all goodness-of-fit metrics except APDE. At location D, Case II was a better fit than Case IV. This could be due to overfitting, as the observed diurnal daily variations were small, or it could possibly be due to the small sample size.
RMSE found in this study was higher that reported in other studies, e.g., [2], perhaps due to smaller sample size or lower wind speeds. Studies have found that an iterative Weibull method fits better than a two-parameter Weibull method if wind data are skewed toward values close to zero [1]. Further, this study’s findings did not agree with findings of [10] that indicated that statistical measures might not be suitable for goodness-of-fit. Moreover, this study complimented the findings of [35], suggesting temporal variations for wind energy assessment.
Figures 3 and 4 show a range of estimated parameters for different months. For each month, the top and bottom of the bar represent the minimum and maximum values among 24 hourly values of the parameter. With both figures, diurnal variabilities in wind speed are significant. Estimated values for each of the 24 h of the day varied considerably for many months. This variability was not captured when the data were only segregated by months to obtain a single monthly value of the shape and scale parameters. Using multiple Weibull distributions increased the complexity of the fitting process and required computational resources [36]. However, since the costs of computational power and data storage have become affordable, the complexity of the fitting process may not be an issue, even if the commercial implications of improving the accuracies were considerable.
Conclusion
Many studies represented wind speed using a single Weibull distribution. As wind resources at any location constantly vary daily or monthly, researchers captured monthly/seasonal variations to better represent wind speeds. However, diurnal variabilities in wind speed can be significant. Using different Weibull distributions for each hour of each month provides a better fit, since R2 for location I varies between 0.81 and 0.84. Similarly, other locations also have higher R2 value, indicating a better fit. Although the improvement may be small, capturing such variability is important for making preliminary assessments of wind energy potential and optimally sizing and modeling renewable systems.
Furthermore, in energy modeling, better fit usually implies accurate estimations of energy yield, suggesting that energy-based indicators might be appropriate criteria for evaluating how well the distribution fits the observed data. Nevertheless, the results of this investigation indicated that conventional statistical tests offer a comparable level of precision to an energy yield indicator for adjusted parameters. Future research may explore advanced modeling techniques such as non-stationary modeling of wind [37] and multi-instrument observations [38] for wind energy measurement.
Notes
The Center for Wind Energy Technology is now known as the National Institute of Wind Energy (NIWE).
References
P.A.C. Rocha, R.C. de Sousa, C.F. de Andrade, M.E.V. da Silva, Comparison of seven numerical methods for determining Weibull parameters for wind energy generation in the northeast region of Brazil. Appl. Energy 89(1), 395–400 (2012)
F. Fazelpour, E. Markarian, N. Soltani, Wind energy potential and economic assessment of four locations in Sistan and Balouchestan province in Iran. Renew. Energy 109, 646–667 (2017)
W.C. Cliff, Effect of generalized wind characteristics on annual power estimates from wind turbine generators, Battelle Pacific Northwest Labs., Richland, WA (USA) (1977)
C. Carrillo, J. Cidrás, E. Díaz-Dorado, A.F. Obando-Montaño, An approach to determine the Weibull parameters for wind energy analysis: the case of Galicia (Spain). Energies 7(4), 2676–2700 (2014)
A. Dabbaghiyan, F. Fazelpour, M.D. Abnavi, M.A. Rosen, Evaluation of wind energy potential in province of Bushehr, Iran. Renew. Sustain. Energy Rev. 55, 455–466 (2016)
D. Solyali, M. Altunç, S. Tolun, Z. Aslan, Wind resource assessment of Northern Cyprus. Renew. Sustain. Energy Rev. 55, 180–187 (2016)
A. Sedaghat, A. Hassanzadeh, J. Jamali, A. Mostafaeipour, W.H. Chen, Determination of rated wind speed for maximum annual energy production of variable speed wind turbines. Appl. Energy 205, 781–789 (2017)
A.K. Azad, M.G. Rasul, T. Yusaf, Statistical diagnosis of the best Weibull methods for wind power assessment for agricultural applications. Energies 7(5), 3056–3085 (2014)
A.S. Dorvlo, Estimating wind speed distribution. Energy Convers. Manag. 43(17), 2311–2318 (2002)
A.N. Celik, On the distributional parameters used in assessment of the suitability of wind speed probability density functions. Energy Convers. Manag. 45(11), 1735–1747 (2004)
T.P. Chang, Performance comparison of six numerical methods in estimating Weibull parameters for wind energy application. Appl. Energy 88(1), 272–282 (2011)
S.A. Ahmed, Comparative study of four methods for estimating Weibull parameters for Halabja, Iraq. Int. J. Phys. Sci. 8(5), 186–192 (2013)
K. Mohammadi, A. Mostafaeipour, Using different methods for comprehensive study of wind turbine utilization in Zarrineh, Iran. Energy Convers. Manag. 65, 463–470 (2013)
A. Ucar, F. Balo, Evaluation of wind energy potential and electricity generation at six locations in Turkey. Appl. Energy 86(10), 1864–1872 (2009)
M.A. Baseer, J.P. Meyer, S. Rehman, M.M. Alam, Wind power characteristics of seven data collection sites in Jubail, Saudi Arabia using Weibull parameters. Renew. Energy 102, 35–49 (2017)
J. Wang, X. Huang, Q. Li, X. Ma, Comparison of seven methods for determining the optimal statistical distribution parameters: a case study of wind energy assessment in the large-scale wind farms of China. Energy 164, 432–448 (2018)
C.G. Justus, W.R. Hargraves, A. Yalcin, Nationwide assessment of potential output from wind-powered generators. J. Appl. Meteorol. 1962–1982, 673–678 (1976)
G. Johnson, Wind energy systems, Wind Energy Syst. (2006)
A. Garcia, J.L. Torres, E. Prieto, A. De Francisco, Fitting wind speed distributions: a case study. Sol. Energy 62(2), 139–144 (1998)
P. Wais, A review of Weibull functions in wind sector. Renew. Sustain. Energy Rev. 70, 1099–1107 (2017)
J.A. Carta, P. Ramirez, S. Velazquez, A review of wind speed probability distributions used in wind energy analysis: case studies in the Canary Islands. Renew. Sustain. Energy Rev. 13(5), 933–955 (2009)
C.G. Justus, W.R. Hargraves, A. Mikhail, D. Graber, Methods for estimating wind speed frequency distributions. J. Appl. Meteorol. 1962–1982, 350–353 (1978)
J.V. Seguro, T.W. Lambert, Modern estimation of the parameters of the Weibull wind speed distribution for wind energy analysis. J. Wind Eng. Ind. Aerodyn. 85(1), 75–84 (2000)
A.A. Amrutha, P. Balachandra, M. Mathirajan, Role of targeted policies in mainstreaming renewable energy in a resource constrained electricity system: a case study of Karnataka electricity system in India. Energy Policy 106, 48–58 (2017)
A. Balouktsis, D. Chassapis, T.D. Karapantsios, A nomogram method for estimating the energy produced by wind turbine generators. Sol. Energy 72(3), 251–259 (2002)
C-WET, Indian wind atlas. Chennai, India: C-WET (2010)
M.J.M. Stevens, P.T. Smulders, The estimation of the parameters of the Weibull wind speed distribution for wind energy utilization purposes. Wind Eng. 3(2), 132–145 (1979)
T.B.M.J. Ouarda, C. Charron, F. Chebana, Review of criteria for the selection of probability distributions for wind speed data and introduction of the moment and L-moment ratio diagram methods, with a case study. Energy Convers. Manag. 124, 247 (2016)
Y.M. Kantar, I. Usta, I. Arik, I. Yenilmez, Wind speed analysis using the extended generalized Lindley distribution. Renew. Energy 118, 1024 (2018)
S.A. Akdaǧ, A. Dinler, A new method to estimate Weibull parameters for wind energy applications. Energy Convers. Manag. 50, 1761 (2009)
S. Mathew, Wind energy: fundamentals, resource analysis and economics (Springer, 2006)
NIWE, List of wind monitoring stations. [Online]. http://niwe.res.in/assets/Docu/lwms/List_of_WMS_as_on_30.06.2018.pdf (2018)
Gamesa, Gamesa G87–2.0 MW. [Online]. Available: http://www.wind-power-program.com/Library/Turbine leaflets/Gamesa/Gamesa G87 2mw.pdf. Accessed 09 Sep 2018 (2008)
NIWE, Wind power potential at 100m agl. [Online]. http://niwe.res.in/assets/Docu/Wra_100m agl map.pdf. Accessed 10 Sep 2018 (2015)
J.Y. He et al., Assessment of urban wind energy resource in Hong Kong based on multi-instrument observations. Renew. Sustain. Energy Rev. 191, 114123 (2024)
X. Yang, X. Jiang, S. Liang, Y. Qin, F. Ye, B. Ye, X. Jiayu et al., Spatiotemporal variation of power law exponent on the use of wind energy. Appl. Energy 356, 122441 (2024)
T.B.M.J. Ouarda, C. Charron, Non-stationary statistical modelling of wind speed: a case study in eastern Canada. Energy Convers. Manag. 236, 114028 (2021)
J.Y. He, P.W. Chan, Q.S. Li, T. Huang, S.H.L. Yim, Assessment of urban wind energy resource in Hong Kong based on multi-instrument observations. Renew. Sustain. Energy Rev. 191, 114123 (2024)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Jain, S., Jain, N.K. Improvements in Fitting Accuracy of Weibull Distribution for Wind Data by Capturing Monthly and Diurnal Variability in Wind Speeds. J. Inst. Eng. India Ser. C 105, 607–615 (2024). https://doi.org/10.1007/s40032-024-01040-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40032-024-01040-4