Abstract
Exponential dispersion models, which are linear exponential families with a dispersion parameter, are the prototype response distributions for generalized linear models. The Tweedie family comprises those exponential dispersion models with power mean-variance relationships. The normal, Poisson, gamma and inverse Gaussian distributions belong to theTweedie family. Apart from these special cases, Tweedie distributions do not have density functions which can be written in closed form. Instead, the densities can be represented as infinite summations derived from series expansions. This article describes how the series expansions can be summed in an numerically efficient fashion. The usefulness of the approach is demonstrated, but full machine accuracy is shown not to be obtainable using the series expansion method for all parameter values. Derivatives of the density with respect to the dispersion parameter are also derived to facilitate maximum likelihood estimation. The methods are demonstrated on two data examples and compared with with Box-Cox transformations and extended quasi-likelihoood.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Aalen O.O. 1992. Modelling heterogeneity in survival analysis by the compound Poisson distribution. Ann. Appl. Probab. 2: 951–972.
Abramowitz M. and Stegun I.A. (Eds.) 1965. A Handbook of Mathematical Functions with Formulas, Graphs and Mathematical Tables. Dover Publications, New York.
Bar-Lev S.K. and Enis P. 1986. Reproducibility and natural exponential families with power variance functions. Ann. Statist. 14: 1507–1522.
Bar-Lev S.K. and Stramer O. 1987. Characterizations of natural exponential families with power variance functions by zero regression properties. Probab. Theory Related Fields 76: 509–522.
Box G.E.P. and Cox D.R. 1964. An analysis of transformations (with discussion). J. R. Stat. Soc. Ser. B Stat. Methodol. 26: 211–252.
Byrd R.H., Lu P., Nocedal J., and Zhu C. 1995. A limited memory algorithm for bound constrained optimization. SIAM J. Scientific Computing. 16: 1190–1208.
Cox D.R. 1961. Tests of separate families of hypotheses. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, University of California Press, Berkeley, CA Vol. 1, pp. 105–123.
Cox D.R. 1962. Further results on tests of separate families of hypotheses. Journal of the Royal Statistical Society B 24: 406–424.
Cox D.R. and Reid N. (1987). Parameter orthogonality and approximate conditional inference. J.R. Statist. Soc. B 49: 1–39.
Davidian M. 1990. Estimation of variance functions in assays with possible unequal replication and nonnormal data. Biometrika 77: 43–54.
Davidian M., Carroll R.J., and Smith W. 1988. Variance functions and the minimum detectable concentration in assays. Biometrika 75: 549–556.
de Silva H.N., Hall A.J., Tustin D.S. and Gandar P.W. 1999. Analysis of distribution of root length density of apple trees on different dwarfing rootstocks. Ann. Botany, 83: 335–345.
Dunn P.K. 2001. Likelihood-based inference for tweedie exponential dispersion models. Unpublished PhD Thesis, University of Queensland.
Dunn P.K. and Smyth G.K. 1996. Randomized quantile residuals. J. Comput. Graph. Statist. 5: 236–244.
Feller W. 1968. An Introduction to Probability Theory and its Applications, third edition. John Wiley, New York: Volume I.
Gilchrist R. and Drinkwater D. 1999. Fitting Tweedie models to data with probability of zero responses. Proceedings of the 14th International Workshop on Statistical Modelling, Graz, pp. 207–214.
Haberman S. and Renshaw A.E. 1996. Generalized linear models and actuarial science. The Statistician 45: 407–436.
Haberman S. and Renshaw A.E. 1998. Actuarial applications of generalized linear models. In: D.J. Hand and S. D. Jacka (Eds.), Statistics in Finance, Arnold, London.
Hougaard P. 1986. Survival models for heterogeneous populations derived from stable distributions. Biometrika 73: 387–396.
Hougaard P., Harvald B., and Holm N.V. 1992. Measuring the similarities between the lifetimes of adult Danish twins born between 1881–1930. J. Amer. Statist. Assoc. 87: 17–24.
Johnson N.L. and Kotz S. 1970. Continuous Univariate Distributions–I, Houghton Mifflin Company, Boston.
Jørgensen B. 1987. Exponential dispersion models (with discussion). J. R. Stat. Soc. Ser. B Stat. Methodol. 49: 127–162.
Jørgensen B. 1997. The Theory of Dispersion Models, Chapman and Hall, London.
Jørgensen B. and Paes de Souza M.C. 1994. Fitting Tweedie's compound Poisson model to insurance claims data. Scand. Actuar. J. 1: 69–93.
McCullagh P. and Nelder J.A. 1989. Generalized Linear Models, 2nd edn. Chapman and Hall, London.
Millenhall S.J. 1999. A systematic relationship between minimum bias and generalized linear models. Proceedings of the Casualty Actuarial Society, Vol. 86, pp. 393–487.
Murphy K.P., Brockman M.J., and Lee P.K.W. (2000). Using generalized linear models to build dynamic pricing systems. Casualty Actuarial Forum.
Nelder J.A. 1994. An alternative view of the splicing data. J. Roy. Statist. Soc. Ser. C 43: 469–476.
Nelder J.A. and Lee Y. 1992. Likelihood, quasi-likelihood and pseudolikelihood: some comparisons, J. R. Stat. Soc. Ser. B Stat. Methodol. 54: 273–284.
Nelder J.A. and Pregibon D. 1987. An extended quasi-likelihood function. Biometrika 74: 221–232.
Nelder J.A. and Wedderburn R.W.M. 1972. Generalized linear models. J. Roy. Statist. Soc. Ser. A 135: 370–384.
Pereira B. de B. 1977. Discriminating among separate models: A bibliography. Internat. Statist. Rev. 45: 163–172.
Perry J.N. 1981. Taylor's power law for dependence of variance on mean in animal populations. J. Roy. Statist. Soc. Ser. C 30: 254–263.
R Development Core Team. 2004. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available from http://www.R-project.org.
Renshaw A.E. 1994. Modelling the claims process in the presence of covariates. ASTIN Bulletin 24: 265–286.
Seigel A.F. 1979. The noncentral chi-squared distribution with zero degrees of freedom and testing for uniformity. Biometrika 66: 381–386.
Seigel A.F. 1985. Modelling data containing exact zeros using zero degrees of freedom. J. R. Stat. Soc. Ser. B 47: 267–271.
Smyth G.K. 1989. Generalized linear models with varying dispersion, J. R. Stat. Soc. Ser. B 51: 47–60.
Smyth G.K. 1996. Regression analysis of quantity data with exact zeros. Proceedings of the Second Australia–Japan Workshop on Stochastic Models in Engineering, Technology and Management. Technology Management Centre, University of Queensland, pp. 572–580.
Smyth G.K. and Verbyla A.P. 1999. Adjusted likelihood methods for modelling dispersion in generalized linear models. Environmetrics 10: 695–709.
Smyth G.K. and Jørgensen B. 2002. Fitting Tweedie's compound Poisson model to insurance claims data: dispersion modelling. ASTIN Bulletin 32: 143–157.
Tweedie M.C.K. 1984. An index which distinguishes between some important exponential families. Statistics: Applications and New Directions. Proceedings of the Indian Statistical Institute Golden Jubilee International Conference, Indian Statistical Institute, Calcutta, pp. 579–604.
Wright E.M. 1933. On the coefficients of power series having essential singularities. J. London Math. Soc. 8: 71–79.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dunn, P.K., Smyth, G.K. Series evaluation of Tweedie exponential dispersion model densities. Stat Comput 15, 267–280 (2005). https://doi.org/10.1007/s11222-005-4070-y
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s11222-005-4070-y