×

A flexible regression model for count data. (English) Zbl 1194.62091

Summary: Poisson regression is a popular tool for modeling count data and is applied in a vast array of applications from the social to the physical sciences and beyond. Real data, however, are often over- or under-dispersed and, thus, not conducive to Poisson regression. We propose a regression model based on the Conway-Maxwell-Poisson (COM-Poisson) distribution to address this problem. The COM-Poisson regression generalizes the well-known Poisson and logistic regression models, and is suitable for fitting count data with a wide range of dispersion levels. With a GLM approach that takes advantage of exponential family properties, we discuss model estimation, inference, diagnostics, and interpretation, and present a test for determining the need for a COM-Poisson regression over a standard Poisson regression. We compare the COM-Poisson to several alternatives and illustrate its advantages and usefulness using three data sets with varying dispersion.

MSC:

62J12 Generalized linear models (logistic models)
62H12 Estimation in multivariate analysis
62J20 Diagnostics, and linear inference and regression
PDFBibTeX XMLCite
Full Text: DOI arXiv

References:

[1] Ben, M. G. and Yohai, V. J. (2004). Quantile quantile plot for deviance residuals in the generalized linear model. J. Comput. Graph. Statist. 13 36-47.
[2] Boatwright, P., Borle, S. and Kadane, J. B. (2003). A model of the joint distribution of purchase quantity and timing. J. Amer. Statist. Assoc. 98 564-572. · Zbl 1045.62118
[3] Borle, S., Boatwright, P. and Kadane, J. B. (2006). The timing of bid placement and extent of multiple bidding: An empirical investigation using ebay online auctions. Statist. Sci. 21 194-205. · Zbl 1426.62368
[4] Borle, S., Boatwright, P., Kadane, J. B., Nunes, J. C. and Shmueli, G. (2005). The effect of product assortment changes on customer retention. Marketing Science 24 616-622.
[5] Borle, S., Dholakia, U., Singh, S. and Westbrook, R. (2007). The impact of survey participation on subsequent behavior: An empirical investigation. Marketing Science 26 711-726.
[6] Cui, Y., Kim, D.-Y. and Zhu, J. (2006). On the generalized Poisson regression mixture model for mapping quantitative trait loci with count data. Genetics 174 2159-2172.
[7] Davison, A. and Tsai, C.-L. (1992). Regression model diagnostics. International Statistical Review 60 337-353. · Zbl 0775.62201
[8] Famoye, F. (1993). Restricted generalized Poisson regression model. Comm. Statist. Theory Methods 22 1335-1354. · Zbl 0784.62018
[9] Famoye, F., Wulu, J. J. and K. P. Singh (2004). On the generalized Poisson regression model with an application to accident data. Journal of Data Science 2 287-295.
[10] Kadane, J. B., Krishnan, R. and Shmueli, G. (2006). A data disclosure policy for count data based on the COM-Poisson distribution. Management Science 52 1610-1617.
[11] Kadane, J. B., Shmueli, G., Minka, T. P., Borle, S. and Boatwright, P. (2005). Conjugate analysis of the Conway-Maxwell-Poisson distribution. Bayesian Anal. 1 363-374. · Zbl 1490.62058
[12] Kalyanam, K., Borle, S. and Boatwright, P. (2007). Deconstructing each item’s category contribution. Marketing Science 26 327-341.
[13] Kutner, M. H., Nachtsheim, C. J. and Neter, J. (2003). Applied Linear Regression Models , 4th ed. McGraw-Hill, New York.
[14] Lattin, J. M., Green, P. E. J. and Caroll, D. (2003). Analyzing Mulivariate Data . Duxbury, Pacific Grove, CA.
[15] Long, J. S. (1997). Regression Models for Categorical and Limited Dependent Variables . Sage, London. · Zbl 0911.62055
[16] Lord, D., Guikema, S. D. and Geedipally, S. R. (2008). Application of the Conway-Maxwell-Poisson generalized linear model for analyzing motor vehicle crashes. Accident Analysis & Prevention 40 1123-1134.
[17] McCullagh, P. and Nelder, J. A. (1997). Generalized Linear Models , 2nd ed. Chapman & Hall/CRC, London. · Zbl 0588.62104
[18] Minka, T. P., Shmueli, G., Kadane, J. B., Borle, S. and Boatwright, P. (2003). Computing with the COM-Poisson distribution. Technical Report 776, Dept. Statistics, Carnegie Mellon Univ., Pittsburgh, PA. · Zbl 1490.62058
[19] Shmueli, G., Minka, T. P., Kadane, J. B., Borle, S. and Boatwright, P. (2005). A useful distribution for fitting discrete data: Revival of the Conway-Maxwell-Poisson distribution. Appl. Statist. 54 127-142. · Zbl 1490.62058
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.