6. Discussion
This work develops a zero-inflated COM–Poisson regression to model count data containing some form of dispersion (i.e. over- or under-dispersion) and an excess number of zeroes. Such data structures appear frequently in various applications such as psychology, engineering, and business. Excess zeroes are a common cause of data over-dispersion (Hilbe, 2008). For any generated dataset, data outcomes that are zeroes add to the sample size but not to the sum total of data observations, thus diminishing the mean of the dataset. Meanwhile, these values still contribute to the variance of the dataset, thus increasing the chance that the variance is greater than the mean. However, it does not necessarily imply the overall dispersion level of a zero-inflated dataset as being overdispersed. Sellers and Shmueli (2013) provide data examples where distribution mixtures can impact the overall level of data dispersion. Because such data can be overor under-dispersed, the two-parameter COM–Poisson structure allows for more flexibility in describing the relationship between explanatory variables and the response variable—both in the count component and the zero component. In fact, we demonstrate the flexibility of the ZICMP in its ability to capture three special case zero-inflated distributions, namely the ZIP, ZIG, and logistic models. This stems from the distributional structure and statistical properties associated with the COM–Poisson distribution.