in order to arrive at the marginal density for Y; that is, the density that is no longer
conditional on υ
i
. The usual assumption is that υ
i
has a gamma density with param-
eter α
⫺1
. This is a continuous right-skewed density that resembles a chi-squared
variable. In fact, the chi-squared density is a special case of the gamma density (Hoel
et al., 1971).
Constructing the NBRM Density. “Integrating out” υ
i
means that we take the aver-
age value of density (10.6) over the distribution of υ
i
[see Greene (2003) for the
details of the integration]. The advantage of assuming a gamma density for υ
i
here
is that this integration then has a closed-form solution. The resulting marginal den-
sity of Y
i
, given the regressors and α, is the negative binomial density (Cameron and
Trivedi, 1998):
f(y
i
冟 x
i
,
ββ
,α) ⫽
ᎏ
⌫(
Γ
α
(
⫺
α
1
⫺
)⌫
1
⫹
(y
y
i
⫹
i
)
1)
ᎏ
冢
ᎏ
α
⫺
α
1
⫺
⫹
1
µ
i
ᎏ
冣
α
⫺1
冢
ᎏ
µ
i
⫹
µ
α
i
⫺1
ᎏ
冣
y
i
, (10.7)
where as before, µ
i
equals exp(冱β
k
X
ik
). The gamma function, Γ(⭈), in this expression
is defined by an integral with no closed-form solution (Hoel et al., 1971). However,
it turns out that Γ(a) ⫽ (a ⫺ 1)! if a is an integer. The term α
⫺1
is not typically an
integer. But if it were, given that y
i
is an integer, density (10.7) would have the same
form as the negative binomial density in expression (10.1), where r ⫽ α
⫺1
and
p ⫽ α
⫺1
/(α
⫺1
⫹ µ
i
). The product of density (10.7) over all n sample cases is the like-
lihood function, which is then maximized with respect to α and
ββ
to find the MLEs.
Because the conditional mean of Y
i
in density (10.7) is still µ
i
⫽ exp(冱β
k
X
ik
), the
betas still have the same interpretations as given to those in the PRM. (This holds
true for all count models discussed in this chapter.) Estimated probabilities for each
count are calculated by substituting µ
ˆ
i
⫽ exp(冱β
ˆ
k
X
ik
) and α
ˆ
into expression (10.7).
Greene (1998) presents a convenient recursion formula that can be programmed into
LIMDEP for the calculation of these probabilities.
Testing for Overdispersion. The conditional variance of Y
i
in the NBRM is
µ
i
⫹ αµ
i
2
. The parameter α, called the overdispersion parameter, is always greater
than or equal to zero. This means that the conditional variance is normally greater
than the conditional mean. If α equals zero, the conditional variance is equal to the
conditional mean and the NBRM reduces to the PRM. That is, the PRM is nested
inside the NBRM, and therefore a test for overdispersion is a test for whether α ⫽ 0.
This can be performed using either a nested chi-squared test or a Wald test of the form
z ⫽ α
ˆ
/σ
α
ˆ
. The two tests are asymptotically equivalent (Cameron and Trivedi, 1998).
However, in that α cannot be less than zero, the distribution of these test statistics is
nonstandard. Thus, when performing the chi-squared test at a given level of signifi-
cance, say δ, we use the critical value of 2δ for the test statistic as the criterion. For
the Wald test, we simply use the critical value corresponding to δ rather than δ/2. For
example, performing the chi-squared or Wald test at the .05 level for H
0
: α ⫽ 0
involves using the critical χ
2
value corresponding to the .1 level, or the critical z value
corresponding to the .05 level, and so on (Cameron and Trivedi, 1998).
366 REGRESSION MODELS FOR AN EVENT COUNT