Multinomial baseline-category logit models are a generalisation of
logistic regression, that allow to model not only binary or dichotomous
responses, but also polychotomous responses. In addition, they allow to
model responses in the form of counts that have a pre-determined sum.
These models are described in Agresti
(2002). Estimating these models is also supported by the function
multinom()
in the R package “nnet” (Venables and Ripley 2002). In the package
“mclogit”, the function to estimate these models is called
mblogit()
, which uses the infrastructure for estimating
conditional logit models, exploiting the fact that baseline-category
logit models can be re-expressed as condigional logit models.
Baseline-category logit models are constructed as follows. Suppose a categorical dependent variable or response with categories is observed for individuals . Let denote the probability that the value of the dependent variable for individual is equal to , then the baseline-category logit model takes the form:
where the first category () is the baseline category.
Equivalently, the model can be expressed in terms of log-odds, relative to the baseline-category:
Here the relevant parameters of the model are the coefficients which describe how the values of independent variables (numbered ) affect the relative chances of the response taking a value versus taking the value . Note that there is one coefficient for each independent variable and each response other than the baseline category.