In Practice: Using a Probit to Estimate the Probability of Bankruptcy
It is possible to estimate the probability of default using statistical techniques,
when there is sufficient data avaialble. For instance, if we have a database that lists all
firms that went bankrupt during a period of time, as well as firms that did not go bankrupt
during the same period, together with descriptive characteristics on these firms, a probit
analysis can be used to estimate the likelihood of bankruptcy as a function of these
characteristics. The steps involved in a probit analysis are as follows:
1. Identify the event of interest: Probits work best when the event either occurs or it
does not. For bankruptcy, the event might be the filing for bankruptcy protection
under the law.
2. Over a specified time period, collect information on all the firms that were exposed to
the event. In the bankruptcy case, this would imply collecting information on which
firms that filed for bankruptcy over a certain period (say, 5 years).
3. Based upon your knowledge of the event, and other research on it, specify measurable
and observable variables that are likely to be good predictors of that event. In the case
of bankruptcy, these might include excessive debt ratios, declining income, poor
project returns and small market capitalization.
4. Collect information on these variables for the firms that filed for bankruptcy, at the
time of the filing. Collect the same information for all other firms that were in
existence at the same time, and which have data available on them on these variables.
(If this is too data intensive, a random sampling of the firms that were not exposed to
the event can be used.) In the bankruptcy analysis, this would imply collecting
information on debt ratios, income trends, project returns and market capitalization on
the firms that filed for bankruptcy at the time of the filing, and all other firms across
the period.
5. In a probit, the dependent variable is the occurrence of the specified event (1 if it
occurs, 0 if it does not) and the independent variables are the variables specified in
step 3. The output from the probit looks very much like the output from a multiple
regression, with statistical significance attached to each of the independent variables.
24
In Warner’s study of railroad bankruptcies, the direct cost of bankruptcy seems to be about 5%.