TY - JOUR
T1 - On maximum likelihood estimation of competing risks using the cause-specific semi-parametric Cox model with time-varying covariates
T2 - An application to credit risk
AU - Thackham, Mark
AU - Ma, Jun
PY - 2022
Y1 - 2022
N2 - Credit-granting institutions need to estimate the probability of loan default, which represents the chance a customer fails to make repayments as promised. Critically this estimation is intertwined with the competing risk a customer fully repays their loan while also having key predictive drivers with values that change over time. A conventional model in this setting is a competing risks Cox Model with time-varying covariates. However partial likelihood estimation of this model has two shortcomings: (1) the baseline hazard is not estimated, so calculating probabilities requires a further estimation step; and (2) a covariance matrix for both regression coefficients and the baseline hazard is not produced. This paper caters for these shortcomings by devising a maximum likelihood technique to jointly estimate regression coefficients and the cause-specific baseline hazards using constrained optimisation to ensure the latter’s non-negativity. We show via simulation our technique produces regression coefficients estimates with lower bias in small samples with heavy censoring. When applied to a real-world credit risk dataset consisting of home loan data our Maximum Likelihood approach produces a smoother estimate of the cause-specific baseline hazards for default and redemption than those obtained using the Partial Likelihood and Breslow approach. This provides better clarity of the shape of these functions through both a less volatile central estimate as well as quantifying the error of this central estimate. We implement our method in R.
AB - Credit-granting institutions need to estimate the probability of loan default, which represents the chance a customer fails to make repayments as promised. Critically this estimation is intertwined with the competing risk a customer fully repays their loan while also having key predictive drivers with values that change over time. A conventional model in this setting is a competing risks Cox Model with time-varying covariates. However partial likelihood estimation of this model has two shortcomings: (1) the baseline hazard is not estimated, so calculating probabilities requires a further estimation step; and (2) a covariance matrix for both regression coefficients and the baseline hazard is not produced. This paper caters for these shortcomings by devising a maximum likelihood technique to jointly estimate regression coefficients and the cause-specific baseline hazards using constrained optimisation to ensure the latter’s non-negativity. We show via simulation our technique produces regression coefficients estimates with lower bias in small samples with heavy censoring. When applied to a real-world credit risk dataset consisting of home loan data our Maximum Likelihood approach produces a smoother estimate of the cause-specific baseline hazards for default and redemption than those obtained using the Partial Likelihood and Breslow approach. This provides better clarity of the shape of these functions through both a less volatile central estimate as well as quantifying the error of this central estimate. We implement our method in R.
KW - Competing risk cause specific Cox models
KW - time-varying covariates
KW - constrained maximum likelihood optimisation
KW - credit risk
UR - http://www.scopus.com/inward/record.url?scp=85089028558&partnerID=8YFLogxK
U2 - 10.1080/01605682.2020.1800418
DO - 10.1080/01605682.2020.1800418
M3 - Article
SN - 0160-5682
VL - 73
SP - 5
EP - 14
JO - Journal of the Operational Research Society
JF - Journal of the Operational Research Society
IS - 1
ER -