Multi-period credit default prediction with time-varying covariates. Walter Orth University of Cologne, Department of Statistics and Econometrics

Similar documents
Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Lecture 6: Univariate Volatility Modelling: ARCH and GARCH Models

A Non-Parametric Approach of Heteroskedasticity Robust Estimation of Vector-Autoregressive (VAR) Models

Corporate Finance Data & The Role of Dynamic Panels. Mark Flannery, University of Florida Kristine W. Hankins, University of Kentucky

Generalized Autoregressive Score Models

Clustered Defaults. Jin-Chuan Duan. (First draft: November 9, 2009; Current version: March 25, 2010)

A Guide to Modern Econometric:

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

Solutions of the Financial Risk Management Examination

TMA 4275 Lifetime Analysis June 2004 Solution

Introduction to Econometrics

Volatility. Gerald P. Dwyer. February Clemson University

Testing an Autoregressive Structure in Binary Time Series Models

Duration Analysis. Joan Llull

9) Time series econometrics

Modelling geoadditive survival data

Beyond the Target Customer: Social Effects of CRM Campaigns

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014

ECON/FIN 250: Forecasting in Finance and Economics: Section 4.1 Forecasting Fundamentals

Arma-Arch Modeling Of The Returns Of First Bank Of Nigeria

Survival Analysis. Stat 526. April 13, 2018

MAS3301 / MAS8311 Biostatistics Part II: Survival

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

Lecture 8: Multivariate GARCH and Conditional Correlation Models

Switching Regime Estimation

Lecture 2: Univariate Time Series

FORECASTING THE RESIDUAL DEMAND FUNCTION IN

xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models

Lecture 7 Time-dependent Covariates in Cox Regression

Forecasting the term structure interest rate of government bond yields

Modelling of Dependent Credit Rating Transitions

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

A Survival Analysis of GMO vs Non-GMO Corn Hybrid Persistence Using Simulated Time Dependent Covariates in SAS

GARCH Models. Eduardo Rossi University of Pavia. December Rossi GARCH Financial Econometrics / 50

Consider Table 1 (Note connection to start-stop process).

Using EViews Vox Principles of Econometrics, Third Edition

Clustered Defaults. Jin-Chuan Duan. (November 12, 2009)

Financial Econometrics and Volatility Models Copulas

Introduction to Econometrics

ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications

End-Semester Examination MA 373 : Statistical Analysis on Financial Data

RATING TRANSITIONS AND DEFAULT RATES

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

IFRS9 PD modeling Presentation MATFYZ Konstantin Belyaev

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS

Frailty Modeling for clustered survival data: a simulation study

Chapter 1. Probability, Random Variables and Expectations. 1.1 Axiomatic Probability

Analysing geoadditive regression data: a mixed model approach

Machine Learning Linear Classification. Prof. Matteo Matteucci

Comparing Predictive Accuracy, Twenty Years Later: On The Use and Abuse of Diebold-Mariano Tests

Econometric Analysis of Cross Section and Panel Data

Estimating Global Bank Network Connectedness

A simple alternative to the linear probability model for binary choice models with endogenous regressors

Logistic regression model for survival time analysis using time-varying coefficients

Functional Coefficient Models for Nonstationary Time Series Data

1 Description of variables

DEM Working Paper Series. Credit risk predictions with Bayesian model averaging

A Modified Fractionally Co-integrated VAR for Predicting Returns

Lecture 9: Markov Switching Models

Losses Given Default in the Presence of Extreme Risks

Nowcasting GDP with Real-time Datasets: An ECM-MIDAS Approach

Survival Analysis Math 434 Fall 2011

Tasmanian School of Business & Economics Economics & Finance Seminar Series 1 February 2016

Appendix 1 Model Selection: GARCH Models. Parameter estimates and summary statistics for models of the form: 1 if ɛt i < 0 0 otherwise

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Accounting for Missing Values in Score- Driven Time-Varying Parameter Models

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Regression: Ordinary Least Squares

ESRI Research Note Nowcasting and the Need for Timely Estimates of Movements in Irish Output

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

2. Multivariate ARMA

MULTINOMIAL LOGISTIC REGRESSION

6. Assessing studies based on multiple regression

Econometrics. 9) Heteroscedasticity and autocorrelation

Multistate Modeling and Applications

ISQS 5349 Spring 2013 Final Exam

Bayesian modelling of real GDP rate in Romania

University of Pretoria Department of Economics Working Paper Series

Chapter 11. Regression with a Binary Dependent Variable

Statistical Inference and Methods

Financial Econometrics

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

Bootstrapping the Grainger Causality Test With Integrated Data

Intermediate Econometrics

NOWCASTING THE NEW TURKISH GDP

Financial Econometrics and Quantitative Risk Managenent Return Properties

1 Phelix spot and futures returns: descriptive statistics

Supply Chain Network Structure and Risk Propagation

A New Approach to Rank Forecasters in an Unbalanced Panel

Chapter 2 Inference on Mean Residual Life-Overview

Variable Selection in Predictive Regressions

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Homework Solutions Applied Logistic Regression

Logistic Regression with the Nonnegative Garrote

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures

Discrete Choice Modeling

Model Selection in GLMs. (should be able to implement frequentist GLM analyses!) Today: standard frequentist methods for model selection

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Click Prediction and Preference Ranking of RSS Feeds

Transcription:

with time-varying covariates Walter Orth University of Cologne, Department of Statistics and Econometrics

2 20 Overview Introduction Approaches in the literature The proposed models Empirical analysis Conclusions

Introduction 3 20 Motivation Problem: Default prediction with a flexible multi-period time horizon Objective: Development of a model with high (out-of-sample) discriminatory power, i.e. a model that ranks the obligors according to their default probabilities accurately.

Introduction 4 20 Multi-period vs. single-period default prediction models Only a small fraction of the default prediction literature deals with multi-period predictions.

Introduction 4 20 Multi-period vs. single-period default prediction models Only a small fraction of the default prediction literature deals with multi-period predictions. Common approach: Modelling one-year default probabilities by estimating a discrete-time hazard model with covariates lagged by one year.

Introduction 4 20 Multi-period vs. single-period default prediction models Only a small fraction of the default prediction literature deals with multi-period predictions. Common approach: Modelling one-year default probabilities by estimating a discrete-time hazard model with covariates lagged by one year. Such a model cannot be easily extended to more than one year because the future values of the covariates are unknown. does not use all information if data are quarterly/monthly.

Introduction 5 20 Basic notation Y : Lifetime / Time until default Definition of hazard rate in discrete time: Definition in continuous time: λ(y) = lim y 0 λ(y) = P(Y = y Y y) P(y Y < y + y Y y) y

Introduction 5 20 Basic notation Y : Lifetime / Time until default Definition of hazard rate in discrete time: Definition in continuous time: λ(y) = lim y 0 λ(y) = P(Y = y Y y) P(y Y < y + y Y y) y We observe obligor i, i = 1,...,n, for t i periods recording the default history and time-varying covariates x it ( panel data).

Introduction 5 20 Basic notation Y : Lifetime / Time until default Definition of hazard rate in discrete time: Definition in continuous time: λ(y) = lim y 0 λ(y) = P(Y = y Y y) P(y Y < y + y Y y) y We observe obligor i, i = 1,...,n, for t i periods recording the default history and time-varying covariates x it ( panel data). Y it : Lifetime of obligor i starting at t

Introduction 5 20 Basic notation Y : Lifetime / Time until default Definition of hazard rate in discrete time: Definition in continuous time: λ(y) = lim y 0 λ(y) = P(Y = y Y y) P(y Y < y + y Y y) y We observe obligor i, i = 1,...,n, for t i periods recording the default history and time-varying covariates x it ( panel data). Y it : Lifetime of obligor i starting at t Main economic interest: Default probability P(Y it H) for various prediction horizons H given the information available until t

Approaches in the literature 6 20 Approaches that involve covariate forecasting Continuous-time model of Duffie et al. (JFE 2007): λ(t,x it ) = exp(β x it ) The (four) covariates are modelled with Gaussian panel vector autoregressions. The probability of default until time H is given by [ ( )] P(Y it H) = 1 E exp H which is approximated by numerical methods. 0 λ(t + s,x i,t+s )ds A similar approach that also involves the estimation of a covariate forecasting model is given by Hamerle et al. (JFF 2006).,

Approaches in the literature 7 20 Drawbacks of approaches with covariate forecasting Complexity: A multivariate density forecast for a vector of covariates over multiple periods is needed.

Approaches in the literature 7 20 Drawbacks of approaches with covariate forecasting Complexity: A multivariate density forecast for a vector of covariates over multiple periods is needed. This complexity either results in highly parameterized models (that may perform poorly out of sample) or very restrictive assumptions in order to reduce dimensionality.

Approaches in the literature 7 20 Drawbacks of approaches with covariate forecasting Complexity: A multivariate density forecast for a vector of covariates over multiple periods is needed. This complexity either results in highly parameterized models (that may perform poorly out of sample) or very restrictive assumptions in order to reduce dimensionality. Computational burden since closed-form solutions are usually not available.

Approaches in the literature 8 20 Stepwise lagging of covariates Campbell et al. (JF 2008) estimate discrete-time hazard models lagging the covariates by s months, s = 6, 12, 24, 36: λ(t + s,x it ) = [1 + exp(β sx it )] 1 If we extend this idea and apply a stepwise lagging procedure (SLP) by estimating the model for every s, s = 1,...,H, the H-period default probabilities are given by: P(Y it H) = 1 H [1 λ(t + s,x it )] s=1

9 20 Overview Introduction Approaches in the literature The proposed models Empirical analysis Conclusions

The proposed models 10 20 We propose to specify the hazard rate in period t + s as a function of the forecast time s and the covariates in period t. For instance, within the proportional hazard specification we get λ(t + s,x it ) = λ 0 (s)exp(β x it )

The proposed models 10 20 We propose to specify the hazard rate in period t + s as a function of the forecast time s and the covariates in period t. For instance, within the proportional hazard specification we get λ(t + s,x it ) = λ 0 (s)exp(β x it ) In this model, each covariate vector x it in our panel is connected to the corresponding lifetime Y it.

The proposed models 10 20 We propose to specify the hazard rate in period t + s as a function of the forecast time s and the covariates in period t. For instance, within the proportional hazard specification we get λ(t + s,x it ) = λ 0 (s)exp(β x it ) In this model, each covariate vector x it in our panel is connected to the corresponding lifetime Y it. Note that conventional models would be specified as λ(t,x it ) = λ 0 (t)exp(β x it ) leaving those models with the problem that the covariates are not known in t + s.

The proposed models 10 20 We propose to specify the hazard rate in period t + s as a function of the forecast time s and the covariates in period t. For instance, within the proportional hazard specification we get λ(t + s,x it ) = λ 0 (s)exp(β x it ) In this model, each covariate vector x it in our panel is connected to the corresponding lifetime Y it. Note that conventional models would be specified as λ(t,x it ) = λ 0 (t)exp(β x it ) leaving those models with the problem that the covariates are not known in t + s. The H-period default probabilities are easily calculated as P(Y it H) = 1 exp( H 0 λ(t + s,x it)ds).

The proposed models 10 20 We propose to specify the hazard rate in period t + s as a function of the forecast time s and the covariates in period t. For instance, within the proportional hazard specification we get λ(t + s,x it ) = λ 0 (s)exp(β x it ) In this model, each covariate vector x it in our panel is connected to the corresponding lifetime Y it. Note that conventional models would be specified as λ(t,x it ) = λ 0 (t)exp(β x it ) leaving those models with the problem that the covariates are not known in t + s. The H-period default probabilities are easily calculated as P(Y it H) = 1 exp( H 0 λ(t + s,x it)ds). In our specification we only have to estimate the model once in contrast to the stepwise lagging approach.

The proposed models 11 20 Estimation Clearly, the lifetimes Y it are not (conditionally) independent. For instance, Y it already covers the lifetime Y i,t+1 plus one additional period.

The proposed models 11 20 Estimation Clearly, the lifetimes Y it are not (conditionally) independent. For instance, Y it already covers the lifetime Y i,t+1 plus one additional period. However, we can consistently (n ) estimate our model treating the observations as independent. Let C it be the censoring indicator corresponding to Y it.the pseudo log likelihood function is given by log L = n t i 1 (1 C it ) log(λ(t +Y it,x it ))+log(1 F(t +Y it,x it )) i=1 t=1

The proposed models 11 20 Estimation Clearly, the lifetimes Y it are not (conditionally) independent. For instance, Y it already covers the lifetime Y i,t+1 plus one additional period. However, we can consistently (n ) estimate our model treating the observations as independent. Let C it be the censoring indicator corresponding to Y it.the pseudo log likelihood function is given by log L = n t i 1 (1 C it ) log(λ(t +Y it,x it ))+log(1 F(t +Y it,x it )) i=1 t=1 For valid inference, we have to adjust the standard errors for the clustering within the observations of each obligor.

The proposed models 12 20 The log-logistic model The proportional hazards (PH) specification given above assumes that hazard ratios are constant over forecast time. However, several studies find that hazard rates of different firms tend to approach each other.

The proposed models 12 20 The log-logistic model The proportional hazards (PH) specification given above assumes that hazard ratios are constant over forecast time. However, several studies find that hazard rates of different firms tend to approach each other. In contrast, proportional odds (PO) models imply that the hazard ratios converge monotonically towards one (Bennett, AS 1983).

The proposed models 12 20 The log-logistic model The proportional hazards (PH) specification given above assumes that hazard ratios are constant over forecast time. However, several studies find that hazard rates of different firms tend to approach each other. In contrast, proportional odds (PO) models imply that the hazard ratios converge monotonically towards one (Bennett, AS 1983). The most common PO model is the log-logistic model where the hazard rate is given by λ(t + s, x it ) = αexp(β x it ) α s α 1 1 + [exp(β x it )s] α The CDF evaluated at H (which gives the default probabilities) is P(Y it H) = 1 1 + [exp(β x it )H] α

13 20 Overview Introduction Approaches in the literature The proposed models Empirical analysis Conclusions

Empirical analysis 14 20 The dataset Default histories, balance sheet and stock market variables for North American public firms from Compustat and CRSP

Empirical analysis 14 20 The dataset Default histories, balance sheet and stock market variables for North American public firms from Compustat and CRSP Excluding financial firms we have 339,222 non-missing firm-months and 3575 firms in the time from December 1980 until March 2010.

Empirical analysis 14 20 The dataset Default histories, balance sheet and stock market variables for North American public firms from Compustat and CRSP Excluding financial firms we have 339,222 non-missing firm-months and 3575 firms in the time from December 1980 until March 2010. We observe 498 different default events, but our definition of Y it leads to 18,914 lifetimes in our sample that end with a default.

Empirical analysis 15 20 Selection of regressors Using a general-to-specific variable selection approach based on candidate variables taken from related studies we end up with the following set of regressors: Profitability: Net Income / Total Assets (NITA) Leverage: Total Liabilities / Total Assets (TLTA) Growth: Dummy for very high or very low growth of Total Assets (GRO) Stock return: Excess one-year log return over S&P 500 (RET) Volatility: Standard deviation of monthly log returns over previous year (VOLA) Size: Log of market value relative to total market value of S&P 500 (SIZE)

Empirical analysis 16 20 Estimation results Cox model (PH) Log-logistic model Coef. Std. Err. Coef. Std. Err. NITA -5.60 (1.36) -6.80 (1.27) TLTA 2.43 (0.30) 2.31 (0.25) GRO 0.21 (0.05) 0.18 (0.05) RET -0.83 (0.06) -0.81 (0.05) VOLA 6.14 (0.53) 6.06 (0.46) SIZE -0.37 (0.03) -0.34 (0.03) const. 11.99 (0.28) α 1.26 (0.02)

Empirical analysis 17 20 Evaluation of predictive power Competitors: Cox model, log-logistic model, stepwise lagging procedure (SLP) and Standard & Poor s Long Term Issuer Credit Ratings

Empirical analysis 17 20 Evaluation of predictive power Competitors: Cox model, log-logistic model, stepwise lagging procedure (SLP) and Standard & Poor s Long Term Issuer Credit Ratings Prediction horizons: 1, 3 and 5 years

Empirical analysis 17 20 Evaluation of predictive power Competitors: Cox model, log-logistic model, stepwise lagging procedure (SLP) and Standard & Poor s Long Term Issuer Credit Ratings Prediction horizons: 1, 3 and 5 years For a given sample month t, we calculate the Accuracy Ratio and Harrell s C for the out-of-sample predictions made at t. We then take a weighted average of the time series of indices using the number of firms observed in t as weights.

Empirical analysis 17 20 Evaluation of predictive power Competitors: Cox model, log-logistic model, stepwise lagging procedure (SLP) and Standard & Poor s Long Term Issuer Credit Ratings Prediction horizons: 1, 3 and 5 years For a given sample month t, we calculate the Accuracy Ratio and Harrell s C for the out-of-sample predictions made at t. We then take a weighted average of the time series of indices using the number of firms observed in t as weights. Range of t: December 1995 - March 2005.

Empirical analysis 18 20 Out-of-sample predictive power 1 year 3 years 5 years AR C AR C AR C log-logistic.8939.8862.7864.7672.7436.7104 Cox.8917.8840.7819.7628.7389.7059 SLP.8906.8829.7785.7586.7338.6993 S&P.8234.8149.7625.7338.7417.6943

Empirical analysis 19 20 Testing for significant differences Using the bootstrap we tested for significant differences in out-of-sample predictive accuracy. The tests yield the following main results: The log-logistic model has significantly more predictive power (α =.1) than all alternatives and at all horizons with the exception of Standard & Poor s for the 5-year horizon.

Empirical analysis 19 20 Testing for significant differences Using the bootstrap we tested for significant differences in out-of-sample predictive accuracy. The tests yield the following main results: The log-logistic model has significantly more predictive power (α =.1) than all alternatives and at all horizons with the exception of Standard & Poor s for the 5-year horizon. The stepwise lagging procedure (SLP) is significantly worse (α =.05) than both the log-logistic and the Cox model under all measures and horizons. This is probably due to overparameterization.

Conclusions 20 20 Main results We have derived a simple modeling approach for multi-period default predictions that does not involve the problem of forecasting covariates.

Conclusions 20 20 Main results We have derived a simple modeling approach for multi-period default predictions that does not involve the problem of forecasting covariates. The empirical part showed that our approach has high out-of-sample predictive power.

Conclusions 20 20 Main results We have derived a simple modeling approach for multi-period default predictions that does not involve the problem of forecasting covariates. The empirical part showed that our approach has high out-of-sample predictive power. The proportional odds model in the log-logistic specification was shown to fit significantly better in our application than the workhorse of survival analysis, the Cox proportional hazards model.