Model Risk With. Default. Estimates of Probabilities of. Dirk Tasche Imperial College, London. June 2015

Similar documents
RATING TRANSITIONS AND DEFAULT RATES

Rating Transitions and Defaults Conditional on Rating Outlooks Revisited:

Does quantification without adjustments work?

Issues using Logistic Regression for Highly Imbalanced data

Expected Shortfall is not elicitable so what?

U.S. Retail/Restaurants Sector: Historical Performance And Current Risks

Expected Shortfall is not elicitable so what?

How to evaluate credit scorecards - and why using the Gini coefficient has cost you money

Distribution Date: 26-Dec-07

Fisher Consistency for Prior Probability Shift

c 4, < y 2, 1 0, otherwise,

Losses Given Default in the Presence of Extreme Risks

U.S. Consumer Products And Retail/Restaurants Sector: Historical Performance And Current Risks

Penarth Master Issuer plc - Monthly Report October 2012 Combined Series Report For IPD Ending: 19 November 2012

Default & Recovery Rates of Corporate Bond Issuers

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Lecture 9: Classification, LDA

Week 2 Quantitative Analysis of Financial Markets Bayesian Analysis

Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project

Lecture 9: Classification, LDA

Penarth Master Issuer plc - Monthly Report November 2012 Combined Series Report For IPD Ending: 18 December 2012

Lecture 9: Classification, LDA

INTRODUCTION TO PATTERN RECOGNITION

Statistical Machine Learning Hilary Term 2018

Fieller Stability Measure: A Novel Model-dependent Backtesting Approach

MIRA, SVM, k-nn. Lirong Xia

Probabilities & Statistics Revision

CS798: Selected topics in Machine Learning

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework

Generalized Autoregressive Score Models

IFRS9 PD modeling Presentation MATFYZ Konstantin Belyaev

Statistical Learning Reading Assignments

15-388/688 - Practical Data Science: Basic probability. J. Zico Kolter Carnegie Mellon University Spring 2018

Claris Finance 2003 Srl

Normal Random Variables and Probability

arxiv:cond-mat/ v1 23 Jul 2002

Programme Specification (Undergraduate) Chemistry

VUS and HUM Represented with Mann-Whitney Statistic

Machine Learning Lecture 2

L11: Pattern recognition principles

Linear vs Non-linear classifier. CS789: Machine Learning and Neural Network. Introduction

Nonparametric Bayesian Methods (Gaussian Processes)

Towards Maximum Geometric Margin Minimum Error Classification

GS Mortgage Securities Corporation II Trust Pass-Through Certificates Series 2005-ROCK

Course Outline. TERM EFFECTIVE: Fall 2017 CURRICULUM APPROVAL DATE: 03/27/2017

COM336: Neural Computing

Supporting Statistical Hypothesis Testing Over Graphs

BAYESIAN DECISION THEORY

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 4 of Data Mining by I. H. Witten, E. Frank and M. A.

Loss Estimation using Monte Carlo Simulation

Tagus - Sociedade de Titularizacao de Creditos

An Introduction to Machine Learning

Can macro variables improve transition matrix forecasting?

Public Disclosure Copy

Monthly Investor Report 31 October Fastnet Securities 5 Limited

Claris Finance 2003 Srl

Advanced Introduction to Machine Learning CMU-10715

Calculating credit risk capital charges with the one-factor model

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Machine Learning Lecture 2

Model Accuracy Measures

Unsupervised Learning with Permuted Data

A traffic lights approach to PD validation

Classification: Linear Discriminant Analysis

A fast and simple algorithm for training neural probabilistic language models

Probability Theory for Machine Learning. Chris Cremer September 2015

Unsupervised Learning

Mixture Models and Expectation-Maximization

Course Outline 4/10/ Santa Teresa Blvd Gilroy, CA COURSE: MATH 205A DIVISION: 10 ALSO LISTED AS: SHORT TITLE: FIRST HALF ALGEBRA

STA414/2104 Statistical Methods for Machine Learning II

SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION

Maths for Signals and Systems Linear Algebra in Engineering

Intelligent Systems Statistical Machine Learning

This appendix provides a very basic introduction to linear algebra concepts.

DS Machine Learning and Data Mining I. Alina Oprea Associate Professor, CCIS Northeastern University

Argo Mortgages 2 S.r.l

Machine Learning Linear Models

PATTERN RECOGNITION AND MACHINE LEARNING

Probabilistic classification CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2016

INFORMATION VALUE ESTIMATOR FOR CREDIT SCORING MODELS

LECTURE NOTE #3 PROF. ALAN YUILLE

Distribution Date: 27-Aug-07

Public Disclosure Copy

Lecture 2. Introduction to Systems (Lathi )

Machine Learning Lecture 2

Discriminative Direction for Kernel Classifiers

1: PROBABILITY REVIEW

CS 188: Artificial Intelligence. Outline

Three-group ROC predictive analysis for ordinal outcomes

Classification of high dimensional data: High Dimensional Discriminant Analysis

Multivariate Distributions

Chapter 7, Part A Sampling and Sampling Distributions

Classification. 1. Strategies for classification 2. Minimizing the probability for misclassification 3. Risk minimization

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions

Clustering and Gaussian Mixtures

Clustering. Professor Ameet Talwalkar. Professor Ameet Talwalkar CS260 Machine Learning Algorithms March 8, / 26

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Data Mining Techniques

Transcription:

Model Risk With Estimates of Probabilities of Default Dirk Tasche Imperial College, London June 2015

The views expressed in the following material are the author s and do not necessarily represent the views of the Global Association of Risk Professionals (GARP), its Membership or its Management. 2

Outline Two forecasting problems A taxonomy of dataset shift Estimation under prior probability shift assumption Estimation under invariant density ratio assumption An application to the mitigation of model risk for PD estimation Concluding remarks References Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 2 / 18

Two forecasting problems Single borrower s probability of default (PD) Moody s corporate issuer and default counts in 2008 2. Grade Caa-C B Ba Baa A Aa Aaa All Issuers 417 1151 528 1021 966 582 140 4805 Defaults 63 25 6 5 5 4 0 108 January 1, 2009: What is a Baa-rated borrower s probability to default in 2009? Natural (?) estimate: 5 1021 0.49%. 2 Source: Moody s (2015) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 3 / 18

Rating profile known Two forecasting problems Moody s corporate issuer proportions and default rates in 2008 and issuer proportions in 2009 3. All numbers in %. Grade 2008 2009 Issuers Default rate Issuers Default rate Caa-C 8.7 15.1 11.4? B 24.0 2.2 20.9? Ba 11.0 1.1 11.0? Baa 21.2 0.5 21.9? A 20.1 0.5 20.7? Aa 12.1 0.7 11.2? Aaa 2.9 0.0 2.8? All 100.0 2.2 100.0? How to take account of the additional data? 3 Source: Moody s (2015) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 4 / 18

Two forecasting problems Some thoughts Compared to 2008, the rating profile in January 2009 has changed. Why should the grade-level or total default rates remain the same? Invariant grade-level default rates would imply almost invariant discriminatory power: Observed accuracy ratio in 2008: 63.4% Forecast accuracy ratio for 2009: 66.6% What other ways are there to reflect almost invariant discriminatory power? Assuming an invariant accuracy ratio is not sufficient for inferring PDs if only the rating profile is known. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 5 / 18

Two forecasting problems Alternatives to invariant grade-level default rates Geometric interpretations of accuracy ratio (for continuous score): Based on area under Receiver Operating Characteristic (ROC) Based on area between Cumulative Accuracy Profile (CAP) and diagonal Derivative of CAP is (essentially) the PD curve of the scores. Derivative of ROC is (essentially) the density ratio of the scores. Is invariant density ratio a viable alternative? We also look at invariant rating profiles of defaulters and non-defaulters as an alternative assumption on almost invariant discriminatory power. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 6 / 18

A taxonomy of dataset shift The Machine Learning perspective Classification on datasets with changed distributions is a problem well-known in Machine Learning. Moreno-Torres et al. (2012) proposed a taxonomy for dataset shifts. Setting: Each item in a dataset has a class y and a covariates vector x. p test (x, y) and p trai (x, y) are the joint distributions of (x, y) on the test and training sets respectively. ptrai (x, y) is known from observation but for p test (x, y) only the marginal distribution p test (x) is observable now. How to determine unconditional class probabilities p test (y = c) and conditional class probabilities p test (y = c x)? Definition: Dataset shift occurs if p test (x, y) p trai (x, y). On Slide 4, y = default status, x = rating grade. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 7 / 18

A taxonomy of dataset shift The Moreno-Torres et al. taxonomy Four types of dataset shift p test (x, y) p trai (x, y): Covariate shift: p test (x) p trai (x), but p test (y x) = p trai (y x). Prior probability shift: p test (y) p trai (y), but p test (x y) = p trai (x y). Concept shift: p test (x) = p trai (x), but p test (y x) p trai (y x), p test (y) = p trai (y), but p test (x y) p trai (x y). or Other shifts. Assuming invariant grade-level default rates on Slide 4 is equivalent to an assumption of covariate shift. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 8 / 18

Estimation under prior probability shift assumption Moody s corporate rating profiles 2008 Frequency 0.0 0.1 0.2 0.3 0.4 0.5 Defaulters All Non defaulters Caa C B Ba Baa A Aa Aaa Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 9 / 18

Estimation under prior probability shift assumption The least-squares estimator Setting as on Slide 4: y = default status (classes D default and N non-default), x = rating grade p test (x) known, but p test (x) p trai (x) Want to determine ptest = p test (y = D) and p test (y = D x) Prior probability shift assumption: p test (x y = c) = p trai (x y = c) for c = D, N. Hence, for all x, the class probability p test should satisfy p test (x) = p test p trai (x y = D) + (1 p test ) p trai (x y = N). This is unlikely to be achievable. Therefore least squares approximation ( )( ) p test (x) p trai (x y=n) p trai (x y=d) p trai (x y=n) dx p test = ) 2dx. (1) ( p trai (x y=d) p trai (x y=n) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 10 / 18

Estimation under prior probability shift assumption Fitted and observed Moody s corporate rating profiles Fitted profile 2009 Observed profile 2009 Frequency 0.00 0.05 0.10 0.15 0.20 Caa C B Ba Baa A Aa Aaa Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 11 / 18

Estimation under invariant density ratio assumption The invariant density ratio estimator Setting as on Slide 4: y = default status (classes D default and N non-default), x = rating grade p test (x) known, but p test (x) p trai (x) Want to determine ptest = p test (y = D) and p test (y = D x) Invariant density ratio assumption: p test (x y = D) p test (x y = N) = p trai(x y = D) def = λ(x). p trai (x y = N) Then the class probability p test must satisfy λ(x) 1 0 = 1+(λ(x) 1) p test d p test (x). (2a) There is a unique solution to (2a) if and only if λ(x) d p test (x) > 1 and λ(x) 1 d p test (x) > 1. (2b) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 12 / 18

Estimation under invariant density ratio assumption Properties If condition (2b) is not satisfied then the profiles p test (x) and p trai (x) are so different that any inheritance of discriminatory power seems questionable. Exact fit: With the invariant density ratio estimate p test come estimates of the conditional densities p test (x y = c), c = D, N, such that λ(x) = p test (x y=d) p test (x y=n), and p test (x) = p test p test (x y = D) + (1 p test ) p test (x y = N). Call p test = p trai (y = D x) d p test (x) the covariate shift estimator of p test. Then it holds that p test = (1 π) p trai (y = D) + π p test. (3) Where 0 π 1 and π is the closer to 1 the more discriminatory power the scores x have on the training set. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 13 / 18

Estimation under invariant density ratio assumption Conditional profiles 2009: Invariant from 2008 vs. fitted Defaulters Frequency Frequency 0.0 0.2 0.4 0.6 0.00 0.10 0.20 Observed 2008 Fitted 2009 Observed 2009 Caa C B Ba Baa A Aa Aaa Non defaulters Observed 2008 Fitted 2009 Observed 2009 Caa C B Ba Baa A Aa Aaa Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 14 / 18

An application to the mitigation of model risk for PD estimation Different forecast methods Three methods of forecasting portfolio-wide default rate p test : Covariate shift estimator p test (Slide 13) Prior probability shift estimator p test, (1) Invariant density ratio estimator p test, (2a) p test and p test give the same estimate under a prior probability shift. Estimates p test and p test of p test provide estimates of the conditional default rates p test (y = D x) by p test (y = D x) = p test λ(x) 1 + (λ(x) 1) p test. (3) suggests that max(p test, p test ) could be an upper bound for next year s portfolio-wide default rate. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 15 / 18

An application to the mitigation of model risk for PD estimation Observed vs. forecast corporate default rates 4 Per cent 2 4 6 8 Observed Covariate shift forecast Invariant Density Ratio forecast 1990 1995 2000 2005 2010 2015 Year 4 Source of observed rates: Moody s (2015). Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 16 / 18

Concluding remarks Straightforward covariate shift (or invariant conditional default rate ) PD estimates sometimes may seriously underestimate future default rates. The invariant density ratio approach often provides very different estimates that may be used for model risk mitigation. The invariant density ratio approach can also be applied for the estimation of loss rates. Rating agency data like Moody s (2015) possibly are subjective, making results of the approach conservative. Further reading: Background and more details: Tasche (2013), Tasche (2014) More sophisticated approaches: Hofer and Krempl (2013), Hofer (2015) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 17 / 18

References V. Hofer. Adapting a classification rule to local and global shift when only unlabelled data are available. European Journal of Operational Research, 243(1):177 189, 2015. V. Hofer and G. Krempl. Drift mining in data: A framework for addressing drift in classification. Computational Statistics & Data Analysis, 57(1):377 391, 2013. Moody s. Annual Default Study: Corporate Default and Recovery Rates, 1920-2014. Special comment, Moody s Investors Service, March 2015. J.G. Moreno-Torres, T. Raeder, R. Alaiz-Rodriguez, N.V. Chawla, and F. Herrera. A unifying view on dataset shift in classification. Pattern Recognition, 45(1):521 530, 2012. D. Tasche. The art of probability-of-default curve calibration. Journal of Credit Risk, 9(4):63 103, 2013. D. Tasche. Exact fit of simple finite mixture models. Journal of Risk and Financial Management, 7(4):150 164, 2014. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 18 / 18