Model Risk With. Default. Estimates of Probabilities of. Dirk Tasche Imperial College, London. June 2015

Size: px
Start display at page:

Download "Model Risk With. Default. Estimates of Probabilities of. Dirk Tasche Imperial College, London. June 2015"

Transcription

1 Model Risk With Estimates of Probabilities of Default Dirk Tasche Imperial College, London June 2015

2 The views expressed in the following material are the author s and do not necessarily represent the views of the Global Association of Risk Professionals (GARP), its Membership or its Management. 2

3 Outline Two forecasting problems A taxonomy of dataset shift Estimation under prior probability shift assumption Estimation under invariant density ratio assumption An application to the mitigation of model risk for PD estimation Concluding remarks References Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 2 / 18

4 Two forecasting problems Single borrower s probability of default (PD) Moody s corporate issuer and default counts in Grade Caa-C B Ba Baa A Aa Aaa All Issuers Defaults January 1, 2009: What is a Baa-rated borrower s probability to default in 2009? Natural (?) estimate: %. 2 Source: Moody s (2015) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 3 / 18

5 Rating profile known Two forecasting problems Moody s corporate issuer proportions and default rates in 2008 and issuer proportions in All numbers in %. Grade Issuers Default rate Issuers Default rate Caa-C ? B ? Ba ? Baa ? A ? Aa ? Aaa ? All ? How to take account of the additional data? 3 Source: Moody s (2015) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 4 / 18

6 Two forecasting problems Some thoughts Compared to 2008, the rating profile in January 2009 has changed. Why should the grade-level or total default rates remain the same? Invariant grade-level default rates would imply almost invariant discriminatory power: Observed accuracy ratio in 2008: 63.4% Forecast accuracy ratio for 2009: 66.6% What other ways are there to reflect almost invariant discriminatory power? Assuming an invariant accuracy ratio is not sufficient for inferring PDs if only the rating profile is known. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 5 / 18

7 Two forecasting problems Alternatives to invariant grade-level default rates Geometric interpretations of accuracy ratio (for continuous score): Based on area under Receiver Operating Characteristic (ROC) Based on area between Cumulative Accuracy Profile (CAP) and diagonal Derivative of CAP is (essentially) the PD curve of the scores. Derivative of ROC is (essentially) the density ratio of the scores. Is invariant density ratio a viable alternative? We also look at invariant rating profiles of defaulters and non-defaulters as an alternative assumption on almost invariant discriminatory power. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 6 / 18

8 A taxonomy of dataset shift The Machine Learning perspective Classification on datasets with changed distributions is a problem well-known in Machine Learning. Moreno-Torres et al. (2012) proposed a taxonomy for dataset shifts. Setting: Each item in a dataset has a class y and a covariates vector x. p test (x, y) and p trai (x, y) are the joint distributions of (x, y) on the test and training sets respectively. ptrai (x, y) is known from observation but for p test (x, y) only the marginal distribution p test (x) is observable now. How to determine unconditional class probabilities p test (y = c) and conditional class probabilities p test (y = c x)? Definition: Dataset shift occurs if p test (x, y) p trai (x, y). On Slide 4, y = default status, x = rating grade. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 7 / 18

9 A taxonomy of dataset shift The Moreno-Torres et al. taxonomy Four types of dataset shift p test (x, y) p trai (x, y): Covariate shift: p test (x) p trai (x), but p test (y x) = p trai (y x). Prior probability shift: p test (y) p trai (y), but p test (x y) = p trai (x y). Concept shift: p test (x) = p trai (x), but p test (y x) p trai (y x), p test (y) = p trai (y), but p test (x y) p trai (x y). or Other shifts. Assuming invariant grade-level default rates on Slide 4 is equivalent to an assumption of covariate shift. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 8 / 18

10 Estimation under prior probability shift assumption Moody s corporate rating profiles 2008 Frequency Defaulters All Non defaulters Caa C B Ba Baa A Aa Aaa Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 9 / 18

11 Estimation under prior probability shift assumption The least-squares estimator Setting as on Slide 4: y = default status (classes D default and N non-default), x = rating grade p test (x) known, but p test (x) p trai (x) Want to determine ptest = p test (y = D) and p test (y = D x) Prior probability shift assumption: p test (x y = c) = p trai (x y = c) for c = D, N. Hence, for all x, the class probability p test should satisfy p test (x) = p test p trai (x y = D) + (1 p test ) p trai (x y = N). This is unlikely to be achievable. Therefore least squares approximation ( )( ) p test (x) p trai (x y=n) p trai (x y=d) p trai (x y=n) dx p test = ) 2dx. (1) ( p trai (x y=d) p trai (x y=n) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 10 / 18

12 Estimation under prior probability shift assumption Fitted and observed Moody s corporate rating profiles Fitted profile 2009 Observed profile 2009 Frequency Caa C B Ba Baa A Aa Aaa Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 11 / 18

13 Estimation under invariant density ratio assumption The invariant density ratio estimator Setting as on Slide 4: y = default status (classes D default and N non-default), x = rating grade p test (x) known, but p test (x) p trai (x) Want to determine ptest = p test (y = D) and p test (y = D x) Invariant density ratio assumption: p test (x y = D) p test (x y = N) = p trai(x y = D) def = λ(x). p trai (x y = N) Then the class probability p test must satisfy λ(x) 1 0 = 1+(λ(x) 1) p test d p test (x). (2a) There is a unique solution to (2a) if and only if λ(x) d p test (x) > 1 and λ(x) 1 d p test (x) > 1. (2b) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 12 / 18

14 Estimation under invariant density ratio assumption Properties If condition (2b) is not satisfied then the profiles p test (x) and p trai (x) are so different that any inheritance of discriminatory power seems questionable. Exact fit: With the invariant density ratio estimate p test come estimates of the conditional densities p test (x y = c), c = D, N, such that λ(x) = p test (x y=d) p test (x y=n), and p test (x) = p test p test (x y = D) + (1 p test ) p test (x y = N). Call p test = p trai (y = D x) d p test (x) the covariate shift estimator of p test. Then it holds that p test = (1 π) p trai (y = D) + π p test. (3) Where 0 π 1 and π is the closer to 1 the more discriminatory power the scores x have on the training set. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 13 / 18

15 Estimation under invariant density ratio assumption Conditional profiles 2009: Invariant from 2008 vs. fitted Defaulters Frequency Frequency Observed 2008 Fitted 2009 Observed 2009 Caa C B Ba Baa A Aa Aaa Non defaulters Observed 2008 Fitted 2009 Observed 2009 Caa C B Ba Baa A Aa Aaa Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 14 / 18

16 An application to the mitigation of model risk for PD estimation Different forecast methods Three methods of forecasting portfolio-wide default rate p test : Covariate shift estimator p test (Slide 13) Prior probability shift estimator p test, (1) Invariant density ratio estimator p test, (2a) p test and p test give the same estimate under a prior probability shift. Estimates p test and p test of p test provide estimates of the conditional default rates p test (y = D x) by p test (y = D x) = p test λ(x) 1 + (λ(x) 1) p test. (3) suggests that max(p test, p test ) could be an upper bound for next year s portfolio-wide default rate. Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 15 / 18

17 An application to the mitigation of model risk for PD estimation Observed vs. forecast corporate default rates 4 Per cent Observed Covariate shift forecast Invariant Density Ratio forecast Year 4 Source of observed rates: Moody s (2015). Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 16 / 18

18 Concluding remarks Straightforward covariate shift (or invariant conditional default rate ) PD estimates sometimes may seriously underestimate future default rates. The invariant density ratio approach often provides very different estimates that may be used for model risk mitigation. The invariant density ratio approach can also be applied for the estimation of loss rates. Rating agency data like Moody s (2015) possibly are subjective, making results of the approach conservative. Further reading: Background and more details: Tasche (2013), Tasche (2014) More sophisticated approaches: Hofer and Krempl (2013), Hofer (2015) Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 17 / 18

19 References V. Hofer. Adapting a classification rule to local and global shift when only unlabelled data are available. European Journal of Operational Research, 243(1): , V. Hofer and G. Krempl. Drift mining in data: A framework for addressing drift in classification. Computational Statistics & Data Analysis, 57(1): , Moody s. Annual Default Study: Corporate Default and Recovery Rates, Special comment, Moody s Investors Service, March J.G. Moreno-Torres, T. Raeder, R. Alaiz-Rodriguez, N.V. Chawla, and F. Herrera. A unifying view on dataset shift in classification. Pattern Recognition, 45(1): , D. Tasche. The art of probability-of-default curve calibration. Journal of Credit Risk, 9(4):63 103, D. Tasche. Exact fit of simple finite mixture models. Journal of Risk and Financial Management, 7(4): , Dirk Tasche (Imperial College) Model Risk With Estimates of Probabilities of Default 18 / 18

RATING TRANSITIONS AND DEFAULT RATES

RATING TRANSITIONS AND DEFAULT RATES RATING TRANSITIONS AND DEFAULT RATES 2001-2012 I. Transition Rates for Banks Transition matrices or credit migration matrices characterise the evolution of credit quality for issuers with the same approximate

More information

Rating Transitions and Defaults Conditional on Rating Outlooks Revisited:

Rating Transitions and Defaults Conditional on Rating Outlooks Revisited: Special Comment December 2005 Contact Phone New York David T. Hamilton 1.212.553.1653 Richard Cantor Rating Transitions and Defaults Conditional on Rating Outlooks Revisited: 1995-2005 Summary In this

More information

Does quantification without adjustments work?

Does quantification without adjustments work? Does quantification without adjustments work? Dirk Tasche February 28, 2016 arxiv:1602.08780v1 [stat.ml] 28 Feb 2016 Classification is the task of predicting the class labels of objects based on the observation

More information

Issues using Logistic Regression for Highly Imbalanced data

Issues using Logistic Regression for Highly Imbalanced data Issues using Logistic Regression for Highly Imbalanced data Yazhe Li, Niall Adams, Tony Bellotti Imperial College London yli16@imperialacuk Credit Scoring and Credit Control conference, Aug 2017 Yazhe

More information

Expected Shortfall is not elicitable so what?

Expected Shortfall is not elicitable so what? Expected Shortfall is not elicitable so what? Dirk Tasche Bank of England Prudential Regulation Authority 1 dirk.tasche@gmx.net Finance & Stochastics seminar Imperial College, November 20, 2013 1 The opinions

More information

U.S. Retail/Restaurants Sector: Historical Performance And Current Risks

U.S. Retail/Restaurants Sector: Historical Performance And Current Risks U.S. Retail/Restaurants Sector: Historical Performance And Current Risks S&P Global Fixed Income Research Diane Vazza Managing Director +1 212 438 2760 diane.vazza@spglobal.com Nick Kraemer Senior Director

More information

Expected Shortfall is not elicitable so what?

Expected Shortfall is not elicitable so what? Expected Shortfall is not elicitable so what? Dirk Tasche Bank of England Prudential Regulation Authority 1 dirk.tasche@gmx.net Modern Risk Management of Insurance Firms Hannover, January 23, 2014 1 The

More information

How to evaluate credit scorecards - and why using the Gini coefficient has cost you money

How to evaluate credit scorecards - and why using the Gini coefficient has cost you money How to evaluate credit scorecards - and why using the Gini coefficient has cost you money David J. Hand Imperial College London Quantitative Financial Risk Management Centre August 2009 QFRMC - Imperial

More information

Distribution Date: 26-Dec-07

Distribution Date: 26-Dec-07 ABN AMRO Acct : 724532.1 Outside Parties To The Transaction Contact Information: Payment : 26-Dec-07 Prior Payment: 26-Nov-07 Issuer: First Franklin Financial Corporation Analyst: Shaun Horbochuk 714.259.6217

More information

Fisher Consistency for Prior Probability Shift

Fisher Consistency for Prior Probability Shift Journal of Machine Learning Research 18 (2017) 1-32 Submitted 1/17; Revised 4/17; Published 9/17 Fisher Consistency for Prior Probability Shift Dirk Tasche Swiss Financial Market Supervisory Authority

More information

c 4, < y 2, 1 0, otherwise,

c 4, < y 2, 1 0, otherwise, Fundamentals of Big Data Analytics Univ.-Prof. Dr. rer. nat. Rudolf Mathar Problem. Probability theory: The outcome of an experiment is described by three events A, B and C. The probabilities Pr(A) =,

More information

Losses Given Default in the Presence of Extreme Risks

Losses Given Default in the Presence of Extreme Risks Losses Given Default in the Presence of Extreme Risks Qihe Tang [a] and Zhongyi Yuan [b] [a] Department of Statistics and Actuarial Science University of Iowa [b] Smeal College of Business Pennsylvania

More information

U.S. Consumer Products And Retail/Restaurants Sector: Historical Performance And Current Risks

U.S. Consumer Products And Retail/Restaurants Sector: Historical Performance And Current Risks U.S. Consumer Products And Retail/Restaurants Sector: Historical Performance And Current Risks S&P Global Fixed Income Research Diane Vazza Managing Director +1 212 438 2760 diane.vazza@spglobal.com Nick

More information

Penarth Master Issuer plc - Monthly Report October 2012 Combined Series Report For IPD Ending: 19 November 2012

Penarth Master Issuer plc - Monthly Report October 2012 Combined Series Report For IPD Ending: 19 November 2012 Combined Series Report For IPD Ending: 19 November 2012 Reporting Date 17 November 2012 Reporting Period 01 October 2012-31 October 2012 Interest Payment Date 19 November 2012 Contact Details Name Telephone

More information

Default & Recovery Rates of Corporate Bond Issuers

Default & Recovery Rates of Corporate Bond Issuers Special Comment January 2004 Contact Phone New York David T. Hamilton 1.212.553.1653 Praveen Varma Sharon Ou Richard Cantor Default & Recovery Rates of Corporate Bond Issuers Summary A Statistical Review

More information

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS INSTITUTE AND FACULTY OF ACTUARIES Curriculum 09 SPECIMEN SOLUTIONS Subject CSA Risk Modelling and Survival Analysis Institute and Faculty of Actuaries Sample path A continuous time, discrete state process

More information

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline. Practitioner Course: Portfolio Optimization September 10, 2008 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y ) (x,

More information

Lecture 9: Classification, LDA

Lecture 9: Classification, LDA Lecture 9: Classification, LDA Reading: Chapter 4 STATS 202: Data mining and analysis Jonathan Taylor, 10/12 Slide credits: Sergio Bacallado 1 / 1 Review: Main strategy in Chapter 4 Find an estimate ˆP

More information

Week 2 Quantitative Analysis of Financial Markets Bayesian Analysis

Week 2 Quantitative Analysis of Financial Markets Bayesian Analysis Week 2 Quantitative Analysis of Financial Markets Bayesian Analysis Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project

Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project Devin Cornell & Sushruth Sastry May 2015 1 Abstract In this article, we explore

More information

Lecture 9: Classification, LDA

Lecture 9: Classification, LDA Lecture 9: Classification, LDA Reading: Chapter 4 STATS 202: Data mining and analysis October 13, 2017 1 / 21 Review: Main strategy in Chapter 4 Find an estimate ˆP (Y X). Then, given an input x 0, we

More information

Penarth Master Issuer plc - Monthly Report November 2012 Combined Series Report For IPD Ending: 18 December 2012

Penarth Master Issuer plc - Monthly Report November 2012 Combined Series Report For IPD Ending: 18 December 2012 Combined Series Report For IPD Ending: 18 December 2012 Reporting Date 17 December 2012 Reporting Period 01 November 2012-30 November 2012 Interest Payment Date 18 December 2012 Contact Details Name Telephone

More information

Lecture 9: Classification, LDA

Lecture 9: Classification, LDA Lecture 9: Classification, LDA Reading: Chapter 4 STATS 202: Data mining and analysis October 13, 2017 1 / 21 Review: Main strategy in Chapter 4 Find an estimate ˆP (Y X). Then, given an input x 0, we

More information

INTRODUCTION TO PATTERN RECOGNITION

INTRODUCTION TO PATTERN RECOGNITION INTRODUCTION TO PATTERN RECOGNITION INSTRUCTOR: WEI DING 1 Pattern Recognition Automatic discovery of regularities in data through the use of computer algorithms With the use of these regularities to take

More information

Statistical Machine Learning Hilary Term 2018

Statistical Machine Learning Hilary Term 2018 Statistical Machine Learning Hilary Term 2018 Pier Francesco Palamara Department of Statistics University of Oxford Slide credits and other course material can be found at: http://www.stats.ox.ac.uk/~palamara/sml18.html

More information

Fieller Stability Measure: A Novel Model-dependent Backtesting Approach

Fieller Stability Measure: A Novel Model-dependent Backtesting Approach Fieller Stability Measure: A Novel Model-dependent Backtesting Approach Cristián Bravo, Sebastián Maldonado Abstract Dataset shift is present in almost all real-world applications, since most of them are

More information

MIRA, SVM, k-nn. Lirong Xia

MIRA, SVM, k-nn. Lirong Xia MIRA, SVM, k-nn Lirong Xia Linear Classifiers (perceptrons) Inputs are feature values Each feature has a weight Sum is the activation activation w If the activation is: Positive: output +1 Negative, output

More information

Probabilities & Statistics Revision

Probabilities & Statistics Revision Probabilities & Statistics Revision Christopher Ting Christopher Ting http://www.mysmu.edu/faculty/christophert/ : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 January 6, 2017 Christopher Ting QF

More information

CS798: Selected topics in Machine Learning

CS798: Selected topics in Machine Learning CS798: Selected topics in Machine Learning Support Vector Machine Jakramate Bootkrajang Department of Computer Science Chiang Mai University Jakramate Bootkrajang CS798: Selected topics in Machine Learning

More information

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Cheng Soon Ong & Christian Walder. Canberra February June 2018 Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 2018 (Many figures from C. M. Bishop, "Pattern Recognition and ") 1of 143 Part IV

More information

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework HT5: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Maximum Likelihood Principle A generative model for

More information

Generalized Autoregressive Score Models

Generalized Autoregressive Score Models Generalized Autoregressive Score Models by: Drew Creal, Siem Jan Koopman, André Lucas To capture the dynamic behavior of univariate and multivariate time series processes, we can allow parameters to be

More information

IFRS9 PD modeling Presentation MATFYZ Konstantin Belyaev

IFRS9 PD modeling Presentation MATFYZ Konstantin Belyaev IFRS9 PD modeling Presentation MATFYZ 03.01.2018 Konstantin Belyaev IFRS9 International Financial Reporting Standard In July 2014, the International Accounting Standard Board (IASB) issued the final version

More information

Statistical Learning Reading Assignments

Statistical Learning Reading Assignments Statistical Learning Reading Assignments S. Gong et al. Dynamic Vision: From Images to Face Recognition, Imperial College Press, 2001 (Chapt. 3, hard copy). T. Evgeniou, M. Pontil, and T. Poggio, "Statistical

More information

15-388/688 - Practical Data Science: Basic probability. J. Zico Kolter Carnegie Mellon University Spring 2018

15-388/688 - Practical Data Science: Basic probability. J. Zico Kolter Carnegie Mellon University Spring 2018 15-388/688 - Practical Data Science: Basic probability J. Zico Kolter Carnegie Mellon University Spring 2018 1 Announcements Logistics of next few lectures Final project released, proposals/groups due

More information

Claris Finance 2003 Srl

Claris Finance 2003 Srl External Parties Joint Lead Managers AG Banca Nazionale del Lavoro S.p.A. Computation Agent AG, London Branch Table of Contents Page 1. Current Distributions and Ratings Data 2 2. Collection Summary 3

More information

Normal Random Variables and Probability

Normal Random Variables and Probability Normal Random Variables and Probability An Undergraduate Introduction to Financial Mathematics J. Robert Buchanan 2015 Discrete vs. Continuous Random Variables Think about the probability of selecting

More information

arxiv:cond-mat/ v1 23 Jul 2002

arxiv:cond-mat/ v1 23 Jul 2002 Remarks on the monotonicity of default probabilities Dirk Tasche arxiv:cond-mat/0207555v1 23 Jul 2002 July 23, 2002 Abstract The consultative papers for the Basel II Accord require rating systems to provide

More information

Programme Specification (Undergraduate) Chemistry

Programme Specification (Undergraduate) Chemistry Programme Specification (Undergraduate) BSc Chemistry This document provides a definitive record of the main features of the programme and the learning outcomes that a typical student may reasonably be

More information

VUS and HUM Represented with Mann-Whitney Statistic

VUS and HUM Represented with Mann-Whitney Statistic Communications for Statistical Applications and Methods 05, Vol., No. 3, 3 3 DOI: http://dx.doi.org/0.535/csam.05..3.3 Print ISSN 87-7843 / Online ISSN 383-4757 VUS and HUM Represented with Mann-Whitney

More information

Machine Learning Lecture 2

Machine Learning Lecture 2 Machine Perceptual Learning and Sensory Summer Augmented 15 Computing Many slides adapted from B. Schiele Machine Learning Lecture 2 Probability Density Estimation 16.04.2015 Bastian Leibe RWTH Aachen

More information

L11: Pattern recognition principles

L11: Pattern recognition principles L11: Pattern recognition principles Bayesian decision theory Statistical classifiers Dimensionality reduction Clustering This lecture is partly based on [Huang, Acero and Hon, 2001, ch. 4] Introduction

More information

Linear vs Non-linear classifier. CS789: Machine Learning and Neural Network. Introduction

Linear vs Non-linear classifier. CS789: Machine Learning and Neural Network. Introduction Linear vs Non-linear classifier CS789: Machine Learning and Neural Network Support Vector Machine Jakramate Bootkrajang Department of Computer Science Chiang Mai University Linear classifier is in the

More information

Nonparametric Bayesian Methods (Gaussian Processes)

Nonparametric Bayesian Methods (Gaussian Processes) [70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent

More information

Towards Maximum Geometric Margin Minimum Error Classification

Towards Maximum Geometric Margin Minimum Error Classification THE SCIENCE AND ENGINEERING REVIEW OF DOSHISHA UNIVERSITY, VOL. 50, NO. 3 October 2009 Towards Maximum Geometric Margin Minimum Error Classification Kouta YAMADA*, Shigeru KATAGIRI*, Erik MCDERMOTT**,

More information

GS Mortgage Securities Corporation II Trust Pass-Through Certificates Series 2005-ROCK

GS Mortgage Securities Corporation II Trust Pass-Through Certificates Series 2005-ROCK DATES ADMINISTRATOR Payment Date: Apr 3, 2015 Prior Payment: Mar 3, 2015 Next Payment: May 4, 2015 First Payment Date: Jun 3, 2005 Closing Date: May 26, 2005 Cut-off Date: May 1, 2005 Final Distribution

More information

Course Outline. TERM EFFECTIVE: Fall 2017 CURRICULUM APPROVAL DATE: 03/27/2017

Course Outline. TERM EFFECTIVE: Fall 2017 CURRICULUM APPROVAL DATE: 03/27/2017 5055 Santa Teresa Blvd Gilroy, CA 95023 Course Outline COURSE: MATH 430 DIVISION: 10 ALSO LISTED AS: TERM EFFECTIVE: Fall 2017 CURRICULUM APPROVAL DATE: 03/27/2017 SHT TITLE: ALGEBRA I LONG TITLE: Algebra

More information

COM336: Neural Computing

COM336: Neural Computing COM336: Neural Computing http://www.dcs.shef.ac.uk/ sjr/com336/ Lecture 2: Density Estimation Steve Renals Department of Computer Science University of Sheffield Sheffield S1 4DP UK email: s.renals@dcs.shef.ac.uk

More information

Supporting Statistical Hypothesis Testing Over Graphs

Supporting Statistical Hypothesis Testing Over Graphs Supporting Statistical Hypothesis Testing Over Graphs Jennifer Neville Departments of Computer Science and Statistics Purdue University (joint work with Tina Eliassi-Rad, Brian Gallagher, Sergey Kirshner,

More information

BAYESIAN DECISION THEORY

BAYESIAN DECISION THEORY Last updated: September 17, 2012 BAYESIAN DECISION THEORY Problems 2 The following problems from the textbook are relevant: 2.1 2.9, 2.11, 2.17 For this week, please at least solve Problem 2.3. We will

More information

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 4 of Data Mining by I. H. Witten, E. Frank and M. A.

Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 4 of Data Mining by I. H. Witten, E. Frank and M. A. Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter of Data Mining by I. H. Witten, E. Frank and M. A. Hall Statistical modeling Opposite of R: use all the attributes Two assumptions:

More information

Loss Estimation using Monte Carlo Simulation

Loss Estimation using Monte Carlo Simulation Loss Estimation using Monte Carlo Simulation Tony Bellotti, Department of Mathematics, Imperial College London Credit Scoring and Credit Control Conference XV Edinburgh, 29 August to 1 September 2017 Motivation

More information

Tagus - Sociedade de Titularizacao de Creditos

Tagus - Sociedade de Titularizacao de Creditos External Parties Joint Arrangers and Joint Lead Managers Caixa-Banco de Investimento, S.A Banco Millennium bcp Investimento, S.A Banco Espirito Santo de Investimento, S.A. Table of Contents Page 1. Current

More information

An Introduction to Machine Learning

An Introduction to Machine Learning An Introduction to Machine Learning L6: Structured Estimation Alexander J. Smola Statistical Machine Learning Program Canberra, ACT 0200 Australia Alex.Smola@nicta.com.au Tata Institute, Pune, January

More information

Can macro variables improve transition matrix forecasting?

Can macro variables improve transition matrix forecasting? Can macro variables improve transition matrix forecasting? Georges Mansourati & Anders Olsson May 2006 Abstract Transition matrices that describe the probability for firms to migrate between rating classes

More information

Public Disclosure Copy

Public Disclosure Copy Public Disclosure Authorized AFRICA Rwanda Energy & Extractives Global Practice Recipient Executed Activities Investment Project Financing FY 2017 Seq No: 3 ARCHIVED on 07-May-2018 ISR32125 Implementing

More information

Monthly Investor Report 31 October Fastnet Securities 5 Limited

Monthly Investor Report 31 October Fastnet Securities 5 Limited Monthly Investor Report 31 October 2018 Fastnet Securities 5 Limited Tranche Name Identifier Legal Maturity Date Original Tranche Balance Restructured Tranche Balance Original Rating Current Rating (S&P/Moodys)

More information

Claris Finance 2003 Srl

Claris Finance 2003 Srl External Parties Joint Lead Managers Deutsche Bank AG Banca Nazionale del Lavoro S.p.A. Computation Agent Deutsche Bank AG, London Branch Table of Contents Page 1. Current Distributions and Ratings Data

More information

Advanced Introduction to Machine Learning CMU-10715

Advanced Introduction to Machine Learning CMU-10715 Advanced Introduction to Machine Learning CMU-10715 Gaussian Processes Barnabás Póczos http://www.gaussianprocess.org/ 2 Some of these slides in the intro are taken from D. Lizotte, R. Parr, C. Guesterin

More information

Calculating credit risk capital charges with the one-factor model

Calculating credit risk capital charges with the one-factor model Calculating credit risk capital charges with the one-factor model Susanne Emmer Dirk Tasche September 15, 2003 Abstract Even in the simple Vasicek one-factor credit portfolio model, the exact contributions

More information

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Cheng Soon Ong & Christian Walder. Canberra February June 2018 Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 2018 Outlines Overview Introduction Linear Algebra Probability Linear Regression

More information

Machine Learning Lecture 2

Machine Learning Lecture 2 Announcements Machine Learning Lecture 2 Eceptional number of lecture participants this year Current count: 449 participants This is very nice, but it stretches our resources to their limits Probability

More information

Model Accuracy Measures

Model Accuracy Measures Model Accuracy Measures Master in Bioinformatics UPF 2017-2018 Eduardo Eyras Computational Genomics Pompeu Fabra University - ICREA Barcelona, Spain Variables What we can measure (attributes) Hypotheses

More information

Unsupervised Learning with Permuted Data

Unsupervised Learning with Permuted Data Unsupervised Learning with Permuted Data Sergey Kirshner skirshne@ics.uci.edu Sridevi Parise sparise@ics.uci.edu Padhraic Smyth smyth@ics.uci.edu School of Information and Computer Science, University

More information

A traffic lights approach to PD validation

A traffic lights approach to PD validation A traffic lights approach to PD validation Dirk Tasche May 2, 2003 Abstract As a consequence of the dependence experienced in loan portfolios, the standard binomial test which is based on the assumption

More information

Classification: Linear Discriminant Analysis

Classification: Linear Discriminant Analysis Classification: Linear Discriminant Analysis Discriminant analysis uses sample information about individuals that are known to belong to one of several populations for the purposes of classification. Based

More information

A fast and simple algorithm for training neural probabilistic language models

A fast and simple algorithm for training neural probabilistic language models A fast and simple algorithm for training neural probabilistic language models Andriy Mnih Joint work with Yee Whye Teh Gatsby Computational Neuroscience Unit University College London 25 January 2013 1

More information

Probability Theory for Machine Learning. Chris Cremer September 2015

Probability Theory for Machine Learning. Chris Cremer September 2015 Probability Theory for Machine Learning Chris Cremer September 2015 Outline Motivation Probability Definitions and Rules Probability Distributions MLE for Gaussian Parameter Estimation MLE and Least Squares

More information

Unsupervised Learning

Unsupervised Learning 2018 EE448, Big Data Mining, Lecture 7 Unsupervised Learning Weinan Zhang Shanghai Jiao Tong University http://wnzhang.net http://wnzhang.net/teaching/ee448/index.html ML Problem Setting First build and

More information

Mixture Models and Expectation-Maximization

Mixture Models and Expectation-Maximization Mixture Models and Expectation-Maximiation David M. Blei March 9, 2012 EM for mixtures of multinomials The graphical model for a mixture of multinomials π d x dn N D θ k K How should we fit the parameters?

More information

Course Outline 4/10/ Santa Teresa Blvd Gilroy, CA COURSE: MATH 205A DIVISION: 10 ALSO LISTED AS: SHORT TITLE: FIRST HALF ALGEBRA

Course Outline 4/10/ Santa Teresa Blvd Gilroy, CA COURSE: MATH 205A DIVISION: 10 ALSO LISTED AS: SHORT TITLE: FIRST HALF ALGEBRA 5055 Santa Teresa Blvd Gilroy, CA 95023 Course Outline COURSE: MATH 205A DIVISION: 10 ALSO LISTED AS: TERM EFFECTIVE: Fall 2018 Inactive Course SHT TITLE: FIRST HALF ALGEBRA LONG TITLE: First Half of Elementary

More information

STA414/2104 Statistical Methods for Machine Learning II

STA414/2104 Statistical Methods for Machine Learning II STA414/2104 Statistical Methods for Machine Learning II Murat A. Erdogdu & David Duvenaud Department of Computer Science Department of Statistical Sciences Lecture 3 Slide credits: Russ Salakhutdinov Announcements

More information

SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION

SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION SUPERVISED LEARNING: INTRODUCTION TO CLASSIFICATION 1 Outline Basic terminology Features Training and validation Model selection Error and loss measures Statistical comparison Evaluation measures 2 Terminology

More information

Maths for Signals and Systems Linear Algebra in Engineering

Maths for Signals and Systems Linear Algebra in Engineering Maths for Signals and Systems Linear Algebra in Engineering Lectures 13 15, Tuesday 8 th and Friday 11 th November 016 DR TANIA STATHAKI READER (ASSOCIATE PROFFESOR) IN SIGNAL PROCESSING IMPERIAL COLLEGE

More information

Intelligent Systems Statistical Machine Learning

Intelligent Systems Statistical Machine Learning Intelligent Systems Statistical Machine Learning Carsten Rother, Dmitrij Schlesinger WS2015/2016, Our model and tasks The model: two variables are usually present: - the first one is typically discrete

More information

This appendix provides a very basic introduction to linear algebra concepts.

This appendix provides a very basic introduction to linear algebra concepts. APPENDIX Basic Linear Algebra Concepts This appendix provides a very basic introduction to linear algebra concepts. Some of these concepts are intentionally presented here in a somewhat simplified (not

More information

DS Machine Learning and Data Mining I. Alina Oprea Associate Professor, CCIS Northeastern University

DS Machine Learning and Data Mining I. Alina Oprea Associate Professor, CCIS Northeastern University DS 4400 Machine Learning and Data Mining I Alina Oprea Associate Professor, CCIS Northeastern University January 17 2019 Logistics HW 1 is on Piazza and Gradescope Deadline: Friday, Jan. 25, 2019 Office

More information

Argo Mortgages 2 S.r.l

Argo Mortgages 2 S.r.l External Parties Arrangers WestLB AG Natixis S.A. UBS Investment Bank Servicer CDC IXIS Table of Contents Page 1. The Notes 2 2. Issuer Available Funds 3 3. Expenses 4 4. Amortisation Amounts 5 5. Pre-Enforcement

More information

Machine Learning Linear Models

Machine Learning Linear Models Machine Learning Linear Models Outline II - Linear Models 1. Linear Regression (a) Linear regression: History (b) Linear regression with Least Squares (c) Matrix representation and Normal Equation Method

More information

PATTERN RECOGNITION AND MACHINE LEARNING

PATTERN RECOGNITION AND MACHINE LEARNING PATTERN RECOGNITION AND MACHINE LEARNING Slide Set 3: Detection Theory January 2018 Heikki Huttunen heikki.huttunen@tut.fi Department of Signal Processing Tampere University of Technology Detection theory

More information

Probabilistic classification CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2016

Probabilistic classification CE-717: Machine Learning Sharif University of Technology. M. Soleymani Fall 2016 Probabilistic classification CE-717: Machine Learning Sharif University of Technology M. Soleymani Fall 2016 Topics Probabilistic approach Bayes decision theory Generative models Gaussian Bayes classifier

More information

INFORMATION VALUE ESTIMATOR FOR CREDIT SCORING MODELS

INFORMATION VALUE ESTIMATOR FOR CREDIT SCORING MODELS ECDM Lisbon INFORMATION VALUE ESTIMATOR FOR CREDIT SCORING MODELS Martin Řezáč Dept. of Mathematics and Statistics, Faculty of Science, Masaryk University Introduction Information value is widely used

More information

LECTURE NOTE #3 PROF. ALAN YUILLE

LECTURE NOTE #3 PROF. ALAN YUILLE LECTURE NOTE #3 PROF. ALAN YUILLE 1. Three Topics (1) Precision and Recall Curves. Receiver Operating Characteristic Curves (ROC). What to do if we do not fix the loss function? (2) The Curse of Dimensionality.

More information

Distribution Date: 27-Aug-07

Distribution Date: 27-Aug-07 ABN AMRO Acct : 722341.1 Payment : 27-Aug-07 Prior Payment: 25-Jul-07 Next Payment: 25-Sep-07 Record : 24-Aug-07 Distribution Count: Content: Pages Contact Information: 8/27/2007 0:00 Statement to Certificate

More information

Public Disclosure Copy

Public Disclosure Copy Public Disclosure Authorized AFRICA Rwanda Energy & Extractives Global Practice Recipient Executed Activities Investment Project Financing FY 2017 Seq No: 1 ARCHIVED on 15-Oct-2017 ISR29580 Implementing

More information

Lecture 2. Introduction to Systems (Lathi )

Lecture 2. Introduction to Systems (Lathi ) Lecture 2 Introduction to Systems (Lathi 1.6-1.8) Pier Luigi Dragotti Department of Electrical & Electronic Engineering Imperial College London URL: www.commsp.ee.ic.ac.uk/~pld/teaching/ E-mail: p.dragotti@imperial.ac.uk

More information

Machine Learning Lecture 2

Machine Learning Lecture 2 Machine Perceptual Learning and Sensory Summer Augmented 6 Computing Announcements Machine Learning Lecture 2 Course webpage http://www.vision.rwth-aachen.de/teaching/ Slides will be made available on

More information

Discriminative Direction for Kernel Classifiers

Discriminative Direction for Kernel Classifiers Discriminative Direction for Kernel Classifiers Polina Golland Artificial Intelligence Lab Massachusetts Institute of Technology Cambridge, MA 02139 polina@ai.mit.edu Abstract In many scientific and engineering

More information

1: PROBABILITY REVIEW

1: PROBABILITY REVIEW 1: PROBABILITY REVIEW Marek Rutkowski School of Mathematics and Statistics University of Sydney Semester 2, 2016 M. Rutkowski (USydney) Slides 1: Probability Review 1 / 56 Outline We will review the following

More information

CS 188: Artificial Intelligence. Outline

CS 188: Artificial Intelligence. Outline CS 188: Artificial Intelligence Lecture 21: Perceptrons Pieter Abbeel UC Berkeley Many slides adapted from Dan Klein. Outline Generative vs. Discriminative Binary Linear Classifiers Perceptron Multi-class

More information

Three-group ROC predictive analysis for ordinal outcomes

Three-group ROC predictive analysis for ordinal outcomes Three-group ROC predictive analysis for ordinal outcomes Tahani Coolen-Maturi Durham University Business School Durham University, UK tahani.maturi@durham.ac.uk June 26, 2016 Abstract Measuring the accuracy

More information

Classification of high dimensional data: High Dimensional Discriminant Analysis

Classification of high dimensional data: High Dimensional Discriminant Analysis Classification of high dimensional data: High Dimensional Discriminant Analysis Charles Bouveyron, Stephane Girard, Cordelia Schmid To cite this version: Charles Bouveyron, Stephane Girard, Cordelia Schmid.

More information

Multivariate Distributions

Multivariate Distributions IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate

More information

Chapter 7, Part A Sampling and Sampling Distributions

Chapter 7, Part A Sampling and Sampling Distributions Slides Prepared by JOHN S. LOUCKS St. Edward s University Slide 1 Chapter 7, Part A Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling

More information

Classification. 1. Strategies for classification 2. Minimizing the probability for misclassification 3. Risk minimization

Classification. 1. Strategies for classification 2. Minimizing the probability for misclassification 3. Risk minimization Classification Volker Blobel University of Hamburg March 2005 Given objects (e.g. particle tracks), which have certain features (e.g. momentum p, specific energy loss de/ dx) and which belong to one of

More information

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions Pattern Recognition and Machine Learning Chapter 2: Probability Distributions Cécile Amblard Alex Kläser Jakob Verbeek October 11, 27 Probability Distributions: General Density Estimation: given a finite

More information

Clustering and Gaussian Mixtures

Clustering and Gaussian Mixtures Clustering and Gaussian Mixtures Oliver Schulte - CMPT 883 2 4 6 8 1 12 14 16 18 2 4 6 8 1 12 14 16 18 5 1 15 2 25 5 1 15 2 25 2 4 6 8 1 12 14 2 4 6 8 1 12 14 5 1 15 2 25 5 1 15 2 25 detected tures detected

More information

Clustering. Professor Ameet Talwalkar. Professor Ameet Talwalkar CS260 Machine Learning Algorithms March 8, / 26

Clustering. Professor Ameet Talwalkar. Professor Ameet Talwalkar CS260 Machine Learning Algorithms March 8, / 26 Clustering Professor Ameet Talwalkar Professor Ameet Talwalkar CS26 Machine Learning Algorithms March 8, 217 1 / 26 Outline 1 Administration 2 Review of last lecture 3 Clustering Professor Ameet Talwalkar

More information

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline. MFM Practitioner Module: Risk & Asset Allocation September 11, 2013 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y

More information

Data Mining Techniques

Data Mining Techniques Data Mining Techniques CS 6220 - Section 2 - Spring 2017 Lecture 6 Jan-Willem van de Meent (credit: Yijun Zhao, Chris Bishop, Andrew Moore, Hastie et al.) Project Project Deadlines 3 Feb: Form teams of

More information