Extreme Value Theory.

Size: px
Start display at page:

Download "Extreme Value Theory."

Transcription

1 Bank of England Centre for Central Banking Studies CEMLA 2013 Extreme Value Theory. David G. Barr November 21, 2013 Any views expressed are those of the author and not necessarily those of the Bank of England. 1

2 Contents 1 Outline. 3 2 How probable are improbable events? 3 3 Can we use the Normal distribution to calculate stock return probabilities? The data exhibit excess kurtosis i.e. fat tails So what do we do about it? Estimating probability distributions Estimating a Normal distribution Parametric vs. empirical distributions Extreme value theory A quick summary Back to work Sensible curves What makes these curves sensible? The Theory in EVT The theory in general terms: Gnedenko s result The general EVT shape: The generalised Pareto distribution Do we actually fit the density function to the histogram? To estimate the GPD do the following: Using the estimated parameters of the Generalised Pareto Distribution A simpler EVT shape: The Pareto distribution Application of the Pareto distribution to SP500 daily returns References. 27 2

3 1. Outline. 1. Introduction: Why is EVT useful? 2. Ingredients: (a) Probability distributions for EVT. (b) Gnedenko s results 3. A general application of EVT. 4. A simpler, restricted, application. 2. How probable are improbable events? The most damaging events occur rarely (fortunately?). As a result we have few observations with which to estimate their probability. We can estimate them using data, asset returns for example, from normal times and by assuming that they come from a standard distribution. This gives us an estimated parametric distribution, but it is typically very bad at providing estimated probabilities for rare events. Or we can assume that the frequencies of past data represent the probabilities of future outcomes. This gives us an empirical distribution, but this too is very bad at providing the probabilities we want. (Though it can be better than the parametric distribution.) A third option is to use Extreme Value Theory, which is a parametric estimate of the tails of the distribution only. 3

4 3. Can we use the Normal distribution to calculate stock return probabilities? 3.1. The data exhibit excess kurtosis i.e. fat tails. We hear quite a lot about stock returns being approximately lognormally distributed. I.e. ln (1 + r) N(µ, σ) (1) where ln (1 + r) is the log return. We will refer to this as just the return from now on. The following figures show the frequency of daily returns for the US and UK. They demonstrate that, to the eye at least, the lognormal is a reasonable approximation. Figure 1: SP500 4

5 Figure 2: FT100 The are three things worth noting in these charts: 1. The returns are centred very close to zero. This is because they are the actual returns from one day to the next. The means are actually slightly positive and, grossed up to annual rates, are about 15% p.a. 2. The data display fat-tails. We could observe these in (at least) two ways. (a) We could fit a more-general curve than the Normal and we would see it hover above the Normal in the tails. (We ll do this later.) (b) We could simply observe that there are several spikes where the Normal is effectively at zero (as these Figures show). 5

6 Figure 3: SP500-left tail 3. These spikes might not look important, but if we were to use a standard significance test for the returns at these points, we would reject the null hypothesis that the returns are distributed according to the Normals drawn here. 4. More dangerously in a risk management context, using the Normal makes us feel a lot safer than we actually are. 5. There are several ways to test whether an asset s returns follow a specific distribution. The Bera-Jarque test is a test against Normality (i.e. it is used to detect evidence against Normality specifically). The Kolmogorov-Smirnov test can be used to test against any specified distribution So what do we do about it? One possibility is to do nothing, but it s not very interesting, and leads to errors in VaR calculations. Use a simple distribution that makes a better job of matching the data, a Student-t with low degrees of freedom, for example: 6

7 It looks just like the normal, but it has fatter tails. Figure 4: t ( ) and Normal distributions. Unfortunately this doesn t work too well either - the mass of observations close to the mean dominate the parameter estimates. 7

8 4. Estimating probability distributions Estimating a Normal distribution. The normal has only two parameters to estimate, µ and σ. With these values for ˆµ and ˆσ we can construct the full Normal distribution. However, having these estimates does not imply that the true distribution of S t actually is Normal. Daily returns for the SP500 from 2 January 1957 to 26 April 2012 have the following estimated moments: ˆµ = %, ˆσ = % for T = observations. If the returns were Normally distributed the 1st percentile would be found from r 1% = 2.33 (2) r 1% = 2.32% (3) (4) So we should see (1% of 13928) = 139 observations lower than -2.32%. In fact there were 220 such observations. Conclusion: There are more observations in the lower tail of the actual distribution than we should expect if they are generated by a Normal distribution. 1% of the observations lie below -2.71% i.e. VaR(1%:empirical) = -2.71% 8

9 VaR(1%:Normal) = -2.32% Note that these conclusions about the normal are not based on a statistical test. Would we, for example, want to reject the Normal distribution if there were only 140 > 139 observations below -2.32%? As it happens the statistical tests also reject Normality in this case. 9

10 5. Parametric vs. empirical distributions. In the above example, it is not impossible that the returns come from a N(0.0244, ) distribution but......the number of observations below -2.32% suggests that this is unlikely. So what next? We can search for a parametric distribution that is not rejected by the data. We can assume that some of the empirical characteristics of the past observations will be reflected in the future returns. In particular we usually use the frequency distribution (or histogram ), as opposed to an estimated probability distribution. The second of these makes use of the empirical distribution. For example, we would conclude that the probability of a loss in excess of 2.32% is equal to the frequency of these lossess in the historical data i.e. p = 220/13928 = 1.58% rather than the 1% we get from the estimated Normal. The main failing of empirical distributions is that they tell us nothing about the probability of outcomes that lie outside the sample of empirical observations. 10

11 6. Extreme value theory A quick summary. EVT provides a method for fitting a sensible curve to the following observed histogram: Fitted EVT Fitted Normal Figure 5: SP500-left tail-example EVT curve. We have to do three things: 1. Pick a sensible curve. 2. Rearrange the histogram s data to simplify the estimation process. 3. Perform the estimation. 11

12 6.2. Back to work... Extreme value outcomes are not necessarily more unusual than any other events - consider a uniform distribution. In financial data however, it turns out that they are. This makes it very difficult to assess their probabilities. In particular, they tend not to fit into the distributions that do quite well at assessing the probabilities of ordinary outcomes. See the SP500 charts we looked at earlier. Since these events are also among the most dangerous, extreme value theory (EVT) has been adopted to deal with them. Hull (2012) summarises EVT s role rather well: [EVT] is a way of smoothing and extrapolating the tails of an empirical distribution. This is true of fitting most parametric distributions of course (why?) but EVT does a better job with the tails Sensible curves. We have seen that fitting a Normal to SP500 returns leads to underestimation of the tail probabilities. EVT provides a theoretical distribution that fits the tails much better, although it typically makes a mess of fitting the rest of the distribution. Does this matter? No, because we are interested only in tail risk here. 12

13 Why is this better than the empirical distribution? Because it is continuous, and can be extended beyond the most extreme empirical data point, making it more accurate for risk calculations. The core of EVT in finance is the Generalised Pareto Distribution. This will supply our sensible curve. We present it in terms of the probability that a variable x will be less than a number X, given that x is greater than a number m. Note that this is the cumulative distribution and not the density function. The numerical examples we saw earlier were presented in terms of the density function. We will have to take this change of approach into account when we perform the EVT estimation. Generalised Pareto Distribution (GPD): P (x < X x > m) GP D(X) (5) ( ) 1 (X m) φ GP D(X) = φ φ 0 (6) β ( ) X m = 1 exp φ = 0 (7) β where, β > 0, and m, φ 0. The distribution starts at x = m. Note that x is the random variable, and X represents a specific number. Many texts reverse these definitions. We return to the relative merits of these two later. For now all we need to know is that EVT based on the GPD can describe many types 13

14 of data. For financial returns in particular, the PD restrictions (i.e. fat-tailed) seem to hold. For GP D(X): φ GP D(X) becomes the... > 0 Frechet = 0 Gumbel < 0 Weibull For financial returns we expect to find φ > 0 since the Frechet exhibits fat tails. Financial data are often consistent with a another restriction m = β φ and when this is applied to the GPD/Frechet we get the Pareto Distribution. Pareto Distribution PD: a restricted version of the GPD that displays fat tails. P (x < X x > m) P D(X) (8) ( m ) α P D(X) = 1 x m, α > 0 (9) X = 1 KX α where K = m α (10) where the restriction is m = β φ, and α 1 φ > 0. We will estimate a GPD and a PD in what follows. 14

15 Figure 6: Frechet distributions. 7. What makes these curves sensible? The Theory in EVT The theory in general terms: Gnedenko s result. The key result of the theory is that, under certain conditions, the cdf of many random variables converges on a specific shape as we get further into the tails. This shape is the familiar smooth decline towards zero that we see in the Normal, t, χ 2 etc. We can apply this result even when we do not know which distribution the variables come from. 15

16 So, in finance, even though the true distribution of asset returns may remain a mystery, we can say something about the shape of the distribution in the tails, which is what matters for Value at Risk etc. And, provided that we have enough data, EVT allows us to estimate a shape that will approximate the true distribution whatever it is (subject to some conditions). More specifically, Gnedenko s result states that for a wide class of distributions the upper tail converges on the GPD above The general EVT shape: The generalised Pareto distribution. Textbooks typically present this material in terms of losses expressed as positive numbers - we do the same here. For many distributions the part to the right of a value u converges on the generalised Pareto distribution. I.e. if F u (Y ) is the probability that x lies between u and u + Y then, as u increases, ( ) 1 (Y u) φ F u (Y ) G φ,β (y) = φ β φ 0 (11) ( ) Y u = 1 exp β φ = 0 (12) I.e. as u gets larger, and we move into the tail, the distribution converges on the GPD. We use slightly different notation in contrast to (7) above to emphasize that this is an approximation to the GPD that improves as u gets larger. 16

17 In the limit, Y becomes X, and u becomes m as in (7). The parameters φ, β can then be estimated using non-linear methods, for any choice of u (or m) Do we actually fit the density function to the histogram? A histogram presents the observed observations grouped into buckets. A density function presents the probabilities for every possible individual observation i.e. it does not group them into buckets. While the buckets approach is convenient for diagrams it would be complicated (though not impossible) for estimation. We actually fit the probability (more precisely, the value of the density function) of getting each observation For the cumulative distribution for the GPD is ( ) 1 (X m) φ GP D(X) = φ φ 0 (13) β For which the density is gpd(x) = 1 β ( ) 1 (X m) φ φ β (14) which we can think of, loosely, as the probability of observing x = X To estimate the GPD do the following: Order the sample of n loss-making returns (x) as positive numbers, starting with the largest first. Chose u: Usually this will be the empirical 95th percentile, so select the losses above this level. 17

18 If we want to find VaR(q%) for example, we will need to choose u to the left (closer to the mean) of this. The 95th percentile satisfies this condition for VaR(1%). Call the number of observations in this percentile n u, so n u = 0.05 (15) n Estimate the parameters of GPD(X) using maximum likelihood. The likelihood for the sample is: L = ln(l) = = n u i=1 n u 1 n u 1 ln gpd(x i ) (16) 1 β [ 1 β ( 1 + φ (x ) 1 i u) φ 1 β ( 1 + φ(x ) ] 1/φ 1 i u) β We have to find values of φ, β to maximise ln(l). (17) (18) For this we need a numerical search process, and we have to supply a couple of starting values to get the search started. Hull (2012) p316 gives the following data for losses on an example portfolio (from a total sample of 500 losses): 18

19 [ 1 Loss ($000s) ln β ( 1 + φ(x i u) β ) 1/φ 1 ] φ=0.3,β=40 where the second column has been calculated using initial values of φ = 0.3 and β = 40. The value of u is 160. This gives n u = 22, and n u /n = 22/500 = 4.4% i.e. we are fitting the distribution to the largest 4.4% of losses. We then use the search process to find the φ, β that maximise the sum of the final column (i.e. push the sum towards zero from below). The optimising values in this case are φ = and β = The estimated φ > 0 confirms the presence of fat tails in the data Using the estimated parameters of the Generalised Pareto Distribution. VaR(q) and expected shortfall (ES(q)) where q is expressed as the weight in the tail (e.g. 1%) are ( [ V ar(q) = u + β ] φ n q 1) (19) φ n u ES(q) = V ar(q) + β φu 1 φ (20) 7.6. A simpler EVT shape: The Pareto distribution. For a lot of financial data the Pareto distribution performs well. GP D(X) = 1 KX α (21) 19

20 where K = α = 1 φ ( ) α β = m α (22) φ (23) The parameter α is known as the tail index. The larger the tail index, the fatter the tail. The tail index for the Normal is zero. Tail indices can be found for both parametric and empirical distributions. The tail index can then be used to calculate the VaR for either distribution. If the data satisfy this restriction, it is more efficient to estimate a Pareto distribution than the GPD. And we can fit the cumulative distribution function quite easily in this case Application of the Pareto distribution to SP500 daily returns. We use the same sample of SP500 data as in Section 4 above. The cumulative frequencies of deviations on both sides of the mean are shown in Table 1, in which the final column is (1 Φ(Col1)) 100. We can estimate a PD equation to fit the actual column in Table 1 as follows: 20

21 Deviation Percentage Percentage (below the mean) actual Normal distn > 1 s.d > 2 s.d > 3 s.d > 4 s.d > 5 s.d > 6 s.d > 7 s.d > 8 s.d > 9 s.d > 10 s.d Table 1: Actual and Normal one-sided frequencies. X will be the value of x measured in standard deviations. This scaling is not important - we could choose numbers for v arbitrarily. The probability that x exceeds X is KX α, which corresponds to the percentages in Table 1. Take logs of this probability to produce a linear equation for estimation i.e. ln(p r(x > X)) = ln(k) αln(x) (24) Construct the ln data for equation (24) using the first 7 observations from Table 1 (we keep the other 3 for out-of-sample tests), see Table 2. 21

22 X ln(x) P rop(x > X) ln[p rop(x > X)] in s.d. (Actual) = ln(col(3)) Table 2: Data for fitting equation (24). Plot ln(x) (column 2) against ln[p rop(x > X)] (column 4): Figure 1. These are the 2 variables for the regression. Figure 7: x axis = ln(x), y axis = ln[prop(x>x)]. 22

23 The plot is approximately linear, which is evidence in favour of our using the Pareto. Select the observation beyond which the actual frequencies appear most linear: Observation 3 in this case. So we run the regression using observations 3 to 7. This bit is is art, not science. OLS using the log linear observations generates ˆ ln(k) = (25) ˆK = 0.27 (26) ˆα = 3.31 (27) Most financial time series produce ˆα between 3 and 5. From the parameter estimates we can construct the following cumulative probabilities for comparison: Deviation Percentage: Percentage: Percentage: in s.d. fitted Normal distn actual > > > > > > > > >

24 The value for 4.5 s.d. represents an interpolation of the fitted distribution. The values for 8, 9 and 10 s.d. represent extrapolations of the fited distribution. Figures 2 and 3 show that the power distribution fits the empirical data better than the Normal in the tails. 24

25 Figure 8: Fitted distributions: 1 to 7 sigma. Figure 9: Fitted distributions: 4 to 15 sigma. 25

26 The extrapolated power distribution does not beat the empirical in these graphs due to the Black Monday loss of 23σ. Even the Pareto underestimates that probability of this size of loss. 26

27 8. References. Hull s Risk Management and Financial Institutions., 3ed (2012), is an excellent source for the material covered here. Danielsson s Financial Risk Forecasting. (2011) also covers this material but at a more advanced level. Both books are published by Wiley Finance. Working with both simultaneously can be confusing because their notations differ: Hull is the better place to start. 27

Volatility. Gerald P. Dwyer. February Clemson University

Volatility. Gerald P. Dwyer. February Clemson University Volatility Gerald P. Dwyer Clemson University February 2016 Outline 1 Volatility Characteristics of Time Series Heteroskedasticity Simpler Estimation Strategies Exponentially Weighted Moving Average Use

More information

Financial Econometrics and Volatility Models Extreme Value Theory

Financial Econometrics and Volatility Models Extreme Value Theory Financial Econometrics and Volatility Models Extreme Value Theory Eric Zivot May 3, 2010 1 Lecture Outline Modeling Maxima and Worst Cases The Generalized Extreme Value Distribution Modeling Extremes Over

More information

Week 1 Quantitative Analysis of Financial Markets Distributions A

Week 1 Quantitative Analysis of Financial Markets Distributions A Week 1 Quantitative Analysis of Financial Markets Distributions A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Asymptotic distribution of the sample average value-at-risk

Asymptotic distribution of the sample average value-at-risk Asymptotic distribution of the sample average value-at-risk Stoyan V. Stoyanov Svetlozar T. Rachev September 3, 7 Abstract In this paper, we prove a result for the asymptotic distribution of the sample

More information

Financial Econometrics and Quantitative Risk Managenent Return Properties

Financial Econometrics and Quantitative Risk Managenent Return Properties Financial Econometrics and Quantitative Risk Managenent Return Properties Eric Zivot Updated: April 1, 2013 Lecture Outline Course introduction Return definitions Empirical properties of returns Reading

More information

Introduction to Algorithmic Trading Strategies Lecture 10

Introduction to Algorithmic Trading Strategies Lecture 10 Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References

More information

Time Series Models for Measuring Market Risk

Time Series Models for Measuring Market Risk Time Series Models for Measuring Market Risk José Miguel Hernández Lobato Universidad Autónoma de Madrid, Computer Science Department June 28, 2007 1/ 32 Outline 1 Introduction 2 Competitive and collaborative

More information

Math 562 Homework 1 August 29, 2006 Dr. Ron Sahoo

Math 562 Homework 1 August 29, 2006 Dr. Ron Sahoo Math 56 Homework August 9, 006 Dr. Ron Sahoo He who labors diligently need never despair; for all things are accomplished by diligence and labor. Menander of Athens Direction: This homework worths 60 points

More information

SPRING 2007 EXAM C SOLUTIONS

SPRING 2007 EXAM C SOLUTIONS SPRING 007 EXAM C SOLUTIONS Question #1 The data are already shifted (have had the policy limit and the deductible of 50 applied). The two 350 payments are censored. Thus the likelihood function is L =

More information

Probabilities & Statistics Revision

Probabilities & Statistics Revision Probabilities & Statistics Revision Christopher Ting Christopher Ting http://www.mysmu.edu/faculty/christophert/ : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 January 6, 2017 Christopher Ting QF

More information

Efficient Estimation of Distributional Tail Shape and the Extremal Index with Applications to Risk Management

Efficient Estimation of Distributional Tail Shape and the Extremal Index with Applications to Risk Management Journal of Mathematical Finance, 2016, 6, 626-659 http://www.scirp.org/journal/jmf ISSN Online: 2162-2442 ISSN Print: 2162-2434 Efficient Estimation of Distributional Tail Shape and the Extremal Index

More information

Recall the Basics of Hypothesis Testing

Recall the Basics of Hypothesis Testing Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE

More information

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu

More information

10.7 Fama and French Mutual Funds notes

10.7 Fama and French Mutual Funds notes 1.7 Fama and French Mutual Funds notes Why the Fama-French simulation works to detect skill, even without knowing the characteristics of skill. The genius of the Fama-French simulation is that it lets

More information

Fin285a:Computer Simulations and Risk Assessment Section 6.2 Extreme Value Theory Daníelson, 9 (skim), skip 9.5

Fin285a:Computer Simulations and Risk Assessment Section 6.2 Extreme Value Theory Daníelson, 9 (skim), skip 9.5 Fin285a:Computer Simulations and Risk Assessment Section 6.2 Extreme Value Theory Daníelson, 9 (skim), skip 9.5 Overview Extreme value distributions Generalized Pareto distributions Tail shapes Using power

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

APPROXIMATING THE GENERALIZED BURR-GAMMA WITH A GENERALIZED PARETO-TYPE OF DISTRIBUTION A. VERSTER AND D.J. DE WAAL ABSTRACT

APPROXIMATING THE GENERALIZED BURR-GAMMA WITH A GENERALIZED PARETO-TYPE OF DISTRIBUTION A. VERSTER AND D.J. DE WAAL ABSTRACT APPROXIMATING THE GENERALIZED BURR-GAMMA WITH A GENERALIZED PARETO-TYPE OF DISTRIBUTION A. VERSTER AND D.J. DE WAAL ABSTRACT In this paper the Generalized Burr-Gamma (GBG) distribution is considered to

More information

Distribution Fitting (Censored Data)

Distribution Fitting (Censored Data) Distribution Fitting (Censored Data) Summary... 1 Data Input... 2 Analysis Summary... 3 Analysis Options... 4 Goodness-of-Fit Tests... 6 Frequency Histogram... 8 Comparison of Alternative Distributions...

More information

3 Continuous Random Variables

3 Continuous Random Variables Jinguo Lian Math437 Notes January 15, 016 3 Continuous Random Variables Remember that discrete random variables can take only a countable number of possible values. On the other hand, a continuous random

More information

Regression: Ordinary Least Squares

Regression: Ordinary Least Squares Regression: Ordinary Least Squares Mark Hendricks Autumn 2017 FINM Intro: Regression Outline Regression OLS Mathematics Linear Projection Hendricks, Autumn 2017 FINM Intro: Regression: Lecture 2/32 Regression

More information

Solutions of the Financial Risk Management Examination

Solutions of the Financial Risk Management Examination Solutions of the Financial Risk Management Examination Thierry Roncalli January 9 th 03 Remark The first five questions are corrected in TR-GDR and in the document of exercise solutions, which is available

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests: One sided tests So far all of our tests have been two sided. While this may be a bit easier to understand, this is often not the best way to do a hypothesis test. One simple thing that we can do to get

More information

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods Robert V. Breunig Centre for Economic Policy Research, Research School of Social Sciences and School of

More information

Frequency Analysis & Probability Plots

Frequency Analysis & Probability Plots Note Packet #14 Frequency Analysis & Probability Plots CEE 3710 October 0, 017 Frequency Analysis Process by which engineers formulate magnitude of design events (i.e. 100 year flood) or assess risk associated

More information

Extreme Value Theory as a Theoretical Background for Power Law Behavior

Extreme Value Theory as a Theoretical Background for Power Law Behavior Extreme Value Theory as a Theoretical Background for Power Law Behavior Simone Alfarano 1 and Thomas Lux 2 1 Department of Economics, University of Kiel, alfarano@bwl.uni-kiel.de 2 Department of Economics,

More information

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Overview of Extreme Value Theory. Dr. Sawsan Hilal space Overview of Extreme Value Theory Dr. Sawsan Hilal space Maths Department - University of Bahrain space November 2010 Outline Part-1: Univariate Extremes Motivation Threshold Exceedances Part-2: Bivariate

More information

Chapter 9. Non-Parametric Density Function Estimation

Chapter 9. Non-Parametric Density Function Estimation 9-1 Density Estimation Version 1.1 Chapter 9 Non-Parametric Density Function Estimation 9.1. Introduction We have discussed several estimation techniques: method of moments, maximum likelihood, and least

More information

Fin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals

Fin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals Fin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals Overview Hypothesis testing terms Testing a die Testing issues Estimating means Confidence intervals

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

Frequency Estimation of Rare Events by Adaptive Thresholding

Frequency Estimation of Rare Events by Adaptive Thresholding Frequency Estimation of Rare Events by Adaptive Thresholding J. R. M. Hosking IBM Research Division 2009 IBM Corporation Motivation IBM Research When managing IT systems, there is a need to identify transactions

More information

Section 5.4. Ken Ueda

Section 5.4. Ken Ueda Section 5.4 Ken Ueda Students seem to think that being graded on a curve is a positive thing. I took lasers 101 at Cornell and got a 92 on the exam. The average was a 93. I ended up with a C on the test.

More information

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Econ 423 Lecture Notes: Additional Topics in Time Series 1 Econ 423 Lecture Notes: Additional Topics in Time Series 1 John C. Chao April 25, 2017 1 These notes are based in large part on Chapter 16 of Stock and Watson (2011). They are for instructional purposes

More information

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1 4 Hypothesis testing 4. Simple hypotheses A computer tries to distinguish between two sources of signals. Both sources emit independent signals with normally distributed intensity, the signals of the first

More information

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor 1. The regression equation 2. Estimating the equation 3. Assumptions required for

More information

Physics 509: Bootstrap and Robust Parameter Estimation

Physics 509: Bootstrap and Robust Parameter Estimation Physics 509: Bootstrap and Robust Parameter Estimation Scott Oser Lecture #20 Physics 509 1 Nonparametric parameter estimation Question: what error estimate should you assign to the slope and intercept

More information

Shape of the return probability density function and extreme value statistics

Shape of the return probability density function and extreme value statistics Shape of the return probability density function and extreme value statistics 13/09/03 Int. Workshop on Risk and Regulation, Budapest Overview I aim to elucidate a relation between one field of research

More information

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,

More information

Quantifying Weather Risk Analysis

Quantifying Weather Risk Analysis Quantifying Weather Risk Analysis Now that an index has been selected and calibrated, it can be used to conduct a more thorough risk analysis. The objective of such a risk analysis is to gain a better

More information

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department

More information

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. Summary of the previous lecture Frequency factors Normal distribution

More information

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments /4/008 Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University C. A Sample of Data C. An Econometric Model C.3 Estimating the Mean of a Population C.4 Estimating the Population

More information

Testing for Normality

Testing for Normality Testing for Normality For each mean and standard deviation combination a theoretical normal distribution can be determined. This distribution is based on the proportions shown below. This theoretical normal

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Applied Econometrics (QEM)

Applied Econometrics (QEM) Applied Econometrics (QEM) based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #3 1 / 42 Outline 1 2 3 t-test P-value Linear

More information

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015 MFM Practitioner Module: Quantitiative Risk Management October 14, 2015 The n-block maxima 1 is a random variable defined as M n max (X 1,..., X n ) for i.i.d. random variables X i with distribution function

More information

IT S TIME FOR AN UPDATE EXTREME WAVES AND DIRECTIONAL DISTRIBUTIONS ALONG THE NEW SOUTH WALES COASTLINE

IT S TIME FOR AN UPDATE EXTREME WAVES AND DIRECTIONAL DISTRIBUTIONS ALONG THE NEW SOUTH WALES COASTLINE IT S TIME FOR AN UPDATE EXTREME WAVES AND DIRECTIONAL DISTRIBUTIONS ALONG THE NEW SOUTH WALES COASTLINE M Glatz 1, M Fitzhenry 2, M Kulmar 1 1 Manly Hydraulics Laboratory, Department of Finance, Services

More information

Multivariate Statistical Analysis

Multivariate Statistical Analysis Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For

More information

Chapter 9. Non-Parametric Density Function Estimation

Chapter 9. Non-Parametric Density Function Estimation 9-1 Density Estimation Version 1.2 Chapter 9 Non-Parametric Density Function Estimation 9.1. Introduction We have discussed several estimation techniques: method of moments, maximum likelihood, and least

More information

Regression Analysis. y t = β 1 x t1 + β 2 x t2 + β k x tk + ϵ t, t = 1,..., T,

Regression Analysis. y t = β 1 x t1 + β 2 x t2 + β k x tk + ϵ t, t = 1,..., T, Regression Analysis The multiple linear regression model with k explanatory variables assumes that the tth observation of the dependent or endogenous variable y t is described by the linear relationship

More information

Testing for Normality

Testing for Normality Testing for Normality For each mean and standard deviation combination a theoretical normal distribution can be determined. This distribution is based on the proportions shown below. This theoretical normal

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

Independent Events. Two events are independent if knowing that one occurs does not change the probability of the other occurring

Independent Events. Two events are independent if knowing that one occurs does not change the probability of the other occurring Independent Events Two events are independent if knowing that one occurs does not change the probability of the other occurring Conditional probability is denoted P(A B), which is defined to be: P(A and

More information

Final Exam. Name: Solution:

Final Exam. Name: Solution: Final Exam. Name: Instructions. Answer all questions on the exam. Open books, open notes, but no electronic devices. The first 13 problems are worth 5 points each. The rest are worth 1 point each. HW1.

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur Lecture No. # 38 Goodness - of fit tests Hello and welcome to this

More information

Just Enough Likelihood

Just Enough Likelihood Just Enough Likelihood Alan R. Rogers September 2, 2013 1. Introduction Statisticians have developed several methods for comparing hypotheses and for estimating parameters from data. Of these, the method

More information

CE 3710: Uncertainty Analysis in Engineering

CE 3710: Uncertainty Analysis in Engineering FINAL EXAM Monday, December 14, 10:15 am 12:15 pm, Chem Sci 101 Open book and open notes. Exam will be cumulative, but emphasis will be on material covered since Exam II Learning Expectations for Final

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem. Statistics 1 Mathematical Model A mathematical model is a simplification of a real world problem. 1. A real world problem is observed. 2. A mathematical model is thought up. 3. The model is used to make

More information

:Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics?

:Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics? MRA: Further Issues :Effects of Data Scaling We ve already looked at the effects of data scaling on the OLS statistics, 2, and R 2. What about test statistics? 1. Scaling the explanatory variables Suppose

More information

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis Chapter 12: An introduction to Time Series Analysis Introduction In this chapter, we will discuss forecasting with single-series (univariate) Box-Jenkins models. The common name of the models is Auto-Regressive

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

Multivariate Distributions

Multivariate Distributions IEOR E4602: Quantitative Risk Management Spring 2016 c 2016 by Martin Haugh Multivariate Distributions We will study multivariate distributions in these notes, focusing 1 in particular on multivariate

More information

Exam C Solutions Spring 2005

Exam C Solutions Spring 2005 Exam C Solutions Spring 005 Question # The CDF is F( x) = 4 ( + x) Observation (x) F(x) compare to: Maximum difference 0. 0.58 0, 0. 0.58 0.7 0.880 0., 0.4 0.680 0.9 0.93 0.4, 0.6 0.53. 0.949 0.6, 0.8

More information

Statistical Inference with Regression Analysis

Statistical Inference with Regression Analysis Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing

More information

Interpreting Regression Results

Interpreting Regression Results Interpreting Regression Results Carlo Favero Favero () Interpreting Regression Results 1 / 42 Interpreting Regression Results Interpreting regression results is not a simple exercise. We propose to split

More information

VaR vs. Expected Shortfall

VaR vs. Expected Shortfall VaR vs. Expected Shortfall Risk Measures under Solvency II Dietmar Pfeifer (2004) Risk measures and premium principles a comparison VaR vs. Expected Shortfall Dependence and its implications for risk measures

More information

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H.

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H. ACE 564 Spring 2006 Lecture 8 Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information by Professor Scott H. Irwin Readings: Griffiths, Hill and Judge. "Collinear Economic Variables,

More information

ECO375 Tutorial 4 Introduction to Statistical Inference

ECO375 Tutorial 4 Introduction to Statistical Inference ECO375 Tutorial 4 Introduction to Statistical Inference Matt Tudball University of Toronto Mississauga October 19, 2017 Matt Tudball (University of Toronto) ECO375H5 October 19, 2017 1 / 26 Statistical

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

4 Testing Hypotheses. 4.1 Tests in the regression setting. 4.2 Non-parametric testing of survival between groups

4 Testing Hypotheses. 4.1 Tests in the regression setting. 4.2 Non-parametric testing of survival between groups 4 Testing Hypotheses The next lectures will look at tests, some in an actuarial setting, and in the last subsection we will also consider tests applied to graduation 4 Tests in the regression setting )

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

Chapter 5 Confidence Intervals

Chapter 5 Confidence Intervals Chapter 5 Confidence Intervals Confidence Intervals about a Population Mean, σ, Known Abbas Motamedi Tennessee Tech University A point estimate: a single number, calculated from a set of data, that is

More information

Chapter 4: An Introduction to Probability and Statistics

Chapter 4: An Introduction to Probability and Statistics Chapter 4: An Introduction to Probability and Statistics 4. Probability The simplest kinds of probabilities to understand are reflected in everyday ideas like these: (i) if you toss a coin, the probability

More information

Chapter 5. Means and Variances

Chapter 5. Means and Variances 1 Chapter 5 Means and Variances Our discussion of probability has taken us from a simple classical view of counting successes relative to total outcomes and has brought us to the idea of a probability

More information

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).

More information

DISTRIBUTIONS USED IN STATISTICAL WORK

DISTRIBUTIONS USED IN STATISTICAL WORK DISTRIBUTIONS USED IN STATISTICAL WORK In one of the classic introductory statistics books used in Education and Psychology (Glass and Stanley, 1970, Prentice-Hall) there was an excellent chapter on different

More information

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes: Practice Exam 1 1. Losses for an insurance coverage have the following cumulative distribution function: F(0) = 0 F(1,000) = 0.2 F(5,000) = 0.4 F(10,000) = 0.9 F(100,000) = 1 with linear interpolation

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER.

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER. Two hours MATH38181 To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER EXTREME VALUES AND FINANCIAL RISK Examiner: Answer any FOUR

More information

Probability. Table of contents

Probability. Table of contents Probability Table of contents 1. Important definitions 2. Distributions 3. Discrete distributions 4. Continuous distributions 5. The Normal distribution 6. Multivariate random variables 7. Other continuous

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Common ontinuous random variables

Common ontinuous random variables Common ontinuous random variables CE 311S Earlier, we saw a number of distribution families Binomial Negative binomial Hypergeometric Poisson These were useful because they represented common situations:

More information

[y i α βx i ] 2 (2) Q = i=1

[y i α βx i ] 2 (2) Q = i=1 Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation

More information

arxiv: v1 [stat.me] 2 Mar 2015

arxiv: v1 [stat.me] 2 Mar 2015 Statistics Surveys Vol. 0 (2006) 1 8 ISSN: 1935-7516 Two samples test for discrete power-law distributions arxiv:1503.00643v1 [stat.me] 2 Mar 2015 Contents Alessandro Bessi IUSS Institute for Advanced

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

Financial Econometrics

Financial Econometrics Financial Econometrics Multivariate Time Series Analysis: VAR Gerald P. Dwyer Trinity College, Dublin January 2013 GPD (TCD) VAR 01/13 1 / 25 Structural equations Suppose have simultaneous system for supply

More information

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc. Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter

More information

Chapter 6. Estimation of Confidence Intervals for Nodal Maximum Power Consumption per Customer

Chapter 6. Estimation of Confidence Intervals for Nodal Maximum Power Consumption per Customer Chapter 6 Estimation of Confidence Intervals for Nodal Maximum Power Consumption per Customer The aim of this chapter is to calculate confidence intervals for the maximum power consumption per customer

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information