Module 7: Probability and Statistics

Similar documents
Summary of the lecture in Biostatistics

Lecture Notes Types of economic variables

Chapter 5 Properties of a Random Sample

Special Instructions / Useful Data

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Chapter 8. Inferences about More Than Two Population Central Values

CHAPTER VI Statistical Analysis of Experimental Data

Simple Linear Regression

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

MEASURES OF DISPERSION

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

Third handout: On the Gini Index

Econometric Methods. Review of Estimation

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

22 Nonparametric Methods.

ENGI 3423 Simple Linear Regression Page 12-01

Statistics MINITAB - Lab 5

X ε ) = 0, or equivalently, lim

ESS Line Fitting

Descriptive Statistics

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.

Lecture 2 - What are component and system reliability and how it can be improved?

Lecture 3. Sampling, sampling distributions, and parameter estimation

Class 13,14 June 17, 19, 2015

Functions of Random Variables

Chapter 2 Supplemental Text Material

Multiple Choice Test. Chapter Adequacy of Models for Regression

f f... f 1 n n (ii) Median : It is the value of the middle-most observation(s).

Analysis of Variance with Weibull Data

Chapter 13 Student Lecture Notes 13-1

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests. Soccer Goals in European Premier Leagues

Multiple Linear Regression Analysis

Goodness of Fit Test for The Skew-T Distribution

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Chapter 8: Statistical Analysis of Simulated Data

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Lecture 3 Probability review (cont d)

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

Point Estimation: definition of estimators

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

4. Standard Regression Model and Spatial Dependence Tests

Continuous Distributions

5.1 Properties of Random Numbers

Median as a Weighted Arithmetic Mean of All Sample Observations

Random Variate Generation ENM 307 SIMULATION. Anadolu Üniversitesi, Endüstri Mühendisliği Bölümü. Yrd. Doç. Dr. Gürkan ÖZTÜRK.

Chapter 14 Logistic Regression Models

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

A New Family of Transformations for Lifetime Data

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions

: At least two means differ SST

Lecture 1 Review of Fundamental Statistical Concepts

Chapter 5 Elementary Statistics, Empirical Probability Distributions, and More on Simulation

Simple Linear Regression

STA302/1001-Fall 2008 Midterm Test October 21, 2008

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

PTAS for Bin-Packing

Introduction to local (nonparametric) density estimation. methods

Random Variables and Probability Distributions

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

Qualifying Exam Statistical Theory Problem Solutions August 2005

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

Bootstrap Method for Testing of Equality of Several Coefficients of Variation

Parameter, Statistic and Random Samples

STATISTICAL INFERENCE

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2

residual. (Note that usually in descriptions of regression analysis, upper-case

A NEW LOG-NORMAL DISTRIBUTION

Chapter 3 Sampling For Proportions and Percentages

Linear Regression with One Regressor

(Monte Carlo) Resampling Technique in Validity Testing and Reliability Testing

ρ < 1 be five real numbers. The

Chapter 4 Multiple Random Variables

Module 7. Lecture 7: Statistical parameter estimation

Correlation and Regression Analysis

Multivariate Transformation of Variables and Maximum Likelihood Estimation

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

The Mathematical Appendix

Probability and. Lecture 13: and Correlation

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

arxiv: v1 [math.st] 24 Oct 2016

Simulation Output Analysis

Logistic regression (continued)

Chapter 11 The Analysis of Variance

Permutation Tests for More Than Two Samples

IFYMB002 Mathematics Business Appendix C Formula Booklet

STK4011 and STK9011 Autumn 2016

Maximum Likelihood Estimation

The number of observed cases The number of parameters. ith case of the dichotomous dependent variable. the ith case of the jth parameter

Law of Large Numbers

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1


Chapter 4 Multiple Random Variables

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

L5 Polynomial / Spline Curves

Correlation and Simple Linear Regression

Transcription:

Lecture 4: Goodess of ft tests. Itroducto Module 7: Probablty ad Statstcs I the prevous two lectures, the cocepts, steps ad applcatos of Hypotheses testg were dscussed. Hypotheses testg may be used to check the valdty of a hypothess about a populato parameter from a observed sample. Smlarly, t may be ecessary to check whether a observed dataset belogs to a partcular probablty dstrbuto. Frst part of ths lecture deals wth emprcal determato of probablty dstrbuto of a RV. The use of a probablty paper ths cotext alog wth ts method of costructo s dscussed here. Descrptos of Normal ad Logormal probablty paper ad probablty plot are preseted. Also, there are certa statstcal tests, kow as goodess-of-ft tests, to check the probablty dstrbuto that a dataset (sample) possbly follows. I real lfe scearos, some commo examples of goodess-of-ft tests are whether a sample of a dscrete varable follows a Posso dstrbuto, whether a sample of a cotuous varable follows a Normal dstrbuto or whether two samples are draw from detcal dstrbutos. The most commoly used tests are - Ch-square ( ) Test, Kolmogorov-Smrov K S Test ad Aderso-Darlg Test. These are dscussed at the later part of ths lecture.. Emprcal Determato of probablty dstrbuto of a RV I may real lfe scearos, the actual probablty dstrbuto of a radom process s ukow. O the bass of frequecy dstrbuto determed from observed data, some probablty dstrbuto may be assumed emprcally. Probablty papers are useful to check the assumpto of a partcular probablty dstrbuto of a radom varable. 3. Probablty Paper A probablty paper s a specally costructed plottg paper where the oe of the axes (where the RV s plotted) s a arthmetc axs ad the probablty axs s dstorted such a way that the cumulatve probablty dstrbuto of the RV plots as a straght le. It may be oted that separate probablty papers are eeded to plot the CDF of dfferet probablty dstrbutos as a straght le. For a partcular probablty dstrbuto wth dfferet parameters, oe probablty paper may be suffcet as the case of expoetal dstrbuto. For some probablty dstrbutos such as Gamma dstrbuto, separate probablty papers are eeded for each set of parameters. 4. Utltes of a probablty paper Whe a RV s expected to ft a certa probablty dstrbuto, ts observed CDF s plotted o the correspodg probablty paper. The resultg plot has the followg utltes: If the plot s a straght le, t ca be drectly cocluded that the RV follows the hypothetcal pdf.

If the plot devates from a straght le, the the locato of devato dcates the rego (eg. the tal, the mode etc) where the ft s ot good. If the plot s ot at all close to a straght le, t mples that the hypothetcal dstrbuto has to be rejected ad some other dstrbuto has to be tested for the ft. If the plot follows a straght le for a part of the rage ad the devates, t s the dcato of a chage dstrbuto beyod a certa rage of the RV. The slope ad tercept of the straght le plotted o the probablty paper ca be used to estmate the parameters of the dstrbuto. 5. Normal Probablty Paper The ormal probablty paper s costructed o the bass of stadard ormal probablty dstrbuto fucto. The radom varable X s represeted o the horzotal (or vertcal) axs arthmetc scale. The vertcal (or horzotal) axs represets two scales the stadard ormal varate Z X ad the cumulatve probablty values F X x ragg from 0 to. m The expermetal data pots are plotted usg Gumbel s plottg posto gve by N where N the total umber of observatos ad m rak of the data pot whe the observed values are arraged ascedg order. As the probablty scale s compressed ear the meda ad expaded towards the tals, hece a ormal varate X wth mea ad stadard devato plots as a straght le o ths paper. The straght le passes through Z X ad F X x 0. 5 ad has a slope of. Hece the parameters of the X dstrbuto ca be easly obtaed from the plot. Problem o Normal Probablty Paper Q. The observed stregths of 30 cocrete cubes are gve below. Check whether the stregth of cocrete cubes follows ormal dstrbuto or ot by plottg o ormal probablty paper. Determe the mea stregth ad the stadard devato. Sl o Stregth KN/m Sl o Stregth KN/m Sl o Stregth KN/m Sl o Stregth KN/m Sl o Stregth KN/m Sl o Stregth KN/m 5.4 6 7.48.08 6 4. 3.39 6 8.85 4.55 7 9.6 4.67 7 4.38 3.0 7 4.70 3 3.7 8 8.6 3 0.3 8 5.09 3 0.76 8.7 4 9.0 9 3.49 4 7.59 9 5.3 4 8.85 9 3.77 5 4.4 0 6.76 5 6.87 0 5.8 5 3.78 30.6

Sol. The observed stregths of the cocrete cubes are frst arraged ascedg order. The ther m plottg postos are determed by, where N 30 ad m rak of the observed data N pot whe arraged ascedg order of ther values. The ormal probablty plot s prepared ad the data s foud to plot almost as a straght le. Thus, the stregth of the cocrete cubes follows ormal dstrbuto. 0.99 0.98 Normal Probablty Plot 0.95 0.90 Probablty 0.75 0.50 0.5 0.0 0.05 0.0 0.0 6 8 0 4 6 8 30 3 Stregth (KN/m ) From the plot, the value of stregth correspodg to cumulatve probablty of 0.5 s 3 KN / m. Thus the mea stregth s 3 KN / m. The stadard devato s gve by the verse of the slope of the straght le. It s obtaed roughly as 8 KN / m. 6. Log Normal Probablty Paper The log ormal probablty paper dffers from a ormal probablty paper as follows: The horzotal axs for the radom varable X s logarthmc scale stead of arthmetc lx / x scale. The stadard ormal varate Z s gve by Z m where x m s the meda of X.

7. Geeral Probablty Paper ad Probablty plot For ay probablty dstrbuto, the probablty paper may be costructed by detfyg a sutable stadard varate Z. The use of the stadard varate esures that the costructed probablty paper s depedet of the parameters of the dstrbuto. The stadard varate Z s represeted o the vertcal axs arthmetc scale. The cumulatve probablty values are also represeted o the vertcal axs. The radom varable X s represeted o the horzotal axs arthmetc scale. The plottg posto of the expermetal data pots ca be obtaed by a umber of methods (eg. Gumbel, Haze etc). As per the Gumbel method, plottg m posto s gve by where N the total umber of observatos ad m rak of the N data pot whe the observed values are arraged ascedg order. If the plotted data pots gve rse to a straght le o the paper, the the data pots belog to the partcular probablty dstrbuto for whch the paper s costructed. Depedg o the stadard varate selected, the parameters of the dstrbuto ca be obtaed from the slope, tercept etc of the plot. 8. Goodess-of-ft tests As metoed the troducto, certa statstcal tests, kow as goodess-of-ft tests, are used to check the probablty dstrbuto that a dataset (sample) possbly follows. The most commoly used tests are - Ch-square ( ) Test, Kolmogorov-Smrov K S Test ad Aderso-Darlg Test. These are dscussed ths secto oe after aother. 8. Ch-square test The Ch Square Dstrbuto s used for testg the goodess of ft of a set of data to a specfc probablty dstrbuto. For ths, observed ad hypothetcal frequeces that follow the specfc probablty dstrbuto are compared. It ca be used for the both dscrete or cotuous radom varables. Let us cosder a sample cotag observed values of a radom varable. Let O, O, O3,..., O k be the k observed frequeces of the varates ad the correspodg frequeces from a assumed theoretcal dstrbuto be E, E, E3,..., Ek. It s requred to test whether the dffereces betwee the observed ad expected frequeces are sgfcat. Thus the goodess s checked as X k O E E As k approaches to fty, the samplg dstrbuto of k s the degrees of freedom. X teds to a v dstrbuto wth The test for goodess of ft s geerally relable f 5 k ad E 5. It may be oted that most cases the parameters of theoretcal dstrbuto are ot kow. Hece, the parameters

should be estmated from the data tself ad the statstc remas vald f the degree of freedom s reduced by oe for every ukow parameter. If k O E E C, v where C, v s value of approxmate v dstrbuto at cumulatve probablty, the the assumed theoretcal dstrbuto s a acceptable model at the sgfcace level. Problem o ch squared dstrbuto Q. Cosder a gve stato a watershed where the severe rastorms are recorded over a perod of 70 years. Out of these 70 years, years were wthout severe rastorms ad 5, 4, 6, 3 years wth,, 3 ad 4 rastorms aually. Test whether the data ca be assumed to follow Posso dstrbuto at 5% sgfcace level. Sol. Average occurrece rate of rastorms 54 63 3 4 70.857 rastorms / year Now to check the goodess of ft, we use the ch-square dstrbuto at 5% sgfcace level As the dataset s qute small, the data the class four storms/year s combed to the class of three storms/year Null hypothess H 0 : The radom varable has a Posso dstrbuto wth. 897. Alterate hypothess H : The radom varable does ot follow the dstrbuto specfed ull hypothess. Level of sgfcace: 0. 05 Here 4 k ; degree of freedom, k Crtcal rego v,0.05

From ch square table, 5. 99,0.05 No of storms/year Observed frequecy O Theoretcal frequecy E O E / E 0.309 0.09 5 5.348 0.0046 4 5.075 0.0767 3 9 8.80 0.066 Total 70 70 0.668 Thus, we get k O E 0.668 5.99,0.05 E Hece, the Posso dstrbuto s a vald model at 5% sgfcace level Decso: The ull hypothess ca ot be rejected at 5% sgfcace level 8. Kolmogorov-Smrov (KS) Test Ths test s also most commoly used to check the valdty of the assumed model for cotuous radom varables. It relates to the CDF rather tha the pdf of cotuous varables. It compares the observed or data based cumulatve frequecy wth assumed theoretcal cumulatve dstrbuto. Let the cotuous varable be X ad x, x,... x represet the ordered sample of sze, the values arraged creasg order. Now from ths ordered set the emprcal or sample dstrbuto fucto S x s developed. Ths fucto s a step fucto. Thus the cumulatve frequecy step fucto s defed as S x s the step fucto ad x S 0 x x k x) xk x xk ; k,,..., x x ( F s the proposed theoretcal dstrbuto. The dscrepacy betwee the theoretcal model ad the observed data s computed ad the maxmum dfferece D betwee S x ad F x over the etre rage of x s obtaed.

D max x F x S x S (x) F(x) 0.9 0.8 0.7 0.6 F(x) 0.5 0.4 0.3 0. S (x) D 0. 0-0 -5-0 -5 0 5 0 x Fg. S (x) ad F(x) Thus for a specfed sgfcace level,the wth the crtcal value D K S test compares the maxmum dfferece D s defed as P D D If the observed value s less tha the crtcal value, the the proposed dstrbuto s vald at the sgfcace level. 8.. Advatage of KS Test As Ch-Square Test, dvso of data to tervals s ot ecessary ths case. The test statstc s dstrbuto free ulke that of Ch-Square Test. K S test works for o-ormal data also. However, the test may fal f the data s too far from ormal. If the sample dstrbuto s large, Smrov has gve the lmtg dstrbuto of D as lm For 50 for 0. 05 ad 0. 0 P D z exp k z k 8z D. 36 ad D. respectvely

Problem o KS test The data of fracture toughess of pla cocrete specmes made wth burt brck aggregates s show the table ext slde. The data appears to fall approxmately a straght le o a Normal probablty paper wth N 0.540,0.05. Perform the Kolmogorov-Smrov test at 5% sgfcace level to statstcally justfy the assumpto for the gve data. Fracture toughess (MPa m) of pla cocrete specmes ( creasg order) m K m IC K m IC K IC 0.45 0 0.508 9 0.557 0.48 0.53 0 0.59 3 0.484 0.53 0.59 4 0.484 3 0.538 0.60 5 0.489 4 0.538 3 0.605 6 0.494 5 0.544 4 0.6 7 0.494 6 0.548 5 0.658 8 0.494 7 0.548 Sol. Null hypothess H 0 : The radom varable has a Normal Dstrbuto Alterate hypothess H : The radom varable does ot have the specfed dstrbuto. Level of sgfcace: 0. 05 0.05 Crtcal rego (from table) D D 5 0. 64 The cumulatve frequecy of the gve data s plotted the followg fgure wth respect to the equato of K S Test. The theoretcal dstrbuto fucto of ormal model s also show.

Cumulatve dstrbuto of fracture toughess 0.9 0.8 0.7 S 0.6 N(0.540,0.05) CDF 0.5 0.4 0.3 D 0. 0. 0 0.45 0.5 0.55 0.6 0.65 0.7 K Fg. S (x) ad F(x) From the fgure, the maxmum dscrepacy of two fuctos, D max 0. 348 occurrg at K IC 0.5080 MPa. m.e. the maxmum dscrepacy 0.348 s less tha the crtcal value 0.64. Therefore the ull hypothess caot be rejected at 5% sgfcace level. Thus, the N 0.540,0.05 s a vald model at 5% sgfcace level. model 8.. Kolmogorov-Smrov two-sample test The same test used the case of oe sample test ca be used to evaluate whether two samples come from the same dstrbuto. Let the maxmum absolute dfferece betwee two emprcal dstrbuto fuctos be D m, Let the two fuctos be represeted as step fuctos samples of szes m ad, respectvely. G m x ad x S based o two Thus the dfferece becomes D m, max x G m x S x

0.9 0.8 0.7 0.6 F(x) 0.5 0.4 0.3 D m, 0. 0. 0 -.5 - -.5 - -0.5 0 0.5.5 x Fg 3. Kolgomorov-Smrov two-sample goodess-of-ft test If the sample dstrbuto have large values of m ad, Smrov has gve the lmtg dstrbuto as lm P Dm, z, exp m m z k 8 m k z (replacg wth m m ) Problem o two-sample test Q. The table showg modulus of rupture data for two dfferet groups of tmber s show ext slde. Suppler delver tem two lots. The frst lot cossts of 50 samples ad the secod lot cossts of 30 samples. Both the lots were suppled by a same suppler who clams the secod lot to be superor to the frst lot. Apply the Kolmogorov-Smrov two-sample test to verfy whether the two samples are of same type (from the same populato).

LOT A (Modulus of Rupture N / mm ) 35.3 33.8 30.05 3.68 6.63 36.85 36.8 36.38 34.44 3.5 7.9 38.8 37.78 35.88 8.46 4.55 9.9 35.03 37.5 30.33 8.7 7.83 34.63 33.47 38.05 3.33 3.5 33.06 3.48 34.56 3.37 7.93 36.47 34. 36 3.56 30.0 38.64 35.58 37.65 8 33.7 8.98 36.9 8.83 5.39 8.76 3.0 33.6 3.4 LOT B (Modulus of Rupture N / mm ) 33.9 34.4 8.97 35.89 8.69 36.53 35.7 39.33 37.69 3.6 38.7 9. 5.88.87 3.76 34.49 7. 36.88 5.9 38 9.93 3.03 5.84 35.67 33.9 38.6 8.3 30.53 33.4 39. Sol. Null hypothess H 0 : The radom varables sampled by the frst 50 values ad the radom varables sampled by the ext 30 values have the same dstrbuto. Alterate hypothess H : The radom varables have dfferet dstrbutos. Level of sgfcace: 0. 05.

Calculatos: The data from each sample are raked separately wth values of the step fuctos x S x. G m ad The samples are sorted creasg order ad raked accordgly for both samples ad G m x ad S x are determed as show the followg table. The the step fucto of both samples are plotted. The maxmum absolute dfferece betwee the emprcal dstrbuto s the determed. Rak x at lot A MR at A Rak x at lot A MR at A 0.0 7.83 6 0.5 33.06 0.04 3.5 7 0.54 33.8 3 0.06 3.5 8 0.56 33.47 4 0.08 3.37 9 0.58 33.6 0.44 3.0 47 0.94 37.78 3 0.46 3.4 48 0.96 38.05 4 0.48 3.48 49 0.98 38.64 5 0.5 3.68 50 38.8 Rak x at B MR at B Rak x at B MR at B 0.033333.87 6 0.533333 33.9 0.066667 5.9 7 0.566667 33.9 3 0. 5.84 8 0.6 34.4. 0.4 3.6 7 0.9 38.6 3 0.433333 3.03 8 0.933333 38.7 4 0.466667 3.76 9 0.966667 39. 5 0.5 33.4 30 39.33 m 5030 The maxmum dfferece s D 0. ad 8. 75. m 50 30

0.9 0.8 S(x) G(x) 0.7 0.6 F(x) 0.5 0.4 0.3 0. 0. 0 5 0 5 30 35 40 x The crtcal value s 0.30 (from table). Fg 4. S(x) ad G(x) As 0. s less tha 0.30, the ull hypothess caot be rejected at 5% sgfcace level. 0. Cocludg Remarks I ths lecture, commoly used goodess of ft tests such as Ch-square ( ) test, Kolmogorov-Smrov K S test ad Aderso-Darlg test are dscussed. Example problems usg these tests are also preseted here. The ext lecture troduces regresso aalyses ad correlato.