COMPARISON OF GEOSTATISTICAL METHODS FOR ESTIMATING THE AREAL AVERAGE CLIMATOLOGICAL RAINFALL MEAN USING DATA ON PRECIPITATION AND TOPOGRAPHY

Size: px
Start display at page:

Download "COMPARISON OF GEOSTATISTICAL METHODS FOR ESTIMATING THE AREAL AVERAGE CLIMATOLOGICAL RAINFALL MEAN USING DATA ON PRECIPITATION AND TOPOGRAPHY"

Transcription

1 INTERNATIONAL JOURNAL OF CLIMATOLOGY It. J. Climatol. 18: (1998) COMPARISON OF GEOSTATISTICAL METHODS FOR ESTIMATING THE AREAL AVERAGE CLIMATOLOGICAL RAINFALL MEAN USING DATA ON PRECIPITATION AND TOPOGRAPHY EULOGIO PARDO-IGÚZQUIZA* TAMSAT, Departmet of Meteorology, Uiersity of Readig, Readig RG6 6BB, Berkshire, Eglad Receied 7 August 1997 Reised 22 February 1998 Accepted 28 February 1998 ABSTRACT The results of estimatig the areal average climatological raifall mea i the Guadalhorce river basi i souther Spai are preseted i this paper. The classical Thiesse method ad three differet geostatistical approaches (ordiary krigig, cokrigig ad krigig with a exteral drift) have bee used as estimators ad their results are compared ad discussed. The first two methods use oly raifall iformatio, while cokrigig ad krigig with a exteral drift use both precipitatio data ad orographic iformatio (easily accessible from topographic maps). I the case study preseted, krigig with a exteral drift seems to give the most coheret results i accordace with cross-validatio statistics. If there is a correlatio betwee climatological raifall mea ad altitude, it seems logical that the iclusio of topographic iformatio should improve the estimates. Krigig with a exteral drift has the advatage of requirig a less demadig variogram aalysis tha cokrigig Royal Meteorological Society. KEY WORDS: techiques; Thiesse method; ordiary krigig; cokrigig; krigig with a exteral drift; raifall; areal average; Guadalhorce river basi; Spai; geostatistical methods 1. INTRODUCTION Most of the water received by a river basi occurs as raifall evets over the basi. Differet quatities are of iterest, from total raifall i a sigle storm to log-term aggregates over time. Amog the latter, mea aual raifall is used widely i differet applicatios, ad varies from year-to-year. The mea of the aual raifall meas for all the years available i the historical record is kow as the climatological mea ad gives the expected aual raifall. However the climatological mea is usually kow oly at the locatios of raigauge statios; to estimate it at usampled locatios over the whole basi (ad the expected total water iput by raifall), a procedure is required to calculate the average spatial climatological raifall mea. To calculate average spatial statistics i climatology, the Thiesse method (Thiesse, 1911) was the stadard for about half a cetury util the developmet of geostatistics i the middle 1960s (Mathero, 1963). The geostatistical method of spatial estimatio, kow geerically as krigig, takes ito accout the spatial correlatio betwee experimetal data through the variogram fuctio. Krigig improves o the Thiesse method i two ways: (i) More precise estimates due to usig the iformatio from surroudig samples. (ii) It miimises the estimatio variace which i tur provide a measure of the ucertaity of the estimates. * Correspodece to. Departmet of Meteorology, Uiversity of Readig, Readig RG6 6BB, UK; tel.: ; fax: ; E.Pardo@readig.ac.uk Cotract grat sposor: Europea Commuity Traiig Fellowship (part of the Traiig ad Mobility of Researches (TMR) programme) CCC /98/ $ Royal Meteorological Society

2 1032 E. PARDO-IGÚZQUIZA Thus krigig has curretly become the stadard method of estimatig average raifall values at sites with o records. Early applicatios of krigig i raifall estimatio were described by Delhomme ad Delfier (1973). Sice the there have bee may applicatios ad differet kids of krigig have also bee developed. For example, multivariate techiques to deal with at least two variables that are coregioalized. This is the case where the raifall patter i a area is iflueced by physiography, i.e. orographic raifall. The umber of raigauges is usually small ad their spatial distributio sparse while, topographical iformatio is widely available. The, if the raifall is coregioalized with altitude, the topographical data may be used to improve the precisio of the raifall estimates. The aim of this paper is to show the results of the applicatio of differet geostatistical methods to estimate the areal average climatological mea usig precipitatio data ad topography. A review of the geostatistical techiques ad the results of their applicatio to the Guadalhorce river basi i the south of Spai are preseted ad discussed i the followig sectios. 2. THEORY REVIEW ON GEOSTATISTICAL METHODS Give a river basi of area A (Figure 1(a)), the expected aual iput Q by raifall is give by: Q= A p(x)dx (1) Figure 1. River basi (a) ad discretized river basi (b)

3 GEOSTATISTICAL METHODS AND CLIMATOLOGICAL RAINFALL 1033 where p(x) is the value of the climatological raifall mea at spatial locatio x (x deotig the coordiates of a poit i the plae). p(x) is oly kow at those poits where raigauge statios are located, ad the it must be estimated at the o- sampled poits. I practice, the itegral (1) is evaluated usig the discrete sum approximatio: N Q= p B (x i ) (2) where the area A of the river basi has bee discretized i N cells of equal area B (Figure 1(b)); ad p B (x i ) is the areal average climatological raifall mea over cell B with cetroid x i. It is clear that p B (x i ) is a areal average estimate (also referred to as a block estimate): p B (x i )= 1 B B(x i ) p(x) dx where B is called the support of the estimate, beig: N A= B(x i ) (4) the whole regio, ad B(x i ) deotes the support B at the locatio x i. Block krigig provides estimates of Equatio (3) from the experimetal data with puctual support (for example data measured i raigauge statios): pˆb(x i )= j p(x j ) (5) with p(x j ) beig the climatological mea at locatio x j. I equatio (5) oly raifall data (the climatological mea) are used. There are differet possibilities for performig such a estimatio usig geostatistics; there are also differet ways of icorporatig the topographical iformatio for the purpose of improvig estimates pˆb(x i ), ad the Q. The methods of estimatio are briefly explaied below Thiesse method The Thiesse polygos method is equivalet to the earest eighbor method; all the poits iside the polygo have the same value which is equal to the closest sample value; p(x) i Equatio (3) is the estimated as the earest experimetal poit. I practice, this is solved by the well-kow method of drawig Vorooï polygos aroud each experimetal data. However, a easier procedure has bee adopted here as it is more suitable for computatio. The support B is subdivided ito smaller square uits ad each small uit is estimated by the earest eighbor method. If each subuit is sufficietly small this procedure gives a good approximatio to the classical triagulatio method Ordiary krigig (OK) I ordiary krigig the estimate pˆb(x o ) is give by equatio (5) where the weights i are obtaied by solvig to the ordiary krigig system (Jourel ad Huijbregts, 1978): j p (x i, x j )+= p(x i, B(x o )) j =1 Ad the estimatio variace is give by: ;,..., (6) (3)

4 1034 E. PARDO-IGÚZQUIZA ˆ 2= i p(x i, B(x o ))+ (7) where, Lagrage multiplier; p (x i, x j ), variogram fuctio betwee poits x i ad x j ; p(x i, B(x o )), mea variogram fuctio betwee poit x i ad support B with cetroid x o ; ad p, deotes variogram of the climatological raifall mea. The two previous methods use oly raifall data (the climatological mea). Iformatio o altitude may be icluded i the estimatio procedure i differet ways, two of them beig cokrigig ad krigig with a exteral drift Cokrigig (CoK) The CoK estimate of p B (x o ) is give by: pˆb(x o )= m i p(x i )+ j y(x j ) (8) where {y(x j );,...,m} are experimetal data o altitude. The weights i, j are obtaied as solutio of the CoK system: m m j p (x i, x j )+ k py (x i, x k )+ 1 = p(x i, B(x o )) k=1 m j py (x k, x j )+ t Y (x k, x t )+ 2 = py (x k, B(x o )) t=1 j =1 j =0,..., k=1,...,m (9) with estimatio variace: ˆ 2= m i p(x i, B(x o ))+ k py (x k, B(x o ))+ 1 (10) k=1 where: 1, 2, Lagrage parameters; p (h), the variogram fuctio of the climatological mea; Y (h), variogram of altitude; py (h), cross-variogram of climatological mea ad altitude. The bar over the variogram deotes the mea variogram betwee a poit ad a support B Krigig with a exteral drift (KED) I KED the mathematical expectatio of the climatological raifall mea radom fuctio P(x) is expressed as a liear fuctio of altitude Y(x): E[P(x)]=a 1 +a 2 Y(x) (11) The estimate pˆb(x o ) has the form give i Equatio (4), with the weights i obtaied as solutios of the KED system (Ahmed ad De Marsily, 1987; Hudso ad Wackeragel, 1994):

5 GEOSTATISTICAL METHODS AND CLIMATOLOGICAL RAINFALL 1035 j p (x i, x j ) y(x i )= (x i, B(x o )) j y(x j )=y(x o ) with estimatio variace:,..., (12) ˆ 2= i p(x i, B(x o )) y(x o ) (13) where all the otatio has bee defied previously. It is clear from Equatio (12) how the values of the coefficiets a 1 ad a 2 are ot eeded; it is ot ecessary to kow them i order to apply KED. I order to apply KED it is ecessary to kow the value of the mea altitude of the block cetered at each experimetal locatio, as well the mea altitude of the block to be estimated. While CoK requires the variograms of climatological mea ad raifall as well as the crossvariogram, KED ad OK require oly the variogram of the climatological mea. 3. CASE STUDY The study area is the Guadalhorce river basi, located betwee logitude 4 ad 6 west ad betwee latitude 36 ad 38 orth (Figure 2). The area of the basi is discretized to squares of 4 km side, ad is 2864 Km 2. There are 51 raigauge statios located withi or aroud the river basi, as depicted i Figure 3; i the figure the coordiates are give i km with a arbitrary origi. The climatological mea (mea aual raifall calculated from historical records coverig more tha 20 years i all cases), ad the altitude are kow for every raigauge statio. Figure 4 shows the climatological mea plotted agaist altitude; the liear correlatio coefficiet is O average, the climatological mea icreases with altitude statistically. The physical explaatio of this correlatio coefficiet is complex because there are differet meteorological factors that ifluece the raifall patter i this basi, amog them the wid patters ad the effect of the Mediterraea Sea i the south of the basi (Figure 2). Figure 5 shows a cotour map of the mea altitude over blocks of 4 4 km calculated from a topographical map by averagig the poit altitudes of a regular grid of poits over each block. The highest altitudes are located at the wester ad easter borders of the river basi. The expected aual resources iput i the basi is obtaied by the evaluatio of pˆb(x i ) for N=179 blocks of 4 4 Km 2 i Equatio (2). The cotour maps of the estimates usig Thiesse polygos, OK, CoK ad KED are show i Figure 6(a d), respectively. I the figures, the climatological raifall mea is give i meters ad the isohyet iterval is 50 mm (0.05 m). The average areal climatological mea over the whole river basi is 586.9, 591.1, ad mm for the four methods, respectively. Thus the estimates of the expected aual iput of water resources are 1681, 1693, 1738 ad 1812 hm 3 obtaied for Thiesse, OK, CoK ad KDE, respectively. I decreasig order of the magitude of their estimates the estimators are KED, CoK, OK ad Thiesse; KED gives 4% more aual iput resources tha CoK, 7% more tha OK ad 8% more tha the Thiesse method; CoK gives 3% more iput resources tha OK ad the Thiesse method. The differece betwee OK ad the Thiesse method is less tha 1%. Figure 6 shows a great similarity betwee the Thiesse estimates (Figure 6(a)) ad OK estimates (Figure 6(d)). The isohyet map of KED (Figure

6 1036 E. PARDO-IGÚZQUIZA Figure 2. Locatio of Guadalhorce river basi i the south of Spai 6(d)) has greater similarity with the topography (Figure 5) ad the CoK isohyet map also resembles the topography but to a lesser extet tha KED. For the applicatio of CoK ad KED, the values used have bee 309 of mea altitudes over blocks 4 4 km which correspods to a grid of blocks over the area represeted i Figure 3, after discoutig the blocks over the sea with altitude 0. The experimetal variograms of climatological mea ad altitude ad the experimetal cross-variogram betwee climatological mea ad altitude, as well as the fitted models used i the differet krigig systems, are show i Figure 7(a c). The correspodet models are: p (h)= Sph(h) 40 (14) Y (h)= Sph(h) 40 (15) py (h)= Sph(h) 40 (16) where c Sph(h) a deotes a spherical variogram with sill c ad rage a. These variograms have bee estimated by fittig a model to the oparametric variogram estimates show i Figure 7. The variograms have bee modelled by the same spherical structure i order to apply the liear model of coregioalizatio i CoK (Jourel ad Huijbregts, 1978). The models fitted to the oparametric estimates (Equatios 14 16) are i agreemet well with the parameters estimated by maximum likelihood (Pardo-Igúzquiza, 1997), the results of which are show i Table I. From the results i Table I several cosideratios may be extracted: The Akaike iformatio criterio (Akaike, 1974) suggests that the models without ugget effect for both regioalized variables, climatological mea ad topography are more plausible; ad, eve whe the ugget effect is estimated, its value is very small i relatio to the variace.

7 GEOSTATISTICAL METHODS AND CLIMATOLOGICAL RAINFALL 1037 Figure 3. Guadalhorce river basi ad locatio of the 51 raigauge statios Figure 4. Climatological raifall mea versus altitude

8 1038 E. PARDO-IGÚZQUIZA Figure 5. Cartography of the mea altitude over block (4 4 Km) support The maximum likelihood estimates for the rage are 39.3 km with stadard error of 5.33 km, ad 34.7 km with stadard error of 4.5 km for climatological raifall ad altitude, respectively. The maximum likelihood estimates for the variace are m 2 with stadard error of m 2 ad km 2 with stadard error of km 2. The model Equatio (14) for the climatological raifall variogram is coheret with the model estimated by maximum likelihood. For topography the model suggested by maximum likelihood has a smaller rage but the model give i Equatio (15) is still withi the 95% cofidece iterval for the rage estimated by maximum likelihood. The sample size (51 locatios i Figure 3) is large from a hydrological poit of view; to have 51 raigauges iside a 80 78=6240 km 2 (area of the rectagle of Figure 3) is a high desity etwork. From a statistical poit of view, it is still a adequate umber of data; Pardo-Igúzquiza (1997) shows how with more tha 30 data, the maximum likelihood estimator gives good results for the iferece of spatial covariace parameters. What is more importat, a stadard error is attached to the estimated parameters. The ucertaity of the climatological raifall mea estimates are assessed by the estimatio variace, which is ot available for the Thiesse method. A represetatio of the square root of the estimatio variace (the S.D. of the estimatio error) kow as krigig error, or stadard error, has bee represeted i Figure 8(a c) for OK, CoK ad KED, respectively. The stadard error of the global average (the average for the whole river basi) climatological mea is 79.86mm, mm ad mm for OK, CoK ad KED, respectively (see Appedix 1). The stadard errors for the global average are almost the same, but whe we cosider the local stadard errors, as ca be see from Figure 8(a c), the differeces are oticeable. KED gives a larger krigig error tha CoK ad OK, while OK gives a krigig error larger tha CoK. The latter case is perfectly logical because CoK uses additioal iformatio o altitude as well as the variogram of altitude, ad the cross- variogram betwee climatological mea ad altitude; a decrease of the ucertaity i the estimates is to be expected.

9 GEOSTATISTICAL METHODS AND CLIMATOLOGICAL RAINFALL 1039 Figure 6. Cartography of the areal average climatological raifall mea estimated by: (a) Thiesse; (b) OK; (c) CoK; ad (d) KED

10 1040 E. PARDO-IGÚZQUIZA Figure 7. Experimetal (*) ad model (cotiuous lie) for: (a) climatological mea; (b) altitude; ad (c) cross-variogram betwee climatological mea ad altitude

11 GEOSTATISTICAL METHODS AND CLIMATOLOGICAL RAINFALL 1041 Table I. Variogram parameters estimated by maximum likelihood Regioalized variable Rage S.E. Nugget S.E. Variace S.E. NLLF effect AIC Raifall NA NA Raifall Altitude NA NA Altitude S.E.: stadard error; NLLF: egative loglikelihood fuctio; NA: ot applicable. Rage i km; variace ad ugget effect i m 2 ad km 2 for raifall ad ugget effect, respectively. AIC: Akaike iformatio parameter; AIC= log L+2k=NLLF+2k, where L: likelihood fuctio, NLLF: egative loglikelihood fuctio ad k is the umber of parameters fitted to the model (equal to 3 (mea, rage, variace) ad 4 whe the ugget effect is fitted (mea, rage, variace, ugget effect). I the KED system give i Equatio (12), the variogram of the climatological mea p (h) is used, though the variogram that should be used is the variogram of the ukow residuals r(x i )=p(x i ) a 1 a 2 y(x i ). This approximatio does ot modify the estimates of p(x), but overestimates the variace of the estimatio error (Ahmed ad De Marsily, 1987). 4. CROSS-VALIDATION RESULTS Cross-validatio is frequetly used to validate or compare variogram models. The procedure, as is well-kow, cosists of the estimatio of each experimetal value usig the remaider of the experimetal iformatio (i.e. excludig itself). This makes the calculatio of the true error i the estimatio procedure possible: i =pˆ (x i ) p(x i ) (17) where: is the error; pˆ (x) is the estimated value ad p(x) is the true value. Thus, repeatig this estimatio for the umber of experimetal data, the followig cross-validatio statistics may be calculated. Mea error: ME= 1 i mea squared error MSE= 1 i 2 (19) mea stadardized square error MSSE= 1 i 2 ˆ i2 (20) It is assumed that if the variogram model is correct, ME should be almost zero, MSE a small value ad MSSE close to 1. It should be oted that ME is useless, as ME=0 is imposed i the krigig system by the ubiased coditios, o matter which variogram model is used. The value of cross-validatio has bee criticized by some authors (Jourel ad Rossi, 1989). If cross-validatio works efficietly, it ca be used for estimatio purposes. This is the miimum iterpolatio error method of Lebel ad Basti (1985), which etails choosig the values of the variogram parameters that give miimum MSE ad MSSE equal to 1. Pardo-Igúzquiza (1998) shows that whe five variogram estimators are compared, the oe usig cross-validatio (the previous method) gives the worst results. Therefore cross-validatio must be used with cautio, small differeces i the cross-validatio results may be give just by samplig variability. The cross-validatio statistics for OK, CoK ad KED with the 51 experimetal data of climatological (18)

12 1042 E. PARDO-IGÚZQUIZA Figure 8. Cartography of the krigig error for: (a) OK; (b) CoK; ad (c) KED

13 GEOSTATISTICAL METHODS AND CLIMATOLOGICAL RAINFALL 1043 Table II. Cross-validatio statistics ME MSE MSSE OK CoK KED mea are show i Table II. The KED approach gives the most cogruet results, with the smallest MSE ad the MSSE closest to 1. The values of MSSE for OK ad CoK suggest that the sill of the variograms was uderestimated, but that the estimatio variace obtaied by KED was ot. If the distributio of the errors is Gaussia (which is geerally the case, at least approximately by the cetral limit theorem), 95, 68, 38 ad 19% cofidece itervals for the climatological raifall mea estimates may be costructed as [pˆ 2ˆ ], [pˆ ˆ ], [pˆ 0.5ˆ ] ad [pˆ 0.25ˆ ], where pˆ is the estimate ad ˆ is its krigig error. It ca the be calculated i which percetages from amog the previous radom itervals the true values are cotaied. Because there are 51 experimetal data, the expected umber of times that the radom itervals will cotai the true values are 48, 35, 19 ad 10 for the previous itervals, respectively. The results for OK, CoK ad KED are show i Table III. The best results are obtaied with KED. Other represetatios of the cross-validatio results are give i Figures 9 ad 10. I Figure 9(a d), the estimated values versus the true values of climatological mea are represeted for Thiesse, OK, CoK ad KED, respectively. The poits should be aroud the 45 lie. Agai the best results are obtaied for KED with a liear correlatio coefficiet of The liear correlatio coefficiet for Thiesse, OK ad CoK are 0.75, 0.74 ad 0.77, respectively. Figure 10(a d) show the errors give by equatio (17) versus the estimated values. The magitude of the errors should be idepedet of the magitude of the estimated values. All the methods yield the larger errors whe the magitude of the estimate is very high, but KED gives roughly the best results. 5. CONCLUSIONS Amog the geostatistical methods used i the estimatio of the areal average climatological raifall mea o a river basi i the south of Spai, ordiary krigig uses oly values o the raifall while cokrigig ad krigig with exteral drift use both iformatio o raifall ad o topography. Cokrigig has more requiremets tha krigig with a exteral drift, as two direct variogram models ad a crossvariogram model eed to be fitted istead of oly oe direct variogram model i krigig with a exteral drift ad ordiary krigig. Cosiderig the cross-validatio results, krigig with a exteral drift seems to give the most coheret results i the case study preseted; therefore, the estimates of the expected total resources iput over the Guadalhorce river basi give by krigig with a exteral drift ca be cosidered the most appropriate. Table III. Results from radom cofidece itervals [ẑ2ˆ ] (%) [ẑˆ ] (%) [ẑ0.56ˆ ] (%) [ẑ0.25ˆ ] (%) Theoretical 45 (95) 35 (68) 19 (38) 10 (19) OK 42 (82) 29 (56) 18 (35) 8 (15) CoK 41 (80) 24 (47) 15 (29) 11 (21) KED 47 (92) 32 (62) 20 (39) 11 (21)

14 1044 E. PARDO-IGÚZQUIZA Figure 9. Estimated value versus true value for: (a) Thiesse method; (b) OK; (c) CoK; ad (d) KED

15 GEOSTATISTICAL METHODS AND CLIMATOLOGICAL RAINFALL 1045 Figure 10. Error vs. estimated value for: (a) Thiesse method; (b) OK; (c) CoK; ad (d) KED

16 1046 E. PARDO-IGÚZQUIZA ACKNOWLEDGEMENTS The work of the author has bee supported by a Europea Commuity Traiig Fellowship i the Departmet of Meteorology (Readig Uiversity, UK) as part of the Traiig ad Mobility of Researches (TMR) programme. The author would like to thak the reviewers for providig very costructive commets. APPENDIX A. CALCULATION OF THE ESTIMATION VARIANCE OF THE GLOBAL MEAN FROM COMBINATION OF LOCAL ESTIMATION VARIANCES Figure A1 shows a area V composed of N elemetary uits of equal area u i : N V= u i (A1) Over the whole area V, there is defied a regioalized variable Z that is oly kow i experimetal locatios. There are observatios that are assumed as realizatios of radom variables {Z i, i= 1,...,}. Let Z V be the global mea that is give by Z V = 1 N Z V= 1 N N u Z i N where Z V = Z u i ad Z iu is the mea value of Z i the i-thm cell of size (or support) u ad estimated as a weighted sum of the experimetal data o poit support: Z iu = i k Z k (A3) k=1 i where k is the weight give to the k-thm experimetal data i the estimatio the mea value of Z over the support u i ad calculated as the solutio of the krigig system. Krigig also provides the variace of the estimatio error: s i2 =var(r iu )=var(z iu Z iu )=E{[Z iu Z iu ] 2 } Although a ubiased estimate of global mea i Equatio (A2) ca be easily foud from the local estimates i Equatio (A3) as: Z V = 1 N Z V= 1 N N u Z i (A2) (A4) (A5) Figure A1. Global domai V as sum of N=6 equal elemetary domais of support u ad =5 experimetal values () of puctual support

17 GEOSTATISTICAL METHODS AND CLIMATOLOGICAL RAINFALL 1047 the calculatio of the variace of the estimatio error of the global mea is ot so straightforward, i.e. because the estimatio variaces of the differet u uits are correlated the: var(r V )=var(z V Z V )=E{[Z V Z V ] 2 } 1 N N 2 s i The calculatio of var(r V ) is a little more cumbersome that just the arithmetic mea of estimatio variaces. It is clear that (A6) var(r V )= 1 N 2 var(r V* )= 1 N 2 var(z V Z V )= 1 N 2 E{[Z V Z V ] 2 } (A7) where var(x) is variace of the variable X. It the requires oly some algebra to show that: N var(r* V )=C VV + N k=1 l=1 N ki lj C kl 2 k=1 ki C kv where C VV is the mea covariace iside the support V; C kl is the covariace betwee Z k ad Z l ; C kv is the mea covariace betwee Z k ad the support V. If the covariace model is pure ugget effect, i.e. there is o correlatio betwee the experimetal samples, for the covariace fuctios: C VV =0 C kv =0 (A9) C kl =0ifkl C kl =var(z) ifkl ad for the weights: (A8) ki = 1 i, k (A10) which gives ad the var(r V * )=N 2 var(z) 2 (A11) var(r V )= var(z) (A12) which is the classical statistical result that gives the variace of the mea of a radom variable. The fact that the observatios are correlated icreases the stadard error (squared root of the estimatio variace) of the mea. I the example preseted i the text, the stadard error for the global mea, evaluated by ay of the krigig procedures preseted, is aroud 80 mm while usig stadard theory cosiderig that they are idepedet observatios we have, with =51, ad var(z) =0.0313, that the stadard error is / m=25 mm. REFERENCES Ahmed, S. ad De Marsily, G Compariso of geostatistical methods for estimatig trasmissivity usig data o trasmissivity ad specific capacity, Water Resourc. Res., 23(9), Akaike, H A ew look at the statistical model idetificatio, IEEE Tras. Auto. Cotrol, 79, Delhomme, J.P. ad Delfier, P Applicatio du krigeage a l optimisatio d ue campage pluviometrique e zoe aride, i Proceedigs, Symposium o Desig of Water Resources Projects with Iadequate Data. Vol. 2, UNESCO, Madrid, pp Hudso, G. ad Wackeragel, H Mappig temperature usig krigig with exteral drift: theory ad a example from Scotlad, It. J. Climatol., 14, Jourel, A.G. ad Huijbregts, Ch Miig Geostatistics. Academic Press, New York, 600 p. Jourel, A.G. ad Rossi, M.E Whe do we eed a tred model i krigig?, Math. Geol., 21(7), Lebel, T. ad Basti, G Variogram idetificatio by the mea-square iterpolatio error method, J. Hydrol., 77, Mathero, G Priciples of geostatistics, Eco. Geol., 58, Pardo-Igúzquiza, E., MLREML: a computer program for the iferece of spatial covariace parameters by maximum likelihood ad restricted maximum likelihood, Comput. Geosci., 23(2), Pardo-Igúzquiza, E Maximum likelihood estimatio of spatial covariace parameters, Math. Geol., 30(1), Thiesse, A.H Precipitatio for large areas, Mo. Wea. Re., 39,

Example 3.3: Rainfall reported at a group of five stations (see Fig. 3.7) is as follows. Kundla. Sabli

Example 3.3: Rainfall reported at a group of five stations (see Fig. 3.7) is as follows. Kundla. Sabli 3.4.4 Spatial Cosistecy Check Raifall data exhibit some spatial cosistecy ad this forms the basis of ivestigatig the observed raifall values. A estimate of the iterpolated raifall value at a statio is

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Patter Recogitio Classificatio: No-Parametric Modelig Hamid R. Rabiee Jafar Muhammadi Sprig 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Ageda Parametric Modelig No-Parametric Modelig

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

Chapter 11 Output Analysis for a Single Model. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Chapter 11 Output Analysis for a Single Model. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation Chapter Output Aalysis for a Sigle Model Baks, Carso, Nelso & Nicol Discrete-Evet System Simulatio Error Estimatio If {,, } are ot statistically idepedet, the S / is a biased estimator of the true variace.

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach STAT 425: Itroductio to Noparametric Statistics Witer 28 Lecture 7: Desity Estimatio: k-nearest Neighbor ad Basis Approach Istructor: Ye-Chi Che Referece: Sectio 8.4 of All of Noparametric Statistics.

More information

GUIDELINES ON REPRESENTATIVE SAMPLING

GUIDELINES ON REPRESENTATIVE SAMPLING DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

Activity 3: Length Measurements with the Four-Sided Meter Stick

Activity 3: Length Measurements with the Four-Sided Meter Stick Activity 3: Legth Measuremets with the Four-Sided Meter Stick OBJECTIVE: The purpose of this experimet is to study errors ad the propagatio of errors whe experimetal data derived usig a four-sided meter

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS

NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.

More information

(all terms are scalars).the minimization is clearer in sum notation:

(all terms are scalars).the minimization is clearer in sum notation: 7 Multiple liear regressio: with predictors) Depedet data set: y i i = 1, oe predictad, predictors x i,k i = 1,, k = 1, ' The forecast equatio is ŷ i = b + Use matrix otatio: k =1 b k x ik Y = y 1 y 1

More information

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers Chapter 4 4-1 orth Seattle Commuity College BUS10 Busiess Statistics Chapter 4 Descriptive Statistics Summary Defiitios Cetral tedecy: The extet to which the data values group aroud a cetral value. Variatio:

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Section A assesses the Units Numerical Analysis 1 and 2 Section B assesses the Unit Mathematics for Applied Mathematics

Section A assesses the Units Numerical Analysis 1 and 2 Section B assesses the Unit Mathematics for Applied Mathematics X0/70 NATIONAL QUALIFICATIONS 005 MONDAY, MAY.00 PM 4.00 PM APPLIED MATHEMATICS ADVANCED HIGHER Numerical Aalysis Read carefully. Calculators may be used i this paper.. Cadidates should aswer all questios.

More information

There is no straightforward approach for choosing the warmup period l.

There is no straightforward approach for choosing the warmup period l. B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

The Method of Least Squares. To understand least squares fitting of data.

The Method of Least Squares. To understand least squares fitting of data. The Method of Least Squares KEY WORDS Curve fittig, least square GOAL To uderstad least squares fittig of data To uderstad the least squares solutio of icosistet systems of liear equatios 1 Motivatio Curve

More information

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day

Number of fatalities X Sunday 4 Monday 6 Tuesday 2 Wednesday 0 Thursday 3 Friday 5 Saturday 8 Total 28. Day LECTURE # 8 Mea Deviatio, Stadard Deviatio ad Variace & Coefficiet of variatio Mea Deviatio Stadard Deviatio ad Variace Coefficiet of variatio First, we will discuss it for the case of raw data, ad the

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

THE SYSTEMATIC AND THE RANDOM. ERRORS - DUE TO ELEMENT TOLERANCES OF ELECTRICAL NETWORKS

THE SYSTEMATIC AND THE RANDOM. ERRORS - DUE TO ELEMENT TOLERANCES OF ELECTRICAL NETWORKS R775 Philips Res. Repts 26,414-423, 1971' THE SYSTEMATIC AND THE RANDOM. ERRORS - DUE TO ELEMENT TOLERANCES OF ELECTRICAL NETWORKS by H. W. HANNEMAN Abstract Usig the law of propagatio of errors, approximated

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Estimation of a population proportion March 23,

Estimation of a population proportion March 23, 1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

10-701/ Machine Learning Mid-term Exam Solution

10-701/ Machine Learning Mid-term Exam Solution 0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Circle the single best answer for each multiple choice question. Your choice should be made clearly. TEST #1 STA 4853 March 6, 2017 Name: Please read the followig directios. DO NOT TURN THE PAGE UNTIL INSTRUCTED TO DO SO Directios This exam is closed book ad closed otes. There are 32 multiple choice questios.

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5 CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio

More information

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense,

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense, 3. Z Trasform Referece: Etire Chapter 3 of text. Recall that the Fourier trasform (FT) of a DT sigal x [ ] is ω ( ) [ ] X e = j jω k = xe I order for the FT to exist i the fiite magitude sese, S = x [

More information

Basics of Probability Theory (for Theory of Computation courses)

Basics of Probability Theory (for Theory of Computation courses) Basics of Probability Theory (for Theory of Computatio courses) Oded Goldreich Departmet of Computer Sciece Weizma Istitute of Sciece Rehovot, Israel. oded.goldreich@weizma.ac.il November 24, 2008 Preface.

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

THE KALMAN FILTER RAUL ROJAS

THE KALMAN FILTER RAUL ROJAS THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a getle itroductio to the Kalma filter, a umerical method that ca be used for sesor fusio or for calculatio of trajectories. First, we cosider

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Confidence Intervals

Confidence Intervals Cofidece Itervals Berli Che Deartmet of Comuter Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Referece: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chater 5 & Teachig Material Itroductio

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Stochastic Simulation

Stochastic Simulation Stochastic Simulatio 1 Itroductio Readig Assigmet: Read Chapter 1 of text. We shall itroduce may of the key issues to be discussed i this course via a couple of model problems. Model Problem 1 (Jackso

More information

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + 62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Chapter 6. Sampling and Estimation

Chapter 6. Sampling and Estimation Samplig ad Estimatio - 34 Chapter 6. Samplig ad Estimatio 6.. Itroductio Frequetly the egieer is uable to completely characterize the etire populatio. She/he must be satisfied with examiig some subset

More information

ANALYSIS OF EXPERIMENTAL ERRORS

ANALYSIS OF EXPERIMENTAL ERRORS ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder

More information

Lecture 27. Capacity of additive Gaussian noise channel and the sphere packing bound

Lecture 27. Capacity of additive Gaussian noise channel and the sphere packing bound Lecture 7 Ageda for the lecture Gaussia chael with average power costraits Capacity of additive Gaussia oise chael ad the sphere packig boud 7. Additive Gaussia oise chael Up to this poit, we have bee

More information

Maximum likelihood estimation from record-breaking data for the generalized Pareto distribution

Maximum likelihood estimation from record-breaking data for the generalized Pareto distribution METRON - Iteratioal Joural of Statistics 004, vol. LXII,. 3, pp. 377-389 NAGI S. ABD-EL-HAKIM KHALAF S. SULTAN Maximum likelihood estimatio from record-breakig data for the geeralized Pareto distributio

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

AP Statistics Review Ch. 8

AP Statistics Review Ch. 8 AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is: PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Robust Default Correlation for Cost Risk Analysis

Robust Default Correlation for Cost Risk Analysis Robust Default Correlatio for Cost Risk Aalysis Christia Smart, Ph.D., CCEA Director, Cost Estimatig ad Aalysis Missile Defese Agecy Preseted at the 03 ICEAA Professioal Developmet ad Traiig Workshop Jue,

More information

Riemann Sums y = f (x)

Riemann Sums y = f (x) Riema Sums Recall that we have previously discussed the area problem I its simplest form we ca state it this way: The Area Problem Let f be a cotiuous, o-egative fuctio o the closed iterval [a, b] Fid

More information

REGRESSION (Physics 1210 Notes, Partial Modified Appendix A)

REGRESSION (Physics 1210 Notes, Partial Modified Appendix A) REGRESSION (Physics 0 Notes, Partial Modified Appedix A) HOW TO PERFORM A LINEAR REGRESSION Cosider the followig data poits ad their graph (Table I ad Figure ): X Y 0 3 5 3 7 4 9 5 Table : Example Data

More information

WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT

WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? Harold G. Loomis Hoolulu, HI ABSTRACT Most coastal locatios have few if ay records of tsuami wave heights obtaied over various time periods. Still

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

A Risk Comparison of Ordinary Least Squares vs Ridge Regression

A Risk Comparison of Ordinary Least Squares vs Ridge Regression Joural of Machie Learig Research 14 (2013) 1505-1511 Submitted 5/12; Revised 3/13; Published 6/13 A Risk Compariso of Ordiary Least Squares vs Ridge Regressio Paramveer S. Dhillo Departmet of Computer

More information

EE / EEE SAMPLE STUDY MATERIAL. GATE, IES & PSUs Signal System. Electrical Engineering. Postal Correspondence Course

EE / EEE SAMPLE STUDY MATERIAL. GATE, IES & PSUs Signal System. Electrical Engineering. Postal Correspondence Course Sigal-EE Postal Correspodece Course 1 SAMPLE STUDY MATERIAL Electrical Egieerig EE / EEE Postal Correspodece Course GATE, IES & PSUs Sigal System Sigal-EE Postal Correspodece Course CONTENTS 1. SIGNAL

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7:

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7: 0 Multivariate Cotrol Chart 3 Multivariate Normal Distributio 5 Estimatio of the Mea ad Covariace Matrix 6 Hotellig s Cotrol Chart 6 Hotellig s Square 8 Average Value of k Subgroups 0 Example 3 3 Value

More information

Access to the published version may require journal subscription. Published with permission from: Elsevier.

Access to the published version may require journal subscription. Published with permission from: Elsevier. This is a author produced versio of a paper published i Statistics ad Probability Letters. This paper has bee peer-reviewed, it does ot iclude the joural pagiatio. Citatio for the published paper: Forkma,

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

TR/46 OCTOBER THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION A. TALBOT

TR/46 OCTOBER THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION A. TALBOT TR/46 OCTOBER 974 THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION by A. TALBOT .. Itroductio. A problem i approximatio theory o which I have recetly worked [] required for its solutio a proof that the

More information

Basis for simulation techniques

Basis for simulation techniques Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios

More information

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion Poit Estimatio Poit estimatio is the rather simplistic (ad obvious) process of usig the kow value of a sample statistic as a approximatio to the ukow value of a populatio parameter. So we could for example

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

Final Review for MATH 3510

Final Review for MATH 3510 Fial Review for MATH 50 Calculatio 5 Give a fairly simple probability mass fuctio or probability desity fuctio of a radom variable, you should be able to compute the expected value ad variace of the variable

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ), Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We

More information