Confidence Regions For The Ratio Of Two Percentiles

Size: px
Start display at page:

Download "Confidence Regions For The Ratio Of Two Percentiles"

Transcription

1 Confidence Regions For The Ratio Of Two Percentiles Richard Johnson Joint work with Li-Fei Huang and Songyong Sim January 28, 2009

2 OUTLINE Introduction Exact sampling results normal linear model case Other general distributions are treated by large sample methods. Nonparametric Bayes expansion of posterior distribution of quantile.

3 Introduction Wood engineers use the ratio of the fifth or other lower percentiles when comparing lumber of two different grades, moisture content, or sorted under two different grading systems. Motivated by this application, we develop some exact, and some approximate, confidence regions for the ratio of percentiles from two different populations. This work generalizes earlier work on inferences concerning the ratio of two means.

4 4 Most of the existing literature on ratios deals with the ratio of means and, in particular, ratios of means of normal distributions. Fieller(1954) treats bivariate normal distributions. McDonald(1981) obtained confidence regions for the ratio of means in the normal theory setting of fitting a straight line.

5 Hwang(1995) used a resampling approach to construct confidence regions for this same ratio. Huang and Johnson(2004) study the two sample normal theory case. Evans, Johnson, Green and Verrill obtain large sample confidence intervals for the ratio of Weibull percentiles.

6 6 Fieller(1954), using a fudical approach, considered T (θ) = n(x θy ) S11 2θS 12 + θ 2 S 22 which has a central t distribution when θ = µ 1 /µ 2 prevails and (X i, Y i ) bivariate normal. A confidence region consists of all θ 0 such that H 0 : θ = θ 0 is not rejected by a two-sided test using the central t distribution. Unfortunately, T (θ) is not monotone in θ. This sometimes results in a region that is the complement of a finite interval or even the whole real line.

7 Some exact results Assumptions X 1,..., X n1 is a random sample from the cumulative distribution function(cdf) F 1 ( ) Y 1,..., Y n2 is a random sample from F 2 ( ), The two random samples be independent. Initially we allow for different percentiles, or p 1 and p 2, to be specified. ξ 1p1 = inf{x : F 1 (x) p 1 } ξ 2p2 = inf{y : F 2 (y) p 2 } The ratio of population percentiles is θ = ξ 1p1 /ξ 2p2

8 Normal distributions with equal variances Further, let F i be N(µ i, σi 2 ), for i = 1, 2. Then, ξ ipi = µ i + σ i Φ 1 (p i ) is the lower 100p i -th percentile i = 1, 2 where Φ( ) is the standard normal cdf. We also let z pi be the upper 100p i -th percentile of the standard normal distribution, so ξ ipi = µ i z pi σ i for i = 1, 2.

9 θ = ξ 1p1 /ξ 2p2 with ξ ipi = µ i z pi σ i for i = 1, 2. Theorem If the value of k = σ 1 is known, σ 2 T (θ, k) = 1 n 1 + θ2 n 2 k 2 1 X θy /k k 2 n 1 i=1 (X i X ) 2 + n 2 j=1 (Y i Y ) 2 n 1 + n 2 2 follows the non-central t distribution with n 1 + n 2 2 degrees of freedom and non-centrality parameter 1 δ(θ, k) = 1 + θ2 (z n 1 n 2 k 2 p1 θ k z p 2 ) 9

10 10 Remark If the two variances are the same, that is, k = 1, then T (θ) = 1 + θ2 n 1 n 2 1 X θy n1 i=1 (X i X ) 2 + n 2 j=1 (Y i Y ) 2 n 1 +n 2 2 follows the non-central t distribution with n 1 + n 2 2 degrees of freedom and non-centrality parameter 1 δ(θ) = 1 + θ2 (z p1 θz p2 ). n 1 n 2

11 Proof Here k = σ 1 σ 2 and θ = µ 1 z p1 σ 1 µ 2 z p2 σ 2. Under the assumptions, the random quantity X θy kσ 2 distributed as N(0, 1 n 1 + µ 1 µ 2θ σ 1 = z p1 θ σ 1 k z p 2 and n1 i=1 (X i X ) 2 σ 2 1 is θ2 n 2 k2) plus the constant + n2 j=1 (Y j Y ) 2 n1 i=1 = (X i X ) 2 n2 j=1 k 2 σ2 2 + (Y j Y ) 2 σ2 2 is independently distributed as χ 2 n 1 +n 2 2(0). σ 2 2

12 12 The result follows by the definition of the non-central t as a ratio of these random quantities. ( see Johnson and Kotz(1970), pp ) A 100(1 α)% confidence region for θ consists of all θ 0 such that the equal-tailed level α test of H 0 : θ = θ 0 versus H 1 : θ θ 0 does not reject H 0. The confidence region can be a finite interval, the complement of an interval, or the real line.

13

14 Table 1 illustrates possible confidence regions for the ratio of percentiles from two normal distributions having equal variances. Possible regions include a bounded interval, the complement of an interval, and the whole real line. The latter two cases occur when the estimated percentile in the denominator and both percentiles are too close to zero, respectively. Table 1 also shows that the regions for ξ 2,p2 /ξ 1,p1 could be obtained for the ratio ξ 1,p1 /ξ 2,p2. 14

15 95% confidence regions normal populations with equal variances. n 1 = n 2 = 10. s pooled = 2 x = 5, y = 10 x = 10, y = 5 means (p 1 = p 2 = 0.5) (0.359, 0.659) (1.518, 2.786) 20th percentiles (p 1 = p 2 = 0.2) (0.206, 0.574) (1.741, 4.870) 10th percentiles (p 1 = p 2 = 0.1) (0.075, 0.515) (1.939, ) 5th percentiles (p 1 = p 2 =.05) ( 0.087, 0.458) 15 (, ) (2.183, )

16 Large-Sample Confidence Regions When large samples are available, we can obtain confidence intervals for the ratio of percentiles even when the ratio of variances, k 2, is unknown. The first result still requires normal populations. The 100p i -th percentile of population i is µ i z pi σ i for i = 1, 2.

17 It always gives an interval. 17 Let X 1,..., X n1 be a random sample from a normal distribution with mean µ 1 and variance σ1 2; let Y 1,..., Y n2 be a random sample from a normal distribution with mean µ 2 and variance σ2 2 ; and let the samples be independent. n 1 If lim = λ, 0 < λ < 1, and ξ 2p2 0, n 1,n 2 n 1 + n 2 an application of the delta method gives an approximate 100(1 α)% confidence interval for θ. x z p1 s 1 ±z α/2 y z p2 s 2 ( 1 + z ) p n 1 s 2 1 (y z p2 s 2 ) 2 + ( 1 + z 2 p 2 2 n 2 ) s 2 2 (x z p 1 s 1 ) 2 (y z p2 s 2 ) 4

18 An Alternative Large-Sample Approach We begin by considering the random quantity X z p1 S 1 θ(y z p2 S 2 ) and derive large sample confidence regions that may be intervals, complements of intervals, or sometimes even the whole real line.

19 Theorem When µ 1 z p1 σ 1 0 and n 1 lim = λ, 0 < λ < 1, n 1,n 2 n 1 + n 2 n1 + n 2 ( X z p1 S 1 θ(y z p2 S 2 ) ( ξ 1,p1 θ ξ 2,p2 ) ) converges in distribution to the normal with mean 0 and variance 1 λ (1 + z 2 p 1 2 )σ2 1 + θ 2 1 (1 λ) (1 + z 2 p 2 2 )σ2 2

20 20 Under the null hypotheses H 0 : θ = ξ 1,p1 /ξ 2,p2, the test statistic Z n1,n 2 (θ) = X z p 1 S 1 θ(y z p2 S 2 ) (1 + z p )S 1 m + θ2 (1 + z p )S 2 n is, asymptotically, standard normal. (1)

21 A large sample 100 (1 α)% confidence region for θ is then given by the collection of θ where {θ : Z n1,n 2 (θ) z α/2 } As a function of θ, Z n1,n 2 (θ) has a single maximum which is, in absolute value, greater than the asymptotes. Let θ = argmax Z n1,n 2 (θ) lim Z n 1,n 2 (θ) = Y z p 2 S 2 θ (1 + z 2 p 2 2 )S 2 2 n 2 lim Z n 1,n 2 (θ) = Y z p 2 S 2 θ 21 (1 + z 2 p 2 2 )S 2 2 n 2

22 22 Further, the derivative converges to 0 as θ ± and has the same sign as X z p1 S 1 for sufficiently large negative θ. The confidence region is (1) an interval if z α/2 < (Y z p2 S 2 ) < Z (1 + z n1 p 2 2,n 2 (θ ) 2 2 )S 2 n 2

23 23 (2) the complement of an interval Y z p2 S 2 if z (1 + z p 2 2 α/2 < Z n1,n 2 (θ ) 2 2 )S 2 n 2 (3) the whole real line Y z p2 S 2 if < Z (1 + z n1 p 2 2,n 2 (θ ) z α/2 2 2 )S 2 n 2

24 24 We see that the region fails to be a bounded interval if the percentile in the denominator is not significantly different from zero, or, Y z p2 S 2 z α/2 (1 + z 2 p 2 2 )S 2 2 n 2

25 A Large Sample Nonparametric Approach When sample sizes are large, we can also take a nonparametric approach. Let [ w ] denote the smallest integer that is greater than or equal to w. We estimate the ξ 1p1 by order statistics. ˆξ 1p1 = X (r1 ) where r 1 = [ n 1 p 1 ] ξ 2,p2 ˆ = Y (r2 ) where r 2 = [ n 2 p 2 ]. Johnson and Huang (2004) consider a delta method approach and a modified approach based on the normal limit for n1 + n 2 ( ˆξ 1p1 θ ˆξ 2p2 ( ξ 1,p1 θ ξ 2,p2 ) )

26 Examples and Simulation We consider data on the modulus of rupture(mor) of Douglas Fir specimens, provided to us by J. Evans, whose percentile estimates were given in Aplin et. al. (1986). The normal plots suggest that data are normally distributed and it is reasonable to assume the variances are equal. The two data sets are

27 MOR(lb/in 2 ) of 2 4 inches, Grade 2, Green(30 % moisture content)

28 MOR(lb/in 2 ) of 2 6 inches, Select Structural, Green ( 30 % moisture content)

29 29 From the summary statistics n 1 = 107 x = s 2 1 = 2, 354, 470 n 2 = 103 y = s 2 2 = 2, 528, 864 s pooled = x (6) = y (6) = we determine 95% confidence intervals for ratio of 5th percentile of the MOR of 2 4 s to that of 2 6 s. Pametric Nonparametric (.395,.628) (.400,.635)

30 Regions other than Intervals Simulation with 5000 trials. n 1 = n 2 = 100 µ 1 = 1.93, σ 1 = 2, ξ 1,0.05 = 1.36 µ 2 = 1.93, σ 2 = 1, ξ 2,0.05 = where the denominator percentile is close to zero Estimated Coverage Probabilities ( estimated standard deviations in parentheses.) Large sample Modified Z n1,n 2 (θ) Z n1,n 2 (θ) 87.58% 94.86% (.4664) (.3123) Large sample Modified Nonparametric Nonparametric 88.82% 93.18% (.4456) (.3565)

31 Since the unmodified versions do not have good coverage, it is interesting to compare with the behavior of the two modified procedures. Modified outside Z n1,n 2 (θ) interval interval real line overall number N i =2329 N c =2669 N r =2 N t =5000 coverage number percent 92.19% 97.19% 100% 94.86% coverage (0.3795%) (0.2337%) (0) (0.3123%) Modified outside Nonparametric interval interval real line overall number N i =1582 N c =3376 N r =42 N t =5000 coverage number percent 85.97% 96.48% 100% 93.18% coverage (0.4912%) (0.2606%) (0) (0.3565%)

32 A BAYESIAN NONPARAMETRIC APPROACH In the nonparametric setting, we select a Dirichlet process prior for the single sample problem. We begin by showing how to obtain a sample of several different percentiles fromthe posterior distribution. Examples of approximation to the joint posterior distribution of one and two percentiles are given. We also investigate the large sample behavior of the posterior quantile process for a single population.

33 33 To compare percentiles from two different populations, we select separate Dirichlet process prior distributions for each of the populations. An observation is selected from each of the posterior distributions and the appropriate function of the percentiles formed. By repeated sampling, we obtain an approximation to the posterior distribution of, for instance, the ratio of percentiles. An example is given. For any cumulative distribution function(cdf) F, the 100p-th percentile is ξ p = F 1 (p) = inf{x F (x) p} 0 < p < 1

34 34 The posterior distribution will be based on a random sample X = (X 1, X 2,..., X n ) from a distribution G( ). We consider a Dirichlet process, with prior measure α 0, over the possible cdf s F. Let t 1 < t 2 < < t k. Given the observations X 1, X 2,..., X n, we set α(t i ) = α(, t i ] = #(X j t i ) + α 0 (t i ) for 1 i k. For simplicity, we set α(t i, t i+1 ) = α(, t i+1 ] α(, t i ] and denote α R = α 0 (R).

35 35 By Ferguson (1973), the posterior distribution of F is a Dirichlet process with the measure α. Then, for i < j, it is well known that F (t i ) Beta(α(t i ), n + α R α(t i )) F (t j ) F (t i ) Beta(α(t i, t j ), n + α R α(t i, t j )) F (t j ) F (t i ) 1 F (t i ) F (ti ) Beta(α(t i, t j ), n+α R α(t i ) α(t i, t j )).

36 Generation of Values of a Percentile from the Posterior Distribution By Ferguson(1973), the posterior distribution of F (t 1 ) is Beta(α(t 1 ), n + α R α(t 1 )) so Pr[ξ p1 t 1 ] = Pr[p 1 F (t 1 )] Hence we generate ξ p1 as follows; 1 Generate a value u 1 from U(0, 1). 2 Numerically, find t 1 such that Γ(n + α R ) u 1 = Γ(α(t 1 ))Γ(n + α R α(t 1 )) 1 p 1 Take t 1 as ξ p1. z α(t 1) 1 (1 z) n+α R α(t 1 ) 1 dz 36

37 Illustration Consider the ratio of 5th percentiles. As data, we generated a sample of size 100 from a N(4, 1) population representing the first population. Let x (1) x (2) x (100) be the order statistics. We also generated a second sample of size 100 from the same normal population to represent the sample from the second population.

38 Next, we select a value u from a uniform(0,1) distribution and then numerically solve u = P 1 [ξ 1p1 t] for t which is then an observed value from the posterior distribution of the 5th percentile from the first population. 38

39 39 Most of the time, the generated value is one of the order statistics near the sample 5th percentile. Table 1 shows the first 1000 values selected from the posterior distribution. Notice that 98.1 % are actual values in the data set. Table of first 1,000 generated values from the first posterior distribution when the data are 100 N(4, 1) observations. Out of 1,000 generations, 981 are the data values.

40 40 Value Value of ξ 0.05 Freq Property of ξ 0.05 Freq Property x (1) interpolated interpolated x (6) x (2) interpolated interpolated interpolated interpolated interpolated interpolated x (7) interpolated interpolated interpolated x (8) x (3) x (9) x (4) interpolated interpolated x (10) interpolated interpolated interpolated x (11) interpolated x (12) interpolated x (13) interpolated x (14) x (5)

41 Suppose we wish to explore the joint posterior distribution of two percentiles (ξ p1, ξ p2 ) with p 1 < p 2. To generate random values for (ξ p1, ξ p2 ) from the posterior distribution 1 Generate a value t 1 from Beta(α(t 1 ), n + α R α(t 1 )) and call it ξ p1. Implementation is as above. 2 Given the value ξ p1 generated in the previous step, we next generate a value from Beta(α(t 1, t 2 ), n + α R α(t 1 ) α(t 1, t 2 )) and call it η. Take ξ p1 + η as ξ p2. Here η = t 2 t 1.

42 To implement this procedure, we note that the conditional posterior probability Pr[ξ p2 ξ p1 η F (t 1 ) = p 1 ] = Pr[ξ p2 t 1 + η F (t 1 ) = p 1 ] = Pr[p 2 F (t 1 + η) F (t 1 ) = p 1 ] = Pr[ p2 p 1 1 p 1 F (t 1+η) p 1 1 p 1 F (t1 ) = p 1 ] = Pr[ p2 p 1 1 p 1 F (t 1+η) F (t 1 ) 1 F (t 1 ) ] F (t 1 ) = p 1 and a value of ξ p2 given ξ p1 = F 1 (p 1 ) = t 1 can be generated as follows; 42

43 1 Generate U 2 = u 2 from U(0, 1). 2 Find η such that 1 u 2 = (p 2 p 1 )/(1 p 1 ) Γ(n + α R α(t 1 )) Γ(α(t 1, t 1 + η))γ(n + α R α(t 1 ) α(t 1, t 1 + η)) = z α(t 1,t 1 +η) 1 (1 z) n+α R α(t 1 ) α(t 1,t 1 +η) 1 dz 1 (p 2 p 1 )/(1 p 1 ) Γ(n + α R α(t 1 )) Γ(α(t 1, t 1 + η))γ(n + α R α(t 1 + η)) z α(t 1,t 1 +η) 1 (1 z) n+α R α(t 1 +η) 1 dz (3) Take t 1 + η as a value t 2 for ξ p2. 43

44 We apply our algorithm to data on the modulus of rupture(mor) of Douglas Fir specimens observations(sorted) of MOR(lb/in 2 of 2 4 inches, Grade 2, Green(30% moisture content) 44

45 The prior measure α 0 used in our simulation study is the uniform on (0, 9500) and we consider p 1 = 0.05 and p 2 = 0.1, 0.2, 0.5 and Table Basic statistics from 5,000(each) vectors of percentiles generated from the joint posterior distribution. (ξ 0.05, ξ 0.10 ) (ξ 0.05, ξ 0.20 ) Mean vector (2327.9, ) (2322.1, ) sd (263.1, 225.3) (265.8, 268.0) Correlation (ξ 0.05, ξ 0.50 ) (ξ 0.05, ξ 0.95 ) Mean vector (2317.5, ) ( ) sd (267.7, 189.5) (268.1, 248.3) Correlation

46 cdf x Figure: Posterior distribution of 5-th percentile based on a sample of size 1000 standard normal variables.

47 cdf atxi Figure: Posterior distribution of 50-th percentile based on a sample of size 100 standard normal variables. Dots are approximation. 47

48 cdf atxi Figure: Posterior distribution of 50-th percentile based on a sample of size 1000 standard normal variables.

49 Figure: Estimated Joint posterior of 5-th and 10-th percentiles. Based on 10,000 selections using Lumber data density*1,000, % %

50 Figure: Estimated contours of joint posterior of 5-th and 10-th percentiles. Based on 10,000 selections using Lumber data.

51 Histogram of Q10 Q5 Frequency Q10 Q5 Figure: Estimated posterior distribution of Q10 - Q5. Based on 10,000 selections using Lumber data.

52 density*1,000, % % Figure: Estimated Joint posterior of 25-th and 75-th percentiles. Based on 10,000 selections using Lumber data. 52

53 Figure: Estimated contours of joint posterior of 25-th and 75-th percentiles. Based on 10,000 selections using Lumber data.

54 Histogram of IQR Frequency Figure: Estimated posterior distribution for interquartile range. Based on 10,000 selections using Lumber data. IQR

55 55 N(4,1)/N(4,1) Frequency ratio: 5 th percentile Figure: Estimated posterior distribution for ratio of 5-th percentiles. Samples of size 100 N(4, 1). 10,000 generated pairs of percentiles.

56 N(2,1)/N(2,1) Frequency ratio: 5 th percentile Figure: Estimated posterior distribution for ratio of 5-th percentiles. Samples of size 100 N(2, 1). 10,000 generated pairs of percentiles. 56

57 Posterior Distributions-Asymptotics We need the following property of a special density estimator. Lemma 1. Let X 1, X 2,..., X n,... be independently distributed as G and assume that G has a derivative g that is uniformly continuous on R. Let η p satisfy p = G(η p ) and assume that g(η p ) > 0.

58 58 Let G n (t) = #(X j t) + α 0 (t) n + α R and t n = inf{t p G n (t)}. Then for w 0, almost surely, as n ˆf w (t n ) = #(t n < X j t n + w/ n) n w/ n = g(η p ) + o(1) Bertrand-Retali(1978) Silverman(1986) p71 72

59 59 This holds for all (countable) rational w. Remark. By Lemma 1 and the fact that G n (t n ) = p + O(n 1 ) as n, G n (t n + w/ n) p w/ n g(η p ) almost surely, for all rational w 0. For several percentiles p 1 < p 2 <... < p k, we define t in = inf{t p i G n (t)} for i = 1, 2,..., k

60 Theorem Assume that the X 1, X 2,... are independently distributed as G which has a derivative g that is uniformly continuous on R. Further assume that p = (p 1, p 2,..., p k ) has distinct ordered elements and g(η i ) > 0, for i = 1, 2,..., k, where p i = G(η i ), for i = 1, 2,..., k, Then, almost surely on the set where

61 Theorem continued G n (t in + w i / n) p i w i / n as n,we have g(η i ) for i = 1, 2,..., k Pr[ n(ξ p1 t 1n ) w 1,..., n(ξ pk t kn ) w k ] where V g = N k (0, V g ) { pi (1 p j )/[g(η i )g(η j )] i j p i (1 p j )/[g(η i )g(η j )] i > j

62 Correction Terms to Normal Limit Theorem Let Φ and φ be the standard normal cdf and pdf, respectively, and write G n = G n (t n + w/ n). Then, as n, Γ(n + α R ) Γ((n + α R )G n )Γ((n + α R )(1 G n )) 1 p = 1 Φ(k n ) + z (n+α R)G n 1 (1 z) (n+α R)(1 G n ) 1 dz 1 n + αr (1 2G n )(k 2 n 1) 3 G n (1 G n ) φ(k n) + d n n + α R φ(k n ) + O(n 3/2 )

63 63 where n + αr k n = G n (1 G n ) (p G n), k n and d n each converge to constants.

64 cdf x Figure: Posterior distribution of 5-th percentile based on a sample of size 1000 standard normal variables. Correction shown

65 cdf atxi Figure: Posterior distribution of 50-th percentile based on a sample of size 100 standard normal variables. Correction shown

66 cdf atxi Figure: Posterior distribution of 50-th percentile based on a sample of size 1000 standard normal variables. Correction shown

67 67 We next consider the joint distribution of two quantiles. Theorem Let p 1 < p 2 and t i = t in be the solutions of p i = G n (t in ), for i = 1, 2, where G n (t) = #(X j t) + α 0 (t) n + α R Write G 1n = G n (t 1 + w 1 / n) and G 2n = G n (t 2 + w 2 / n).

68 +O(n 1 ) 68 Then Pr[ n(ξ p1 t 1 ) w 1, n(ξ p2 t 2 ) w 2 ] = 1 2π V n 1/2 n+αr (p 1 G 1n ) n+αr (p 2 G 2n ) e 1 2 y V 1 n y dy2 dy 1 1 [ 2z10 z + 20 n + αr z 10 (z 20 z 10 ) I z 20 z 10 (z 20 z 10 )(1 z 20 ) I 01 + z 20(z 20 2z 10 ) 1 3z10 2 (z 20 z 10 ) 2I 30 + (z 20 z 10 ) 2I 21 1 (z 20 z 10 ) 2I 12 + (1 z 10)(1 2z 20 + z 10 ) ] 3(z 20 z 10 ) 2 (1 z 20 ) I 2 03

Chapter 9: Hypothesis Testing Sections

Chapter 9: Hypothesis Testing Sections Chapter 9: Hypothesis Testing Sections 9.1 Problems of Testing Hypotheses 9.2 Testing Simple Hypotheses 9.3 Uniformly Most Powerful Tests Skip: 9.4 Two-Sided Alternatives 9.6 Comparing the Means of Two

More information

Lecture 15: Inference Based on Two Samples

Lecture 15: Inference Based on Two Samples Lecture 15: Inference Based on Two Samples MSU-STT 351-Sum17B (P. Vellaisamy: STT 351-Sum17B) Probability & Statistics for Engineers 1 / 26 9.1 Z-tests and CI s for (µ 1 µ 2 ) The assumptions: (i) X =

More information

STAT 512 sp 2018 Summary Sheet

STAT 512 sp 2018 Summary Sheet STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}

More information

A first look at the performances of a Bayesian chart to monitor. the ratio of two Weibull percentiles

A first look at the performances of a Bayesian chart to monitor. the ratio of two Weibull percentiles A first loo at the performances of a Bayesian chart to monitor the ratio of two Weibull percentiles Pasquale Erto University of Naples Federico II, Naples, Italy e-mail: ertopa@unina.it Abstract. The aim

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

MATH4427 Notebook 4 Fall Semester 2017/2018

MATH4427 Notebook 4 Fall Semester 2017/2018 MATH4427 Notebook 4 Fall Semester 2017/2018 prepared by Professor Jenny Baglivo c Copyright 2009-2018 by Jenny A. Baglivo. All Rights Reserved. 4 MATH4427 Notebook 4 3 4.1 K th Order Statistics and Their

More information

Summary of Chapters 7-9

Summary of Chapters 7-9 Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two

More information

Continuous Distributions

Continuous Distributions Chapter 3 Continuous Distributions 3.1 Continuous-Type Data In Chapter 2, we discuss random variables whose space S contains a countable number of outcomes (i.e. of discrete type). In Chapter 3, we study

More information

Inference on distributions and quantiles using a finite-sample Dirichlet process

Inference on distributions and quantiles using a finite-sample Dirichlet process Dirichlet IDEAL Theory/methods Simulations Inference on distributions and quantiles using a finite-sample Dirichlet process David M. Kaplan University of Missouri Matt Goldman UC San Diego Midwest Econometrics

More information

ORDER STATISTICS, QUANTILES, AND SAMPLE QUANTILES

ORDER STATISTICS, QUANTILES, AND SAMPLE QUANTILES ORDER STATISTICS, QUANTILES, AND SAMPLE QUANTILES 1. Order statistics Let X 1,...,X n be n real-valued observations. One can always arrangetheminordertogettheorder statisticsx (1) X (2) X (n). SinceX (k)

More information

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables To be provided to students with STAT2201 or CIVIL-2530 (Probability and Statistics) Exam Main exam date: Tuesday, 20 June 1

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

SPRING 2007 EXAM C SOLUTIONS

SPRING 2007 EXAM C SOLUTIONS SPRING 007 EXAM C SOLUTIONS Question #1 The data are already shifted (have had the policy limit and the deductible of 50 applied). The two 350 payments are censored. Thus the likelihood function is L =

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

One-Sample Numerical Data

One-Sample Numerical Data One-Sample Numerical Data quantiles, boxplot, histogram, bootstrap confidence intervals, goodness-of-fit tests University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Robust Parameter Estimation in the Weibull and the Birnbaum-Saunders Distribution

Robust Parameter Estimation in the Weibull and the Birnbaum-Saunders Distribution Clemson University TigerPrints All Theses Theses 8-2012 Robust Parameter Estimation in the Weibull and the Birnbaum-Saunders Distribution Jing Zhao Clemson University, jzhao2@clemson.edu Follow this and

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

18.05 Practice Final Exam

18.05 Practice Final Exam No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For

More information

Visual interpretation with normal approximation

Visual interpretation with normal approximation Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation

More information

Chapter 7. Confidence Sets Lecture 30: Pivotal quantities and confidence sets

Chapter 7. Confidence Sets Lecture 30: Pivotal quantities and confidence sets Chapter 7. Confidence Sets Lecture 30: Pivotal quantities and confidence sets Confidence sets X: a sample from a population P P. θ = θ(p): a functional from P to Θ R k for a fixed integer k. C(X): a confidence

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Class 5 AMS-UCSC Tue 24, 2012 Winter 2012. Session 1 (Class 5) AMS-132/206 Tue 24, 2012 1 / 11 Topics Topics We will talk about... 1 Confidence Intervals

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Uniformly Most Powerful Bayesian Tests and Standards for Statistical Evidence

Uniformly Most Powerful Bayesian Tests and Standards for Statistical Evidence Uniformly Most Powerful Bayesian Tests and Standards for Statistical Evidence Valen E. Johnson Texas A&M University February 27, 2014 Valen E. Johnson Texas A&M University Uniformly most powerful Bayes

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Estimation of Quantiles

Estimation of Quantiles 9 Estimation of Quantiles The notion of quantiles was introduced in Section 3.2: recall that a quantile x α for an r.v. X is a constant such that P(X x α )=1 α. (9.1) In this chapter we examine quantiles

More information

STAT 6385 Survey of Nonparametric Statistics. Order Statistics, EDF and Censoring

STAT 6385 Survey of Nonparametric Statistics. Order Statistics, EDF and Censoring STAT 6385 Survey of Nonparametric Statistics Order Statistics, EDF and Censoring Quantile Function A quantile (or a percentile) of a distribution is that value of X such that a specific percentage of the

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida First Year Examination Department of Statistics, University of Florida August 19, 010, 8:00 am - 1:00 noon Instructions: 1. You have four hours to answer questions in this examination.. You must show your

More information

1 Statistical inference for a population mean

1 Statistical inference for a population mean 1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known

More information

Tables Table A Table B Table C Table D Table E 675

Tables Table A Table B Table C Table D Table E 675 BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values

More information

Probability Theory and Statistics. Peter Jochumzen

Probability Theory and Statistics. Peter Jochumzen Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................

More information

Lecture 12: Small Sample Intervals Based on a Normal Population Distribution

Lecture 12: Small Sample Intervals Based on a Normal Population Distribution Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 00 MODULE : Statistical Inference Time Allowed: Three Hours Candidates should answer FIVE questions. All questions carry equal marks. The

More information

Was there under-representation of African-Americans in the jury pool in the case of Berghuis v. Smith?

Was there under-representation of African-Americans in the jury pool in the case of Berghuis v. Smith? 1. The United States Supreme Court recently considered Berghuis v. Smith, the case of Diapolis Smith, an African-American man convicted in 1993 of second-degree murder by an all-white jury in Kent County,

More information

inferences on stress-strength reliability from lindley distributions

inferences on stress-strength reliability from lindley distributions inferences on stress-strength reliability from lindley distributions D.K. Al-Mutairi, M.E. Ghitany & Debasis Kundu Abstract This paper deals with the estimation of the stress-strength parameter R = P (Y

More information

Hypothesis Testing - Frequentist

Hypothesis Testing - Frequentist Frequentist Hypothesis Testing - Frequentist Compare two hypotheses to see which one better explains the data. Or, alternatively, what is the best way to separate events into two classes, those originating

More information

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations John R. Michael, Significance, Inc. and William R. Schucany, Southern Methodist University The mixture

More information

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I 1 / 16 Nonparametric tests Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I Nonparametric one and two-sample tests 2 / 16 If data do not come from a normal

More information

Bootstrap inference for the finite population total under complex sampling designs

Bootstrap inference for the finite population total under complex sampling designs Bootstrap inference for the finite population total under complex sampling designs Zhonglei Wang (Joint work with Dr. Jae Kwang Kim) Center for Survey Statistics and Methodology Iowa State University Jan.

More information

TUTORIAL 8 SOLUTIONS #

TUTORIAL 8 SOLUTIONS # TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level

More information

5 Introduction to the Theory of Order Statistics and Rank Statistics

5 Introduction to the Theory of Order Statistics and Rank Statistics 5 Introduction to the Theory of Order Statistics and Rank Statistics This section will contain a summary of important definitions and theorems that will be useful for understanding the theory of order

More information

BTRY 4090: Spring 2009 Theory of Statistics

BTRY 4090: Spring 2009 Theory of Statistics BTRY 4090: Spring 2009 Theory of Statistics Guozhang Wang September 25, 2010 1 Review of Probability We begin with a real example of using probability to solve computationally intensive (or infeasible)

More information

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages Name No calculators. 18.05 Final Exam Number of problems 16 concept questions, 16 problems, 21 pages Extra paper If you need more space we will provide some blank paper. Indicate clearly that your solution

More information

Non-parametric Inference and Resampling

Non-parametric Inference and Resampling Non-parametric Inference and Resampling Exercises by David Wozabal (Last update. Juni 010) 1 Basic Facts about Rank and Order Statistics 1.1 10 students were asked about the amount of time they spend surfing

More information

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations Chapter 5 Statistical Models in Simulations 5.1 Contents Basic Probability Theory Concepts Discrete Distributions Continuous Distributions Poisson Process Empirical Distributions Useful Statistical Models

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Probability Density Functions

Probability Density Functions Probability Density Functions Probability Density Functions Definition Let X be a continuous rv. Then a probability distribution or probability density function (pdf) of X is a function f (x) such that

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part : Sample Problems for the Elementary Section of Qualifying Exam in Probability and Statistics https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 2: Sample Problems for the Advanced Section

More information

Bahadur representations for bootstrap quantiles 1

Bahadur representations for bootstrap quantiles 1 Bahadur representations for bootstrap quantiles 1 Yijun Zuo Department of Statistics and Probability, Michigan State University East Lansing, MI 48824, USA zuo@msu.edu 1 Research partially supported by

More information

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 1: Sample Problems for the Elementary Section of Qualifying Exam in Probability and Statistics https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 2: Sample Problems for the Advanced Section

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data International Mathematical Forum, 3, 2008, no. 33, 1643-1654 Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data A. Al-khedhairi Department of Statistics and O.R. Faculty

More information

A Note on Hypothesis Testing with Random Sample Sizes and its Relationship to Bayes Factors

A Note on Hypothesis Testing with Random Sample Sizes and its Relationship to Bayes Factors Journal of Data Science 6(008), 75-87 A Note on Hypothesis Testing with Random Sample Sizes and its Relationship to Bayes Factors Scott Berry 1 and Kert Viele 1 Berry Consultants and University of Kentucky

More information

Solutions to Practice Test 2 Math 4753 Summer 2005

Solutions to Practice Test 2 Math 4753 Summer 2005 Solutions to Practice Test Math 4753 Summer 005 This test is worth 00 points. Questions 5 are worth 4 points each. Circle the letter of the correct answer. Each question in Question 6 9 is worth the same

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random

More information

Confidence Intervals in Ridge Regression using Jackknife and Bootstrap Methods

Confidence Intervals in Ridge Regression using Jackknife and Bootstrap Methods Chapter 4 Confidence Intervals in Ridge Regression using Jackknife and Bootstrap Methods 4.1 Introduction It is now explicable that ridge regression estimator (here we take ordinary ridge estimator (ORE)

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

Asymptotic Statistics-VI. Changliang Zou

Asymptotic Statistics-VI. Changliang Zou Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous

More information

1 Probability theory. 2 Random variables and probability theory.

1 Probability theory. 2 Random variables and probability theory. Probability theory Here we summarize some of the probability theory we need. If this is totally unfamiliar to you, you should look at one of the sources given in the readings. In essence, for the major

More information

P Values and Nuisance Parameters

P Values and Nuisance Parameters P Values and Nuisance Parameters Luc Demortier The Rockefeller University PHYSTAT-LHC Workshop on Statistical Issues for LHC Physics CERN, Geneva, June 27 29, 2007 Definition and interpretation of p values;

More information

CH.9 Tests of Hypotheses for a Single Sample

CH.9 Tests of Hypotheses for a Single Sample CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance

More information

Primer on statistics:

Primer on statistics: Primer on statistics: MLE, Confidence Intervals, and Hypothesis Testing ryan.reece@gmail.com http://rreece.github.io/ Insight Data Science - AI Fellows Workshop Feb 16, 018 Outline 1. Maximum likelihood

More information

Efficient and Robust Scale Estimation

Efficient and Robust Scale Estimation Efficient and Robust Scale Estimation Garth Tarr, Samuel Müller and Neville Weber School of Mathematics and Statistics THE UNIVERSITY OF SYDNEY Outline Introduction and motivation The robust scale estimator

More information

Propagation of Uncertainties in Measurements: Generalized/ Fiducial Inference

Propagation of Uncertainties in Measurements: Generalized/ Fiducial Inference Propagation of Uncertainties in Measurements: Generalized/ Fiducial Inference Jack Wang & Hari Iyer NIST, USA NMIJ-BIPM Workshop, AIST-Tsukuba, Japan, May 18-20, 2005 p. 1/31 Frameworks for Quantifying

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

Unit 14: Nonparametric Statistical Methods

Unit 14: Nonparametric Statistical Methods Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based

More information

Distribution-Free Procedures (Devore Chapter Fifteen)

Distribution-Free Procedures (Devore Chapter Fifteen) Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal

More information

Empirical Likelihood

Empirical Likelihood Empirical Likelihood Patrick Breheny September 20 Patrick Breheny STA 621: Nonparametric Statistics 1/15 Introduction Empirical likelihood We will discuss one final approach to constructing confidence

More information

Math489/889 Stochastic Processes and Advanced Mathematical Finance Solutions for Homework 7

Math489/889 Stochastic Processes and Advanced Mathematical Finance Solutions for Homework 7 Math489/889 Stochastic Processes and Advanced Mathematical Finance Solutions for Homework 7 Steve Dunbar Due Mon, November 2, 2009. Time to review all of the information we have about coin-tossing fortunes

More information

Chapter 9: Interval Estimation and Confidence Sets Lecture 16: Confidence sets and credible sets

Chapter 9: Interval Estimation and Confidence Sets Lecture 16: Confidence sets and credible sets Chapter 9: Interval Estimation and Confidence Sets Lecture 16: Confidence sets and credible sets Confidence sets We consider a sample X from a population indexed by θ Θ R k. We are interested in ϑ, a vector-valued

More information

Math 494: Mathematical Statistics

Math 494: Mathematical Statistics Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Statistics - Lecture One. Outline. Charlotte Wickham  1. Basic ideas about estimation Statistics - Lecture One Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Outline 1. Basic ideas about estimation 2. Method of Moments 3. Maximum Likelihood 4. Confidence

More information

parameter space Θ, depending only on X, such that Note: it is not θ that is random, but the set C(X).

parameter space Θ, depending only on X, such that Note: it is not θ that is random, but the set C(X). 4. Interval estimation The goal for interval estimation is to specify the accurary of an estimate. A 1 α confidence set for a parameter θ is a set C(X) in the parameter space Θ, depending only on X, such

More information

LECTURE NOTES FYS 4550/FYS EXPERIMENTAL HIGH ENERGY PHYSICS AUTUMN 2013 PART I A. STRANDLIE GJØVIK UNIVERSITY COLLEGE AND UNIVERSITY OF OSLO

LECTURE NOTES FYS 4550/FYS EXPERIMENTAL HIGH ENERGY PHYSICS AUTUMN 2013 PART I A. STRANDLIE GJØVIK UNIVERSITY COLLEGE AND UNIVERSITY OF OSLO LECTURE NOTES FYS 4550/FYS9550 - EXPERIMENTAL HIGH ENERGY PHYSICS AUTUMN 2013 PART I PROBABILITY AND STATISTICS A. STRANDLIE GJØVIK UNIVERSITY COLLEGE AND UNIVERSITY OF OSLO Before embarking on the concept

More information

Estimating and Testing Quantile-based Process Capability Indices for Processes with Skewed Distributions

Estimating and Testing Quantile-based Process Capability Indices for Processes with Skewed Distributions Journal of Data Science 8(2010), 253-268 Estimating and Testing Quantile-based Process Capability Indices for Processes with Skewed Distributions Cheng Peng University of Southern Maine Abstract: This

More information

Inference for Binomial Parameters

Inference for Binomial Parameters Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for

More information

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu

More information

STT 843 Key to Homework 1 Spring 2018

STT 843 Key to Homework 1 Spring 2018 STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ

More information

Review of basic probability and statistics

Review of basic probability and statistics Probability: basic definitions Review of basic probability and statistics A random variable is the outcome of a natural process that can not be predicted with certainty. Examples: the maximum temperature

More information

Additional Problems Additional Problem 1 Like the http://www.stat.umn.edu/geyer/5102/examp/rlike.html#lmax example of maximum likelihood done by computer except instead of the gamma shape model, we will

More information

Statistical Inference: Uses, Abuses, and Misconceptions

Statistical Inference: Uses, Abuses, and Misconceptions Statistical Inference: Uses, Abuses, and Misconceptions Michael W. Trosset Indiana Statistical Consulting Center Department of Statistics ISCC is part of IU s Department of Statistics, chaired by Stanley

More information

Some Curiosities Arising in Objective Bayesian Analysis

Some Curiosities Arising in Objective Bayesian Analysis . Some Curiosities Arising in Objective Bayesian Analysis Jim Berger Duke University Statistical and Applied Mathematical Institute Yale University May 15, 2009 1 Three vignettes related to John s work

More information

Unobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior:

Unobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior: Pi Priors Unobservable Parameter population proportion, p prior: π ( p) Conjugate prior π ( p) ~ Beta( a, b) same PDF family exponential family only Posterior π ( p y) ~ Beta( a + y, b + n y) Observed

More information

A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 2011

A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 2011 A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 2011 Reading Chapter 5 (continued) Lecture 8 Key points in probability CLT CLT examples Prior vs Likelihood Box & Tiao

More information

Math Review Sheet, Fall 2008

Math Review Sheet, Fall 2008 1 Descriptive Statistics Math 3070-5 Review Sheet, Fall 2008 First we need to know about the relationship among Population Samples Objects The distribution of the population can be given in one of the

More information

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and

More information

557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING: EXAMPLES

557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING: EXAMPLES 557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING: EXAMPLES Example Suppose that X,..., X n N, ). To test H 0 : 0 H : the most powerful test at level α is based on the statistic λx) f π) X x ) n/ exp

More information

NEW APPROXIMATE INFERENTIAL METHODS FOR THE RELIABILITY PARAMETER IN A STRESS-STRENGTH MODEL: THE NORMAL CASE

NEW APPROXIMATE INFERENTIAL METHODS FOR THE RELIABILITY PARAMETER IN A STRESS-STRENGTH MODEL: THE NORMAL CASE Communications in Statistics-Theory and Methods 33 (4) 1715-1731 NEW APPROXIMATE INFERENTIAL METODS FOR TE RELIABILITY PARAMETER IN A STRESS-STRENGT MODEL: TE NORMAL CASE uizhen Guo and K. Krishnamoorthy

More information

Summarizing Measured Data

Summarizing Measured Data Summarizing Measured Data 12-1 Overview Basic Probability and Statistics Concepts: CDF, PDF, PMF, Mean, Variance, CoV, Normal Distribution Summarizing Data by a Single Number: Mean, Median, and Mode, Arithmetic,

More information

NONINFORMATIVE NONPARAMETRIC BAYESIAN ESTIMATION OF QUANTILES

NONINFORMATIVE NONPARAMETRIC BAYESIAN ESTIMATION OF QUANTILES NONINFORMATIVE NONPARAMETRIC BAYESIAN ESTIMATION OF QUANTILES Glen Meeden School of Statistics University of Minnesota Minneapolis, MN 55455 Appeared in Statistics & Probability Letters Volume 16 (1993)

More information

Financial Econometrics and Volatility Models Copulas

Financial Econometrics and Volatility Models Copulas Financial Econometrics and Volatility Models Copulas Eric Zivot Updated: May 10, 2010 Reading MFTS, chapter 19 FMUND, chapters 6 and 7 Introduction Capturing co-movement between financial asset returns

More information

1 General problem. 2 Terminalogy. Estimation. Estimate θ. (Pick a plausible distribution from family. ) Or estimate τ = τ(θ).

1 General problem. 2 Terminalogy. Estimation. Estimate θ. (Pick a plausible distribution from family. ) Or estimate τ = τ(θ). Estimation February 3, 206 Debdeep Pati General problem Model: {P θ : θ Θ}. Observe X P θ, θ Θ unknown. Estimate θ. (Pick a plausible distribution from family. ) Or estimate τ = τ(θ). Examples: θ = (µ,

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Post-exam 2 practice questions 18.05, Spring 2014

Post-exam 2 practice questions 18.05, Spring 2014 Post-exam 2 practice questions 18.05, Spring 2014 Note: This is a set of practice problems for the material that came after exam 2. In preparing for the final you should use the previous review materials,

More information