1 Hypothesis test of a mean vector

Size: px
Start display at page:

Download "1 Hypothesis test of a mean vector"

Transcription

1 THE UNIVERSITY OF CHICAGO Booth School of Busiess Busiess 41912, Sprig Quarter 2010, Mr Ruey S Tsay Lecture: Iferece about sample mea Key cocepts: 1 Hotellig s T 2 test 2 Likelihood ratio test 3 Various cofidece regios 4 Applicatios 5 Missig values 6 Impact of serial correlatios 1 Hypothesis test of a mea vector Let x 1,, x be a radom sample from a p-dimesioal ormal populatio with mea µ ad positive-defiite variace-covariace matrix Σ Cosider the testig problem: H o : µ = µ o versus H a : µ µ o, where µ o is a kow vector This is a geeralizatio of the oe-sample t test of the uivariate case Whe p = 1, the test statistic is t = x µ o s/, with s2 = 1 (x i x) 2 1 The statistic follows a Studet-t distributio with 1 degrees of freedom Oe rejects H o if the p-value of t is less tha the type-i error deoted by α For geeralizatio, we rewrite the test as ( ) s t = ( x µ o ) ( x µ o ) Oe rejects H o if ad oly if t 2 t 2 1(α/2), the upper 100(α/2) percetile of t-distributio with 1 degrees of freedom A atural geeralizatio of this test statistic is T 2 = ( x µ o ) ( S ) 1 ( x µ o ), (1) where S = 1 (x 1 i x)(x i µ o ) is the sample covariace matrix

2 This is the Hotellig s T 2 statistic It is distributed as ( 1)p F p p, p, where F u,v deotes the F -distributio with degrees of freedom u ad v Recall that [t 1 (α/2)] 2 = F 1, 1 (α), where F u,v (α) is the upper 100α percetile of the F -distributio with u ad v degrees of freedom Thus, whe p = 1, T 2 reduces to the usual oe-sample t statistic Example 1 Cosider the mothly log returs of Boeig (BA), Abbott Labs (ABT), Motorola (MOT) ad Geeral Motors (GM) from Jauary 1998 to December 2007 The log returs are i percetages Let µ o = 0 Test the ull hypothesis that the mea vector of the log returs is zero Aswer: Except for three possible outlyig observatios, the chi-square QQ-plot idicates that the ormal assumptio is reasoable See the QQ-plot i Figure 1 > setwd("c:/users/rst/teachig/ama") > x=readtable("m-ba4c9807txt") > head(x) V1 V2 V3 V > source("qqchi2r") <== Check ormality > qqchi2(x) [1] "correlatio coefficiet:" [1] ## Summary statistics > xave=mea(x) > xave V1 V2 V3 V > sqrt(apply(x,2,var)) V1 V2 V3 V > S=var(x) > S V1 V2 V3 V4 V V V

3 V > Si=solve(S) <== Fid the iverse of S > T2=row(x)*t(xave)%*%Si%*%xave > T2 [,1] [1,] ### Results of margial tests > ttest(x[,1]) Oe Sample t-test data: x[, 1] t = , df = 119, p-value = alterative hypothesis: true mea is ot equal to 0 95 percet cofidece iterval: sample estimates: mea of x > ttest(x[,2]) Oe Sample t-test data: x[, 2] t = 07465, df = 119, p-value = alterative hypothesis: true mea is ot equal to 0 95 percet cofidece iterval: sample estimates: mea of x > ttest(x[,3]) Oe Sample t-test data: x[, 3] t = 11524, df = 119, p-value =

4 d Quatile of chi square Figure 1: Chi-square QQ plot for mothly log returs of 4 stocks alterative hypothesis: true mea is ot equal to 0 95 percet cofidece iterval: sample estimates: mea of x Example 2 Cosider the Perspiratio data of 20 observatios i Table 51 of the text There are three measuremets, amely sweat rate(x 1 ), Sodium(X 2 ), ad Potassium(X 3 ) Test H o : µ = (4, 50, 10) versus H a : µ (4, 50, 10) > z=readtable("t5-1dat") > dim(z) [1] 20 3 > colames(z) <- c("rate","sodium","potassium") > qqchi2(z) [1] "correlatio coefficiet:" <== Normality seems reasoable 4

5 [1] > source("hotelligr") > Hotellig(z,c(4,50,10)) [,1] Hoteliig-T pvalue <== Caot reject the ull hypo at the 5% level Remark: Let y = Cx + d, where C is a p p o-sigular matrix ad d is a p-dimesioal vector The, testig H o : µ = µ o of x is equivalet to testig H o : µ y = Cµ o + d of y It turs out that, as expected, the Hotellig T 2 statistic is idetical Proof: ȳ = C x + d ad S y = CSC T 2 = (ȳ µ yo ) S 1 y (ȳ µ yo ) = (C x + d Cµ o d) (CSC ) 1 (C x + d Cµ o d) = [C( x µ o )] C 1 S 1 C 1 [C( x µ o )] = ( x µ o ) S 1 ( x µ o ) 2 Likelihood ratio test Key result: Hotellig T 2 test statistic is equivalet to likelihood ratio test statistic Geeral theory: Let θ be the parameter vector of a likelihood fuctio L(θ) with observatios x 1,, x Suppose that the parameter space is Θ Cosider the ull hypothesis θ Θ o versus H a : θ Θ o, where Θ o is a subspace of Θ The likelihood ratio statistic is Λ = max θ Θ o L(θ) (2) max θ Θ L(θ) Oe rejects H o if Λ < c, where c is some critical value depedig o the type-i error Ituitively, oe rejects H o if the maximized likelihood fuctio over the subspace Θ o is much smaller tha that over the parameter space Θ, idicatig that it is ulikely for θ to be i Θ o Specifically, uder the hull hypothesis H o, the likelihood ratio statistic where v = dim(θ) ad v o = dim(θ o ) 2 l(λ) χ 2 v v o, as, Remark: Θ o Θ idicates that the ull model is ested i the alterative model For o-ested models, likelihood ratio test does ot apply [See some recet papers o the geeralized likelihood ratio test] Normal case: For multivariate ormal distributio, the likelihood ratio statistic is relatively simple because the limitig distributio of Λ is available Specifically, cosider H o : µ = µ o 5

6 versus H a : µ µ o Here θ = (µ, σ 11, σ 21, σ 22,, σ p1,, σ pp ) is of dimesio p+p(p+1)/2 Uder H o, µ is fixed at µ o so that Θ o cosists of the space for the elemets of Σ I this case, Λ = max Σ L(µ o, Σ) max µ,σ L(µ, Σ) Recall that, uder Θ, max µ,σ L(µ, Σ) = L( µ, Σ) = 1 (2π) p/2 Σ /2 e p/2, where µ = x = 1 x i ad Σ = 1 (x i x)(x i x) Uder Θ o, the likelihood fuctio becomes [ 1 L(µ o, Σ) = exp 1 ] (x (2π) p/2 Σ /2 i µ 2 o ) Σ 1 (x i µ o ) By the same method as before, we ca show that max Σ L(µ o, Σ) = L(µ o, Σ o ) = where Σ o = 1 (x i µ o )(x i µ o ) Cosequetly, Λ = ( Σ ) /2 Σ o The statistic Λ 2/ = Σ / Σ o is called the Wilks lambda 1 (2π) p/2 Σ o /2 e p/2, Result 51 Let X 1,, X be a radom sample from a N p (µ, Σ) populatio Cosider the hypothesis testig: H o : µ = µ o versus H a : µ µ o The, the likelihood ratio statistic becomes ( Λ 2/ = 1 + T 2 ) 1, 1 where T 2 is the Hotellig s T 2 statistic Proof The followig equality of determiats is useful ( 1) (x i x)(x i x) + ( x µ o )( x µ o ) ( ) = (x i x)(x i x) 1 ( x µ o ) 1 (x x)(x i x) ( x µ o ) This follows directly from the idetity A = A 22 A 11 A 12 A 1 22 A 21 = A 11 A 22 A 21 A 1 11 A 12 with [ (x A = i x)(x i x) ] [ ] ( x µo ) A11 A ( x µo ) = 12 1 A 21 A 22 6

7 Note also that Therefore, That is, Cosequetly, (x i µ o )(x i µ o ) = (x i x)(x i x) + ( x µ o )( x µ o ) ( ( 1) (x i µ o )(x i µ o ) = (x x)(x i x) ( 1) 1 + T 2 ) 1 ( Σ o = Σ 1 + T 2 ) 1 Λ 2/ = Σ ( Σ o = 1 + T 2 ) 1 1 Remark: The prior result suggests that there is o eed to calculate S 1 i computig the Hotellig s T 2 statistic Ideed, 3 Cofidece regio T 2 = ( 1) Σ o Σ ( 1) Let θ be the parameter vector of iterest ad Θ be the paramter space For a radom sample X 1,, X, let X deote the data A 100(1 α)% cofidece regio R(X) is defied as P r[θ R(X)] = 1 α, where the probability is evaluated at the ukow true parameter θ That is, the regio R(X) will cover θ with probability 1 α For the mea vector µ of a multivariate ormal distributio, the cofidece regio (CR) ca be obtaied by the result T 2 ( 1)p p F p, p That is, P r [ ( X µ) S 1 ( X µ) ] ( 1)p p F p, p(α) = 1 α, whatever the values of µ ad Σ, where S = 1 (X 1 i X)(X i X) I other words, if we measure the distace usig the variace-covariace matrix 1 S, the X will be withi [( 1)pF p, p (α)/( p)] 1/2 of µ 7

8 Remark: The quatity ( X µ) (S/) 1 ( X µ) ca be cosidered as the Mahalaobis distace of µ from X, because the covariace matrix of X is 1 Σ, which is cosistetly estimated by 1 S Compared with the Euclidea distace, Mahalaobis distace takes ito cosideratio the covariace structure It is a distace measure based o correlatios betwee variables For ormal distributio, CR ca be viewed as ellipsoids cetered at µ ad have axes determied by the eigevalues ad eigevectors of S For N p (µ, Σ) distributio, the cotours of costat desity are ellipsoids defied by x such that (x µ) Σ 1 (x µ) = c 2 These ellipsoids are cetered at µ ad have axes ±c λ i e i, where Σe i = λ i e i for i = 1,, p Cosequetly, the CR are cetered at x ad the axes of the cofidece ellipsoid are ± λ ( 1)p i ( p) F p, p(α)e i, where Se i = λ i e i for i = 1,, p Simultaeous cofidece itervals: Suppose that X N p (µ, Σ) For ay o-zero p-dimesioal vector a, Z = a X N(a µ, a Σa) If x 1,, x are a radom sample from X, {z i = a x i } is a radom sample of Z The sample mea ad variace are z = a x ad s 2 z = a Sa, where S is the sample covariace matrix of x i If a is fixed, the 100(1 α)% cofidece iterval of a µ is a x t 1 (α/2) a Sa a µ a x + t 1 (α/2) a Sa This iterval cosists of a µ values for which (a x a µ) a Sa t 1(α/2), or, equivaletly, [a ( x µ)] 2 t 2 a Sa 1(α/2) If we cosider all values of a for which the prior iequality holds, the we are aturally led to the determiatio of [a ( x µ)] 2 max a a Sa Usig the Maximizatio Lemma (see Eq (250) of the text, p 80) with x = a, d = x µ ad B = S, we obtai [a ( x µ)] 2 [ [a ( x µ)] 2 ] max = max = ( x µ) a a Sa a S 1 ( x µ) = T 2, a Sa 8

9 with the maximum occurrig for a proportioal to S 1 ( x µ) Notig that T 2 F p, p, we have the followig result ( 1)p p Result 53 Let X 1,, X be a radom sample from a N p (µ, Σ) populatio with Σ beig positive defiite The, simultaeously for all a, the iterval a X p( 1) ( p) F p, p(α)a Sa, a X + p( 1) ( p) F p, p(α)a Sa will cotai a µ with probability 1 α For coveiece, we refer to the simultaeous cofidece itervals of Result 53 as T 2 - itervals I particular, the choices of a = (1, 0,, 0), (0, 1, 0,, 0),, (0,, 0, 1) allow us to coclude that x i p( 1) ( p) F sii p, p(α) µ i x i + p( 1) ( p) F sii p, p(α), hold simultaeously with cofidece coefficiet 1 α for all i = 1,, p Furthermore, by choosig a = (0,, 0, 1, 0,, 0, 1, 0,, 0) with 1 at the ith positio ad 1 at the kth positio, where i k, we have a µ = µ i µ k ad a Sa = s ii 2s ik + s kk, ad the T 2 -itervals x i x k p( 1) ( p) F sii 2s ik + s kk p, p(α) µ i µ k x i x k + p( 1) ( p) F sii 2s ik + s kk p, p(α) Compariso A alterative approach to costruct cofidece itervals for the compoet meas is to cosider the compoet oe at a time This approach igores the covariace structure of the variables ad leads to the itervals x i t 1 (α/2) sii µ i x i + t 1 (α/2) for i = 1,, p Here each iterval has a cofidece coefficiet 1 α However, we do ot kow the cofidece coefficiet that all itervals cotai their respective µ i, except for the case that the variables X i are idepedet I the latter case, the cofidece coefficiet is (1 α) p Obviously, the T 2 -iterval is wider tha the idividual iterval for each µ i The table below gives some comparisos of critical multipliers for oe-at-a-time t-itervals ad the T 2 itervals for selected ad p whe the coverage probability is (1 α) = 095 sii, 9

10 Table 1: Critical multipliers for oe-at-a-time t-itervals ad the T 2 -itervals for selected sample size ad dimesio p, where (1 α) = 095 p( 1) p p, p(005) t 1 (0025) p = 3 p = 5 p = Boferroi method for multiple comparisos I some applicatios, the T 2 itervals might be too wide to be of practical value ad oe may oly cocer about a fiite umber of liear combiatios, say a 1,, a m I this case, we may cotruct m cofidece itervals that are shorter tha the T 2 itervals This method to multiple comparisos is called the Boferroi method Let C i deote a cofidece statemet about the value of a iµ with P r[c i true ] = 1 α i, where i = 1,, m The, P r[all C i true] = 1 P r[at least oe C i false] m m 1 P r[c i false] = 1 [1 P r(c i true)] = 1 (α α m ) This is a special case of the Boferroi iequality ad allows us to cotrol the overall type-i error rate α α m The choices of α i make the method flexible, depedig o the prior kowledge o the importace of each iterval If there is o prior iformatio, the α i = α/m is ofte used Let z i = a ix i ad s zi = a isa i The, each of the itervals szi z i ± t 1 (α/(2m)), i = 1,, m, cotais a iµ with probability 1 α/m Joitly, all a iµ are i their respective itervals with probability p 1 m(α/m) = 1 α For istace, if m = p ad a i is the ith uit vector, the the itervals sii x i t 1 (α/(2p)) µ sii i x i + t 1 (α/(2p)) hold with probability at least 1 α We refer to these itervals as the Boferroi itervals 10

11 Table 2: Ratio of the legths betwee Boferroi iterval ad T 2 -iterval for selected sample size ad dimesio p whe the probability of the iterval is 095 p I geeral, if we use α i = α/p, the we ca compare the legth of Boferroi itervals with those of the T 2 itervals The ratio is Legth of Boferroi iterval Legth of T 2 iterval = t 1(α/(2p)) p( 1) p F p, p(α) This ratio does ot deped o the sample mea or covariace matrix Table 2 gives the prior ratio for some selected ad p See also, Table 54 of the text( page 234) 4 Large sample case Whe the sample size is large, we ca apply the cetral limit theory so that the populatio may ot be ormal Recall that, whe is large, ( X µ) S 1 ( X µ) χ 2 p Thus, P r[( X µ) S 1 ( X µ) χ 2 p(α)] 1 α, where χ 2 p(α) is the upper 100α percetile of a chi-square distributio with p degrees of freedom This property ca be used (a) to costruct asymptotic cofidece itervals for the meas µ i ad (b) to perform hypothesis testig about µ Asymptotic cofidece itervals: Let X 1,, X be a radom sample from a populatio with mea µ ad positive covariace matrix Σ If p is sufficietly large, a X ± χ 2 p(α) a Sa will cotai a µ, for every a, with probability approximately 1 α Sice the chi-square distributio does ot deped o a, the above cofidece itervals are simultaeous cofidece itervals This result ca be used to compare meas of differet compoets of X 11

12 Asymptotic testig: Let X 1,, X be a radom sample from a populatio with mea µ ad positive covariace matrix Σ Whe p is sufficietly large, the hypothesis H o : µ = µ o is rejected i favor of H a : µ µ o, at the sigificace level α if ( x µ o ) S 1 ( x µ o ) > χ 2 p(α) Remark: A simple R fuctio is available o the course web that computes various types of cofidece itervals for the meas of the compoets of X To use the program, do the followig: (a) Dowload the program ito your workig directory, (2) use the commad source( r-cregiotxt ) to compile the fuctio The commad cofreg is ready to use You ca modify the fuctio to costruct cofidece regios for differeces betwee meas of compoets of X Example: Cosider the mothly log returs of four Midwest compaies used before The various cofidece itervals are obtaied as follows: > setwd("c:/users/rst/teachig/ama") > x=readtable("m-ba4c9807txt") > source("cregior") > cofreg(x) [1] "CR based o T^2" [,1] [,2] [1,] [2,] [3,] [4,] [1] "CR based o idividual t" [,1] [,2] [1,] [2,] [3,] [4,] [1] "CR based o Boferroi" [,1] [,2] [1,] [2,] [3,] [4,] [1] "Asymp simu CR" [,1] [,2] [1,] [2,] [3,] [4,] > 12

13 Figure 2: T-sq chart for the mothly log returs of four Midwest compaies: The limits are 95% ad 99% quatiles t Multivariate cotrol charts We discuss two cases I the first case, the goal is to idetify uusual observatios i a sample, icludig the possibility of drift over time This is called the T 2 -chart, which uses the distace d i discussed before i assessig ormality assumptio For the ith observatio, the d 2 i statistic is d 2 i = (x i x) S 1 (x i x) The upper cotrol limit is the set by the upper quatiles of χ 2 p Typically, 95th ad 99th percetiles are used to set the upper cotrol limit ad the lower cotrol limit is zero Oce a poit is foud to be outside the cotrol limit, the idividual cofidece itervals for the compoet meas ca be used to idetify the source of the deviatio As a illustratio, cosider the mothly log returs of the four Midwest compaies from 1998 to 2007 The T 2 -chart is give i Figure 2 About three data poits are outside of the 99% limit, idicatig a volatility period I the secod case, we cosider a cotrol chart for future obsevratios The theory behid this type of cotrol chart is the followig result Result 56 Let X 1,, X be idepedetly distributed as N p (µ, Σ) ad let X be a 13

14 future observatio from the same populatio The, T 2 = + 1 (X X) S 1 (X X) p( 1) p F p, p, ad a 100(1 α)% p-dimesioal predictio ellipsoid is give by all x satisfyig (x x) S 1 (x x) p(2 1) ( p) F p, p(α) Proof: First, E(X X) = 0 Sice X ad X are idepedet, Cov(X X) = Σ + 1 Σ = + 1 Σ /( + 1)(X X) N p (0, Σ) Furthermore, by the result i Eq (56) of the Thus, textbook, + 1 (X X) S (X X) has a scaled F distributio as stated T 2 -chart for future observatios For each ew observatio, plot T 2 = + 1 (x x) S 1 (x x), agaist time order Set the lower cotrol limit to zero ad the upper cotrol limit as UCL = ( 1)p p F p, p(001) For illustratio, cosider the mothly log retur series of the four Midwest compaies We start with iitial = 40 observatios The T 2 -chart for future observatios is give i Figure 3 Remark: R fuctios for the two cotrol charts are available o the course web 6 Missig values i ormal radom sample A key assumptio: Missig at radom Two methods are available: 1 The EM algorithm: Dempster, Laird ad Rubi (1977, JRSSB) 2 Markov chai Mote Carlo (MCMC) method Some refereces 14

15 Figure 3: T 2 Future Chart for mothly log returs of four Midwest compaies: Future Chart: the limits are 95% ad 99% quatiles T tdx 15

16 1 The EM Alorithm ad Extesios by G J McLachla ad T Krisha (2008), Joh Wiley 2 The EM Algorithm ad Related Statistical Models by M Wataabe ad K Yamaguchi (2003), CRC Press 3 Bayesia Data Aalysis, 2d Ed, by Gelma, Carli, Ster ad Rubi (2003), CRC Press 4 Bayes ad Empirical Bayes Methods for Data Aalysis, 2d Ed, by BP Carli ad TA Louis (2001), CRC Press EM algorithm: Iterate betwee Expectatio step ad Maximizatio step E-step: For each data poit with missig values, use the coditioal distributio X 1 X 2 = x 2 N k (µ 1 + Σ 12 Σ 1 22 (x 2 µ 2 ), Σ 11 Σ 12 Σ 1 22 Σ 21 ), where k = dim(x 1 ) ad the partitio is based o X = (X 1, X 2) Here X 1 deotes the missig compoets ad X 2 the observed compoet M-step: Perform MLE estimatio based o the complete data It is helpful to make use of sufficiet statistics MCMC method: Also makes use of the same coditioal distributio However, istead of usig the expectatio, oe draws a radom sample from the coditioal distributio to fill the missig values Basic Result used i MCMC method: Suppose that x 1,, x form a radom sample from N(µ, Σ), where Σ is kow Suppose that the prior distributio of µ is N(µ o, Σ o ) The the posterior distributio of µ is also multivariate ormal with mea µ ad covariace matrix Σ, where where x = x i / Σ 1 = Σ 1 o + Σ 1, µ = Σ (Σ 1 o µ o + Σ 1 x), 7 Impact of serial correlatios Cosider the case i which X i is serially correlated such as it follows a vector AR(1) model X t µ = Φ(X t 1 µ) + a t, a t iid N p (0, Σ), 16

17 where E(X t ) = µ ad all eigevalues of Φ are less tha 1 i modulus Let X 1,, X be cosecutive observatios of the model Defie the lag-l autocovariace matrix of X t as Γ l = E(X t µ)(x t l µ), l = 0, ±1, ±2, It is easy to see that Γ l = Γ l Also, by repeatedly substitutio, we have Therefore, X t µ = a t + Φa t 1 + Φ 2 a t 2 + Γ 0 = Φ i Σ(Φ i ) For l > 0, post-multiplyig the model by (X t l µ), takig expectatio, ad usig the fact that X t l is ucorrelated with a t, we get Γ l = ΦΓ l 1, l = 1, 2, Takig the traspose of the above equatio, we have Usig Γ l = Γ l, we obtai which is equivalet to Γ l = Γ l 1Φ Γ l = Γ (l 1) Φ, l > 1, Γ l = Γ l+1 Φ, l = 1, 2, For the vector AR(1) model, we ca show the followig properties: 1 E( X) = µ 2 Cov( 1/2 t=1 X t ) p Ω, where Ω = (I Φ) 1 Γ 0 + Γ 0 (I Φ ) 1 Γ 0, as 3 S = 1 t=1 (X 1 t X)(X t X) p Γ 0 as Usig these properties, we ca show that ( X µ) is approximately ormal with mea 0 ad covariace Ω This implies that ( X µ) Ω 1 ( X µ) χ 2 p, ot the usual statistic ( X µ) S 1 ( X µ) See Table 510 of the text (page 257) for the differece i coverage probability due to the effect of AR(1) serial correlatios 17

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Exponential Families and Bayesian Inference

Exponential Families and Bayesian Inference Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7:

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7: 0 Multivariate Cotrol Chart 3 Multivariate Normal Distributio 5 Estimatio of the Mea ad Covariace Matrix 6 Hotellig s Cotrol Chart 6 Hotellig s Square 8 Average Value of k Subgroups 0 Example 3 3 Value

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

4. Hypothesis testing (Hotelling s T 2 -statistic)

4. Hypothesis testing (Hotelling s T 2 -statistic) 4. Hypothesis testig (Hotellig s T -statistic) Cosider the test of hypothesis H 0 : = 0 H A = 6= 0 4. The Uio-Itersectio Priciple W accept the hypothesis H 0 as valid if ad oly if H 0 (a) : a T = a T 0

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Summary. Recap ... Last Lecture. Summary. Theorem

Summary. Recap ... Last Lecture. Summary. Theorem Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

TAMS24: Notations and Formulas

TAMS24: Notations and Formulas TAMS4: Notatios ad Formulas Basic otatios ad defiitios X: radom variable stokastiska variabel Mea Vätevärde: µ = X = by Xiagfeg Yag kpx k, if X is discrete, xf Xxdx, if X is cotiuous Variace Varias: =

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

APPLIED MULTIVARIATE ANALYSIS

APPLIED MULTIVARIATE ANALYSIS ALIED MULTIVARIATE ANALYSIS FREQUENTLY ASKED QUESTIONS AMIT MITRA & SHARMISHTHA MITRA DEARTMENT OF MATHEMATICS & STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANUR X = X X X [] The variace covariace atrix

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

STATISTICAL INFERENCE

STATISTICAL INFERENCE STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the

More information

Lecture 11 and 12: Basic estimation theory

Lecture 11 and 12: Basic estimation theory Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes. Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

Mathematical Statistics - MS

Mathematical Statistics - MS Paper Specific Istructios. The examiatio is of hours duratio. There are a total of 60 questios carryig 00 marks. The etire paper is divided ito three sectios, A, B ad C. All sectios are compulsory. Questios

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Department of Mathematics

Department of Mathematics Departmet of Mathematics Ma 3/103 KC Border Itroductio to Probability ad Statistics Witer 2017 Lecture 19: Estimatio II Relevat textbook passages: Larse Marx [1]: Sectios 5.2 5.7 19.1 The method of momets

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

Chi-Squared Tests Math 6070, Spring 2006

Chi-Squared Tests Math 6070, Spring 2006 Chi-Squared Tests Math 6070, Sprig 2006 Davar Khoshevisa Uiversity of Utah February XXX, 2006 Cotets MLE for Goodess-of Fit 2 2 The Multiomial Distributio 3 3 Applicatio to Goodess-of-Fit 6 3 Testig for

More information

Statistical Properties of OLS estimators

Statistical Properties of OLS estimators 1 Statistical Properties of OLS estimators Liear Model: Y i = β 0 + β 1 X i + u i OLS estimators: β 0 = Y β 1X β 1 = Best Liear Ubiased Estimator (BLUE) Liear Estimator: β 0 ad β 1 are liear fuctio of

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2 82 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Defiitio: et X be a radom sample with joit p.m/d.f. f X x θ. The geeralised likelihood ratio test g.l.r.t. of the NH : θ H 0 agaist the alterative AH : θ H 1,

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version)

Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version) Kurskod: TAMS Provkod: TENB 2 March 205, 4:00-8:00 Examier: Xiagfeg Yag (Tel: 070 2234765). Please aswer i ENGLISH if you ca. a. You are allowed to use: a calculator; formel -och tabellsamlig i matematisk

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15 17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

Lecture Notes 15 Hypothesis Testing (Chapter 10)

Lecture Notes 15 Hypothesis Testing (Chapter 10) 1 Itroductio Lecture Notes 15 Hypothesis Testig Chapter 10) Let X 1,..., X p θ x). Suppose we we wat to kow if θ = θ 0 or ot, where θ 0 is a specific value of θ. For example, if we are flippig a coi, we

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Chapter 13: Tests of Hypothesis Section 13.1 Introduction Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed

More information

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008 Chapter 6 Part 5 Cofidece Itervals t distributio chi square distributio October 23, 2008 The will be o help sessio o Moday, October 27. Goal: To clearly uderstad the lik betwee probability ad cofidece

More information

Lecture 23: Minimal sufficiency

Lecture 23: Minimal sufficiency Lecture 23: Miimal sufficiecy Maximal reductio without loss of iformatio There are may sufficiet statistics for a give problem. I fact, X (the whole data set) is sufficiet. If T is a sufficiet statistic

More information

Estimation of the Mean and the ACVF

Estimation of the Mean and the ACVF Chapter 5 Estimatio of the Mea ad the ACVF A statioary process {X t } is characterized by its mea ad its autocovariace fuctio γ ), ad so by the autocorrelatio fuctio ρ ) I this chapter we preset the estimators

More information

Topic 18: Composite Hypotheses

Topic 18: Composite Hypotheses Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

Statistical Analysis on Uncertainty for Autocorrelated Measurements and its Applications to Key Comparisons

Statistical Analysis on Uncertainty for Autocorrelated Measurements and its Applications to Key Comparisons Statistical Aalysis o Ucertaity for Autocorrelated Measuremets ad its Applicatios to Key Comparisos Nie Fa Zhag Natioal Istitute of Stadards ad Techology Gaithersburg, MD 0899, USA Outlies. Itroductio.

More information

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Lecture 6 Simple alternatives and the Neyman-Pearson lemma STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Basis for simulation techniques

Basis for simulation techniques Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios

More information

1 Review of Probability & Statistics

1 Review of Probability & Statistics 1 Review of Probability & Statistics a. I a group of 000 people, it has bee reported that there are: 61 smokers 670 over 5 960 people who imbibe (drik alcohol) 86 smokers who imbibe 90 imbibers over 5

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n, CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 9 Variace Questio: At each time step, I flip a fair coi. If it comes up Heads, I walk oe step to the right; if it comes up Tails, I walk oe

More information

MA Advanced Econometrics: Properties of Least Squares Estimators

MA Advanced Econometrics: Properties of Least Squares Estimators MA Advaced Ecoometrics: Properties of Least Squares Estimators Karl Whela School of Ecoomics, UCD February 5, 20 Karl Whela UCD Least Squares Estimators February 5, 20 / 5 Part I Least Squares: Some Fiite-Sample

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

There is no straightforward approach for choosing the warmup period l.

There is no straightforward approach for choosing the warmup period l. B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace.

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63. STT 315, Summer 006 Lecture 5 Materials Covered: Chapter 6 Suggested Exercises: 67, 69, 617, 60, 61, 641, 649, 65, 653, 66, 663 1 Defiitios Cofidece Iterval: A cofidece iterval is a iterval believed to

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

Chi-squared tests Math 6070, Spring 2014

Chi-squared tests Math 6070, Spring 2014 Chi-squared tests Math 6070, Sprig 204 Davar Khoshevisa Uiversity of Utah March, 204 Cotets MLE for goodess-of fit 2 2 The Multivariate ormal distributio 3 3 Cetral limit theorems 5 4 Applicatio to goodess-of-fit

More information