4. Hypothesis testing (Hotelling s T 2 -statistic)

Size: px
Start display at page:

Download "4. Hypothesis testing (Hotelling s T 2 -statistic)"

Transcription

1 4. Hypothesis testig (Hotellig s T -statistic) Cosider the test of hypothesis H 0 : = 0 H A = 6= 0 4. The Uio-Itersectio Priciple W accept the hypothesis H 0 as valid if ad oly if H 0 (a) : a T = a T 0 is accepted for all liear compouds a: [i some sese the uio of all such hypotheses] For give a; we set y = a T x so that i the populatio uder H 0 ; ad i our sample E (y) = a T 0 V ar (y) = a T a y = a T x s s s y s:e: (y) = = a T S u a The uivariate t-statistic for testig H 0 (a) agaist the alterative (y) 6= a T 0 is t (a) = y at 0 s:e: (y) p a T (x = 0 ) p a T Sa where S is uderstood to deote the ubiased estimator S u for the rest of this subsectio. The acceptace threshold for H 0 (a) takes the form t (a) R for some R. The multivariate acceptace regio is the itersectio \ t (a) R (4.) which is true if ad oly if max a t (a) R: Therefore we adopt T = max t (a) a as the test statistic for H 0 : Write d = x 0 ; the t (a) = at dd T a a T Sa which is the ratio of two quadratic forms. This is a classical extremal problem kow as the geeralized eigevalue problem As this ratio is idepedet of the scale of a we ca set the deomiator arbitrarily to have the value.

2 This gives a equivalet formulatio Maximize a T (x 0 ) (x 0 ) T a subject to a T Sa = (4.) i.e. of the form: max f (a) subject to g (a) = 0. We itroduce a Lagragea multiplier ad seek to determie ad a to satisfy d h i a T (x da 0 ) (x 0 ) T a a T Sa = 0 dd T a Sa = 0 (4.3a) S dd T I a = 0 (4.3b) js dd T Ij = 0 (4.3c) (4.3b) ca be writte Ma = a showig that a is a eigevector of M = S dd T. (4.3c) is the determiatal equatio satis ed by the eigevalues of S dd T. Premultiplyig (4.3a) by a T gives a T dd T a a T Sa = 0 = at dd T a a T Sa = t (a) Therefore i order to maximize t (a) we choose to be the largest eigevalue of M = S dd T : This is a rak matrix with the sigle o-zero eigevalue tr S dd T = d T S d ad the maximum of (4.) is kow as Hotellig s T statistic T = (x 0 ) T S (x 0 ) (4.4) which is the sample Mahalaobis distace betwee x ad 0. Note that M = uv T u = S d ad v = d are rak matrices (vectors). Mu = u v T u = u where so u = S d is the correspodig eigevector of M: Thus a = u = S d de es the liear compoud y = a T x givig the maximum discrepacy from the ull hypothesis H 0 : 4. Distributio of T Uder H 0 it ca be show that T s p p F p; p (4.5)

3 where F p; p is the F distributio o p ad p degrees of freedom. The correspodig F statistic is therefore F = p : T p : If H 0 is true the F has a F distributio o p ad p degrees of freedom (d.f.), while if H 0 is ot true the the distributio has a o-cetral F distributio. NB. depedig o the covariace matrix used, T has slightly di eret forms ( T ( ) (x = 0 ) T Sm (x 0 ) (x 0 ) T Su (x 0 ) where S m is the ML estimate of (with divisor ) ad S u is the ubiased estimator of (with divisor ). Example I a ivestigatio of adult itelligece, scores were obtaied o two tests "verbal" ad "performace" for = 0 subjects aged 60 to 64. Doppelt ad Wallace (955) reported the followig mea score ad covariace matrix: x 55:4 = x 34:97 0:54 6:99 S u = 6:99 9:68 At the = :0 (%) level, test the hypothesis that 60 = 50 ad We rst compute S u = :039 :0400 :0400 :03 d = x 0 = 4:76 5:03 T The T statistic is the T = 4:76 5:03 :039 :0400 4:76 0 :0400 :03 5:03 4:76 = :039 4:76 5:03 : :03 :03 = 3:54 This gives F = p : T p = 99 3:54 = 76:9 3

4 The earest tabulated % value correspods to F ;60 ad is Therefore we coclude the ull hypothesis should be rejected. The sample probably arose from a populatio with a much lower mea vector, rather closer to the sample mea. Example The chage i levels of free fatty acid (FFA) were measured o 5 hypotised subjects who had bee asked to experiece fear, depressio ad ager e ects while uder hyposis. The mea FFA chages were x = :699 x = :78 x 3 = :558 Give that the covariace matrix of the stress di ereces y i = x i x i ad y i = x i x i3 is :7343 :666 S u = :666 :7733 Su 0:804 0:338 = 0:338 :7733 test at the 0.05 level of sigi cace, whether each e ect produced the same chage i FFA. [T = :68 ad F = :4 with degrees of freedom,3. Do ot reject the hypothesis "o emotio e ect" at the = :05 level] 4.3 Ivariace of T T is ua ected by chages i the scale or origi of the (respose) variables. Cosider where C is (p p) ad o-sigular. y = Cx + d The ull hypothesis H 0 : x = 0 is equivalet to H 0 : y = C 0 + d. trasformatio Uder this liear y = C x + d S y = CSC T so that T y = y y T S y y y = (x 0 ) T C T CSC T C (x 0 ) = (x 0 ) T C T C T S C C (x 0 ) = (x 0 ) T S (x 0 ) which demostrates ivariace. 4

5 4.4 Co dece iterval for a mea A co dece regio for ca be obtaied give the distributio of T by substitutig the data values x ad S : I Example above we have (x ) T S (x ) s p p F p; p (4.6) x = (55:4; 34:97) T 00S :3 :40 = :40 :3 ad F ;99 (:0) is approximately 4.83 (by iterpolatio). A 99% co dece regio for is therefore the set of ( ; ) satisfyig :3 ( 55:4) :80 ( 5:4) ( 34:97) + :3 ( 34:97) 00 4:83 = 9:76: 99 This represets the iterior of a ellipse i p = dimesios which ca be plotted. I higher dimesios a ellipsoidal co dece regio is obtaied. 4.5 Likelihood ratio testig Give a data matrix X of observatios o a radom vector x whose distributio depeds o a vector of parameters, the likelihood ratio for testig the ull hypothesis H 0 : 0 H : agaist the alterative is de ed as = sup 0 L sup L where L = L (; X) is the likelihood fuctio. I a likelihood ratio test (LRT) we reject H 0 for low values of ; i.e. if < c where c is chose so that the probability of Type I error is : If we de e l0 = log L 0 where L 0 is the value of the umerator ad similarly l = log L, the rejectio criterio takes the form L log = log 0 Result L (4.7) = l 0 l > k (4.8) Whe H 0 is true ad for large the log likelihood ratio (4.8) has the -distributio o r degrees of freedom, r, where r equals the umber of free parameters uder H mius the umber of free parameters uder H 0 : 5

6 4.6 LRT for a mea whe is kow H 0 : = 0 a give value whe is kow Give a radom sample from N (; ) resultig i x ad S the likelihood give i (3.8b) is (to withi a additive costat) l (; ) = log jj + tr S o + (x ) T (x ) (4.9) Uder H 0 the value of is kow ad l0 = l ( 0 ; ) = log jj + tr S o + (x 0 ) T (x 0 ) Uder H with o restrictio o ; the m.l.e. of is ^ = x: Thus l = log jj + tr S Therefore log = l 0 l = (x 0 ) T (x 0 ) (4.0) which is times the Mahalaobis distace of x from 0. Note the similarity with Hotellig s T statistic. Give the distributio of x uder H 0 is x s N p 0 ; ad (4.0) may be writte usig the trasformatio y = idepedet N (0; ) variates as (x 0 ) to a stadard set of px log = y T y = yi (4.) we have the exact distributio showig that i this case the asymptotic distributio of Example log s p (4.) log is exact for the small sample case. Measuremets of the legth of skull were made o a sample of rst ad secod sos from 5 families. 85:7 x = 83:84 9:48 66:88 S = 96:78 Assumig that i fact =

7 test at the = :05 level the hypothesis H 0 : = 8 8 T Solutio log = 5 3:7 :84 :0 0 3:7 0 :0 :84 = 0:5 3:7 + :84 = 4:3 Sice (:05) = 5:99 do ot reject H LRT for whe is ukow (Hotellig0s) Cosider the LRT test of the hypothesis H 0 : = 0 whe is ukow. H : 6= 0 I this case must be estimated uder H 0 ad also uder H : Uder H 0 : ( = 0 ) l ( 0 ; ) = log jj + tr S o + (x 0 ) T (x 0 ) = log jj + tr S + d T 0 d 0 = log jj + tr S + d 0 d T 0 (4.3a) (4.3b) writig d 0 for x 0 : Note that (as before) we used the followig: d T 0 d 0 = tr d T 0 d 0 = tr d 0 d T 0 Uder H : l (^; ) = log jj + tr S o + (x ^) T (x ^) = log jj + tr S (4.4a) Substitutig m.l.e. s ^ = x ad ^ = S obtaied previously, gives l ^; ^ = log jsj + tr S S = flog jsj + tr (I p )g l = log jsj + p (4.4b) 7

8 Comparig (4:3b) with (4:4b) it is clear that the m.l..e. of uder H 0 must be ^ = S + d 0 d T 0 ad that the correspodig value of l = log L is l 0 = log js + d 0 d T 0 j + p (4.4c) The log likelihood ratio is l 0 l = log js + d 0 d T 0 j log jsj = log js j + log js + d 0 d T 0 j = log js S + d 0 d T 0 j = log ji p +S d 0 d T 0 j = log + d T 0 S d 0 (4.5) makig use of the useful matrix result proved i (:8:3) that ji p +uv T j = + v T u : Sice log = log + T (4.6) we see that ad T are mootoically related. Therefore we ca coclude that the LRT of H 0 : = 0 whe is ukow is equivalet to use of Hotellig s T statistic. 4.8 LRT for = 0 with ukow H 0 : = 0 whe is ukow. H : 6= 0 Uder H 0 we substitute ^ = x ito o l (^; 0 ) = log j 0 j + tr 0 S + (x ^) T 0 (x ^) givig l 0 = log j 0 j + tr 0 S (4.7) Uder H we substitute the urestricted m.l.e. s ^ = x ad ^ = S givig as i (4:4e) l = log jsj + p (4.8) l0 l = log j 0 j + tr 0 S log jsj p = log j0 Sj+ tr 0 S p (4.9) This statistic depeds oly o the eigevalues of the positive de ite matrix 0 S ad has the property that l0 l = log! 0 as S approaches 0: 8

9 Let A be the arithmetic mea ad G the geometric mea of the eigevalues of 0 S the tr 0 = pa Sj = Gp j 0 log = fpa p log G pg The geeral result for the distributio of (4:0) for large gives = p fa log G g (4.0) l 0 l s r (4.) where r = p (p + ) is the umber of idepedet parameters i : 4.0 Test for sphericity A covariace matrix is said to have the property of "sphericity" if = ki p (4.) for some k: We see that this is a special case of the more geeral situatio = k 0 treated i Sectio (3.3.3). The same procedure ca be applied. The geeral likelihood expressio for a sample from the MVN distributio is: log L = log jj + tr S + dd T Uder H 0 : = ki p ad ^ = x so log L = log jki p j + tr k S = p log k + k tr S [ log L] = 0 at a miimum p k k tr S = 0 ^k = tr S p (4.4) which is i fact the arithmetic mea A of the eigevalues of S: Substitute back ito (4.3) gives l0 = p (log A + ) Uder H : ^ = x ad ^ = S l = log jsj + p = p (log G + ) 9

10 thus log = l 0 l = p log A G (4.5) The umber of free parameters cotaied i is uder H 0 ad p (p + ) uder H : Hece the appropriate distributio for comparig log is r where r = p (p + ) = (p ) (p + ) (4.6) 4. Test for idepedece Idepedece of the variables x ; :::; x p is maifest by a diagoal covariace matrix = diag ( ; :::; pp ) (4.7) We cosider H 0 : is diagoal H :. is urestricted agaist the geeral alterative Uder H 0 it is clear i fact that we will d ^ ii = s ii because the estimators of ii for each x i are idepedet. We ca also show this formally = log jj + tr S + dd T ( px = log ii + px s ii ii ii ( log L) = 0 ii s ii ii = 0 b ii = s ii Therefore ( px ) = log s ii + p = flog jdj + pg where D = diag (s ; :::; s pp ) : Uder H as before we d l = log jsj + p 0

11 Therefore l 0 l = [log jdj log jsj] = log jd Sj = log jd SD j = log jrj (4.8) The umber of free parameters cotaied i is p uder H 0 ad p (p + ) uder H : Hece the appropriate distributio for comparig log is r where r = p (p + ) p = p (p ) (4.9) 4. Simultaeous co dece itervals (Sche e, Roy & Bose) The uio-itersectio method for derivig Hotellig s T statistic provides "simultaeous co - dece itervals" for the parameters whe is ukow. Followig Sectio 4. let T = ( ) (x ) T S (x ) (4.30) where is the ukow (true) mea. Let t (a) be the uivariate t liear compoud y = a T x: The max a t (a) = T ad for all p where vectors a statistic correspodig to the t (a) T (4.3) t (a) = y y s y = p p a T (x ) = p a T Sa (4.3) so From Sectio 4. the distributio of T is T s p p F p; p Pr T ( ) p p F p; p () = therefore from (4.3), for all p vectors a Pr t (a) ( ) p p F p; p () = (4.33) Substitutig from (4.3), the co dece statemet i (4.33) is:

12 With probability for all p vectors a where K is the costat ( ) p ja T x a T j p F p; p () s a = K T Sa = s a T Sa say, (4.34) ( ) p = K = p F p; p () (4.35) A 00 ( ) % co dece iterval for the liear compoud a T is therefore s a T a x K T Sa (4.36) How ca we apply this result? We might be iterested i a de ed set of liear combiatios (liear compouds) of : The i th compoet of is for example the liear compoud de ed by a T = (0; :::; ; :::0) the uit vector with a sigle i the i th positio. For a large umber of such sets of CI s we would expect 00 ( ) % to cotai o mis-statemets while 00% would cotai at least oe mis-statemet. We ca relate the T co dece itervals to the T test of H 0 : = 0. If this H 0 is rejected at sigi cace level the there exists at least oe vector a such that the iterval (4.36) does ot iclude the value a T 0 : NB. If the covariace matrix S u (with deomiator ) is supplied, the i (4.36) r a T S u a may be replaced by : r a T Sa 4.3 The Boferroi method This provides aother way to costruct simultaeous CI s for a small umber of liear compouds of whilst cotrollig the overall level of co dece. Cosider a set of evets A ; A ; :::; A m Pr (A \ ::: \ A m ) = Pr A [ ::: [ A m From the additive law of probabilities X m Pr A [ ::: [ A m Pr A i Therefore Pr (A \ ::: \ A m ) mx Pr A i (4.37)

13 Let C k deote a co dece statemet about the value of some liear compoud a T k with Pr [C k true] = k : Pr (all C k true) ( + ::: + m ) (4.38) Therefore we ca cotrol the overall error rate give by + ::: + m = say. For example, i order to costruct simultaeous 00 ( ) % CI s for all p compoets k of we could choose k = p (k = ; :::; p) leadig to if s ii derives from S u : Example x t x p t p. p r s r spp Itelligece scores data o = 0 subjects: x x = = x S U = 55:4 34:97 0:54 6:99 6:99 9:68. Costruct 99% simultaeous co dece itervals for ; ad : For take a T = (; 0) a T x = 0 55:4 = 55:4 34:97 a T S u a = 0:54 Now take = :0 K = ( ( ) p p) F p; p () = 00 F ;99 (:0) 99 = 3: takig F ;99 (:0) = 4:83 (approx). Therefore the CI for is r 0:54 55:4 3: 0 = 55:4 4:50 givig a iterval = (50:7; 59:7) For we already have K, take a T = (0; ) the a T x = 34:97 a T S u a = 9:68 3

14 The CI for is r 9:68 34:97 3: = 34:97 3:40 0 givig a iterval = (3:6; 38:4) For take a T = [; ] a T 55:4 x = [; ] = 0:7 34:97 a T 0:54 6:99 S u a = [; ] 6:99 9:68 = 0:54 6:9 + 9:68 = 76:4 CI for is 0:7 3: r 76:4 0 = 0:7 :7 = (7:6; 3:0). Costruct CI s for ; by Boferroi method. Use = :0: Idividual CI s are costructed usig k = :0 = :005 (k = ; ) : The k t 00 = t 00 (:005) ' (:9975) CI for is = :8 55:4 :8 r 0:54 0 = 55:4 4:06 = (5:; 59:3) ad for is 34:97 :8 r 9:68 0 = 34:97 3:06 = (3:9; 38:0) Comparig CI s obtaied by the two methods we see that the simultaeous CI s for ad ad are 8.7% wider tha the coirrespodig Boferroi CI s. NB. If we had required 99% Boferroi CI s for ; ad the m = 3 i (4.38) ad m = :0 = :007: The correspodig percetage poit of t would be 6 t 00 (:007) ' (:9983) = :93 leadig to a slightly wider CI Tha obtaied above. 4

15 4.4 Two sample procedures Suppose we have two idepedet radom samples fx ; :::; x g fx ; :::; x g of size ; from two populatios. : x s N p ( ; ) : x s N p ( ; ) givig rise to sample meas x ; x ad sample covariace matrices S ; S. Note the assumptio of a commo covariace matrix : We cosider testig H 0 : = agaist H : 6= Let d = x x : Uder H 0 d s N 0; + (a) Case of kow Aalogously to the oe sample case d + s N (0; Ip ) dt d s p where = + (b) Case of ukow We have the Wishart distributed quatities S s W p (; ) S s W p (; ) Let S p = S + S be the pooled estimator of the covariace matrix : The from the additive properties of the Wishart distributio ( ) S p has the Wishart distributio W p (; ) ad d s N (0; ) It may be show that T = d T Sp d (4.39) has the distributio of a Hotellig s T statistic. I fact T s ( ) p p F p; p (4.40) 5

16 4.5 Multi-sample procedures (MANOVA) We cosider the case of k samples from populatios ; :::; k : The sample from populatio i is of size i : By aalogy with the uivariate case we ca decompose the SSP matrix ito orthogoal parts. This decompositio ca be represeted as a Multivariate Aalysis of Variace (MANOVA) table. The MANOVA model is x ij = + i + e ij j = ; :::; i ad i = ; :::; k (4.4) where e ij are idepedet N p (0; ) variables. Here the parameter vector is the overall (grad) mea ad the i is the i th treatmet e ect with kx i i = 0 De itios Let the i th sample mea be with = X k x i = i i: The Grad Mea is x = X i x ij j= X k ix i. The Betwee Groups sum of squares ad cross-products (SSP) matrix is ad the Total SSP matrix is B = T = kx i (x i x) (x i x) T (4.4) kx X i (x ij x) (x ij x) (4.43) j= It ca be show algebraically that T = B + W where W is the Withi Groups (or residual) SSP matrix give by kx X i W = (x ij x i ) (x ij x i ) T (4.44) The MANOVA table is j= Source Matrix of SS ad Degrees of of variatio cross-products (SSP) freedom (d.f.) Treatmet Residual B = X k i (x i x) (x i x) T k W = X k X i j= (x ij x i ) (x ij x i ) T X k i k Total (corrected for the mea) T = X k X i (x ij j= x) (x ij x) 6 X k i

17 We are iterested i testig the hypothesis H 0 : = = ::: = k (4.4) whether the samples i fact come from the same populatio agaist the geeral alterative H : 6= 6= ::: 6= k (4.43) 4.5. Wilk s statistic We ca derive a likelihood ratio test statistic kow as Wilk s : Uder H 0 the m.l.e. s are ^ = x ^ = S leadig to the maximized log likelihood (miimum of log L) where Uder H the m.l.e. s are This follows from l = mi ;d i = mi ( ( W = l 0 = p + log jsj (4.44) ^ i = x i ^ = W kx W i = log jj + kx i S i kx i tr log jj + tr S i + d i d T ) i!) kx i S i sice ^d i = x i ^ i = 0. Hece ^ = W ad l = p + log W (4.45) Therefore sice T = S jw j l0 l = log jt j = log (4.46) where is kow as Wilk s statistic. We reject H 0 for small values of or large values of log : Asymptotically, the rejectio regio is the upper tail of a p(k ). Uder H 0 the ukow has p parameters ad uder H the umber of parameters for ; :::; k is pk: Hece the d.f. of the is p (k ). Apart from this asymptotic result, other approximate distributios (otably Bartlett s approximatio) are available, but the details are outside the scope of this course. 7

18 4.5. Calculatio of Wilk s Result Let ; :::; p be the eigevalues of W B the Proof py = ( + j ) (4.47) j= = T W = (W + B) W = W (W + B) = I + W B py = ( + j ) (4.48) j= by the useful idetity proved earlier i the otes Case k = We show that use of Wilk s for k = groups is equivalet to usig Hotellig s T statistic. Speci cally, we show that is a mootoic fuctio of T. Thus to reject H 0 for < is equivalet to rejectig H 0 for T > (for some costats ; ): Proof For k = we ca show (Ex.) that where d = x x. The B = ddt (4.49) I + W B = I + W dd T = + dt W d Now W is just ( ) S p where S p is the pooled estimator of : Thus comparig with (4.39) where T (two sample case) is give we see = + T (4.50) 8

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

x = Pr ( X (n) βx ) =

x = Pr ( X (n) βx ) = Exercise 93 / page 45 The desity of a variable X i i 1 is fx α α a For α kow let say equal to α α > fx α α x α Pr X i x < x < Usig a Pivotal Quatity: x α 1 < x < α > x α 1 ad We solve i a similar way as

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

IIT JAM Mathematical Statistics (MS) 2006 SECTION A

IIT JAM Mathematical Statistics (MS) 2006 SECTION A IIT JAM Mathematical Statistics (MS) 6 SECTION A. If a > for ad lim a / L >, the which of the followig series is ot coverget? (a) (b) (c) (d) (d) = = a = a = a a + / a lim a a / + = lim a / a / + = lim

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2 82 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Defiitio: et X be a radom sample with joit p.m/d.f. f X x θ. The geeralised likelihood ratio test g.l.r.t. of the NH : θ H 0 agaist the alterative AH : θ H 1,

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is: PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,

More information

The Method of Least Squares. To understand least squares fitting of data.

The Method of Least Squares. To understand least squares fitting of data. The Method of Least Squares KEY WORDS Curve fittig, least square GOAL To uderstad least squares fittig of data To uderstad the least squares solutio of icosistet systems of liear equatios 1 Motivatio Curve

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Matrix Representation of Data in Experiment

Matrix Representation of Data in Experiment Matrix Represetatio of Data i Experimet Cosider a very simple model for resposes y ij : y ij i ij, i 1,; j 1,,..., (ote that for simplicity we are assumig the two () groups are of equal sample size ) Y

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices Radom Matrices with Blocks of Itermediate Scale Strogly Correlated Bad Matrices Jiayi Tog Advisor: Dr. Todd Kemp May 30, 07 Departmet of Mathematics Uiversity of Califoria, Sa Diego Cotets Itroductio Notatio

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

STAT431 Review. X = n. n )

STAT431 Review. X = n. n ) STAT43 Review I. Results related to ormal distributio Expected value ad variace. (a) E(aXbY) = aex bey, Var(aXbY) = a VarX b VarY provided X ad Y are idepedet. Normal distributios: (a) Z N(, ) (b) X N(µ,

More information

APPLIED MULTIVARIATE ANALYSIS

APPLIED MULTIVARIATE ANALYSIS ALIED MULTIVARIATE ANALYSIS FREQUENTLY ASKED QUESTIONS AMIT MITRA & SHARMISHTHA MITRA DEARTMENT OF MATHEMATICS & STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANUR X = X X X [] The variace covariace atrix

More information

Summary. Recap ... Last Lecture. Summary. Theorem

Summary. Recap ... Last Lecture. Summary. Theorem Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Lecture Notes 15 Hypothesis Testing (Chapter 10)

Lecture Notes 15 Hypothesis Testing (Chapter 10) 1 Itroductio Lecture Notes 15 Hypothesis Testig Chapter 10) Let X 1,..., X p θ x). Suppose we we wat to kow if θ = θ 0 or ot, where θ 0 is a specific value of θ. For example, if we are flippig a coi, we

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Chapter 13: Tests of Hypothesis Section 13.1 Introduction Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

1 General linear Model Continued..

1 General linear Model Continued.. Geeral liear Model Cotiued.. We have We kow y = X + u X o radom u v N(0; I ) b = (X 0 X) X 0 y E( b ) = V ar( b ) = (X 0 X) We saw that b = (X 0 X) X 0 u so b is a liear fuctio of a ormally distributed

More information

Comparing your lab results with the others by one-way ANOVA

Comparing your lab results with the others by one-way ANOVA Comparig your lab results with the others by oe-way ANOVA You may have developed a ew test method ad i your method validatio process you would like to check the method s ruggedess by coductig a simple

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Statistical Inference Based on Extremum Estimators

Statistical Inference Based on Extremum Estimators T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

Asymptotic Results for the Linear Regression Model

Asymptotic Results for the Linear Regression Model Asymptotic Results for the Liear Regressio Model C. Fli November 29, 2000 1. Asymptotic Results uder Classical Assumptios The followig results apply to the liear regressio model y = Xβ + ε, where X is

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3 Itroductio to Ecoometrics (3 rd Updated Editio) by James H. Stock ad Mark W. Watso Solutios to Odd- Numbered Ed- of- Chapter Exercises: Chapter 3 (This versio August 17, 014) 015 Pearso Educatio, Ic. Stock/Watso

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7:

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7: 0 Multivariate Cotrol Chart 3 Multivariate Normal Distributio 5 Estimatio of the Mea ad Covariace Matrix 6 Hotellig s Cotrol Chart 6 Hotellig s Square 8 Average Value of k Subgroups 0 Example 3 3 Value

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the

More information

Homework Set #3 - Solutions

Homework Set #3 - Solutions EE 15 - Applicatios of Covex Optimizatio i Sigal Processig ad Commuicatios Dr. Adre Tkaceko JPL Third Term 11-1 Homework Set #3 - Solutios 1. a) Note that x is closer to x tha to x l i the Euclidea orm

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

y ij = µ + α i + ɛ ij,

y ij = µ + α i + ɛ ij, STAT 4 ANOVA -Cotrasts ad Multiple Comparisos /3/04 Plaed comparisos vs uplaed comparisos Cotrasts Cofidece Itervals Multiple Comparisos: HSD Remark Alterate form of Model I y ij = µ + α i + ɛ ij, a i

More information

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators. IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

STATISTICAL INFERENCE

STATISTICAL INFERENCE STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments: Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Lecture 6 Simple alternatives and the Neyman-Pearson lemma STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

STAC51: Categorical data Analysis

STAC51: Categorical data Analysis STAC51: Categorical data Aalysis Mahida Samarakoo Jauary 28, 2016 Mahida Samarakoo STAC51: Categorical data Aalysis 1 / 35 Table of cotets Iferece for Proportios 1 Iferece for Proportios Mahida Samarakoo

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Definitions and Theorems. where x are the decision variables. c, b, and a are constant coefficients.

Definitions and Theorems. where x are the decision variables. c, b, and a are constant coefficients. Defiitios ad Theorems Remember the scalar form of the liear programmig problem, Miimize, Subject to, f(x) = c i x i a 1i x i = b 1 a mi x i = b m x i 0 i = 1,2,, where x are the decisio variables. c, b,

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Study the bias (due to the nite dimensional approximation) and variance of the estimators

Study the bias (due to the nite dimensional approximation) and variance of the estimators 2 Series Methods 2. Geeral Approach A model has parameters (; ) where is ite-dimesioal ad is oparametric. (Sometimes, there is o :) We will focus o regressio. The fuctio is approximated by a series a ite

More information

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the

More information

Principle Of Superposition

Principle Of Superposition ecture 5: PREIMINRY CONCEP O RUCUR NYI Priciple Of uperpositio Mathematically, the priciple of superpositio is stated as ( a ) G( a ) G( ) G a a or for a liear structural system, the respose at a give

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

Chapter 12 EM algorithms The Expectation-Maximization (EM) algorithm is a maximum likelihood method for models that have hidden variables eg. Gaussian

Chapter 12 EM algorithms The Expectation-Maximization (EM) algorithm is a maximum likelihood method for models that have hidden variables eg. Gaussian Chapter 2 EM algorithms The Expectatio-Maximizatio (EM) algorithm is a maximum likelihood method for models that have hidde variables eg. Gaussia Mixture Models (GMMs), Liear Dyamic Systems (LDSs) ad Hidde

More information

Machine Learning for Data Science (CS 4786)

Machine Learning for Data Science (CS 4786) Machie Learig for Data Sciece CS 4786) Lecture & 3: Pricipal Compoet Aalysis The text i black outlies high level ideas. The text i blue provides simple mathematical details to derive or get to the algorithm

More information

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes. Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely

More information

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator Slide Set 13 Liear Model with Edogeous Regressors ad the GMM estimator Pietro Coretto pcoretto@uisa.it Ecoometrics Master i Ecoomics ad Fiace (MEF) Uiversità degli Studi di Napoli Federico II Versio: Friday

More information