Chi-squared tests Math 6070, Spring 2014

Size: px
Start display at page:

Download "Chi-squared tests Math 6070, Spring 2014"

Transcription

1 Chi-squared tests Math 6070, Sprig 204 Davar Khoshevisa Uiversity of Utah March, 204 Cotets MLE for goodess-of fit 2 2 The Multivariate ormal distributio 3 3 Cetral limit theorems 5 4 Applicatio to goodess-of-fit 8 4 Testig for a fiite distributio 8 42 Testig for a Desity 9 Chi-squared tests refers to a family of statistical test whose large-sample asymptotics are related i oe way or aother to the χ 2 family of distributios These are our first examples of o-parametric procedures Let me metio oe particularly useful chi-squared test here before we get started properly Cosider a populatio whose distributio is believed to follow a kow desity fuctio f say, expoetial(2) The there is a chi-squared test that ca be used to check our hypothesis This chi-squared test is easy to implemet, ad works well for large-sized samples First, we study a related parametric estimatio problem The we fid a cosistet MLE Ad fially, we use our estimate to devise a test for our hypothesis This will be doe i several stages It is likely that you already kow somethig of the resultig χ 2 -test (page 8) But here you will fid a rigorous proof, i lie with the spirit of this course, of the fact that the said χ 2 -test actually works A [covicig but] o-rigorous justificatio of the same fact ca be foud i pages of the excellet moograph of P Bickel ad K Doksum [Mathematical Statistics, Holde Day, First editio, 977] There, a likelihood priciple argumet is devised, which usually works

2 MLE for goodess-of fit Suppose X,, X is a idepedet sample from a fiite populatio More precisely, let t,, t m be m distict umbers (or types ), ad θ := (θ,, θ m ) a m-dimesioal probability vector Cosider iid radom variables X,, X such that P{X = t i } = θ i for all i m That is, each X is of type t i with probability θ i We wish to fid a (actually, the ) MLE for the mass fuctio of the X i s That is, the MLE for the vector θ := (θ,, θ m ) Note that, here, Θ is the collectio of all θ such that 0 θ,, θ m ad θ + + θ m = This is a kid of parametric problem, of course, ad we proceed by computig the MLE Defie N := (N,, N m ) to be the cout statistics for the various types Usig colum-vector otatio, this meas that j= I {X j = t } N = () j= I {X j = t m } I words, N k deotes the umber of occuraces of type k i the sample X := (X,, X ) Note that m j= N j =, so the N j s are obviously depedet rv s Let us calculate their joit mass fuctio Let = (,, m ) be a m-vector of positive itegers such that m j= j = Such s are the possible values of N Moreover, P θ {N = } = P θ {X = s,, X = s }, (2) where the sum is take over all s = (s,, s ) such that of them are of type t, 2 of type t 2, etc For all such s, P θ {X = s,, X = s } = θ θm m (3) Moreover, the umber of all such s s is exactly ( )! :=,, m! m! (4) Therefore, we have derived the followig Lemma The radom -vector N has the multiomial distributio with parameters (, θ) That is, for all itegers,, m such that + + m =, ( ) P θ {N = } = θ,, θm m (5) m Cosider the special case = 2 The, N biomial(, θ), ad N 2 = N So the multiomial distributio is a extesio of the biomial i this sese We write N Multiomial(, θ) i place of the statemet that N is multiomial with parameters (, θ) 2

3 It turs out that we ca compute the MLE for θ by usig N as the basic data To do so, we use Lemma ad first fid the log-likelihood as follows: L(θ) = A + m N i l θ i, (6) where A does ot deped o θ The case where some θ i {0, } is trivial ad ca be ruled out easily Therefore, we may assume that 0 < θ i < for all i =,, m Thus, for all l =,, m, m ( Ni L(θ) = θ ) i (7) θ l θ i θ l As θ varies i Θ, θ,, θ m roam freely subject to the sigle coditio that θ m = m θ i I other words, ( θ i / θ l ) = 0 for all i =,, m differet from l, but ( θ m / θ l ) = This proves that ( L(θ)/ θ l ) = (N m /θ m ) (N l /θ l ) for all l =,, m Set it equal to zero (for MLE) to see that there must exist γ such that N j = γθ j for all j =,, m But m j= N j =, whereas m j= θ j = Therefore, γ = We have proved the followig Lemma 2 The MLE for θ based o N,, N m is give by N / ˆθ := (8) N m / [To prove that L/ θ l = 0 ideed yields a maximum, you take secod derivatives ad apply similar argumets] 2 The Multivariate ormal distributio We have see already the multiomial distributio, which is a probability distributio for discrete m-dimesioal radom vectors You should ote that the multiomial distributio is a geeralizatio of the biomial distributio Ad, as it turs out, multiomial distributios have a multivariate ormal approximatio that is a geeralizatio of the better-kow cetral limit theorem for uivariate radom variables I this sectio, I will hash out a rough outlie of the theory of multivariate ormal distributios [ad radom vectors] The mathematical uderpiigs of this theory is explaied i math 600, ad should ot be missed Defiitio 3 Let X := (X,, X m ) deote a m-dimesioal radom vector It mea vector is defied as E(X ) E(X) :=, E(X m ) 3

4 ad its covariace matrix is defied as the matrix whose (i, j)th etry is Cov(X i, X j ); sice Cov(X i, X i ) = Var(X i ), Var(X ) Cov(X, X 2 ) Cov(X, X m ) Cov(X 2, X ) Var(X 2 ) Cov(X 2, X m ) Cov(X) := Cov(X m, X ) Cov(X m, X 2 ) Var(X m ) Of course, we are tacitly assumig that the meas, variaces, ad covariaces are fiite If x is a m-dimesioal oradom vector, the x Cov(X)x is always a scalar Oe ca show that, i fact, that scalar is equal to the variace of the radom scalar quatity x X That is, Var (x X) = x Cov(X)x Sice the variace of a radom variable is always 0, it follows that if Σ := Cov(X), the x Qx 0 for all x Ay matrix that has the precedig property is called positive semidefiite, ad we just saw that all covariace matrices are positive semidefiite The coverse is also true, but harder to prove Propositio 4 Let Q be a symmetric m m matrix The Q is the covariace of a radom vector X if ad oly if Q is positive semidefiite Also, a symmetric m m matrix Q is positive semidefiite if ad oly if all of eigevalues are 0 We say that Q is positive defiite if x Qx > 0 for all x Propositio 5 A symmetric m m matrix Q is positive defiite if ad oly if all of its eigevalues are > 0 I particular, such a matrix is ivertible Defiitio 6 Give a m-dimesioal vector µ ad a m m-dimesioal covariace matrix Q, the multivariate ormal pdf with mea vector µ ad covariace matrix Q is f(x) := exp (2π) m/2 det Q /2 ( ) 2 (x µ) Q (x µ) This is a proper pdf o R m We say that X N m (µ, Q), whe X = (X,, X m ), ad the joit pdf of (X,, X m ) is f The precedig fuctio f is ideed a proper joit desity i the sese that f(x) 0 for all x := (x,, x m ) ad So it makes sese to say X N m (µ, Q) f(x,, x m ) dx dx m = 4

5 Propositio 7 If X N m (µ, Q) for some vector µ := (µ,, µ m ) ad matrix Q, the µ = E(X), ad Cov(X) = Q Thus, we refer to µ ad Q respectively as the mea vector ad covariace matrix of the N m (µ, Q) distributio The followig states that a radom vector is distributed as a multivariate ormal if ad oly if all of possible [oradom] liear combiatios of its compoets are distributed as [uivariate] ormals Propositio 8 X N m (µ, Q) if ad oly if t X N(t µ, t Qt) for all oradom m-vectors t := (t,, t m ) I other words, the precedig states that the property about the liear combiatios of the compoets characterizes completely the multivariate ormal distributio A advatage of the precedig characterizatio is that the osigularity of Q is ot metioed aywhere explicitly Therefore, we ca defie the followig slight geeralizatio of multivariate ormals to the case that Q is positive semidefiite [as opposed to beig positive defiite]: Defiitio 9 Let X be a radom m-vector We say that X N m (µ, Q), for some µ = (µ,, µ m ) ad some m m symmetric positive semidefiite matrix Q, whe t X N(t µ, t Qt) for all t := (t,, t m ) 3 Cetral limit theorems Choose ad fix a probability vector θ, ad assume it is the true oe so that we write P, E, etc i place of the more cumbersome P θ, E θ, etc We ca use () to say a few more thigs about the distributio of N Note first that () is equivalet to the followig: N = I {X j = t } I {X j = t m } j= We ca compute directly to fid that := Y j (9) j= EY = θ, (0) ad θ ( θ ) θ θ 2 θ θ m θ θ 2 θ 2 ( θ 2 ) θ 2 θ m CovY = θ m θ θ m θ 2 θ m ( θ m ) := Q 0 () 5

6 Because Y,, Y are iid radom vectors, the followig is a cosequece of the precedig, cosidered i cojuctio with the multidimesioal CLT Theorem 0 As, N θ d Nm (0, Q 0 ) (2) There is a variat of Theorem 0 which is more useful to us Defie (N θ )/ θ Ñ := (N m θ m )/ (3) θ m This too is a sum of iid radom vectors Ỹ,, Ỹ, ad has EỸ := 0 ad CovỸ = Q, (4) where Q is the (m m)-dimesioal covariace matrix give below: θ θ θ 2 θ θ m θ θ 2 θ 2 θ 2 θ m Q = θ m θ (5) θ m θ 2 θ m The we obtai, Theorem As, Ñ d N m (0, Q) (6) Remark 2 Oe should be a little careful at this stage because Q is sigular [For example, cosider the case m = 2] However, N m (0, Q) makes sese eve i this case The reaso is this: If C is a o sigular covariace matrix, the we kow what it meas for X N m (0, C), ad a direct computatio reveals that for all real umbers t ad all m-vectors ξ, [ ( )] E [exp (itξ X)] = E exp t2 2 ξ Cξ (7) This defies the characteristic fuctio of the radom variable ξ X uiquely There is a theorem (Ma Wold device) that says that a uique descriptio of the characteristic fuctio of ξ X for all ξ also describes the distributio of X uiquely This meas that ay radom vector X that satisfies (7) for all t R ad ξ R m is N m (0, C) Moreover, this approach does ot require C to be ivertible To lear more about multivariate ormals, you should atted Math 600 6

7 As a cosequece of Theorem we have the followig: As, Ñ 2 d Nm (0, Q) 2 (8) [Recall that x 2 = x x 2 m] Next, we idetify the distributio of N m (0, Q) 2 A direct (ad somewhat messy) calculatio shows that Q is a projectio matrix That is, Q 2 := QQ = Q It follows fairly readily from this that its eigevalues are all zeros ad oes Here is the proof: Let λ be a eigevalue ad x a eigevector Without loss of geerality, we may assume that x = ; else, cosider the eigevector y := x/ x istead This meas that Qx = λx Therefore, Q 2 x = λ 2 x Because Q 2 = Q, we have prove that λ = λ 2, whece it follows that λ {0, }, as promised Write Q i its spectral form That is, λ 0 0 Q = P 0 λ 2 0 P, (9) 0 0 λ m where P is a orthogoal (m m) matrix [This is because Q is positive semidefiite You ca lear more o this i Math 600] Defie λ λ2 0 Λ := P (20) 0 0 λm The, Q = Λ Λ, which is why Λ is called the square root of Q Let Z be a m- vector of iid stadard ormals ad cosider ΛZ This is multivariate ormal (check characteristic fuctios), E[ΛZ] = 0, ad Cov(ΛZ) = Λ Cov(Z)Λ = Λ Λ = Q I other words, we ca view N m (0, Q) as ΛZ Therefore, m N m (0, Q) 2 = d ΛZ 2 = λ j Zi 2 (2) Because λ i {0, }, we have proved that N m (0, Q) χ 2 r, where r = rak(q) Aother messy but elemetary computatio shows that the rak of Q is m [Roughly speakig, this is because the θ i s have oly oe source of liear depedece Namely, that θ + + θ m = ] This proves that Ñ 2 d χ 2 m (22) Write out the left-had side explicitly to fid the followig: Theorem 3 As, m (N i θ i ) 2 θ i d χ 2 m (23) It took a while to get here, but we will soo be rewarded for our effort 7

8 4 Applicatio to goodess-of-fit 4 Testig for a fiite distributio I light of the developmet of Sectio, ca paraphrase Theorem 3 as sayig that uder P θ, as, X (θ) d χ 2 m, where X (θ) := ) 2 m (ˆθi θ i (24) θ i Therefore, we are i a positio to derive a asymptotic ( α)-level cofidece set for θ Defie for all a > 0, { C (a) := θ Θ : X (θ) a } (25) Equatio (24) shows that for all θ Θ, lim P θ {θ C (a)} = P { χ 2 m a } (26) We ca fid (eg, from χ 2 -tables) a umber χ 2 (α) such that It follows that P { χ 2 m > χ 2 m (α) } = α (27) C ( χ 2 m (α) ) is a asymptotic level ( α) cofidece iterval for θ (28) The statistics X (θ) is a little awkward to work with, ad this makes the computatio of C (χ 2 m (α)) difficult Nevertheless, it is easy eough to use C (χ 2 m (α)) to test simple hypotheses for θ Suppose we are iterested i the test H 0 : θ = θ 0 versus H : θ θ 0, (29) where θ 0 = (θ 0,, θ 0m ) is a (kow) probability vector i Θ The we ca reject H 0 (at asymptotic level ( α)) if θ 0 C (χ 2 m (α)) I other words, we reject H 0 if X (θ 0 ) > χ2 m (α) (30) Because X (θ 0 ) is easy to compute, (29) ca be carried out with relative ease Moreover, X (θ 0 ) has the followig iterpretatio: Uder H 0, X (θ 0 ) = m (Obs i Exp i ) 2, (3) Exp i The statistic X (θ) was discovered by K Pearso i 900, ad is therefore called the Pearso χ 2 statistic Icidetally, the term stadard deviatio is also due to Karl Pearso (893) 8

9 where Obs i deotes the umber of observatios i the sample that are of type i, ad Exp i := θ 0i is the expected umber uder H 0 of sample poits that are of type i Next we study two examples Example 4 A certai populatio cotais oly the digits 0,, ad 2 We wish to kow if every digit is equally likely to occur i this populatio That is, we wish to test H 0 : θ = ( 3, 3, 3 ) versus H : θ ( 3, 3, 3 ) (32) To this ed, suppose we make, 000 idepedet draws from this populatio, ad fid that we have draw 300 zeros (ˆθ = 03), 340 oes (ˆθ 2 = 034), ad 360 twos (ˆθ 3 = 036) That is, Types Obs Exp (Obs Exp) 2 /Exp / / / Total (approx) Suppose we wish to test (32) at the traditioal (asymptotic) level 95% Here, m = 3 ad α = 005, ad we fid from chi-square tables that χ 2 2(005) 5999 Therefore, accordig to (30), we reject if ad oly if X (θ 0 ) is greater tha χ 2 2(005)/ Thus, we do ot reject the ull hypothesis at asymptotic level 95% O the other had, χ 2 2(0) 4605, ad X (θ 0 ) is greater tha χ 2 2(0)/ Thus, had we performed the test at asymptotic level 90% we would ideed reject H 0 I fact, this discussio has show that the asymptotic P -value of our test is somewhere betwee 005 ad 0 It is more importat to report/estimate P -values tha to make blaket declaratios such as accept H 0 or reject H 0 Cosult your Math text(s) To illustrate this example further, suppose the table of observed vs expected remais the same but = 2000 The, X (θ 0 ) is a good deal greater tha χ 2 2(005)/ ad we reject H 0 soudly at the asymptotic level 95% I fact, i this case the P -value is somewhere betwee 0005 ad (check this!) 42 Testig for a Desity Suppose we wish to kow if a certai data-set comes from the expoetial(2) distributio (say!) So let X,, X be a idepedet sample We have H 0 : X expoetial(2) versus H : X expoetial(2) (33) 9

10 Now take a fie mesh 0 < z < z 2 < < z m < of real umbers The, P H0 {z j X z j+ } = 2 zj+ z j e 2r dr = e 2zj e 2zj+ (34) These are θ 0j s, ad we ca test to see whether or ot θ = θ 0, where θ j is the true probability P{z j X z j+ } for all j =,, m The χ 2 -test of the previous sectio hadles just such a situatio That is, let ˆθ j deote the proportio of the data X,, X that lies i [z j, z j+ ] The (30) provides us with a asymptotic ( α)-level test for H 0 that the data is expoetial(2) Other distributios are hadled similarly, but θ 0j is i geeral z j+ z j f 0 (x) dx, where f 0 is the desity fuctio uder H 0 Example 5 Suppose we wish to test (33) Let m = 5, ad defie z j = (j )/0 for j =,, 5 The, { j θ 0j = P H0 0 X j } ( ) ( 6(j ) = exp exp 6j ) (35) That is, j θ 0j Cosider the followig (hypothetical) data: = 200, ad j θ 0j ˆθj The, we form the χ 2 -table, as before, ad obtai Types Exp Obs (Obs Exp) 2 /Exp Total (approx) 0

11 Thus, X (θ 0 ) You should check that 005 P -value 0 I fact, the P -value is much closer to 0 So, at the asymptotic level 95%, we do ot have eough evidece agaist H 0 Some thigs to thik about: I the totals for Exp, you will see that the total is ad ot Ca you thik of why this has happeed? Does this pheomeo reder our coclusios false (or weaker tha they may appear)? Do we eed to fix this by icludig more types? How ca we address it i a sesible way? Would addressig this chage the fial coclusio?

Chi-Squared Tests Math 6070, Spring 2006

Chi-Squared Tests Math 6070, Spring 2006 Chi-Squared Tests Math 6070, Sprig 2006 Davar Khoshevisa Uiversity of Utah February XXX, 2006 Cotets MLE for Goodess-of Fit 2 2 The Multiomial Distributio 3 3 Applicatio to Goodess-of-Fit 6 3 Testig for

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15 17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig

More information

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2 82 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Defiitio: et X be a radom sample with joit p.m/d.f. f X x θ. The geeralised likelihood ratio test g.l.r.t. of the NH : θ H 0 agaist the alterative AH : θ H 1,

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22 CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Distribution of Random Samples & Limit theorems

Distribution of Random Samples & Limit theorems STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M. MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval

More information

Summary. Recap ... Last Lecture. Summary. Theorem

Summary. Recap ... Last Lecture. Summary. Theorem Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca

More information

STATISTICAL INFERENCE

STATISTICAL INFERENCE STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample

More information

Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version)

Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version) Kurskod: TAMS Provkod: TENB 2 March 205, 4:00-8:00 Examier: Xiagfeg Yag (Tel: 070 2234765). Please aswer i ENGLISH if you ca. a. You are allowed to use: a calculator; formel -och tabellsamlig i matematisk

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

Estimation of the Mean and the ACVF

Estimation of the Mean and the ACVF Chapter 5 Estimatio of the Mea ad the ACVF A statioary process {X t } is characterized by its mea ad its autocovariace fuctio γ ), ad so by the autocorrelatio fuctio ρ ) I this chapter we preset the estimators

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

Discrete Mathematics for CS Spring 2005 Clancy/Wagner Notes 21. Some Important Distributions

Discrete Mathematics for CS Spring 2005 Clancy/Wagner Notes 21. Some Important Distributions CS 70 Discrete Mathematics for CS Sprig 2005 Clacy/Wager Notes 21 Some Importat Distributios Questio: A biased coi with Heads probability p is tossed repeatedly util the first Head appears. What is the

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)]. Probability 2 - Notes 0 Some Useful Iequalities. Lemma. If X is a radom variable ad g(x 0 for all x i the support of f X, the P(g(X E[g(X]. Proof. (cotiuous case P(g(X Corollaries x:g(x f X (xdx x:g(x

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

Lecture 20: Multivariate convergence and the Central Limit Theorem

Lecture 20: Multivariate convergence and the Central Limit Theorem Lecture 20: Multivariate covergece ad the Cetral Limit Theorem Covergece i distributio for radom vectors Let Z,Z 1,Z 2,... be radom vectors o R k. If the cdf of Z is cotiuous, the we ca defie covergece

More information

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the

More information

TAMS24: Notations and Formulas

TAMS24: Notations and Formulas TAMS4: Notatios ad Formulas Basic otatios ad defiitios X: radom variable stokastiska variabel Mea Vätevärde: µ = X = by Xiagfeg Yag kpx k, if X is discrete, xf Xxdx, if X is cotiuous Variace Varias: =

More information

Lecture 11 and 12: Basic estimation theory

Lecture 11 and 12: Basic estimation theory Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Lecture Note 8 Point Estimators and Point Estimation Methods. MIT Spring 2006 Herman Bennett

Lecture Note 8 Point Estimators and Point Estimation Methods. MIT Spring 2006 Herman Bennett Lecture Note 8 Poit Estimators ad Poit Estimatio Methods MIT 14.30 Sprig 2006 Herma Beett Give a parameter with ukow value, the goal of poit estimatio is to use a sample to compute a umber that represets

More information

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman: Math 224 Fall 2017 Homework 4 Drew Armstrog Problems from 9th editio of Probability ad Statistical Iferece by Hogg, Tais ad Zimmerma: Sectio 2.3, Exercises 16(a,d),18. Sectio 2.4, Exercises 13, 14. Sectio

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

This section is optional.

This section is optional. 4 Momet Geeratig Fuctios* This sectio is optioal. The momet geeratig fuctio g : R R of a radom variable X is defied as g(t) = E[e tx ]. Propositio 1. We have g () (0) = E[X ] for = 1, 2,... Proof. Therefore

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if LECTURE 14 NOTES 1. Asymptotic power of tests. Defiitio 1.1. A sequece of -level tests {ϕ x)} is cosistet if β θ) := E θ [ ϕ x) ] 1 as, for ay θ Θ 1. Just like cosistecy of a sequece of estimators, Defiitio

More information

Probability and statistics: basic terms

Probability and statistics: basic terms Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

STA Object Data Analysis - A List of Projects. January 18, 2018

STA Object Data Analysis - A List of Projects. January 18, 2018 STA 6557 Jauary 8, 208 Object Data Aalysis - A List of Projects. Schoeberg Mea glaucomatous shape chages of the Optic Nerve Head regio i aimal models 2. Aalysis of VW- Kedall ati-mea shapes with a applicatio

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

Section 14. Simple linear regression.

Section 14. Simple linear regression. Sectio 14 Simple liear regressio. Let us look at the cigarette dataset from [1] (available to dowload from joural s website) ad []. The cigarette dataset cotais measuremets of tar, icotie, weight ad carbo

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices

Random Matrices with Blocks of Intermediate Scale Strongly Correlated Band Matrices Radom Matrices with Blocks of Itermediate Scale Strogly Correlated Bad Matrices Jiayi Tog Advisor: Dr. Todd Kemp May 30, 07 Departmet of Mathematics Uiversity of Califoria, Sa Diego Cotets Itroductio Notatio

More information

11 THE GMM ESTIMATION

11 THE GMM ESTIMATION Cotets THE GMM ESTIMATION 2. Cosistecy ad Asymptotic Normality..................... 3.2 Regularity Coditios ad Idetificatio..................... 4.3 The GMM Iterpretatio of the OLS Estimatio.................

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach STAT 425: Itroductio to Noparametric Statistics Witer 28 Lecture 7: Desity Estimatio: k-nearest Neighbor ad Basis Approach Istructor: Ye-Chi Che Referece: Sectio 8.4 of All of Noparametric Statistics.

More information

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences. Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014. Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Basis for simulation techniques

Basis for simulation techniques Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

Exponential Families and Bayesian Inference

Exponential Families and Bayesian Inference Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where

More information

Confidence Intervals for the Population Proportion p

Confidence Intervals for the Population Proportion p Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Statistical Inference Based on Extremum Estimators

Statistical Inference Based on Extremum Estimators T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information