Section 14. Simple linear regression.

Size: px
Start display at page:

Download "Section 14. Simple linear regression."

Transcription

1 Sectio 14 Simple liear regressio. Let us look at the cigarette dataset from [1] (available to dowload from joural s website) ad []. The cigarette dataset cotais measuremets of tar, icotie, weight ad carbo mooide (CO) cotet for 5 brads of domestic cigarettes. We are goig to try to predict CO as a fuctio of tar ad icotie cotet. To visualize the data let us plot each of these variable agaist others, see figure Sice the variables seem to have a liear relatioship we fit a least-squares lie, which we will eplai below, to fit the data usig Matlab tool polytool. For eample, if our vectors are ic for icotie, tar for tar ad carb for CO the, for eample, usig polytool(ic,carb,1) will produce figure 14.1 (a), etc. We ca also perform statistical aalysis of these fits, i a sese that will gradually be eplaied below, usig Matlab regress fuctio. For carbo mooide vs. tar: [b,bit,r,rit,stats]=regress(carb,[oes(5,1),tar]); b =.7433 bit = stats = , for carbo mooide vs. icotie [b,bit,r,rit,stats]=regress(carb,[oes(5,1),ic]); b = bit = stats =

2 35 Carbo mooide Carbo mooide frag replacemets 5 Nicotie PSfrag replacemets Tar Nicotie frag replacemets 0 5 Tar Figure 14.1: Least-squares lie (solid lie). (a) Carbo mooide cotet (mg.) vs. icotie cotet (mg.). (b) Carbo mooide vs. tar cotet. (c) Tar cotet vs. icotie cotet. ad for icotie vs. tar [b,bit,r,rit,stats]=regress(tar,[oes(5,1),ic]); b = bit = stats = The output of regress gives a vector b of parameters of a fitted least-squares lie, 95% cofidece itervals bit for these parameters, ad stats cotais i order: R statistic, F statistic, p-value of F statistic, MLE πˆ of the error variace. All of these will be eplaied below. 9

3 Simple liear regressio model. Suppose that we have a pair of variables (X, Y ) ad a variable Y is a liear fuctio of X plus radom oise: Y = f(x) + χ = β 0 + β 1 X + χ, where a radom oise χ is assumed to have ormal distributio N(0, π ). A variable X is called a predictor variable, Y - a respose variable ad a fuctio f() = β 0 + β 1 - a liear regressio fuctio. Suppose that we are give a sequece of pairs (X 1, Y 1 ),..., (X, Y ) that are described by the above model: Y i = β 0 + β 1 X i + χ i ad χ 1,..., χ are i.i.d. N(0, π ). We have three ukow parameters - β 0, β 1 ad π - ad we wat to estimate them usig a give sample. The poits X 1,..., X ca be either radom or o radom, but from the poit of view of estimatig liear regressio fuctio the ature of Xs is i some sese irrelevat so we will thik of them as fied ad o radom ad assume that the radomess comes from the oise variables χ i. For a fied X i, the distributio of Y i is equal to N(f(X i ), π ) with p.d.f. 1 (y f (X i )) απ e σ ad the likelihood fuctio of the sequece Y 1,..., Y is: 1 ) 1 P e σ (Yi f(x i )) 1 ) = e 1 P i=1 σ απ απ i=1 (Y i β 0 β 1 X i ). Let us fid the maimum likelihood estimates of β 0, β 1 ad π that maimize this likelihood fuctio. First of all, it is obvious that for ay π we eed to miimize L := (Y i β 0 β 1 X i ) i=1 over β 0, β 1. The lie that miimizes the sum of squares L is called the least-squares lie. To fid the critical poits we write: If we itroduce the otatios L = (Y i (β 0 + β 1 X i )) = 0 β 0 i=1 L β 1 = (Y i (β 0 + β 1 X i ))X i = 0 i= X X 1 X = X i, Y = Y i, = i, XY = X iy i the the critical poit coditios ca be rewritte as X β 0 + β 1 X = Y ad β 0 X + β 1 = XY. 93

4 Solvig for β 0 ad β 1 we get the MLE X βˆ = X ad ˆ = XY Y 0 Y βˆ1 β 1. X X These estimates are used to plot least-squares regressio lies i figure Fially, to fid the MLE of π we maimize the likelihood over π ad get: πˆ = 1 (Y i βˆ0 βˆ1x i ). i=1 The differeces r i = Y i Ŷ i betwee observed respose variables Y i ad the values predicted by the estimated regressio lie Ŷ i = βˆ0 + βˆ1x i are called the residuals. The R statistic i the eamples above is defied as R i=1 = 1 (Y i Ŷ i ). i=1 (Y i Ȳ ) The umerator i the last sum is the sum of squares of the residuals ad the umerator is the variace of Y ad R is usually iterpreted as the proportio of variability i the data eplaied by the liear model. The higher R the better our model eplais the data. Net, we would like to do statistical iferece about the liear model. 1. Costruct cofidece itervals for parameters of the model β 0, β 1 ad π.. Costruct predictio itervals for Y give ay poit X (dotted lies i figure 14.1). 3. Test hypotheses about parameters of the model. For eample, F -statistic i the output of Matlab fuctio regress comes from a test of the hypothesis H 0 : β 0 = 0, β 1 = 0 that the respose Y is ot correlated with a predictor variable X. I spirit all these problems are similar to statistical iferece about parameters of ormal distributio such as t-tests, F -tests, etc. so as a startig poit we eed to fid a joit distributio of the estimates βˆ0, βˆ1 ad ˆπ. To compute the joit distributio of βˆ0 ad βˆ1 is very easy because they are liear combiatios of Y i s which have ormal distributios ad, as a result, βˆ0 ad βˆ1 will have ormal distributios. All we eed to do is fid their meas, variaces ad covariace, which is a straightforward computatio. However, we will obtai this as a part of a more geeral computatio that will also give us joit distributio of all three estimates βˆ0, βˆ1 ad ˆπ. Let us deote the sample variace of Xs by The we will prove the followig: π X = X. 94

5 1. ) π 1 X ) ) π ) βˆ1 N β 1,, βˆ0 N β π X 0, + = N β 0,, π π π Xπ Cov( βˆ0, βˆ1) =. π. πˆ is idepedet of βˆ0 ad βˆ1. 3. πˆ has π distributio with degrees of freedom. Remark. Lie 1 meas that ( βˆ0, βˆ1) have joitly ormal distributio with mea (β 0, β 1 ) ad covariace matri ( ) π Σ = X X π 1. X Proof. Let us cosider two vectors 1 1 ) a 1 = (a 11,..., a 1 ) =,..., ad X i X a = (a 1,..., a ) where a i =. π It is easy to check that both vectors have legth 1 ad they are orthogoal to each other sice their scalar product is 1 X i X a 1 a = a 1i a i = π i=1 i=1 = 0. Let us choose vectors a 3,..., a so that a 1,..., a is orthoormal basis ad, as a result, the matri a11 a 1 a 1 a A =... a 1 a is orthogoal. Let us cosider vectors Y = (Y 1,..., Y ), µ = EY = (EY 1,..., EY ) ad ) Y = (Y 1,..., Y ) = Y µ Y1 EY 1 =,..., Y EY π π π so that the radom variables Y 1,..., Y are i.i.d. stadard ormal. We proved before that if we cosider a orthogoal trasformatio of i.i.d. stadard ormal sequece: Z = (Z 1,..., Z ) = Y A 95

6 the Z 1,..., Z will also be i.i.d. stadard ormal. Sice Y µ ) Z = Y A = A = Y A µa π π this implies that Y A = πz + µa. Let us defie a vector Z = (Z 1,..., Z ) = Y A = πz + µa. Each Z i is a liear combiatio of Y i s ad, therefore, it has a ormal distributio. Sice we made a specific choice of the first two colums of the matri A we ca write dow eplicitly the first two coordiates Z 1 ad Z of vector Z. We have, Z 1 = a i1 Y i = 1 Yi = Y = (βˆ0 + βˆ1x) i=1 i=1 ad the secod coordiate Z = (Xi X)Y i a i Y i = π i=1 i=1 = (X π i X)Y i = π βˆ1. π i=1 Solvig these two equatios for βˆ0 ad βˆ1 we ca epress them i terms of Z 1 ad Z as βˆ1 = 1 Z ad βˆ0 = 1 Z X 1 Z. π π This easily implies claim 1. Net we will show how ˆπ ca also be epressed i terms of Z i s. πˆ = (Yi βˆ0 βˆ1x i ) = ) (Y i Ȳ ) βˆ1(x i X ) {sice βˆ0 = Ȳ βˆ1x } i=1 i=1 = (Yi Ȳ ) i=1 βˆ1π (Y i Ȳ )(X i X ) π +βˆ1 (X i X ) i=1 {{ i=1 = (Yi Ȳ ) βˆ1 π = Y i (Ȳ ) βˆ1 π {{ {{ i=1 i=1 βˆ1 Z Z 1 = Yi Z 1 Z = Z i Z 1 Z = Z Z. i=1 i=1 I the last lie we used the fact that Z = Y A is a orthogoal trasformatio of Y ad sice orthogoal trasformatio preserves the legth of a vector we have, Z i = Y i. i=1 i=1 96

7 If we ca show that Z 3,..., Z are i.i.d. with distributio N(0, π ) the πˆ Z ) 3 Z ) = π π π has -distributio with degrees of freedom, because Z i /π N(0, 1). Sice we showed above that Z = µa + πz Z i = (µa) i + πz i, the fact that Z 1,..., Z are i.i.d. stadard ormal implies that Z i s are idepedet of each other ad Z i N((µA) i, π ). Let us compute the mea EZ i = (µa) i : (µa) i = EZ i = E a ji Y j = a ji EY j = a ji (β 0 + β 1 X j ) j=1 j=1 j=1 = a ji (β 0 + β 1 X + β 1 (X j X)) j=1 = (β0 + β 1 X) a ji + β 1 a ji (X j X). j=1 j=1 Sice the matri A is orthogoal its colums are orthogoal to each other. Let a i = (a 1i,..., a i ) be the vector i the ith colum ad let us cosider i 3. The the fact that a i is orthogoal to the first colum gives 1 a i a 1 = a j1 a ji = a ji = 0 j=1 j=1 ad the fact that a i is orthogoal to the secod colum gives a i a = 1 π j=1 (X j X)a ji = 0. This show that for i 3 a ji = 0 ad j=1 j=1 a ji (X j X) = 0 ad this proves that EZ i = 0 for i 3 ad Z i N(0, π ) for i 3. As we metioed above this also proves that πˆ/π. Fially, πˆ is idepedet of βˆ0 ad βˆ1 because πˆ ca be writte as a fuctio of Z 3,..., Z ad βˆ0 ad βˆ1 ca be writte as fuctios of Z 1 ad Z. Statistical iferece i simple liear regressio. Suppose ow that we wat to fid the cofidece itervals for ukow parameters of the model β 0, β 1 ad π. This is 97

8 straightforward ad very similar to the cofidece itervals for parameters of ormal distributio. For eample, usig that πˆ/π, if we fid the costats c 1 ad c such that (0, c 1 ) = 1 α ad (c, + ) = 1 α the with probability α we have c 1 πˆ/π c. Solvig this for π we fid the α cofidece iterval: πˆ πˆ π. Similarly, we fid the α cofidece iterval for β 1. Sice / π πˆ (βˆ1 β 1 ) π N(0, 1) ad π the π / 1 πˆ (βˆ1 β 1 ) t π π has Studet t -distributio with degrees of freedom. Simplifyig, we get ( )π (βˆ1 β 1 ) t. (14.0.1) πˆ Therefore, if we fid c such that t ( c, c) = α the with probability α: ( )π c (βˆ1 β 1 ) c πˆ ad solvig for β 1 we obtai the α cofidece iterval: βˆ1 c ad α cofidece iterval for β 0 is: πˆ X ) πˆ X ) βˆ0 c 1 + β 0 βˆ0 + c 1 +. π π c c 1 πˆ πˆ. ( )π β 1 βˆ1 + c ( )π Similarly, to fid the cofidece iterval for β 0 we use that βˆ0 β / 0 1 πˆ / πˆ ) π = ( βˆ0 β 0 ) 1 + X π 1 + X π σ ) t (14.0.) We ca ow costruct various t-tests based o t-statistics (14.0.1) ad (14.0.). 98

9 Liear combiatios of parameters. More geerally, let us compute the distributio of a liear combiatio c 0 βˆ0 + c 1 βˆ1 of the estimates. This will allow us to costruct cofidece itervals ad t-tests for liear combiatios of parameters c 0 β 0 + c 1 β 1. Clear, the distributio of this liear combiatio will be ormal with mea E ( c βˆ + c β ˆ ) = c β + c β. We compute its variace: Var(c 0 βˆ0 + c 1 βˆ1) = E(c 0 βˆ0 + c 1 βˆ1 c 0 β 0 c 1 β 1 ) = E(c 0 (βˆ0 β 0 ) + c 1 (βˆ1 β 1 )) ˆ ˆ = c E(β β ) + E(β β ) + E(βˆ β )(βˆ c1 1 1 c0c β 1) {{ {{ {{ variace of βˆ variace of ˆ 1 β0 covariace ) 1 X π Xπ = c0 + π + c1 c 0 c 1 π π π c0 (c 0 X c 1 ) ) = π +. π This proves that βˆ0 + c 1 βˆ1 N c 0 β 0 + c 1 β 1, π c0 + (c 0X c 1 ) )) c 0. (14.0.3) π Usig (c 0, c 1 ) = (1, 0) or (0, 1), will give the distributios of βˆ0 ad βˆ1. Predictio Itervals. Suppose ow that we have a ew observatio X for which Y is ukow ad we wat to predict Y or fid the cofidece iterval for Y. Accordig to simple regressio model, Y = β 0 + β 1 X + χ ad it is atural to take Ŷ = βˆ0 + βˆ1x as the predictio of Y. Let us fid the distributio of their differece Ŷ Y. Clearly, the differece will have ormal distributio so we oly eed to compute the mea ad the variace. The mea is E(Ŷ Y ) = Eβˆ0 + Eβˆ1X β 0 β 1 X Eχ = β 0 + β 1 X β 0 β 1 X 0 = 0. Sice a ew pair (X, Y ) is idepedet of the prior data we have that Y is idepedet of Ŷ. Therefore, sice the variace of the sum or differece of idepedet radom variables is equal to the sum of their variaces, we get Var( Ŷ Y ) = Var( Ŷ ) + Var(Y ) = π + Var( Ŷ ), where we also used that Var(Y ) = Var(χ) = π. To compute the variace of Ŷ we ca use the formula above with (c 0, c 1 ) = (1, X) 1 (X Var( Ŷ ) = Var( βˆ0 + Xβˆ1) = π X) ) +. π 99

10 Therefore, we showed that As a result, we have: ˆ 0, π (X X) )) Y Y N. π Ŷ Y / 1 πˆ ) (X X) π π σ π t ad the 1 α predictio iterval for Y is π Ŷ c (X X) ) π Y Ŷ + c (X These are the dashed curves created by Matlab polytool fuctio. π X) ). Simultaeous cofidece set for (β 0, β 1 ) ad F -test. We will ow costruct a statistic that will allow us to give a cofidece set for both parameters β 0, β 1 at the same time ad test the hypothesis of the type H 0 : β 0 = 0 ad β 1 = 0. (14.0.4) The values (0, 0) could be replaced by ay other predetermied values. Lookig at the proof of the joit distributio of the estimates, as a itermediate step we showed that estimates βˆ0 ad βˆ1 ca be related to Z 1 = (βˆ0 + βˆ1x) ad Z = π ˆ β 1 where ormal radom variables Z 1, Z are idepedet of each other ad idepedet of πˆ π. Also, Z 1 ad Z have variace π. Stadardizig these radom variables we get π A = ((βˆ0 β 0 ) + ( βˆ1 β 1 )X ) N(0, 1) ad B = (βˆ1 β 1 ) N(0, 1) π π which implies that A + B -distributio. By defiitio of F -distributio, / (A πˆ + B ) F,. π Simplifyig the left-had side we get F := ) (βˆ0 β 0 ) + X (βˆ1 β 1 ) + X (βˆ0 β 0 )(βˆ1 β 1 ) F,. ˆπ 100

11 This allows us to obtai a joit cofidece set (ellipse) for parameters β 0, β 1. Give a cofidece level α [0, 1] is we defie a threshold c by F, (0, c) = α the with probability α we have F := ) (βˆ0 β 0 ) + X (βˆ1 β 1 ) + X (βˆ0 β 0 )(βˆ1 β 1 ) c. ˆπ This iequality defies a ellipse for (β 0, β 1 ). To test the hypothesis (14.0.4), we use the fact that uder H 0 the statistic ad defie a decisio rule by F := (βˆ0 + X βˆ1 + X βˆ0βˆ1) F, ˆπ { H β = 0 : F c H 1 : F > c, where F, (c, ) = α - a level of sigificace. F -statistic output by Matlab regress fuctio will be eplaied i the et sectio. Refereces. [1] Usig Cigarette Data for A Itroductio to Multiple Regressio. by Laure McItyre, Joural of Statistics Educatio v.,.1 (1994). [] Medehall, W., ad Sicich, T. (199), Statistics for Egieerig ad the Scieces (3rd ed.), New York: Delle Publishig Co. 101

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments: Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Simple Linear Regression

Simple Linear Regression Simple Liear Regressio 1. Model ad Parameter Estimatio (a) Suppose our data cosist of a collectio of pairs (x i, y i ), where x i is a observed value of variable X ad y i is the correspodig observatio

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences. Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes. Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5 CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 6 9/23/203 Browia motio. Itroductio Cotet.. A heuristic costructio of a Browia motio from a radom walk. 2. Defiitio ad basic properties

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests

More information

32 estimating the cumulative distribution function

32 estimating the cumulative distribution function 32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

REGRESSION WITH QUADRATIC LOSS

REGRESSION WITH QUADRATIC LOSS REGRESSION WITH QUADRATIC LOSS MAXIM RAGINSKY Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X, Y ), where, as before, X is a R d

More information

Lecture 11 and 12: Basic estimation theory

Lecture 11 and 12: Basic estimation theory Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis

More information

STA6938-Logistic Regression Model

STA6938-Logistic Regression Model Dr. Yig Zhag STA6938-Logistic Regressio Model Topic -Simple (Uivariate) Logistic Regressio Model Outlies:. Itroductio. A Example-Does the liear regressio model always work? 3. Maximum Likelihood Curve

More information

TAMS24: Notations and Formulas

TAMS24: Notations and Formulas TAMS4: Notatios ad Formulas Basic otatios ad defiitios X: radom variable stokastiska variabel Mea Vätevärde: µ = X = by Xiagfeg Yag kpx k, if X is discrete, xf Xxdx, if X is cotiuous Variace Varias: =

More information

Topics Machine learning: lecture 2. Review: the learning problem. Hypotheses and estimation. Estimation criterion cont d. Estimation criterion

Topics Machine learning: lecture 2. Review: the learning problem. Hypotheses and estimation. Estimation criterion cont d. Estimation criterion .87 Machie learig: lecture Tommi S. Jaakkola MIT CSAIL tommi@csail.mit.edu Topics The learig problem hypothesis class, estimatio algorithm loss ad estimatio criterio samplig, empirical ad epected losses

More information

THE KALMAN FILTER RAUL ROJAS

THE KALMAN FILTER RAUL ROJAS THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a getle itroductio to the Kalma filter, a umerical method that ca be used for sesor fusio or for calculatio of trajectories. First, we cosider

More information

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x

More information

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Chapter 13: Tests of Hypothesis Section 13.1 Introduction Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed

More information

Matrix Representation of Data in Experiment

Matrix Representation of Data in Experiment Matrix Represetatio of Data i Experimet Cosider a very simple model for resposes y ij : y ij i ij, i 1,; j 1,,..., (ote that for simplicity we are assumig the two () groups are of equal sample size ) Y

More information

Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version)

Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version) Kurskod: TAMS Provkod: TENB 2 March 205, 4:00-8:00 Examier: Xiagfeg Yag (Tel: 070 2234765). Please aswer i ENGLISH if you ca. a. You are allowed to use: a calculator; formel -och tabellsamlig i matematisk

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

9. Simple linear regression G2.1) Show that the vector of residuals e = Y Ŷ has the covariance matrix (I X(X T X) 1 X T )σ 2.

9. Simple linear regression G2.1) Show that the vector of residuals e = Y Ŷ has the covariance matrix (I X(X T X) 1 X T )σ 2. LINKÖPINGS UNIVERSITET Matematiska Istitutioe Matematisk Statistik HT1-2015 TAMS24 9. Simple liear regressio G2.1) Show that the vector of residuals e = Y Ŷ has the covariace matrix (I X(X T X) 1 X T )σ

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ), Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We

More information

Regression with quadratic loss

Regression with quadratic loss Regressio with quadratic loss Maxim Ragisky October 13, 2015 Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X,Y, where, as before,

More information

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700 Simple Regressio CS 7 Ackowledgemet These slides are based o presetatios created ad copyrighted by Prof. Daiel Measce (GMU) Basics Purpose of regressio aalysis: predict the value of a depedet or respose

More information

Logit regression Logit regression

Logit regression Logit regression Logit regressio Logit regressio models the probability of Y= as the cumulative stadard logistic distributio fuctio, evaluated at z = β 0 + β X: Pr(Y = X) = F(β 0 + β X) F is the cumulative logistic distributio

More information

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15 17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Chapter 1 Simple Linear Regression (part 6: matrix version)

Chapter 1 Simple Linear Regression (part 6: matrix version) Chapter Simple Liear Regressio (part 6: matrix versio) Overview Simple liear regressio model: respose variable Y, a sigle idepedet variable X Y β 0 + β X + ε Multiple liear regressio model: respose Y,

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

Stochastic Simulation

Stochastic Simulation Stochastic Simulatio 1 Itroductio Readig Assigmet: Read Chapter 1 of text. We shall itroduce may of the key issues to be discussed i this course via a couple of model problems. Model Problem 1 (Jackso

More information

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22 CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

It should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable.

It should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable. Chapter 10 Variace Estimatio 10.1 Itroductio Variace estimatio is a importat practical problem i survey samplig. Variace estimates are used i two purposes. Oe is the aalytic purpose such as costructig

More information

Lecture 12: September 27

Lecture 12: September 27 36-705: Itermediate Statistics Fall 207 Lecturer: Siva Balakrisha Lecture 2: September 27 Today we will discuss sufficiecy i more detail ad the begi to discuss some geeral strategies for costructig estimators.

More information

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACATÓLICA Quatitative Methods Miguel Gouveia Mauel Leite Moteiro Faculdade de Ciêcias Ecoómicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACatólica 006/07 Métodos Quatitativos

More information

The picture in figure 1.1 helps us to see that the area represents the distance traveled. Figure 1: Area represents distance travelled

The picture in figure 1.1 helps us to see that the area represents the distance traveled. Figure 1: Area represents distance travelled 1 Lecture : Area Area ad distace traveled Approximatig area by rectagles Summatio The area uder a parabola 1.1 Area ad distace Suppose we have the followig iformatio about the velocity of a particle, how

More information

Mathematical Statistics - MS

Mathematical Statistics - MS Paper Specific Istructios. The examiatio is of hours duratio. There are a total of 60 questios carryig 00 marks. The etire paper is divided ito three sectios, A, B ad C. All sectios are compulsory. Questios

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

University of California, Los Angeles Department of Statistics. Simple regression analysis

University of California, Los Angeles Department of Statistics. Simple regression analysis Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100C Istructor: Nicolas Christou Simple regressio aalysis Itroductio: Regressio aalysis is a statistical method aimig at discoverig

More information

Lecture 15: Learning Theory: Concentration Inequalities

Lecture 15: Learning Theory: Concentration Inequalities STAT 425: Itroductio to Noparametric Statistics Witer 208 Lecture 5: Learig Theory: Cocetratio Iequalities Istructor: Ye-Chi Che 5. Itroductio Recall that i the lecture o classificatio, we have see that

More information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N. 3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Least-Squares Regression

Least-Squares Regression MATH 482 Least-Squares Regressio Dr. Neal, WKU As well as fidig the correlatio of paired sample data {{ x 1, y 1 }, { x 2, y 2 },..., { x, y }}, we also ca plot the data with a scatterplot ad fid the least

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Notes on iteration and Newton s method. Iteration

Notes on iteration and Newton s method. Iteration Notes o iteratio ad Newto s method Iteratio Iteratio meas doig somethig over ad over. I our cotet, a iteratio is a sequece of umbers, vectors, fuctios, etc. geerated by a iteratio rule of the type 1 f

More information

10-701/ Machine Learning Mid-term Exam Solution

10-701/ Machine Learning Mid-term Exam Solution 0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

ECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization

ECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization ECE 90 Lecture 4: Maximum Likelihood Estimatio ad Complexity Regularizatio R Nowak 5/7/009 Review : Maximum Likelihood Estimatio We have iid observatios draw from a ukow distributio Y i iid p θ, i,, where

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }

Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { } UCLA STAT A Applied Probability & Statistics for Egieers Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistat: Neda Farziia, UCLA Statistics Uiversity of Califoria, Los Ageles, Sprig

More information

Statistics 203 Introduction to Regression and Analysis of Variance Assignment #1 Solutions January 20, 2005

Statistics 203 Introduction to Regression and Analysis of Variance Assignment #1 Solutions January 20, 2005 Statistics 203 Itroductio to Regressio ad Aalysis of Variace Assigmet #1 Solutios Jauary 20, 2005 Q. 1) (MP 2.7) (a) Let x deote the hydrocarbo percetage, ad let y deote the oxyge purity. The simple liear

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2 82 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Defiitio: et X be a radom sample with joit p.m/d.f. f X x θ. The geeralised likelihood ratio test g.l.r.t. of the NH : θ H 0 agaist the alterative AH : θ H 1,

More information

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators. IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits

More information

Machine Learning Theory (CS 6783)

Machine Learning Theory (CS 6783) Machie Learig Theory (CS 6783) Lecture 2 : Learig Frameworks, Examples Settig up learig problems. X : istace space or iput space Examples: Computer Visio: Raw M N image vectorized X = 0, 255 M N, SIFT

More information