1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

Size: px
Start display at page:

Download "1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments"

Transcription

1 /4/008 Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University C. A Sample of Data C. An Econometric Model C.3 Estimating the Mean of a Population C.4 Estimating the Population Variance and Other Moments C.5 Interval Estimation Slide C- C.6 Hypothesis Tests About a Population Mean C.7 Some Other Useful Tests C.8 Introduction to Maximum Likelihood Estimation C.9 Algebraic Supplements Slide C-3

2 /4/008 Slide C-4 Figure C. Histogram of Hip Sizes Slide C-5 EY [ ] =μ (C.) var( Y) = E[ Y E( Y)] = E[ Y μ ] =σ (C.) Slide C-6

3 /4/008 y = yi (C.3) Y = Yi / i= (C.4) Slide C-7 y = yi (C.3) Y = Yi / i= (C.4) Slide C-8 Slide C-9 3

4 /4/008 Y Y = Yi = Y+ Y Y i= (C.5) E[ Y] = E Y + E Y E Y = EY [ ] + EY [ ] EY [ ] = μ+ μ μ =μ Slide C-0 Y var ( Y) = var Y+ Y Y = var Y var Y... var Y = σ + σ σ σ = ( Y ) + ( Y ) + + ( Y ) (C.6) Slide C- Y Figure C. Increasing Sample Size and Sampling Distribution of Y Slide C- 4

5 /4/008 Y Y μ P μ Y μ+ = P σ/ σ/ σ/ = P Z.5.5 = P Z =.9544 [ ] If we draw a random sample of size = 40 from a normal population with variance 0, the least squares estimator will provide an estimate within inch of the true value about 95% of the time. If = 80 the probability that Y is within inch of μ increases to Slide C-3 Central Limit Theorem: If Y,,Y are independent and identically distributed random variables with mean μ and variance σ Y μ, and Y = Yi /, then Z = σ has a probability distribution that converges to the standard normal (0,) as. Slide C-4 f ( y) y 0< y< = 0 otherwise Y /3 Z = /8 Slide C-5 5

6 /4/008 Figure C.3 Central Limit Theorem Slide C-6 A powerful finding about the estimator of the population mean is that it is the best of all possible estimators that are both linear and unbiased. A linear estimator is simply one that is a weighted average of the Y i s, such as Y = ay, where the a i i i are constants. Best means that it is the linear unbiased estimator with the smallest possible variance. Slide C-7 μ r = E Y ( μ) r ( ) ( ) μ = μ = μ = E Y E Y 0 ( Y ) μ = E μ =σ μ 3 = E μ 4 = E ( Y μ) ( Y μ) 3 4 Slide C-8 6

7 /4/008 var ( Y) =σ = E[ Y μ] σ = ( Yi Y) ˆ σ = ( Y ) i Y (C.7) Slide C-9 ( ) var Y =σˆ (C.8) se Y = var Y =σˆ / ( ) ( ) (C.9) Slide C-0 μ r = E Y ( μ) r In statistics the Law of Large umbers says that sample means converge to population averages (expected values) as the sample size. 3 4 ( i ) ( i ) ( i ) μ = Y Y =σ μ = μ = 3 Y Y 4 Y Y Slide C- 7

8 /4/008 μ = = σ 3 skewness S 3 4 kurtosis K μ = = σ 4 Slide C- ( yi y) ( yi ) σ ˆ = = = = σˆ var ( Y ) = = = ( Y) se =σ ˆ =.556 Slide C-3 ( Y ) i Y σ= = = 3 4 ( i ) μ = Y Y = ( i ) μ = Y Y = Slide C-4 8

9 /4/008 Y μ 8 μ PY ( > 8) = P > σ σ Y y > 8 > = >.4659 =.307 ˆ σ.8070 P ( Y ) P P ( Z ) Y y y* 7.58 y* 7.58 PY ( y* ) P P Z.95 ˆ = = σ y * 7.58 =.645 y * = Slide C-5 C.5. Interval Estimation: σ Known Y = Y i i= ( ) Y ~ μ, σ Z Y μ = = σ σ Y μ ~ 0, ( ) (C.0) P[ Z z] =Φ( z) Slide C-6 Figure C.4 Critical Values for the (0,) Distribution Slide C-7 9

10 /4/008 [ ] PZ [ ] PZ.96 =.96 =.05 [ ] P.96 Z.96 =.05 =.95 (C.) P Y.96 Y.96 σ μ + σ =.95 Slide C-8 σ σ P Y zc μ Y + zc = α (C.) Y ± z c σ (C.3) Slide C-9 Slide C-30 0

11 /4/008 Slide C-3 Any one interval estimate may or may not contain the true population parameter value. If many samples of size are obtained, and intervals are constructed using (C.3) with ( α) =.95, then 95% of them will contain the true parameter value. A 95% level of confidence is the probability that the interval estimator will provide an interval containing the true parameter value. Our confidence is in the procedure, not in any one interval estimate. Slide C-3 When σ is unknown it is natural to replace it with its estimator σˆ. ( Y ) i Y i= σ ˆ = Y μ t = t σ ˆ ( ) (C.4) Slide C-33

12 /4/008 Y μ P tc tc = α σˆ σˆ σˆ P Y tc μ Y + tc = α σˆ Y ± tc or Y ± tcse Y ( ) (C.5) Slide C-34 Remark: The confidence interval (C.5) is based upon the assumption that the population is normally distributed, so that Y is normally distributed. If the population p is not normal, then we invoke the central limit theorem, and say that Y is approximately normal in large samples, which from Figure C.3 you can see might be as few as 30 observations. In this case we can use (C.5), recognizing that there is an approximation error introduced in smaller samples. Slide C-35 Slide C-36

13 /4/008 Given a random sample of size = 50 we estimated the mean U.S. hip width to be = 7.58 inches. ˆ 3.65 therefore ˆ.807 σ = σ= σ ˆ = =.556 σˆ.807 y± t c = 7.58 ±.0 = , [ ] Slide C-37 Components of Hypothesis Tests A null hypothesis, H 0 An alternative hypothesis, H A test statistic A rejection region A conclusion Slide C-38 The ull Hypothesis The null hypothesis, which is denoted H 0 (H-naught), specifies a value c for a parameter. We write the null hypothesis as H : μ = c. A null hypothesis is the belief we 0 will maintain until we are convinced by the sample evidence that it is not true, in which case we reject the null hypothesis. Slide C-39 3

14 /4/008 The Alternative Hypothesis H : μ > c If we reject the null hypothesis that μ = c, we accept the alternative that μ is greater than c. H : μ < c If we reject the null hypothesis that μ = c, we accept the alternative that μ is less than c. H : μ c If we reject the null hypothesis that μ = c, we accept the alternative that μ takes a value other than (not equal to) c. Slide C-40 The Test Statistic A test statistic s probability distribution is completely known when the null hypothesis is true, and it has some other distribution if the null hypothesis is not true. Y μ t = ~ t( ) σ ˆ If H : μ = c is true then 0 Y c t = ~ t( ) σ ˆ (C.6) Slide C-4 Remark: The test statistic distribution in (C.6) is based on an assumption that the population is normally distributed. If the population is not normal, then we invoke the central limit theorem, and say that Y is approximately normal in large samples. We can use (C.6), recognizing that there is an approximation error introduced if our sample is small. Slide C-4 4

15 /4/008 The Rejection Region If a value of the test statistic is obtained that falls in a region of low probability, then it is unlikely that the test statistic has the assumed distribution, and thus it is unlikely that the null hypothesis is true. If the alternative hypothesis is true, then values of the test statistic will tend to be unusually large or unusually small, determined by choosing a probability α, called the level of significance of the test. The level of significance of the test α is usually chosen to be.0,.05 or.0. Slide C-43 A Conclusion When you have completed a hypothesis test you should state your conclusion, whether you reject, or do not reject, the null hypothesis. Say what the conclusion means in the economic context of the problem you are working on, i.e., interpret the results in a meaningful way. Slide C-44 Figure C.5 The rejection region for the one-tail test of H : μ = c against H : μ > c Slide C-45 5

16 /4/008 Figure C.6 The rejection region for the one-tail test of H : μ = c against H : μ < c Slide C-46 Figure C.7 The rejection region for a test of H : μ = c against H : μ c Slide C-47 The null hypothesis is H : μ = The alternative hypothesis is H : μ > 6.5. Y 6.5 The test statistic t = ~ t ˆ hypothesis is true. σ ( ) if the null The level of significance α=.05. = t = tc (.95,49).6766 Slide C-48 6

17 /4/008 The value of the test statistic is t = = Conclusion: Since t =.5756 >.68 we reject the null hypothesis. The sample information we have is incompatible with the hypothesis that μ = 6.5. We accept the alternative that the population mean hip size is greater than 6.5 inches, at the α=.05 level of significance. Slide C-49 The null hypothesis is H : μ = 7. 0 The alternative hypothesis is H : μ 7. Y 7 The test statistic t = ~ t ˆ hypothesis is true. σ ( ) if the null The level of significance α=.05, therefore = t = tc (.975,49).0 α =.05. Slide C-50 The value of the test statistic is t = = Conclusion: Since.0 < t =.69<.0 we do not reject the null hypothesis. The sample information we have is compatible with the hypothesis that the population mean hip size μ = 7. Slide C-5 7

18 /4/008 Warning: Care must be taken here in interpreting the outcome of a statistical test. One of the basic precepts of hypothesis testing is that finding a sample value of the test statistic in the non-rejection region does not make the null hypothesis true! The weaker statements we do not reject the null hypothesis, or we fail to reject the null hypothesis, do not send a misleading message. Slide C-5 p-value rule: Reject the null hypothesis when the p- value is less than, or equal to, the level of significance α. That is, if p α then reject H 0. If p > α then do not reject H 0 Slide C-53 How the p-value is computed depends on the alternative. If t is the calculated value [not the critical value t c ] of the t- statistic with degrees of freedom, then: if H : μ > c, p = probability to the right of t if H : μ < c, p = probability to the left of t if H : μ c, p = sum of probabilities to the right of t and to the left of t Slide C-54 8

19 /4/008 Figure C.8 The p-value for a right-tail test Slide C-55 Figure C.9 The p-value for a two-tailed test Slide C-56 A statistical test procedure cannot prove the truth of a null hypothesis. When we fail to reject a null hypothesis, all the hypothesis test can establish is that the information in a sample of data is compatible with the null hypothesis. On the other hand, a statistical test can lead us to reject the null hypothesis, with only a small probability, α, of rejecting the null hypothesis when it is actually true. Thus rejecting a null hypothesis is a stronger conclusion than failing to reject it. Slide C-57 9

20 /4/008 Correct Decisions The null hypothesis is false and we decide to reject it. The null hypothesis is true and we decide not to reject it. Incorrect Decisions The null hypothesis is true and we decide to reject it (a Type I error) The null hypothesis is false and we decide not to reject it (a Type II error) Slide C-58 The probability of a Type II error varies inversely with the level of significance of the test, α, which is the probability of a Type I error. If you choose to make α smaller, the probability of a Type II error increases. If the null hypothesis is μ = c, and if the true (unknown) value of μ is close to c, then the probability of a Type II error is high. The larger the sample size, the lower the probability of a Type II error, given a level of Type I error α. Slide C-59 H : μ= c 0 H : μ c If we fail to reject the null hypothesis at the α level of significance, then the value c will fall within a 00( α)% confidence interval estimate of μ. If we reject the null hypothesis, then c will fall outside the 00( α)% confidence interval estimate of μ. Slide C-60 0

21 /4/008 We fail to reject the null hypothesis when when Y c tc t σˆ c t t t c, c or σˆ Y tc c Y + t c σˆ Slide C-6 C.7. Testing the population variance ( ) Y ~ μσ,, Y =Yi σ ˆ = ( Yi Y) ( ) H : σ =σ 0 0 ( )ˆ σ V = ~ χ σ 0 ( ) Slide C-6 If H : σ >σ, then the null hypothesis is rejected if V χ 0. (.95, ) If H: σ σ0, then we carry out a two tail test, and the null hypothesis is rejected if V χ(.975, ) or if V χ(.05, ). Slide C-63

22 /4/008 Case : Population variances are equal p ( ) ˆ ( ) σ + σˆ p + σ ˆ = σ =σ =σ If the null hypothesis H : μ μ = c is true then t = ( ) 0 Y Y c ~ t σ ˆ p + ( + ) Slide C-64 Case : Population variances are unequal df ( ) Y Y c * t = σˆ ˆ σ + ( σ ˆ +σˆ ) ( σˆ ) ( σˆ ) = + Slide C-65 ( ) σˆ σ ( ) σˆ σ ~ F (, ) ( ˆ ) σ σ σˆ σ ( ) F = = Slide C-66

23 /4/008 The normal distribution is symmetric, and has a bell-shape with a peakedness and tail-thickness leading to a kurtosis of 3. We can test for departures from normality by checking the skewness and kurtosis from a sample of data. μ skewness = S = σ μ 4 kurtosis = K = σ Slide C-67 The Jarque-Bera test statistic allows a joint test of these two characteristics, ( K ) JB = 6 S If we reject the null hypothesis then we know the data have nonnormal characteristics, but we do not know what distribution the population might have. Slide C-68 For the Hip data, ( K ) ( ) JB = S + = (.038) + = p= P χ().935 =.673 Slide C-69 3

24 /4/008 Figure C.0 Wheel of Fortune Game Slide C-70 For wheel A, with p=/4, the probability of observing WI, WI, LOSS is = 64 = For wheel B, with p=3/4, the probability of observing WI, WI, LOSS is = = Slide C-7 If we had to choose wheel A or B based on the available data, we would choose wheel B because it has a higher probability of having produced the observed data. It is more lk likelyl that wheel B was spun than wheel A, and p ˆ = 34is called the maximum likelihood estimate of p. The maximum likelihood principle seeks the parameter values that maximize the probability, or likelihood, of observing the outcomes actually obtained. Slide C-7 4

25 /4/008 Suppose p can be any probability between zero and one. The probability of observing WI, WI, LOSS is the likelihood L, and is 3 L( p) = p p ( p) = p p (C.7) We would like to find the value of p that maximizes the likelihood of observing the outcomes actually obtained. Slide C-73 Figure C. A Likelihood Function Slide C-74 ( ) dl p = p 3p dp ( ) p 3 p = 0 p 3 p = 0 = = There are two solutions to this equation, p=0 or p=/3. The value that maximizes L(p) is p ˆ = 3, which is the maximum likelihood estimate. Slide C-75 5

26 /4/008 Let us define the random variable X that takes the values x= (WI) and x=0 (LOSS) with probabilities p and p. x [ ] f ( p ) p ( p ) x P X = x = f x p = p p, x = 0, (,, ) = ( ) ( ) f x x p f x p f x p = p xi ( p) (,, x ) = L p x xi (C.8) Slide C-76 Figure C. A Log-Likelihood Function Slide C-77 ( ) = ( i ) ln L p ln f x p i= i i i= i= ( ) ( ) = x ln p + x ln p (C.9) ( ) dln L p xi xi = dp p p Slide C-78 6

27 /4/008 xi xi = 0 pˆ pˆ ( ) i ( i) p ˆ x p ˆ x = 0 xi pˆ = = x (C.0) Slide C-79 ( θ ) = ( i θ) ln L ln f x i= ( θv) θˆ ~ a, (C.) θ ˆ c a t = ~ t( ) se ( θˆ ) (C.) Slide C-80 REMARK: The asymptotic results in (C.) and (C.) hold only in large samples. The distribution of the test statistic can be approximated by a t-distribution with degrees of freedom. If is truly large then the t (-) distribution converges to the standard normal distribution (0,). When the sample size may not be large, we prefer using the t-distribution critical values, which are adjusted for small samples by the degrees of freedom correction, when obtaining interval estimates and carrying out hypothesis tests. Slide C-8 7

28 /4/008 ( ˆ d ln L V = var θ ) = E dθ ( θ) (C.3) Slide C-8 Figure C.3 Two Log-Likelihood Functions Slide C-83 ( ) ( ) d ln L p xi xi = dp p p (C.4) ( ) ( ) 0 ( 0) 0 ( ) E x = P x = + P x = = p+ p = p i i i Slide C-84 8

29 /4/008 ( ) ( i) ( i) = ( ) E d ln L p E x E x dp p p p p = p = p ( p) ( p) Slide C-85 d ln L p p p ( ) ( ) V = var ( pˆ ) = E = dp ( ) a p p pˆ ~ p, Slide C-86 ( pˆ) ˆ pˆ V = se ( pˆ ) = Vˆ = pˆ ( pˆ) Slide C-87 9

30 /4/008 ( pˆ) pˆ se ( pˆ ) = = = pˆ t = = =.7303 se.034 ( pˆ ) ( pˆ) ( ) [ ] pˆ ±.96 se =.375 ± =.3075,.445 Slide C-88 C.8.4a The likelihood ratio (LR) test The likelihood ratio statistic which is twice the difference between L ( θˆ ) L ( c ) ln and ln. ( ˆ ) ( ) LR= lnl θ lnl c (C.5) Slide C-89 Figure C.4 The Likelihood Ratio Test Slide C-90 30

31 /4/008 Figure C.5 Critical Value for a Chi-Square Distribution Slide C-9 ln L( pˆ) = x ln ˆ ln( ˆ i p+ xi p) i= i= ( ) = pˆln pˆ + pˆ ln( pˆ) ( ) = pˆln pˆ + pˆ ln( pˆ) Slide C-9 For the cereal box problem p ˆ =.375 and = 00. [ ] ln L( p ˆ) = ln(.375) + (.375)ln(.375) = 3.36 Slide C-93 3

32 /4/008 The value of the log-likelihood function assuming is true is: ln L (.4) = xi ln(.4) + xi ln(.4) i= i= = 75 ln(.4) + (00 75) ln(.6) = H : p=.4 0 Slide C-94 The problem is to assess whether 3.36 is significantly different from The LR test statistic (C.5) is: ( ) LR = [ln L( pˆ ) ln L(.4)] = 3.36 ( 3.575) =.547 The critical value is χ = (.95,) Since.547 < 3.84 we do not reject the null hypothesis. Slide C-95 Figure C.6 The Wald Statistic Slide C-96 3

33 /4/008 ( ˆ c) d ln L W = θ dθ ( θ) (C.6) If the null hypothesis is true then the Wald statistic (C.6) has a distribution, and we reject the null hypothesis if W χ χ(). ( α,) Slide C-97 ( θ) d ln L I ( θ ) = E = V dθ (C.7) ( ˆ ) ( ) W = θ c I θ (C.8) ( ˆ ) ( ˆ ) W = θ c V = θ c V (C.9) Slide C-98 ˆ V ( ˆ ) = I θ (C.30) θ ˆ c θ ˆ c W = = = t Vˆ se ( θˆ ) Slide C-99 33

34 /4/008 In the blue box-green box example: 00 ( ˆ ) ˆ I p = V = = = pˆ ( pˆ ).375 (.375 ) ( ) ( ) ( ) W = pˆ c I pˆ = =.5333 Slide C-00 Figure C.7 Motivating the Lagrange multiplier test Slide C-0 dln L s( θ ) = dθ ( θ) (C.3) ( ) ( θ) s c LM = = s( c) I ( θ) I (C.3) ( ) ( ) LM = s c I c ( ˆ ) ( ˆ) W = θ c I θ Slide C-0 34

35 /4/008 In the blue box-green box example: xi xi s(.4) = = = c c I (.4) = = = c ( c).4(.4) ( ) ( ) [ ] [ ] LM = s.4 I.4 = =.508 Slide C-03 C.9. Derivation of Least Squares Estimator S = ( y μ) i= i d = ( y μ) i i d = ( y μ) i i Slide C-04 ( μ ) = i = ( i μ) S d y i= i= ( μ ) = i μ i + μ = 0 μ+ μ S y y a a a i= i= a = y = , a = y = , a = = 50 0 i i Slide C-05 35

36 /4/008 Figure C.8 The Sum of Squares Parabola For the Hip Data Slide C-06 ( ) ds μ = a + a μ dμ a + a μ= ˆ 0 a a i i= μ= ˆ = = Y y i i= μ= ˆ = Y y Slide C-07 For the hip data in Table C. yi i= μ= ˆ = = Thus we estimate that the average hip size in the population is 7.58 inches. Slide C-08 36

37 /4/008 Y = Yi / = Y+ Y Y i= = ay + ay ay = ay i i i= Slide C-09 a iyi i= Y = a i = ai + ci = + ci Slide C-0 Y = a Y = + c Y i i i i i= i= = Y + cy i i i i= i= = Y + cy i i i= Slide C- 37

38 /4/008 E Y = E Y + cy =μ+ ce Y [ ] i i i i i= i= =μ+μc i i= Slide C- i i i i i i i= i= i= var( Y ) = var a Y = var + c Y = + c var( Y) ci c i ci ci ci i= i= i= i= =σ + =σ + + =σ + + ci i= =σ +σ (since c = 0) i= i = var( Y) +σ ci i= Slide C-3 alternative hypothesis asymptotic distribution BLUE central limit theorem central moments estimate estimator experimental design information measure interval estimate Lagrange multiplier test Law of large numbers level of significance likelihood function likelihood ratio test linear estimator log likelihood function maximum likelihood estimation null hypothesis point estimate population parameter p-value random sample rejection region sample mean sample variance sampling distribution sampling variation standard error standard error of the mean standard error of the estimate statistical inference test statistic two-tail tests Type I error Type II error unbiased estimators Wald test Slide C-4 38

+ Specify 1 tail / 2 tail

+ Specify 1 tail / 2 tail Week 2: Null hypothesis Aeroplane seat designer wonders how wide to make the plane seats. He assumes population average hip size μ = 43.2cm Sample size n = 50 Question : Is the assumption μ = 43.2cm reasonable?

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test.

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test. Economics 52 Econometrics Professor N.M. Kiefer LECTURE 1: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING NEYMAN-PEARSON LEMMA: Lesson: Good tests are based on the likelihood ratio. The proof is easy in the

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Applied Econometrics (QEM)

Applied Econometrics (QEM) Applied Econometrics (QEM) based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #3 1 / 42 Outline 1 2 3 t-test P-value Linear

More information

Chapter 8 Heteroskedasticity

Chapter 8 Heteroskedasticity Chapter 8 Walter R. Paczkowski Rutgers University Page 1 Chapter Contents 8.1 The Nature of 8. Detecting 8.3 -Consistent Standard Errors 8.4 Generalized Least Squares: Known Form of Variance 8.5 Generalized

More information

Semester 2, 2015/2016

Semester 2, 2015/2016 ECN 3202 APPLIED ECONOMETRICS 2. Simple linear regression B Mr. Sydney Armstrong Lecturer 1 The University of Guyana 1 Semester 2, 2015/2016 PREDICTION The true value of y when x takes some particular

More information

Bias Variance Trade-off

Bias Variance Trade-off Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Single Sample Means. SOCY601 Alan Neustadtl

Single Sample Means. SOCY601 Alan Neustadtl Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size

More information

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY

AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY Econometrics Working Paper EWP0401 ISSN 1485-6441 Department of Economics AN EMPIRICAL LIKELIHOOD RATIO TEST FOR NORMALITY Lauren Bin Dong & David E. A. Giles Department of Economics, University of Victoria

More information

4.1 Least Squares Prediction 4.2 Measuring Goodness-of-Fit. 4.3 Modeling Issues. 4.4 Log-Linear Models

4.1 Least Squares Prediction 4.2 Measuring Goodness-of-Fit. 4.3 Modeling Issues. 4.4 Log-Linear Models 4.1 Least Squares Prediction 4. Measuring Goodness-of-Fit 4.3 Modeling Issues 4.4 Log-Linear Models y = β + β x + e 0 1 0 0 ( ) E y where e 0 is a random error. We assume that and E( e 0 ) = 0 var ( e

More information

STAT Chapter 8: Hypothesis Tests

STAT Chapter 8: Hypothesis Tests STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE

Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE Eric Zivot Winter 013 1 Wald, LR and LM statistics based on generalized method of moments estimation Let 1 be an iid sample

More information

Basic Concepts of Inference

Basic Concepts of Inference Basic Concepts of Inference Corresponds to Chapter 6 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT) with some slides by Jacqueline Telford (Johns Hopkins University) and Roy Welsch (MIT).

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

Review. December 4 th, Review

Review. December 4 th, Review December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

Finite-sample quantiles of the Jarque-Bera test

Finite-sample quantiles of the Jarque-Bera test Finite-sample quantiles of the Jarque-Bera test Steve Lawford Department of Economics and Finance, Brunel University First draft: February 2004. Abstract The nite-sample null distribution of the Jarque-Bera

More information

ECON 4160, Autumn term Lecture 1

ECON 4160, Autumn term Lecture 1 ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least

More information

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test 1 Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test Learning Objectives After this section, you should be able to DESCRIBE the relationship between the significance level of a test, P(Type

More information

Assumptions of classical multiple regression model

Assumptions of classical multiple regression model ESD: Recitation #7 Assumptions of classical multiple regression model Linearity Full rank Exogeneity of independent variables Homoscedasticity and non autocorrellation Exogenously generated data Normal

More information

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics)

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics) Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics) Probability quantifies randomness and uncertainty How do I estimate the normalization and logarithmic slope of a X ray continuum, assuming

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

y ˆ i = ˆ  T u i ( i th fitted value or i th fit) 1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u

More information

SPRING 2007 EXAM C SOLUTIONS

SPRING 2007 EXAM C SOLUTIONS SPRING 007 EXAM C SOLUTIONS Question #1 The data are already shifted (have had the policy limit and the deductible of 50 applied). The two 350 payments are censored. Thus the likelihood function is L =

More information

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement

More information

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): Lecture Three Normal theory null distributions Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): A random variable which is a sum of many

More information

STAT Chapter 11: Regression

STAT Chapter 11: Regression STAT 515 -- Chapter 11: Regression Mostly we have studied the behavior of a single random variable. Often, however, we gather data on two random variables. We wish to determine: Is there a relationship

More information

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 6 Sampling and Sampling Distributions Ch. 6-1 6.1 Tools of Business Statistics n Descriptive statistics n Collecting, presenting, and describing data n Inferential

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

The Logit Model: Estimation, Testing and Interpretation

The Logit Model: Estimation, Testing and Interpretation The Logit Model: Estimation, Testing and Interpretation Herman J. Bierens October 25, 2008 1 Introduction to maximum likelihood estimation 1.1 The likelihood function Consider a random sample Y 1,...,

More information

Introduction to Statistical Data Analysis III

Introduction to Statistical Data Analysis III Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

Introduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007

Introduction Large Sample Testing Composite Hypotheses. Hypothesis Testing. Daniel Schmierer Econ 312. March 30, 2007 Hypothesis Testing Daniel Schmierer Econ 312 March 30, 2007 Basics Parameter of interest: θ Θ Structure of the test: H 0 : θ Θ 0 H 1 : θ Θ 1 for some sets Θ 0, Θ 1 Θ where Θ 0 Θ 1 = (often Θ 1 = Θ Θ 0

More information

POLI 443 Applied Political Research

POLI 443 Applied Political Research POLI 443 Applied Political Research Session 6: Tests of Hypotheses Contingency Analysis Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Exercises Chapter 4 Statistical Hypothesis Testing

Exercises Chapter 4 Statistical Hypothesis Testing Exercises Chapter 4 Statistical Hypothesis Testing Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 5, 013 Christophe Hurlin (University of Orléans) Advanced Econometrics

More information

CH.9 Tests of Hypotheses for a Single Sample

CH.9 Tests of Hypotheses for a Single Sample CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance

More information

Formal Statement of Simple Linear Regression Model

Formal Statement of Simple Linear Regression Model Formal Statement of Simple Linear Regression Model Y i = β 0 + β 1 X i + ɛ i Y i value of the response variable in the i th trial β 0 and β 1 are parameters X i is a known constant, the value of the predictor

More information

Statistics and econometrics

Statistics and econometrics 1 / 36 Slides for the course Statistics and econometrics Part 10: Asymptotic hypothesis testing European University Institute Andrea Ichino September 8, 2014 2 / 36 Outline Why do we need large sample

More information

Course 4 Solutions November 2001 Exams

Course 4 Solutions November 2001 Exams Course 4 Solutions November 001 Exams November, 001 Society of Actuaries Question #1 From the Yule-Walker equations: ρ φ + ρφ 1 1 1. 1 1+ ρ ρφ φ Substituting the given quantities yields: 0.53 φ + 0.53φ

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

Frequency Distribution Cross-Tabulation

Frequency Distribution Cross-Tabulation Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape

More information

Introductory Econometrics. Review of statistics (Part II: Inference)

Introductory Econometrics. Review of statistics (Part II: Inference) Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing

More information

Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you.

Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you. ISQS 5347 Final Exam Spring 2017 Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you. 1. Recall the commute

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random

More information

Statistical inference

Statistical inference Statistical inference Contents 1. Main definitions 2. Estimation 3. Testing L. Trapani MSc Induction - Statistical inference 1 1 Introduction: definition and preliminary theory In this chapter, we shall

More information

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December

More information

Analysis of Cross-Sectional Data

Analysis of Cross-Sectional Data Analysis of Cross-Sectional Data Kevin Sheppard http://www.kevinsheppard.com Oxford MFE This version: November 8, 2017 November 13 14, 2017 Outline Econometric models Specification that can be analyzed

More information

8. Hypothesis Testing

8. Hypothesis Testing FE661 - Statistical Methods for Financial Engineering 8. Hypothesis Testing Jitkomut Songsiri introduction Wald test likelihood-based tests significance test for linear regression 8-1 Introduction elements

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Asymptotics Asymptotics Multiple Linear Regression: Assumptions Assumption MLR. (Linearity in parameters) Assumption MLR. (Random Sampling from the population) We have a random

More information

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions Journal of Modern Applied Statistical Methods Volume 8 Issue 1 Article 13 5-1-2009 Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error

More information

Introduction to Statistical Inference

Introduction to Statistical Inference Introduction to Statistical Inference Dr. Fatima Sanchez-Cabo f.sanchezcabo@tugraz.at http://www.genome.tugraz.at Institute for Genomics and Bioinformatics, Graz University of Technology, Austria Introduction

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Empirical Economic Research, Part II

Empirical Economic Research, Part II Based on the text book by Ramanathan: Introductory Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna December 7, 2011 Outline Introduction

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

Linear Models and Estimation by Least Squares

Linear Models and Estimation by Least Squares Linear Models and Estimation by Least Squares Jin-Lung Lin 1 Introduction Causal relation investigation lies in the heart of economics. Effect (Dependent variable) cause (Independent variable) Example:

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

Functional Form. So far considered models written in linear form. Y = b 0 + b 1 X + u (1) Implies a straight line relationship between y and X

Functional Form. So far considered models written in linear form. Y = b 0 + b 1 X + u (1) Implies a straight line relationship between y and X Functional Form So far considered models written in linear form Y = b 0 + b 1 X + u (1) Implies a straight line relationship between y and X Functional Form So far considered models written in linear form

More information

Maximum-Likelihood Estimation: Basic Ideas

Maximum-Likelihood Estimation: Basic Ideas Sociology 740 John Fox Lecture Notes Maximum-Likelihood Estimation: Basic Ideas Copyright 2014 by John Fox Maximum-Likelihood Estimation: Basic Ideas 1 I The method of maximum likelihood provides estimators

More information

Hypothesis Testing Problem. TMS-062: Lecture 5 Hypotheses Testing. Alternative Hypotheses. Test Statistic

Hypothesis Testing Problem. TMS-062: Lecture 5 Hypotheses Testing. Alternative Hypotheses. Test Statistic Hypothesis Testing Problem TMS-062: Lecture 5 Hypotheses Testing Same basic situation as befe: Data: random i. i. d. sample X 1,..., X n from a population and we wish to draw inference about unknown population

More information

Lecture 3: Inference in SLR

Lecture 3: Inference in SLR Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals

More information

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Financial Econometrics and Quantitative Risk Managenent Return Properties

Financial Econometrics and Quantitative Risk Managenent Return Properties Financial Econometrics and Quantitative Risk Managenent Return Properties Eric Zivot Updated: April 1, 2013 Lecture Outline Course introduction Return definitions Empirical properties of returns Reading

More information

A3. Statistical Inference Hypothesis Testing for General Population Parameters

A3. Statistical Inference Hypothesis Testing for General Population Parameters Appendix / A3. Statistical Inference / General Parameters- A3. Statistical Inference Hypothesis Testing for General Population Parameters POPULATION H 0 : θ = θ 0 θ is a generic parameter of interest (e.g.,

More information

Space Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses

Space Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses Space Telescope Science Institute statistics mini-course October 2011 Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses James L Rosenberger Acknowledgements: Donald Richards, William

More information

Session 3 The proportional odds model and the Mann-Whitney test

Session 3 The proportional odds model and the Mann-Whitney test Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

Unobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior:

Unobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior: Pi Priors Unobservable Parameter population proportion, p prior: π ( p) Conjugate prior π ( p) ~ Beta( a, b) same PDF family exponential family only Posterior π ( p y) ~ Beta( a + y, b + n y) Observed

More information

Categorical Data Analysis Chapter 3

Categorical Data Analysis Chapter 3 Categorical Data Analysis Chapter 3 The actual coverage probability is usually a bit higher than the nominal level. Confidence intervals for association parameteres Consider the odds ratio in the 2x2 table,

More information

Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University

Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Data Mining Chapter 4: Data Analysis and Uncertainty Fall 2011 Ming Li Department of Computer Science and Technology Nanjing University Why uncertainty? Why should data mining care about uncertainty? We

More information

You may not use your books/notes on this exam. You may use calculator.

You may not use your books/notes on this exam. You may use calculator. MATH 450 Fall 2018 Review problems 12/03/18 Time Limit: 60 Minutes Name (Print: This exam contains 6 pages (including this cover page and 5 problems. Check to see if any pages are missing. Enter all requested

More information

Econometrics A. Simple linear model (2) Keio University, Faculty of Economics. Simon Clinet (Keio University) Econometrics A October 16, / 11

Econometrics A. Simple linear model (2) Keio University, Faculty of Economics. Simon Clinet (Keio University) Econometrics A October 16, / 11 Econometrics A Keio University, Faculty of Economics Simple linear model (2) Simon Clinet (Keio University) Econometrics A October 16, 2018 1 / 11 Estimation of the noise variance σ 2 In practice σ 2 too

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

LECTURE 5 HYPOTHESIS TESTING

LECTURE 5 HYPOTHESIS TESTING October 25, 2016 LECTURE 5 HYPOTHESIS TESTING Basic concepts In this lecture we continue to discuss the normal classical linear regression defined by Assumptions A1-A5. Let θ Θ R d be a parameter of interest.

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2008 Prof. Gesine Reinert 1 Data x = x 1, x 2,..., x n, realisations of random variables X 1, X 2,..., X n with distribution (model)

More information

Probability & Statistics - FALL 2008 FINAL EXAM

Probability & Statistics - FALL 2008 FINAL EXAM 550.3 Probability & Statistics - FALL 008 FINAL EXAM NAME. An urn contains white marbles and 8 red marbles. A marble is drawn at random from the urn 00 times with replacement. Which of the following is

More information

Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Hypothesis testing. Anna Wegloop Niels Landwehr/Tobias Scheffer

Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Hypothesis testing. Anna Wegloop Niels Landwehr/Tobias Scheffer Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen Hypothesis testing Anna Wegloop iels Landwehr/Tobias Scheffer Why do a statistical test? input computer model output Outlook ull-hypothesis

More information

Problem 1 (20) Log-normal. f(x) Cauchy

Problem 1 (20) Log-normal. f(x) Cauchy ORF 245. Rigollet Date: 11/21/2008 Problem 1 (20) f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.0 0.2 0.4 0.6 0.8 4 2 0 2 4 Normal (with mean -1) 4 2 0 2 4 Negative-exponential x x f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.5

More information