TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

Similar documents
Topic 9: Sampling Distributions of Estimators

Stat 319 Theory of Statistics (2) Exercises

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

Estimation for Complete Data

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

Linear Regression Models

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

Algebra of Least Squares

11 Correlation and Regression

Properties and Hypothesis Testing

Statistics 20: Final Exam Solutions Summer Session 2007

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Topic 9: Sampling Distributions of Estimators

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

1 Inferential Methods for Correlation and Regression Analysis

SDS 321: Introduction to Probability and Statistics

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Section 14. Simple linear regression.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Statistics 203 Introduction to Regression and Analysis of Variance Assignment #1 Solutions January 20, 2005

Topic 9: Sampling Distributions of Estimators

1 Models for Matched Pairs

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

Simulation. Two Rule For Inverting A Distribution Function

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Regression, Inference, and Model Building

Mathematical Statistics - MS

Topic 18: Composite Hypotheses

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

Binomial Distribution

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

(all terms are scalars).the minimization is clearer in sum notation:

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Data Analysis and Statistical Methods Statistics 651

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT

IIT JAM Mathematical Statistics (MS) 2006 SECTION A

Mathematical Notation Math Introduction to Applied Statistics

Stat 200 -Testing Summary Page 1

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

There is no straightforward approach for choosing the warmup period l.

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Common Large/Small Sample Tests 1/55

Composite Hypotheses

TAMS24: Notations and Formulas

Problem Set 4 Due Oct, 12

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Exercise 4.3 Use the Continuity Theorem to prove the Cramér-Wold Theorem, Theorem. (1) φ a X(1).

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

MATH 472 / SPRING 2013 ASSIGNMENT 2: DUE FEBRUARY 4 FINALIZED

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Stat 139 Homework 7 Solutions, Fall 2015

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

STAC51: Categorical data Analysis

Summary. Recap ... Last Lecture. Summary. Theorem

6.3 Testing Series With Positive Terms

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700

Expectation and Variance of a random variable

Homework for 4/9 Due 4/16

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

(7 One- and Two-Sample Estimation Problem )

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

Power and Type II Error

Lecture 2: Monte Carlo Simulation

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

A statistical method to determine sample size to estimate characteristic value of soil parameters

Question 1: Exercise 8.2

Machine Learning Brett Bernstein

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

1 Review of Probability & Statistics

University of California, Los Angeles Department of Statistics. Hypothesis testing

Probability and Statistics

STAT431 Review. X = n. n )

IE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.

Stat 421-SP2012 Interval Estimation Section

Efficient GMM LECTURE 12 GMM II

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Math 140 Introductory Statistics

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

6 Sample Size Calculations

1036: Probability & Statistics

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Introductory statistics

STA6938-Logistic Regression Model

Notes on Hypothesis Testing, Type I and Type II Errors

Chapter 13, Part A Analysis of Variance and Experimental Design

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Transcription:

Norwegia Uiversity of Sciece ad Techology Departmet of Mathematical Scieces Corrected 3 May ad 4 Jue Solutios TMA445 Statistics Saturday 6 May 9: 3: Problem Sow desity a The probability is.9.5 6x x dx =.9.5 6x 6x dx = [3x x 3 ].9.5 =.47. b The likelihood fuctio is give by Lβ = ββ + x i x i β = β β + x i x i β, i= i= i= ad the log likelihood which has derivative l Lβ = l β + lβ + + l x i + β l x i, i= i= l L β = β + β + + l x i. l L is decreasig o, ad the sum of two first terms teds to whe β + ad to whe β, so that l L will have a sigle zero the third term is egative for β > ad be positive left of the zero ad egative right of the zero. This meas that L has its maximum at this zero. Solvig for the zero, β i= l x i + + i= i= l x i β + =,

TMA445, 6 May Solutios Page of 5 we get β = i= l x i ± 4 + i= l x i i= l x i = i= l x i ± i= + l x i 4. We choose the larger zero sice l L has oly oe zero for positive argumets the other we foud must be egative, ad get the maximum likelihood estimator i= + l i 4 i= l i = + l 4 l. For = ad i= l x i = 4. the estimate is.545. /.4 + /4+/.4 / = The discussio of actual attaimet of maximum at the zero ad of which zero to be chose, is ot required. Problem Temperature i March ad April a We ca get a impressio of ormality of or by meas of a histogram Figur a, ad a more accurate assessmet by a ormal quatile quatile plot, which should show liear relatioships betwee ordered observatios agaist ormal quatiles, or a ormal probability plot Figure b could be used, which should show liear relatioships. To assess whether ad are idepedet, we ca plot the values of agaist Figure c. No special patter should emerge, for example the poits beig close to a o-horizotal lie which idicates correlatio. Also a plot of the values of the i ad the i agaist i i the same graph could reveal depedece Figure d. b µ/s/ has the t-distributio with degrees of freedom, so P t α/ < µ S/ < t α/ = α, with degrees of freedom for t α/. Solvig the double iequality for µ, we get 99% cofidece bouds x ± t α/ s/ for µ. Here, =, x = x i / = 9./ =.758, α =., t.5 = 3.6, s = x i x i = 77.7 9. = 6.379, givig bouds.758 ± 3.6 6.379/ =.758 ±.65, ad a cofidece iterval.5, 3..

TMA445, 6 May Solutios Page 3 of 5.5.4.3.. 4 4.7.6.5.4.3.. a Histogram 3 4 Probability.95.9.75.5.5..5 Normal Probability Plot.5.5 Data, Probability b Normal probability Normal Probability Plot.95.9.75.5.5..5 3 Data,.9.8.7.6.5.4.3....4.6.8.9.8.7.6.5.4.3 c y agaist x...4.6.8 Mothly averages 6 5 4 3 4 6 8 ear Mothly averages d x i ad y i agaist i 8 6 4 4 6 8 ear Figure : a Histograms ad b ormal probability plots. Data come from a ormal distributio at the left but ot at the right. c Plots of y agaist x ad d of the x i ad y i agaist i. ad are idepedet at the left, but ot at the right. c We cosider the ull hypothesis H : µ a µ m = 5 or H : µ a µ m 5 ad the alterative hypothesis H : µ a µ m < 5. The observatios come aturally i pairs, ad there is reaso to believe that the March ad the April temperatures of a year are depedet. Therefore we choose a paired test, which is performed as a sigle sample test usig the differeces d i = y i x i. Uder the ull hypothesis, the test statistic T = D 5/S D / has the t distibutio with degrees of freedom. A small value of T is idicative of H, ad the critical value is t.5 =.796. With our data, d = ȳ x = 67.4 9./ = 4.86 ad s D = d i d i = 364.53 67.4 9. = 7.39. So t = 4.86 5/ 7.39/ =.8, which is ot i the critical regio, ad we do ot reject H. At the.5 sigificace level there is ot evidece to state that µ a µ m < 5.

TMA445, 6 May Solutios Page 4 of 5 Problem 3 Chemical factory a The probability that the lamp lights up whe the procedure is performed oce, is P > 5 = P /4 > 5 /4 = P Z >.5 = P Z <.5 =.56, where Z has the stadard ormal distributio. If the procedure is performed three times, let the amouts of by-product be,, 3. The the probability that the lamp lights up at least oce is P > 5 > 5 3 > 5 = P 5, 5, 3 5 = P i 5 3 = P i > 5 3 =.56 3 =.85. Let be the umber of times the lamp lights up whe the procedure is performed times. The has the biomial distributio with parameters = ad p =.56, ad P 5 = P 4.5 = P p p p 4.5.56.56.8944 P Z >.8 = P Z.8 =., usig the ormal approximatio with cotiuity correctio. The exact biomial probability is.4. ou are ot pealized if you have ot applied the cotiuity correctio. If you use 5 istead of 4.5 i the ormal approximatio, you get.74, ad if you use 4 arisig from P 5 = P 4, you get.3 ot very good approximatios. b Let be the umber of times the procedure is performed, ad let,,..., be the amouts of by-product. The total amout of by-product, = i= i, has the ormal distributio with mea ad variace 4. We wat to fid such that. P 5 = P 4 5 4 = P Z 5 4, that is, 5 /4 z. =.36, or 5 5 z., ad we have a quadratic iequality 5 + z. 5 i. The left had side is a dowwardpoitig parabola as a fuctio of havig zeros z. ± z. + 4 5 5 / 5, that is, 5.4 ad 4.77, meaig that 4.77 ad.8, that is, sice is a iteger.

TMA445, 6 May Solutios Page 5 of 5 Problem 4 Barbecue toight? a A Ve diagram is show to the right. P A B > implies that A B, ad A ad B are ot disjoit. P AP B =.4.4 =.6. = P A B, so A ad B are ot idepedet. Note that C = A B. P C = P A B = P A B = P A + P B P A B =.4 +.4. =.4. C A B A C = A A B = A A B A A =, so A ad C are disjoit this is is also obvious from the descriptio of the evets. The coditioal probability of barbeque give o rai is P C A = P C A /P A = P C/ P A =.4/.4 =.67. b ˆα = Ȳ ˆβ x. Sice Ȳ is a liear combiatio of the i, i, ad also ˆβ is a liear combiatio of the i, also ˆα ca be writte as a liear combiatio of the i, which are mutually idepedet ad have ormal distributios, thus ˆα has the ormal distributio. E ˆα = EȲ ˆβ x = Ei β x = α + βxi β x = α + β x β x = α, ad Var ˆα = VarȲ ˆβ x = VarȲ + x Var ˆβ = σ /+ x σ / x i x = / + x / x i x σ which ca be show to be equal to σ x i / x i x. The assumptios are that the i are idepedet variables havig the ormal distributio with mea α + βx i ad variace σ. The Figure idicates that relatioship betwee E ad x might ot be liear, as most poits havig a small or large x i lie above the estimated regressio lie ad most other poits lie below. This could also be due to depedece betwee the i. The assumptio of costat variace seems to be OK. c A estimate of the temperature at : if the temperature at 3: is 5 C, is ˆα+ ˆβ 5 = 6.57 +.43 5 = 4.9. For makig a 95% predictio iterval for a ew observatio correspodig to x, we cosider the variable Ŷ = ˆα + ˆβx α βx ɛ. Sice ˆα ad ˆβ are liear combiatios of,...,, Ŷ is a liear combiatio of,..., ad ɛ, which are all idepedet, so Ŷ has a ormal distributio. Its expected value is EŶ = Eˆα + ˆβx α βx ɛ = ad its variace VarŶ = VarȲ + ˆβx x ɛ = σ + / + x x / i= x i x ˆβ ad Ȳ are idepedet, leadig to a statistic Ŷ / ˆσ + / + x x / i= x i x, which has the t-distributio with degrees of freedom. So Ŷ P t.5 < ˆσ + / + x x / i= x i x < t.5 =.95, with degrees of freedom for t.5. Solvig the double iequality for, we get 95% predictio bouds ŷ ± t.5ˆσ + / + x x / i= x i x for y. Here, = 8, x = 5.5, ŷ = 4.9, t.5 =. = 8 degrees of freedom, givig bouds 4.9±..4 + / + 5 5.5 /5.7 = 4.9±4.39, ad a predictio iterval.5, 9.3.