Statistics Revision Solutions

Similar documents
10-701/ Machine Learning Mid-term Exam Solution

Pearson Edexcel Level 3 Advanced Subsidiary and Advanced GCE in Statistics

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Background Information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Properties and Hypothesis Testing

Solutions to Odd Numbered End of Chapter Exercises: Chapter 4

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Chapter 6 Sampling Distributions

Intermediate Math Circles November 4, 2009 Counting II

Chapter 13, Part A Analysis of Variance and Experimental Design

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

Topic 9: Sampling Distributions of Estimators

M1 for method for S xy. M1 for method for at least one of S xx or S yy. A1 for at least one of S xy, S xx, S yy correct. M1 for structure of r

Common Large/Small Sample Tests 1/55

Sampling Error. Chapter 6 Student Lecture Notes 6-1. Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

Linear Regression Demystified

Module 1 Fundamentals in statistics

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 4

(6) Fundamental Sampling Distribution and Data Discription

Joint Probability Distributions and Random Samples. Jointly Distributed Random Variables. Chapter { }

Topic 10: Introduction to Estimation

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Central Limit Theorem the Meaning and the Usage

Confidence Intervals for the Population Proportion p

Thomas Whitham Form Centre

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Estimation for Complete Data

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

STP 226 EXAMPLE EXAM #1

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

Matrix Representation of Data in Experiment

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Lecture 3. Properties of Summary Statistics: Sampling Distribution

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

MA Advanced Econometrics: Properties of Least Squares Estimators

Sampling Distributions, Z-Tests, Power

7.1 Convergence of sequences of random variables

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Expectation and Variance of a random variable

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

INTRODUCTORY MATHEMATICS AND STATISTICS FOR ECONOMISTS

Mathematical Notation Math Introduction to Applied Statistics

MA238 Assignment 4 Solutions (part a)

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Linear Regression Models

Chapter 1 (Definitions)

11 Correlation and Regression

Stat 139 Homework 7 Solutions, Fall 2015

1 Inferential Methods for Correlation and Regression Analysis

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

Distribution of Random Samples & Limit theorems

Binomial Distribution

Lecture 11 Simple Linear Regression

Data Analysis and Statistical Methods Statistics 651

IE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Chapter 8: Estimating with Confidence

4. Hypothesis testing (Hotelling s T 2 -statistic)

Statistics 511 Additional Materials

(all terms are scalars).the minimization is clearer in sum notation:

Approximations and more PMFs and PDFs

Chapter 7 Student Lecture Notes 7-1

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Algebra of Least Squares

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

7.1 Convergence of sequences of random variables

Regression, Inference, and Model Building

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

Lecture 7: Properties of Random Samples

Statistics 20: Final Exam Solutions Summer Session 2007

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Statistics 203 Introduction to Regression and Analysis of Variance Assignment #1 Solutions January 20, 2005

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

AMS570 Lecture Notes #2

Statistics 3. Revision Notes

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

Comparing your lab results with the others by one-way ANOVA

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

Topic 9: Sampling Distributions of Estimators

MATHEMATICAL SCIENCES PAPER-II

[ 11 ] z of degree 2 as both degree 2 each. The degree of a polynomial in n variables is the maximum of the degrees of its terms.

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,

STAT 516 Answers Homework 6 April 2, 2008 Solutions by Mark Daniel Ward PROBLEMS

5. A formulae page and two tables are provided at the end of Part A of the examination PART A

Statistical inference: example 1. Inferential Statistics

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Combining. random variables

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

3sin A 1 2sin B. 3π x is a solution. 1. If A and B are acute positive angles satisfying the equation 3sin A 2sin B 1 and 3sin 2A 2sin 2B 0, then A 2B

Transcription:

Statistics Revisio Solutios (i) H ~N (00, ) ad W ~N (7, 9 ) P ( 7.<W < 7) 0.0 (ii) H + H W ~ (0, () + (8) ) N ( H + H W > 0) 0. 978 P (iii) H + W ~N (7, ) P ( H + W > A) > 0.9 P( H + W < A) < 0.0 A< ivnorm(0.0, 7, ) 0. 8 ad A 0 (i) M ~N (80, ) ad W ~N(, ) max T M + M + M + W + W + W + W ~ N(80 +, + 7) P ( T > 00) 0.08 (ii) T ( M + M + M ) ( W + W + W + W ) ~ (, ' N 7) P ( T ' > 0) 0.78. P ( X > b) 0. P( X < b) 0. b ivnorm(0., 0, ). P ( X < a) 0. a ivnorm(0., 0, ) 0.7 ( a, b) (, ) (i) X ~N(0, ) P ( <X < ) 0.7 (ii) Y ~B(0, 0.) P ( Y ) 0. (i) S ~N(0,. ) ad T ~N(8, ) P ( S < 0) P( T < ) 0.

(ii) T S ~N (8, 8.) P ( T S > ) 0.787 (iii) S T ~ N(,. +.) P S T > 0 0.. X ~N(, ) ad Y ~N(, ) X + X Y ~ N(, ) ( X + X Y < ) P( < X + X Y < ) 0. 89 P P X ~N(0, ) ad ~N(, P Y ( 8) 0) T 0 PX + P i Y ~ N(00+ 0 0, 0 + 0 70) i j j P ( T > 80) 0.707 (i) X ~N(0, 0 ) [ P( X )] P ( X < m) P( X > m) < m P ( X < m) or P ( X <m) m ivnorm(, 0, 0) 0.8 (ii) Let Y be the rv deotig the umber of orages (out of 0) each havig mass more tha m grams. The Y ~B(0, ) 0 Sice p.7< ad p < 0., Y ~ P o (.7) approximately. P( more tha orages have mass less thamgrams) P( less tha orages have mass more thamgrams) P ( Y < ) P( Y ) 0.8

7(i) Possible digit combiatio 9 9 7 8 9 7 9 9 8 8 8 8 9 8 8 9 9 9 9 9 Probability (icludig permutatio)!!!!!!!!! 0 0 0 0 0 0 000 000 000 000 000 000 0 000 0 Total 000 0 (ii) P( umbers are idetical sum of umbers is at least ) + 000 000 0 0 (Note that 888 ad 999 are the oly two combiatios which satisfy the umerator, ie the umbers are idetical ad their sum is at least ) 8 (i) Number of ways to seat childre (without ay restrictios) i a circle ( )! 0! Firstly, umber of ways to arrage the boys! Secodly, umber of ways to slot the girls betwee the boys C!! Required probability!! 0! (ii) Groupig J, C ad D together as a sigle uit, umber of ways to arrage the childre such that J sat betwee C ad D ( 9 )!! 8!! (ote that C ad D ca switch sides)

Number of ways to arrage the childre such that J, C ad D simply sat together ( )! 8!!! 9 (ote that J, C ad D ca be freely permutated withi the uit) Required probability 8! 8! 9 (i) Su(Y) Sat(Y) Su (N) Fri (Y) Sat(N) Su(Y) Su(N) Su(Y) Sat(Y) Su (N) Fri (N) Sat(N) Su(Y) Su(N) (Y): go o that day (N): ot goig o that day P(A goes o Su) 0 + + + (ii) P( A will go o Fri but ot o Su) +

P (A will go o Fri) Hece, P (A will ot go o Su A will go o Fri) (iii) P(A goes o both Fri ad Su) + 7 0 Hece, P( A goes o Fri A goes o Su) 7 7 0 0 9 0 (i) Ubiased estimate of populatio mea. 9 00 9 Ubiased estimate of populatio variace 779. 99 00. (ii) Sice 00 is large, by CLT, X ~N.9, approx 00 P ( X <.7) 0.9 (iii) Sice 0 is large, by CLT, T X + X +... + X 0 ~ N(.9 0 7.,. 0.) approx P ( T < 70) 0.. X ~ N( µ, σ ) µ P ( X < ) 0.07 PZ < 0.07 σ µ ivnorm(0.07).7 () σ. µ P ( X >.) 0.0 P( X <.) 0.9 PZ < 0.9 σ. µ ivnorm(0.9). () σ Solvig () ad () gives σ 0.9, µ.8kg P( exactly large, exactly small, exactly betwee large ad small)

! (0.0)(0.07)( 0.0 0.07) 0.08 (i) To test: H : µ 7. H : µ 7. 0 x.9, σ.8, 0 < Usig the Z test, by the GC, p 0.09> 0. 0 H 0 is ot rejected ad there is isufficiet evidece at the % level to suggest that the compay is overstatig the mass of coffee per jar. x µ (ii) Test statistic Z < ivnorm( 0.0). 07 σ x 7. <.07 x<.7.8 0 (iii) Sice X ~ N µ σ,, the σ X µ ~ N 0, 0. 0. ( X µ < 0.) P( 0.< X < 0.) P < Z < 0. 98 P µ.8.8 0. 0.98 The above ca be re-iterpreted as P Z < 0. 0.8 0. ivnorm(0.0) 70.8 (iv) Sample Variace s.8 Hece ubiased estimate of the populatio variace s (.8) 8 0 9

x µ Test statistic Z. σ (i) Sice ( y) ( where σ 8) x, lie o the regressio lie ad it is give that x 00, The sample mea of weights y 0.09(00) 90 (ii) Sample variace for heights ( x x) [ ] 0 ( x x) 0 () Sample variace for weights ( y y) Also, b ( x x)( y y) ( x x) 0.09 [ ] ( y y) () Substitutig () ito the above gives ( x x)( y y) 0.09( 0 ) () Substitutig () ad () ito ( x x)( y y) ( y y) 0.09 0 d gives ( ) d 9 Equatio of lie of regressio of x o yis x c+ dy c+ 9y Sice (00. ) lies o the lie, c 00 9() ad hece x + 9y (iii) r bd 0.8 r 0. 9 A strog value for the coefficiet r deotes a high degree of similarity (coicidece) betwee both regressio lies. (iv) y 0.09(00) 90 kg (a) 0)., 0 x (x ( 0) 797( 0.) 79. 7 ( x 0)( y 0) 97(0.) 9. 7 r S S xx xy S yy 9.7. 79.7 0 (.) (88) 0 88 0 0.7

It appears that there is a strog egative liear associatio betwee the scores o a fitess test ad the weights of the participatig studets. (b) L a+ bw () b S S LW WW LW W L ( W) W 0.77 00.0 7.00 L 0.0, W 8. 8 0 0 Substitutig the above quatities ito (), a L bw 0.0 0.77(8.8). Hece, equatio of regressio lie is give by L.+ 0. 77W.. X ~ N, P ( X >.) < 0.0 P( X <.) > 0. 99 P Z.. < > 0.99 > ivnorm(0.99)... Solvig gives > 97. 98 Yes, cetral limit theorem (CLT) was used i approximatig X to a ormal distributio based o the fact that the sample size (umber of observatios) was large. (a) A B 0 x x x 0 (i) P ( A)

(ii) Number of boys that are shortsighted 0 (this is represeted by ( A B)' ) ( 0 x ) + ( x) + ( x) + x A B ( A B ) 0 + ( ) 80 80 P P (iii) ( A B ) (iv) P( A B) P ( A B) P( B) 9 (b) (i) 9 77 (ii)!! 80 79 8 (iii) (!) 0 70 7 shortsighted o-shortsighted shortsighted girl girl boy 7(i) Let X be the radom variable deotig the umber of days a studet is late from Moday to Wedesday. The X ~B(, 0.09) Required probability P ( X ) (0.009) 0. 0009 (ii) ( P studet is ot late o days) ( 0.009) 0. 9 Let Y be the radom variable deotig the umber of studets (out of a class of ) who are ot late for a stretch of days. The Y ~B(, 0.9) ad P ( Y ) 0.

(iii) P ( at least latecomer i a class of i a give school week of days) 0. 0.77 Let Y ' be the radom variable deotig the umber of weeks (out of 80 weeks) where there is at least oe latecomer. The Y ' ~ B(80, 0.77) Sice p.> ad q.8>, Y ' ~ N(., 7.9) approx. P ( Y ' > 0) P( Y' > 0.) 0.809 (cotiuity correctio has to be used) 8 (a) X ~N(0, 0. ) T X + X + X + X + X ~ (0, 0. 0.) N P ( 0.<T <.) 0. (b) Y ~N(0, 0. ) S ~ N( 0 0, 0. 0.), W ~ N( 0 0, 0. 0.) ad S W ~N(0, 0.9) P ( S < W + ) P( S W < ) 0.97 (c) 0. L ~N 0,. 0 L 0 ~ N(0,. 0 ) P ( L 0 a) 0.0 P( L 0 a) + P( L 0 a) 0. 0 Sice the distributio of ( L 0) is cetred at zero, P ( L 0 a) 0. 0 P Z a. 0 0.0 P ( Z a) 0.0 a ivnorm(0.0). a 0.09