THE ROYAL STATISTICAL SOCIETY 2015 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 3

Similar documents
THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

THE ROYAL STATISTICAL SOCIETY 2002 EXAMINATIONS SOLUTIONS ORDINARY CERTIFICATE PAPER II

Chapter 7 Comparison of two independent samples

Analysis of variance (ANOVA) Comparing the means of more than two groups

Chapter 12: Inference about One Population

THE ROYAL STATISTICAL SOCIETY 2007 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 2 PROBABILITY MODELS

Lecture 7: Hypothesis Testing and ANOVA

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

STA Module 10 Comparing Two Proportions

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

You may not use your books/notes on this exam. You may use calculator.

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Preview from Notesale.co.uk Page 3 of 63

Chapter 5 Confidence Intervals

General Certificate of Education Advanced Level Examination June 2014

Lecture 10 Multiple Linear Regression

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

Visual interpretation with normal approximation

THE ROYAL STATISTICAL SOCIETY ORDINARY CERTIFICATE PAPER II

Chapter 9 Inferences from Two Samples

Statistical Methods in Natural Resources Management ESRM 304

BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference

Lectures 5 & 6: Hypothesis Testing

Data analysis and Geostatistics - lecture VII

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

Mathematical Notation Math Introduction to Applied Statistics

Sampling Distributions: Central Limit Theorem

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means

Single Sample Means. SOCY601 Alan Neustadtl

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

State Examinations Commission. Coimisiún na Scrúduithe Stáit. Leaving Certificate Marking Scheme. Mathematics.

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE EXAMINATION MODULE 2

Module 10: Analysis of Categorical Data Statistics (OA3102)

Practice Questions: Statistics W1111, Fall Solutions

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

4.1 Hypothesis Testing

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Multiple comparisons - subsequent inferences for two-way ANOVA

STA Module 11 Inferences for Two Population Means

STA Rev. F Learning Objectives. Two Population Means. Module 11 Inferences for Two Population Means

Correlation Analysis

Lecture 3: Inference in SLR

Table 1: Fish Biomass data set on 26 streams

Statistics Handbook. All statistical tables were computed by the author.

STAT Chapter 8: Hypothesis Tests

STATISTICS 4, S4 (4769) A2

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

ANOVA Analysis of Variance

16.400/453J Human Factors Engineering. Design of Experiments II

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

10: Crosstabs & Independent Proportions

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

CH.9 Tests of Hypotheses for a Single Sample

POLI 443 Applied Political Research

Data Analysis and Statistical Methods Statistics 651

CHAPTER 7. Hypothesis Testing

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

Practice Problems Section Problems

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

STA 101 Final Review

Review of Statistics 101

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Mathematical Notation Math Introduction to Applied Statistics

Smoking Habits. Moderate Smokers Heavy Smokers Total. Hypertension No Hypertension Total

Chapter Eight: Assessment of Relationships 1/42

Statistics for Managers using Microsoft Excel 6 th Edition

Module 17: Two-Sample t-tests, with equal variances for the two populations

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Permutation Tests. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods

QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018

Statistics in medicine

Chapter 8 Heteroskedasticity

10.2: The Chi Square Test for Goodness of Fit

CBA4 is live in practice mode this week exam mode from Saturday!

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Statistical Calculations and Tests Using the TI 83/84.

Non-parametric (Distribution-free) approaches p188 CN

Table 3: Lives of 100 Electric Bulbs

2010 Applied Maths. Advanced Higher Statistics. Finalised Marking Instructions

Section VII. Chi-square test for comparing proportions and frequencies. F test for means

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Rama Nada. -Ensherah Mokheemer. 1 P a g e

9231 FURTHER MATHEMATICS

1/24/2008. Review of Statistical Inference. C.1 A Sample of Data. C.2 An Econometric Model. C.4 Estimating the Population Variance and Other Moments

MATH 240. Chapter 8 Outlines of Hypothesis Tests

Confidence Intervals, Testing and ANOVA Summary

Transcription:

THE ROYAL STATISTICAL SOCIETY 015 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 3 The Society is providing these solutions to assist candidates preparing for the examinations in 017. The solutions are intended as learning aids and should not be seen as "model answers". Users of the solutions should always be aware that in many cases there are valid alternative methods. Also, in the many cases where discussion is called for, there may be other valid points that could be made. While every care has been taken with the preparation of these solutions, the Society will not be responsible for any errors or omissions. The Society will not enter into any correspondence in respect of these solutions.

1. (i) (ii) x 138.6 x 11.55 cm. 1 (11.55) n 1 1 x 1 138.6 s x 1611.34 0.955 n 1 n 11 1 So s = 0.9775 cm. 1 (0.9775) (Accept both answers if direct from calculator) TOTAL s 95% C.I. for is given by x t. (t on 11 d.f.) 1 (method) n 1 (11 d.f. EITHER here OR in (iii)) 0.9775 which is 11.55.01 i.e. (10.99, 1.171) 1 (.01) 1 1(10.99), 1(1.171) (n 1)s (n 1)s 95% C.I. for given by, ( values on 11 d.f.).1 (method).05.975 110.955 110.955 which is, i.e. (0.479,.75). 1 (1.90), 1 (3.816) 1.90 3.816 Hence 95% C.I. for is (0.69, 1.660). 1 (0.69), 1 (1.660) (iii) (a) H 0: 11 vs H 1: 11. x 0 11.55 11 Test statistic t = t 1.949. s 0.9775 n 1 1 (method), 1 (1.949) Compare with upper 5% point of t 11 which is 1.796. 1 (1.796) Test statistic is significant, so reject H 0 Implies mean assembly time is significantly longer than required. (b) H : 0.7 vs H : 0.7 0 1 (n 1)s 11 (0.9775) Test statistic is 1.450. 0 0.7 1 (method), 1(1.450) Compare with upper 5% point of which is 19.675. 1 (19.675) Test statistic is highly significant, so reject H. 0 Implies assembly times are too variable. 11 TOTAL 8

. (i) Test H : 0 distribution of deliveries is uniform against 1(null hyp correct) distribution is not uniform. Total number of deliveries is 94, so Ei 4 for each day. 1 (E = 4) ( Oi Ei) 17 5 9 6 6 3 1 60 14.76. 1 (method) E 4 4 5% point of i 1(14.76) distribution on 6 degrees of freedom is 1.59.1 (6 df), 1 (1.59) So our result is significant, we reject the null hypothesis and conclude that the distribution of deliveries does not appear to be uniform. 1 (conclusion correct) TOTAL 7 5 30 pˆ(1 pˆ) 0.18710.819 (ii) pˆ 0.1871. s. e.( pˆ ) 0.074. So 95% 94 n 94 confidence interval is 0.1871 1.96 0.074 1 (0.1871 or 55/94), 1 (1.96) 1 (CI formula) i.e. (0.145, 0.317). 1(0.145), 1 (0.317) If the distribution was uniform, then we would expect a proportion 0.857 7 of deliveries to be made at weekends. Our confidence interval lies well below this value, indicating that fewer deliveries than expected are made at weekends. 1 (comment) TOTAL 6 68 (iii) We have pˆ 1 0.1871 as above and pˆ 0.145. 1 (0.145 or 68/317) 317 Wish to test H0 : p1 p against H1 : p1 p. Pooled sample proportion is 55 68 pˆ 0.013 94 317 and 1 (0.013 or 13/611) 1 1 1 1 s. e.( p ˆ ˆ 1 p) p(1 p) 0.013 0.7987 n1 n 94 317 = 0.035. 1 (method), 1 (0.035) pˆ1 pˆ 0.074 Then Z = 0.843. s. e.( pˆ pˆ ) 0.035 1 1 (-0.843) Comparing with the lower 5% point of N(0, 1) which is -1.645, our value is not significant so we do not reject the null hypothesis. Thus we have no evidence that the proportion of births at weekends has risen significantly. 1 (c.f. -1.645) 1 (conclusion correct) TOTAL 7

3(i) Assuming equal population variances, the confidence interval for A B is given by 1 1 ( xa xb) t. s where s is the pooled sample variance. 1 (method) n n A B sa sb We have s (equal sample sizes) = 45.045s 6.71. 1 (method) 1 (6.71 or 45.045) From tables, the t value on 18 degrees of freedom is.101. So we have interval1 (18 df) 1 (.101) (6.65 33.04).101 6.71 1 1 = 6.39 6.307 10 10 1 (method) i.e. (-1.697, -0.083). 1 (-1.697), 1 (-0.083) (0.083 to 1.697 also acceptable) The CI does not include zero, indicating that the mean anxiety levels produced by the two versions may differ. Version A, with the easy questions first, appears to produce lower levels of anxiety. 1 (means differ), 1 (lower levels in A) (ii) H : 0 the two samples come from the same distribution (equal medians).1 (null hyp) the two samples come from different distributions. 1 (alt hyp) The ranks are: 1 (correct test attempted) Version A Value 4.6 39.3 16.3 3.8 8.0 0.6 1.1 6.7 4. 3.9 Rank 7 19 1 1 9 3 8 6 13 Version B Value 38.6 34.0 3.6 30.3 35.9.9 9.5 39. 4.9 33.5 Rank 17 15 5 11 16 4 10 18 0 14 1 (correct ranks) The sums of the ranks are 80 (Version A) and 130 (Version B). 1 (80), 1 (130) The lower of these two values is 80. 1 (choose lower) We compare this with the value from tables with sample sizes 10 and 10 which is 78. 1 (78) Since our value is higher than this critical value we do not reject the null hypothesis. 1 (do not reject null) i.e. the median anxiety levels for the two versions do not appear to be significantly different 1 (conclusion)

4(i) H : 0 there is no association between charging discrepancies and normal/special offer. there is an association. 1 (null hyp), 1( alternative hyp) row total column total For each table cell, expected frequency =. These are given in grand total brackets below: 1 (method or implied by values below) 0 7 1 (all E s correct) 7 (13.81) (13.19) 15 9 44 (.51) (1.49) 384 364 748 (38.68) (365.3) 419 400 819 ( Oi Ei) 1 1 1 1 1 1 6.19 7.51 1.3 E i 13.81 13.19.51 1.49 38.68 365.3 = 5.679 + 5.130 + 0.009 = 10.818. 1 (method), 1(10.818) Comparing with tables and using degrees of freedom, the critical 5% point is 5.991. The calculated value is thus highly significant. So we have strong evidence to reject the null hypothesis in favour of the alternative hypothesis. i.e. the incidence of discrepancy in checkout and shelf prices is highly associated with whether or not items are on special offer. 1 ( df), 1 (5.991), 1 (reject null hyp) (ii) Now have Normal Special offer Incorrect price 35 36 Correct price 384 364 Total 419 400 1 (table adaptation) 35 36 pˆ ˆ 1 p 0.0835 0.09 0.0065. 1 (0.0835 or 35/419), 1 (0.09 or 36/400) 419 400 0.0835 0.9165 0.090.91 Estimated s.e. of this difference is 0.0197 1 (method) 419 400 So the 95% confidence interval is 0.0065 1.96 0.0197 which is (-0.045, 0.03). 1 (CI method), 1 (1.96), 1 (-0.045), 1 (0.03) TOTAL 8 (iii) In (i) we found a difference in price accuracy between the normal priced and special offer items. However, the interval in (ii) contains zero, suggesting that there is no difference in the proportions of incorrect prices for the two categories. This is because the main discrepancy (i.e. largest contributions to overall ) in (i) was between the overcharged/undercharged items whereas in (ii) these categories have been amalgamated. 1 ( appear inconsistent ) 1 (valid explanation) TOTAL