This page intentionally left blank

Size: px
Start display at page:

Download "This page intentionally left blank"

Transcription

1

2

3 This page intentionally left blank

4

5 Copyright 006 New Age International (P) Ltd., Publishers Published by New Age International (P) Ltd., Publishers All rights reserved. No part of this ebook may be reproduced in any form, by photostat, microfilm, xerography, or any other means, or incorporated into any information retrieval system, electronic or mechanical, without the written permission of the publisher. All inquiries should be ed to ISBN : PUBLISHING FOR ONE WORLD NEW AGE INTERNATIONAL (P) LIMITED, PUBLISHERS 4835/4, Ansari Road, Daryaganj, New Delhi Visit us at

6 PREFACE Statistics is a subject used in research and analysis of data in almost all fields. Official government statistics are our old records and creates historical evidences. Many people have contributed to the refinement of statistics, which we use today in various fields. It is a long process of development. Today we have many statistical tools for application and analysis of data in various fields like business, medicine, engineering, agriculture, management etc. Many people feel difficult to find which statistical technique is to be applied and where. Even though computer softwares have minimized the work, a basic knowledge is must for proper application. This book is providing the important and widely used statistical tests with worked out examples and exercises in real life applications. It is presented in a simple way in an understandable manner. It will be useful for the researchers to apply these tests for their data analysis. The statisticians also find it useful for easy reference. It is good companion for all who need statistical tools for their field. The author is greatly indebted to the Authorities of Annamalai University for permitting to publish this book. V. Rajagopalan

7 This page intentionally left blank

8 CONTENTS Preface... v. INTRODUCTION PARAMETRIC TESTS Test Test for a Population Proportion... 9 Test Test for a Population Mean (Population variance is known)...3 Test 3 Test for a Population Mean (Population variance is unknown)...6 Test 4 Test for a Population Variance (Population mean is known)...0 Test 5 Test for a Population Variance (Population mean is unknown)...4 Test 6 Test for Goodness of Fit...7 Test 7 Test for Equality of two Population Proportions...30 Test 8 Test for Equality of two Population Means (Population variances are equal and known)...33 Test 9 Test for Equality of two Population Means (Population variances are unequal and known)...36 Test 0 Test for Equality of two Population Means (Population variances are equal and unknown)...39 Test Test for Paired Observations...4 Test Test for Equality of two Population Standard Deviations...45 Test 3 Test for Equality of two Population Variances...48 Test 4 Test for Consistency in a table...53 Test 5 Test for Homogeneity of Several Population Proportions...56 Test 6 Test for Homogeneity of Several Population Variances (Bartlett's test)...60 Test 7 Test for Homogeneity of Several Population Means...65 Test 8 Test for Independence of Attributes...70 Test 9 Test for Population Correlation Coefficient Equals Zero...74 Test 0 Test for Population Correlation Coefficient Equals a Specified Value...78 Test Test for Population Partial Correlation Coefficient...8 Test Test for Equality of two Population Correlation Coefficients...83 Test 3 Test for Multiple Correlation Coefficient...86

9 viii Contents Test 4 Test for Regression Coefficient...88 Test 5 Test for Intercept in a Regression ANALYSIS OF VARIANCE TESTS Test 6 Test for Completely Randomized Design...97 Test 7 ANOCOVA Test for Completely Randomized Design... 0 Test 8 Test for Randomized Block Design Test 9 Test for Randomized Block Design... 5 (More than one observation per cell) Test 30 ANOCOVA Test for Randomized Block Design... 0 Test 3 Test for Latin Square Design... 7 Test 3 Test for Factorial Design... 3 Test 33 Test for 3 Factorial Design Test 34 Test for Split Plot Design... 4 Test 35 ANOVA Test for Strip Plot Design MULTIVARIATE TESTS Test 36 Test for Population Mean Vectors (Covariance matrix is known)...57 Test 37 Test for Population Mean Vector (Covariance matrix is known) Test 38 Test for Equality of Population Mean Vectors (Covariance matrices are equal and known) Test 39 Test for Equality of Population Mean Vectors (Covariance matrices are equal and unknown) Test 40 Test for Equality of Population Mean Vectors (Covariance matrices are unequal and unknown) NON-PARAMETRIC TESTS Test 4 Sign Test for Median Test 4 Sign Test for Medians (Paired observations) Test 43 Median Test Test 44 Median Test for two Populations... 8 Test 45 Median Test for K Populations Test 46 Wald Wolfowitz Run Test Test 47 Kruskall Wallis Rank Sum Test (H Test) Test 48 Mann Whitney Wilcoxon Rank Sum Test... 9 Test 49 Mann Whitney Wilcoxon U-Test Test 50 Kolmogorov Smirnov Test for Goodness of Fit Test 5 Kolmogorov Smirnov Test for Comparing two Populations Test 5 Spearman Rank Correlation Test... 0 Test 53 Test for Randomness Test 54 Test for Randomness of Rank Correlation Test 55 Friedman's Test for Multiple Treatment of a Series of Objects... 07

10 Contents ix 6. SEQUENTIAL TESTS Test 56 Sequential Test for Population Mean (Variance is known)... 3 Test 57 Sequential Test for Standard Deviation (Mean is known)...6 Test 58 Sequential Test for Dichotomous Classification... 8 Test 59 Sequential Test for the Parameter of a Bernoulli Population... 0 Test 60 Sequential Probability Ratio Test TABLES REFERENCES

11 CHAPTER INTRODUCTION Testing of Statistical hypotheses is a remarkable aspect of statistical theory, which helps us to make decisions where there is a lack of uncertainty. There are many real life situations where we would like to take a decision for further action. Further, there are some problems, for which we would like to determine whether the claims are acceptable or not. Suppose that we are interested to test the following claims:. The average consumption of electricity in city A is 75 units per month.. Bath soap B reduces the rate of skin infections by 50%. 3. Oral polio vaccine is more potent than parenteral polio vaccine. 4. A new variety of paddy yields 6.5 tones per hectare. 5. Drug C produces less drug dependence than drug D. 6. Health drink E improves weight gain by 5% for children. 7. Plant produced by cloning grows 50% faster than the ordinary one. 8. Door-to-door campaign increases the sales of a washing powder by 0%. 9. Machine F produces items within specifications than Machine G. 0. The defective items in a large consignment of coconut is less than 4%. These are a few of the many varieties of problems, which can be solved, only with the help of statisticians. To solve such problems, we need the following basic and important concept in statistics theory, as follows.. POPULATION In any statistical investigation, the interest usually lies in the assessment of general magnitude with respect to one or more characters relating to individuals belonging to a group. Such group of individuals under study is called population. The number of units in any population is known as population size, which may be either finite or infinite. In a finite population, the size is denoted by, N. Thus in statistics, population is an aggregate of objects, animate or inanimate under study. In statistical survey, complete enumeration of population is tedious, if the population size is too large or infinite. In some situations, even though, 00% inspection is possible, the units are destroyable during the course of inspection. As there are various constraints in conducting complete enumeration namely man-power, time factor, expenditure etc., we take the help of sampling.

12 Selected Statistical Tests. SAMPLE A finite, small subset of units of a population is called a sample and the number of units in a sample is called sample size and is denoted by n. The process of selecting a sample is known as sampling. Every member of a sample is called sample unit and the numerical values of such sample units are called observations. If each unit of population has an equal chance of being included in it, then such a sample is called random sample. A sample of n observations be denoted by X, X,, X n. 3. PARAMETERS The statistical measures namely mean, standard deviation, variance, correlation coefficient etc., if they are calculated based on the population are called parameters. If the population information is neither available completely nor finite, parameters cannot be evaluated. In such cases, the parameters are termed as unknown. 4. STATISTICS The statistical measures, if they are obtained, based on the sample alone, they are called statistics. Any function of sample observations is also known as a statistic. The following are the list of standard symbols used for parameters and statistics: Statistical measures Parameter Statistic Mean µ X Median M m Standard deviation σ s Variance σ s Proportion P p Correlation coefficient ρ r Regression coefficient β b 5. SAMPLING ERROR Errors arise because only a part of the population is (i.e., sample) used to estimate the parameters and drawing inferences about the population. Such error is called sampling error. 6. STATISTICAL INFERENCE The process of ascertaining or arriving valid conclusions to the population based on a sample or samples is called statistical inference. It has two major divisions namely, estimation and testing of hypothesis. 7. ESTIMATION When the parameters are unknown, they are estimated by their respective statistics based on the samples. Such a process is called estimation. If an unknown parameter is estimated by a specific statistic, it is called an estimator. For example, the sample mean is an estimator to the population mean. If a specific value is used for estimating, the unknown parameter is called an estimate. It is broadly classified into two types namely point and interval estimation.

13 Introduction 3 8. POINT AND INTERVAL ESTIMATION If a single value is used as an estimate to the unknown parameter, it is called as point estimate and if we choose two values a and b (a < b) so that the unknown parameter is expected to lie in between a and b. Such an interval (a, b), found for estimating the parameter is called as an interval estimate. 9. TESTING OF HYPOTHESIS Hypothesis testing begins with an assumption or hypothesized value that we make about the unknown population parameter. The sample data are collected and sample statistics are obtained from it. These statistics are used to test the assumption about the parameter whether we made is correct. The difference between the hypothesized value and the actual value of the sample statistic is determined. Then we decide whether the difference is significant or not. The smaller the difference, the greater the likelihood, that our hypothesized value is correct. We cannot accept or reject the hypothesized value about a population parameter simply by intuition. The statistical tests for testing the significance of the difference between the hypothesized value and the actual value of the sample statistic or the difference between any set of sample statistics are called tests of significance. 0. STANDARD ERROR The standard deviation of any statistic is known as its standard error and it is abbreviated as S.E. It plays an important role in statistical tests. List of standard errors of some well-known statistics for large samples are given below: S.No. Statistic Standard error X σ / n p PQ / n 3 s σ / n 4 s σ / n 5 r ( ρ )/ n 6 ( X X ) σ n σ + n 7 ( s s ) 8 ( p p ) σ σ + n n P Q n P Q + n. PARAMETRIC TESTS The statistical tests for testing the parameters of the population are called parametric tests. The different kinds of parametric tests are studied in Chapter.

14 4 Selected Statistical Tests The following are the test procedures that we adopt in studying the parametric tests in a systematic manner:. Null Hypothesis It is a tentative statement about the unknown population parameter. It is to be tested based on the sample data. It is always of no difference between the hypothesized value and the actual value of the sample statistic. It is to be tested, for possible rejection under the assumption that it is true. It is usually denoted by H 0.. Alternative Hypothesis Any hypothesis, which is complementary to the null hypothesis, is called an alternative hypothesis. It is usually denoted by H..3 Type-I and Type-II Errors In hypothesis testing, we draw valid inferences about the population parameters on the basis of the sample data alone. Due to sampling errors, there may be a possibility of rejecting a true null hypothesis, called as Type-I error and of accepting a false null hypothesis, called as Type-II error are tabulated as follows: Situation H 0 is true H 0 is false Conclusion (H is false) (H is true) H 0 is accepted Correct Type-II (H is rejected) Decision Error H 0 is rejected Type-I Correct (H is accepted) Error Decision The acceptance or rejection of H 0 depend on the test criterion that is used in hypothesis testing. In any hypothesis testing, we would like to control both Type-I and Type-II errors. The probability of committing Type-I error is denoted by α and the probability of committing Type-II error is denoted by β..4 Level of Significance There is no standard or universal level of significance for testing hypotheses. In some instances, a 5 percent level or percent of significance are used. However, the choice of the level of significance must be at minimum. The higher the significance level leads to higher the probability of rejecting a null hypothesis when it is true. Usually, the level of significance is the size of the Type-I error, i.e., either 5% or %, is to be fixed in advance before collecting the sample information..5 Critical Region A region corresponding to a statistic, t in the sample space S which amounts to rejecting of H 0 is termed as region of rejection or critical region. If ω is the critical region and if t is a statistic based on a sample of size, n then P (t ω H 0 ) = α. That is, the null hypothesis is rejected, if the observed value falls in the critical region. The boundary value of the critical region is called as critical value. Let it be Z α..6 One-sided and Two-sided Tests In any test, the critical region is represented by a portion of area under the probability curve of the sampling distribution of the statistic. In a statistical test, if the alternative hypothesis is one-sided (left-

15 Introduction 5 sided or right-sided) is called a one-sided test. For example, a test for testing the mean of a population, H 0 : µ = µ 0 against the alternative hypothesis H : µ < µ 0 (left-sided) or H : µ > µ 0 (right-sided) and for testing H 0 against H : µ µ 0 (two-sided) is known as two-sided test..7 Test Statistic A statistical test is conducted by means of a test statistic for which the probability distribution is determined by the assumption that the null hypothesis is true. It is based on the statistic, the expected value of the statistic (hypothesized value assumed in H 0 ) and the standard error of the statistic. The value so obtained as test statistic value based on the observed data is called observed value of the test statistic, let it be Z, and we use this value for arriving conclusion..8 Conclusion By comparing the two values namely, the observed value of the test statistic and the critical value, the conclusion is arrived at. If Z Z α, we conclude that there is no evidence against the null hypothesis H 0 and hence it may be accepted. If Z > Z α, we conclude that there is evidence against the null hypothesis H 0 and in favor of H. Hence, H 0 is rejected and alternatively, H is accepted.. ANALYSIS OF VARIANCE It is a powerful statistical tool in tests of significance. In parametric tests, we discussed the statistical tests relating to mean of a population or equality of means of two populations. In situations, when we have three or more samples to consider at a time, an alternative procedure is needed for testing the hypothesis that all the samples are drawn from the same populations, which have the same mean. Analysis of variance (ANOVA) was introduced by R.A. Fisher to deal the problem in the analysis of agricultural data. Variations in the observations are inherent in nature. The total variation in the observed data is due to the following two causes namely, (i) assignable causes, and (ii) chance causes. By this technique, the total variation in the sample data can be bifurcated into variation between sample and variation within samples. The second kind of variation is due to experimental error. These kinds of tests are very much applicable in agricultural field experiments, where they want to know the yield of different kinds of seeds, fertilizers adopted, pesticides used, different irrigation, cultivation method etc., accordingly there are different types of ANOVA tests available and are provided in Chapter 3. In ANOVA tests, we need the following terms with their definitions:. Treatments Various factors or methods that we adopted in a comparative experiment are termed as treatments. For example, in field experiments, different varieties of paddy seeds, different kinds of fertilizers, different methods of cultivation etc., are called treatments.. Experimental Unit A small area of experimental material is used for applying the treatment is called an experimental unit. In agricultural experiments, a cultivated land, usually called as experimental material is divided into smaller areas of plots in which, different treatment can be applied in it. Such kind of plots are called experimental units.

16 6 Selected Statistical Tests.3 Blocks In field experiments, the experimental material is firstly divided into relatively homogeneous divisions, known as Blocks. All the blocks are further divided into small plots of experimental units..4 Replication The repetition of the treatments to the experimental units more number of times under investigation is called replication. In agricultural experiments, each block will receive all the treatments and in every block the similar treatments are repeated according to the number of blocks available. Hence, in analysis, the number of blocks will be same as number of replications..5 Randomization The adoption of various treatments to the experimental units in a random manner is called randomization. Different kinds of randomization will be adopted in the ANOVA tests, namely, complete randomization, randomization within blocks, row-wise, column-wise etc., according to the types of experimental designs. 3. MULTIVARIATE DATA ANALYSIS The data and analysis that we consider for more than one character (variable) plays an important role in the theory of statistics, usually called as multivariate analysis. Such kind of data will be in two dimensions. For example, in the study of physical characters namely, age (X ), height (X ), weight (X 3 ) of N individuals, it can be arranged into a two dimensional data in the form of a matrix of order, 3 N observations, the one direction being the sample numbers and the other being the variables. Hence, matrix theory has a major role in multivariate data analysis and the readers should have knowledge on matrix algebra. The tests of significance relating to multivariate data are provided in Section NON-PARAMETRIC METHODS The hypothesis tests mentioned above have made inferences about population parameters. These parametric tests have used the parametric statistics of samples that came from the population being tested. For those tests, we made the assumption about the population from which the samples were drawn. There are tests, which do not have any restriction or assumption about the population from which we sampled. They are known as distribution free or non-parametric tests. The hypotheses of non-parametric tests are concerned with something other than the value of a population parameter. Such different kinds of non-parametric tests are discussed in Chapter SEQUENTIAL TESTS The statistical tests mentioned earlier are based on fixed sample size. That is, the number of sample observations for those tests are constants. However, in sequential tests, the number of observations required depends on the outcome of the observations and is therefore, not pre-determined, but a random variable. The sequential test for testing hypothesis, H 0 against H is described as follows. At each stage of the experiment, the sample observation is drawn and making any one of the following three decisions namely (i) accepting H 0, (ii) rejecting H 0 ( or accepting H ) and (iii) continue the experiment by making an additional observation. Thus, such a test procedure is carried out sequentially. Some of the sequential tests are provided in Chapter 6.

17 CHAPTER PARAMETRIC TESTS

18 THIS PAGE IS BLANK

19 TEST TEST FOR A POPULATION PROPORTION Aim To test the population proportion, P be regarded as P 0, based on a random sample. That is, to investigate the significance of the difference between the observed sample proportion p and the assumed population proportion P 0. Source If X is the number of occurrences of an event in n independent trials with constant probability P of occurrences of that event for each trial, then E(X ) = np and V(X ) = npq, where Q = P, is the probability of non-occurrence of that event. It has proved that for large n, the binomial distribution tends to normal distribution. Hence, the normal test can be applied. In a random sample of size n, let X be the number of persons possessing the given attribute. Then the observed proportion in the sample be X = p, n (say), then E(p) = P and S.E(p) = Var( p) P( P) =. n Assumption The sample size must be sufficiently large (i.e., n > 30) to justify the normal approximation to binomial. Null Hypothesis H 0 : The population proportion (P) is regarded as P 0. That is, there is no significant difference between the observed sample proportion p and the assumed population proportion P 0. i.e., H 0 : P = P 0. Alternative Hypotheses H () : P P 0 H () : P > P 0 H (3) : P < P 0

20 0 Selected Statistical Tests Level of Significance ( a) and Critical Region () Z > Z such that P { Z > Z } = α α α α/ α/ 0 Zα/ Z α/ () Z > Z α such that P {Z > Z α } = α 0 Zα α (3) Z < Z α such that P {Z < Z α } = α α Zα 0

21 Parametric Tests Critical Values ( Z a ) Critical value Level of Significance (α) (Z α ) % 5% 0%. Two-sided test Z α =.58 Z α =.96 Z α =.645. Right-sided test Z α =.33 Z α =.645 Z α =.8 3. Left-sided test Z α =.33 Z α =.645 Z α =.8 Test Statistic Z = p P P( P) n The statistic Z follows Standard Normal Distribution. Conclusions (Under H 0 : P = P 0 ). If Z Z α, we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, it may be accepted at α% level of significance. Otherwise reject H 0 or accept H ().. If Z Z α, we conclude that the data do not provide us any evidence against the null hypothesis H 0 and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H (). 3. If Z Zα, we conclude that the data do not provide us any evidence against the null hypothesis H 0 and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H (3). Example Hindustan Lever Ltd. Company expects that more than 30% of the households in Delhi city will consume its product if they manufacture a new face cream. A random sample of 500 households from the city is surveyed, 63 are favorable in manufacturing the product. Examine whether the expectation of the company would be met at % level. Solution Aim: To test the HLL Company s manufacture of a new product of face cream will be consumed by 30% of the households in New Delhi or more. H 0 : The HLL Company s manufacture of a new product of face cream will be consumed by 30% of the households in New Delhi. i.e., H 0 : P = 0.3. H : The HLL Company s manufacture of a new product of face cream will be consumed by more than 30% of the households in New Delhi. i.e., H : p > 0.3

22 Selected Statistical Tests Level of Significance: α = 0.05 and Critical Value: Z α =.645 Based on the above data, we observed that, n = 500, p = (63/500) = 0.36 Test Statistic: Z = p P P( P) n (Under H 0 : P = 0.3) = (0.3)(0.7) 500 =.7 Conclusion: Since Z < Z α, we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, accept H 0 at 5% level of significance. That is, the HLL Company s manufacture of a new product of face cream will be consumed by 30% of the households in New Delhi. Example A plastic surgery department wants to know the necessity of mesh repair of hernia. They think that 5% of the hernia patients only need mesh. In a sample of 50 hernia patients from hospitals, 4 only needed mesh. Test at % level of significance that the expectation of the department for mesh repair of hernia patients is true. Solution Aim: To test the necessity of hernia repair with mesh is 5% or not. H 0 : The necessity of mesh repair of hernia is 5%. i.e., H 0 : P = 0.5 H : The necessity of mesh repair of hernia is not 5%. i.e., H : P 0.5 Level of Significance: α = 0.0 and Critical Value: Z α =.33 Based on the above data, we observed that, n = 50, p = (4/50) = 0.36 Test Statistic: Z = p P P( P) n (Under H 0 : P = 0.5) = (0.5)(0.85) 50 = 0.80 Conclusion: Since Z < Z α, we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, accept H 0 at % level of significance. That is, the necessity of mesh repair of hernia as expected by the plastic surgery department 5% is true. EXERCISES. A random sample of 400 apples was taken from large consignment and 35 were found to be bad. Examine whether the bad items in the lot will be 7% at % level.. 50 people were attacked by a disease of which 5 died. Will you reject the hypothesis that the death rate, if attacked by this disease is 3% against the hypothesis that it is more, at 5% level?

23 TEST TEST FOR A POPULATION MEAN (Population Variance is Known) Aim To test the population mean µ be regarded as µ 0, based on a random sample. That is, to investigate the significance of the difference between the sample mean X and the assumed population mean µ 0. Source Let X be the mean of a random sample of n independent observations drawn from a population whose mean µ is unknown and variance σ is known. Assumptions (i) (ii) Null Hypothesis The population from which, the sample drawn, is assumed as Normal distribution. The population variance σ is known. H 0 : The sample has been drawn from a population with mean µ be µ 0. That is, there is no significant difference between the sample mean X and the assumed population mean µ 0. i.e., H 0 : µ = µ 0. Alternative Hypotheses H () : µ µ 0 H () : µ > µ 0 H (3) : µ < µ 0 Level of Significance (a) and Critical Region: (As in Test )

24 4 Selected Statistical Tests Test Statistic Z = X µ The Statistic Z follows Standard Normal distribution. Conclusions (As in Test ) Example σ / n (Under H 0 : µ = µ 0 ) The daily wages of a Factory s workers are assumed to be normally distributed. A random sample of 50 workers has the average daily wage of rupees 0. Test whether the average daily wages of that factory be regarded as rupees 5 with a standard deviation of rupees 0 at 5% level of significance. Solution Aim: Our aim is to test the null hypothesis that the average daily wage of the Factory s workers be regarded as rupees 5 with standard deviation of rupees 0. H 0 : The average daily wage of the Factory s workers is 5 rupees. i.e., H 0 : µ = 5. H : The average daily wage of the Factory s workers is not 5 rupees. i.e., H : µ 5. Level of Significance: α = 0.05 and Critical Value: Z α =.96 Test Statistic: Z = X µ σ / n (Under H 0 : µ = 5) = 0 5 0/ 50 =.77. Conclusion: Since the observed value of the test statistic Z =.77, is smaller than the critical value.96 at 5% level of significance, the data do not provide us any evidence against the null hypothesis H 0. Hence it is accepted and concluded that the average daily wage of the Factory s workers be regarded as rupees 5 with a standard deviation of rupees 0. Example A bulb manufacturing company hypothesizes that the average life of its product is,450 hours. They know that the standard deviation of bulbs life is 0 hours. From a sample of 00 bulbs, the company finds the sample mean of,390 hours. At a % level of significance, should the company conclude that the average life of the bulbs is less than the hypothesized,450 hours? Solution Aim: Our aim is to test whether the average life of bulbs is regarded as,450 hours or less. H 0 : The average life of bulbs is,450 hours. i.e., H 0 : µ = 450. H : The average life of bulbs is below,450 hours. i.e., H : µ < 450. Level of Significance: α = 0.0 and Critical Value: Z α =.33

25 Parametric Tests 5 Test Statistic: Z = X µ σ / n (Under H 0 : µ = 450) = =.86 0 / 00 Conclusion: Since the observed value of the test statistic Z =.86, is smaller than the critical value.33 at % level of significance, the data provide us evidence against the null hypothesis H 0 and in favor of H. Hence, H is accepted and concluded that the average life of the bulbs is significantly less than the hypothesized,450 hours. EXERCISES. A Film producer knows that his movies ran an average of 00 days in each cities of Tamilnadu, and the corresponding standard deviation was 8 days. A researcher randomly chose 80 theatres in southern districts and found that they ran the movie an average of 86 days. Test the hypotheses at % significance level.. A sample of 50 children observed from rural areas of a district has an average birth weight of.85 kg. The past record shows that the standard deviation of birth weight in the district is 0.3 kg. Can we expect that the average birth weight of the children in the district will be more than 3 kg at 5% level?

26 TEST 3 TEST FOR A POPULATION MEAN (Population Variance is Unknown) Aim To test that the population mean µ be regarded as µ 0, based on a random sample. That is, to investigate the significance of the difference between the sample mean X and the assumed population mean µ 0. Source A random sample of n observations X i, (i =,,, n) be drawn from a population whose mean µ and variance σ are unknown. Assumptions (i) (ii) Null Hypothesis The population from which, the sample drawn is Normal distribution. The population variance σ is unknown. (Since σ is unknown, it is replaced by its unbiased estimate S ) H 0 : The sample has been drawn from a population with mean µ be µ 0. That is, there is no significant difference between the sample mean X and the assumed population mean µ 0. i.e., H 0 : µ = µ 0. Alternative Hypotheses H (): µ µ 0 H (): µ > µ 0 H (3): µ < µ 0

27 Parametric Tests 7 Level of Significance ( a) and Critical Region () t > t α,n such that P{ t > t α, n } = α α/ t α/, n 0 t α/, n α/ () t > tα, n such that P { t > tα, n } = α 0 t α,n α (3) t < tα, n such that P { t < tα, n } = α α t α, n 0 Critical Values (t α, n ) are obtained from Table.

28 8 Selected Statistical Tests Test Statistic t = X µ S / n (Under H 0 : µ = µ 0 ) n X = i= n X i n, S = ( X i X ) n i= The Statistic t follows t distribution with (n ) degrees of freedom. Conclusions. If t t α, we conclude that the data do not provide us any evidence against the null hypothesis H 0, and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H ().. If t tα, we conclude that the data do not provide us any evidence against the null hypothesis H 0, and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H (). 3. If t tα, we conclude that the data do not provide us any evidence against the null hypothesis H 0, and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H (3). Example A sample of students from a school has the following scores in an I.Q. test Do this data support that the mean I.Q. mark of the school students is 80? Test at 5% level. Solution Aim: To test the mean I.Q. marks of the school students be regarded as 80 or not. H 0 : The mean I.Q. mark of the school students is 80. i.e., H 0 : µ=80. H : The mean I.Q. mark of the school students is not 80. i.e., H : µ 80. Level of Significance: α = 0.05 and Critical Value: t 0.05, =.0 Test Statistic: t = X µ S / n (Under H 0 : µ = 80) = 7.0/ = 0.5 Conclusion: Since t <.0, we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, accept H 0, at 5% level of significance. That is, the mean I.Q. mark of the school students is regarded as 80.

29 Parametric Tests 9 Example The average breaking strength of steel rods is specified as.5 kg. To test this, a sample of 0 rods was examined. The mean and standard deviations obtained were.35 kg and.5 respectively. Is the result of the experiment significant at 5% level? Solution Aim: To test the average breaking strength of steel rods specified as.5 kg is true or not. H 0 : The average breaking strength of steel rods specified as.5 kg is true. i.e., H 0 : µ =.5. H : The average breaking strength of steel rods specified as.5 kg is not true. i.e., H : µ.5. Level of Significance: α = 0.05 and Critical Value: t 0.05,9 =.09 Test Statistic: t = X µ S / n (Under H 0 : µ =.5).35.5 = = Conclusion: Since t <.09, we conclude that the data do not provide us any evidence against the null hypothesis H 0 and hence it may be accepted at 5% level of significance. That is, the average breaking strength of steel rods specified as.5 kg is true. EXERCISES. A sales person says that the average sales of pickle in a week will be 0 numbers. A sample of sales on 8 weeks observed as Examine whether the claim of the salesman is true at % significance level.. A sample of 0 coconut has the following yield of coconuts from a grove in a season are Shall we conclude that the average yield of coconuts from the grove is 65? Test at % level.

30 TEST 4 TEST FOR A POPULATION VARIANCE (Population Mean is Known) Aim To test the population variance σ be regarded as σ, based on a random sample. That is, to 0 investigate the significance of the difference between the assumed population variance σ and the 0 sample variance s. Source A random sample of n observations X i, (i =,,, n) be drawn from a normal population with known mean µ and unknown variance σ. Assumption The population from which, the sample drawn is normal distribution. Null Hypothesis H 0 : The population variance σ is σ. That is, there is no significant difference between the 0 assumed population variance σ and the sample variance s. i.e., H 0 0 : σ = σ. 0 Alternative Hypotheses H () : σ σ 0 H () : σ > σ 0 H (3) : σ < σ 0

31 Parametric Tests Level of Significance ( a) and Critical Region () χ < χ (α/),n χ > χ (α/), n such that P{χ < χ (α/),n χ > χ (α/), n } = α () > χ α, n α/ 0 ( α/ ), n χ such that P { > χ α,n} χ χ = α χ ( α/ ), n α/ 0 (3) χ < χ α, n such that P {χ < χ α, n } = α. χ α,n fi a α 0 χ ( α), n

32 Selected Statistical Tests The critical values of Left sided test and Right sided test are provided as a and b are obtained from Table 3. Test Statistic χ = n i= ( X i σ 0 µ ) The statistic χ follows χ distribution with n degrees of freedom. Conclusions. If χ (α/) χ χ (α/), we conclude that the data do not provide us any evidence against the null hypothesis H 0, and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H ().. If χ χ α, we conclude that the data do not provide us any evidence against the null hypothesis H 0, and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H (). 3. If χ χ α, we conclude that the data do not provide us any evidence against the null hypothesis H 0, and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H (3). Example An agriculturist expects that the average yield of coconut is 63 per coconut tree and variance is 0.5 per year from a coconut grove. A random sample of 0 coconut trees has the following yield in a year: Test the variance is significant at 5% level of significance. Solution Aim: To test the variance yield of coconut from the grove is significant with the sample variance or not. H 0 : The variance of the yield of coconut in the grove is 0.5. i.e., H 0 : σ = 0.5 H : The variance of the yield of coconut in the grove is not 0.5. i.e., H : σ 0.5 Level of Significance: α = 0.05 Critical Values: χ (.975), 0 = 3.47 & χ (.05), 0 = Critical Region: P (χ (.975), 0 < 3.47) + P (χ (.05), 0 >0.483) = 0.0 i= Test Statistic: χ = σ n ( X i 0 µ ) = 49. =

33 Parametric Tests 3 Conclusion: Since χ (α/) < χ < χ (α/), we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, H 0 is accepted at 5% level of significance. That is, the variance of the yield of coconut in the grove be regarded as 0.5. Example The variation of birth weight (as measured by the variance) of children in a region is expected to be more than 0.6. The mean of the birth weight is known, which is.4 Kg. A sample of children is selected, whose birth weight is obtained as follows. Solution Weight (in Kgs.): Set up the hypotheses and for testing the expectedness at 5% level of significance. Aim: To test the variance of the birth weight of the children be 0.6 or more. H 0 : The variance of the birth weight of children in the region is 0.6. i.e., H 0 : σ = 0.6 H : The variance of the birth weight of children in the region is more than 0.6. i.e., H : σ > 0.6 Level of Significance: α = 0.05 and Critical Value: χ 0.05, = i= Test Statistic: χ = σ n ( X i 0 µ ) = =.94 Conclusion: Since χ < χ α, we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, H 0 is accepted at 5% level of significance. That is, the variance of the birth weight of children in the region is 0.6. EXERCISES. A psychologist is aware of studies showing that the mean and variability (measured as variance) of attention, spans of 5-year-olds can be summarized as 80 and 64 minutes respectively. She wants to study whether the variability of attention span of 6-year-olds is different. A sample of 0 6-yearolds has the following attention spans in minutes: State explicit null and alternative hypotheses and test at 5% level.. The average and variance of daily expenditure of office going women is known as Rs.30 and Rs.0 respectively. A sample of 0 office going women is selected whose daily expenditure is obtained as Test whether the variance of the daily expenditure of office going women is 0 at % level of significance.

34 TEST 5 TEST FOR A POPULATION VARIANCE (Population Mean is Unknown) Aim To test the population variance σ be regarded as σ, based on a random sample. That is, to 0 investigate the significance of the difference between the assumed population variance σ and the 0 sample variance s. Source A random sample of n observations X i, (i =,,, n) be drawn from a normal population with mean µ and variance σ (both are unknown). The unknown population mean µ is estimated by its unbiased estimate X. Assumption The population from which, the sample drawn is normal distribution. Null Hypothesis H 0 : The population variance σ is σ. That is, there is no significant difference between the 0 assumed population variance σ and the sample variance s. i.e., H 0 0 : σ = σ. 0 Alternative Hypotheses H () : σ σ 0 H () : σ > σ 0 H (3) : σ < σ 0 Level of Significance ( a) and Critical Region: (As in Test 4)

35 Parametric Tests 5 Test Statistic n i= ( X i X ) χ = σ The statistic χ follows χ distribution with (n ) degrees of freedom. Conclusions (As in Test 4) Example 0 A Statistics Professor conducted an examination to the class of 3 freshmen and sophomores. The mean score was 7.7 and the sample standard deviation was 5.9. Past experience to the Professor to believe that, a standard deviation of about 3 points on a 00-point examination indicates that the exam does a good job. Does this exam meet his goodness criterion at 0% level? Solution Aim: To test that, the examination meets the professor s goodness criterion or not. H 0 : The variance of the score on the exam is regarded as 3 (=69). i.e., H 0 : σ = 69 H : The variance of the score on the exam is not 69. i.e., H : σ 69 Level of Significance: α = 0.0 Critical Values: χ (.95), 30 = & χ (.05), 30 = Critical Region: P (χ (.95),30 < 8.493) + P (χ (.05),30 i= ( X i X ) Test Statistic: χ = σ n 0 ns = σ0 3 (5.9) = 3 > ) = 0.0 = Conclusion: Since χ > χ (α/), we conclude that the data provide us evidence against the null hypothesis H 0 and in favor of H. Hence, H is accepted at 0% level of significance. That is, this examination does not meet his goodness criterion of believing the standard deviation to be 3. Example The variation of daily sales in a vegetable mart is reported as Rs.00. A sample of 0 day s was observed with variance as Rs.60. Test whether the variance of the sales in the vegetable mart be regarded as Rs.00 or not at % level of significance. Solution Aim: To test the variance of the sales in the vegetable mart be regarded as Rs.00 or not. H 0 : The variance of the sales in the vegetable mart is Rs.00. i.e., H 0 : σ = 00 H 0 : The variance of the sales in the vegetable mart is not Rs.00. i.e., H : σ 00 Level of Significance: α = 0.05 Critical Values: χ (.975), 9 = & χ (.05), 9 = 3.85

36 6 Selected Statistical Tests Critical Region: P (χ (.975), 9 < 8.907) + P (χ (.05), 9 > 3.85) = 0.05 i= ( X i X ) Test Statistic: χ = σ n = = 3 00 Conclusion: Since χ (α/) < χ < χ (α/), we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, H 0 is accepted at 5% level of significance. That is, the variance of the sales in the vegetable mart is Rs.00. EXERCISES. A manufacturer claims that the lifetime of a certain brand of batteries produced by his company has a variance more than 6800 hours. A sample of 0 batteries selected from the production department of that company has a variance of 5000 hours. Test the manufacturer s claim at 5% level.. A manufacturer recorded the cut-off bias (volt) of a sample of 0 tubes as follows: The variability of cut-off bias for tubes of a standard type as measured by the standard deviation is 0.0 volts. Is the variability of new tube with respect to cut-off bias less than that of the standard type at % level?

37 TEST 6 TEST FOR GOODNESS OF FIT Aim To test that, the observed frequencies are good for fit with the theoretical frequencies. That is, to investigate the significance of the difference between the observed frequencies and the expected frequencies, arranged in K classes. Source Let O i, (i =,,, K) is a set of observed frequencies on K classes based on any experiment and E i (i =,,, K) is the corresponding set of expected (theoretical or hypothetical) frequencies. Assumptions (i) The observed frequencies in the K classes should be independent. K (ii) O i = E i= K i= i = N. (iii) The total frequency, N should be sufficiently large (i.e., N > 50). (iv) Each expected frequency in the K classes should be at least 5. Null Hypothesis H 0 : The observed frequencies are good for fit with the theoretical frequencies. That is, there is no significant difference between the observed frequencies and the expected frequencies, arranged in K classes. Alternative Hypothesis H : The observed frequencies are not good for fit with the theoretical frequencies. That is, there is a significant difference between the observed frequencies and the expected frequencies, arranged in K classes.

38 8 Selected Statistical Tests Level of Significance ( a) and Critical Region χ > χ α,(k ) such that P{χ > χ α,(k ) } = α Test Statistic χ = K i= Oi E Ei The Statistic χ follows χ distribution with (K ) degrees of freedom. Conclusion If χ χ α,(k ), we conclude that the data do not provide us any evidence against the null hypothesis H 0 and hence it may be accepted at α% level of significance. Otherwise reject H 0 or accept H. Example The sales of milk from a milk booth are varying from day-to-day. A sample of one-week sales (Number of Liters) is observed as follows. Day: Monday Tuesday Wednesday Thursday Friday Saturday Sunday Sales: Examine whether the sales of milk are same over the entire week at % level of significance. Solution Aim: To test the sales of milk is same over the entire week or not. H 0 : The sale of milk is same over the entire week. H : The sale of milk is not same over the entire week. Level of Significance: α = 0.0 Critical value: χ 0.0,6 = 6.8 i Day Frequency Observed (O i ) Expected (E i ) ( Oi Ei ) ( Oi Ei) E i Monday Tuesday Wednesday Thursday Friday Saturday Sunday Test Statistic: χ = K i= Oi E Ei i = 7.05

39 Parametric Tests 9 Conclusion: Since χ < χ α,(k ), we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, H 0 is accepted at % level of significance. That is, the sales of milk are same over the entire week. Example In an experiment on pea breeding, Mendal obtained the following frequencies of seeds from 560 seeds: 3 rounded and yellow (RY), 04 wrinkled and yellow (WY); round and green (RG), 3 wrinkled and green (WG). Theory predicts that the frequencies should be in the proportion 9:3:3: respectively. Set up the hypothesis and test it for % level. Solution Aim: To test the observed frequencies of the pea breeding in the ratio 9:3:3:. H 0 : The observed frequencies of the pea breeding are in the ratio 9:3:3:. H : The observed frequencies of the pea breeding are not in the ratio 9:3:3:. Level of Significance: α = 0.0 Critical value: χ 0.0,3 =.345 Seed type Frequency Observed (O i ) Expected (E i ) O E ) ( i i ( Oi Ei) E RY WY RG WG Test Statistic: χ = K i= Oi E Ei i = Conclusion: Since χ < χ α,(k ), we conclude that the data do not provide us any evidence against the null hypothesis H 0. Hence, H 0 is accepted at % level of significance. That is, the observed frequencies of the pea breeding are in the ratio 9:3:3:. i EXERCISES. A chemical extract plant processes seawater to collect sodium chloride and magnesium. It is known that seawater contains sodium chloride, magnesium and other elements in the ratio of 6:4:34. A sample of 300 hundred tones of seawater has resulted in 95 tones of sodium chloride and 9 tones of magnesium. Are these data consistent with the known composition of seawater at 0% level?. Among 80 off springs of a certain cross between guinea pigs, 4 were red, 6 were black and were white. According to genetic model, these numbers should be in the ratio 9:3:4. Are these consistent with the model at % level of significance?

40 TEST 7 TEST FOR EQUALITY OF TWO POPULATION PROPORTIONS Aim To test the two population proportions P and P be equal, based on two random samples. That is, to investigate the significance of the difference between the two sample proportions p and p. Source From a random sample of n observations, X observations possessing an attribute A whose sample proportion p is X /n. Let the corresponding proportion in the population be denoted by P, which is unknown. From another sample of n observations, X observations possessing the attribute A whose sample proportion p is X /n. Let the corresponding proportion in the population be denoted by P, which is unknown. Assumption The sample sizes of the two samples are sufficiently large (i.e., n, n 30 ) to justify the normal approximation to the binomial. Null Hypothesis H 0 : The two population proportions P and P are equal. That is, there is no significant difference between the two sample proportions p and p. i.e., H 0 : P = P. Alternative Hypotheses H () : P P H () : P > P H (3) : P < P Level of Significance ( a) and Critical Region: (As in Test )

41 Parametric Tests 3 Test Statistic Z = ( p p ) ( P P ) P( P) + n n (Under H 0 : P = P ) n p + n p P = n + n The statistic Z follows Standard Normal distribution. Conclusions (As in Test ) Example Random samples of 300 male and 400 female students were asked whether they like to introduce CBCS system in their university. 60 male and 30 female were in favor of the proposal. Test the hypothesis that proportions of male and female in favor of the proposal are equal or not at % level. Solution Aim: To test the proportion of male and female students are equal or not, in introducing CBCS system in their university. H 0 : The proportion of male (P ) and female (P ) students are equal, in favour of the proposal of introducing CBCS system in their university. i.e., H 0 : P = P. H : The proportion of male and female students is not equal, in favour of the propasal of introducing CBCS system in their university. i.e., H : P P Level of Significance: α = 0.0 and Critical Value: Z α =.33 6 Based on the data, we observed that n = 300, p = = 0.53, n = 400, p = = P = Test Statistic: Z = n p ( p + n n + n p p P( P) + n n ) ( P ( ) + ( ) = = P ) (Under H 0 : P = P ) Z = ( ) =.3 Conclusion: Since Z < Z α, we conclude that the data do not provide us any evidence against the null hypothesis H 0 and hence it is accepted at % level of significance. That is, the proportion of male and female students are equal, in favour of the propsal of introducing CBCS system in their university.

42 3 Selected Statistical Tests Example From a random sample of 000 children selected from rural areas of a district in Tamilnadu, it is found that five are affected by polio. Another sample of 500 from urban areas of the same district, three of them is affected. Will it be reasonable to claim that the proportion of polio-affected children in rural area is more than urban area at % level? Solution Aim: To test the proportion of polio-affected children in rural area is same as in urban area or more than urban area. H 0 : The proportion of polio-affected children in rural (P ) and urban (P ) areas are equal i.e., H 0 : P = P. H : The proportion of polio-affected children in rural area is more than urban area. i.e., H : P > P. Level of Significance: α = 0.0 and Critical Value: Z α =.33 5 Based on the data, we observed that n = 000, p = = 0.005, n = 500, p = = P = n p + n p n + n = ( ) + ( ) = ( p p) ( P P ) Test Statistic: Z = P( P) + n n (Under H 0 : P = P ) Z = ( ) =.30 Conclusion: Since Z < Z α, we conclude that the data do not provide us any evidence against the null hypothesis H 0 and hence it is accepted at % level of significance. That is, the proportions of polio-affected children in rural and urban areas are equal. EXERCISES. From a sample of 300 pregnancies in city-a in a year, 63 births are females. Another sample of 50 pregnancies in city-b in the same year, 3 births are females. Test whether the female births in both cities are equal at % level of significance.. A sample of 500 persons were selected from a city in Tamilnadu, 0 are tea drinkers. Another sample of 300 persons from a city of Kerala, 60 persons are tea drinkers. Test the hypothesis that the tea drinkers in Tamilnadu are less than that of Kerala at 0% level.

Analysis of Variance and Co-variance. By Manza Ramesh

Analysis of Variance and Co-variance. By Manza Ramesh Analysis of Variance and Co-variance By Manza Ramesh Contents Analysis of Variance (ANOVA) What is ANOVA? The Basic Principle of ANOVA ANOVA Technique Setting up Analysis of Variance Table Short-cut Method

More information

[ z = 1.48 ; accept H 0 ]

[ z = 1.48 ; accept H 0 ] CH 13 TESTING OF HYPOTHESIS EXAMPLES Example 13.1 Indicate the type of errors committed in the following cases: (i) H 0 : µ = 500; H 1 : µ 500. H 0 is rejected while H 0 is true (ii) H 0 : µ = 500; H 1

More information

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Tribhuvan University Institute of Science and Technology 2065

Tribhuvan University Institute of Science and Technology 2065 1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics.

Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics. Bus 216: Business Statistics II Introduction Business statistics II is purely inferential or applied statistics. Study Session 1 1. Random Variable A random variable is a variable that assumes numerical

More information

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should 11 CHI-SQUARED Chapter 11 Chi-squared Objectives After studying this chapter you should be able to use the χ 2 distribution to test if a set of observations fits an appropriate model; know how to calculate

More information

Kumaun University Nainital

Kumaun University Nainital Kumaun University Nainital Department of Statistics B. Sc. Semester system course structure: 1. The course work shall be divided into six semesters with three papers in each semester. 2. Each paper in

More information

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47 Contents 1 Non-parametric Tests 3 1.1 Introduction....................................... 3 1.2 Advantages of Non-parametric Tests......................... 4 1.3 Disadvantages of Non-parametric Tests........................

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E Salt Lake Community College MATH 1040 Final Exam Fall Semester 011 Form E Name Instructor Time Limit: 10 minutes Any hand-held calculator may be used. Computers, cell phones, or other communication devices

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

6. For any event E, which is associated to an experiment, we have 0 P( 7. If E 1

6. For any event E, which is associated to an experiment, we have 0 P( 7. If E 1 CHAPTER PROBABILITY Points to Remember :. An activity which gives a result is called an experiment.. An experiment which can be repeated a number of times under the same set of conditions, and the outcomes

More information

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12 NONPARAMETRIC TESTS LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-1 lmb@iasri.res.in 1. Introduction Testing (usually called hypothesis testing ) play a major

More information

Random Variable And Probability Distribution. Is defined as a real valued function defined on the sample space S. We denote it as X, Y, Z,

Random Variable And Probability Distribution. Is defined as a real valued function defined on the sample space S. We denote it as X, Y, Z, Random Variable And Probability Distribution Introduction Random Variable ( r.v. ) Is defined as a real valued function defined on the sample space S. We denote it as X, Y, Z, T, and denote the assumed

More information

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics SEVERAL μs AND MEDIANS: MORE ISSUES Business Statistics CONTENTS Post-hoc analysis ANOVA for 2 groups The equal variances assumption The Kruskal-Wallis test Old exam question Further study POST-HOC ANALYSIS

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Instructor Name Time Limit: 120 minutes Any calculator is okay. Necessary tables and formulas are attached to the back of the exam.

More information

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

Math 101: Elementary Statistics Tests of Hypothesis

Math 101: Elementary Statistics Tests of Hypothesis Tests of Hypothesis Department of Mathematics and Computer Science University of the Philippines Baguio November 15, 2018 Basic Concepts of Statistical Hypothesis Testing A statistical hypothesis is an

More information

PROBABILITY.

PROBABILITY. PROBABILITY PROBABILITY(Basic Terminology) Random Experiment: If in each trial of an experiment conducted under identical conditions, the outcome is not unique, but may be any one of the possible outcomes,

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

Chapter 12: Inference about One Population

Chapter 12: Inference about One Population Chapter 1: Inference about One Population 1.1 Introduction In this chapter, we presented the statistical inference methods used when the problem objective is to describe a single population. Sections 1.

More information

VTU Edusat Programme 16

VTU Edusat Programme 16 VTU Edusat Programme 16 Subject : Engineering Mathematics Sub Code: 10MAT41 UNIT 8: Sampling Theory Dr. K.S.Basavarajappa Professor & Head Department of Mathematics Bapuji Institute of Engineering and

More information

MIT Arts, Commerce and Science College, Alandi, Pune DEPARTMENT OF STATISTICS. Question Bank. Statistical Methods-I

MIT Arts, Commerce and Science College, Alandi, Pune DEPARTMENT OF STATISTICS. Question Bank. Statistical Methods-I Q1 Q2 Q3 Q4 Q5 Q6 Q7 Q8 Q9 MIT Arts, Commerce and Science College, Alandi, Pune DEPARTMENT OF STATISTICS Question Bank Statistical Methods-I Questions for 2 marks Define the following terms: a. Class limits

More information

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course. Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Overview Introduction 10-1 Scatter Plots and Correlation 10- Regression 10-3 Coefficient of Determination and

More information

STA2601. Tutorial letter 203/2/2017. Applied Statistics II. Semester 2. Department of Statistics STA2601/203/2/2017. Solutions to Assignment 03

STA2601. Tutorial letter 203/2/2017. Applied Statistics II. Semester 2. Department of Statistics STA2601/203/2/2017. Solutions to Assignment 03 STA60/03//07 Tutorial letter 03//07 Applied Statistics II STA60 Semester Department of Statistics Solutions to Assignment 03 Define tomorrow. university of south africa QUESTION (a) (i) The normal quantile

More information

Statistical Inference Theory Lesson 46 Non-parametric Statistics

Statistical Inference Theory Lesson 46 Non-parametric Statistics 46.1-The Sign Test Statistical Inference Theory Lesson 46 Non-parametric Statistics 46.1 - Problem 1: (a). Let p equal the proportion of supermarkets that charge less than $2.15 a pound. H o : p 0.50 H

More information

green green green/green green green yellow green/yellow green yellow green yellow/green green yellow yellow yellow/yellow yellow

green green green/green green green yellow green/yellow green yellow green yellow/green green yellow yellow yellow/yellow yellow CHAPTER PROBLEM Did Mendel s results from plant hybridization experiments contradict his theory? Gregor Mendel conducted original experiments to study the genetic traits of pea plants. In 1865 he wrote

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample

More information

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes Tests Concerning a Single Sample 41.2 Introduction This Section introduces you to the basic ideas of hypothesis testing in a non-mathematical way by using a problem solving approach to highlight the concepts

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

ANOVA - analysis of variance - used to compare the means of several populations.

ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Thursday 5 June 2003 Morning Time: 1 hour 30 minutes

Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Thursday 5 June 2003 Morning Time: 1 hour 30 minutes Paper Reference(s) 6683 Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary Thursday 5 June 2003 Morning Time: 1 hour 30 minutes Materials required for examination Answer Book (AB16) Graph Paper (ASG2)

More information

Engineering Mathematics IV(15MAT41) Module-V : SAMPLING THEORY and STOCHASTIC PROCESS

Engineering Mathematics IV(15MAT41) Module-V : SAMPLING THEORY and STOCHASTIC PROCESS Engineering Mathematics IV(15MAT41) Module-V : SAMPLING THEORY and STOCHASTIC PROCESS By Dr. K.S.BASAVARAJAPPA Professor and Head, Department of Mathematics, Bapuji Institute of Engineering and Technology,

More information

Hypothesis testing. Data to decisions

Hypothesis testing. Data to decisions Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

Section 6.2 Hypothesis Testing

Section 6.2 Hypothesis Testing Section 6.2 Hypothesis Testing GIVEN: an unknown parameter, and two mutually exclusive statements H 0 and H 1 about. The Statistician must decide either to accept H 0 or to accept H 1. This kind of problem

More information

Statistics Handbook. All statistical tables were computed by the author.

Statistics Handbook. All statistical tables were computed by the author. Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance

More information

Statistics and Measurement Concepts with OpenStat

Statistics and Measurement Concepts with OpenStat Statistics and Measurement Concepts with OpenStat William Miller Statistics and Measurement Concepts with OpenStat William Miller Urbandale, Iowa USA ISBN 978-1-4614-5742-8 ISBN 978-1-4614-5743-5 (ebook)

More information

1. A machine produces packets of sugar. The weights in grams of thirty packets chosen at random are shown below.

1. A machine produces packets of sugar. The weights in grams of thirty packets chosen at random are shown below. No Gdc 1. A machine produces packets of sugar. The weights in grams of thirty packets chosen at random are shown below. Weight (g) 9.6 9.7 9.8 9.9 30.0 30.1 30. 30.3 Frequency 3 4 5 7 5 3 1 Find unbiased

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Practice Final Exam ANSWER KEY Chapters 7-13

Practice Final Exam ANSWER KEY Chapters 7-13 Practice Final Exam ANSWER KEY Chapters 7-13 1. 0.0410 (normal) 2. 0.7580 (normal) 3. 66.09 (reverse normal) 4. The critical value for a sample size of n = 6 is 0.888. Since the value of r (0.974) is greater

More information

Sample Problems for the Final Exam

Sample Problems for the Final Exam Sample Problems for the Final Exam 1. Hydraulic landing assemblies coming from an aircraft rework facility are each inspected for defects. Historical records indicate that 8% have defects in shafts only,

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

CHAPTER 14 THEORETICAL DISTRIBUTIONS

CHAPTER 14 THEORETICAL DISTRIBUTIONS CHAPTER 14 THEORETICAL DISTRIBUTIONS THEORETICAL DISTRIBUTIONS LEARNING OBJECTIVES The Students will be introduced in this chapter to the techniques of developing discrete and continuous probability distributions

More information

SLOW LEARNERS MATERIALS BUSINESS MATHEMATICS SIX MARKS QUESTIONS

SLOW LEARNERS MATERIALS BUSINESS MATHEMATICS SIX MARKS QUESTIONS SLOW LEARNERS MATERIALS BUSINESS MATHEMATICS SIX MARKS QUESTIONS 1. Form the differential equation of the family of curves = + where a and b are parameters. 2. Find the differential equation by eliminating

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

psychological statistics

psychological statistics psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA,

More information

37.3. The Poisson Distribution. Introduction. Prerequisites. Learning Outcomes

37.3. The Poisson Distribution. Introduction. Prerequisites. Learning Outcomes The Poisson Distribution 37.3 Introduction In this Section we introduce a probability model which can be used when the outcome of an experiment is a random variable taking on positive integer values and

More information

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems

More information

One- and Two-Sample Tests of Hypotheses

One- and Two-Sample Tests of Hypotheses One- and Two-Sample Tests of Hypotheses 1- Introduction and Definitions Often, the problem confronting the scientist or engineer is producing a conclusion about some scientific system. For example, a medical

More information

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F. Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #27 Estimation-I Today, I will introduce the problem of

More information

Everything is not normal

Everything is not normal Everything is not normal According to the dictionary, one thing is considered normal when it s in its natural state or conforms to standards set in advance. And this is its normal meaning. But, like many

More information

green green green/green green green yellow green/yellow green yellow green yellow/green green yellow yellow yellow/yellow yellow

green green green/green green green yellow green/yellow green yellow green yellow/green green yellow yellow yellow/yellow yellow CHAPTER PROBLEM Did Mendel s results from plant hybridization experiments contradict his theory? Gregor Mendel conducted original experiments to study the genetic traits of pea plants. In 1865 he wrote

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

Hypothesis testing. 1 Principle of hypothesis testing 2

Hypothesis testing. 1 Principle of hypothesis testing 2 Hypothesis testing Contents 1 Principle of hypothesis testing One sample tests 3.1 Tests on Mean of a Normal distribution..................... 3. Tests on Variance of a Normal distribution....................

More information

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49 4 HYPOTHESIS TESTING 49 4 Hypothesis testing In sections 2 and 3 we considered the problem of estimating a single parameter of interest, θ. In this section we consider the related problem of testing whether

More information

2. A music library has 200 songs. How many 5 song playlists can be constructed in which the order of the songs matters?

2. A music library has 200 songs. How many 5 song playlists can be constructed in which the order of the songs matters? Practice roblems for final exam 1. A certain vault requires that an entry code be 8 characters. If the first 4 characters must be letters (repeated letters are allowed) and the last 4 characters are numeric

More information

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data? Data Analysis: Agonistic Display in Betta splendens By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether

More information

SMTX1014- PROBABILITY AND STATISTICS UNIT V ANALYSIS OF VARIANCE

SMTX1014- PROBABILITY AND STATISTICS UNIT V ANALYSIS OF VARIANCE SMTX1014- PROBABILITY AND STATISTICS UNIT V ANALYSIS OF VARIANCE STATISTICAL QUALITY CONTROL Introduction and Process Control Control Charts for X and R Control Charts for X and S p Chart np Chart c Chart

More information

Test of Hypothesis for Small and Large Samples and Test of Goodness of Fit

Test of Hypothesis for Small and Large Samples and Test of Goodness of Fit Quest Journals Journal of Software Engineering and Simulation Volume 2 ~ Issue 7(2014) pp: 08-15 ISSN(Online) :2321-3795 ISSN (Print):2321-3809 www.questjournals.org Research Paper Test of Hypothesis for

More information

Statistical Hypothesis Testing with SAS and R

Statistical Hypothesis Testing with SAS and R Statistical Hypothesis Testing with SAS and R Statistical Hypothesis Testing with SAS and R Dirk Taeger Institute for Prevention and Occupational Medicine of the German Social Accident Insurance, Institute

More information

Performance Evaluation and Comparison

Performance Evaluation and Comparison Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation

More information

79 Wyner Math Academy I Spring 2016

79 Wyner Math Academy I Spring 2016 79 Wyner Math Academy I Spring 2016 CHAPTER NINE: HYPOTHESIS TESTING Review May 11 Test May 17 Research requires an understanding of underlying mathematical distributions as well as of the research methods

More information

MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL

MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL 1). Two urns are provided as follows: urn 1 contains 2 white chips and 4 red chips, while urn 2 contains 5 white chips and 3 red chips. One chip is chosen

More information

MATH Notebook 3 Spring 2018

MATH Notebook 3 Spring 2018 MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................

More information

Course Review. Kin 304W Week 14: April 9, 2013

Course Review. Kin 304W Week 14: April 9, 2013 Course Review Kin 304W Week 14: April 9, 2013 1 Today s Outline Format of Kin 304W Final Exam Course Review Hand back marked Project Part II 2 Kin 304W Final Exam Saturday, Thursday, April 18, 3:30-6:30

More information

HYPERGEOMETRIC and NEGATIVE HYPERGEOMETIC DISTRIBUTIONS

HYPERGEOMETRIC and NEGATIVE HYPERGEOMETIC DISTRIBUTIONS HYPERGEOMETRIC and NEGATIVE HYPERGEOMETIC DISTRIBUTIONS A The Hypergeometric Situation: Sampling without Replacement In the section on Bernoulli trials [top of page 3 of those notes], it was indicated

More information

Rank-Based Methods. Lukas Meier

Rank-Based Methods. Lukas Meier Rank-Based Methods Lukas Meier 20.01.2014 Introduction Up to now we basically always used a parametric family, like the normal distribution N (µ, σ 2 ) for modeling random data. Based on observed data

More information

STAT Section 5.8: Block Designs

STAT Section 5.8: Block Designs STAT 518 --- Section 5.8: Block Designs Recall that in paired-data studies, we match up pairs of subjects so that the two subjects in a pair are alike in some sense. Then we randomly assign, say, treatment

More information

= 1 i. normal approximation to χ 2 df > df

= 1 i. normal approximation to χ 2 df > df χ tests 1) 1 categorical variable χ test for goodness-of-fit ) categorical variables χ test for independence (association, contingency) 3) categorical variables McNemar's test for change χ df k (O i 1

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Probability and Statistics A.M. Mathai Associate Professor of Mathematics and Statistics McGill University, A-fontreal, Canada P. N. Rathie Professor of Statistics, Instituto

More information

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data? Agonistic Display in Betta splendens: Data Analysis By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether

More information

Worksheet 1. CBSE-Class-VI-Mathematics. Knowing Our Numbers. 1. Tick ( ) the greatest and cross ( ) smallest number :

Worksheet 1. CBSE-Class-VI-Mathematics. Knowing Our Numbers. 1. Tick ( ) the greatest and cross ( ) smallest number : Knowing Our Numbers Worksheet 1 1. Tick ( ) the greatest and cross ( ) smallest number : 2117, 5719, 10036, 2159, 7315, 41083, 80751, 32190, 76019, 1256 2. Make the smallest and the greatest 4-digit number

More information

*X202/13/01* X202/13/01. APPLIED MATHEMATICS ADVANCED HIGHER Statistics NATIONAL QUALIFICATIONS 2013 TUESDAY, 14 MAY 1.00 PM 4.

*X202/13/01* X202/13/01. APPLIED MATHEMATICS ADVANCED HIGHER Statistics NATIONAL QUALIFICATIONS 2013 TUESDAY, 14 MAY 1.00 PM 4. X0/1/01 NATIONAL QUALIFICATIONS 01 TUESDAY, 1 MAY 1.00 PM.00 PM APPLIED MATHEMATICS ADVANCED HIGHER Statistics Read carefully 1. Calculators may be used in this paper.. Candidates should answer all questions.

More information

Statistical Theory 1

Statistical Theory 1 Statistical Theory 1 Set Theory and Probability Paolo Bautista September 12, 2017 Set Theory We start by defining terms in Set Theory which will be used in the following sections. Definition 1 A set is

More information

SL - Binomial Questions

SL - Binomial Questions IB Questionbank Maths SL SL - Binomial Questions 262 min 244 marks 1. A random variable X is distributed normally with mean 450 and standard deviation 20. Find P(X 475). Given that P(X > a) = 0.27, find

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

Comparison of Two Samples

Comparison of Two Samples 2 Comparison of Two Samples 2.1 Introduction Problems of comparing two samples arise frequently in medicine, sociology, agriculture, engineering, and marketing. The data may have been generated by observation

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability Random variable The outcome of each procedure is determined by chance. Probability Distributions Normal Probability Distribution N Chapter 6 Discrete Random variables takes on a countable number of values

More information

Inferences Based on Two Samples

Inferences Based on Two Samples Chapter 6 Inferences Based on Two Samples Frequently we want to use statistical techniques to compare two populations. For example, one might wish to compare the proportions of families with incomes below

More information