LECTURE NOTES. INTSTA2 Introductory Statistics 2. Francis Joseph H. Campeña, De La Salle University Manila

Size: px
Start display at page:

Download "LECTURE NOTES. INTSTA2 Introductory Statistics 2. Francis Joseph H. Campeña, De La Salle University Manila"

Transcription

1 LECTURE NOTES INTSTA Introductory Statistics Francis Joseph H. Campeña, De La Salle University Manila

2 Contents 1 Normal Distribution 1.1 Normal Distribution Sampling and Sampling Distribution 8.1 Sampling and Sampling Distribution Estimation Estimating the Population Mean (µ) Estimating the Population Proportion (π) Estimating the Population Variance (σ ) Errors in Estimation and Sample Size Determination Estimation of Two Parameters Estimating Difference of Two Means Estimating Difference of Two Proportions Estimating the Ratio of Two variances: σ 1 σ Statistical Test of Hypothesis Hypothesis Testing Bibliography 34 1

3 Chapter 1 Normal Distribution Recall that a continuous random variable has a probability of zero of assuming exactly any of its values. And due to the nature of the random variable, we cannot enumerate all of its possible values. Thus when we consider continuous random variables and their probabilities, we only look at probabilities of the random variable have a value in a specified interval. However, we will only consider one type of continuous random variable, the Normal random variable and its associated probability distribution. 1.1 Normal Distribution The normal distribution is one of the most important continuous distribution in the entire field of statistics. And the graph of this distribution is called the normal curve. This distribution is sometimes called the Gaussian distribution in honor of Karl Friedrich Gauss, who derived its equation.

4 CHAPTER 1. NORMAL DISTRIBUTION 3 Remark Properties of the normal curve: 1. It has a bell-shaped curve.. The mode, which is the point on the horizontal axis where the curve is a maximum, occurs at x = µ. 3. The curve is symmetric about a vertical axis through the mean, µ. 4. The normal curve approaches the horizontal axis asymptotically as we proceed in either direction away from the mean. (The graph approaches the x-axis but the graph will never intersect the x-axis). 5. The total area under the curve and above the horizontal axis is equal to 1. Definition A continuous random variable X having the bell-shaped distribution is called a normal random variable. The mathematical equation for the probability distribution of the normal random variable depends on two parameters µ and σ ; its mean and standard deviation. Thus we denote the probability density of X by N(x; µ; σ). If X is a normal random variable with mean µ and variance σ, then the equation of the normal curve is N(x; µ; σ) = 1 πσ e 1 ( x µ σ ) for < x <. Remark It is difficult to compute for the probabilities of a normal random variable using the above formula. However, another way of calculating such probabilities is through the transformation of a normal random variable to its corresponding standard normal random variable. By transforming a normal random variable to a standard normal random variable we can now determine probabilities of the said random variable. Thus we define the standard normal random variable and its distribution.

5 CHAPTER 1. NORMAL DISTRIBUTION 4 Definition The distribution of a normal random variable with mean µ = 0 and standard deviation σ = 1 is called a standard normal distribution. In order to transform a normal random variable to a standard normal one, we use the following formula: X µ Z=. σ By using the table for the standard normal random variable, we can now determine the probability of any normal random variable by transforming the given random variable to its corresponding standard normal random variable. Example Given a normally distributed random variable X with mean 18 and standard deviation of.5, find 1. P (X < 15).. P (17 < X < 1). Solution: (a) P (X < 15) = P Z < = P (Z < 1.) = Refer to the standard normal table: (b) P (17 < X < 1) = P <Z<.5.5 P (Z < 1.) P ( 0.4) = = = P (0.4 < Z < 1.) =

6 CHAPTER 1. NORMAL DISTRIBUTION 5 Example An electrical firm manufacturers light bulbs that have a length of life that is normally distributed with mean equal to 800 hours and standard deviation of 40 hours. Find the probability that the bulb burns between 778 and 834 hours. Solution: The distribution of the light bulbs is illustrated by the figure below: The z values corresponding to x 1 = 778 and x = 834 are Hence, z 1 = z = = 0.55, = P (778 < X < 834) = P ( 0.55 < Z < 0.85) = P (Z < 0.85) P (Z < 0.55) = =

7 CHAPTER 1. NORMAL DISTRIBUTION 6 Exercises (1) Given a normally distributed random variable X with mean 18 and standard deviation of.5, find the value of k such that () A certain type of storage battery last on the average 3.0 years, with a standard deviation of 0.5 years. Assuming that the battery lives are normally distributed, find the probability that a given battery will last less than.3 years. (3) An electrical firm manufactures light bulbs that have a length of life that is normally distributed with mean equal to 800 hours and a standard deviation of 40 hours. Find the probability that a bulb burns between 778 and 834 hours. (4) If the average height of miniature poodles is 30 centimeters, with a standard deviation of 4.1 cm, what percentage of miniature poodles exceeds 35 cm in height, assuming that the height follows a normal distribution and can be measured to any desired degree of accuracy? (5) A set of final examination grades in an introductory statistics course was found to be normally distributed, with a mean of 73 and a variance of 64. (a) What is the probability of getting a grade of 91 or less in this exam? (b) What percentage of students scored between 81 and 89? (c) Only 5% of the students taking the test scored higher than what grade? (6) Plastic bags used for packaging produce re manufactured so that the breaking strength of the bag is normally distributed with a mean of 5 pounds per square inch and a standard deviation of 1.5 pounds per square inch. (a) What proportion of the bags produced have a mean breaking strength of between 5 and 5.5 pounds per square inch? (b) What is the probability that a randomly selected bag will have a mean breaking strength of at least 6 pounds per square inch?

8 CHAPTER 1. NORMAL DISTRIBUTION 7 (c) What percentage of the bags have a mean breaking strength of less than 4.17 pound per square inch? (d) Between what two values symmetrically distributed around the mean will 95% of the breaking strengths fall? (7) If we know that the length of time it takes a college student to find a parking spot in the university parking lot follows a normal distribution with a mean of 3.5 minutes and a standard deviation of 1 minute, find the probability that if we select 36 randomly selected college students, the average time it would take for them to find a parking spot is (a) less than 3. minutes? (b) between 3.4 and 3.7 minutes? (c) more than 3.8 minutes? (8) The time needed to complete a final exam in a particular college course is normally distributed with mean 80 minutes and standard deviation of 10 minutes. (a) What is the probability of completing the exam in an hour or less? (b) What is the probability that a student will complete the exam in more than 60 minutes but less than 75 minutes? (c) Assume that the class has 60 students and that the examination period is 90 minutes in length. How many of the students do you expect will be unable to complete the exam in the allotted time?

9 Chapter Sampling and Sampling Distribution Recall that one of the objectives of statistics is to make inferences concerning a population. And these inferences are based only in partial information regarding the population, since the information of the statistics is based on the sample. And the value of our statistics may vary from sample to sample. Because of this, we need to understand first the variations that are associated with the statistic involved in our inference. Another concern regarding inference based on sample information is the factor of how the samples are taken and how large the sample size is so that meaningful interpretations can be drawn from the sample. This concern is addressed in specialized study of statistics, Sampling Theory, which is beyond the scope of our study in this course. But an overview of terms and concepts in sampling theory are discussed in section Sampling and Sampling Distribution A statistic is a numerical descriptive measure derived from a sample. However, there are random samples thus producing different values for a certain statistic. Since statistic varies from sample to sample then we can say that a statistic is also a random variable. Recall that we can construct a probability distribution for a random variable hence probability distribution for a statistic can also be constructed. We call the probability distribution of a statistic a sampling distribution. 8

10 CHAPTER. SAMPLING AND SAMPLING DISTRIBUTION 9 Definition The sampling distribution of a statistic is the probability distribution for the possible values of the statistic that results when random samples of size n are repeatedly drawn from the population. Example A population consists of N = 5 numbers :1,, 3, 4, and 5. If a random sample of size n = 3 is selected, find the sampling distributions for the sample mean. Solution: Computation of the population mean and variance will give us µ = 3 and σ =. Since there are only 5 distinct and equally likely elements in our population the probability that one will occur is the same for all elements in the population, that is, P (x) = 1. Since we are only choosing 3 from the 5 population there are only 5 C 3 = 10 different possible samples and they are as follows: No. Sample Sample Mean x 1 1,,3 1,, ,, ,3, ,3, ,4, ,3,4 3 8,3, ,4, ,4,5 4 Thus the sampling distribution of the sample mean is x f( x) Notice that if we take the average of all the sample means we will get the value 3 and a variance of 1. But if we increase our sample size say n = 4 and 3 compute for the sampling distribution of x again, we will still get a mean of 3 but a variance of 0.15.

11 CHAPTER. SAMPLING AND SAMPLING DISTRIBUTION 10 Remark We can notice that µ x the mean of the sample means is equal to the population mean, and the variance σ x or the standard deviation σ x will decrease as our sample size increases. If all possible random samples of size n are drawn, without replacement, from a finite population of size N with mean µ and standard deviation σ, then the sampling distribution of the sample mean will be approximately normally distributed and the mean and standard deviation is given by The factor N n N 1 µ x = µ and σ x = σ n N n N 1. is called the finite correction factor. For large or infinite populations, this correction factor will be approximately equal to 1. Hence σ x = σ n The above notion regarding the sampling distribution of the sample mean gives us the foundation of the next theorem; the central limit theorem. The central limit theorem states that in general situations and condition sums and means of samples of random observations that are drawn from a population of any distribution tends to possess, approximately, a bell shaped distribution in repeated sampling. And thus the distribution can be assumed approximately normal. One of the significance of the central limit theorem is that it explains why some of the observations in the real world tends to possess an approximately a normal distribution. To illustrate this significance, consider the weight of a person. Weight can be affected by many factors whether environmental or genetics for instance, family lineage such as the parents weights. Another factor can be the physical activities of the person. All this possibilities may really affect the weight of a person but the central limit theorem together with other theorems applicable to the normal distribution provides an explanation of this events. Another significance of the central limit theorem and probably the most important attribute is its application to statistical inference. Many statistical estimators that are used to make inferences about a population have parameters that are sums and averages of sample observations.

12 CHAPTER. SAMPLING AND SAMPLING DISTRIBUTION 11 Theorem Central Limit Theorem If random samples of size n are drawn from a large or infinite population with mean µ and variance σ, then the sampling distribution of the sample mean is approximately normally distributed with mean and standard deviation µ x and σ x = σ n where µ and sigma are the mean and standard deviation of the population, respectively. Thus, z = ( x µ x σ x = σ n ) is a value of a standard normal random variable Z. Remark 1. If samples are taken from a population having a normal distribution, then the sampling distribution of the sample mean will have a normal distribution no matter what n is.. If samples are taken from a population which is not normally distributed, then the sampling distribution of the sample mean will have an approximate normal distribution only for large samples, that is, when n The standard deviation of the sampling distribution of x,σ x, is called the standard error of the sample mean.

13 Chapter 3 Estimation Procedures and formulas used in estimating values of unknown population parameters that are based on information provided in a sample data are based on the theory of sampling distributions and the methods used to collect these sample. The sampling distributions allow us to associates specific levels of confidence with each statistical inference. And thus enabling us to quantify how much confidence we place in a sample statistic correctly estimating the population parameter. Definition An estimator is a rule, usually expressed as a formula that tells us how to calculate an estimate based on information in the sample. We can classify estimators into two, point estimators and interval estimators. 1. Point estimation - Based on sample data, a single number is calculated to estimate the population parameter. The rule or formula that describes this calculation is called the point estimator, and the resulting number is called the point estimate.. Interval estimation - Based on sample data, two numbers are calculated to form an interval within which the parameter is expected to lie. The rule or formula that describes this calculation is called the interval estimator, and the resulting pair of numbers is called an interval estimate or confidence interval. 1

14 CHAPTER 3. ESTIMATION Estimating the Population Mean (µ) A. Point Estimate for µ (1) The best point estimate for the population mean, µ, is the sample mean, x. () The point estimator x is unbiased with standard error given by SE = σ n. (3) The margin of error of the point estimate, x, is given by±1.96se. (4) If σ is unknown and n 30, the sample standard deviation s can be used to approximate σ. B. Interval Estimate for µ To construct and interval estimate for the population mean, we consider two cases. One case is when the standard deviation of the population is known or unknown by the sample size is large enough, that is, n 30. The other case is when the standard deviation is not known and the sample size is less than 30. (a) CASE 1: If σ is known or σ unknown but n 30, a (1 α)100% confidence interval for a population mean,µ is given by: x ± Z α ( ) σ n where: x = sample mean Z α = z-score with an area of α to the right n = sample size σ = population standard deviation (b) CASE : If σ unknown and n 30, a (1 α)100% confidence interval for a population mean,µ is given by: x ± t α ( s n )

15 CHAPTER 3. ESTIMATION 14 where: x = sample mean t α = critical t-value with an area of α to the right and a degree of freedom n 1 n = sample size s = sample standard deviation Remark (1) If x is used as an estimate of µ, we ( can then be (1 α)100% confident that the error will not exceed Z α σ n ). () If x is used as an estimate of µ, we can then be (1 α)100% confident that the error will not exceed a specified amount e when the sample size ( Z α ) σ is n =. Example e The mean and standard deviation for the quality grade point averages of a random sample of 36 college seniors are calculated to be.6 and 0.3 respectively. Find the 95% confidence interval for the mean of the entire senior class. Solution: The following are known: x =.6, s = 0.3, n = 36, α = 5% for a 95% C.I. A 95% confidence interval for the quality grade point average of the entire senior class ( ) is given by: ( ) ( ) σ x ± Z α 0.3 n.6 ± Z ± = (.50,.698) 3. Estimating the Population Proportion (π) A. Point Estimate for π (1) The best point estimate for the population proportion,π, is the sample proportion, ˆp. () The point estimator ˆp is unbiased with standard error given by SE = ˆpˆq n. (3) The margin of error of the point estimate, ˆp, is given by±1.96se.

16 CHAPTER 3. ESTIMATION 15 (4) The maximum error in estimating π using ˆp, is given by±z α. B. Interval Estimate for π To construct and interval estimate for the population proportion we have the following formula: A (1 α)100% confidence interval for a population proportion,π is given by: where: ˆp = sample proportion ˆq = 1 ˆp Z α ˆp ± Z α ( ) ˆpˆq n = z-score with an area of α to the right n = sample size Example In a random sample of 500 people eating lunch at a hospital cafeteria on various Fridays, it was found that 160 preferred seafood. Find a 95% confidence interval for the actual proportion of people who eat seafood on Fridays at this cafeteria. Solution: The following are known: x = 160, n = 500, α = 5% for a 95% C.I. The point estimate of π is ˆp = = 0.3 A 95% confidence interval for the proportion of people who prefers seafood on Fridays at this cafeteria is given by: ( ) ( ) ˆpˆq (0.3)(0.68) ˆp ± Z α 0.3 ± Z 0.05 n 500 ( ) (0.3)(0.68) 0.3 ± (1.96) < π <

17 CHAPTER 3. ESTIMATION Estimating the Population Variance (σ ) Suppose a sample of size n is drawn from a normal population with variance σ. The point estimate for the population variance,σ, is the sample variance, s and a (1 α)100% confidence interval for the population variance is given by (n 1)s χ α < σ < (n 1)s χ 1 α where χ α and χ 1 are values with n 1 degrees of freedom leaving areas α of α and 1 α, respectively, to the right. Example The following are the volumes, in deciliters, of 10 cans of peaches distributed by a certain company: 46.4, 46.1, 45.8, 47.0, 46.1, 45.9, 45.8, 46.9, 45., and Find a 95% confidence interval for the variance of all such cans of peaches distributed by this company, assuming that the volume is normally distributed random variable. Solution: The following are known: n = 10, α = We compute s to be and the chi-square values to be χ α = χ 0.05 = and χ 1 = χ α =.700. Using these values, a95% confidence interval for the variance of the volume of canned peaches by this company is: (9)(0.934) < σ < (9)(0.934) < σ < Errors in Estimation and Sample Size Determination We note that a (1 α)100% confidence interval provides an estimate of the accuracy of our point estimates. If the parameter is actually at the center of the interval estimate then the point estimate estimates the parameter without error. However, this will not always be the case. Hence, we provide the following theorems.

18 CHAPTER 3. ESTIMATION 17 Theorem Error in Estimating µ If x is used as an estimate ( of µ, ) we can then be σ. n (1 α)100% confident that the error will not exceed Z α Theorem Sample Size for Estimating µ If x is used as an estimate of µ, we can then be (1 α)100% confident that the error will not exceed a specified amount ( σ ) e when the sample size is n = Z α. e Theorem Error in Estimating π If ˆp is used as an estimate ( of π, we ) can then be ˆpˆq. n (1 α)100% confident that the error will not exceed Z α Theorem Sample Size for Estimating π If ˆp is used as an estimate of π, we can then be (1 α)100% confident that( the ) error will not exceed a specified amount ˆpˆq e when the sample size is n = Z α. e

19 CHAPTER 3. ESTIMATION 18 Exercises 1. A scientist interested in monitoring chemical contaminants in food, and thereby the accumulation of contaminants in human diets, selected a random sample of n = 50 male adults. It was found that the average daily intake of dairy products was 756 grams per day with a standard deviation of 35 grams per day. Construct a 95% confidence interval for the mean daily intake of dairy products for men.. A random sample of 1 female students in a certain dormitory showed an average weekly expenditure of P400 for snack foods, with a standard deviation of P1.50. Construct a 90% confidence interval for the average amount spent on snack foods by female students living in this dormitory, assuming the expenditures to be normally distributed. 3. The contents of 7 similar containers of sulfuric acid are 9.8, 10., 10.4, 9.8, 10.0, 10., and 9.6 liters. Find a 95% confidence interval for the mean content of all such containers, assuming an approximate normal distribution for container contents. 4. The mean and standard deviation for the quality grade point averages of a random sample of 36 college seniors are calculated to be.6 and 0.3 respectively. Find the 95% and 99% confidence intervals for the mean of the entire senior class. 5. The following data were collected based from a sample in an experiment: n = 64, x =.5 and s = 3.4. (a) What is the point estimate of µ? (b) What is the margin of error associated with the point estimate of µ? (c) Construct a 99% confidence interval for µ. (d) What is the maximum error of the estimate for (c)? 6. A telephone answering service completes a report in which the length of the call is recorded, at the end of each call. A random sample of 9 reports yields a mean length of call of 1. minutes. Construct a 95% confidence interval for the mean length of call for the whole telephone answering service company if it is known that the population is normally distributed with a standard deviation of 0.6 minutes.

20 CHAPTER 3. ESTIMATION A random sample of 10 chocolate bars has an average of 30 calories with a standard deviation of 15 calories. Assuming that the distribution of the calories is approximately normal. (a) Construct a 99% confidence mean calories content of this chocolate bar. (b) How large a sample is needed if we wish to be 99% confident that our sample mean will be within 5 calories of the true mean? 8. A sample selected from a population gave a sample proportion equal to 0.73 (a) Make a 99% confidence interval for π assuming n = 100. (b) Make a 99% confidence interval for π assuming n = 600. (c) Make a 99% confidence interval for π assuming n = (d) Does the width of the confidence interval constructed for a-c decrease as the sample size increases? If yes, explain why. 9. In poll of 617 workers conducted for Ernst and Young, 5% said that they had observed their co-workers stealing products or cash from their employers. (a) What is the point estimate of the corresponding population proportion? (b) What is the margin of error associated with the point estimate? (c) Find a 95% confidence interval for the proportion of all such workers who have observed their co-workers stealing productgs or cash from their employers. 10. In a random sample of 500 teenagers 1 to 17 years old, it was found that 330 have regular access to computers and the Internet. (a) What is the point estimate of the corresponding population proportion? (b) Construct a 95%confidence interval for the true proportion of teenagers 1 to 17 years old who have regular access to computers and the Internet.

21 CHAPTER 3. ESTIMATION 0 (c) What can you assert with 95% confidence about the possible size of the error e if you estimate the true proportion of teenagers 1 to 17 years old who have regular access to computers and the Internet to be equal to = 0.66? 11. A random sample of 985 likely voters who are likely to vote in the upcoming election were polled during a phonathon conducted by the Liberal party. Of those surveyed, 59 indicated ha they intend to vote for the Liberal candidate in the upcoming election. (a) Construct a 90% confidence interval for the proportion of likely voters in the population who intend to vote for a liberal candidate. (b) What can we assert with a 90% confidence about the possible size of error if we estimate the fraction of voters who intend to vote for the Liberal candidate is (c) How large a sample is needed if we want to be 90% confident that our estimate of p is within 0.1?

22 Chapter 4 Estimation of Two Parameters 4.1 Estimating Difference of Two Means Let µ 1 and σ 1 be the mean and standard deviation, respectively, of the first population and µ and σ be the mean and standard deviation, respectively, of the second population. Random samples of size n 1 are taken from the first population and random samples of size n are taken from the second population. A. Point estimation for µ 1 µ : (1) The best point estimate for the difference between two population means, µ 1 µ, is given by the difference between their sample means, x 1 x. () The point estimator, x 1 x, is unbiased with standard error given σ1 by SE = + σ. n 1 n (3) The margin of error of the point estimate is given by 1.96SE. (4) If σ 1 and σ are unknown but both n 1 and n are 30 are more, then the sample variances s 1 and s can be used. B. Interval estimation for µ 1 µ : We consider four cases in constructing a (1 α)100% Confidence Interval for the difference between two population means. 1

23 CHAPTER 4. ESTIMATION OF TWO PARAMETERS Case 1: Large Sample Case σ 1 and σ are known or σ 1 and σ are unknown but n 1 n 30. ( x 1 x ) ± Z α σ 1 + σ n 1 n 30 and Case : Small Sample Case, Equal Variance σ 1 and σ are unknown and n 1 < 30 and n < 30 but σ 1 = σ. ( 1 ( x 1 x ) ± t α s p + 1 ) n 1 n (n 1 1)s 1 + (n 1)s where, v = n 1 + n and s p = n 1 + n Case 3: Small Sample Case, Unequal Variance σ 1 and σ are unknown and n 1 < 30 and n < 30 but σ 1 σ. ( x 1 x ) ± t α where, v = df = s 1 + s n 1 n ( ) s 1 n 1 + s n ( s 1 n 1 ) n ( s n ) n 1

24 CHAPTER 4. ESTIMATION OF TWO PARAMETERS 3 Case 4: Paired Sample σ 1 and σ are unknown and n 1 < 30 and n < 30 but σ 1 σ. s d ± t α d a n where, d is the mean of the differences and s d is the standard deviation n of the differences computed as d = ( n ) d i n d i=1 i n i=1 s d = n 1 i=1 n 4. Estimating Difference of Two Proportions Suppose independent random samples of size n 1 and n are taken from two populations and let x 1 and x be the number of successes in the first and second populations, respectively. A. Point estimation for π 1 π : d i and (1) The best point estimate for the difference between two population proportion, π 1 π, is given by the difference between their sample proportions, ˆp 1 ˆp. () The point estimator, ˆp 1 ˆp, is unbiased with standard error given ˆp1 ˆq 1 by SE = + ˆp ˆq. n 1 n (3) The margin of error of the point estimate is given by 1.96SE. B. Interval estimation for π 1 π : We consider four cases in constructing a (1 α)100% Confidence Interval for the difference between two population proportion.

25 CHAPTER 4. ESTIMATION OF TWO PARAMETERS 4 ( ˆp 1 ˆp ) ± Z α ( ˆp1 ˆq 1 n 1 + ˆp ˆq n ) 4.3 Estimating the Ratio of Two variances: σ 1 σ For any two independent random samples of size n 1 and n selected from two normal populations, the ratio of the sample variances, s 1, is computed and s the following (1 α)100% confidence interval for σ 1 is given by σ s 1 s f α 1 (v 1, v ) < σ 1 σ < s 1 f s α (v, v 1 ) where f α (v 1, v ) is an f value with a v 1 = n 1 1 and v = n 1 degrees of freedom leaving an area of α to the right. Exercises 1. The wearing qualities of two types of automobile tires were compared by road testing samples of 100 tires of each type. The number of miles until wear out was defined as a specific amount of tire wear. The test results are given below: T ire1 T ire x 1 = 6, 000 x 1 = 5, 100 s 1 = 1, 440, 000 s = 1, 960, 000 Estimate the difference in mean miles to wear out, µ 1 µ.. A standardized chemistry test was given to a random sample of 50 girls and 75 boys. The girls made an average grade of 76 with a standard deviation of 6 while the boys made an average grade of 8 with a standard deviation of 8. Find a 90% confidence interval for the difference µ 1 µ where µ 1 is the mean sore of all boys and µ is the mean of all girls who might take the test.

26 CHAPTER 4. ESTIMATION OF TWO PARAMETERS 5 3. A course in Mathematics is taught to 1 students by the conventional classroom procedure. A second group of 10 students was given the same course by means of programmed materials. At the end of the term, the same examination was given to each group. The 1 students meeting in the classroom made an average group of 85 with a standard deviation of 4 while the 10 students using programmed materials made an average of 81 with a standard deviation of 5. Find a 90% confidence interval for the difference between the population means, assuming the population approximates a normal distribution with equal variances. 4. Records for the past 15 years have shown the average rainfall in a certain region of the country for the month of May to be 4.93 cm with a standard deviation of 1.14 cm. A second region of the country has had an average rainfall in May of.64 cm with a standard deviation of 0.06 cm during the past 10 years. Find a 95% confidence interval for the difference of the true average rainfalls in these two regions assuming that the observations come from normal populations with different variances. 5. It is claimed that a new diet will reduce a persons weight by 4.5 kilograms on the average in a period of weeks. The weights of 7 women who followed this diet were recorded before and after a -week period: Bef ore Af ter (a) Find the average of the differences in weights before and after the weight loss program for the 4 women who participated. (b) Find a 95% Confidence interval for the differences in weights before and after the weight loss program for the 4 women who participated. (c) 6. A poll is taken among the residents of a city and the rounding county to determine the feasibility of a proposal to construct a civic center. If 400 of 5000 city residents favor the proposal and 100 of 000 county residents favor it, find a 90% confidence interval for the true difference in the fractions favoring the proposal to construct the civic center.

27 CHAPTER 4. ESTIMATION OF TWO PARAMETERS 6 7. A geneticist is interested in the proportion of males and females in the population that have a certain minor blood disorder. In a random sample of 100 males, 4 are found to be afflicted, whereas 13 out of 100 females tested appear to have the disorder. Compute a 99% confidence interval for the difference between proportion of males and females that have this blood disorder. 8. In a study of the relationship between birth order and college success, an investigator found that 16 in a sample of 180 college students were firstborn or only child. In a sample of 100 non-graduates of comparable age and socio-economic background, the number of firstborn or only child was 54. Find a point estimate for the difference between the proportions of firstborn or only child in the two populations from which these samples were drawn. 9. An efficiency expert wishes to determine the average time that it takes to drill 3 holes in a certain metal clamp. How large should a sample will be needed for the expert to be 95% confident that his sample mean will be within 15 seconds of the true mean? 10. The government awarded grants to the agricultural departments of nine universities to test the yield capabilities of two new varieties of wheat. Each variety was planted on plots of equal area at each university and the yields, in kilograms per plot, were recorded as follows: V ariety V ariety Find a 95% confidence interval for the mean difference between the yields of the two varieties assuming the distributions of yileds to be approximately normal. 11. A random sample of 1 female students in a certain dormitory showed an average weekly expenditure of Php for snack foods, with a standard deviation of Php (a) What is the point estimate for the average weekly expenditure of females in this dormitory?

28 CHAPTER 4. ESTIMATION OF TWO PARAMETERS 7 (b) What is the standard error in estimating the average weekly expenditure of females in this dormitory? (c) Construct a 90% confidence interval for the average amount spend each week on snack foods by female students living in this dormitory, assuming the expenditures to be approximately normally distributed. 1. Two kinds of thread are being compared for strength. Fifty pieces of each type of thread are tested under similar conditions. Brand A had an average tensile strength of 78.3 kilograms with a standard deviation of 5.6 kilograms, while Brand B had an average tensile strength of 87. kilograms with a standard deviation of 6.3 kilograms. (Use µ 1 for Brand A and µ for Brand B ) (a) What is the point estimate for the difference in the average tensile strength of the two threads? (b) What is the standard error in estimating the difference of the average tensile strength of the two kinds of thread? (c) Construct a 95% confidence interval for the difference of the population means. (d) What is the maximum error in estimating the difference of the average tensile strength of the two kinds of thread? 13. A new rocket-launching system is being considered for deployment of small short-range launches. The existing system has π = 0.8 as the probability of a successful launch. A sample of 40 experimental launches is made with the new system and 34 are successful. (a) Construct a 95% confidence interval for π. (b) Would you conclude that the new system is better? answer. Explain your 14. A study is made to determine if a cold climate results in more students being absent from school during a semester than for a warmer climate. Two groups of students are selected at random, one group from Vermont and the other groups from Georgia. Of the 300 studnets from Vermont, 64 were absent at least 1 day during the semester, and of the 400 students from Georgia, 51 were absent 1 day or more days. Find a 95% confidence

29 CHAPTER 4. ESTIMATION OF TWO PARAMETERS 8 interval for the difference between the fractions of the students who are absent in the two states. 15. A random sample of 100 homes in a certain city, it is found that 68 are heated by natural gas. Find the 98% confidence interval for the fraction of homes in this city that are heated by natural gas. 16. A random sample of 75 colleges students is selected and 16 are found to have cars on campus. Use a 95% confidence interval to estimate the fraction of students who have cars on campus.

30 Chapter 5 Statistical Test of Hypothesis In a certain perspective, we can view hypothesis testing just like a jury in a court trial. In a jury trial, the null hypothesis is similar to the jury making a decision of not- guilty, and the alternative is the guilty verdict. Here we assume that in a jury trial that the defendant isn t guilty unless the prosecution can show beyond a reasonable doubt that defendant is guilty. If it has been established that there is evidence beyond a reasonable doubt and the jury believes that there is enough evidence to refute the null hypothesis, the jury gives a verdict in favor of the alternative hypothesis, which is a guilty verdict. In general, when performing hypothesis testing, we set up the null (H o ) and alternative (H a ) hypothesis in such a way that we believe that Ho is true unless there is sufficient evidence (information from a sample; statistics) to show otherwise. 5.1 Hypothesis Testing A statistical hypothesis is an assertion or conjecture concerning one or more populations. 9

31 CHAPTER 5. STATISTICAL TEST OF HYPOTHESIS 30 Remark 1. Null hypothesis - the hypothesis that we wish to focus our attention on. Generally this is a statement that a population parameter has a specified value. The hypothesis that is tested and the one which the researcher wishes to reject or not to reject. Specifies an exact value of the population parameter. Denoted by H o.. Alternative hypothesis - the hypothesis that is accepted if the null hypothesis is rejected. Allows for the possibility of several values. Denoted by H a or H 1. May be directional (quantifier < or >) or non-directional (quantifier is ). A test of hypothesis is the method to determine whether the statistical hypothesis is true or not. In performing statistical test of hypothesis we consider the following situations: REJECT DO NOT REJECT N ullhypothesis TRUE FALSE T ypeierror CorrectDecision CorrectDecision T ypeiierror The probability of committing a TYPE I error is also called the level of significance and is denoted by a small Greek symbol alpha, α. Some of the common values used for the level of significance are 0.1, 0.05, and For example, if α = 0.1 for a certain test, and the null hypothesis is rejected, then it means that we are 90% confident that this is the correct decision.

32 CHAPTER 5. STATISTICAL TEST OF HYPOTHESIS 31 Remark The following are some important properties pertaining to α and β. The Type I error and Type II error are related. A decrease in the probability of one results in the increase in the porbability of the other. The size of the critical region, and therefore the probability of committing a Type I error, can always be reduced by adjusting the critical values. An increase in the sample size will reduce α and β simultaneously. If the null hypothesis is false, β is a maximum when the true value of a parameter is close to the hypothesized value. The greater the distance between the true value and the hypothesized value, the smaller β will be. Remark

33 CHAPTER 5. STATISTICAL TEST OF HYPOTHESIS 3 The following are some important terms and concepts in performing a test of hypothesis. 1. Level of significance,α. The level of significance,α, is the probability of committing an error of rejecting the null hypothesis when, in fact, it is true.. One-tailed tests v.s. Two-tailed tests. One Tailed Test. A one tailed test is performed when the alternative hypothesis is concerned with values specifically below or above an exact value of the null hypothesis. The alternative hypothesis is directional(i.e. < or >). Two Tailed Test. A two-tailed test is performed when the alternative hypothesis is concerned with values that are not equal to an exact value of the null hypothesis. The alternative hypothesis is non-directional. 3. Test Statistic The value generated from sample data. Test value to be compared with the critical values. 4. Critical Region (Region of rejection/region of acceptance) Depends on the type of test to be performed. If test is one tailed, then the critical region is concentrated on either the left tail (for<) or the right tail of the distribution (for >). If test is two tailed, then the critical region is distributed on each tail of the distribution. Critical values are obtained depending on the type of test to be performed. If the test is one tailed, the significance level will be the area either on the left tail or on the right tail of the distribution. If the test is two tailed, the area in each tail of the distribution will be α.

34 CHAPTER 5. STATISTICAL TEST OF HYPOTHESIS 33 The following are the steps in performing a test of hypothesis: (1) Setup the null and alternative hypothesis. () Indicate the level of significance. (3) Determine the critical region and the corresponding critical values. (4) Compute the value of the test statistic. (5) Make a decision. (6) Draw appropriate conclusion.

35 Bibliography [1] R. Walpole Introduction to Statistics. Pearson Education South Asia Pte Ltd.004. [] R. Walpole, R. Myers, K. Ye, and S. Myers Probability and Statistics for Engineers and Scientists. Pearson Education International.007. [3] L. Stephens Schaums s Outline of Theory and Problems in Beginning Statistics. The McGraw-Hill Companies, Inc [4] L. Kazmier Schaums s Easy Outlines: Business Statistics. The McGraw- Hill Companies, Inc.003. [5] L. Gonick and W. Smith Cartoon Guide to Statistics. HarperCollins Publisher, [6] A. Graham Developing Thinking in Statistics Paul Chapman Publishing 006. [7] R. Khazanie Elementary Statistics: In a World of Applications. Goodyear Publishing Inc.,

Chapter 6 ESTIMATION OF PARAMETERS

Chapter 6 ESTIMATION OF PARAMETERS Chapter 6 ESTIMATION OF PARAMETERS Recall that one of the objectives of statistics is to make inferences concerning a population. And these inferences are based only in partial information regarding the

More information

[ z = 1.48 ; accept H 0 ]

[ z = 1.48 ; accept H 0 ] CH 13 TESTING OF HYPOTHESIS EXAMPLES Example 13.1 Indicate the type of errors committed in the following cases: (i) H 0 : µ = 500; H 1 : µ 500. H 0 is rejected while H 0 is true (ii) H 0 : µ = 500; H 1

More information

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly.

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly. Introduction to Statistics Math 1040 Sample Exam III Chapters 8-10 4 Problem Pages 3 Formula/Table Pages Time Limit: 90 Minutes 1 No Scratch Paper Calculator Allowed: Scientific Name: The point value of

More information

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean.

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean. Math 1342 Exam 3-Review Chapters 7-9 HCCS **************************************************************************************** Name Date **********************************************************************************************

More information

Math 101: Elementary Statistics Tests of Hypothesis

Math 101: Elementary Statistics Tests of Hypothesis Tests of Hypothesis Department of Mathematics and Computer Science University of the Philippines Baguio November 15, 2018 Basic Concepts of Statistical Hypothesis Testing A statistical hypothesis is an

More information

Chapter 9. Hypothesis testing. 9.1 Introduction

Chapter 9. Hypothesis testing. 9.1 Introduction Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems

More information

One- and Two-Sample Tests of Hypotheses

One- and Two-Sample Tests of Hypotheses One- and Two-Sample Tests of Hypotheses 1- Introduction and Definitions Often, the problem confronting the scientist or engineer is producing a conclusion about some scientific system. For example, a medical

More information

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E Salt Lake Community College MATH 1040 Final Exam Fall Semester 011 Form E Name Instructor Time Limit: 10 minutes Any hand-held calculator may be used. Computers, cell phones, or other communication devices

More information

Hypothesis testing for µ:

Hypothesis testing for µ: University of California, Los Angeles Department of Statistics Statistics 10 Elements of a hypothesis test: Hypothesis testing Instructor: Nicolas Christou 1. Null hypothesis, H 0 (always =). 2. Alternative

More information

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product?

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product? A. Attribute data B. Numerical data C. Quantitative data D. Sample data E. Qualitative data F. Statistic G. Parameter Chapter #1 Match the following descriptions with the best term or classification given

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,

More information

EXAM 3 Math 1342 Elementary Statistics 6-7

EXAM 3 Math 1342 Elementary Statistics 6-7 EXAM 3 Math 1342 Elementary Statistics 6-7 Name Date ********************************************************************************************************************************************** MULTIPLE

More information

Inferences Based on Two Samples

Inferences Based on Two Samples Chapter 6 Inferences Based on Two Samples Frequently we want to use statistical techniques to compare two populations. For example, one might wish to compare the proportions of families with incomes below

More information

Point Estimation and Confidence Interval

Point Estimation and Confidence Interval Chapter 8 Point Estimation and Confidence Interval 8.1 Point estimator The purpose of point estimation is to use a function of the sample data to estimate the unknown parameter. Definition 8.1 A parameter

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: CHAPTER 9, 10 Hypothesis Testing Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: The person is guilty. The person is innocent. To

More information

2.57 when the critical value is 1.96, what decision should be made?

2.57 when the critical value is 1.96, what decision should be made? Math 1342 Ch. 9-10 Review Name SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 9.1 1) If the test value for the difference between the means of two large

More information

QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100%

QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% 1) We want to conduct a study to estimate the mean I.Q. of a pop singer s fans. We want to have 96% confidence

More information

Name: Exam: In-term Two Page: 1 of 8 Date: 12/07/2018. University of Texas at Austin, Department of Mathematics M358K - Applied Statistics TRUE/FALSE

Name: Exam: In-term Two Page: 1 of 8 Date: 12/07/2018. University of Texas at Austin, Department of Mathematics M358K - Applied Statistics TRUE/FALSE Exam: In-term Two Page: 1 of 8 Date: 12/07/2018 Name: TRUE/FALSE 1.1 TRUE FALSE University of Texas at Austin, Department of Mathematics M358K - Applied Statistics MULTIPLE CHOICE 1.2 TRUE FALSE 1.3 TRUE

More information

8.1 Frequency Distribution, Frequency Polygon, Histogram page 326

8.1 Frequency Distribution, Frequency Polygon, Histogram page 326 page 35 8 Statistics are around us both seen and in ways that affect our lives without us knowing it. We have seen data organized into charts in magazines, books and newspapers. That s descriptive statistics!

More information

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers Math 2000 Practice Final Exam: Homework problems to review Pages: Problem numbers 52 20 65 1 181 14 189 23, 30 245 56 256 13 280 4, 15 301 21 315 18 379 14 388 13 441 13 450 10 461 1 553 13, 16 561 13,

More information

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COURSE: CBS 221 DISCLAIMER The contents of this document are intended for practice and leaning purposes at the undergraduate

More information

15: CHI SQUARED TESTS

15: CHI SQUARED TESTS 15: CHI SQUARED ESS MULIPLE CHOICE QUESIONS In the following multiple choice questions, please circle the correct answer. 1. Which statistical technique is appropriate when we describe a single population

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample

More information

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Instructor Name Time Limit: 120 minutes Any calculator is okay. Necessary tables and formulas are attached to the back of the exam.

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49 4 HYPOTHESIS TESTING 49 4 Hypothesis testing In sections 2 and 3 we considered the problem of estimating a single parameter of interest, θ. In this section we consider the related problem of testing whether

More information

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data

More information

STP 226 EXAMPLE EXAM #3 INSTRUCTOR:

STP 226 EXAMPLE EXAM #3 INSTRUCTOR: STP 226 EXAMPLE EXAM #3 INSTRUCTOR: Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned. Signed Date PRINTED

More information

Chapter Six: Two Independent Samples Methods 1/51

Chapter Six: Two Independent Samples Methods 1/51 Chapter Six: Two Independent Samples Methods 1/51 6.3 Methods Related To Differences Between Proportions 2/51 Test For A Difference Between Proportions:Introduction Suppose a sampling distribution were

More information

Mt. Douglas Secondary

Mt. Douglas Secondary Foundations of Math 11 Calculator Usage 207 HOW TO USE TI-83, TI-83 PLUS, TI-84 PLUS CALCULATORS FOR STATISTICS CALCULATIONS shows it is an actual calculator key to press 1. Using LISTS to Calculate Mean,

More information

SMAM 314 Exam 42 Name

SMAM 314 Exam 42 Name SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.

More information

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean

More information

Math 10 - Compilation of Sample Exam Questions + Answers

Math 10 - Compilation of Sample Exam Questions + Answers Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the

More information

Chapter 8: Confidence Intervals

Chapter 8: Confidence Intervals Chapter 8: Confidence Intervals Introduction Suppose you are trying to determine the mean rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. describes the.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. describes the. Practice Test 3 Math 1342 Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) The term z α/2 σn describes the. 1) A) maximum error of estimate

More information

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

Problem Set 4 - Solutions

Problem Set 4 - Solutions Problem Set 4 - Solutions Econ-310, Spring 004 8. a. If we wish to test the research hypothesis that the mean GHQ score for all unemployed men exceeds 10, we test: H 0 : µ 10 H a : µ > 10 This is a one-tailed

More information

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15 Chapter 22 Comparing Two Proportions Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 15 Introduction In Ch.19 and Ch.20, we studied confidence interval and test for proportions,

More information

Exam 2 (KEY) July 20, 2009

Exam 2 (KEY) July 20, 2009 STAT 2300 Business Statistics/Summer 2009, Section 002 Exam 2 (KEY) July 20, 2009 Name: USU A#: Score: /225 Directions: This exam consists of six (6) questions, assessing material learned within Modules

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3 Review Chapter 3: Examining Relationships 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study

More information

Inferences About Two Proportions

Inferences About Two Proportions Inferences About Two Proportions Quantitative Methods II Plan for Today Sampling two populations Confidence intervals for differences of two proportions Testing the difference of proportions Examples 1

More information

DSST Principles of Statistics

DSST Principles of Statistics DSST Principles of Statistics Time 10 Minutes 98 Questions Each incomplete statement is followed by four suggested completions. Select the one that is best in each case. 1. Which of the following variables

More information

SL - Binomial Questions

SL - Binomial Questions IB Questionbank Maths SL SL - Binomial Questions 262 min 244 marks 1. A random variable X is distributed normally with mean 450 and standard deviation 20. Find P(X 475). Given that P(X > a) = 0.27, find

More information

# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance

# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance Practice Final Exam Statistical Methods and Models - Math 410, Fall 2011 December 4, 2011 You may use a calculator, and you may bring in one sheet (8.5 by 11 or A4) of notes. Otherwise closed book. The

More information

Inference About Means and Proportions with Two Populations. Chapter 10

Inference About Means and Proportions with Two Populations. Chapter 10 Inference About Means and Proportions with Two Populations Chapter 10 Two Populations? Chapter 8 we found interval estimates for the population mean and population proportion based on a random sample Chapter

More information

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance Chapter 8 Student Lecture Notes 8-1 Department of Economics Business Statistics Chapter 1 Chi-square test of independence & Analysis of Variance ECON 509 Dr. Mohammad Zainal Chapter Goals After completing

More information

(8 One-and Two-Sample Test Of Hypothesis)

(8 One-and Two-Sample Test Of Hypothesis) (8 One-and Two-Sample Test Of Hypothesis) Single Mean: Q1) Suppose that we are interested in making some statistical inferences about the mean, μ, of a normal population with standard deviation σ = 2.0.

More information

Inferences About Two Population Proportions

Inferences About Two Population Proportions Inferences About Two Population Proportions MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Background Recall: for a single population the sampling proportion

More information

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly.

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly. Introduction to Statistics Math 1040 Sample Final Exam - Chapters 1-11 6 Problem Pages Time Limit: 1 hour and 50 minutes Open Textbook Calculator Allowed: Scientific Name: The point value of each problem

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 7 Inferences Based on Two Samples: Confidence Intervals & Tests of Hypotheses Content 1. Identifying the Target Parameter 2. Comparing Two Population Means:

More information

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores

More information

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3) STAT 515 -- Chapter 9: Two-Sample Problems Paired Differences (Section 9.3) Examples of Paired Differences studies: Similar subjects are paired off and one of two treatments is given to each subject in

More information

Math 2200 Fall 2014, Exam 3 You may use any calculator. You may use a 4 6 inch notecard as a cheat sheet.

Math 2200 Fall 2014, Exam 3 You may use any calculator. You may use a 4 6 inch notecard as a cheat sheet. 1 Math 2200 Fall 2014, Exam 3 You may use any calculator. You may use a 4 6 inch notecard as a cheat sheet. Warning to the Reader! If you are a student for whom this document is a historical artifact,

More information

Chapter. Hypothesis Testing with Two Samples. Copyright 2015, 2012, and 2009 Pearson Education, Inc. 1

Chapter. Hypothesis Testing with Two Samples. Copyright 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter 8 Hypothesis Testing with Two Samples Copyright 2015, 2012, and 2009 Pearson Education, Inc 1 Two Sample Hypothesis Test Compares two parameters from two populations Sampling methods: Independent

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α Chapter 8 Notes Section 8-1 Independent and Dependent Samples Independent samples have no relation to each other. An example would be comparing the costs of vacationing in Florida to the cost of vacationing

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Overview Introduction 10-1 Scatter Plots and Correlation 10- Regression 10-3 Coefficient of Determination and

More information

Statistical Inference for Means

Statistical Inference for Means Statistical Inference for Means Jamie Monogan University of Georgia February 18, 2011 Jamie Monogan (UGA) Statistical Inference for Means February 18, 2011 1 / 19 Objectives By the end of this meeting,

More information

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1 Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 9.1-1 Chapter 9 Inferences

More information

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr. Department of Economics Business Statistics Chapter 1 Chi-square test of independence & Analysis of Variance ECON 509 Dr. Mohammad Zainal Chapter Goals After completing this chapter, you should be able

More information

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) Gujarati D. Basic Econometrics, Appendix A.8 Barrow M. Statistics

More information

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc. Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter

More information

Estimating a Population Mean

Estimating a Population Mean Estimating a Population Mean MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Objectives At the end of this lesson we will be able to: obtain a point estimate for

More information

1)I have 4 red pens, 3 purple pens and 1 green pen that I use for grading. If I randomly choose a pen,

1)I have 4 red pens, 3 purple pens and 1 green pen that I use for grading. If I randomly choose a pen, Introduction to Statistics 252 1I have 4 red pens, 3 purple pens and 1 green pen that I use for grading. If I randomly choose a pen, a what is the probability that I choose a red pen? 4 8 1 2 b what is

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Unit 19 Formulating Hypotheses and Making Decisions

Unit 19 Formulating Hypotheses and Making Decisions Unit 19 Formulating Hypotheses and Making Decisions Objectives: To formulate a null hypothesis and an alternative hypothesis, and to choose a significance level To identify the Type I error and the Type

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Inference for Proportions, Variance and Standard Deviation

Inference for Proportions, Variance and Standard Deviation Inference for Proportions, Variance and Standard Deviation Sections 7.10 & 7.6 Cathy Poliak, Ph.D. cathy@math.uh.edu Office Fleming 11c Department of Mathematics University of Houston Lecture 12 Cathy

More information

Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11

Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11 Part III Practice Problems Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11 1. In estimating population mean or proportion what is the width of an interval? 2. If 25 college students out of 80 graduate

More information

79 Wyner Math Academy I Spring 2016

79 Wyner Math Academy I Spring 2016 79 Wyner Math Academy I Spring 2016 CHAPTER NINE: HYPOTHESIS TESTING Review May 11 Test May 17 Research requires an understanding of underlying mathematical distributions as well as of the research methods

More information

Chapter 7: Hypothesis Testing - Solutions

Chapter 7: Hypothesis Testing - Solutions Chapter 7: Hypothesis Testing - Solutions 7.1 Introduction to Hypothesis Testing The problem with applying the techniques learned in Chapter 5 is that typically, the population mean (µ) and standard deviation

More information

Hypothesis testing: Steps

Hypothesis testing: Steps Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute

More information

Chapter 6: SAMPLING DISTRIBUTIONS

Chapter 6: SAMPLING DISTRIBUTIONS Chapter 6: SAMPLING DISTRIBUTIONS Read Section 1.5 Graphical methods may not always be sufficient for describing data. Numerical measures can be created for both populations and samples. Definition A numerical

More information

Chapter 6. Estimates and Sample Sizes

Chapter 6. Estimates and Sample Sizes Chapter 6 Estimates and Sample Sizes Lesson 6-1/6-, Part 1 Estimating a Population Proportion This chapter begins the beginning of inferential statistics. There are two major applications of inferential

More information

QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018

QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018 Page 1 of 4 QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018 ECONOMICS 250 Introduction to Statistics Instructor: Gregor Smith Instructions: The exam

More information

Code No: RT051 R13 SET - 1 II B. Tech II Semester Regular/Supplementary Examinations, April/May-017 PROBABILITY AND STATISTICS (Com. to CSE, IT, CHEM, PE, PCE) Time: 3 hours Max. Marks: 70 Note: 1. Question

More information

(8 One- and Two-Sample Test Of Hypothesis)

(8 One- and Two-Sample Test Of Hypothesis) 324 Stat Lecture Notes (8 One- and Two-Sample Test Of ypothesis) ( Book*: Chapter 1,pg319) Probability& Statistics for Engineers & Scientists By Walpole, Myers, Myers, Ye Definition: A statistical hypothesis

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Hypothesis testing: Steps

Hypothesis testing: Steps Review for Exam 2 Hypothesis testing: Steps Repeated-Measures ANOVA 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2016-17 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

With our knowledge of interval estimation, we can consider hypothesis tests

With our knowledge of interval estimation, we can consider hypothesis tests Chapter 10 Hypothesis Testing 10.1 Testing Hypotheses With our knowledge of interval estimation, we can consider hypothesis tests An Example of an Hypothesis Test: Statisticians at Employment and Immigration

More information

CHAPTER EIGHT Linear Regression

CHAPTER EIGHT Linear Regression 7 CHAPTER EIGHT Linear Regression 8. Scatter Diagram Example 8. A chemical engineer is investigating the effect of process operating temperature ( x ) on product yield ( y ). The study results in the following

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL - MAY 2005 EXAMINATIONS STA 248 H1S. Duration - 3 hours. Aids Allowed: Calculator

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL - MAY 2005 EXAMINATIONS STA 248 H1S. Duration - 3 hours. Aids Allowed: Calculator UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL - MAY 2005 EXAMINATIONS STA 248 H1S Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 17 pages including

More information

Confidence intervals CE 311S

Confidence intervals CE 311S CE 311S PREVIEW OF STATISTICS The first part of the class was about probability. P(H) = 0.5 P(T) = 0.5 HTTHHTTTTHHTHTHH If we know how a random process works, what will we see in the field? Preview of

More information

ANOVA - analysis of variance - used to compare the means of several populations.

ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

Ch. 7: Estimates and Sample Sizes

Ch. 7: Estimates and Sample Sizes Ch. 7: Estimates and Sample Sizes Section Title Notes Pages Introduction to the Chapter 2 2 Estimating p in the Binomial Distribution 2 5 3 Estimating a Population Mean: Sigma Known 6 9 4 Estimating a

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should 11 CHI-SQUARED Chapter 11 Chi-squared Objectives After studying this chapter you should be able to use the χ 2 distribution to test if a set of observations fits an appropriate model; know how to calculate

More information

ECON Introductory Econometrics. Lecture 2: Review of Statistics

ECON Introductory Econometrics. Lecture 2: Review of Statistics ECON415 - Introductory Econometrics Lecture 2: Review of Statistics Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 2-3 Lecture outline 2 Simple random sampling Distribution of the sample

More information