The Central Limit Theorem

Similar documents
Introductory Statistics

6 THE NORMAL DISTRIBUTION

In this chapter, you will study the normal distribution, the standard normal, and applications associated with them.

4/1/2012. Test 2 Covers Topics 12, 13, 16, 17, 18, 14, 19 and 20. Skipping Topics 11 and 15. Topic 12. Normal Distribution

Unit 8: Statistics. SOL Review & SOL Test * Test: Unit 8 Statistics

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

23.3. Sampling Distributions. Engage Sampling Distributions. Learning Objective. Math Processes and Practices. Language Objective

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Describing distributions with numbers

Mt. Douglas Secondary

Probability and Samples. Sampling. Point Estimates

Sampling Distribution of a Sample Proportion

( )( ) of wins. This means that the team won 74 games.

Francine s bone density is 1.45 standard deviations below the mean hip bone density for 25-year-old women of 956 grams/cm 2.

May 2015 Timezone 2 IB Maths Standard Exam Worked Solutions

7.1: What is a Sampling Distribution?!?!

Lecture 11. Data Description Estimation

Exploring the Normal Probability Distribution

(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability

The empirical ( ) rule

UNIVERSITY OF TORONTO MISSISSAUGA. SOC222 Measuring Society In-Class Test. November 11, 2011 Duration 11:15a.m. 13 :00p.m.

Sections 6.1 and 6.2: The Normal Distribution and its Applications

Introductory Statistics

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Math 2311 Sections 4.1, 4.2 and 4.3

Section 5.1: Probability and area

Standards of Learning Content Review Notes. Grade 7 Mathematics 2 nd Nine Weeks,

CHAPTER 7. Parameters are numerical descriptive measures for populations.

Homework 7. Name: ID# Section

Describing distributions with numbers

IM1: UNIT 3. HOMEWORK PACKET

Quiz 2 covered materials in Chapter 5 Chapter 6 Chapter 7. Normal Probability Distribution. Continuous Probability. distribution 11/9/2010.

Continuous Probability Distributions

Section 3.2 Measures of Central Tendency

Lesson 19: Understanding Variability When Estimating a Population Proportion

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Statistics 100 Exam 2 March 8, 2017

Wednesday, February 7, 2018 Warm Up

SESSION 5 Descriptive Statistics

Chapter 7: Random Variables

Introductory Statistics

Continuous random variables

Chapter 7 Discussion Problem Solutions D1 D2. D3.

Chapter 3. Measuring data

The ACCUPLACER (Elementary Algebra) is a 12 question placement exam. Its purpose is to make sure you are put in the appropriate math course.

Honors Algebra 1 - Fall Final Review

Homework 4 Solutions Math 150

LessonPlan A10 MEP NUMERACY SUMMER SCHOOL

What are the mean, median, and mode for the data set below? Step 1

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018

(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management

Topic 2 Part 3 [189 marks]

Section 2 Topic 1 Equations: True or False?

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!

Practice Test 1 BLACKLINE MASTERS

The Shape, Center and Spread of a Normal Distribution - Basic

CHAPTER 2 Modeling Distributions of Data

+ Check for Understanding

Statistics Revision Questions Nov 2016 [175 marks]

Determining the Spread of a Distribution

Test 2 VERSION B STAT 3090 Spring 2017

Determining the Spread of a Distribution

MATH 1150 Chapter 2 Notation and Terminology

Chapter 3. Equations and Inequalities. 10/2016 LSowatsky 1

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

Lecture 7: Confidence interval and Normal approximation

Section 5.4. Ken Ueda

Final Exam Review (Math 1342)

National 5 Portfolio Applications Standard Deviation and Scattergraphs

[ z = 1.48 ; accept H 0 ]

Elementary Statistics

Solving and Graphing Linear Inequalities Chapter Questions. 2. Explain the steps to graphing an inequality on a number line.

The Normal Distribution. Chapter 6

Section 2 Equations and Inequalities

Aim The Cat Nap introduces on-the-hour times by showing an analogue clock face to follow a sequence of events.

Sampling, Frequency Distributions, and Graphs (12.1)

Section 7.2 Homework Answers

AP Statistics Review Ch. 7

Chapter 8: Confidence Intervals

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean.

6. Cold U? Max = 51.8 F Range = 59.4 F Mean = 33.8 F s = 12.6 F med = 35.6 F IQR = 28.8 F

The chart below shows the fraction and decimal forms of some rational numbers. Write,, or in each blank to make a true sentence.

Directions: This is a practice final exam which covers all chapters in this course. (A) (B) 3 10 (C) 10 3 (D) (E) None of the above

Lesson 8: Representing Proportional Relationships with Equations

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power.

Algebra I Solving & Graphing Inequalities

Sample Problems for the Final Exam

Section 1: Expressions

Stat 101 Exam 1 Important Formulas and Concepts 1

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly.

20 Hypothesis Testing, Part I

Dimensional Analysis Exercising Problem Solving Skills

Chapter 4: Probability and Probability Distributions

Review of the Normal Distribution

Chapters 4-6: Estimation

Estadística I Exercises Chapter 4 Academic year 2015/16

Chapter. The Normal Probability Distribution 7/24/2011. Section 7.1 Properties of the Normal Distribution

Transcription:

The Central Limit Theorem for Sums By: OpenStaxCollege Suppose X is a random variable with a distribution that may be known or unknown (it can be any distribution) and suppose: 1. μ X = the mean of Χ 2. σ Χ = the standard deviation of X If you draw random samples of size n, then as n increases, the random variable ΣX consisting of sums tends to be normally distributed and ΣΧ ~ N((n)(μ Χ ), ( n)(σ Χ )). The central limit theorem for sums says that if you keep drawing larger and larger samples and taking their sums, the sums form their own normal distribution (the sampling distribution), which approaches a normal distribution as the sample size increases. The normal distribution has a mean equal to the original mean multiplied by the sample size and a standard deviation equal to the original standard deviation multiplied by the square root of the sample size. The random variable ΣX has the following z-score associated with it: 1. Σx is one sum. 2. z = Σx (n)(μ X ) ( n)(σ X ) 1. (n)(μ X ) = the mean of ΣX 2. ( n)(σ X ) = standard deviation of ΣX To find probabilities for sums on the calculator, follow these steps. 2 nd DISTR 2:normalcdf normalcdf(lower value of the area, upper value of the area, (n)(mean), ( n)(standard deviation)) 1/10

where: mean is the mean of the original distribution standard deviation is the standard deviation of the original distribution sample size = n An unknown distribution has a mean of 90 and a standard deviation of 15. A sample of size 80 is drawn randomly from the population. 1. Find the probability that the sum of the 80 values (or the total of the 80 values) is more than 7,500. 2. Find the sum that is 1.5 standard deviations above the mean of the sums. Let X = one value from the original unknown population. The probability question asks you to find a probability for the sum (or total of) 80 values. ΣX = the sum or total of 80 values. Since μ X = 90, σ X = 15, and n = 80, ΣX ~ N((80)(90), ( 80)(15)) mean of the sums = (n)(μ X ) = (80)(90) = 7,200 standard deviation of the sums = ( n)(σ X ) = ( 80)(15) sum of 80 values = Σx = 7,500 a. Find P(Σx > 7,500) P(Σx > 7,500) = 0.0127 normalcdf(lower value, upper value, mean of sums, stdev of sums) The parameter list is abbreviated(lower, upper, (n)(μ X, ( n)(σ X )) normalcdf (7500,1E99,(80)(90),( 80)(15)) = 0.0127 Reminder 2/10

1E99 = 10 99. Press the EE key for E. b. Find Σx where z = 1.5. Σx = (n)(μ X ) + (z)( n)(σ Χ ) = (80)(90) + (1.5)( 80)(15) = 7,401.2 Try It An unknown distribution has a mean of 45 and a standard deviation of eight. A sample size of 50 is drawn randomly from the population. Find the probability that the sum of the 50 values is more than 2,400. 0.0040 To find percentiles for sums on the calculator, follow these steps. 2 nd DIStR 3:invNorm k = invnorm (area to the left of k, (n)(mean), ( n)(standard deviation)) where: k is the k th percentile mean is the mean of the original distribution standard deviation is the standard deviation of the original distribution sample size = n In a recent study reported Oct. 29, 2012 on the Flurry Blog, the mean age of tablet users is 34 years. Suppose the standard deviation is 15 years. The sample of size is 50. 1. What are the mean and standard deviation for the sum of the ages of tablet users? What is the distribution? 2. Find the probability that the sum of the ages is between 1,500 and 1,800 years. 3. Find the 80 th percentile for the sum of the 50 ages. 1. μ Σx = nμ x = 50(34) = 1,700 and σ Σx = nσ x = ( 50 )(15) = 106.01 The distribution is normal for sums by the central limit theorem. 2. P(1500 < Σx < 1800) = normalcdf (1,500, 1,800, (50)(34), ( 50 )(15)) = 0.7974 3. Let k = the 80 th percentile. k = invnorm(0.80,(50)(34),( 50 )(15)) = 1,789.3 3/10

Try It In a recent study reported Oct.29, 2012 on the Flurry Blog, the mean age of tablet users is 35 years. Suppose the standard deviation is ten years. The sample size is 39. 1. What are the mean and standard deviation for the sum of the ages of tablet users? What is the distribution? 2. Find the probability that the sum of the ages is between 1,400 and 1,500 years. 3. Find the 90 th percentile for the sum of the 39 ages. 1. μ Σx = nμ x = 1,365 and σ Σx = nσ x = 62.4 The distribution is normal for sums by the central limit theorem. 2. P(1400 < Σ x < 1500) = normalcdf (1400,1500,(39)(35),( 39)(10)) = 0.2723 3. Let k = the 90 th percentile. k = invnorm(0.90,(39)(35),( 39) (10)) = 1445.0 The mean number of minutes for app engagement by a tablet user is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of size 70. 1. What are the mean and standard deviation for the sums? 2. Find the 95 th percentile for the sum of the sample. Interpret this value in a complete sentence. 3. Find the probability that the sum of the sample is at least ten hours. 1. μ Σx = nμ x = 70(8.2) = 574 minutes and σ Σx = ( n)(σ x) = ( 70 )(1) = 8.37 minutes 2. Let k = the 95 th percentile. k = invnorm (0.95,(70)(8.2),( 70)(1)) = 587.76 minutes Ninety five percent of the app engagement times are at most 587.76 minutes. 3. ten hours = 600 minutes P(Σx 600) = normalcdf(600,e99,(70)(8.2),( 70)(1)) = 0.0009 The mean number of minutes for app engagement by a table use is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample size of 70. 1. What is the probability that the sum of the sample is between seven hours and ten hours? What does this mean in context of the problem? 2. Find the 84 th and 16 th percentiles for the sum of the sample. Interpret these values in context. 1. 7 hours = 420 minutes 10 hours = 600 minutes normalcdf P(420 Σx 600) = normalcdf(420, 600, (70)(8.2), 70(1)) = 0.9991 4/10

This means that for this sample sums there is a 99.9% chance that the sums of usage minutes will be between 420 minutes and 600 minutes. 2. invnorm(0.84, (70)(8.2), 70(1)) = 582.32 invnorm(0.16, (70)(8.2), 70(1)) = 565.68 Since 84% of the app engagement times are at most 582.32 minutes and 16% of the app engagement times are at most 565.68 minutes, we may state that 68% of the app engagement times are between 565.68 minutes and 582.32 minutes. References Farago, Peter. The Truth About Cats and Dogs: Smartphone vs Tablet Usage Differences. The Flurry Blog, 2013. Posted October 29, 2012. Available online at http://blog.flurry.com (accessed May 17, 2013). Chapter Review The central limit theorem tells us that for a population with any distribution, the distribution of the sums for the sample means approaches a normal distribution as the sample size increases. In other words, if the sample size is large enough, the distribution of the sums can be approximated by a normal distribution even if the original population is not normally distributed. Additionally, if the original population has a mean of μ X and a standard deviation of σ x, the mean of the sums is nμ x and the standard deviation is ( n) (σ x ) where n is the sample size. Formula Review The Central Limit Theorem for Sums: X ~ N[(n)(μ x ),( n)(σ x )] Mean for Sums ( X): (n)(μ x ) The Central Limit Theorem for Sums z-score and standard deviation for sums: zfor the sample mean = Σx (n)(μ X ) ( n)(σ X ) Standard deviation for Sums ( X): ( n)(σ x ) Use the following information to answer the next four exercises: An unknown distribution has a mean of 80 and a standard deviation of 12. A sample size of 95 is drawn randomly from the population. Find the probability that the sum of the 95 values is greater than 7,650. 0.3345 5/10

Find the probability that the sum of the 95 values is less than 7,400. Find the sum that is two standard deviations above the mean of the sums. 7,833.92 Find the sum that is 1.5 standard deviations below the mean of the sums. Use the following information to answer the next five exercises: The distribution of results from a cholesterol test has a mean of 180 and a standard deviation of 20. A sample size of 40 is drawn randomly. Find the probability that the sum of the 40 values is greater than 7,500. 0.0089 Find the probability that the sum of the 40 values is less than 7,000. Find the sum that is one standard deviation above the mean of the sums. 7,326.49 Find the sum that is 1.5 standard deviations below the mean of the sums. Find the percentage of sums between 1.5 standard deviations below the mean of the sums and one standard deviation above the mean of the sums. 77.45% Use the following information to answer the next six exercises: A researcher measures the amount of sugar in several cans of the same soda. The mean is 39.01 with a standard deviation of 0.5. The researcher randomly selects a sample of 100. Find the probability that the sum of the 100 values is greater than 3,910. Find the probability that the sum of the 100 values is less than 3,900. 0.4207 Find the probability that the sum of the 100 values falls between the numbers you found in [link] and [link]. 6/10

Find the sum with a z score of 2.5. 3,888.5 Find the sum with a z score of 0.5. Find the probability that the sums will fall between the z-scores 2 and 1. 0.8186 Use the following information to answer the next four exercise: An unknown distribution has a mean 12 and a standard deviation of one. A sample size of 25 is taken. Let X = the object of interest. What is the mean of ΣX? What is the standard deviation of ΣX? 5 What is P(Σx = 290)? What is P(Σx > 290)? 0.9772 True or False: only the sums of normal distributions are also normal distributions. In order for the sums of a distribution to approach a normal distribution, what must be true? The sample size, n, gets larger. What three things must you know about a distribution to find the probability of sums? An unknown distribution has a mean of 25 and a standard deviation of six. Let X = one object from this distribution. What is the sample size if the standard deviation of ΣX is 42? 49 An unknown distribution has a mean of 19 and a standard deviation of 20. Let X = the object of interest. What is the sample size if the mean of ΣX is 15,200? 7/10

Use the following information to answer the next three exercises. A market researcher analyzes how many electronics devices customers buy in a single purchase. The distribution has a mean of three with a standard deviation of 0.7. She samples 400 customers. What is the z-score for Σx = 840? 26.00 What is the z-score for Σx = 1,186? What is P(Σx < 1,186)? 0.1587 Use the following information to answer the next three exercises: An unkwon distribution has a mean of 100, a standard deviation of 100, and a sample size of 100. Let X = one object of interest. What is the mean of ΣX? What is the standard deviation of ΣX? 1,000 What is P(Σx > 9,000)? Homework Which of the following is NOT TRUE about the theoretical distribution of sums? 1. The mean, median and mode are equal. 2. The area under the curve is one. 3. The curve never touches the x-axis. 4. The curve is skewed to the right. Suppose that the duration of a particular type of criminal trial is known to have a mean of 21 days and a standard deviation of seven days. We randomly sample nine trials. 1. In words, ΣX = 2. ΣX ~ (, ) 3. Find the probability that the total length of the nine trials is at least 225 days. 8/10

4. Ninety percent of the total of nine of these types of trials will last at least how long? 1. the total length of time for nine criminal trials 2. N(189, 21) 3. 0.0432 4. 162.09; ninety percent of the total nine trials of this type will last 162 days or more. Suppose that the weight of open boxes of cereal in a home with children is uniformly distributed from two to six pounds with a mean of four pounds and standard deviation of 1.1547. We randomly survey 64 homes with children. 1. In words, X = 2. The distribution is. 3. In words, ΣX = 4. ΣX ~ (, ) 5. Find the probability that the total weight of open boxes is less than 250 pounds. 6. Find the 35 th percentile for the total weight of open boxes of cereal. Salaries for teachers in a particular elementary school district are normally distributed with a mean of $44,000 and a standard deviation of $6,500. We randomly survey ten teachers from that district. 1. In words, X = 2. X ~ (, ) 3. In words, ΣX = 4. ΣX ~ (, ) 5. Find the probability that the teachers earn a total of over $400,000. 6. Find the 90 th percentile for an individual teacher's salary. 7. Find the 90 th percentile for the sum of ten teachers' salary. 8. If we surveyed 70 teachers instead of ten, graphically, how would that change the distribution in part d? 9. If each of the 70 teachers received a $3,000 raise, graphically, how would that change the distribution in part b? 1. X = the salary of one elementary school teacher in the district 2. X ~ N(44,000, 6,500) 3. ΣX ~ sum of the salaries of ten elementary school teachers in the sample 4. ΣX ~ N(44000, 20554.80) 5. 0.9742 6. $52,330.09 7. 466,342.04 8. Sampling 70 teachers instead of ten would cause the distribution to be more spread out. It would be a more symmetrical normal curve. 9/10

9. If every teacher received a $3,000 raise, the distribution of X would shift to the right by $3,000. In other words, it would have a mean of $47,000. 10/10