(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.

Similar documents
(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.

The Chi-Square Distributions

The Chi-Square Distributions

6 THE NORMAL DISTRIBUTION

Francine s bone density is 1.45 standard deviations below the mean hip bone density for 25-year-old women of 956 grams/cm 2.

Section 5.4. Ken Ueda

What does a population that is normally distributed look like? = 80 and = 10

In this chapter, you will study the normal distribution, the standard normal, and applications associated with them.

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

Chapter. The Normal Probability Distribution 7/24/2011. Section 7.1 Properties of the Normal Distribution

Looking at data: distributions - Density curves and Normal distributions. Copyright Brigitte Baldi 2005 Modified by R. Gordon 2009.

Exploring the Normal Probability Distribution

Continuous random variables

Normal distributions

Statistics Lecture 3

Section 8.5 The Normal Distribution

Using Tables and Graphing Calculators in Math 11

Mt. Douglas Secondary

4/1/2012. Test 2 Covers Topics 12, 13, 16, 17, 18, 14, 19 and 20. Skipping Topics 11 and 15. Topic 12. Normal Distribution

Continuous Random Variables

CHAPTER 6 SOME CONTINUOUS PROBABILITY DISTRIBUTIONS. 6.2 Normal Distribution. 6.1 Continuous Uniform Distribution

9/19/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1

Density Curves and the Normal Distributions. Histogram: 10 groups

Elementary Statistics

( )( ) of wins. This means that the team won 74 games.

Statistics 100 Exam 2 March 8, 2017

Math 2311 Sections 4.1, 4.2 and 4.3

Essential Question: What are the standard intervals for a normal distribution? How are these intervals used to solve problems?

Unit 4 Probability. Dr Mahmoud Alhussami

3.1 Measure of Center

Continuous Probability Distributions

Chapter 3: The Normal Distributions

Reminders. Homework due tomorrow Quiz tomorrow

Page 312, Exercise 50

Chapter 23. Inference About Means

Complement: 0.4 x 0.8 = =.6

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation

Chapter 2 Solutions Page 15 of 28

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability

EQ: What is a normal distribution?

Essential Statistics Chapter 6

Unit 8: Statistics. SOL Review & SOL Test * Test: Unit 8 Statistics

Continuous RVs. 1. Suppose a random variable X has the following probability density function: π, zero otherwise. f ( x ) = sin x, 0 < x < 2

The empirical ( ) rule

Exam #2 Results (as percentages)

Business Statistics. Chapter 6 Review of Normal Probability Distribution QMIS 220. Dr. Mohammad Zainal

1) What is the probability that the random variable has a value less than 3? 1)

Recall that the standard deviation σ of a numerical data set is given by

The Standard Deviation as a Ruler and the Normal Model

and the Sample Mean Random Sample

Section 5.1: Probability and area

Chapter (7) Continuous Probability Distributions Examples

Practice problems from chapters 2 and 3

STA 218: Statistics for Management

The Normal Model. Copyright 2009 Pearson Education, Inc.

This is Continuous Random Variables, chapter 5 from the book Beginning Statistics (index.html) (v. 1.0).

STT 315 This lecture is based on Chapter 2 of the textbook.

Q 1 = 23.8 M = Q 3 = 29.8 IQR = 6 The numbers are in order and there are 18 pieces of data so the median is the average of the 9th and 10th

Review. Midterm Exam. Midterm Review. May 6th, 2015 AMS-UCSC. Spring Session 1 (Midterm Review) AMS-5 May 6th, / 24

Section 7.2 Homework Answers

Standard Normal Calculations

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

Math 140 Introductory Statistics

Math 140 Introductory Statistics

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Chapter 3 additional notes

The Shape, Center and Spread of a Normal Distribution - Basic

Will Landau. Feb 28, 2013

AP Statistics Ch 7 Random Variables

Standard Deviation and z Scores

Stat 20 Midterm 1 Review

Chapter 6 Group Activity - SOLUTIONS

NORMAL CURVE STANDARD SCORES AND THE NORMAL CURVE AREA UNDER THE NORMAL CURVE AREA UNDER THE NORMAL CURVE 9/11/2013

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Describing distributions with numbers

UNIVERSITY OF TORONTO MISSISSAUGA. SOC222 Measuring Society In-Class Test. November 11, 2011 Duration 11:15a.m. 13 :00p.m.

Chapter # classifications of unlikely, likely, or very likely to describe possible buying of a product?

STAT 200 Chapter 1 Looking at Data - Distributions

Calculating Normal Distribution Probabilities

Quiz 2 covered materials in Chapter 5 Chapter 6 Chapter 7. Normal Probability Distribution. Continuous Probability. distribution 11/9/2010.

What are the mean, median, and mode for the data set below? Step 1

Advanced/Advanced Subsidiary. You must have: Mathematical Formulae and Statistical Tables (Pink)

Section 7.1 Properties of the Normal Distribution

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data

23.3. Sampling Distributions. Engage Sampling Distributions. Learning Objective. Math Processes and Practices. Language Objective

Density curves and the normal distribution

Chapter 8: Confidence Intervals

Part 3: Parametric Models

12.4. The Normal Distribution: A Problem-Solving Tool

Using a Graphing Calculator

6. Cold U? Max = 51.8 F Range = 59.4 F Mean = 33.8 F s = 12.6 F med = 35.6 F IQR = 28.8 F

Chapter 9 - Correlation and Regression

Online Practice Quiz KEY Chapter 2: Modeling Distributions of Data

Content by Week Week of October 14 27

Exercises from Chapter 3, Section 1

Sampling, Frequency Distributions, and Graphs (12.1)

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Continuous random variables

Graphing Skill #1: What Type of Graph is it? There are several types of graphs that scientists often use to display data.

8.1 Frequency Distribution, Frequency Polygon, Histogram page 326

Transcription:

MATH 183 Normal Distributions Dr. Neal, WKU Measurements that are normally distributed can be described in terms of their mean µ and standard deviation!. These measurements should have the following properties: (i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution. (ii) The measurements are symmetric about the mean. (iii) (The 68 95 99.7 Rule): Around 68% of the measurements should be within one standard deviation of average, around 95% should be within two standard deviations of average, and around 99.7% of the measurements should be within three standard deviations of average. (iv) A histogram of measurements create a Bell-Shaped Curve with the percentages at the high and low ends dropping off exponentially. Such a measurement is denoted by X ~ N (µ,! ). When µ = 0 and! = 1, then we have the standard normal distribution that is denoted by Z ~ N (0, 1). Typical Histogram Theoretical Shape Example 1. ( Good Final Exam Question ): In the U.S., birth weights are normally distributed with a mean of 7 pounds and a standard deviation of 2 pounds. Explain what this means in terms of the properties of a normal distribution. Solution. Let! be all babies born in the U.S., and let X denote the birth weight. Then X ~ N (7, 2) lbs. That is, (i) In the U.S, the average birth weight, the most likely birth weight, and the median birth weight are all 7 lbs. (ii) Birth weights as a whole are symmetric about the weight 7 lbs. (iii) Around 68% of newborn weights are from 5 to 9 lbs (µ ±! ); around 95% of newborn weights are from 3 to 11 lbs (µ ± 2! ); and around 99.7% of newborn weights are from 1 to 13 lbs (µ ± 3! ). (iv) A histogram of newborn birth weights create a Bell-Shaped Curve with the percentages of high and low weights dropping off exponentially.

Example 2 A hospital weighs all the babies that are born in the maternity ward. The weights in pounds for one particular week are as follows: 5.64 5.39 5.86 7.99 7.89 11.27 7.31 6.54 7.53 4.15 2.93 7.08 5.08 7.46 9.71 9.17 7.50 7.10 9.30 4.69 8.38 2.74 6.02 6.94 6.89 6.33 6.07 5.96 8.05 6.81 8.04 6.57 7.26 6.88 8.05 4.91 Do these weights actually appear to be normally distributed? Here let! be all babies born at this hospital and let X be the birth weight. To study the sample, we first shall construct a histogram with range [2, 12] on the x -axis with bins of length 0.5. We see that the histogram somewhat resembles a bell-shaped curve, with symmetry about the middle, the most likely values in the middle, and the numbers of measurements somewhat tailing off at both extreme ends. If we compute the statistics, we obtain a mean of µ 6.819, a standard deviation of! 1.733, and a median of 6.915. So the mean is close to the median, and the most likely values occur in the bin from 6.5 lbs. to 7 lbs. which also contains the mean and the median. Thus, it may be safe to say that the mean and mode both equal the median. The range µ ±! is 5.086 to 8.552 lbs. and contains 26 out of 36 or about 72.2% of the weights. The range µ ± 2! is 3.353 to 10.285 lbs. and contains 33 out of 36 or about 91.67% of the weights. The range µ ± 3! is 1.62 to 12.018 lbs. and contains 100% of the weights. So the 68-95-99.7 rule is slightly violated, but overall these baby weights seem to be close to normally distributed. By including many more measurements over more weeks, we would possibly come to a stronger belief that new-born baby weights are in fact normally distributed. It is possible to use a chi-square test to determine whether or not a measurement is normally distributed based only upon a sample of measurements. (Details of the chi-square test are to come later in the course.)

Calculator Exercise We can use the TI-83/84 to generate a list of normally distributed measurements. (a) Choose values for the mean µ and the standard deviation! of your desired distribution X ~ N (µ,! ). Under MATH PRB find the command randnorm( (item 6). Enter the command randnorm(µ, σ, 100) L1 with your values of µ and! in order to enter 100 measurements from your distribution into list L1. (b) Draw and label a histogram of your measurements with appropriately sized bins. Does your histogram appear to follow a bell-shaped curve? Generally use an X range of µ 3! to µ + 3!. (c) Is the median of your measurements close to the true mean µ and are your measurements somewhat symmetric about the true mean µ? (d) With your values of µ and!, find the bounds µ ±!, µ ± 2!, and µ ± 3!. (e) Sort your measurements in your list and find the actual percentages of your measurements that are within each bound µ ±!, µ ± 2!, and µ ± 3!. (f) Do you conclude that your calculator generated a somewhat normal distribution? Normal Calculations Given a normal distribution X ~ N (µ,! ), we wish to find various probabilities of where an arbitrary measurement may lie. For instance, we could find P( a! X! b), which is the probability that an random measurement X lies between a and b. P( a! X! b) P(X! k) P(X! k) a b k k We also may wish to find the proportion less than a value k (or at most k ), denoted by P(X < k) (or P(X! k) ). Finally, we may want the proportion greater than k (or at least k ), denoted by P(X > k) (or P(X! k) ). These proportions can be computed with the built-in normalcdf( command (item 2) from the DISTR menu:

P(a! X! b) = normalcdf(a, b, µ,! ) P(X < k) = P(X! k) = normalcdf( 1E99, k, µ,! ) P(X > k) = P(X! k) = normalcdf(k, 1E99, µ,! ) The bounds 1E99 and E99 are used as estimates of and +. Inverse Normal Calculation To find the value x for which P(X! x ) equals a desired proportion p (an inverse normal calculation), we use the command invnorm( p,µ,! ). The invnorm( command is also found in the DISTR (2nd Vars) menu. Example 3. The lengths of human pregnancies are approximately normally distributed with a mean of 266 days and a standard deviation of 16 days. (a) What is the population!? What is the measurement X and its distribution? (b) What percent of pregnancies last at most 240 days? (c) What percent of pregnancies last from 240 to 270 days? (d) How long do the longest 20% of pregnancies last? Solution. (a) Here! is the population of all women who have given birth and X is the measurement of how many days the pregnancy lasted. Then X N(266, 16). (b) For X ~ N(266, 16), we wish to find P(X! 240). We use the command normalcdf( 1E99, 240, 266, 16). We see that around 5.2% of pregnancies last at most 240 days. (c) To find P(240! X! 270), enter the command normalcdf(240, 270, 266, 16). We see that around 54.66% of pregnancies last from 240 to 270 days (d) To find how long the longest 20% of pregnancies last, we must find the value x for which P(X! x ) = 0.20. But to use the invnorm( command, we instead must find x such that P(X! x ) = 0.80. Using the command invnorm(.80, 266, 16), we see that the longest 20% of pregnancies last at least 279.46 days.

Example 4. Heights of adult women are normally distributed with a mean of 65.5 inches and a standard deviation of 2.75 inches. What percentage of women are (a) at least 70 in. tall? (b) at most 63 in. tall? (c) from 64 to 68 in. tall? (d) What height is such that 95% of all women are below this height? (e) What height is such that 90% of all women are above this height? (f) What two heights, symmetric about the mean, contain 50% of all heights? Solution. Here,! = All adult women and X = height in inches. Then X ~ N(65.5, 2.75). (a) At least 70, meaning 70 or more: Enter the command normalcdf(70, 1 E99, 65.5, 2.75) to obtain P(X! 70) 0.05088. So about 5.09% of women are at least 70 inches tall. (b) At most 63, meaning up to 63: Enter the command normalcdf( 1 E99, 63, 65.5, 2.75) to obtain P(X! 63) 0.18165. So about 18.165% of women are at most 63 inches tall. (c) P(64! X! 68) 0.5256. So about 52.56% of women are from 64 to 68 inches tall. (d) We must find x such that P(X < x) = 0.95. To do so, enter invnorm(.95, 65.5, 2.75) to obtain x 70 inches. (e) We must find x such that P(X > x) = 0.90 or equivalently such that P(X! x) = 0.10. To do so, enter invnorm(.10, 65.5, 2.75) to obtain x 61.976 inches. (f) If we want 50% of heights in the middle, then we need x and y such that P(x! X! y) = 0.50. But then we need 25% of the heights at each tail. So we need x such that P(X! x) = 0.25 and y such that P(X! y) = 0.75. Enter invnorm(.25, 65.5, 2.75) to obtain x 63.645 inches and invnorm(.75, 65.5, 2.75) to obtain y 67.355 in.

Standard Normal Distribution Suppose X ~ N (µ,! ) is a normally distributed measurement. For example IQ scores are such that X ~ N (100, 15). Then most measurements (about 99.7%) are within µ ± 3!, which is 55 to 145 for IQ scores. If we subtract µ from every measurement, then we still have a normal distribution, but most values will be between 3! and 3!. By subtracting µ, the result is N (0,! ). Next, suppose we divide the new values by!. Then most values will be between 1 and 1. The result is now N (0, 1). By subtracting µ from every measurement and then dividing by!, we have standardized the values and have obtained the standard normal distribution Z ~ N (0, 1). Let X ~ N (µ,! ) and Z = X! µ ". Then Z ~ N (0, 1). For Z ~ N (0, 1), we still can compute the various probabilities and inverse calculations using the normalcdf( and invnorm commands. For example, P(Z! "1.22) and P(!0.85 " Z " 1.05) are shown below. 1 1 Z ~ N (0, 1) P(Z! "1.22) P(!0.85 " Z " 1.05) Example 5. Let Z ~ N (0, 1). (a) Find the number z such that P(Z! z) = 0.05. (b) Find the numbers w and z such that P(w! Z! z) = 0.95. Solution. For Part (a), we actually need P(Z! z) = 0.95. So enter the command invnorm(.95, 0, 1) to obtain z 1.645. For Part (b), we need w and z such that P(Z! w) = 0.025 and P(Z! z) = 0.975. So enter the commands invnorm(.025, 0, 1) and invnorm(.025, 0,1 ) to obtain w 1.96 and z 1.96.

By converting different normal distributions X and Y to standard normal distributions, then X and Y can be placed on the same scale. Values from X and Y then can be compared without any probability calculation. Example 6. IQ scores are X ~ N(100, 15) and baby birth weights are Y ~ N(7, 2) (lbs). Which is less likely, an IQ of at least 145 or a new-born weighing at least 10 pounds? Solution. Simply convert each value to a standard normal scale: X! 145 " Z! 145 #100 15 " Z! 3 Y! 10 " Z! 10 # 7 2 " Z! 1.5 Z 1.5 3 The range Z! 3 creates less probability than Z!1.5, so an IQ of at least 145 is less likely than a newborn weighing at least 10 pounds.

Practice Exercises Dr. Neal, WKU 1. Students in a Psychology Masters Program are given an IQ test. The scores are generally found to be normally distributed with a mean of 112 and a standard deviation of 9. (a) Give the population! under consideration and the measurement X. What is the notation for the distribution of X? (b) Compute (i) P(X!105) (ii) P(100! X! 124) (iii) P(X!130) (c) What scores x and y, symmetric about the mean, are such that P(x! X! y) = 0.66? 2. Let Z ~ N (0, 1). (a) Compute (i) P(Z! 1. 45) (ii) P(Z! "2.12) (iii) P(!2 " Z " 2) (b) Find the numbers w and z such that P(w! Z! z) = 0.97. (c) Find the number z such that P(Z! z) = 0.01. 3. ACT scores are X ~ N(22.4, 3.2) and SAT scores are Y ~ N(1020, 160). (a) Which is a better score, an ACT of 28 or an SAT of 1400? (b) Which happens less often, an ACT of at most 14 or an SAT of at least 1400? 4. A sample of 100 watt GE light bulbs are tested for lifetime during everyday use. The lifetimes in hours before burning out were as follows: 617 670.7 817.6 724 683.4 844.7 630.4 614.2 652.5 615.9 606 706.4 609.7 614.7 730.2 615 741.6 684.6 602.2 633 1139.1 773.6 625.5 728.1 633.3 616 717.2 711.4 624.3 813.2 629.8 620.4 626.4 628.4 877.5 623.1 733.6 751 613.8 602 776.4 833.9 653.4 618.5 655.5 640.9 (a) Give the population! under consideration and the measurement X. (b) Find the sample mean and deviation. (c) What percentage of these measurements are within x ± S? Within x ± 2S? Do these times appear to be normally distributed?

Answers 1.! = All students in this Psychology Masters Program; X = IQ score; X ~ N(112, 9) (b) P(X!105) 0.21835 P(100! X! 124) 0.817577 P(X!130) 0.02275 (c) x = invnorm(.17, 112, 9) 103.4125 and y = invnorm(.83, 112, 9) 120.5875 2. (a) P(Z! 1. 45) 0.07353 P(Z! "2.12) 0.017 P(!2 " Z " 2) 0.9545 (b) w 2.17 and z 2.17 ( invnorm(.015, 0, 1) and invnorm(.985, 0, 1) ) (c) z = invnorm(.99, 0, 1) 2.326 3. (a) Convert each score to standard normal: X = 28! Z = 28 " 22.4 3. 2! Z = 1. 75 Y = 1400! Z = 1400 " 1020 160! Z = 2.375 So an SAT of Y = 1400 produces the higher score on a standard scale. 14 # 22.4 1400 # 1020 (b) X! 14 " Z! " Z! #2.625 Y! 1400 " Z! " Z! 2.375 3.2 160 Here, the left tail Z! "2. 625 creates less probability than the right tail Z! 2.375; so an ACT score of at X! 14 is less likely to occur. 4. (a)! = All 100 watt GE light bulbs (during this production); X = lifetime of bulb (b) x 688.698 and S 101.168 (c) x ± S is 587.53 hrs to 789.866 hrs, which contains 40/46 or about 87% of the times x ± 2S is 486.362 hrs to 891.034 hrs, which contains 45/46 or about 97.8% of the times A histogram shows that the times are not at all normally distributed.