Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67
|
|
- Clarissa Warner
- 5 years ago
- Views:
Transcription
1 Chapter 6 The Standard Deviation as a Ruler and the Normal Model 1 /67
2 Homework Read Chpt 6 Complete Reading Notes Do P129 1, 3, 5, 7, 15, 17, 23, 27, 29, 31, 37, 39, 43 2 /67
3 Objective Students calculate a z-score and demonstrate the effect of adding to and scaling a data distribution 3 /67
4 The Standard Deviation as a Unit of Measure Remember when we compared the shoe sizes for adult men and women. We know that women tend to have smaller feet, and male and female shoe sizes do not represent the same size foot. A man s size 9 is not the same as a woman s size 9. But it is possible to compare some measurement of a woman s size relative to other women with a similar measure for men, one comparing his size with the sizes of other men? Of course the answer is yes. I would not have asked otherwise. 4 /67
5 The Standard Deviation as a Unit of Measure The unit of measure we can use for comparing differing data values is the standard deviation. The standard deviation tells us how the distribution of data values varies (in relation to the mean of the distribution), so it is ideal for comparing one individual to a group and for comparing groups with differing means. I have repeatedly said that statistics is the study of variation. The most commonly used measure of variation is the standard deviation and it forms the basis for nearly all the statistics we will do this year. 5 /67
6 The Standard Score A data value is converted to a standard score to provide a means for comparison. We convert individual data values, relative to their standard deviation, using the following formula: z = observed expected = score mean = Y Y standard deviation standard deviation S or Y µ σ The resulting values (standardized values), denoted as z, are called z-scores 6 /67
7 Standard Score Remember! z = observed expected standard deviation 7 /67
8 The Standard Score z = Y Y S or Y µ σ Remember, this is to convert a raw score into a standardized score. Raw scores above the mean will have a z-score > 0, and raw scores below the mean will have a z-score < 0. 8 /67
9 The Standard Score Standardized values (z-scores) have no units. z-scores are the distance of an individual data value from the mean in standard deviations. In other words, a z-score is the number of standard deviations away from the mean for a data value. A negative z-score tells us that the data value is below the mean. A positive z-score tells us that the data value is above the mean. 9 /67
10 Standard Score A Z score indicates the number of standard deviations an observation falls from the mean. 10/67
11 Why Do We Bother? Standardized values have been converted from their original units to standard statistical unit of standard deviations from the mean. Thus, we can compare values that are measured on different scales, with different units, or from different populations. Now we can compare the shoe sizes for men and women. The values give us a measure of how a data value compares to other data values of like kind. A measure of how different from an expected value is a given data value. 11/67
12 Benefits of Standardizing The 2015 SAT test had a mean score of 511 for math. The standard deviation for the math scores was 120. An SAT score of 550 would have a z-score of: z = =.325 That tells us the score of 550 was 1/3 of a standard deviation above the mean. How would that score compare to an ACT score of 23 on the math portion? 12/67
13 Benefits of Standardizing The ACT results for that same year had a mean of 20.8 and standard deviation of 5.2. The z-score for a 23 on the ACT would be z = =.4231 A z-score of.4231 tells us that a 23 is about.4 standard deviations above the mean ACT score. Thus a score of 23 on the ACT is better than a 550 on the SAT, relative to the mean for each test. 13/67
14 Shifting Data Shifting data (horizontal translation): Adding (or subtracting) a constant amount to each and every value in a distribution just adds (or subtracts) the same constant to (from) the mean, median and other measures of position. This has the effect of shifting the distribution horizontally. Measures of spread (variance, IQR, range), however, are not affected by adding a constant to every value in the distribution. 14/67
15 Shifting Data The United States of Amurica has been half-heartedly trying to implement the metric system of measurements. Students were asked to estimate the length of their classroom in meters. The following data shows the results. Collection 1 Dot Plot Another plot shows the amount of Collection 1 Dot Plot error in the guesses. It was found by Error subtracting the correct value (13) from each estimate Guess Compare the resulting dot plots. 15/67
16 Rescaling Data Rescaling data (stretch or compress): When we divide or multiply all the data values in a distribution by any constant value, all measures of position (such as the mean, median and percentiles) and measures of spread (such as the range, IQR, and standard deviation) are divided or multiplied by that same constant value. Collection 1 Dot Plot Error 16/67
17 Rescaling Data (cont.) The errors in meters can also be measured in feet. 1 meter = ft. So we multiply each value by Now compare the plots Collection 1 Dot Plot Collection 1 Dot Plot Error error_ft 17/67
18 18/67
19 TI-84 Enter the data into L 1. Find the mean, standard deviation, and 5-number summary Create L 2 : subtract 13 from every entry in L 1. This gives the errors. Find the mean, standard deviation, and 5-number summary. Create L 3: multiply every entry in L 1 by Now we have feet estimates. Find the mean, standard deviation, and 5-number summary. Create L 4 : multiply every entry in L 2 by Errors in feet. Find the mean, standard deviation, and 5-number summary 19/67
20 TI-84 These are the results. Collection 1 mean count std dev std err missing min Q1 Med Q3 max Guess Error Feet error_ft x (x3.28) Note the relationships between the values. Adding a constant only affects measures of position like mean and median. Multiplying (scaling) affects all measures. 20/67
21 Back to z-scores Standardizing data into z-scores shifts the data by subtracting the mean and rescales the values by dividing by their standard deviation. Standardizing into z-scores does not change the shape of the distribution. What is the z-score for the mean of a distribution? Standardizing into z-scores changes the center by making the mean = 0. What is the z-score for the score one standard deviation above the mean? Standardizing into z-scores changes the spread by making the standard deviation = 1. 21/67
22 Mean and Standard Deviation To find the mean of the z-score distribution. We have subtracted the mean (a constant) from every score and divided by the standard deviation (another constant). Both transformations effect the mean The result is a mean of the z distribution of 0. Z = X X s = 0 The standard deviation is only effected by the multiplication. s z = s s = 1 So the standard deviation of the z distribution is 1. ( ) X z = X + s s = 1 22/67
23 When is Unusual Really Unusual? A z-score gives us an indication of how unusual a value is because it tells us how far the raw score is from the mean. A data value that sits right at the mean, has a z-score equal to 0. A z-score of 1 means the original data value is 1 standard deviation above the mean. A z-score of 1 means the original data value is 1 standard deviation below the mean. 23/67
24 When is Unusual Really Unusual? How far from 0 does a z-score have to be to be interesting or unusual? There is no universal standard, but the more extreme a z-score (negative or positive), the more unusual it is. We will have a more definitive answer soon. Remember that a negative z-score tells us that the data value is below the mean, while a positive z-score tells us that the data value is above the mean. 24/67
25 Are We Normal? There are an infinite number of data distributions. Change the mean or standard deviation of the data and you have a new distribution. For every distribution there is a corresponding z distribution. Remember, the shape does not change. There is a family of distributions (models) of data values that are extremely useful. The ideal of these models is called the Normal model (commonly called bell-shaped curves ). Normal models are appropriate for distributions whose shapes are unimodal and sufficiently symmetric. These distributions allow us to determine how extreme a z-score (and thus, the raw score) is. 25/67
26 Are We Normal? N(µ,σ) There is a Normal model (of raw data values) for every possible combination of mean and standard deviation. We write N(µ,σ) to represent a Normal model with a mean of µ and a standard deviation of σ. We use Greek letters for population parameters. The mean, µ, and standard deviation, σ, are not numerical summaries of the data. They are part of the model. They do not come from sample data. They are numbers that we choose (or know) to specify the model. Our statistics, x and s, are our best estimates of those values. 26/67
27 Are We Normal? Summaries of data taken from samples, like the sample mean and sample standard deviation, are written with Latin letters. These sample summaries of data are called sample statistics. When we standardize Normal data, the standardized value is a z-score, and: z = y µ σ You should note that only some data distributions are sufficiently unimodal and symmetric to be modeled by the normal distribution. Normal models approximate the shape of those distribution and occur often when there are many factors influencing the values of the data. 27/67
28 Are We Normal? Once we have standardized, we need only one model: The N(0,1) model is called the Standard Normal model (or the Standard Normal distribution). We cannot use a Normal model for just any data set. The data must be unimodal and sufficiently symmetric for us to use a normal model. Standardizing does not change the shape of the distribution. 28/67
29 Are We Normal? When we use the Normal model, we are assuming the population distribution is Normal. (Unimodal, and sufficiently symmetric). We cannot truly know the assumption of normality is valid, so we check the following condition: Nearly Normal Condition: The shape of the data s distribution is sufficiently unimodal and symmetric. This condition can be checked with a histogram or a Normal Probability Plot (NPP to be explained later). 29/67
30 The Rule When our data is unimodal, and sufficiently symmetric, a Normal model gives us an idea of how extreme a given data value is by telling us how likely we are to find a data value that far from the mean. To do this we use a Normal curve, under which the area is set to 1. The rule (also known as the Empirical Rule), provides an estimate of these values at 1, 2, and 3 standard deviations from the mean. These are approximations of values that come from evaluations of the standardized normal curve. 30/67
31 The Standardized Normal Curve 2 (x µ) f (x) = e 2σ 2 σ 2π Z score We can, and will (repeatedly), find these numbers accurately, but we can use this simplified rule about the Normal model to give us the ability to quickly estimate 31/67
32 The Rule It turns out that in a Normal model: about 68% of the values fall within one standard deviation of the mean; about 95% of the values fall within two standard deviations of the mean; and, about 99.7% of the values fall within three standard deviations of the mean. 32/67
33 The Rule The following shows what the Rule tells us: 99.7% 95% 68% Z score 33/67
34 The First Three Rules for Working with Normal Models Draw a pichur. Draw a pichur. Draw a pichur. And, when we have data, one necessary picture is a histogram to check that the distribution is sufficiently unimodal and symmetric to ensure we can use the Normal model to model the distribution. Additionally, I (and AP) will expect to see a normal model, appropriately shaded, on nearly everything you do from this point on. 34/67
35 Describing the Appropriate Model When asked to describe the model used in analyzing data, provide the properly labeled graphic with the shown. 99.7% 95% 68% IQ Scores 35/67
36 Finding Normal Percentiles by Table When a data value doesn t fall exactly 1, 2, or 3 standard deviations from the mean, we can look it up in a table of Normal percentiles. (Actually, we will use the calculator). Table Z in Appendix D provides us with normal percentiles. I can provide you with a z table if you prefer that to the calculator. 36/67
37 Finding Normal Percentiles by Table There are two forms of the z-table. On the tables I pass out you will find two sides. These are the two forms of the z-tables. One table gives the area to the left of the given z value. One table gives the area between the mean and the given z value. 37/67
38 Finding Normal Percentiles by by Table Hand This table shows the area under the curve between the z-score and the mean % /67
39 Finding Normal Percentiles by Table This table shows the area under the curve to the left of the z-score % /67
40 Finding Normal Percentiles by Table To find the area between the mean and 1.27 standard deviations above the mean, we locate z = 1.2 in the first column, then scroll across to the column under.07 ( = 1.27) 39.80% /67
41 Finding Normal Percentiles by Table Thus the area under the curve between the mean and 39.80% 1.27 standard deviations above the mean is In other words 39.80% of the total area falls in the region between z = 0 and z = Due to the symmetry of the normal model % 39.80% of the total area also falls in the region between z = 0 and z = /67
42 Finding Normal Percentiles by Table To find the percentage of scores between z =.44 and z = 1.50, find the area to the left of each z-score: (p(z <.44) =.6700, p(z < 1.5) =.9332) and subtract. z =.44 z = =.2632 or 26.32% p(.44 < z < 1.5) = % /67
43 Finding Normal Percentiles by Table To find the percentage of scores between z = -.43 and and z = 0.78, find the area to the left of z =.43 [p(z <.43) =.3336], and to the left of z = 0.78 p(z <.78 =.7823), then subtract the two values =.4487 or 44.87% z = -.43 z =.78 p(-.43 < z <.78) = % /67
44 Tails To find the area in the tails above a positive z-score or below a negative z-score, simply find the corresponding area for the z-score and subtract from 1.0. Percent of scores below z = 1.5 p(z < 1.5) =.9332 z = =.0668 = 6.68% 6.68% Percent of scores above z = 1.5 p(z > 1.5) = /67
45 Finding Normal Percentiles Now that we have spent all that time, let us get real. There is really no reason to use a table because we have a calculator that will do the job much more accurately and much more simply. 45/67
46 TI-84 To find the probability (area) between two x values: 2nd - Distr - 2:normalcdf(x min,x max,µ,σ) (normal cumulative distribution function) 2nd distr VARS 2:normalcdf lower: upper: μ: σ: Paste If you wish to find the probability between two x values and see the graph: 2nd - Distr - Draw - 1:ShadeNorm(x min,x max,µ,σ) Be sure to set your window xmin, xmax (depending on data values), ymin = 0, ymax =.4 46/67
47 TI-84 To find probabilities from a standardized normal distribution on the calculator is exactly the same as with a non-standardized normal distribution. Simply use µ = 0 and σ = 1. 2nd - Distr - 2:normalcdf(z min,z max,0,1) 2nd distr VARS 2:normalcdf lower: upper: μ: 0 σ: 1 Paste 47/67
48 TI-84 To find the area under the curve below a z-score of nd - Distr - 2:normalcdf(-10^99,1.45,0,1) = nd distr VARS 2:normalcdf lower:-10^99 upper: 1.45 μ: 0 σ: 1 Paste To find the area under the curve above a z-score of nd - Distr - 2:normalcdf(1.45,10^99 0,1) = nd distr VARS 2:normalcdf lower: 1.45 upper: 10^99 μ: 0 σ: 1 Paste 48/67
49 TI-84 To find the area under the curve between a z-score of and nd - Distr - 2:normalcdf(-1.45, 1.45,0,1) = nd distr VARS 2:normalcdf lower: upper: 1.45 μ: 0 σ: 1 Paste 49/67
50 From Percentiles to Scores: z in Reverse Sometimes we start with area (probability) and need to find the corresponding z-score or the original observation value. Example: What z-score represents the first quartile in a Normal model? 25% 25%?? 50/67
51 From Percentiles to Scores: z in Reverse via Table Look in Table Z for an area of The exact area is not there, but 0.25 is between and This figure is associated with z = 0.675, so the first quartile is about standard deviations below the mean. 51/67
52 Probability to z-score with a table To find a z-score corresponding to an area, or to know what z-score lies above, or below a given percentage, simply work backwards. In the table, first find the percentage (probability) in the field of the table. Then determine the z-score by combining the the row and column headings. 52/67
53 Probability to z-score To find the z-score below which falls 72% of the scores (72nd percentile) locate.72 in the table =.58 72% of scores fall below a z-score of.58 53/67
54 Probability to z-score on TI-84 Back to reality To find the z-score when you know the percentile: 2nd - Distr - 3:invNorm(Area, µ,σ) 2nd VARS 3:invNorm To find the z-score below which falls 72% of the scores distr area: μ: σ: Paste (72nd percentile): 2nd - Distr - 3:invNorm(.72, 0,1) = % of the distribution falls below a z score of about.58. 2nd distr area:.72 3:invNorm μ: 0 σ: 1 Paste VARS 54/67
55 Probability to z-score on TI-84 Remember, the TI-84 requires the area (probability) BELOW the desired score. If you are given the area ABOVE, you must subtract from 1 to find the area below. To find the z-score above which falls 25% of the scores (3rd quartile): 2nd - Distr - 3:invNorm(.75, 0,1) = nd distr area:.75 3:invNorm μ: 0 σ: 1 Paste VARS 25% of the normal distribution falls above a z score of about /67
56 When Are You Normal Enough? When you have your own data, you must verify whether a Normal model is reasonable and appropriate. AP and I will require you to verify the appropriateness of the methodology by ensuring the data distribution is sufficiently unimodal and symmetric. Looking at a histogram of the data (sample set) is probably the best way to check that the underlying distribution is sufficiently unimodal and symmetric. 56/67
57 What is Normal Enough? Another graphical display that can help you determine whether a Normal model is appropriate is the Normal Probability Plot (NPP). If the distribution of the data is roughly Normal, the Normal probability plot approximates a diagonal straight line. Significant deviations from a straight line indicate that the distribution is not Normally distributed. 57/67
58 Normal Probability Plot To create a Normal Probability Plot (NPP) (which you will not.) Order the data in ascending order. Calculate the percentile (cumulative percentage) for each data value. Find the z-score from a normal distribution matching the calculated percentile. (Note: this is NOT the z-score calculated from the observation, but the z score determined by the position of the observation in the distribution.) Plot the points (z x, z p ), where z x is the score calculated for the observation value and z p is the z-score for the corresponding percentile. Check for linearity. A good fit to a straight line suggests a normal distribution. 58/67
59 TI-84 A better way to create a NPP is to let the calculator do it. Statplot Choose the last icon Enter the list for your data Select x or y for your data axis (it really does not matter) Choose your marks (bigger is better for old eyes) Zoom 9: ZoomStat to see your NPP If the line appears to be linear, you got normal. 59/67
60 Testing for Normalcy Nearly Normal data have a histogram and a Normal probability plot that look somewhat like this example of 100 values randomly sampled from a normal distribution.: normal probability plot 60/67
61 Testing for Normalcy Data Sampled From a Left Skewed Distribution normal probability plot 61/67
62 Testing for Normalcy Data Sampled From a Right Skewed Distribution normal probability plot 62/67
63 Hot Beverage Dispenser Suppose a vending machine dispenses hot beverages into 8 ounce cups. The machine is set to dispense with a standard deviation of 0.2 ounces. Providing too much beverage risks burning the customer. Providing too little beverage risks frustrating the customer. σ X = What mean setting do you recommend for the machine? If we want to keep the vast majority of customers happy we should ensure that 95% of customers get at least 8 ounces. 63/67
64 Hot Beverage Dispenser If we want to keep the vast majority of customers happy we should ensure that 95% of customers get at least 8 ounces. σ X = 0.2 We need to find the mean such that 95% of our data falls above 8 oz. To find the mean, we must first find the z-score above which fall 95%. invnorm(.05, 0, 1) = Why.05? 95% To find the mean, we must convert the z-score. 5%? 0 z = x µ X σ X = 8 µ X 0.2 µ X = ? To ensure 95% of customers get at least 8 oz, the machine should be set with a mean of oz. 64/67
65 Hot Beverage Dispenser The company is not happy with our recommendation of ounces. That would cost too much. We must come up with another recommendation to ensure 95% of customers get at least the full 8 oz. What do you suggest??% Of course we could lower the mean volume to 8.1 oz. but that would leave a lot of customers unhappy. What percentage of customers would get cheated? Normalcdf( 10^99, 8, 8.1,.2) =.3085 Normalcdf( 9,.5, 0, 1) =.3085 OR z = = % of the vendors customers would be shorted. 65/67
66 Hot Beverage Dispenser Naturally the company is not pleased. The 8.1 oz is acceptable to the company but the percent below 8 is much too high. You just cannot seem to make some people happy. So what else might we do to make the company and customers happy? 95% We could recommend the company produce vending 5% machines that are more consistent in distribution. That means reducing the variability in the data What would be an appropriate standard deviation to ensure 95% of customers get at least 8.0 ounces with a mean of 8.1 oz? invnorm(.05, 0, 1) = σ = = σ The standard deviation would need to be.0608 oz. 66/67
67 Requirement From this point forward it will no longer be sufficient simply to find the requested values. You must conclude every problem with a concluding sentence. We found the standard deviation of.0608 so our concluding sentence would be something on the order of: To ensure 95% of customers receive at least 8 oz with a mean of 8.1 oz, the standard deviation would need to be.0608 oz. 67/67
Chapter 6 The Standard Deviation as a Ruler and the Normal Model
Chapter 6 The Standard Deviation as a Ruler and the Normal Model Overview Key Concepts Understand how adding (subtracting) a constant or multiplying (dividing) by a constant changes the center and/or spread
More information6 THE NORMAL DISTRIBUTION
CHAPTER 6 THE NORMAL DISTRIBUTION 341 6 THE NORMAL DISTRIBUTION Figure 6.1 If you ask enough people about their shoe size, you will find that your graphed data is shaped like a bell curve and can be described
More informationSection 5.4. Ken Ueda
Section 5.4 Ken Ueda Students seem to think that being graded on a curve is a positive thing. I took lasers 101 at Cornell and got a 92 on the exam. The average was a 93. I ended up with a C on the test.
More informationElementary Statistics
Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:
More information(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.
MATH 382 Normal Distributions Dr. Neal, WKU Measurements that are normally distributed can be described in terms of their mean µ and standard deviation σ. These measurements should have the following properties:
More information(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.
MATH 183 Normal Distributions Dr. Neal, WKU Measurements that are normally distributed can be described in terms of their mean µ and standard deviation!. These measurements should have the following properties:
More informationIntroduction to Probability and Statistics Twelfth Edition
Introduction to Probability and Statistics Twelfth Edition Robert J. Beaver Barbara M. Beaver William Mendenhall Presentation designed and written by: Barbara M. Beaver Introduction to Probability and
More informationStat 101 Exam 1 Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative
More informationObjective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.
Chapter 3 Numerically Summarizing Data Chapter 3.1 Measures of Central Tendency Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. A1. Mean The
More informationMATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More information3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability
Random variable The outcome of each procedure is determined by chance. Probability Distributions Normal Probability Distribution N Chapter 6 Discrete Random variables takes on a countable number of values
More informationIn this chapter, you will study the normal distribution, the standard normal, and applications associated with them.
The Normal Distribution The normal distribution is the most important of all the distributions. It is widely used and even more widely abused. Its graph is bell-shaped. You see the bell curve in almost
More informationContinuous random variables
Continuous random variables A continuous random variable X takes all values in an interval of numbers. The probability distribution of X is described by a density curve. The total area under a density
More information( )( ) of wins. This means that the team won 74 games.
AP Statistics Ch. 2 Notes Describing Location in a Distribution Often, we are interested in describing where one observation falls in a distribution in relation to the other observations. The pth percentile
More informationMt. Douglas Secondary
Foundations of Math 11 Calculator Usage 207 HOW TO USE TI-83, TI-83 PLUS, TI-84 PLUS CALCULATORS FOR STATISTICS CALCULATIONS shows it is an actual calculator key to press 1. Using LISTS to Calculate Mean,
More informationSection 4.4 Z-Scores and the Empirical Rule
Section 4.4 Z-Scores and the Empirical Rule 1 GPA Example A sample of GPAs of 40 freshman college students appear below (sorted in increasing order) 1.40 1.90 1.90 2.00 2.10 2.10 2.20 2.30 2.30 2.40 2.50
More informationMath 140 Introductory Statistics
Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The
More informationMath 140 Introductory Statistics
Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The
More informationChapter 5. Understanding and Comparing. Distributions
STAT 141 Introduction to Statistics Chapter 5 Understanding and Comparing Distributions Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 27 Boxplots How to create a boxplot? Assume
More information9/19/2012. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1
PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 The aspect of the data we want to describe/measure is relative position z scores tell us how many standard deviations above or below
More informationFrancine s bone density is 1.45 standard deviations below the mean hip bone density for 25-year-old women of 956 grams/cm 2.
Chapter 3 Solutions 3.1 3.2 3.3 87% of the girls her daughter s age weigh the same or less than she does and 67% of girls her daughter s age are her height or shorter. According to the Los Angeles Times,
More informationPerformance of fourth-grade students on an agility test
Starter Ch. 5 2005 #1a CW Ch. 4: Regression L1 L2 87 88 84 86 83 73 81 67 78 83 65 80 50 78 78? 93? 86? Create a scatterplot Find the equation of the regression line Predict the scores Chapter 5: Understanding
More informationSTT 315 This lecture is based on Chapter 2 of the textbook.
STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their
More informationChapter 4. Displaying and Summarizing. Quantitative Data
STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range
More informationPercentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included:
AP Statistics Chapter 2 Notes 2.1 Describing Location in a Distribution Percentile: The pth percentile of a distribution is the value with p percent of the observations (If your test score places you in
More informationUsing Tables and Graphing Calculators in Math 11
Using Tables and Graphing Calculators in Math 11 Graphing calculators are not required for Math 11, but they are likely to be helpful, primarily because they allow you to avoid the use of tables in some
More information3.1 Measure of Center
3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects
More informationUnit 8: Statistics. SOL Review & SOL Test * Test: Unit 8 Statistics
Name: Block: Unit 8: Statistics Day 1 Sequences Day 2 Series Day 3 Permutations & Combinations Day 4 Normal Distribution & Empirical Formula Day 5 Normal Distribution * Day 6 Standard Normal Distribution
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:
More information20 Hypothesis Testing, Part I
20 Hypothesis Testing, Part I Bob has told Alice that the average hourly rate for a lawyer in Virginia is $200 with a standard deviation of $50, but Alice wants to test this claim. If Bob is right, she
More informationWhat is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected
What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types
More informationThe Standard Deviation as a Ruler and the Normal Model
The Standard Deviation as a Ruler and the Normal Model Al Nosedal University of Toronto Summer 2017 Al Nosedal University of Toronto The Standard Deviation as a Ruler and the Normal Model Summer 2017 1
More informationThe Normal Distribution (Pt. 2)
Chapter 5 The Normal Distribution (Pt 2) 51 Finding Normal Percentiles Recall that the Nth percentile of a distribution is the value that marks off the bottom N% of the distribution For review, remember
More informationChapter 3: The Normal Distributions
Chapter 3: The Normal Distributions http://www.yorku.ca/nuri/econ2500/econ2500-online-course-materials.pdf graphs-normal.doc / histogram-density.txt / normal dist table / ch3-image Ch3 exercises: 3.2,
More informationCHAPTER 1. Introduction
CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing
More information+ Check for Understanding
n Measuring Position: Percentiles n One way to describe the location of a value in a distribution is to tell what percent of observations are less than it. Definition: The p th percentile of a distribution
More information4/1/2012. Test 2 Covers Topics 12, 13, 16, 17, 18, 14, 19 and 20. Skipping Topics 11 and 15. Topic 12. Normal Distribution
Test 2 Covers Topics 12, 13, 16, 17, 18, 14, 19 and 20 Skipping Topics 11 and 15 Topic 12 Normal Distribution 1 Normal Distribution If Density Curve is symmetric, single peaked, bell-shaped then it is
More informationThe empirical ( ) rule
The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%
More informationSTAT 200 Chapter 1 Looking at Data - Distributions
STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the
More informationHistograms allow a visual interpretation
Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called
More informationTOPIC: Descriptive Statistics Single Variable
TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency
More informationDensity Curves and the Normal Distributions. Histogram: 10 groups
Density Curves and the Normal Distributions MATH 2300 Chapter 6 Histogram: 10 groups 1 Histogram: 20 groups Histogram: 40 groups 2 Histogram: 80 groups Histogram: 160 groups 3 Density Curve Density Curves
More informationChapter 2: Tools for Exploring Univariate Data
Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is
More informationTest 3 SOLUTIONS. x P(x) xp(x)
16 1. A couple of weeks ago in class, each of you took three quizzes where you randomly guessed the answers to each question. There were eight questions on each quiz, and four possible answers to each
More informationEssential Statistics Chapter 6
1 Essential Statistics Chapter 6 By Navidi and Monk Copyright 2016 Mark A. Thomas. All rights reserved. 2 Continuous Probability Distributions chapter 5 focused upon discrete probability distributions,
More informationThe Normal Model. Copyright 2009 Pearson Education, Inc.
The Normal Mol Copyright 2009 Pearson Education, Inc. The trick in comparing very different-looking values is to use standard viations as our rulers. The standard viation tells us how the whole collection
More informationChapter 6 Group Activity - SOLUTIONS
Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing
More informationFinding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data
Finding Quartiles. Use the median to divide the ordered data set into two halves.. If n is odd, do not include the median in either half. If n is even, split this data set exactly in half.. Q1 is the median
More informationMath 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore
Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile
More informationZ score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound
1 EDUR 8131 Chat 3 Notes 2 Normal Distribution and Standard Scores Questions Standard Scores: Z score Z = (X M) / SD Z = deviation score divided by standard deviation Z score indicates how far a raw score
More informationDescribing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific
More informationSection 7.1 Properties of the Normal Distribution
Section 7.1 Properties of the Normal Distribution In Chapter 6, talked about probability distributions. Coin flip problem: Difference of two spinners: The random variable x can only take on certain discrete
More informationMath 2311 Sections 4.1, 4.2 and 4.3
Math 2311 Sections 4.1, 4.2 and 4.3 4.1 - Density Curves What do we know about density curves? Example: Suppose we have a density curve defined for defined by the line y = x. Sketch: What percent of observations
More informationLecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:
Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots
More informationSTA 218: Statistics for Management
Al Nosedal. University of Toronto. Fall 2017 My momma always said: Life was like a box of chocolates. You never know what you re gonna get. Forrest Gump. Simple Example Random Experiment: Rolling a fair
More information(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)
3. Descriptive Statistics Describing data with tables and graphs (quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) Bivariate descriptions
More informationFREQUENCY DISTRIBUTIONS AND PERCENTILES
FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly
More informationare the objects described by a set of data. They may be people, animals or things.
( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms
More informationReview. Midterm Exam. Midterm Review. May 6th, 2015 AMS-UCSC. Spring Session 1 (Midterm Review) AMS-5 May 6th, / 24
Midterm Exam Midterm Review AMS-UCSC May 6th, 2015 Spring 2015. Session 1 (Midterm Review) AMS-5 May 6th, 2015 1 / 24 Topics Topics We will talk about... 1 Review Spring 2015. Session 1 (Midterm Review)
More informationChapter 18. Sampling Distribution Models /51
Chapter 18 Sampling Distribution Models 1 /51 Homework p432 2, 4, 6, 8, 10, 16, 17, 20, 30, 36, 41 2 /51 3 /51 Objective Students calculate values of central 4 /51 The Central Limit Theorem for Sample
More information1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.
1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions
More informationUsing a graphic display calculator
12 Using a graphic display calculator CHAPTER OBJECTIVES: This chapter shows you how to use your graphic display calculator (GDC) to solve the different types of problems that you will meet in your course.
More informationChapter 4.notebook. August 30, 2017
Sep 1 7:53 AM Sep 1 8:21 AM Sep 1 8:21 AM 1 Sep 1 8:23 AM Sep 1 8:23 AM Sep 1 8:23 AM SOCS When describing a distribution, make sure to always tell about three things: shape, outliers, center, and spread
More informationChapter 23. Inference About Means
Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that
More informationThe area under a probability density curve between any two values a and b has two interpretations:
Chapter 7 7.1 The Standard Normal Curve Introduction Probability density curve: The area under a probability density curve between any two values a and b has two interpretations: 1. 2. The region above
More information1 Probability Distributions
1 Probability Distributions In the chapter about descriptive statistics sample data were discussed, and tools introduced for describing the samples with numbers as well as with graphs. In this chapter
More informationQUANTITATIVE DATA. UNIVARIATE DATA data for one variable
QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE
More informationChapter 3. Measuring data
Chapter 3 Measuring data 1 Measuring data versus presenting data We present data to help us draw meaning from it But pictures of data are subjective They re also not susceptible to rigorous inference Measuring
More informationMath 140 Introductory Statistics
Box Plots Math 140 Introductory Statistics Professor B. Ábrego Lecture 6 Sections 2.3, 2.4, and 2.5 11,12,20,25,30,30,30,32,35, 39,40,40,40,42,45,48,50,70. = 11 Q 1 = 30 Median = 37 Q 3 = 42 = 70. = 70
More informationWhat does a population that is normally distributed look like? = 80 and = 10
What does a population that is normally distributed look like? = 80 and = 10 50 60 70 80 90 100 110 X Empirical Rule 68% 95% 99.7% 68-95-99.7% RULE Empirical Rule restated 68% of the data values fall within
More informationNote that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).
Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something
More informationStandard Normal Calculations
Standard Normal Calculations Section 4.3 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 10-2311 Cathy Poliak, Ph.D. cathy@math.uh.edu
More information4.2 The Normal Distribution. that is, a graph of the measurement looks like the familiar symmetrical, bell-shaped
4.2 The Normal Distribution Many physiological and psychological measurements are normality distributed; that is, a graph of the measurement looks like the familiar symmetrical, bell-shaped distribution
More informationEQ: What is a normal distribution?
Unit 5 - Statistics What is the purpose EQ: What tools do we have to assess data? this unit? What vocab will I need? Vocabulary: normal distribution, standard, nonstandard, interquartile range, population
More informationHow spread out is the data? Are all the numbers fairly close to General Education Statistics
How spread out is the data? Are all the numbers fairly close to General Education Statistics each other or not? So what? Class Notes Measures of Dispersion: Range, Standard Deviation, and Variance (Section
More informationGRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study
CHAPTER PROBLEM Do women really talk more than men? A common belief is that women talk more than men. Is that belief founded in fact, or is it a myth? Do men actually talk more than women? Or do men and
More informationExam #2 Results (as percentages)
Oct. 30 Assignment: Read Chapter 19 Try exercises 1, 2, and 4 on p. 424 Exam #2 Results (as percentages) Mean: 71.4 Median: 73.3 Soda attitudes 2015 In a Gallup poll conducted Jul. 8 12, 2015, 1009 adult
More informationDescribing Distributions
Describing Distributions With Numbers April 18, 2012 Summary Statistics. Measures of Center. Percentiles. Measures of Spread. A Summary Statement. Choosing Numerical Summaries. 1.0 What Are Summary Statistics?
More informationChapter 2 Solutions Page 15 of 28
Chapter Solutions Page 15 of 8.50 a. The median is 55. The mean is about 105. b. The median is a more representative average" than the median here. Notice in the stem-and-leaf plot on p.3 of the text that
More informationNumerical Measures of Central Tendency
ҧ Numerical Measures of Central Tendency The central tendency of the set of measurements that is, the tendency of the data to cluster, or center, about certain numerical values; usually the Mean, Median
More informationAnswers Part A. P(x = 67) = 0, because x is a continuous random variable. 2. Find the following probabilities:
Answers Part A 1. Woman s heights are normally distributed with a mean of 63.6 inches and a standard deviation of 2.5 inches. Find the probability that a single randomly selected woman will be 67 inches
More informationRemember your SOCS! S: O: C: S:
Remember your SOCS! S: O: C: S: 1.1: Displaying Distributions with Graphs Dotplot: Age of your fathers Low scale: 45 High scale: 75 Doesn t have to start at zero, just cover the range of the data Label
More informationUNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004
UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 50W - Introduction to Biostatistics Fall 00 Exercises with Solutions Topic Summarizing Data Due: Monday September 7, 00 READINGS.
More informationCOMPLEMENTARY EXERCISES WITH DESCRIPTIVE STATISTICS
COMPLEMENTARY EXERCISES WITH DESCRIPTIVE STATISTICS EX 1 Given the following series of data on Gender and Height for 8 patients, fill in two frequency tables one for each Variable, according to the model
More informationStatistics and Sampling distributions
Statistics and Sampling distributions a statistic is a numerical summary of sample data. It is a rv. The distribution of a statistic is called its sampling distribution. The rv s X 1, X 2,, X n are said
More informationConfidence Intervals. - simply, an interval for which we have a certain confidence.
Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something
More informationRecall that the standard deviation σ of a numerical data set is given by
11.1 Using Normal Distributions Essential Question In a normal distribution, about what percent of the data lies within one, two, and three standard deviations of the mean? Recall that the standard deviation
More informationFurther Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data
Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)
More informationMEASURING THE SPREAD OF DATA: 6F
CONTINUING WITH DESCRIPTIVE STATS 6E,6F,6G,6H,6I MEASURING THE SPREAD OF DATA: 6F othink about this example: Suppose you are at a high school football game and you sample 40 people from the student section
More informationPractice problems from chapters 2 and 3
Practice problems from chapters and 3 Question-1. For each of the following variables, indicate whether it is quantitative or qualitative and specify which of the four levels of measurement (nominal, ordinal,
More informationExample 2. Given the data below, complete the chart:
Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is
More informationChapter. The Normal Probability Distribution 7/24/2011. Section 7.1 Properties of the Normal Distribution
Chapter The Normal Probability Distribution 3 7 Section 7.1 Properties of the Normal Distribution 2010 Pearson Prentice Hall. All rights 2010 reserved Pearson Prentice Hall. All rights reserved 7-2 Illustrating
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number
More informationAlgebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves)
Algebra 2 Outliers Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Notes #1 Chp 12 Outliers In a set of numbers, sometimes there will be
More informationStatistics 528: Homework 2 Solutions
Statistics 28: Homework 2 Solutions.4 There are several gaps in the data, as can be seen from the histogram. Minitab Result: Min Q Med Q3 Max 8 3278 22 2368 2624 Manual Result: Min Q Med Q3 Max 8 338 22.
More informationChapters 1 & 2 Exam Review
Problems 1-3 refer to the following five boxplots. 1.) To which of the above boxplots does the following histogram correspond? (A) A (B) B (C) C (D) D (E) E 2.) To which of the above boxplots does the
More informationChapter 1. Looking at Data
Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,
More informationUnit 4 Probability. Dr Mahmoud Alhussami
Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from
More informationBusiness Statistics: A Decision-Making Approach, 6e. Chapter Goals
Chapter 4 Student Lecture Notes 4-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 4 Using Probability and Probability Distributions Fundamentals of Business Statistics Murali Shanker
More information