Chapter 6 Group Activity - SOLUTIONS

Similar documents
Chapter 5. Understanding and Comparing. Distributions

Describing Distributions

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Describing Distributions With Numbers

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included:

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things.

Chapter 2: Tools for Exploring Univariate Data

MATH 1150 Chapter 2 Notation and Terminology

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 1. Looking at Data

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

Math 140 Introductory Statistics

Math 140 Introductory Statistics

CHAPTER 1. Introduction

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Histograms allow a visual interpretation

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!

STATISTICS/MATH /1760 SHANNON MYERS

Lecture 2 and Lecture 3

+ Check for Understanding

Math 138 Summer Section 412- Unit Test 1 Green Form, page 1 of 7

Density Curves and the Normal Distributions. Histogram: 10 groups

Describing distributions with numbers

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

Chapter2 Description of samples and populations. 2.1 Introduction.

Lecture 1: Description of Data. Readings: Sections 1.2,

Stat 101 Exam 1 Important Formulas and Concepts 1

MEASURING THE SPREAD OF DATA: 6F

STAT 200 Chapter 1 Looking at Data - Distributions

2011 Pearson Education, Inc

Example 2. Given the data below, complete the chart:

Units. Exploratory Data Analysis. Variables. Student Data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

AP Final Review II Exploring Data (20% 30%)

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

Chapter 3. Measuring data

Section 3. Measures of Variation

Name: Class: Date: Mini-Unit. Data & Statistics. Investigation 1: Variability & Associations in Numerical Data. Practice Problems

Statistics for Managers using Microsoft Excel 6 th Edition

Math 082 Final Examination Review

Statistics and parameters

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Chapter 3. Data Description

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

MATH 117 Statistical Methods for Management I Chapter Three

Describing Distributions With Numbers Chapter 12

The Normal Distribution. Chapter 6

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

6 THE NORMAL DISTRIBUTION

Elementary Statistics

The empirical ( ) rule

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

Section 5.4. Ken Ueda

IB Questionbank Mathematical Studies 3rd edition. Grouped discrete. 184 min 183 marks

1 Measures of the Center of a Distribution

Comparing Measures of Central Tendency *

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Practice Questions for Exam 1

Essential Question: What are the standard intervals for a normal distribution? How are these intervals used to solve problems?

Chapter 5: Exploring Data: Distributions Lesson Plan

Chapter 4.notebook. August 30, 2017

Vocabulary: Samples and Populations

Chapter 6 The Normal Distribution

1. Exploratory Data Analysis

Performance of fourth-grade students on an agility test

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008

3.1 Measure of Center

Section 7.2 Homework Answers

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 3.1-1

Measures of. U4 C 1.2 Dot plot and Histogram 2 January 15 16, 2015

Archdiocese of Washington Catholic Schools Academic Standards Mathematics

Algebra Calculator Skills Inventory Solutions

Complement: 0.4 x 0.8 = =.6

Statistics I Chapter 2: Univariate data analysis

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Q 1 = 23.8 M = Q 3 = 29.8 IQR = 6 The numbers are in order and there are 18 pieces of data so the median is the average of the 9th and 10th

A C E. Answers Investigation 4. Applications

200 participants [EUR] ( =60) 200 = 30% i.e. nearly a third of the phone bills are greater than 75 EUR

Descriptive Univariate Statistics and Bivariate Correlation

a table or a graph or an equation.

Section 2.3: One Quantitative Variable: Measures of Spread

Statistics I Chapter 2: Univariate data analysis

In this investigation you will use the statistics skills that you learned the to display and analyze a cup of peanut M&Ms.

Chapter 4: Displaying and Summarizing Quantitative Data

The Normal Distribution (Pt. 2)

Chapters 1 & 2 Exam Review

Math 3339 Homework 2 (Chapter 2, 9.1 & 9.2)

CHAPTER 2: Describing Distributions with Numbers

MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline.

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves)

A is one of the categories into which qualitative data can be classified.

Teaching a Prestatistics Course: Propelling Non-STEM Students Forward

CHAPTER 2 Description of Samples and Populations

FSA Algebra I End-of-Course Review Packet

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data

Transcription:

Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing the shape, center and spread of this data, and whether there are any outliers. a. On the grid, make a histogram of the data using a bin-width of 2 credits. Give your histogram a descriptive title and label the units on the bottom. Label the vertical axis. Number of Credit Hours Taken in Summer Term 8 16 12 7 6 8 8 6 12 Histogram of Credits Taken by Mth 105 Students in Summer Term 0 2 6 8 10 12 1 16 18 Credit Hours b. What is the shape of the histogram? If you are not sure yet, compute and compare the mean and the median in the next question. The data are skewed to the right. The mean should be higher than the median. c. Find the mean, median and mode, including units. Mean = (8++16+12++7+++6+8+8++6+12)/1 = 103/1 = 7. credits. Median: Put the numbers in order:,,,,, 6, 6, 7, 8, 8, 8, 12, 12, 16. The average of the middle two, 6 and 7 is 6.5 credits. Mode: The center of the highest bar is 5 credits; however, no one took 5 credits. I would say the mode is credits because all 5 people in that bar are taking credits. Cara Lee Page 1

d. Find the 5-number summary, IQR and range, including units. [,,,,, 6, 6] [7, 8, 8, 8, 12, 12, 16] 5-Number summary (Min, Q1, Med, Q3, Max) 5-Number summary: (,, 6.5, 8, 16) credits IQR = Q3-Q1 = 8 = credits Range = Max Min = 16 = 12 credits e. Use the 5-number summary to draw a boxplot on the grid below. Make your horizontal scale match your histogram for comparison. Give your boxplot a descriptive title and label the horizontal axis. Boxplot of Credits Taken by Mth 105 Students in Summer Term 0 2 6 8 10 12 1 16 18 Credit Hours f. How does the shape of the boxplot compare with the shape of the histogram? Can you see the shape of the distribution on the boxplot? From the boxplot we can see that the scores are skewed to the right because the box and tail are longer on the right side. g. Do you think any of the students are outliers? Why or why not? The student taking 16 credits may be an outlier. We will learn a test for outliers in Math 23. (Fences) h. Write a few complete sentences summarizing the four characteristics of this distribution. The distribution of the number of credits taken by Math 105 students in the summer is unimodal and skewed to the right. The median number of credits taken is 6.5 credits with the middle 50% taking -8 credits for an IQR of credits. The student taking 16 credits may be an outlier. Cara Lee Page 2

Comparing Distributions with Boxplots 2. Below are fictitious student test scores from a Math 105 midterm in two different classes. You will be making a boxplot for each to compare their distributions. Class 1: 72, 86, 65, 99, 86, 71, 55, 86, 92, 73, 95, 71 points [55, 65, 71, 71, 72, 73], [86, 86, 86, 92, 95, 99] Class 2: 75, 9, 82, 81, 69, 71, 85, 92, 88, 78, 73, 65, 66 points [65, 66, 69, 71, 73, 75], 78, [81, 82, 85, 88, 92, 9] a. Find the mean, 5-number summary, IQR and range for each class, including units. Class 1: Class 2: Mean: 79.3 points Mean: 78. points 5-Number summary: (55, 71, 79.5, 89, 99) points (65, 70, 78, 86.5, 9) points IQR = Q3-Q1 = 89 71 = 18 points Range = Max Min = 99 55 = points IQR = Q3-Q1 = 86.5 70 = 16.5 points Max Min = 9 65 = 29 points b. Draw the boxplot for each class using the same scale. Boxplot for Class 1 30 0 50 60 70 80 90 100 Points Boxplot for Class 2 30 0 50 60 70 80 90 100 Points Cara Lee Page 3

c. What is the shape of the data for each class? How can you tell? The shapes of both are approximately symmetric. The boxplots are fairly symmetric, and the medians are very close to the means for both classes. Calculating Standard Deviation, s d. Using your means rounded to one decimal place, find the standard deviation for each class, including units. The variable n refers to the number of data values. Class 1: Mean = 79.3 points, n = 12 students Test Score (points) Deviation from the mean Squared deviation 72 79.3 7.3 53.29 86 79.3 6.7.89 65 79.3 1.3 20.9 99 79.3 19.7 388.09 86 79.3 6.7.89 71 79.3 8.3 68.89 55 79.3 2.3 590.9 86 79.3 6.7.89 92 79.3 12.7 161.29 73 79.3 6.3 39.69 95 79.3 15.7 26.9 71 79.3 8.3 68.89 Sum of the squared deviations (numerator) 1956.28 s = ( x mean) 2 n 1 = 1956.28 12 1 13.3 points Cara Lee Page

75, 9, 82, 81, 69, 71, 85, 92, 88, 78, 73, 65, 66 Class 2: Mean = 78. points, n = 13 students Test Score (points) Deviation from the mean Squared deviation 75 78. 3. 11.56 9 78. 15.6 23.36 82 78. 3.6 12.96 81 78. 2.6 6.76 69 78. 9. 88.36 71 78. 7. 5.76 85 78. 6.6 3.56 92 78. 13.6 18.96 88 78. 9.6 92.16 78 78. 0. 0.16 73 78. 5. 29.16 65 78. 13. 179.56 66 78. 12. 153.76 Sum of the squared deviations (numerator) 1101.08 s = ( x mean) 2 n 1 = 1101.08 13 1 9.58 points e. Write a few complete sentences summarizing the four characteristics of the distribution of class 1. The test scores for class 1 are approximately symmetric with a mean of 79.3 points and a standard deviation of 13.35 points. The student who scored 55 points may be an outlier. Cara Lee Page 5

f. Which class did better on the test? Use the vocabulary and values for center and spread in your answer. Class 1 had a slightly higher mean (79.3 vs 78. points) and median (79.5 vs 78 points) and students scored higher, but some students also scored lower. It could be said that Class 1 did better. It could also be said that Class 2 did better because they have a smaller range (29 vs points) and standard deviation (9.58 vs 13.3 points), so their scores were less spread out and more consistent. The lowest score in class 2 was 65. The lowest score in Class 1 was 55 points which may be an outlier due to a student having a bad day, illness or emergency, etc. More on Shapes (Distributions) of Data Measurements like heights and weights of people tend to follow a Normal distribution. Measurements of machined parts or test scores on very complex tests, like the SAT s, also follow a Normal distribution. Variables that have a limit on one side tend to be skewed away from that side. For example, the number of tattoos people have, tends to be skewed to the right, because there is a lower limit of 0. 3. Match the description of the data with its most likely shape: a. The amount of liquid in soda cans (ml) i. Normally distributed b. Lengths of newborn babies (in) ii. Skewed to the right c. Household incomes in the U.S. (dollars) iii. Skewed to the left. For each quantitative variable in the class survey, write the shape of the distribution. Write n/a for qualitative variables. a. How do you identify your gender? n/a, qualitative b. What is your age? Skewed to the right c. How many credits are you taking this term? Skewed to the right d. What is your main mode of transportation to campus? n/a qualitative e. How long is your pinky finger to the nearest half-centimeter? Symmetric Cara Lee Page 6

The Normal Distribution Theoretically, pinky measurements should follow a Normal distribution. From several terms of Math 105 students, the mean pinky length is 6 cm with a standard deviation of 0.86 cm. 5. Label the horizontal scale of the Normal model to represent the pinky length of PCC students. Then divide the curve into sections and write the probabilities from the empirical rule in each section. 0.0015 0.0235 0.135 0.3 0.3 0.135 0.0235 0.0015 3.2.28 5.1 6 6.86 7.72 8.58 Student Pinky Lengths in cm 6. Using the Empirical rule, find the percentage of PCC students who would have pinky lengths a. Less than 5.1 cm 0.0015 + 0.0235 + 0.135 = 0.16 or 16% b. Between 6 and 7.72 cm 0.3 + 0.135 = 0.75 or 7.5% c. Greater than 6.86 cm 0.135 + 0.0235 + 0.0015 = 0.16 or 16% Z-Scores 7a. Calculate the Z-score for a person with a pinky length of 7.72 cm. What does this mean? 7.72 6 Z = 2 standard deviations. This person is 2 standard deviations 0.86 above the mean, which is rare. b. Calculate the Z-score for a person with a pinky length of 5.57 cm. What does this mean? 5.57 6 Z = -0.5 standard deviations. This person is half a standard 0.86 deviation below the mean. Cara Lee Page 7

Percentiles 8a. In what percentile is a student with a pinky length of 5.1 cm? What does this mean? 0.135 + 0.0235 + 0.0015 = 0.16. This person is in the 16 th percentile, which means their pinky length is longer than 16% of the population. b. What pinky length is the 8 th percentile? What does this mean? 0.0015 + 0.0235 + 0.135 + 0.3 + 0.3 = 0.8 or 8% The 8 th percentile corresponds to one standard deviation above the mean, so the 8 th percentile is 6.86 cm. More Practice The thickness of the plastic on an interior car door part follows a Normal distribution with a mean of 3 mm and a standard deviation of 0.5 mm. 9. Label the horizontal scale of the Normal model to represent the pinky length of PCC students. Then divide the curve into sections and write the probabilities from the empirical rule in each section. 0.0015 0.0235 0.135 0.3 0.3 0.135 0.0235 0.0015 1.5 2 2.5 3 3.5.5 Plastic Thickness in mm 10. Using the Empirical rule, find the percentage of the parts that have a thickness a. Between 2.5 and mm 0.3 + 0.3 + 0.135 = 0.815 or 81.5% b. Greater than 2mm 0.135 + 0.3 + 0.3 + 0.135 + 0.0235 + 0.0015 = 97.5% c. Less than 3.5 mm 0.3 + 0.3 + 0.135 + 0.0235 + 0.0015 = 0.8 or 8% Cara Lee Page 8

Z-Scores 11a. Calculate the Z-score for a part with a thickness of 3.75 mm. What does that mean? Z 3.75 3 0.5 = 1.5 standard deviations. This part is 1.5 standard deviations above the mean. b. Calculate the Z-score for a part with a thickness of 1.5 mm. What does that mean? Z 1.5 3 0.5 = -3 standard deviations. This part is 3 standard deviations below the mean, which is extremely rare. Percentiles 12a. In what percentile is a part with a thickness of.5 mm? 1 0.0015 = 0.9985. (Or add up all the decimals to the left of.5.) This part is in the 99.85 th percentile, which means the part s is thicker than 99.85% of the parts. b. What thickness is the 16 th percentile? 0.0015 + 0.0235 + 0.135 = 0.16 or 16% The 16 th percentile corresponds to one standard deviation below the mean, so the 16 th percentile is a part with thickness of 2.5 mm. That means a part with a thickness of 2.5 mm is thicker than 16% of the parts. Cara Lee Page 9