Instructor: Doug Ensley Course: MAT Applied Statistics - Ensley

Similar documents
MATH 1150 Chapter 2 Notation and Terminology

Exam: practice test 1 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

AP Final Review II Exploring Data (20% 30%)

Introduction to Statistics

Exercises from Chapter 3, Section 1

Practice Questions for Exam 1

CHAPTER 1. Introduction

download instant at

TOPIC: Descriptive Statistics Single Variable

Range The range is the simplest of the three measures and is defined now.

Example 2. Given the data below, complete the chart:

Statistics 100 Exam 2 March 8, 2017

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Resistant Measure - A statistic that is not affected very much by extreme observations.

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

3.1 Measure of Center

Chapter 2: Tools for Exploring Univariate Data

Elementary Statistics

Name: Class: Date: ID: A. Find the mean, median, and mode of the data set. Round to the nearest tenth. c. mean = 8.2, median = 8, mode =7

Chapter 1:Descriptive statistics

Units. Exploratory Data Analysis. Variables. Student Data

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:

Chapter 1. Looking at Data

Chapter 3. Data Description

Unit 2: Numerical Descriptive Measures

Sections 6.1 and 6.2: The Normal Distribution and its Applications

Section 3.2 Measures of Central Tendency

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Data Presentation. Naureen Ghani. May 4, 2018

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

Spring 2012 Student Performance Analysis

CHAPTER 2: Describing Distributions with Numbers

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)

Describing distributions with numbers

Name: Class: Date: ID: A. Find the mean, median, and mode of the data set. Round to the nearest tenth. c. mean = 9.7, median = 8, mode =15

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

are the objects described by a set of data. They may be people, animals or things.

Describing Distributions

Describing distributions with numbers

Section 3. Measures of Variation

STAT 200 Chapter 1 Looking at Data - Distributions

M 140 Test 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

Slide 1. Slide 2. Slide 3. Pick a Brick. Daphne. 400 pts 200 pts 300 pts 500 pts 100 pts. 300 pts. 300 pts 400 pts 100 pts 400 pts.

3.3. Section. Measures of Central Tendency and Dispersion from Grouped Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Lecture 11. Data Description Estimation

Vocabulary: Samples and Populations

GRAPHS AND STATISTICS Central Tendency and Dispersion Common Core Standards

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Chapters 1 & 2 Exam Review

The empirical ( ) rule

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Chapter 1: Exploring Data

A C E. Answers Investigation 4. Applications

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

1.3.1 Measuring Center: The Mean

Descriptive Univariate Statistics and Bivariate Correlation

Math 10 - Compilation of Sample Exam Questions + Answers

Recap: Ø Distribution Shape Ø Mean, Median, Mode Ø Standard Deviations

6 THE NORMAL DISTRIBUTION

Sampling, Frequency Distributions, and Graphs (12.1)

Vocabulary: Data About Us

Name: Class: Date: ID: A

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

Student Performance Analysis. Algebra I Standards of Learning

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Nicole Dalzell. July 2, 2014

7) 24% of the lawyers in a firm are female. If there are 150 lawyers altogether, how many lawyers are female?

Math 074 Final Exam Review. REVIEW FOR NO CALCULATOR PART OF THE EXAM (Questions 1-14)

Lecture 1: Descriptive Statistics

Let's Do It! What Type of Variable?

Math 082 Final Examination Review

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included:

Descriptive Statistics Solutions COR1-GB.1305 Statistics and Data Analysis

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study

Sections 2.3 and 2.4

Section 2.4. Measuring Spread. How Can We Describe the Spread of Quantitative Data? Review: Central Measures

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

IB Questionbank Mathematical Studies 3rd edition. Grouped discrete. 184 min 183 marks

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

Math 140 Introductory Statistics

Math 140 Introductory Statistics

Unit 2. Describing Data: Numerical

1. The following two-way frequency table shows information from a survey that asked the gender and the language class taken of a group of students.

Let's Do It! What Type of Variable?

The Normal Distribution. Chapter 6

Review: Central Measures

Lecture 1: Description of Data. Readings: Sections 1.2,

2011 Pearson Education, Inc

Describing Data: Two Variables

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Mean, Mode, Median and Range. I know how to calculate the mean, mode, median and range.

1.3: Describing Quantitative Data with Numbers

First Quartile = 26 Third Quartile = 35 Interquartile Range = 9

Sem. 1 Review Ch. 1-3

Skills Check #1 ESM 1

Chapter 6 Group Activity - SOLUTIONS

Stat 101 Exam 1 Important Formulas and Concepts 1

Transcription:

Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 04 - Sections 2.5 and 2.6 1. A travel magazine recently presented data on the annual number of vacation days averaged by residents of eight different countries. They reported 41 days for Italy, 38 for France, 35 for Germany, 32 for Brazil, 27 for Britain, 26 for Canada, 24 for Japan, and 10 for the United States. Complete parts (a) through (d). a. Report the median. days b. By finding the median of the four values below the median, report the first quartile. days c. Find the third quartile. days d. Interpret the values found in parts (a)-(c) in the context of these data. % of the countries have residents who take fewer than 25.0 vacation days, half of the countries have residents who take fewer than vacation days, and 75% of the countries have residents who take (1) 36.5 vacation days per year. The middle 50% of the countries have residents who take an average of between and 36.5 vacation days annually. (1) fewer than more than exactly ID: 2.5.63 1 of 12 9/2/15, 1:12 PM

2. The given data table is divided by categories (Cat) and number of observations (Obs) of general data. Use the data table to answer the following questions. a. Find and interpret the median. b. Find the first quartile (Q1) and the third quartile (Q3). c. Find and interpret the mean. Full data set Cat Obs Cat Obs Cat Obs Cat Obs A 3.8 F 17.1 L 30.1 R 43.5 B 6.4 G 20.6 M 31.9 S 45.2 C 9.9 H 23.9 N 33.1 T 48.4 D 13.2 J 25.3 P 35.7 U 51.3 E 15.3 K 26.7 Q 39.5 V 52.9 a. The median is. (Round to the nearest tenth as needed.) What information is given by the median? A. Approximately 50% of the data lie below this value. B. Approximately 50% of the data lie within 1 standard deviation of this value. C. This value is the average of the data set. D. Approximately 50% of the data are more than 1 standard deviation away from this value. b. The first quartile, Q1, is. (Round to the nearest tenth as needed.) The third quartile, Q3, is. (Round to the nearest tenth as needed.) c. The mean is. (Round to the nearest tenth as needed.) What information is given by the mean? A. This value is the average of the data set. B. Approximately 50% of the data are more than 1 standard deviation away from this value. C. Approximately 50% of the data lie within 1 standard deviation of this value. D. Approximately 50% of the data lie below this value. ID: 2.5.64 2 of 12 9/2/15, 1:12 PM

3. The "high school female athletes" data file has data for 57 high school female athletes on the maximum number of pounds they were able to bench press, which is a measure of strength. For these data x = 86.2, Q1 = 76, median = 86, Q3 = 96. Complete parts (a) and (b). a. Interpret the quartiles. One (1) had a maximum bench press less than pounds, and one fourth had a maximum bench press (2) than 96 pounds. b. Would you guess that the distribution is skewed, or roughly symmetric? A. There's not enough evidence to decide. B. Skewed C. Roughly symmetric (1) fourth fifth third (2) greater less ID: 2.5.65 3 of 12 9/2/15, 1:12 PM

4. Here is the five-number summary for the distribution of a cigarette tax (in cents) for all the states in a certain country. Use this information to answer parts a through d. Minimum = 9, Q1 = 30, Median = 47, Q3 = 99, Maximum = 150 a. About what proportion of the states have cigarette taxes (i) greater than 30 cents and (ii) greater than 99 cents? (i) About % of the states have cigarette taxes greater than 30 cents. (ii) About % of the states have cigarette taxes greater than 99 cents. b. Between what two values are the middle 50% of the observations found? The lower bound of the middle 50% is. (Type a whole number.) and the upper bound of the middle 50% is c. Find and interpret the interquartile range. The interquartile range (IQR) is. What is the relevance of the IQR? A. The IQR summarizes the range for the lower half of the data. B. The IQR summarizes the range for the upper half of the data. C. The IQR summarizes the range within one standard deviation of the mean. D. The IQR summarizes the range for the middle half of the data. d. Based on the summary, do you think this distribution was bell-shaped? If so, why? If not, why not, and what shape would you expect? A. The distribution is skewed right because the median is closer to Q1. Further proof is given by the values of the minim relative to Q1 and Q3, respectively. B. The distribution is bell-shaped because the median is exactly between Q1 and Q3. Further proof is given by the value maximum relative to Q1 and Q3, respectively. C. The distribution is skewed left because the median is closer to Q3. Further proof is given by the values of the minimu to Q1 and Q3, respectively. ID: 2.5.68 4 of 12 9/2/15, 1:12 PM

5. During a recent semester at a large national university, students having accounts on a mainframe computer had hard drive use (in kilobytes) described by the five-number summary, minimum = 430, Q1 = 444, median = 596, Q3 = 1053, and maximum = 430,000. Complete parts a and b below. a. Would you expect this distribution to be symmetric, skewed to the right, or skewed to the left? Explain. Fill in the blanks to complete the statement below. The distribution is (1), because the median is (2) b. Use the 1.5 IQR criterion to determine all potential outliers that are present. Choose the correct answer below. A. Since the minimum value is not within 1.5 IQR of Q1, there is at least one outlier. B. Since the maximum value is not within 1.5 IQR of Q3, there is at least one outlier. C. Since the minimum value is within 1.5 IQR of Q1 and the maximum value is within 1.5 IQR of Q3, there are no po D. Since the minimum value is not within 1.5 IQR of Q1 and the maximum value is not within 1.5 IQR of Q3, there ar (1) symmetric skewed to the right skewed to the left (2) closer to the minimum. closer to the maximum. exactly halfway between the minimum and the maximum. ID: 2.5.72 6. The scores on an exam have mean = 87, standard deviation = 13, minimum = 66, Q1 = 77, median = 81, Q3 = 103, and maximum = 120. State which of these values are used in a box plot and then sketch the box plot. Which of these values are used in the box plot? Select all that apply. A. Q3 B. maximum C. mean D. minimum E. standard deviation F. Q1 G. median Choose the correct box plot below. A. B. C. D. 65 125 65 125 65 125 65 ID: 2.5.74 5 of 12 9/2/15, 1:12 PM

7. A survey was conducted to determine how many miles per day employees of a company used public transportation. The sample values are below. Identify the five-number summary, and draw a box plot. 0 0 0 0 0 0 0 5 8 10 Identify the five-number summary. minimum = Q1 = median = Q3 = maximum = Choose the correct box plot below. A. B. C. D. -12-8 -4 0-12-8-4 0 4 8 12-4 0 4 0 4 ID: 2.5.75 8. The unemployment rates for various countries range from 4.8 to 11.4, with Q1 = 5.9, median = 6.2, Q3 = 9.8, a mean of 6.9, and standard deviation of 3.3. Use this information to answer parts a through c. a. In a box plot, what would be the values at the outer edges of the box, and what would be the values to which the whiskers extend? The lower edge of the box is and the upper edge of the box is. The lower whisker extends down to and the upper whisker extends up to. b. The highest unemployment rate was 11.4. Is it an outlier according to the three standard deviation criterion? Explain. A. No, because is it less than three standard deviations from the mean. B. Yes, because is it more than three standard deviations from the mean. C. Yes, because is it less than three standard deviations from the mean. D. No, because is it more than three standard deviations from the mean. c. What unemployment value for a country would have a z-score equal to 0? A country with a z-score of 0 would have an unemployment rate of. ID: 2.5.77 6 of 12 9/2/15, 1:12 PM

9. The carbon dioxide emissions of a group of nations had a mean of 8.5 and standard deviation of 2.9. a. One country's observation was 14.4. Find and interpret its z-score relative to the distribution of values for the group of nations. b. Another country's observation was 2.2. Find and interpret its z-score. a. Find the z-score for the observation of 14.4. z = (Round to two decimal places as needed.) What does this z-score imply? A. The observation 14.4 is not an outlier because it is less than 3 standard deviations from the mean. B. The observation 14.4 is not an outlier because its z-score is positive. C. The observation 14.4 is an outlier because it is greater than 3 standard deviations from the mean. D. The observation 14.4 is an outlier because its z-score is negative. b. Find the z-score for the observation of 2.2. z = (Round to two decimal places as needed.) What does this z-score imply? A. The observation 2.2 is an outlier because its z-score is negative. B. The observation 2.2 is not an outlier because its z-score is positive. C. The observation 2.2 is an outlier because it is greater than 3 standard deviations from the mean. D. The observation 2.2 is not an outlier because it is less than 3 standard deviations from the mean. ID: 2.5.78 10. For a sample of 297 female heights, the mean was 64.3 inches and the standard deviation was 2.2 inches. The shortest person in this sample had a height of 55 inches. a. Find the z-score for the height of 55 inches. b. What does the negative sign for the z-score represent? c. Is this observation a potential outlier according to the three standard deviation distance criterion? Explain. a. Find the z-score. z = (Round to one decimal place as needed.) b. What does the negative sign for the z-score represent? A. The observation is not a potential outlier. B. The observation is a potential outlier. C. The observation is above the mean. D. The observation is below the mean. c. Is this observation a potential outlier according to the three standard deviation distance criterion? Explain. A. Yes, because the z-score is negative. B. No, because the z-score is negative. C. Yes, because it is greater than three standard deviations from the mean. D. No, because it is less than three standard deviations from the mean. ID: 2.5.79 7 of 12 9/2/15, 1:12 PM

11. The MINITAB vertical side-by-side box plots shown below compare the values reported by the UN of per capita carbon dioxide emissions for for two nations in a certain year. Complete parts (a) through (c). 25 20 15 10 5 0 CO2 Nation 1 x 25 20 15 10 5 0 CO2 Nation 2 x a. Give the approximate value of carbon dioxide emissions for the outlier shown. (Round to the nearest integer as needed.) b. What shape would you predict for the distribution in Nation 2? Why? A. Skewed to the left since the distance between Q2 and large and the lower whisker is much shorter than the B. Skewed to the right since the distance between Q2 a much larger than between Q1 and Q2 and the upper much longer than the lower one C. There can be many values represented by the shorte and just a few represented by the longer one. So, the enough evidence for prediction. c. Summarize how the carbon dioxide emissions compare in the two nations. A. The emissions levels in Nation 1 and in Nation 2 are because their whiskers overlap. B. The emissions are much higher in Nation 1 than in Na Roughly 75% of the levels reported in Nation 2 are le smallest level reported in Nation 1. C. The emissions are much lower in Nation 2 than in Na of the levels reported in Nation 2 are less than those Nation 1. ID: 2.5.83 12. The six full-time employees of a tanning salon near campus had annual incomes last year of $ 8100, $ 8300, $ 8300, $ 8800, $ 9200, $ 9900. The owner made $ 450,000. a. For the seven annual incomes at the salon, report the mean and the median. b. Why is it misleading for the owner to boast to her friends that the average salary at the salon is more than $ 70,000? a. The mean salary is $. The median salary is $. b. Why is the boast misleading? A. It is misleading because no one actually makes $ 70,000. B. It is misleading because the mean is so influenced by the owner's salary that it is not a typical value. C. It is misleading because the mean is actually less than $ 70,000. D. The boast is not misleading. ID: 2.6.84 8 of 12 9/2/15, 1:12 PM

13. A college newspaper reported the results of a survey of students taken on campus. One question asked was "Do you think the current war has made us safer?" The figure shows the way the magazine reported the results. Use this information to complete parts a and b. 65 60 55 50 45 40 35 30 Has War Made Us Safer? 36% Yes 61% No a. Explain what is wrong with the way this bar chart was constructed. A. The graph uses figures, so the relative percentages a misleading. B. The graph does not have a heading, so it is unclear w graph is about. C. The graph is portraying more than one group when th differ greatly, so the size of the bars is misleading. D. The vertical axis does not start at zero, so the relative percentages are visually misleading. b. Explain why you would not see this error made with a pie chart. A. A pie chart can always display more than one group. B. A pie chart never uses figures, so the relative percen always be clear. C. A pie chart always has a heading, so it will always be what the pie chart is depicting. D. A pie chart always uses percentages, so the relative the slices will represent the relative percentages in ea catagory. ID: 2.6.87 9 of 12 9/2/15, 1:12 PM

1. 29.5 25.0 36.5 25 29.5 (1) fewer than 25.0 2. 28.4 A. Approximately 50% of the data lie below this value. 16.2 41.5 28.7 A. This value is the average of the data set. 3. (1) fourth 76 (2) greater C. Roughly symmetric 4. 75 25 30 99 69 D. The IQR summarizes the range for the middle half of the data. A. The distribution is skewed right because the median is closer to Q1. Further proof is given by the values of the minimum and maximum relative to Q1 and Q3, respectively. 5. (1) skewed to the right (2) closer to the minimum. B. Since the maximum value is not within 1.5 IQR of Q3, there is at least one outlier. 6. A. Q3, B. maximum, D. minimum, F. Q1, G. median 10 of 12 9/2/15, 1:12 PM

B. 65 125 7. 0 0 0 5 10 D. 0 4 8 12 8. 5.9 9.8 4.8 11.4 A. No, because is it less than three standard deviations from the mean. 6.9 9. 2.03 A. The observation 14.4 is not an outlier because it is less than 3 standard deviations from the mean. 2.17 D. The observation 2.2 is not an outlier because it is less than 3 standard deviations from the mean. 10. 4.2 D. The observation is below the mean. C. Yes, because it is greater than three standard deviations from the mean. 11 of 12 9/2/15, 1:12 PM

11. 21 B. Skewed to the right since the distance between Q2 and Q3 is much larger than between Q1 and Q2 and the upper whisker is much longer than the lower one B. The emissions are much higher in Nation 1 than in Nation 2. Roughly 75% of the levels reported in Nation 2 are less than the smallest level reported in Nation 1. 12. 71,800 8800 B. It is misleading because the mean is so influenced by the owner's salary that it is not a typical value. 13. D. The vertical axis does not start at zero, so the relative percentages are visually misleading. D. A pie chart always uses percentages, so the relative sizes of the slices will represent the relative percentages in each catagory. 12 of 12 9/2/15, 1:12 PM