Practice problems from chapters 2 and 3

Similar documents
Section 3. Measures of Variation

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

2011 Pearson Education, Inc

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study

UNIVERSITY OF MASSACHUSETTS Department of Biostatistics and Epidemiology BioEpi 540W - Introduction to Biostatistics Fall 2004

Chapter 2: Tools for Exploring Univariate Data

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

Chapter 3 Data Description

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

3.1 Measure of Center

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

MATH 117 Statistical Methods for Management I Chapter Three

A is one of the categories into which qualitative data can be classified.

Stats Review Chapter 3. Mary Stangler Center for Academic Success Revised 8/16

Exercises from Chapter 3, Section 1

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

Recap: Ø Distribution Shape Ø Mean, Median, Mode Ø Standard Deviations

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS

STAT 200 Chapter 1 Looking at Data - Distributions

MATH 1150 Chapter 2 Notation and Terminology

AP Final Review II Exploring Data (20% 30%)

The Empirical Rule, z-scores, and the Rare Event Approach

Chapter 2 Class Notes Sample & Population Descriptions Classifying variables

Elementary Statistics

Exam: practice test 1 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Resistant Measure - A statistic that is not affected very much by extreme observations.

Slide 1. Slide 2. Slide 3. Pick a Brick. Daphne. 400 pts 200 pts 300 pts 500 pts 100 pts. 300 pts. 300 pts 400 pts 100 pts 400 pts.

TOPIC: Descriptive Statistics Single Variable

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.)

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

MgtOp 215 Chapter 3 Dr. Ahn

P8130: Biostatistical Methods I

equal to the of the. Sample variance: Population variance: **The sample variance is an unbiased estimator of the

Full file at

Section 3.2 Measures of Central Tendency

3 Lecture 3 Notes: Measures of Variation. The Boxplot. Definition of Probability

Unit 2: Numerical Descriptive Measures

University of Jordan Fall 2009/2010 Department of Mathematics

Complement: 0.4 x 0.8 = =.6

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:

3.3. Section. Measures of Central Tendency and Dispersion from Grouped Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Chapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Lecture 2 and Lecture 3

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data

The area under a probability density curve between any two values a and b has two interpretations:

Measures of the Location of the Data

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 3.1-1

Example 2. Given the data below, complete the chart:

1.3.1 Measuring Center: The Mean

Chapter 01 : What is Statistics?

Section 2.3: One Quantitative Variable: Measures of Spread

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22

CHAPTER 2 Description of Samples and Populations

Describing distributions with numbers

Introduction to Statistics

are the objects described by a set of data. They may be people, animals or things.

STT 315 This lecture is based on Chapter 2 of the textbook.

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

Chapter 1 - Lecture 3 Measures of Location

Determining the Spread of a Distribution

The empirical ( ) rule

6 THE NORMAL DISTRIBUTION

Determining the Spread of a Distribution

CHAPTER 1. Introduction

2 Descriptive Statistics

Averages How difficult is QM1? What is the average mark? Week 1b, Lecture 2

Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users

STAT 155 Introductory Statistics. Lecture 6: The Normal Distributions (II)

Chapter 3. Data Description

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)

Lecture 3: Chapter 3

Math 14 Lecture Notes Ch Percentile

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

Recall that the standard deviation σ of a numerical data set is given by

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Sampling, Frequency Distributions, and Graphs (12.1)

Determining the Spread of a Distribution Variance & Standard Deviation

Range The range is the simplest of the three measures and is defined now.

Lecture 2. Descriptive Statistics: Measures of Center

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!

CHAPTER 2: Describing Distributions with Numbers

Quantitative Tools for Research

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Chapter (3) Describing Data Numerical Measures Examples

Descriptive Statistics-I. Dr Mahmoud Alhussami

Units. Exploratory Data Analysis. Variables. Student Data

Stats Review Chapter 6. Mary Stangler Center for Academic Success Revised 8/16

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

1. Exploratory Data Analysis

Practice Questions for Exam 1

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012

Exercise 1. Exercise 2. Lesson 2 Theoretical Foundations Probabilities Solutions You ip a coin three times.

Chapter 6 Assessment. 3. Which points in the data set below are outliers? Multiple Choice. 1. The boxplot summarizes the test scores of a math class?

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

Remember your SOCS! S: O: C: S:

Transcription:

Practice problems from chapters and 3 Question-1. For each of the following variables, indicate whether it is quantitative or qualitative and specify which of the four levels of measurement (nominal, ordinal, interval, and ratio) is most appropriate. a) Class standing (i.e., letter grades) of students of a statistics class b) Admitting diagnosis of patients admitted to a mental health clinic c) Weights of babies born in a hospital during a year d) Gender of babies born in a hospital during a year e) Under-arm temperature of day-old infants born in a hospital. Answer: a) Qualitative and Ordinal; b) Qualitative and Nominal; c) Quantitative and Ratio; d) Qualitative and Nominal; e) Quantitative and Interval. Question-. Consider the following the following sample data set 0.3, 0.6, 0.9, 1.3, 0.4, 0.6, 1., 1.4, 1.1, 0., 0. a) Find the mean, median, standard deviation, and range. b) Find the interquartile range. c) Find the 45th and 87th percentiles. Answer: Sort data in ascending order: 0., 0., 0.3, 0.4, 0.6, 0.6, 0.9, 1.1, 1., 1.3, 1.4 I want to remind you that L k = location of kth percentile in the sorted data and P k = data value at the location of kth percentile of sorted data a) Mean=0.75; Median: since n=11 is odd number, the median is the 6th number=0.6 Standard deviation: n( x i s = ) ( x i ) = n(n 1) 11(8.16) (8.) 11(11 1) = 0.45 Range=1.4-0.=1. b) Interquartile range = Q 3 Q 1. To find P 5 or the first quartile, find L 5 = kn 100, where k=5 and n=11. L 5 = 0.5 11 =.75. It is a fraction number. So the third value in the sorted data, 1

i.e., P 5 = Q 1 = 0.3; To find P 75 or the third quartile, find L 75 = kn 100, where k=75 and n=11. L 75 = 0.75 11 = 8.5. It is a fraction number. So 9th value in the sorted data, i.e., P 75 = Q 3 = 1.. Interquartile range=1.-0.3= 0.9. c) Find P 45 and P 87. To find P 45, calculate L 45 = kn 100, where k=45 and n=11. L 45 = 0.45 11 = 4.95. It is a fraction number. So 5th value in the sorted data, i.e., P 45 = 0.6 To find P 87, calculate L 87 = kn 100, where k=87 and n=11. L 87 = 0.87 11 = 9.57. It is a fraction number. So 10th values in the sorted data i.e., P 87 = 1.3 Question-3. A study of physical fitness tests for 1 randomly selected Pre-Medical students measured their exercise capacity (in minutes). The following data resulted: 34, 19, 33, 30, 43, 36, 3, 41, 31, 31, 37, 18 a) Find the mean, the median, and the mode for the students exercise capacity. b) Find the standard deviation and the variance for the sample data of the students exercise capacity. c) Provide the five number summary for the students exercise capacity. d) Find the percentile corresponding to 36 minutes. e) Find P 4. Answer: a) Mean=3.08; Median= 6th value+7th value = 3+33 = 3.5; Mode=31 b) Standard deviation: n( x i s = ) ( x i ) = n(n 1) 1(1971) (385) 1(1 1) = 56.6 = 7.5, s = 56.6 c) Five number of summary: Min=18, Q 1 = 30.5, Q = 3.5, Q 3 = 36.5, Max=43. e) The number of data points less than 36=8. Percentile of 36 = 8 100 = 67 1 f) P 4 = 3rd value of sorted data=30

Question-4. Suppose we call unusual observations in a population of normally distributed data that are either at least standard deviation above the mean or about standard deviation below the mean. What percent are unusual? Answer: 5% Question-5. Suppose the distribution of grades in your statistics class is normal, with mean = 83.4, s = 7.0. There are 10 students in the class. If your score is 97.4 in the class, roughly how many students have scores higher than you? Answer: Here z score = 97.4 83.4 7 =. If z-score= (you may think that z-score is the same as k in the 68 95 99.7 rule, that is, k = ). In this rule, when k =, you are in the 95th position, that is,.5% (consider both side of normal curve, 5%/=.5%) of the population has a score greater than you (and therefore a higher exam score). If there are 10 people in the class then about (.05)*(10) = 3 students have higher scores. Question-6. Listed below are the thorax lengths of (in millimeters) of a sample of male fruit flies. Based on these sample values, is a thorax length of 0.68 mm unusual? Why or why not? 0.7, 0.90, 0.84, 0.68, 0.84, 0.90, 0.9, 0.84, 0.64, 0.84, 0.76 Answer: Mean x = 0.81, standard deviation s = 0.094, z-score= x x s 1.38. This score lies between - and. So 0.68 is not unusual. = 0.68 0.81 0.094 = Question-7. A woman wrote to Dear Abby and claimed that she gave birth 305 days from a visit from her husband, who was in Navy. Lengths of pregnancies have a mean of 68 days and a standard deviation of 15 days. Find the z-score for 305 days. Is such a length unusual? What do you conclude? Answer: Mean x = 68, standard deviation s = 15, z-score= x x = 305 68 =.47. s 15 This score is greater than but less than 3. So 305 is somewhat unusual. We can call 305 days long pregnancy as an unusual. Question-8. If a sample has a mean 55 and a standard deviation 6, use the empirical rule (show normal curve) to determine interval that you would expect 95% of the data to lie. Assume the data show a bell-shaped distribution. 3

Answer: x ± ks. Here k =, x = 55, and s = 6. Interval is: 55 ± 6 = [43, 67] Question-9. Use the sample data listed below to find the coefficient of variation for each of the two samples. Compare and interpret the result in your own words? Heights (in.) of men: 71, 66, 7, 69, 68, 69 Lengths (mm) of cuckoo eggs: 19.7, 1.7, 1.9,.1,.1,.3,.7,.9, 3.9 Answer:: Height of Men: mean x = 69.17, standard deviation s =.14, CV= 100 = s x.14 100 = 3.09%. 69.17 Lengths of cuckoo eggs: mean x =.14, standard deviation s = 1.13, CV= 100 = s x 1.13 100 = 5.1%..14 The relative variation to the mean in the cuckoo eggs length in greater by a factor of 1.7 times than the relative variation to the mean of the men s heights. Question-10. Use Chebyshev s theorem to find what percent of the values will fall between 10 and 6 for a data set with mean of 18 and standard deviation of. Answer: As we don t know the population distribution, we have to use Tchebychev s inequality. Here x = 18 and s =. Consider the interval [ x ks, x + ks]. The length of this interval= ( x + ks) ( x ks) = x + ks x + ks = ks. According to the question we have ks = 6 10 k = 16 4k = 16 4k = 16 4 4 k = 4 To get proportion of values, use Tchebychev s rule and which is = 1 1 = 1 1 = k 4 1 1 = 0.94. That is, at least 94% values lies between the interval 10 to 6. 16 Question-11. The box plot is created for wait time (in minutes) from the hospital s Emergency Room. These wait times are based on a sample of 160 patients during the month of January. (See Figure 1) a) What is an approximate value of the inter-quartile range for the above data? 4

Figure 1: Boxplot Answer: IQR = 14 8.5 = 5.5 b) If a patient waited for minutes, can this time be declared as potential outlier? Justify your answer. Answer: Yes, because is greater than the upper fence value 0. c) Estimate the number of patients who waited less than 14 minutes. Comment on the shape of the distribution of wait times. Answer: 75%. The shape of the distribution is skewed to the left. Question-1. How many different 9-letter code words can be made using the symbols %, %, %, %, &, &, &, +, +? 9! Answer: = 160 as there are nine items where four are alike, three are alike, 4!3!! and two are alike. Question-13. How many different ways can 5 identical tubes of tartar control toothpaste, 3 identical tubes of bright white toothpaste, and 4 identical tubes of mint toothpaste be arranged in a grocery counter display? (Answer: 7,70). 5

1! Answer: = 770 as there twelve items in all where five are alike, three are 5!3!4! alike, and four are alike. Question-14. Six men and seven women apply for two identical jobs. If the jobs are filled at random, find the following: (a) The probability that both are filled by men. (b) The probability that both are filled by women. (c) The probability that one man and one woman are hired. (d) The probability that the one man and the one woman who are twins are hired. Ans. Answer: (a) The random variable X, counts the number of men in a sample of two drawn without replacement from a population of size 13. As probability is the relative frequency of the event of interest in the sample space, we need to compute a ratio. The denominator of the ratio counts the number of ways two applicants can be drawn without replacement from a pool of 13 applicants. Order is not important here. In the numerator, we compute the number of ways two men can be selected from the six male applicants and none of the women can be selected from the seven female applicants. Then we apply the counting rule that says in how many ways two men can be selected and none of the women can be selected together. The 7 0) probability is then ( 13 ) = 0.19. ( 6 (b) (c) (d) ( 6 0 7 ( ) 13 ) = 0.69. ( 6 1 7 ( 1) 13 ) = 0.538 ( 1 1 1 1 5 0 6 ( 0) 13 ) = 0.013 6