Lecture 11. Data Description Estimation

Size: px
Start display at page:

Download "Lecture 11. Data Description Estimation"

Transcription

1 Lecture 11 Data Description Estimation

2 Measures of Central Tendency (continued, see last lecture) Sample mean, population mean Sample mean for frequency distributions The median The mode The midrange

3 3-22 The Weighted Mean The weighted mean is used when the values in a data set are not all equally represented. The weighted mean of a variable X is found by multiplying each value by its corresponding weight and dividing the sum of the products by the sum of the weights.

4 3-22 The Weighted Mean The weighted mean X = w X + w X w X n w + w w 1 2 wx = w where w, w,..., w are the weights 1 2 for the values X, X,..., X. n 1 2 n n n

5 Distribution Shapes Frequency distributions can assume many shapes. The three most important shapes are positively skewed,, symmetrical, and negatively skewed.

6 Positively Skewed Y Positively Skewed Mode < Median < Mean X

7 Symmetrical Y Symmetrical Mean = Media n = Mode X

8 Negatively Skewed Y Negatively Skewed Mean < Median < Mode X

9 3-33 Measures of Variation - Range The range is defined to be the highest value minus the lowest value. The symbol R is used for the range. R = highest value lowest value. Extremely large or extremely small data values can drastically affect the range.

10 3-33 Measures of Variation - Population Variance The variance is the average of the squares of the distance each value is from the mean. The symbol for the population variance is 2 σ ( σ is the Greek lowercase letter sigma) 2 2 ( X µ ) σ =, where N X = individual value µ = population mean N = population size

11 3-33 Measures of Variation - Population Standard Deviation The standard deviation is the square root of the variance. σ = 2 σ = 2 ( X µ ). N

12 3-33 Measures of Variation - Example Consider the following data to constitute the population: 10, 60, 50, 30, 40, 20. Find the mean and variance. The mean µ = ( )/6 = 210/6 = 35. The variance σ 2 = 1750/6 = See next slide for computations.

13 3-33 Measures of Variation - Example X X -- µ (( X -- µ 2 ) 2 )

14 3-33 Measures of Variation - Sample Variance The unbiased estimator of the population variance or the sample variance is a statistic whose value approximates the expected value of a population variance. It is denoted by s 2, where 2 2 ( X X ) s =, and n 1 X = sample mean n = sample size

15 3-33 Measures of Variation - Sample Standard Deviation The sample standard deviation is the square root of the sample variance. s = s 2 = 2 ( X X ). n 1

16 3-3 Shortcut Formula for the Sample Variance and the Standard Deviation s 2 = 2 2 X ( X ) / n n 1 s = 2 2 X ( X ) / n n 1

17 3-33 Sample Variance - Example Find the variance and standard deviation for the following sample: 16, 19, 15, 15, 14. ΣX = = 79. ΣX 2 = = 1263.

18 3-33 Sample Variance - Example s X ( X ) / n = n (79) / 5 = = s = 3.7 = 19..

19 3-33 Sample Variance for Grouped and Ungrouped Data For grouped data, use the class midpoints for the observed value in the different classes. For ungrouped data, use the same formula (see next slide) with the class midpoints, X m, replaced with the actual observed X value.

20 3-33 Sample Variance for Grouped and Ungrouped Data The samplevariance for grouped data: s 2 = 2 2 f X m [( f X m) / n]. n 1 For ungrouped data, replace X m with the observed X value.

21 3-33 Sample Variance for Grouped Data - Example X ff f X f X f X f X n = Σf X = Σf X 2 2 = 1500

22 3-33 Sample Variance for Ungrouped Data - Example The sample variance and standard deviation: s 2 2 f X [( f X ) / n] 2 s = n [(186)2/ 24] = = = = 16..

23 3-33 Coefficient of Variation The coefficient of variation is defined to be the standard deviation divided by the mean. The result is expressed as a percentage. s CVar= 100% or CVar = σ 100%. X µ

24 The Empirical (Normal) Rule For any bell shaped distribution: Approximately 68% of the data values will fall within one standard deviation of the mean. Approximately 95% will fall within two standard deviations of the mean. Approximately 99.7% will fall within three standard deviations of the mean.

25 The Empirical (Normal) Rule µ ± 1σ 68% µ ± 2σ -- 95% µ ± 3σ 99.7% µ 3σ µ 2σ µ 1σ µ µ +1σ µ +2σ µ +3σ

26 3-44 Measures of Position z score The standard score or z score for a value is obtained by subtracting the mean from the value and dividing the result by the standard deviation. The symbol z is used for the z score.

27 3-44 Measures of Position z-score The z score represents the number of standard deviations a data value falls above or below the mean. For samples: X X z = s For populations: X µ z =. σ.

28 3-44 z-score z - Example A student scored 65 on a statistics exam that had a mean of 50 and a standard deviation of 10. Compute the z-score. z = (65 50)/10 = 1.5. That is, the score of 65 is 1.5 standard deviations above the mean. Above - since the z-score is positive.

29 3-44 Quartiles Quartiles divide the data set into 4 groups. Quartiles are denoted by Q 1, Q 2, and Q 3. The median is the same as Q 2.

30 3-44 Outliers and the Interquartile Range (IQR) An outlier is an extremely high or an extremely low data value when compared with the rest of the data values. The Interquartile Range, IQR = Q 3 Q. 1

31 3-55 Exploratory Data Analysis - Box Plot When the data set contains a small number of values, a box plot is used to graphically represent the data set. These plots involve five values: the minimum value, the lower hinge (Q( 1 ), the median, the upper hinge (Q( 3 ), and the maximum value.

32 3-55 Exploratory Data Analysis - Box Plot The lower hinge is the other name for the lower quartile Q 1, denoted by the symbol LH. The upper hinge is the other name for the upper quartile Q 3, denoted by the symbol UH.

33 Exploratory Data Analysis - Box Plot - Example (Cardiograms data) LH UH MINIMUM MAXIMUM MEDIAN

34 Information Obtained from a Box Plot If the median is near the center of the box, the distribution is approximately symmetric. If the median falls to the left of the center of the box, the distribution is positively skewed. If the median falls to the right of the center of the box, the distribution is negatively skewed.

35 Information Obtained from a Box Plot If the lines are about the same length, the distribution is approximately symmetric. If the right line is larger than the left line, the distribution is positively skewed. If the left line is larger than the right line, the distribution is negatively skewed.

36 Estimations

37 Outline 8-1 Introduction 8-2 Confidence Intervals for the Mean [σ Known or n 30] and Sample Size 8-3 Confidence Intervals for the Mean [σ Unknown and n < 30]

38 8-1 Introduction One aspect of inferential statistics is estimation, which is the process of estimating the value of a parameter from information obtained from a sample. An important question: How large should the sample be in order to make an accurate estimate?

39 8-1 Introduction Two types of estimations: A point estimate is a specific numerical value estimate of a parameter. An interval estimate of a parameter is an interval or a range of values used to estimate the parameter.

40 8-2 Point estimates A point estimate is a specific numerical value estimate of a parameter. The best estimate of the population mean µ is the sample mean X.

41 8-22 Three Properties of a Good Estimator The estimator must be an unbiased estimator. That is, the expected value or the mean of the estimates obtained from samples of a given size is equal to the parameter being estimated.

42 8-22 Three Properties of a Good Estimator The estimator must be consistent. For a consistent estimator, as sample size increases, the value of the estimator approaches the value of the parameter estimated.

43 8-22 Three Properties of a Good Estimator The estimator must be a relatively efficient estimator. That is, of all the statistics that can be used to estimate a parameter, the relatively efficient estimator has the smallest variance.

44 8-22 Confidence Intervals An interval estimate interval estimate of a parameter is an interval or a range of values used to estimate the parameter. This estimate may or may not contain the value of the parameter being estimated.

45 8-22 Confidence Intervals A confidence interval confidence interval is a specific interval estimate of a parameter determined by using data obtained from a sample and the specific confidence level of the estimate.

46 8-22 Confidence Intervals The confidence level confidence level of an interval estimate of a parameter is the probability that the interval estimate will contain the parameter.

47 8-2 Confidence Intervals for the Mean (σ Known or n 30) and Sample Size Three common confidence intervals are used: the 90%, the 95%, and the 99% confidence intervals. When the sample size is large, approximately 95% of the sample means will fall within ±1.96 standard errors of the population mean:

48 8-2 Confidence Intervals for the Mean (σ Known or n 30) and Sample Size If a specific sample mean is selected, say, there is a 95% probability that it falls within the range of Likewise, there is a 95% probability that the interval specified by will contain µ.

49 8-2 Confidence Intervals for the Mean (σ Known or n 30) and Sample Size

50 8-22 Formula for the Confidence Interval of the Mean for a Specific α The confidence level is the percentage equivalent to the decimal value of 1 α. σ σ X z < µ < X + n α 2 α 2 z n

51 8-22 Maximum Error of Estimate The maximum error of estimate is the maximum difference between the point estimate of a parameter and the actual value of the parameter.

52 8-22 Confidence Intervals - Example The president of a large university wishes to estimate the average age of the students presently enrolled. From past studies, the standard deviation is known to be 2 years. A sample of 50 students is selected, and the mean is found to be 23.2 years. Find the 95% confidence interval of the population mean.

53 8-22 Confidence Intervals - Example Since the 95% confidence interval is desired, z = Hence, α 2 substituting in the formula σ X zα < µ < X + z n one gets 2 α 2 σ n

54 8-22 Confidence Intervals - Example (1.96)( ) < µ < (1.96)( ) < µ < < µ < 238. or 23.2 ± 0.6 years. Hence, the president can say, with 95% confidence, that the average age of the students is between and 238. years, based on 50 students.

55 8-22 Confidence Intervals - Example A certain medication is known to increase the pulse rate of its users. The standard deviation of the pulse rate is known to be 5 beats per minute. A sample of 30 users had an average pulse rate of 104 beats per minute. Find the 99% confidence interval of the true mean.

56 8-22 Confidence Intervals - Example Since the 99% confidence interval is desired, z = Hence, α 2 substituting in the formula σ X z < µ < α X + z n one gets 2 α 2 σ n

57 8-22 Confidence Intervals - Example (2.58). ( ) < µ < (2.58)( ) < µ < < µ < Hence, one can say, with 99% confidence, that the average pulse rate is between and beats per minute, based on 30 users.

58 The meaning of 95% confidence

59 8-2 Formula for the Minimum Sample Size Needed for an Interval Estimate of the Population Mean n = of estimate. z α where E is the σ 2 E If necessary, round the answer up to obtain a whole number. 2 maximum error

60 8-22 Minimum Sample Size Needed for an Interval Estimate of the Population Mean - Example The college president asks the statistics teacher to estimate the average age of the students at their college. How large a sample is necessary? The statistics teacher decides the estimate should be accurate within 1 year and be 99% confident. From a previous study, the standard deviation of the ages is known to be 3 years.

61 8-22 Minimum Sample Size Needed for an Interval Estimate of the Population Mean - Example Since α = ( or ), z = 2. 58, and E = 1, substituting in n n α 2 = z α σ 2 E 2 gives = ( )( 3 ) 2 =

62 8-33 Characteristics of the t Distribution The t distribution shares some characteristics of the normal distribution and differs from it in others. The t distribution is similar to the standard normal distribution in the following ways: It is bell-shaped. It is symmetrical about the mean.

63 8-33 Characteristics of the t Distribution The mean, median, and mode are equal to 0 and are located at the center of the distribution. The curve never touches the x axis. The t distribution differs from the standard normal distribution in the following ways:

64 8-33 Characteristics of the t Distribution The variance is greater than 1. The t distribution is actually a family of curves based on the concept of degrees of freedom, which is related to the sample size. As the sample size increases, the t distribution approaches the standard normal distribution.

65 8-33 Standard Normal Curve and the t Distribution

66 8-33 Confidence Interval for the Mean (σ Unknown and n < 30) - Example Ten randomly selected automobiles were stopped, and the tread depth of the right front tire was measured. The mean was 0.32 inch, and the standard deviation was 0.08 inch. Find the 95% confidence interval of the mean depth. Assume that the variable is approximately normally distributed.

67 8-33 Confidence Interval for the Mean (σ Unknown and n < 30) - Example Since σ is unknown and s must replace it, the t distribution must be used with α = Hence, with 9 degrees of freedom, t α/2 = (see Table F in text). From the next slide, we can be 95% confident that the population mean is between 0.26 and 0.38.

68 8-33 Confidence Interval for the Mean (σ Unknown and n < 30) - Example Thus the confidence of the population mean is found by substituting in X t 95% s < µ < X + t n α 2 α 2 interval s n 0.32 (2.262) 0.08 < µ < (2.262) < µ < 0. 38

69 When to use the z or t distribution?

Chapter 3 Data Description

Chapter 3 Data Description Chapter 3 Data Description Section 3.1: Measures of Central Tendency Section 3.2: Measures of Variation Section 3.3: Measures of Position Section 3.1: Measures of Central Tendency Definition of Average

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures

More information

Range The range is the simplest of the three measures and is defined now.

Range The range is the simplest of the three measures and is defined now. Measures of Variation EXAMPLE A testing lab wishes to test two experimental brands of outdoor paint to see how long each will last before fading. The testing lab makes 6 gallons of each paint to test.

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures

More information

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. Chapter 3 Numerically Summarizing Data Chapter 3.1 Measures of Central Tendency Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. A1. Mean The

More information

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population . Measures of Central Tendency: Mode, Median and Mean Average a single number that is used to describe the entire sample or population. Mode a. Easiest to compute, but not too stable i. Changing just one

More information

Lecture 2. Descriptive Statistics: Measures of Center

Lecture 2. Descriptive Statistics: Measures of Center Lecture 2. Descriptive Statistics: Measures of Center Descriptive Statistics summarize or describe the important characteristics of a known set of data Inferential Statistics use sample data to make inferences

More information

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept.

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept. Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3- Measures of Center 3-3 Measures of Variation Section 3-1: Overview Descriptive Statistics summarize or describe the important

More information

Determining the Spread of a Distribution Variance & Standard Deviation

Determining the Spread of a Distribution Variance & Standard Deviation Determining the Spread of a Distribution Variance & Standard Deviation 1.3 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3 Lecture 3 1 / 32 Outline 1 Describing

More information

MATH 117 Statistical Methods for Management I Chapter Three

MATH 117 Statistical Methods for Management I Chapter Three Jubail University College MATH 117 Statistical Methods for Management I Chapter Three This chapter covers the following topics: I. Measures of Center Tendency. 1. Mean for Ungrouped Data (Raw Data) 2.

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

Section 3.2 Measures of Central Tendency

Section 3.2 Measures of Central Tendency Section 3.2 Measures of Central Tendency 1 of 149 Section 3.2 Objectives Determine the mean, median, and mode of a population and of a sample Determine the weighted mean of a data set and the mean of a

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

Stats Review Chapter 3. Mary Stangler Center for Academic Success Revised 8/16

Stats Review Chapter 3. Mary Stangler Center for Academic Success Revised 8/16 Stats Review Chapter Revised 8/16 Note: This review is composed of questions similar to those found in the chapter review and/or chapter test. This review is meant to highlight basic concepts from the

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:

More information

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data

More information

Section 3. Measures of Variation

Section 3. Measures of Variation Section 3 Measures of Variation Range Range = (maximum value) (minimum value) It is very sensitive to extreme values; therefore not as useful as other measures of variation. Sample Standard Deviation The

More information

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 2: Describing Distributions with Numbers CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used

Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used Measures of Central Tendency Mode: most frequent score. best average for nominal data sometimes none or multiple modes in a sample bimodal or multimodal distributions indicate several groups included in

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Review: Central Measures

Review: Central Measures Review: Central Measures Mean, Median and Mode When do we use mean or median? If there is (are) outliers, use Median If there is no outlier, use Mean. Example: For a data 1, 1.2, 1.5, 1.7, 1.8, 1.9, 2.3,

More information

Resistant Measure - A statistic that is not affected very much by extreme observations.

Resistant Measure - A statistic that is not affected very much by extreme observations. Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)

More information

The Normal Distribution Review

The Normal Distribution Review The Normal Distribution Review A. Properties of the Normal Distribution Many continuous variables have distributions that are bell-shaped and are called approximately normally distributed variables. The

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

3 Lecture 3 Notes: Measures of Variation. The Boxplot. Definition of Probability

3 Lecture 3 Notes: Measures of Variation. The Boxplot. Definition of Probability 3 Lecture 3 Notes: Measures of Variation. The Boxplot. Definition of Probability 3.1 Week 1 Review Creativity is more than just being different. Anybody can plan weird; that s easy. What s hard is to be

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

After completing this chapter, you should be able to:

After completing this chapter, you should be able to: Chapter 2 Descriptive Statistics Chapter Goals After completing this chapter, you should be able to: Compute and interpret the mean, median, and mode for a set of data Find the range, variance, standard

More information

Chapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Chapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc. Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data

More information

Section 2.4. Measuring Spread. How Can We Describe the Spread of Quantitative Data? Review: Central Measures

Section 2.4. Measuring Spread. How Can We Describe the Spread of Quantitative Data? Review: Central Measures mean median mode Review: entral Measures Mean, Median and Mode When do we use mean or median? If there is (are) outliers, use Median If there is no outlier, use Mean. Example: For a data 1, 1., 1.5, 1.7,

More information

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves)

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Outliers Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Notes #1 Chp 12 Outliers In a set of numbers, sometimes there will be

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1

More information

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Overview Key Concepts Understand how adding (subtracting) a constant or multiplying (dividing) by a constant changes the center and/or spread

More information

additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst

additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst STATISTICS atisticsadditionalmathematicsstatistic

More information

Chapter 1 - Lecture 3 Measures of Location

Chapter 1 - Lecture 3 Measures of Location Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

Lecture 2 and Lecture 3

Lecture 2 and Lecture 3 Lecture 2 and Lecture 3 1 Lecture 2 and Lecture 3 We can describe distributions using 3 characteristics: shape, center and spread. These characteristics have been discussed since the foundation of statistics.

More information

OBJECTIVES INTRODUCTION

OBJECTIVES INTRODUCTION M7 Chapter 3 Section 1 OBJECTIVES Suarize data using easures of central tendency, such as the ean, edian, ode, and idrange. Describe data using the easures of variation, such as the range, variance, and

More information

Math 14 Lecture Notes Ch Percentile

Math 14 Lecture Notes Ch Percentile .3 Measures of the Location of the Data Percentile g A measure of position, the percentile, p, is an integer (1 p 99) such that the p th percentile is the position of a data value where p% of the data

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

Ch. 17. DETERMINATION OF SAMPLE SIZE

Ch. 17. DETERMINATION OF SAMPLE SIZE LOGO Ch. 17. DETERMINATION OF SAMPLE SIZE Dr. Werner R. Murhadi www.wernermurhadi.wordpress.com Descriptive and Inferential Statistics descriptive statistics is Statistics which summarize and describe

More information

Slide 1. Slide 2. Slide 3. Pick a Brick. Daphne. 400 pts 200 pts 300 pts 500 pts 100 pts. 300 pts. 300 pts 400 pts 100 pts 400 pts.

Slide 1. Slide 2. Slide 3. Pick a Brick. Daphne. 400 pts 200 pts 300 pts 500 pts 100 pts. 300 pts. 300 pts 400 pts 100 pts 400 pts. Slide 1 Slide 2 Daphne Phillip Kathy Slide 3 Pick a Brick 100 pts 200 pts 500 pts 300 pts 400 pts 200 pts 300 pts 500 pts 100 pts 300 pts 400 pts 100 pts 400 pts 100 pts 200 pts 500 pts 100 pts 400 pts

More information

How spread out is the data? Are all the numbers fairly close to General Education Statistics

How spread out is the data? Are all the numbers fairly close to General Education Statistics How spread out is the data? Are all the numbers fairly close to General Education Statistics each other or not? So what? Class Notes Measures of Dispersion: Range, Standard Deviation, and Variance (Section

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

Measures of the Location of the Data

Measures of the Location of the Data Measures of the Location of the Data 1. 5. Mark has 51 films in his collection. Each movie comes with a rating on a scale from 0.0 to 10.0. The following table displays the ratings of the aforementioned

More information

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things. (c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals

More information

MgtOp 215 Chapter 3 Dr. Ahn

MgtOp 215 Chapter 3 Dr. Ahn MgtOp 215 Chapter 3 Dr. Ahn Measures of central tendency (center, location): measures the middle point of a distribution or data; these include mean and median. Measures of dispersion (variability, spread):

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

The Normal Distribution. Chapter 6

The Normal Distribution. Chapter 6 + The Normal Distribution Chapter 6 + Applications of the Normal Distribution Section 6-2 + The Standard Normal Distribution and Practical Applications! We can convert any variable that in normally distributed

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Sections 6.1 and 6.2: The Normal Distribution and its Applications

Sections 6.1 and 6.2: The Normal Distribution and its Applications Sections 6.1 and 6.2: The Normal Distribution and its Applications Definition: A normal distribution is a continuous, symmetric, bell-shaped distribution of a variable. The equation for the normal distribution

More information

Chapter 3. Data Description. McGraw-Hill, Bluman, 7 th ed, Chapter 3

Chapter 3. Data Description. McGraw-Hill, Bluman, 7 th ed, Chapter 3 Chapter 3 Data Description McGraw-Hill, Bluman, 7 th ed, Chapter 3 1 Chapter 3 Overview Introduction 3-1 Measures of Central Tendency 3-2 Measures of Variation 3-3 Measures of Position 3-4 Exploratory

More information

Topic-1 Describing Data with Numerical Measures

Topic-1 Describing Data with Numerical Measures Topic-1 Describing Data with Numerical Measures Central Tendency (Center) and Dispersion (Variability) Central tendency: measures of the degree to which scores are clustered around the mean of a distribution

More information

Measures of Location. Measures of position are used to describe the relative location of an observation

Measures of Location. Measures of position are used to describe the relative location of an observation Measures of Location Measures of position are used to describe the relative location of an observation 1 Measures of Position Quartiles and percentiles are two of the most popular measures of position

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

CHAPTER 3. YAKUP ARI,Ph.D.(C)

CHAPTER 3. YAKUP ARI,Ph.D.(C) CHAPTER 3 YAKUP ARI,Ph.D.(C) math.stat.yeditepe@gmail.com REMEMBER!!! The purpose of descriptive statistics is to summarize and organize a set of scores. One of methods of descriptive statistics is to

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

Describing Data: Numerical Measures. Chapter 3

Describing Data: Numerical Measures. Chapter 3 Describing Data: Numerical Measures Chapter 3 Learning Objectives Calculate the arithmetic mean, weighted mean median, and the mode. Explain the characteristics, uses, advantages, and disadvantages of

More information

STATISTICS. 1. Measures of Central Tendency

STATISTICS. 1. Measures of Central Tendency STATISTICS 1. Measures o Central Tendency Mode, median and mean For a sample o discrete data, the mode is the observation, x with the highest requency,. 1 N F For grouped data in a cumulative requency

More information

Instructor: Doug Ensley Course: MAT Applied Statistics - Ensley

Instructor: Doug Ensley Course: MAT Applied Statistics - Ensley Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 04 - Sections 2.5 and 2.6 1. A travel magazine recently presented data on the annual number of vacation

More information

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest: 1 Chapter 3 - Descriptive stats: Numerical measures 3.1 Measures of Location Mean Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size Example: The number

More information

Measures of Dispersion

Measures of Dispersion Measures of Dispersion MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Introduction Recall that a measure of central tendency is a number which is typical of all

More information

MEASURING THE SPREAD OF DATA: 6F

MEASURING THE SPREAD OF DATA: 6F CONTINUING WITH DESCRIPTIVE STATS 6E,6F,6G,6H,6I MEASURING THE SPREAD OF DATA: 6F othink about this example: Suppose you are at a high school football game and you sample 40 people from the student section

More information

a) The runner completes his next 1500 meter race in under 4 minutes: <

a) The runner completes his next 1500 meter race in under 4 minutes: < I. Let X be the time it takes a runner to complete a 1500 meter race. It is known that for this specific runner, the random variable X has a normal distribution with mean μ = 250.0 seconds and standard

More information

Section 2.3: One Quantitative Variable: Measures of Spread

Section 2.3: One Quantitative Variable: Measures of Spread Section 2.3: One Quantitative Variable: Measures of Spread Objectives: 1) Measures of spread, variability a. Range b. Standard deviation i. Formula ii. Notation for samples and population 2) The 95% rule

More information

Numerical Measures of Central Tendency

Numerical Measures of Central Tendency ҧ Numerical Measures of Central Tendency The central tendency of the set of measurements that is, the tendency of the data to cluster, or center, about certain numerical values; usually the Mean, Median

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed

More information

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study CHAPTER PROBLEM Do women really talk more than men? A common belief is that women talk more than men. Is that belief founded in fact, or is it a myth? Do men actually talk more than women? Or do men and

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Chapter 2 Class Notes Sample & Population Descriptions Classifying variables

Chapter 2 Class Notes Sample & Population Descriptions Classifying variables Chapter 2 Class Notes Sample & Population Descriptions Classifying variables Random Variables (RVs) are discrete quantitative continuous nominal qualitative ordinal Notation and Definitions: a Sample is

More information

= n 1. n 1. Measures of Variability. Sample Variance. Range. Sample Standard Deviation ( ) 2. Chapter 2 Slides. Maurice Geraghty

= n 1. n 1. Measures of Variability. Sample Variance. Range. Sample Standard Deviation ( ) 2. Chapter 2 Slides. Maurice Geraghty Chapter Slides Inferential Statistics and Probability a Holistic Approach Chapter Descriptive Statistics This Course Material by Maurice Geraghty is licensed under a Creative Commons Attribution-ShareAlike.

More information

1.3: Describing Quantitative Data with Numbers

1.3: Describing Quantitative Data with Numbers 1.3: Describing Quantitative Data with Numbers Section 1.3 Describing Quantitative Data with Numbers After this section, you should be able to MEASURE center with the mean and median MEASURE spread with

More information

6.2A Linear Transformations

6.2A Linear Transformations 6.2 Transforming and Combining Random Variables 6.2A Linear Transformations El Dorado Community College considers a student to be full time if he or she is taking between 12 and 18 credits. The number

More information

Exercises from Chapter 3, Section 1

Exercises from Chapter 3, Section 1 Exercises from Chapter 3, Section 1 1. Consider the following sample consisting of 20 numbers. (a) Find the mode of the data 21 23 24 24 25 26 29 30 32 34 39 41 41 41 42 43 48 51 53 53 (b) Find the median

More information

MAT Mathematics in Today's World

MAT Mathematics in Today's World MAT 1000 Mathematics in Today's World Last Time 1. Three keys to summarize a collection of data: shape, center, spread. 2. Can measure spread with the fivenumber summary. 3. The five-number summary can

More information

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 3.1-1

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 3.1-1 Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola 3.1-1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview

More information

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers

More information

Chapter 2. Mean and Standard Deviation

Chapter 2. Mean and Standard Deviation Chapter 2. Mean and Standard Deviation The median is known as a measure of location; that is, it tells us where the data are. As stated in, we do not need to know all the exact values to calculate the

More information

1.3.1 Measuring Center: The Mean

1.3.1 Measuring Center: The Mean 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar) of a set of observations, add their values and divide by the number of observations. If the n observations

More information

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound 1 EDUR 8131 Chat 3 Notes 2 Normal Distribution and Standard Scores Questions Standard Scores: Z score Z = (X M) / SD Z = deviation score divided by standard deviation Z score indicates how far a raw score

More information

Estimating a Population Mean

Estimating a Population Mean Estimating a Population Mean MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Objectives At the end of this lesson we will be able to: obtain a point estimate for

More information

Answers Part A. P(x = 67) = 0, because x is a continuous random variable. 2. Find the following probabilities:

Answers Part A. P(x = 67) = 0, because x is a continuous random variable. 2. Find the following probabilities: Answers Part A 1. Woman s heights are normally distributed with a mean of 63.6 inches and a standard deviation of 2.5 inches. Find the probability that a single randomly selected woman will be 67 inches

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

Measures of average are called measures of central tendency and include the mean, median, mode, and midrange.

Measures of average are called measures of central tendency and include the mean, median, mode, and midrange. CHAPTER 3 Data Description Objectives Suarize data using easures of central tendency, such as the ean, edian, ode, and idrange. Describe data using the easures of variation, such as the range, variance,

More information

Lecture 3: Chapter 3

Lecture 3: Chapter 3 Lecture 3: Chapter 3 C C Moxley UAB Mathematics 26 January 16 3.2 Measurements of Center Statistics involves describing data sets and inferring things about them. The first step in understanding a set

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Measures of Central Tendency and their dispersion and applications. Acknowledgement: Dr Muslima Ejaz

Measures of Central Tendency and their dispersion and applications. Acknowledgement: Dr Muslima Ejaz Measures of Central Tendency and their dispersion and applications Acknowledgement: Dr Muslima Ejaz LEARNING OBJECTIVES: Compute and distinguish between the uses of measures of central tendency: mean,

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information