MgtOp 215 Chapter 3 Dr. Ahn

Similar documents
Unit 2. Describing Data: Numerical

2011 Pearson Education, Inc

Unit 2: Numerical Descriptive Measures

Chapter 3. Data Description

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

Statistics for Managers using Microsoft Excel 6 th Edition

Section 3. Measures of Variation

P8130: Biostatistical Methods I

After completing this chapter, you should be able to:

Lecture 2 and Lecture 3

Quantitative Tools for Research

TOPIC: Descriptive Statistics Single Variable

Descriptive Univariate Statistics and Bivariate Correlation

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

MATH 117 Statistical Methods for Management I Chapter Three

Describing Distributions

A is one of the categories into which qualitative data can be classified.

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Topic-1 Describing Data with Numerical Measures

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study

equal to the of the. Sample variance: Population variance: **The sample variance is an unbiased estimator of the

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

3 Lecture 3 Notes: Measures of Variation. The Boxplot. Definition of Probability

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Lecture 2. Descriptive Statistics: Measures of Center

2.1 Measures of Location (P.9-11)

Chapter 2 Descriptive Statistics

Statistics I Chapter 2: Univariate data analysis

Range The range is the simplest of the three measures and is defined now.

Chapter 3. Measuring data

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

Statistics I Chapter 2: Univariate data analysis

Describing Distributions With Numbers

Describing distributions with numbers

Practice problems from chapters 2 and 3

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations:

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation

Lecture 1: Descriptive Statistics

1. Exploratory Data Analysis

Stats Review Chapter 3. Mary Stangler Center for Academic Success Revised 8/16

Slide 1. Slide 2. Slide 3. Pick a Brick. Daphne. 400 pts 200 pts 300 pts 500 pts 100 pts. 300 pts. 300 pts 400 pts 100 pts 400 pts.

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Chapter 4. Displaying and Summarizing. Quantitative Data

Lecture 11. Data Description Estimation

Determining the Spread of a Distribution

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Chapter 5. Understanding and Comparing. Distributions

Determining the Spread of a Distribution

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that?

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

BNG 495 Capstone Design. Descriptive Statistics

= n 1. n 1. Measures of Variability. Sample Variance. Range. Sample Standard Deviation ( ) 2. Chapter 2 Slides. Maurice Geraghty

Chapter 1. Looking at Data

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

Determining the Spread of a Distribution Variance & Standard Deviation

Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data

Introduction to Statistics

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

GRAPHS AND STATISTICS Central Tendency and Dispersion Common Core Standards

Descriptive Data Summarization

Section 3.2 Measures of Central Tendency

Chapter 1 - Lecture 3 Measures of Location

STAT 200 Chapter 1 Looking at Data - Distributions

Descriptive Statistics-I. Dr Mahmoud Alhussami

Chapter 3 Data Description

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept.

3.1 Measure of Center

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

College Mathematics

Introduction to Statistics

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the

Let's Do It! What Type of Variable?

Describing Distributions With Numbers Chapter 12

Numerical Measures of Central Tendency

CHAPTER 2: Describing Distributions with Numbers

MEASURING THE SPREAD OF DATA: 6F

CIVL 7012/8012. Collection and Analysis of Information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Describing distributions with numbers

1 Measures of the Center of a Distribution

Chapter 2: Tools for Exploring Univariate Data

WELCOME!! LABORATORY MATH PERCENT CONCENTRATION. Things to do ASAP: Concepts to deal with:

1.3: Describing Quantitative Data with Numbers

ECLT 5810 Data Preprocessing. Prof. Wai Lam

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS

Chapter 1: Exploring Data

REVIEW: Midterm Exam. Spring 2012

Recap: Ø Distribution Shape Ø Mean, Median, Mode Ø Standard Deviations

Units. Exploratory Data Analysis. Variables. Student Data

Elementary Statistics

Preliminary Statistics course. Lecture 1: Descriptive Statistics

Stat 101 Exam 1 Important Formulas and Concepts 1

More on Variability. Overview. The Variance is sensitive to outliers. Marriage Data without Nevada

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:

3.3. Section. Measures of Central Tendency and Dispersion from Grouped Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Transcription:

MgtOp 215 Chapter 3 Dr. Ahn Measures of central tendency (center, location): measures the middle point of a distribution or data; these include mean and median. Measures of dispersion (variability, spread): measures the extent to which the observations are scattered; these include standard deviation and range. Parameter: a numerical descriptive measure computed from the population measurements (census data). Statistic: a numerical descriptive measure computed from sample measurements. Throughout we denote N: the number of observations in the population, that is the population size, n: the number of observations in a sample, that is, the sample size, and x i : the i-th observation. Measures of Central Tendency (Location) Population mean Sample mean 1 1 Formulas> Insert Function ( ) >Statistical>AVERAGE Note that the population mean and sample mean are arithmetic means or average, and computed in the same manner. Example 1. Find the sample mean of the following data. 5, 7, 2, 0, 4 Deviation from the mean: observation minus the mean, i.e. or. The mean is a measure of the center in the sense that it balances the deviations from the mean. Median: The middle value of data when ordered from smallest to largest Formulas> Insert Function ( ) >Statistical>MEDIAN Example 2. Find the sample median of the data in Example 1. The median is a measure of the center in the sense that it balances the number of observations on both sides of the median. 1

Example 3. Find the median of the following data. 5, 7, 2, 4, 2, 1 The mode is useful as a measure of central tendency for qualitative data. Graphical comparison of the mean, median, and mode Example 4. Find the mean and median of the following data and compare the results with those of Example 1. 5, 70, 2, 0, 4 HC Homework: 3.1-a & d only, 3.2-a & d only on p.117. Geometric mean: the n-th root of the product of n values. Formulas>Insert Function ( ) >Statistical>GEOMEAN Geometric mean rate of return (change): an average rate of change; a good way to sort out effects that are multiplicative 1 1 1 1 where the rate of change in decimals of i-th period. Example 5. Suppose the interest rates on a savings account during a five-year period are given below. Find the average interest rate per year during the five year period. Year 1 2 3 4 5 Interest rate 3.7% 2.7% 2.4% 2% 1.3% For the above example the numbers the you enter for GEOMEAN in Excel are 1.037, 1.027, 1.024, 1.02, and 1.013. MSL Homework: 3.21 HC Homework: 3.22 Measures of Dispersion Range=largest observation - smallest observation Mean absolute deviation (MAD): 1 for census data and 1 for sample data Formulas >Insert Function ( )>Statistical>AVEDEV Example 6. Find the mean absolute deviation of the data: 40, 55, 75, 95, 95 2

Population variance: 1 1 The (population) variance is the average of the squared deviations from the population mean. Formulas >Insert Function ( )>Statistical>VAR.P Population standard deviation: Formulas >Insert Function ( )>Statistical>STDEV.P Sample variance: 1 1 1 1 The (sample) variance is the average of the squared deviations from the (sample) mean. Formulas >Insert Function ( )>Statistical>VAR.S Sample standard deviation: Formulas >Insert Function ( )>Statistical>STDEV.S Example 7. Find the variance and standard deviation of the sample data in Example 6. Example 8. Find the standard deviation of the data in Example 7 assuming they are census data. Coefficient of variation: 100 % for population and 100 % % for sample a relative measure of the amount of variation with respect to the mean. Note the standard deviation is an absolute measure of variation. Example 9. Find the coefficient of variation of the data in Example 6. HC Homework: 3.1-b only, 3.2-b only on p.117 Example 10. Suppose based on the history of two stocks A and B we have the following information. Find the C.V. s of the stocks. Stock A Stock B Mean Price $100/share $10/share St. Dev. $5 $5 You may get summary measures using MS Excel: Data >Data Analysis.> Descriptive Statistics 3

Note that the standard deviation (as well as variance) given by the above is always based on the sample formula. Therefore for census data we need to make the following correction for the correct standard deviation. (St. Dev. from Excel) or use the function wizard as follows: Formulas >Insert Function ( )>Statistical>STDEV.P. Standardized score: for population data and for sample data, sometimes called a z-score represents the signed distance from the mean measured in units of standard deviation. The standardized variable has mean zero and standard deviation one and does not have a measurement unit. Formulas >Insert Function ( )>Statistical>STANDARDIZE. Example 11. Suppose the test score of a student is 75 on the accounting test with mean 70 and standard deviation 10 and the test score of the student is 65 on the statistics test with mean 60 and standard deviation 5. Compute the standardized test scores of the accounting and statistics test scores of this student, and interpret the standardized scores. HC Homework: 3.1-c only, 3.2-c only on p.117. MSL Homework: 3.13, 3.14, 3.17 Chebychev s Theorem: For any sample or population data the proportion of observations that lie within k standard deviations from the mean is at least 11/. This theorem describes the distribution of data using the mean and the standard deviation together. Example 12. When the sample mean and standard deviation are 77 and 9.04, respectively, find an interval around the mean that covers at least 75% of the data. Also find an interval that covers at least 95% of the data. Empirical rule: When the distribution of (sample or population) data is approximately bell shaped (or mound-shaped), then approximately 68% of the data are within 1 standard deviation from the mean, approximately 95% of the data are within 2 standard deviation from the mean, and approximately 99% of the data are within 3 standard deviation from then mean. Example 13. When the sample mean and standard deviation are 77 and 9.04, respectively, find an interval around the mean that covers approximately 95 % of the data. Assume the distribution of the data is bell shaped. Outliers: Extreme observations not conforming to the rest of the observation. As a rule of thumb observations that are three standard deviations above or below the mean are considered as outliers. MSL Homework: 3.37 HC Homework: 3.40 4

Suppose the observations,,, have been arranged in ascending order. The p-th percentile is the value denoted by such that at least p percent of the observations are less than or equal to and at least (100-p) percent of the observations are greater than or equal to. Formulas >Insert Function ( )>Statistical>PERCENTILE.INC Procedure for calculating percentiles: 1. Arrange n observations in ascending order. 2. Calculate i=np/100. 3. If i is an integer, the p-th percentile is the arithmetic average of and. 4. If i is not an integer, the p-th percentile is, where j is the smallest integer greater than i. Example 14: Suppose we have data 40, 37, 50, 26, 30, 59, 8, 50 a. Find a 25-th percentile. b. Find a 90-th percentile. c. Find a 75th percentile. The first quartile, also called lower quartile and denoted by, is the 25 th percentile. The third quartile, also called upper quartile and denoted by, is the 75 th percentile. Formulas >Insert Function ( )>Statistical>QUARTILE.INC The inter-quartile range (IQR) is, and is a measure of variability. Midhinge:, a measure of location. Example 15: In Example 14, find the inter-quartile range and midhinge. A box-(and whisker) plot is a graphical method of displaying the quartiles, ranges, and extreme values of data. In a box plot the left I (also called the left hinge) is at, the right I (right hinge) at, and I inside of the box is at the median, the right (left) whisker extends out from the right (left) hinge to the largest (smallest) observation with 1.5 IQR above (below) the right (left) hinge, observations beyond the reach of the whiskers are considered outlying observation. The Inner fences are located 1.5 IQR below and above. The Outer fences are located 3 IQR below and above. HC Homework: 3.27 and Calculate 20 th and 90 th MSL Homework: 3.32 percentile of the data for the problem. 5

Example of MS Excel output for the 227 observations of the 1-year return % of the Growth Funds in the data file Retirement Funds. Growth Mean 14.27797357 Standard Error 0.332135457 Median 14.18 Mode 16.95 Standard Deviation 5.004125231 Sample Variance 25.04126933 Kurtosis 5.147918131 Skewness 0.203923123 Range 45.26 Minimum 11.28 Maximum 33.98 Sum 3241.1 Count 227 f n >Statistical>AVEDEV 3.45945 AVERAGE 14.27797 MEDIAN 14.18 QUARTILE.INC 11.790 (with quart 1) 16.635 (with quart 3) STDEV.P 4.993 (Is this appropriate for the data?) PERCENTILE.INC 18.86 (90 th percentile with k=0.9) (Do you get the same value when you apply the procedure for percentiles discussed in class?) 6