Chapter 5. Understanding and Comparing. Distributions

Size: px
Start display at page:

Download "Chapter 5. Understanding and Comparing. Distributions"

Transcription

1 STAT 141 Introduction to Statistics Chapter 5 Understanding and Comparing Distributions Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

2 Boxplots How to create a boxplot? Assume we are given the histogram and 5-number summary. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

3 Step 1: draw a box with bottom Q1 and top Q3, then insert a line at Q2. Note: The red lines and labels of Q1,Q2,Q3 are NOT necessary, for illustration only. Step 2: draw two fences : upper fence = Q IQR, lower fence = Q1 1.5 IQR. Step 3: draw whiskers -draw lines from the ends of the box to the largest and smallest values within the fences. Step 4: add outliers, observations out of the fences, with special symbols. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

4 Summary of Boxplots The bottom line and the top line of the box are Q1 and Q3. The height of the box is IQR. The line insider the box is the median. If the median line is centred, then the distribution is symmetric. If the median line is closer to the bottom (Q1), equivalently Q2 Q1 < Q3 Q2, the distribution is right skewed. If the median line is closer to the top (Q3), equivalently Q2 Q1 > Q3 Q2, the distribution is left skewed. Boxplots can be drawn horizontally. Bin Zou STAT 141 University of Alberta Winter / 27

5 Comparing Groups with Boxplots Conclusions: wind speeds are low in the summer. The tendency is to go down from Jan to Aug, and then go up. Jan has the strongest winds with the largest spread. Bin Zou STAT 141 University of Alberta Winter / 27

6 Chapter 6 The Standard Deviation and the Normal Model Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

7 z-score z-score, also called standardized value, is a measure of relative standing. Assume y is an observation from a sample with mean ȳ and standard deviation s. Then z-score of y is defined as z = y ȳ. s This is the most important formula for the midterm. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

8 Bin Zou STAT 141 University of Alberta Winter / 27

9 z-score tells: how many standard deviations away from the mean does the measurement lie and in which direction? Positive z-score: observation is greater than the mean. Negative z-score: observation is smaller than the mean. Zero z-score: observation is equal to the mean. Bin Zou STAT 141 University of Alberta Winter / 27

10 Shifting Data Add (or subtract) a constant c to each value of the data. Results: all measures of position (centre, percentiles, minimum, maximum) will increase (or decrease) by the same constant. However, the spread (range, IQR, standard deviation) does NOT change. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

11 Rescaling Data Multiply (or divide) all the data values by a constant d. In formula, vspace-1.5ex y new = d y original. Result: position new = d position original. spread new = d spread original. Standardizing into z-scores involves shifting down by the value of the mean and rescaling (dividing) by the value of the standard deviation. Standardizing into z-scores changes the centre by making the mean 0. Standardizing into z-scores changes the spread by making the standard deviation 1. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

12 Density Curve Note: The area enclosed by the density and the x-axis is always 1. Why? Relative frequency adds up to 1. Bin Zou STAT 141 University of Alberta Winter / 27

13 Histogram VS Density Both describe the overall shape of the data, but density curve is smooth (without sharp corners). You can think density curve as a limit case of histogram when the class width approaches 0 (rectangles get narrower). Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

14 As shown by the graph, the area between a and b under the density curve is the proportion (percentage) of observations that fall in [a, b]. What if we want to know the proportion of observations that lie below a or above b? Note: we do NOT discuss the proportion of observations that hit exactly a or b in a density curve. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

15 Normal Model The density curve of a normal distribution/model is bell-shaped, symmetric and unimodal. Its shape is determined by two parameters: the mean µ (also the median and the mode) and the standard deviation σ. The above graph is the density curve of the standard normal distribution with µ = 0 and σ = 1. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

16 Standard Normal Model Recall what we have learnt from shifting and rescaling data. Assume we are given a normal model with the mean µ and the standard deviation σ (short notation N(µ,σ), where N stands for normal distribution). By subtracting µ and dividing by σ for all values (exactly the same as z-score), we obtain the standard normal model: z = y µ σ. Thus, only the distribution of the standard normal is provided in the table. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

17 The Rule Does this graph look somehow familiar to you? If NO, go back to the slide of Empirical rules in Chapter 4. In a normal model, approximately 68% of the values fall within one standard deviation of the mean. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

18 Normal Table Important! You must know how to use the normal table! The normal table provides proportion of the left tail (shadowed area) of the standard normal model below a given value z. The value of z is provided by two side bars: integer part and the first decimal by the vertical bar while the second decimal by the horizontal bar. Example: to find the proportion of values below 3.65, we first locate the row of 3.6 from the rightmost column, next locate the second decimal 0.05 from the top row. Then the unique intersection (0.01% if converted into percentage) gives the answer. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

19 The first page of the z-table covers negative z values from to 0, while the second page goes to the positive side. But, still left tail. What if question asks you to find the proportion of observations above a number (right tail), say greater than 0.19? From the table, we can obtain that the proportion of observations that fall below 0.19 is Since the total area is 1, the area of the right tail is = = 42.47%. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

20 Some people have no interest in either tails, they rather care about the middle portion of the standard normal model. Example: what is the proportion of the values between and 1.19 in the standard normal model? From the table, we find two numbers: (from -0.52) and (from 1.19). (Please check!) Of course, these two numbers are the area of the left tails of and To get the area between these two values, we only need to do subtraction: =0.5815=58.15%. Bin Zou STAT 141 University of Alberta Winter / 27

21 Quick Summary Find the proportion of values in an interval. Interval can only take three types. z < a or the values below a (left tail): directly report the number found from the table. a < z < b or the values between a and b (middle interval): bigger number (found from using b) - smaller number (found from using a). z > b or the values above b (right tail): 1 - the number found from the table. Not standard normal? Convert into the standard normal by z = y µ σ. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

22 From Percentiles to Scores We just learnt how to find proportion from z-scores. Now we study how to go backwards, finding z-scores for given percentiles. 1 Obtain the proportion below z (left tail). Think of the three cases discussed in the previous slide. 2 In the normal table, find the number (with four decimals) which is closest to the proportion. 3 From the position of the number, identity the value of z-score. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

23 Examples Example 6.1 Suppose we want to find the z-score, z, that makes up the smallest 2% in the standard normal model. Smallest indicates the left tail. So in this question, the proportion of the left tail is directly given, which is 2%, or The closest number to in the normal table is Do NOT look for 0.2 on the leftmost column under z. Proportion is known, but z-score is unknown. From the position of , look to the rightmost column, we get -2.0, to the topmost row Hence, the z-score is Bin Zou STAT 141 University of Alberta Winter / 27

24 Examples Example 6.2 Suppose now we are interested in the largest 5%. largest =right tail. So we are looking for z such that the area of (z > z ) is 5%, or The corresponding area of the left tail is then From the table, we find and , both are equally closest to 0.95 among all numbers. Notice that gives z = 1.64 and gives z = In this special case, since 0.95 is exactly the middle of and , we take z-score to be the middle of 1.64 and 1.65 as well. The solution is then Note: remember this example! You need the result million times throughout the course. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

25 Examples Example 6.3 Now we want to find z-scores that given the proportion of 95% in the middle. We are looking for z such that the area between z and z is Can you tell why these two statements are equivalent? Do you remember normal distributions are all symmetric, including the standard normal model. After partitioning the middle 95% out, we are left with 5% for two tails with equal area. Hence, each tail accounts for 2.5%. Using the proportion of , we find z = 1.96, then z = Another way: area below z is =0.975, which yields the same z-score z = Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

26 Examples If the normal model in the question is not standard, then using the standardization z = y µ σ to convert the non-standard into standard. Example 6.4 Assume that the length of a human pregnancy follows a normal distribution with mean 266 (days) and standard deviation 16 (days). What is the proportion that a human pregnancy lasts longer than 280 days. Denote y the length of a human pregnancy, then y N(266,16). What is the area (y > 280)? Using the standardization z = y µ σ, we convert y into z (the standard normal). Area (y > 280) = area (z > = ) = = Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

27 Examples Example 6.5 Assume a variable y is normally distributed with µ = 10 and σ = 2. Find the value that makes up the smallest 10% of this distribution. Find y such that area (y < y ) = 0.1. Equivalently, find z such that area (z < z ) = 0.1, where z = y µ σ. Note: After standardization, y becomes z, and y becomes z. But the inequality direction stays the same. From the proportion of 10%, we obtain z = Rewriting z = y µ σ gives y = µ + σ z. Hence, y = ( 1.28) = Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter / 27

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Chapter 6 Group Activity - SOLUTIONS

Chapter 6 Group Activity - SOLUTIONS Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing

More information

Chapter 7. Scatterplots, Association, and Correlation

Chapter 7. Scatterplots, Association, and Correlation Chapter 7 Scatterplots, Association, and Correlation Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 29 Objective In this chapter, we study relationships! Instead, we investigate

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value

More information

Describing Distributions With Numbers

Describing Distributions With Numbers Describing Distributions With Numbers October 24, 2012 What Do We Usually Summarize? Measures of Center. Percentiles. Measures of Spread. A Summary Statement. Choosing Numerical Summaries. 1.0 What Do

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things. (c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals

More information

Stats Review Chapter 3. Mary Stangler Center for Academic Success Revised 8/16

Stats Review Chapter 3. Mary Stangler Center for Academic Success Revised 8/16 Stats Review Chapter Revised 8/16 Note: This review is composed of questions similar to those found in the chapter review and/or chapter test. This review is meant to highlight basic concepts from the

More information

Describing Distributions

Describing Distributions Describing Distributions With Numbers April 18, 2012 Summary Statistics. Measures of Center. Percentiles. Measures of Spread. A Summary Statement. Choosing Numerical Summaries. 1.0 What Are Summary Statistics?

More information

Essential Statistics Chapter 6

Essential Statistics Chapter 6 1 Essential Statistics Chapter 6 By Navidi and Monk Copyright 2016 Mark A. Thomas. All rights reserved. 2 Continuous Probability Distributions chapter 5 focused upon discrete probability distributions,

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included:

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included: AP Statistics Chapter 2 Notes 2.1 Describing Location in a Distribution Percentile: The pth percentile of a distribution is the value with p percent of the observations (If your test score places you in

More information

Unit 2: Numerical Descriptive Measures

Unit 2: Numerical Descriptive Measures Unit 2: Numerical Descriptive Measures Summation Notation Measures of Central Tendency Measures of Dispersion Chebyshev's Rule Empirical Rule Measures of Relative Standing Box Plots z scores Jan 28 10:48

More information

Section 3. Measures of Variation

Section 3. Measures of Variation Section 3 Measures of Variation Range Range = (maximum value) (minimum value) It is very sensitive to extreme values; therefore not as useful as other measures of variation. Sample Standard Deviation The

More information

Density Curves and the Normal Distributions. Histogram: 10 groups

Density Curves and the Normal Distributions. Histogram: 10 groups Density Curves and the Normal Distributions MATH 2300 Chapter 6 Histogram: 10 groups 1 Histogram: 20 groups Histogram: 40 groups 2 Histogram: 80 groups Histogram: 160 groups 3 Density Curve Density Curves

More information

Example 2. Given the data below, complete the chart:

Example 2. Given the data below, complete the chart: Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 2: Describing Distributions with Numbers CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring

More information

STATISTICS 141 Final Review

STATISTICS 141 Final Review STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

Chapter 1. Looking at Data

Chapter 1. Looking at Data Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information

Performance of fourth-grade students on an agility test

Performance of fourth-grade students on an agility test Starter Ch. 5 2005 #1a CW Ch. 4: Regression L1 L2 87 88 84 86 83 73 81 67 78 83 65 80 50 78 78? 93? 86? Create a scatterplot Find the equation of the regression line Predict the scores Chapter 5: Understanding

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

Descriptive Univariate Statistics and Bivariate Correlation

Descriptive Univariate Statistics and Bivariate Correlation ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to

More information

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations:

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations: Measures of center The mean The mean of a distribution is the arithmetic average of the observations: x = x 1 + + x n n n = 1 x i n i=1 The median The median is the midpoint of a distribution: the number

More information

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Overview Key Concepts Understand how adding (subtracting) a constant or multiplying (dividing) by a constant changes the center and/or spread

More information

In this investigation you will use the statistics skills that you learned the to display and analyze a cup of peanut M&Ms.

In this investigation you will use the statistics skills that you learned the to display and analyze a cup of peanut M&Ms. M&M Madness In this investigation you will use the statistics skills that you learned the to display and analyze a cup of peanut M&Ms. Part I: Categorical Analysis: M&M Color Distribution 1. Record the

More information

Section 5.4. Ken Ueda

Section 5.4. Ken Ueda Section 5.4 Ken Ueda Students seem to think that being graded on a curve is a positive thing. I took lasers 101 at Cornell and got a 92 on the exam. The average was a 93. I ended up with a C on the test.

More information

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population Lecture 5 1 Lecture 3 The Population Variance The population variance, denoted σ 2, is the sum of the squared deviations about the population mean divided by the number of observations in the population,

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

1 Probability Distributions

1 Probability Distributions 1 Probability Distributions In the chapter about descriptive statistics sample data were discussed, and tools introduced for describing the samples with numbers as well as with graphs. In this chapter

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67 Chapter 6 The Standard Deviation as a Ruler and the Normal Model 1 /67 Homework Read Chpt 6 Complete Reading Notes Do P129 1, 3, 5, 7, 15, 17, 23, 27, 29, 31, 37, 39, 43 2 /67 Objective Students calculate

More information

Describing Distributions with Numbers

Describing Distributions with Numbers Describing Distributions with Numbers Using graphs, we could determine the center, spread, and shape of the distribution of a quantitative variable. We can also use numbers (called summary statistics)

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. Chapter 3 Numerically Summarizing Data Chapter 3.1 Measures of Central Tendency Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. A1. Mean The

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

1.3: Describing Quantitative Data with Numbers

1.3: Describing Quantitative Data with Numbers 1.3: Describing Quantitative Data with Numbers Section 1.3 Describing Quantitative Data with Numbers After this section, you should be able to MEASURE center with the mean and median MEASURE spread with

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:

More information

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Cengage Learning

More information

1 Measures of the Center of a Distribution

1 Measures of the Center of a Distribution 1 Measures of the Center of a Distribution Qualitative descriptions of the shape of a distribution are important and useful. But we will often desire the precision of numerical summaries as well. Two aspects

More information

The Normal Distribution. Chapter 6

The Normal Distribution. Chapter 6 + The Normal Distribution Chapter 6 + Applications of the Normal Distribution Section 6-2 + The Standard Normal Distribution and Practical Applications! We can convert any variable that in normally distributed

More information

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution.

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. 1 Histograms p53 The breakfast cereal data Study collected data on nutritional

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1

More information

Describing Distributions With Numbers Chapter 12

Describing Distributions With Numbers Chapter 12 Describing Distributions With Numbers Chapter 12 May 1, 2013 What Do We Usually Summarize? Measures of Center. Percentiles. Measures of Spread. A Summary. 1.0 What Do We Usually Summarize? source: Prof.

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution.

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. 1 Histograms p53 Spoiled ballots are a real threat to democracy. Below are

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

STT 315 This lecture is based on Chapter 2 of the textbook.

STT 315 This lecture is based on Chapter 2 of the textbook. STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures

More information

MEASURING THE SPREAD OF DATA: 6F

MEASURING THE SPREAD OF DATA: 6F CONTINUING WITH DESCRIPTIVE STATS 6E,6F,6G,6H,6I MEASURING THE SPREAD OF DATA: 6F othink about this example: Suppose you are at a high school football game and you sample 40 people from the student section

More information

The empirical ( ) rule

The empirical ( ) rule The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots

More information

Chapter 4.notebook. August 30, 2017

Chapter 4.notebook. August 30, 2017 Sep 1 7:53 AM Sep 1 8:21 AM Sep 1 8:21 AM 1 Sep 1 8:23 AM Sep 1 8:23 AM Sep 1 8:23 AM SOCS When describing a distribution, make sure to always tell about three things: shape, outliers, center, and spread

More information

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data Finding Quartiles. Use the median to divide the ordered data set into two halves.. If n is odd, do not include the median in either half. If n is even, split this data set exactly in half.. Q1 is the median

More information

Section 3.2 Measures of Central Tendency

Section 3.2 Measures of Central Tendency Section 3.2 Measures of Central Tendency 1 of 149 Section 3.2 Objectives Determine the mean, median, and mode of a population and of a sample Determine the weighted mean of a data set and the mean of a

More information

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008 DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 3 Spring 2008 Measures of central tendency for ungrouped data 2 Graphs are very helpful to describe

More information

Section 2.5 Formulas and Additional Applications from Geometry Section 2.6 Solving Linear Inequalities Section 7.

Section 2.5 Formulas and Additional Applications from Geometry Section 2.6 Solving Linear Inequalities Section 7. Section 2.5 Formulas and Additional Applications from Geometry Section 2.6 Solving Linear Inequalities Section 7.1 Evaluating Roots Section 2.5 Formulas and Additional Applications from Geometry Definition

More information

Reminders. Homework due tomorrow Quiz tomorrow

Reminders. Homework due tomorrow Quiz tomorrow Reminders Homework due tomorrow Quiz tomorrow 1 Warm Up - ACT Math Scores Distribution of ACT Math Scores Density 0 5 10 15 20 25 30 35 scores What percent of scores are between 12 and 24? Options: 38%,

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

GRE Quantitative Reasoning Practice Questions

GRE Quantitative Reasoning Practice Questions GRE Quantitative Reasoning Practice Questions y O x 7. The figure above shows the graph of the function f in the xy-plane. What is the value of f (f( ))? A B C 0 D E Explanation Note that to find f (f(

More information

P8130: Biostatistical Methods I

P8130: Biostatistical Methods I P8130: Biostatistical Methods I Lecture 2: Descriptive Statistics Cody Chiuzan, PhD Department of Biostatistics Mailman School of Public Health (MSPH) Lecture 1: Recap Intro to Biostatistics Types of Data

More information

Chapter 18. Sampling Distribution Models. Bin Zou STAT 141 University of Alberta Winter / 10

Chapter 18. Sampling Distribution Models. Bin Zou STAT 141 University of Alberta Winter / 10 Chapter 18 Sampling Distribution Models Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 10 Population VS Sample Example 18.1 Suppose a total of 10,000 patients in a hospital and

More information

Chapter 6 The Normal Distribution

Chapter 6 The Normal Distribution Chapter 6 The Normal PSY 395 Oswald Outline s and area The normal distribution The standard normal distribution Setting probable limits on a score/observation Measures related to 2 s and Area The idea

More information

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that

More information

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Mean vs.

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Using the z-table: Given an Area, Find z ID1050 Quantitative & Qualitative Reasoning

Using the z-table: Given an Area, Find z ID1050 Quantitative & Qualitative Reasoning Using the -Table: Given an, Find ID1050 Quantitative & Qualitative Reasoning between mean and beyond 0.0 0.000 0.500 0.1 0.040 0.460 0.2 0.079 0.421 0.3 0.118 0.382 0.4 0.155 0.345 0.5 0.192 0.309 0.6

More information

FREQUENCY DISTRIBUTIONS AND PERCENTILES

FREQUENCY DISTRIBUTIONS AND PERCENTILES FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly

More information

MATH4427 Notebook 4 Fall Semester 2017/2018

MATH4427 Notebook 4 Fall Semester 2017/2018 MATH4427 Notebook 4 Fall Semester 2017/2018 prepared by Professor Jenny Baglivo c Copyright 2009-2018 by Jenny A. Baglivo. All Rights Reserved. 4 MATH4427 Notebook 4 3 4.1 K th Order Statistics and Their

More information

Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table

Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table Lesson Plan Answer Questions Summary Statistics Histograms The Normal Distribution Using the Standard Normal Table 1 2. Summary Statistics Given a collection of data, one needs to find representations

More information

Chapter 5: Exploring Data: Distributions Lesson Plan

Chapter 5: Exploring Data: Distributions Lesson Plan Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The

More information

Lecture 2 and Lecture 3

Lecture 2 and Lecture 3 Lecture 2 and Lecture 3 1 Lecture 2 and Lecture 3 We can describe distributions using 3 characteristics: shape, center and spread. These characteristics have been discussed since the foundation of statistics.

More information

Units. Exploratory Data Analysis. Variables. Student Data

Units. Exploratory Data Analysis. Variables. Student Data Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as

More information

BNG 495 Capstone Design. Descriptive Statistics

BNG 495 Capstone Design. Descriptive Statistics BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus

More information

MgtOp 215 Chapter 3 Dr. Ahn

MgtOp 215 Chapter 3 Dr. Ahn MgtOp 215 Chapter 3 Dr. Ahn Measures of central tendency (center, location): measures the middle point of a distribution or data; these include mean and median. Measures of dispersion (variability, spread):

More information

Sections 6.1 and 6.2: The Normal Distribution and its Applications

Sections 6.1 and 6.2: The Normal Distribution and its Applications Sections 6.1 and 6.2: The Normal Distribution and its Applications Definition: A normal distribution is a continuous, symmetric, bell-shaped distribution of a variable. The equation for the normal distribution

More information

MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline.

MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline. MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline. data; variables: categorical & quantitative; distributions; bar graphs & pie charts: What Is Statistics?

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

MATH 117 Statistical Methods for Management I Chapter Three

MATH 117 Statistical Methods for Management I Chapter Three Jubail University College MATH 117 Statistical Methods for Management I Chapter Three This chapter covers the following topics: I. Measures of Center Tendency. 1. Mean for Ungrouped Data (Raw Data) 2.

More information

+ Check for Understanding

+ Check for Understanding n Measuring Position: Percentiles n One way to describe the location of a value in a distribution is to tell what percent of observations are less than it. Definition: The p th percentile of a distribution

More information

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation Chapter Four Numerical Descriptive Techniques 4.1 Numerical Descriptive Techniques Measures of Central Location Mean, Median, Mode Measures of Variability Range, Standard Deviation, Variance, Coefficient

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Boxplots and standard deviations Suhasini Subba Rao Review of previous lecture In the previous lecture

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

How spread out is the data? Are all the numbers fairly close to General Education Statistics

How spread out is the data? Are all the numbers fairly close to General Education Statistics How spread out is the data? Are all the numbers fairly close to General Education Statistics each other or not? So what? Class Notes Measures of Dispersion: Range, Standard Deviation, and Variance (Section

More information

Probability Distributions

Probability Distributions CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas

More information

EXPERIMENT: REACTION TIME

EXPERIMENT: REACTION TIME EXPERIMENT: REACTION TIME OBJECTIVES to make a series of measurements of your reaction time to make a histogram, or distribution curve, of your measured reaction times to calculate the "average" or "mean"

More information