Describing distributions with numbers
|
|
- Ralph Sharp
- 5 years ago
- Views:
Transcription
1 Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central tendency of the set of observations it is the tendency of the data to cluster, or center, about certain numerical values. The variability of the set of observation it is the spread of the data. week2 1
2 Measuring Center The Mode is the observation that occurs most frequently. The mode for categorical variable will be the label of the category with the highest number of counts. Measuring center Two common measures of center are the mean and the median. These two measures behave differently. The mean is the average value and the median is the middle value. week2 2
3 Measuring center: the median The median M is the midpoint of the distribution, the number such that half the observations are smaller then it and the other half are larger. To find the median of a distribution: 1. Arrange the observations in order of size, from smallest to largest. 2. If the number of observations n is odd, the median is the center observation in the ordered list. 3. If the number of observations n is even, the median is the average of the two center observations in the ordered list. week2 3
4 Example The annual salaries (in thousands of $) of a random sample of five employees of a company are: 40, 30, 25, 200, 28 Arranging the values in increasing order: median = 30 Excluding 200 median = (28+30)/2=29. week2 4
5 MINITAB commands Stat > Basic Statistics > Display Descriptive Statistics MINITAB output for the data in the example above is given bellow: Variable N Median salary week2 5
6 Measuring center: mean To find the mean of a set of observations, add their values and divide by the number of observations. If the n observations are x 1,x 2, x n, their mean is given by Example mean x x1 + x2 = x = + + x n Find the mean of the following observations: 4, 5, 9, 3, 5. Solution: n = x n i week2 6
7 Example The annual salaries (in thousands of $) of a random sample of five employees of a company are: 40, 30, 25, 200, 28. If we exclude 200 as an outlier, Mean is sensitive to the influence of a few extreme observations. Because the mean cannot resist the influence of extreme values, we say that it is NOT a resistant measure of center. week2 7
8 Mean versus median The median and mean are the most common measures of the center of a distribution. If the distribution is exactly symmetric, the mean and median are exactly the same. Median is less influenced by extreme values. A data set is said to be skewed if one tail of the distribution had more extreme observations than the other tail. If the distribution is skewed to the right, then mode < median < mean If the distribution is skewed to the left, then mean < median < mode. week2 8
9 Questions 1. You are asked to recommend a measure of center to characterize the following data: 0.6, 0.2, 0.1, 0.2, 0.2, 0.3, 0.7, 0.1, 0.0, 22.5, 0.4. What is your recommendation and why? 2. The mean is sensitive to extreme values than the median. (a) more (b) less (c) equally (d) can t say without data 3. Changing the value of a single score in a data set will necessarily cause the mean to change. (T/F) 4. Changing the value of a single score in a data set will necessarily cause the median to change. (T/F) week2 9
10 Percentiles The simplest useful numerical description of a distribution consists of both a measure of center and a measure of spread. We can describe the spread or variability of a distribution by giving several percentiles. The pth percentile of a distribution is the value such that p percent of the observations are smaller or equal to it. The median is the 50 th percentile. If a data set contains n observations, then the pth percentile is th the ( n + 1) p value in the ordered data set. 100 week2 10
11 Example Find the 20th percentile of the data represented by the following stem-and-leaf plot. Stem-and-leaf of Rural N = 29 Leaf Unit = 1.0 N* = (12) week2 11
12 Solution week2 12
13 Quartiles The 25th percentile is called the first quartile (Q 1 ). The first quartile (Q 1 ) is the median of the observations whose position in the ordered list is to the left of the location of the overall median. The 75th percentile is called the third quartile (Q 3 ). The third quartile (Q 3 ) is the median of the observations whose position in the ordered list is to the right of the location of the overall median. NOTE: The median is the second quartile Q 2. week2 13
14 Example The highway mileages of 20 cars, arranged in increasing order are: The median is The first quartile Q 1 is The third quartile Q 3 is Exercise: Find (a) the 10th percentile. (b) the 90th percentile of the above data set. week2 14
15 Measuring Spread The range (max-min) is a measure of spread but it is very sensitive to the influence of extreme values. The distance between the first and third quartiles is called the Interquartile range (IQR) i.e. IQR =Q 3 Q 1. The IQR is another measure of spread that is less sensitive to the influence of extreme values. week2 15
16 The five-number summary The five-number summary of a set of observations consists of the smallest observation, the first quartile, the median, the third quartile and the largest observation. These five numbers give a reasonably complete description of both the center and the spread of the distribution. MINITAB commands: Stat > Basic Statistics > Display Descriptive Statistics week2 16
17 Example The highway mileages of 20 cars, arranged in increasing order are: Give the five number summary. Answer We have, min = 13, first quartile = 18, median = 23, third quartile = 27, max. = 32. The MINITAB output using the above commands is as follows: Variable N Minimum Q1 Median Q3 Maximum mileage week2 17
18 Box-plot A box-plot is a graph of the five-number summary. Example: Make a box-plot for the data in the above example. Boxplot of Mileages 30 Mileages MINITAB commands: Graph > Boxplot week2 18
19 Exercise The stem-and-leaf plot for a set of 50 observations is given below: Draw a box-plot for the data. Stem-and-leaf of Fees N = 50 Leaf Unit = (28) week2 19
20 Exercise The box-plot, histogram and stem-and-leaf plot for a data set are given below. Describe the distribution. Stem-and-leaf of C2 N = 50 Leaf Unit = 1.0 (29) Frequency C C2 week2 20
21 Exercise Consider the following Minitab generated box-plots of coagulation times in seconds for samples of blood drawn from animals receiving three different diets denoted 1, 2, and 3 : 70 coagtimes State whether the following statements are true or false a) The animal that had the longest coagulation time was given diet 3. b) The greatest variability occurs with diet 2. c) Diet 1 shows evidence of right (positive) skewness but diet 2 shows evidence of left (negative) skewness. d) Approximately 25% of animals on diet 2 had coagulation times less then 63. e) The smallest upper (third) quartile is for diet 3. f) We can see that the mean for diet 1 is less than 62 seconds. 1 2 Diet week2 21 3
22 Measuring spread: Standard deviation The variance (s 2 ) of a set of n observations is The standard deviation (s) is the square root of the variance (s 2 ). i.e. It can be shown that, x 2 2 i nx s= n 1 This formula is usually quicker. x x,..., 1, 2 ( ) 2 ( ) 2 ( ) 2 ( ) 2 2 x 1 x + x 2 x + L x n s x x i = = x n 1 n 1 ( x ) 2 ( ) 2 ( ) 2 ( ) 2 1 x + x 2 x + L x n s x x i = = x n 1 n 1 x n week2 22
23 x i x The deviations display the spread of the values x i about their mean. Some of these deviations will be positive and some negative because the observations fall on each side of the mean. The sum of the deviations of the observations from their mean will always be zero. Squaring the deviations makes them all positive, so that observations far from the mean in either direction have large positive squared deviations. The variance is the average of the squared deviations. The variance, s 2, and the standard deviation, s, will be large if the observations are widely spread about their mean, and small if the observations are all close to the mean. week2 23
24 Example Find the standard deviation of the following data set: 4, 8, 2, 9, 7. Solution: n=5, Using the second formula we have week2 24
25 MINITAB commands Stat > Basic Statistics > Display Descriptive Statistics MINITAB output for the above data is given below: Variable N StDev C Exercise: Find the standard deviation of the following data set: 5, 8, 7, 9, 7, 11. week2 25
26 Properties of standard deviation (s) s measures the spread about the mean and should be used only when the mean is chosen as the measure of center. s = 0 only when there is no spread. This happens only when all observations have the same value. Otherwise, s > 0. s, like the mean, is not resistant to extreme values. A few outliers can make s very large. week2 26
27 Ballpark approximation for s The ballpark approximation for the standard deviation s is the Range/4 (divide by 3 if there are less then 10 observations, divide by 5 if there are more then 100 observations). For the data set 4, 8, 2, 9, 7, range = 9 2 = 7 and so s 7 = week2 27
28 The empirical ( ) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7% fall within 3 standard deviations of the mean. What if the distribution is not bell-shaped? There is another rule, named Chebyshev's Rule, that tells us that there must be at least 75% of the data within 2 standard deviations of the mean, regardless of the shape, and at least 89% within 3 standard deviations. week2 28
29 Outliers An outlier is an observation that is usually large or small relative to the other values in a data set. Outliers are typically attributable to one of the following causes: 1. The observation is observed, recorded, or entered incorrectly. 2. The observation comes from a different population. 3. The observation is correct but represents a rare event. week2 29
30 The 1.5 IQR Criterion for outliers Call an observation a suspected outlier if it falls more than 1.5 IQR above the 3rd quartile or below the 1st quartile. Example Consider the data given in exercise 1.13 on page 43 in IPS (mileage data with an extra observation of 66). Variable N Mean Min Q1 Median Q3 Max Mileages The IQR = = 10 and the largest observation, 66, falls more than 1.5 IQR above Q 3 and therefore is an outlier. week2 30
31 Choosing a summary The five-number summary is usually better than the mean and the standard deviation for describing skewed distributions or distributions with strong outliers. Use mean and standard deviation for reasonably symmetric distributions that are free of outliers. week2 31
32 Questions 1. How do the mean, median, and mode compare, usually, when a distribution is positively skewed? negatively skewed? Draw a picture and try to estimate the locations of these measures. 2. Which type of display is the most useful type for clear direct comparisons of the key characteristics of several data sets (e.g. blood cholesterol changes for several different treatments)? 3. In a frequency table of 300 scores, the mean is reported as 80 and the median as 65. One would expect this distribution to be a. positively skewed. b. negatively skewed. c. symmetrical d. rectangular. week2 32
33 4. Find the median of the following frequency distribution. Score Frequency On sta220 term test, John scored at the 78th percentile, and Jack scored at the 63rd. State whether the following statements are true of false a. John is 15 times better than jack. b. John scored 15 more points than Jack. c. 15% of those taking the test got scores ranging between John's and Jack's scores. d. 62 students scored less than John. week2 33
34 6. Estimate the mean and standard deviation of the distribution represented by the following histogram. 10 Frequency Rate week2 34
Describing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationLecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:
Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots
More informationChapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution.
Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. 1 Histograms p53 The breakfast cereal data Study collected data on nutritional
More informationChapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution.
Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. 1 Histograms p53 Spoiled ballots are a real threat to democracy. Below are
More informationChapter 2: Tools for Exploring Univariate Data
Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is
More informationCHAPTER 1. Introduction
CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing
More informationMeasures of center. The mean The mean of a distribution is the arithmetic average of the observations:
Measures of center The mean The mean of a distribution is the arithmetic average of the observations: x = x 1 + + x n n n = 1 x i n i=1 The median The median is the midpoint of a distribution: the number
More informationSTP 420 INTRODUCTION TO APPLIED STATISTICS NOTES
INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make
More informationUnit 2. Describing Data: Numerical
Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient
More informationADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes
We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures
More informationChapter 5: Exploring Data: Distributions Lesson Plan
Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The
More informationThe empirical ( ) rule
The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%
More informationStatistics I Chapter 2: Univariate data analysis
Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,
More informationChapter 1. Looking at Data
Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,
More informationIntroduction to Statistics
Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,
More informationUnit 2: Numerical Descriptive Measures
Unit 2: Numerical Descriptive Measures Summation Notation Measures of Central Tendency Measures of Dispersion Chebyshev's Rule Empirical Rule Measures of Relative Standing Box Plots z scores Jan 28 10:48
More informationare the objects described by a set of data. They may be people, animals or things.
( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms
More informationStatistics I Chapter 2: Univariate data analysis
Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,
More informationElementary Statistics
Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:
More informationCHAPTER 2: Describing Distributions with Numbers
CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring
More informationSTAT 200 Chapter 1 Looking at Data - Distributions
STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the
More informationWhat is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected
What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value
More informationPerhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:
1 Chapter 3 - Descriptive stats: Numerical measures 3.1 Measures of Location Mean Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size Example: The number
More informationCHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.
(c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals
More informationChapter 4. Displaying and Summarizing. Quantitative Data
STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range
More informationSection 3. Measures of Variation
Section 3 Measures of Variation Range Range = (maximum value) (minimum value) It is very sensitive to extreme values; therefore not as useful as other measures of variation. Sample Standard Deviation The
More information3.1 Measure of Center
3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects
More informationExample 2. Given the data below, complete the chart:
Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is
More informationChapter 3. Data Description
Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.
More informationHistograms allow a visual interpretation
Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called
More informationChapter 1: Exploring Data
Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1
More informationMath 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore
Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile
More informationSTOR 155 Introductory Statistics. Lecture 4: Displaying Distributions with Numbers (II)
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STOR 155 Introductory Statistics Lecture 4: Displaying Distributions with Numbers (II) 9/8/09 Lecture 4 1 Numerical Summary for Distributions Center Mean
More informationA is one of the categories into which qualitative data can be classified.
Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative
More informationReview for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data
Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature
More informationFurther Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data
Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)
More informationMATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More informationDEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008
DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 3 Spring 2008 Measures of central tendency for ungrouped data 2 Graphs are very helpful to describe
More informationDescribing Center: Mean and Median Section 5.4
Describing Center: Mean and Median Section 5.4 Look at table 5.2 at the right. We are going to make the dotplot of the city gas mileages of midsize cars. How to describe the center of a distribution: x
More informationSection 2.3: One Quantitative Variable: Measures of Spread
Section 2.3: One Quantitative Variable: Measures of Spread Objectives: 1) Measures of spread, variability a. Range b. Standard deviation i. Formula ii. Notation for samples and population 2) The 95% rule
More informationHomework Example Chapter 1 Similar to Problem #14
Chapter 1 Similar to Problem #14 Given a sample of n = 129 observations of shower-flow-rate, do this: a.) Construct a stem-and-leaf display of the data. b.) What is a typical, or representative flow rate?
More informationDescribing Distributions with Numbers
Describing Distributions with Numbers Using graphs, we could determine the center, spread, and shape of the distribution of a quantitative variable. We can also use numbers (called summary statistics)
More informationP8130: Biostatistical Methods I
P8130: Biostatistical Methods I Lecture 2: Descriptive Statistics Cody Chiuzan, PhD Department of Biostatistics Mailman School of Public Health (MSPH) Lecture 1: Recap Intro to Biostatistics Types of Data
More informationLecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures
More informationDescriptive Statistics
Descriptive Statistics CHAPTER OUTLINE 6-1 Numerical Summaries of Data 6- Stem-and-Leaf Diagrams 6-3 Frequency Distributions and Histograms 6-4 Box Plots 6-5 Time Sequence Plots 6-6 Probability Plots Chapter
More informationLecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)
Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Cengage Learning
More informationF78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives
F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested
More information1. Exploratory Data Analysis
1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,
More informationChapter 3. Measuring data
Chapter 3 Measuring data 1 Measuring data versus presenting data We present data to help us draw meaning from it But pictures of data are subjective They re also not susceptible to rigorous inference Measuring
More informationLecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #
Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures
More informationUnits. Exploratory Data Analysis. Variables. Student Data
Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as
More informationChapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved
Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data
More informationLecture 1: Descriptive Statistics
Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics
More informationChapters 1 & 2 Exam Review
Problems 1-3 refer to the following five boxplots. 1.) To which of the above boxplots does the following histogram correspond? (A) A (B) B (C) C (D) D (E) E 2.) To which of the above boxplots does the
More informationResistant Measure - A statistic that is not affected very much by extreme observations.
Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)
More informationMath 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency
Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:
More informationShape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays
Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed
More informationDescriptive Univariate Statistics and Bivariate Correlation
ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to
More informationChapter 5: Exploring Data: Distributions Lesson Plan
Lesson Plan Exploring Data Displaying Distributions: Histograms For All Practical Purposes Mathematical Literacy in Today s World, 7th ed. Interpreting Histograms Displaying Distributions: Stemplots Describing
More informationThe Empirical Rule, z-scores, and the Rare Event Approach
Overview The Empirical Rule, z-scores, and the Rare Event Approach Look at Chebyshev s Rule and the Empirical Rule Explore some applications of the Empirical Rule How to calculate and use z-scores Introducing
More information1.3.1 Measuring Center: The Mean
1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar) of a set of observations, add their values and divide by the number of observations. If the n observations
More informationDetermining the Spread of a Distribution
Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative
More informationReview: Central Measures
Review: Central Measures Mean, Median and Mode When do we use mean or median? If there is (are) outliers, use Median If there is no outlier, use Mean. Example: For a data 1, 1.2, 1.5, 1.7, 1.8, 1.9, 2.3,
More informationTOPIC: Descriptive Statistics Single Variable
TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency
More information1.3: Describing Quantitative Data with Numbers
1.3: Describing Quantitative Data with Numbers Section 1.3 Describing Quantitative Data with Numbers After this section, you should be able to MEASURE center with the mean and median MEASURE spread with
More informationDetermining the Spread of a Distribution
Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative
More informationChapter 3 Data Description
Chapter 3 Data Description Section 3.1: Measures of Central Tendency Section 3.2: Measures of Variation Section 3.3: Measures of Position Section 3.1: Measures of Central Tendency Definition of Average
More informationObjective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.
Chapter 3 Numerically Summarizing Data Chapter 3.1 Measures of Central Tendency Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. A1. Mean The
More information200 participants [EUR] ( =60) 200 = 30% i.e. nearly a third of the phone bills are greater than 75 EUR
Ana Jerončić 200 participants [EUR] about half (71+37=108) 200 = 54% of the bills are small, i.e. less than 30 EUR (18+28+14=60) 200 = 30% i.e. nearly a third of the phone bills are greater than 75 EUR
More informationExercises from Chapter 3, Section 1
Exercises from Chapter 3, Section 1 1. Consider the following sample consisting of 20 numbers. (a) Find the mode of the data 21 23 24 24 25 26 29 30 32 34 39 41 41 41 42 43 48 51 53 53 (b) Find the median
More informationChapter 1 - Lecture 3 Measures of Location
Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What
More informationAP Final Review II Exploring Data (20% 30%)
AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure
More informationSTT 315 This lecture is based on Chapter 2 of the textbook.
STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their
More information1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.
1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions
More informationChapter 2 Solutions Page 15 of 28
Chapter Solutions Page 15 of 8.50 a. The median is 55. The mean is about 105. b. The median is a more representative average" than the median here. Notice in the stem-and-leaf plot on p.3 of the text that
More informationMath 140 Introductory Statistics
Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The
More informationMath 140 Introductory Statistics
Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The
More informationStatistics 528: Homework 2 Solutions
Statistics 28: Homework 2 Solutions.4 There are several gaps in the data, as can be seen from the histogram. Minitab Result: Min Q Med Q3 Max 8 3278 22 2368 2624 Manual Result: Min Q Med Q3 Max 8 338 22.
More informationORGANIZATION AND DESCRIPTION OF DATA
Loss 0 40 80 120 Frequency 0 5 10 15 20 Miller and Freunds Probability and Statistics for Engineers 9th Edition Johnson SOLUTIONS MANUAL Full download at: https://testbankreal.com/download/miller-freunds-probability-statisticsengineers-9th-edition-johnson-solutions-manual/
More informationLast Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics
Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different
More informationIntroduction to Statistics
Introduction to Statistics By A.V. Vedpuriswar October 2, 2016 Introduction The word Statistics is derived from the Italian word stato, which means state. Statista refers to a person involved with the
More informationContinuous random variables
Continuous random variables A continuous random variable X takes all values in an interval of numbers. The probability distribution of X is described by a density curve. The total area under a density
More informationQUANTITATIVE DATA. UNIVARIATE DATA data for one variable
QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE
More informationRange The range is the simplest of the three measures and is defined now.
Measures of Variation EXAMPLE A testing lab wishes to test two experimental brands of outdoor paint to see how long each will last before fading. The testing lab makes 6 gallons of each paint to test.
More informationSection 2.4. Measuring Spread. How Can We Describe the Spread of Quantitative Data? Review: Central Measures
mean median mode Review: entral Measures Mean, Median and Mode When do we use mean or median? If there is (are) outliers, use Median If there is no outlier, use Mean. Example: For a data 1, 1., 1.5, 1.7,
More informationLecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)
Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Mean vs.
More informationBNG 495 Capstone Design. Descriptive Statistics
BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus
More informationStat 101 Exam 1 Important Formulas and Concepts 1
1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative
More informationSTA 218: Statistics for Management
Al Nosedal. University of Toronto. Fall 2017 My momma always said: Life was like a box of chocolates. You never know what you re gonna get. Forrest Gump. Problem How much do people with a bachelor s degree
More informationØ Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.
Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number
More informationExam: practice test 1 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Exam: practice test MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the problem. ) Using the information in the table on home sale prices in
More informationChapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.
Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data
More informationThe Normal Distribution. Chapter 6
+ The Normal Distribution Chapter 6 + Applications of the Normal Distribution Section 6-2 + The Standard Normal Distribution and Practical Applications! We can convert any variable that in normally distributed
More informationGRAPHS AND STATISTICS Central Tendency and Dispersion Common Core Standards
B Graphs and Statistics, Lesson 2, Central Tendency and Dispersion (r. 2018) GRAPHS AND STATISTICS Central Tendency and Dispersion Common Core Standards Next Generation Standards S-ID.A.2 Use statistics
More informationWhat is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.
What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,
More information3 Lecture 3 Notes: Measures of Variation. The Boxplot. Definition of Probability
3 Lecture 3 Notes: Measures of Variation. The Boxplot. Definition of Probability 3.1 Week 1 Review Creativity is more than just being different. Anybody can plan weird; that s easy. What s hard is to be
More informationAfter completing this chapter, you should be able to:
Chapter 2 Descriptive Statistics Chapter Goals After completing this chapter, you should be able to: Compute and interpret the mean, median, and mode for a set of data Find the range, variance, standard
More informationChapter 5. Understanding and Comparing. Distributions
STAT 141 Introduction to Statistics Chapter 5 Understanding and Comparing Distributions Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 27 Boxplots How to create a boxplot? Assume
More information