Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data

Size: px
Start display at page:

Download "Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data"

Transcription

1 Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data Mean Standard Error Median 25 Mode 20 Standard Deviation Sample Variance Kurtosis Skewness Range 38 Minimum 7 Maximum 45 Sum 403 Count 15 Largest(1) 45 Smallest(1)

2 Chapter Goals Learn how to present and describe sets of data. Learn measures of central tendency, measures of dispersion (spread), measures of position, and types of distributions. Learn how to interpret findings so that we know what the data is telling us about the sampled population.

3 2.1: Graphic Presentation of Data Use initial exploratory data-analysis techniques to produce a pictorial representation of the data. Resulting displays reveal patterns of behavior of the variable being studied. The method used is determined by the type of data and the idea to be presented. No single correct answer when constructing a graphic display.

4 Circle Graphs and Bar Graphs: Graphs that are used to summarize attribute data. Circle graphs (pie diagrams) show the amount of data that belongs to each category as a proportional part of a circle. Bar graphs show the amount of data that belongs to each category as proportionally sized rectangular areas.

5 Example: The table below lists the number of automobiles sold last week by day for a local dealership. Day Number Sold Monday 15 Tuesday 23 Wednesday 35 Thursday 11 Friday 12 Saturday 42 Describe the data using a circle graph and a bar graph.

6 Automobiles Sold Last Week Saturday 30% Monday 11% Tuesday 17% Friday 9% Thursday 8% Wednesday 25%

7 Monday Tuesday Wednesday Thursday Friday Saturday Automobiles Sold Last Week

8 Pareto Diagram: A bar graph with the bars arranged from the most numerous category to the least numerous category. It includes a line graph displaying the cumulative percentages and counts for the bars. Note: The Pareto diagram is often used in quality control applications. Used to identify the number and type of defects that happen within a product or service.

9 Example: The final daily inspection defect report for a cabinet manufacturer is given in the table below. Defect Number Dent 5 Stain 12 Blemish 43 Chip 25 Scratch 40 Others 10 Construct a Pareto diagram for this defect report. Management has given the cabinet production line the goal of reducing their defects by 50%. What two defects should they give special attention to in working toward this goal?

10 Solution: Daily Defect Inspection Report Count Percent Defect B lemish Scratc h Chip Stain Others Dent Count Percent Cum % The production line should try to eliminate blemishes and scratches. This would cut defects by more than 50%.

11 Quantitative Data: One reason for constructing a graph of quantitative data is to examine the distribution - is the data compact, spread out, skewed, symmetric, etc. Distribution: The pattern of variability displayed by the data of a variable. The distribution displays the frequency of each value of the variable. Dotplot Display: Displays the data of a sample by representing each piece of data with a dot positioned along a scale. This scale can be either horizontal or vertical. The frequency of the values is represented along the other scale.

12 Example: A random sample of the lifetime (in years) of 50 home washing machines is given below The figure below is a dotplot for the 50 lifetimes.. :...:....: :.::::::...::..... :... : Notice how the data is bunched near the lower extreme and more spread out near the higher extreme.

13 Background: The stem-and-leaf display has become very popular for summarizing numerical data. It is a combination of graphing and sorting. The actual data is part of the graph. Well-suited for computers. Stem-and-Leaf Display: Pictures the data of a sample using the actual digits that make up the data values. Each numerical data is divided into two parts: The leading digit(s) becomes the stem, and the trailing digit(s) becomes the leaf. The stems are located along the main axis, and a leaf for each piece of data is located so as to display the distribution of the data.

14 Example: A city police officer, using radar, checked the speed of cars as they were traveling down the main street in town: Construct a stem-and-leaf plot for this data. Solution: All the speeds are in the 10s, 20s, 30s, 40s, and 50s. Use the first digit of each speed as the stem and the second digit as the leaf. Draw a vertical line and list the stems, in order to the left of the line. Place each leaf on its stem: place the trailing digit on the right side of the vertical line opposite its corresponding leading digit.

15 20 Speeds The speeds are centered around the 30s. Note: The display could be constructed so that only five possible values (instead of ten) could fall in each stem. What would the stems look like? Would there be a difference in appearance?

16 Note: 1. It is fairly typical of many variables to display a distribution that is concentrated (mounded) about a central value and then in some manner be dispersed in both directions. (Why?) 2. A display that indicates two mounds may really be two overlapping distributions. 3. A back-to-back stem-and-leaf display makes it possible to compare two distributions graphically. 4. A side-by-side dotplot is also useful for comparing two distributions.

17 2.2: Frequency Distributions and Histograms Stem-and-leaf plots often present adequate summaries, but they can get very big, very fast. Need other techniques for summarizing data. Frequency distributions and histograms are used to summarize large data sets.

18 Frequency Distribution: A listing, often expressed in chart form, that pairs each value of a variable with its frequency. Ungrouped Frequency Distribution: Each value of x in the distribution stands alone. Grouped Frequency Distribution: Group the values into a set of classes. 1. A table that summarizes data by classes, or class intervals. 2. In a typical grouped frequency distribution, there are usually 5-12 classes of equal width. 3. The table may contain columns for class number, class interval, tally (if constructing by hand), frequency, relative frequency, cumulative relative frequency, and class mark. 4. In an ungrouped frequency distribution each class consists of a single value.

19 Guidelines for constructing a frequency distribution: 1. Each class should be of the same width. 2. Classes should be set up so that they do not overlap and so that each piece of data belongs to exactly one class. 3. For problems in the text, 5-12 classes are most desirable. The square root of n is a reasonable guideline for the number of classes if n is less than Use a system that takes advantage of a number pattern, to guarantee accuracy. 5. If possible, an even class width is often advantageous.

20 Procedure for constructing a frequency distribution: 1. Identify the high (H) and low (L) scores. Find the range. Range = H - L. 2. Select a number of classes and a class width so that the product is a bit larger than the range. 3. Pick a starting point a little smaller than L. Count from L by the width to obtain the class boundaries. Observations that fall on class boundaries are placed into the class interval to the right. Note: 1. The class width is the difference between the upper- and lower-class boundaries. 2. There is no best choice for class widths, number of classes, and starting points.

21 Example: The hemoglobin test, a blood test given to diabetics during their periodic checkups, indicates the level of control of blood sugar during the past two to three months. The data in the table below was obtained for 40 different diabetics at a university clinic that treats diabetic patients. Construct a grouped frequency distribution using the classes <4.7, <5.7, <6.7, etc. Which class has the highest frequency?

22 Solution: Class Frequency Relative Cumulative Class Boundaries f Frequency Rel. Frequency Mark, x < < < < < < The class <6.7 has the highest frequency. The frequency is 16 and the relative frequency is.40.

23 Histogram: A bar graph representing a frequency distribution of a quantitative variable. A histogram is made up of the following components: 1. A title, which identifies the population of interest. 2. A vertical scale, which identifies the frequencies in the various classes. 3. A horizontal scale, which identifies the variable x. Values for the class boundaries or class marks may be labeled along the x-axis. Use whichever method of labeling the axis best presents the variable. Note: 1. The relative frequency is sometimes used on the vertical scale. 2. It is possible to create a histogram based on class marks.

24 Example: Construct a histogram for the blood test results given in the previous example. Solution: 15 Frequency BloodTest

25 Example: A recent survey of Roman Catholic nuns summarized their ages in the table below. Age Frequency Class Mark up to up to up to up to up to up to up to Construct a histogram for this age data.

26 Solution: 200 Frequency Age

27 Terms used to describe histograms: Symmetrical: Both sides of the distribution are identical. There is a line of symmetry. Uniform (rectangular): Every value appears with equal frequency. Skewed: One tail is stretched out longer than the other. The direction of skewness is on the side of the longer tail. (Positively skewed vs. negatively skewed) J-shaped: There is no tail on the side of the class with the highest frequency. Bimodal: The two largest classes are separated by one or more classes. Often implies two populations are sampled. Normal: A symmetrical distribution is mounded about the mean and becomes sparse at the extremes.

28 Note: 1. The mode is the value that occurs with greatest frequency (discussed in Section 2.3). 2. The modal class is the class with the greatest frequency. 3. A bimodal distribution has two high-frequency classes separated by classes with lower frequencies. 4. Graphical representations of data should include a descriptive, meaningful title and proper identification of the vertical and horizontal scales.

29 2.3: Measures of Central Tendency Numerical values used to locate the middle of a set of data, or where the data is clustered. The term average is often associated with all measures of central tendency.

30 Mean: The type of average with which you are probably most familiar. The mean is the sum of all the values divided by the total number of values, n. x n 1 1 = x n n x x x i = ( 1 + 2L+ n ) i= 1 Note: 1. The population mean, µ, (lowercase mu, Greek alphabet), is the mean of all x values for the entire population. 2. We usually cannot measure µ but would like to estimate its value. 3. A physical representation: the mean is the value that balances the weights on the number line.

31 Example: The data below represents the number of accidents in each of the last 6 years at a dangerous intersection. 8, 9, 3, 5, 2, 6, 4, 5 Find the mean number of accidents. Solution: 1 x = = 8 ( ) Note: In the data above, change 6 to x = = ( ). The mean can be greatly influenced by outliers.

32 Median: The value of the data that occupies the middle position when the data are ranked in order according to size. Note: 1. Denoted by x tilde : ~x 2. The population median, (uppercase mu, Greek alphabet), is the data value in the middle position of the entire population. To find the median: 1. Rank the data. 2. Determine the depth of the median. 3. Determine the value of the median. d( ~ x)= n 2 +1

33 Example: Find the median for the set of data {4, 8, 3, 8, 2, 9, 2, 11, 3}. Solution: 1. Rank the data: 2, 2, 3, 3, 4, 8, 8, 9, Find the depth: d( ~ x) = ( 9+ 1)/ 2= 5 3. The median is the fifth number from either end in the ranked data: ~ x =4 Suppose the data set is {4, 8, 3, 8, 2, 9, 2, 11, 3, 15}. 1. Rank the data: 2, 2, 3, 3, 4, 8, 8, 9, 11, Find the depth: d( ~ x ) = (10 + 1)/2 = The median is halfway between the fifth and sixth observations: ~ x = ( 4+ 8)/ 2= 6

34 Mode: The mode is the value of x that occurs most frequently. Note: If two or more values in a sample are tied for the highest frequency (number of occurrences), there is no mode. Midrange: The number exactly midway between a lowest value data L and a highest value data H. It is found by averaging the low and the high values. midrange= L+ H 2

35 Example: Consider the data set {12.7, 27.1, 35.6, 44.2, 18.0}. The midrange is L midrange= + H = = Note: 1. When rounding off an answer, a common rule-of-thumb is to keep one more decimal place in the answer than was present in the original data. 2. To avoid round-off buildup, round off only the final answer, not intermediate steps.

36 2.4: Measures of Dispersion Measures of central tendency alone cannot completely characterize a set of data. Two very different data sets may have similar measures of central tendency. Measures of dispersion are used to describe the spread, or variability, of a distribution. Common measures of dispersion: range, variance, and standard deviation.

37 Range: The difference in value between the highest-valued (H) and the lowest-valued (L) pieces of data: range = H L Other measures of dispersion are based on the following quantity. Deviation from the Mean: A deviation from the mean, x x, is the difference between the value of x and the mean x.

38 Example: Consider the sample {12, 23, 17, 15, 18}. Find the range and each deviation from the mean. Solution: 1 x = = 17 5 ( ) range = H L= 23 12= 11 Data Deviation x x x

39 Note: n i= 1 ( x i x) = 0 (Always!) Mean Absolute Deviation: The mean of the absolute values of the deviations from the mean: Mean absolute deviation = n 1 x i x n 1 i= For the previous example: 1 n n i= 1 x i x = 1 5 ( ) = 14 5 = 2.8

40 Sample Variance: The sample variance, s 2, is the mean of the squared deviations, calculated using n 1 as the divisor. where n is the sample size. s 1 = ( x x) n Note: The numerator for the sample variance is called the sum of squares for x, denoted SS(x). s 2 = SS( x) n 1 where 1 SS( x) = ( x x) = x n ( x) Standard Deviation: The standard deviation of a sample, s, is the positive square root of the variance: s= s2

41 Example: Find the variance and standard deviation for the data {5, 7, 1, 3, 8}. x = = 5 ( ) 48. x x 2 x x ( x x) Sum s 2 s = = 1 (32.8) = = 2.86

42 Note: 1. The shortcut formula for the sample variance: s 2 = x 2 n 1 ( x) 2. The unit of measure for the standard deviation is the same as the unit of measure for the data. The unit of measure for the variance might then be thought of as units squared. n 2

43 2.5: Mean and Standard Deviation of Frequency Distribution If the data is given in the form of a frequency distribution, we need to make a few changes to the formulas for the mean, variance, and standard deviation. Complete the extension table in order to find these summary statistics.

44 In order to calculate the mean, variance, and standard deviation for data: 1. In an ungrouped frequency distribution, use the frequency of occurrence, f, of each observation. 2. In a grouped frequency distribution, we use the frequency of occurrence associated with each class mark. x xf = f s 2 = 2 x f f ( xf ) 1 f 2

45 Example: A survey of students in the first grade at a local school asked for the number of brothers and/or sisters for each child. The results are summarized in the table below. Find the mean, variance, and standard deviation. x = 93/ 62= 15. x f xf x2 f Sum s ( 93) = = 163. s= 163. = 128.

46 2.6: Measures of Position Measures of position are used to describe the relative location of an observation. Quartiles and percentiles are two of the most popular measures of position. An additional measure of central tendency, the midquartile, is defined using quartiles. Quartiles are part of the 5-number summary.

47 Quartiles: Values of the variable that divide the ranked data into quarters; each set of data has three quartiles. 1. The first quartile, Q 1, is a number such that at most 25% of the data are smaller in value than Q 1 and at most 75% are larger. 2. The second quartile is the median. 3. The third quartile, Q 3, is a number such that at most 75% of the data are smaller in value than Q 3 at at most 25% are larger. Ranked data, increasing order 25% 25% 25% 25% L Q 1 Q 2 Q 3 H

48 Percentiles: Values of the variable that divide a set of ranked data into 100 equal subsets; each set of data has 99 percentiles. The kth percentile, P k, is a value such that at most k% of the data is smaller in value than P k and at most (100 k)% of the data is larger. L at most k % at most (100 - k )% P k H Note: 1. The 1st quartile and the 25th percentile are the same: Q 1 = P The median, the 2nd quartile, and the 50th percentile are all the same: ~ x = Q = P 2 50

49 Procedure for finding P k (and quartiles): 1. Rank the n observations, lowest to highest. 2. Compute A = (nk)/ If A is an integer: d(p k ) = A.5 (depth) P k is halfway between the value of the data in the Ath position and the value of the next data. If A is a fraction: d(p k ) = B, the next largest integer. P k is the value of the data in the Bth position.

50 Example: The following data represents the ph levels of a random sample of swimming pools in a California town Find the first and third quartile, and the 35th percentile. k = 25: (20) (25) / 100 = 5, depth = 5.5, Q 1 = 6 k = 75: (20) (75) / 100 = 15, depth = 15.5, Q 3 = 6.95 k = 35: (20) (35) / 100 = 7, depth = 7.5, P 35 = 6.15

51 Midquartile: The numerical value midway between the first and the third quartile. Q + Q midquartile= Example: Find the midquartile for the 20 ph values in the previous example: Q + Q3 midquartile = = 2 1 = = Note: The mean, median, midrange, and midquartile are all measures of central tendency. They are not necessarily equal. Can you think of an example when they would be the same value?

52 5-Number Summary: The 5-number summary is composed of: 1. L, the smallest value in the data set. 2. Q 1, the first quartile (also P 25 ). 3. ~ x, the median. 4. Q 3, the third quartile (also P 75 ). 5. H, the largest value in the data set. Note: 1. The 5-number summary indicates how much the data is spread out in each quarter. 2. The interquartile range is the difference between the first and third quartiles. It is the range of the middle 50% of the data.

53 Box-and-Whisker Display: A graphic representation of the 5-number summary. The five numerical values (smallest, first quartile, median, third quartile, and largest) are located on a scale, either vertical or horizontal. The box is used to depict the middle half of the data that lies between the two quartiles. The whiskers are line segments used to depict the other half of the data. One line segment represents the quarter of the data that is smaller in value than the first quartile. The second line segment represents the quarter of the data that is larger in value that the third quartile.

54 Example: A random sample of students in a sixth grade class was selected. Their weights are given in the table below. Find the 5-number summary for this data and construct a boxplot L Q 1 ~ x Q 3 H

55 Boxplot for weight data: Weights from Sixth Grade Class Weight L Q 1 ~ x Q3 H

56 z-score: The position a particular value of x has relative to the mean, measured in standard deviations. The z-score is found by the formula x x z = value mean = st.dev. s Note: 1. Typically, the calculated value of z is rounded to the nearest hundredth. 2. The z-score measures the number of standard deviations above/below, or away from, the mean. 3. z-scores typically range from to z-scores may be used to make comparisons of raw scores.

57 Example: A certain data set has mean 35.6 and standard deviation 7.1. Find the z-scores for 46 and 33. Solution: z x x = = s = is 1.46 standard deviations above the mean. z x x = = s = is -.37 standard deviations below the mean.

58 2.7: Interpreting and Understanding Standard Deviation Standard deviation is a measure of variability, or spread. Two rules for describing data rely on the standard deviation. Chebyshev s theorem: applies to any distribution. Empirical rule: applies to a variable that is normally distributed.

59 Chebyshev s Theorem: The proportion of any distribution that lies within k standard deviations of the mean is at least 1 (1/k 2 ), where k is any positive number larger than 1. This theorem applies to all distributions of data. Illustration: at least k xks x x+ ks

60 Note: 1. Chebyshev s theorem is very conservative. It holds for any distribution of data. 2. Chebyshev s theorem also applies to any population. 3. The two most common values used to describe a distribution of data are k = 2, The table below lists some values for k and 1 (1/k 2 ). k 1( 1/ k 2 )

61 Example: At the close of trading, a random sample of 35 technology stocks was selected. The mean selling price was and the standard deviation was Use Chebyshev s theorem (with k = 2, 3) to describe the distribution. Solution: At least 75% of the observations lie within 2 standard deviations of the mean: ( x 2s, x + 2s) = ( (12.3), (12.3) = (43.15, 92.35) At least 89% of the observations lie with 3 standard deviations of the mean: ( x 3s, x + 3s) = ( (12.3), (12.3) = (30.85,104.65)

62 Empirical Rule: If a variable is normally distributed: 1. Approximately 68% of the observations lie within 1 standard deviation of the mean. 2. Approximately 95% of the observations lie within 2 standard deviations of the mean. 3. Approximately 99.7% of the observations lie within 3 standard deviations of the mean. Note: 1. The empirical rule is more accurate than Chebyshev s theorem since we know more about the distribution (normally distributed). 2. Also applies to populations. 3. Can be used to determine if a distribution is normally distributed.

63 Illustration of the empirical rule: 99.7% 95% 68% x3s x2s xs x x+ s x+2s x+3s

64 Example: A random sample of plum tomatoes was selected from a local grocery store and their weights recorded. The mean weight was 6.5 ounces with a standard deviation of.4 ounces. If the weights are normally distributed: 1. What percentage of weights fall between 5.7 and 7.3? 2. What percentage of weights fall above 7.7? Solution: ( x 2s, x+ 2s) = ( 65. 2(. 4), (. 4)) = ( 57., 73. ) Approximately 95% of the weights fall between 5.7 and 7.3 ( x 3s, x+ 3s) = ( 65. 3(. 4), (. 4)) = ( 5377.,. ) Approximately 99.7% of the weights fall between 5.3 and 7.7 Approximately.3% of the weight fall outside (5.3,7.7) Approximately (.3/2)=.15% of the weights fall above 7.7

65 Note: The empirical rule may be used to determine whether or not a set of data is approximately normally distributed. 1. Find the mean and standard deviation for the data. 2. Compute the actual proportion of data within 1, 2, and 3 standard deviations from the mean. 3. Compare these actual proportions with those given by the empirical rule. 4. If the proportions found are reasonably close to those of the empirical rule, then the data is approximately normally distributed.

66 Note: 1. Graphic method to test for normality: Draw a relative frequency ogive of grouped data on probability paper. a. Draw a straight line from the lower-left corner to the upper-right corner of the graph connecting the next-toend points of the ogive. b If the ogive lies close to this straight line, the distribution is said to be approximately normal. 2. The ogive may be used to find percentiles. a. Draw a horizontal line through the graph at k. b. At the point where the line intersects the ogive, draw a vertical line to the bottom of the graph. c. Read the value of x from the horizontal scale. d. This value of x is the kth percentile.

67 2.8: The Art of Statistical Deception Good arithmetic, bad statistics Misleading graphs Insufficient information

68 Good Arithmetic, Bad Statistics: The mean can be greatly influenced by outliers. Example: The mean salary for all NBA players is $15.5 million. Misleading graphs: 1. The frequency scale should start at zero to present a complete picture. Graphs that do not start at zero are used to save space. 2. Graphs that start at zero emphasize the size of the numbers involved. 3. Graphs that are chopped off emphasize variation.

69 This graph presents the total picture Sum of Delays Year

70 This graph emphasizes the variation Sum of Delays Year

71 Insufficient Information: Example: An admissions officer from a state school explains that the average tuition at a nearby private university is $13,000 and only $4500 at his school. This makes the state school look more attractive. If most students pay the full tuition, then the state school appears to be a better choice. However, if most students at the private university receive substantial financial aid, then the actual tuition cost could be quite lower!

Measures of Location. Measures of position are used to describe the relative location of an observation

Measures of Location. Measures of position are used to describe the relative location of an observation Measures of Location Measures of position are used to describe the relative location of an observation 1 Measures of Position Quartiles and percentiles are two of the most popular measures of position

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Descriptive Data Summarization

Descriptive Data Summarization Descriptive Data Summarization Descriptive data summarization gives the general characteristics of the data and identify the presence of noise or outliers, which is useful for successful data cleaning

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

Units. Exploratory Data Analysis. Variables. Student Data

Units. Exploratory Data Analysis. Variables. Student Data Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

STT 315 This lecture is based on Chapter 2 of the textbook.

STT 315 This lecture is based on Chapter 2 of the textbook. STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value

More information

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things. (c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

MATH 117 Statistical Methods for Management I Chapter Three

MATH 117 Statistical Methods for Management I Chapter Three Jubail University College MATH 117 Statistical Methods for Management I Chapter Three This chapter covers the following topics: I. Measures of Center Tendency. 1. Mean for Ungrouped Data (Raw Data) 2.

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,

More information

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation Chapter Four Numerical Descriptive Techniques 4.1 Numerical Descriptive Techniques Measures of Central Location Mean, Median, Mode Measures of Variability Range, Standard Deviation, Variance, Coefficient

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008 DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 3 Spring 2008 Measures of central tendency for ungrouped data 2 Graphs are very helpful to describe

More information

Chapter 1. Looking at Data

Chapter 1. Looking at Data Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics CHAPTER OUTLINE 6-1 Numerical Summaries of Data 6- Stem-and-Leaf Diagrams 6-3 Frequency Distributions and Histograms 6-4 Box Plots 6-5 Time Sequence Plots 6-6 Probability Plots Chapter

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

Exam: practice test 1 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Exam: practice test 1 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Exam: practice test MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the problem. ) Using the information in the table on home sale prices in

More information

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population . Measures of Central Tendency: Mode, Median and Mean Average a single number that is used to describe the entire sample or population. Mode a. Easiest to compute, but not too stable i. Changing just one

More information

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511 Topic 2 - Descriptive Statistics STAT 511 Professor Bruce Craig Types of Information Variables classified as Categorical (qualitative) - variable classifies individual into one of several groups or categories

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

Range The range is the simplest of the three measures and is defined now.

Range The range is the simplest of the three measures and is defined now. Measures of Variation EXAMPLE A testing lab wishes to test two experimental brands of outdoor paint to see how long each will last before fading. The testing lab makes 6 gallons of each paint to test.

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

DESCRIPTIVE STATISTICS

DESCRIPTIVE STATISTICS DESCRIPTIVE STATISTICS Statistics deals with the theories and methods used in the collection, organization, interpretation and presentation of data. Data raw material used in statistical investigation

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

Lecture 1: Descriptive Statistics

Lecture 1: Descriptive Statistics Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics

More information

additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst

additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst STATISTICS atisticsadditionalmathematicsstatistic

More information

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Spring 2015: Lembo GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Descriptive statistics concise and easily understood summary of data set characteristics

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Example 2. Given the data below, complete the chart:

Example 2. Given the data below, complete the chart: Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is

More information

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS LEARNING OBJECTIVES: After studying this chapter, a student should understand: notation used in statistics; how to represent variables in a mathematical form

More information

Chapter 3 Data Description

Chapter 3 Data Description Chapter 3 Data Description Section 3.1: Measures of Central Tendency Section 3.2: Measures of Variation Section 3.3: Measures of Position Section 3.1: Measures of Central Tendency Definition of Average

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest: 1 Chapter 3 - Descriptive stats: Numerical measures 3.1 Measures of Location Mean Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size Example: The number

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 2: Describing Distributions with Numbers CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring

More information

Exercises from Chapter 3, Section 1

Exercises from Chapter 3, Section 1 Exercises from Chapter 3, Section 1 1. Consider the following sample consisting of 20 numbers. (a) Find the mode of the data 21 23 24 24 25 26 29 30 32 34 39 41 41 41 42 43 48 51 53 53 (b) Find the median

More information

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included:

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included: AP Statistics Chapter 2 Notes 2.1 Describing Location in a Distribution Percentile: The pth percentile of a distribution is the value with p percent of the observations (If your test score places you in

More information

1.3.1 Measuring Center: The Mean

1.3.1 Measuring Center: The Mean 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar) of a set of observations, add their values and divide by the number of observations. If the n observations

More information

Math 082 Final Examination Review

Math 082 Final Examination Review Math 08 Final Examination Review 1) Write the equation of the line that passes through the points (4, 6) and (0, 3). Write your answer in slope-intercept form. ) Write the equation of the line that passes

More information

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept.

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept. Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3- Measures of Center 3-3 Measures of Variation Section 3-1: Overview Descriptive Statistics summarize or describe the important

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Chapter 5: Exploring Data: Distributions Lesson Plan

Chapter 5: Exploring Data: Distributions Lesson Plan Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Mean, Median, Mode, and Range

Mean, Median, Mode, and Range Mean, Median, Mode, and Range Mean, median, and mode are measures of central tendency; they measure the center of data. Range is a measure of dispersion; it measures the spread of data. The mean of a data

More information

After completing this chapter, you should be able to:

After completing this chapter, you should be able to: Chapter 2 Descriptive Statistics Chapter Goals After completing this chapter, you should be able to: Compute and interpret the mean, median, and mode for a set of data Find the range, variance, standard

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The

More information

Granite School District Parent Guides Utah Core State Standards for Mathematics Grades K-6

Granite School District Parent Guides Utah Core State Standards for Mathematics Grades K-6 Granite School District Parent Guides Grades K-6 GSD Parents Guide for Kindergarten The addresses Standards for Mathematical Practice and Standards for Mathematical Content. The standards stress not only

More information

MgtOp 215 Chapter 3 Dr. Ahn

MgtOp 215 Chapter 3 Dr. Ahn MgtOp 215 Chapter 3 Dr. Ahn Measures of central tendency (center, location): measures the middle point of a distribution or data; these include mean and median. Measures of dispersion (variability, spread):

More information

LC OL - Statistics. Types of Data

LC OL - Statistics. Types of Data LC OL - Statistics Types of Data Question 1 Characterise each of the following variables as numerical or categorical. In each case, list any three possible values for the variable. (i) Eye colours in a

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics By A.V. Vedpuriswar October 2, 2016 Introduction The word Statistics is derived from the Italian word stato, which means state. Statista refers to a person involved with the

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

Chapter 6 Group Activity - SOLUTIONS

Chapter 6 Group Activity - SOLUTIONS Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing

More information

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III)

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III) Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.1.1 Simple Interest 0.2 Business Applications (III) 0.2.1 Expenses Involved in Buying a Car 0.2.2 Expenses Involved

More information

Section 3. Measures of Variation

Section 3. Measures of Variation Section 3 Measures of Variation Range Range = (maximum value) (minimum value) It is very sensitive to extreme values; therefore not as useful as other measures of variation. Sample Standard Deviation The

More information

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the CHAPTER 4 VARIABILITY ANALYSES Chapter 3 introduced the mode, median, and mean as tools for summarizing the information provided in an distribution of data. Measures of central tendency are often useful

More information

Lecture 2. Descriptive Statistics: Measures of Center

Lecture 2. Descriptive Statistics: Measures of Center Lecture 2. Descriptive Statistics: Measures of Center Descriptive Statistics summarize or describe the important characteristics of a known set of data Inferential Statistics use sample data to make inferences

More information

Performance of fourth-grade students on an agility test

Performance of fourth-grade students on an agility test Starter Ch. 5 2005 #1a CW Ch. 4: Regression L1 L2 87 88 84 86 83 73 81 67 78 83 65 80 50 78 78? 93? 86? Create a scatterplot Find the equation of the regression line Predict the scores Chapter 5: Understanding

More information

Resistant Measure - A statistic that is not affected very much by extreme observations.

Resistant Measure - A statistic that is not affected very much by extreme observations. Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

The Empirical Rule, z-scores, and the Rare Event Approach

The Empirical Rule, z-scores, and the Rare Event Approach Overview The Empirical Rule, z-scores, and the Rare Event Approach Look at Chebyshev s Rule and the Empirical Rule Explore some applications of the Empirical Rule How to calculate and use z-scores Introducing

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

Mathematics. Thomas Whitham Sixth Form S J Cooper

Mathematics. Thomas Whitham Sixth Form S J Cooper Mathematics Handling Data Revision Notes For Year 8 Thomas Whitham Sixth Form S J Cooper. Probability of a single event. Probability of two events 3. Statistics Qualitative data 4. Statistics Time series

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

University of Jordan Fall 2009/2010 Department of Mathematics

University of Jordan Fall 2009/2010 Department of Mathematics handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making

More information

Describing Distributions with Numbers

Describing Distributions with Numbers Describing Distributions with Numbers Using graphs, we could determine the center, spread, and shape of the distribution of a quantitative variable. We can also use numbers (called summary statistics)

More information

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations:

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations: Measures of center The mean The mean of a distribution is the arithmetic average of the observations: x = x 1 + + x n n n = 1 x i n i=1 The median The median is the midpoint of a distribution: the number

More information

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. Chapter 3 Numerically Summarizing Data Chapter 3.1 Measures of Central Tendency Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. A1. Mean The

More information

Week 1: Intro to R and EDA

Week 1: Intro to R and EDA Statistical Methods APPM 4570/5570, STAT 4000/5000 Populations and Samples 1 Week 1: Intro to R and EDA Introduction to EDA Objective: study of a characteristic (measurable quantity, random variable) for

More information

Section 3.2 Measures of Central Tendency

Section 3.2 Measures of Central Tendency Section 3.2 Measures of Central Tendency 1 of 149 Section 3.2 Objectives Determine the mean, median, and mode of a population and of a sample Determine the weighted mean of a data set and the mean of a

More information

21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS

21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS 21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS Table of Contents Functions... 2 Polynomials and Rational Functions... 3 Exponential Functions... 4 Logarithmic

More information