Statistics, continued

Size: px
Start display at page:

Download "Statistics, continued"

Transcription

1 Statistics, continued

2 Visual Displays of Data Since numbers often do not resonate with people, giving visual representations of data is often uses to make the data more meaningful. We will talk about a few ways to view data. 2

3 Histograms A histogram, or bar chart, is a common way to represent numerical data. We illustrate a histogram with weather data for high temperatures on January 1 in San Francisco. 3

4 High Temperatures on January 1 in San Francisco Year Temperature Year Temperature

5 By counting how many days had a given high temperature, we get the following chart. Temperature Number of Days Temperature Number of Days

6 Here is a histogram of the last chart. The numbers on the vertical axis are the number of days of the given temperature, and the values on the horizontal axis are the various temperatures. This was created by excel. (#$" (" '#$" '" &#$" &" %#$" %"!#$"!" 6

7 Here is the same chart with the legend put in the vertical and horizontal axes. 7

8 Each value, in this case temperature, is drawn with a vertical bar. The height of the bar represents how many times that value occurs. The values are listed on the horizonal axis in increasing value from left to right. 8

9 Let s look at some more weather data. We have the high temperatures on January 1 from 1977 to 2007 for both San Francisco and Las Cruces. We will also calculate the mean and median for both data sets. 9

10 High Temps on January 1 in San Francisco and Las Cruces Year SF LC Year SF LC

11 One interesting point of this data is the following calculation of central tendency. mean median San Francisco Las Cruces The mean and median are virtually identical for the two cities. We will now plot the data in the same way as we did earlier. 11

12 San Francisco and Las Cruces Weather Data (#$" (" '#$" '" &#$" &" )*" +," %#$" %"!#$"!" 12

13 However, the graphical representation makes the data look much different. The data for Las Cruces is spread out much more than that of San Francisco. A calculation of the middle of the data only presents part of the story. The dispersion or deviation of the data is also an important part of the data. While there are several measures of deviation, the most common one is called standard deviation. 13

14 The most basic property of standard deviation is: the larger the standard deviation, the more spread out the data. That is, the larger the deviation, the more the data is away from the middle, or the average. 14

15 The point of measuring deviation is to give a sense of how far data is from the middle, or the average. Standard deviation approximately measures the average of how far data is from the middle. This is not exactly true, but is roughly true. We will say more about the standard deviation in a little while. 15

16 Box and Whiskers Plot A box and whisker plot is another way to plot data, and it focuses attention on other aspects of the data than in a histogram. 16

17 One of the main pieces of information this plot shows is the quartiles. The idea of quartiles is to divide the data into quarters. The median breaks the data into two halves. If we break each half into halves, we will have broken the data into 4 quartiles. 17

18 The first quartile represents a point where 1/4 of the data is below and 3/4 is above. The second quartile, which is the median, represents a point where 2/4 of the data is below and 2/4 is above. The third quartile represents a point where 3/4 of the data is below and 1/4 is above. 18

19 We will illustrate constructing a box and whisker plot with the following data. Suppose your data set has the following 10 numbers: 60, 62, 64, 64, 65, 67, 70, 75, 80, 82 We first find the median; this is the average of 65 and 67, so is

20 We next find the first and third quartile. To do this we split the data 60, 62, 64, 64, 65, 67, 70, 75, 80, 82 in half: 60, 62, 64, 64, 65 67, 70, 75, 80, 82 The first quartile is the median of the small half. That is 64. The third quartile is the median of the big half. That is

21 The median is the second quartile value. So, we have: first quartile: 64 median: 66 third quartile: 75 low: 60 high: 82 We then make the following plot, marking it next to a number line starting and ending at the high and low, respectively. 21

22 The two boxes reflect the two quartiles, one goes from the first quartile to the median and the other from the median to the third quartile. Then we have the whiskers, which are lines to the extremes (high and low) of the data. The significance of the boxes is that half the data lies inside the two of them. The other half of the data is represented by the the whiskers. 22

23 For a second example, let s use the Laker salary data..32m,.43m,.77m, 1M, 1.76M, 2.17M, 2.2M, 2.7M, 4M, 4.4M, 5.6M, 13.5M, 13.7M, 19.5M. Since there are 14 data points, each half has 7 points. The median of each half has three numbers on each side. These, the first and third quartiles, are listed in red. As we saw earlier, the median is 2.45M. The high and low salaries are.23m and 19.5M 23

24 Box and Whisker plot for the Laker salary data 24

25 There are other sorts of charts to represent data. A pie chart is a commonly occurring chart. Its purpose is to visually show percentages between different items. 25

26 The Normal Distribution Let s look at the experiment of flipping a coin repeatedly. We will simulate this with a computer program. It represents flipping 100 coins and counting the number of heads. It will do this as many times as we want. 26

27 Recall the coin flipping experiment we did early in the semester. Number of Students with a given number of Heads Number of Students Number of Heads 27

28 Simulation of flipping 100 coins 1000 times 28

29 Simulation of flipping 100 coins times 29

30 Simulation of flipping 100 coins times 30

31 Simulation of flipping 100 coins times 31

32 As the number of flips gets larger and larger, the graph looks more and more regular. In fact, the shape looks more like a shape of a curve you may have seen before. 32

33 The Normal (Bell) Curve 33

34 The importance of the bell curve is that as the number of trials gets larger and larger, histograms generally look more and more like a bell curve. The particular shape of the bell curve reflects the standard deviation. 34

35 Bell Curves with Different Standard Deviation St. Dev. = 2 St. Dev. = 5 35

36 The higher the standard deviation, the wider the curve is. In terms of the normal curve, standard deviation can be interpreted as follows: 68% of all data is within 1 standard deviation of the average. 95% of all data is within 2 st. deviations of the average. 99.7% of all data is within 3 st. devs. of the average. 36

37 Sampling The following quotes were taken from Gallup.com on April 27: 37

38 The latest Gallup Poll tracking report shows that 86% of Americans say the U.S. economy is getting worse, while 44% rate the current economy as poor, and only 15% rate it as excellent or good. National Democratic voters preferences for their party s nomination remain evenly split, with the latest Gallup Poll Daily tracking results showing Barack Obama and Hillary Clinton each receiving 47% support. 38

39 The following two quotes were taken from bush.kerry.poll/index.html December 21, 2004: As for Bush, 49 percent of respondents said they approved of the job the president is doing.... The question had a margin of error of plus or minus 3 percentage points. 39

40 April 20, 2004: A broader survey of registered voters gave the president a 50 percent to 46 percent lead over Kerry in a two-man race. And among all adults, Bush led Kerry 49 percent to 46 percent, with a margin of error of plus or minus 3 percentage points. 40

41 How are these statistics measured? Also, in the first quote, what is the meaning of the statement: The question had a margin of error of plus or minus 3 percentage points. Does it mean Bush had an percentage approval rating between 49-3=46 and 49+3=52? 41

42 Polls are conducted by taking a sample of the population and asking them their opinion. The CNN poll surveyed 1003 people. The reported percentage of Bush s approval rating is the percentage of the 1003 people who approved of his performance. Since only a small fraction were surveyed, is the actual approval rating really in this range? 42

43 What is almost always missing from statements about polls is that, due to the fact that not everybody is polled, the poll lists an estimate of the actual data. In fact, statements like the CNN poll data which list a percentage and a margin of error are really probabilistic statements. 43

44 The statement As for Bush, 49 percent of respondents said they approved of the job the president is doing.... The question had a margin of error of plus or minus 3 percentage points. really means that the actual percentage of Americans who approved of the president s job performance has a certain probability of being within 3 points of 49%. 44

45 Most poll data calculate the margin of error based on having a 95% probability that the actual value is within the margin of error. Unfortunately, it is rare, if ever, that a poll lists the actual probabiliy that the true value is within the margin of error. 45

46 If it is important to be more than 95% sure of the accuracy of the results, one can guarantee this by making the margin of error in the poll larger. However, if you said that candidate A has 51% support and candidate B has 49% support, and the margin of error was plus or minus 10%, then the 2% difference between the candidates is much less than the 10% margin of error. Thus, we cannot have any feel for which candidate has the greater support. It is then necessary to have a fairly small margin of error. 46

47 We want, therefore, to be confident of the data but have a fairly small margin of error. Over the years, people working with statistics have found 95% confidence in the data a good balance between being sure of the data and not having too large of a margin of error. 47

48 It turns out that actual number of people polled, rather than the fraction of the population, is what matters to do these calculations. For example, polling 1000 people is plenty to get good results, even when polling about a national issue. 48

Mean/Average Median Mode Range

Mean/Average Median Mode Range Normal Curves Today s Goals Normal curves! Before this we need a basic review of statistical terms. I mean basic as in underlying, not easy. We will learn how to retrieve statistical data from normal curves.

More information

Section 7.1 How Likely are the Possible Values of a Statistic? The Sampling Distribution of the Proportion

Section 7.1 How Likely are the Possible Values of a Statistic? The Sampling Distribution of the Proportion Section 7.1 How Likely are the Possible Values of a Statistic? The Sampling Distribution of the Proportion CNN / USA Today / Gallup Poll September 22-24, 2008 www.poll.gallup.com 12% of Americans describe

More information

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27 Chapter 7: Statistics Describing Data Chapter 7: Statistics Describing Data 1 / 27 Categorical Data Four ways to display categorical data: 1 Frequency and Relative Frequency Table 2 Bar graph (Pareto chart)

More information

Data set B is 2, 3, 3, 3, 5, 8, 9, 9, 9, 15. a) Determine the mean of the data sets. b) Determine the median of the data sets.

Data set B is 2, 3, 3, 3, 5, 8, 9, 9, 9, 15. a) Determine the mean of the data sets. b) Determine the median of the data sets. FOUNDATIONS OF MATH 11 Ch. 5 Day 1: EXPLORING DATA VOCABULARY A measure of central tendency is a value that is representative of a set of numerical data. These values tend to lie near the middle of a set

More information

AP Stats MOCK Chapter 7 Test MC

AP Stats MOCK Chapter 7 Test MC Name: Class: Date: AP Stats MOCK Chapter 7 Test MC Multiple Choice-13 questions Identify the choice that best completes the statement or answers the question. 1. A survey conducted by Black Flag asked

More information

Understanding Inference: Confidence Intervals I. Questions about the Assignment. The Big Picture. Statistic vs. Parameter. Statistic vs.

Understanding Inference: Confidence Intervals I. Questions about the Assignment. The Big Picture. Statistic vs. Parameter. Statistic vs. Questions about the Assignment If your answer is wrong, but you show your work you can get more partial credit. Understanding Inference: Confidence Intervals I parameter versus sample statistic Uncertainty

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets

Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

Introduction to Statistics for Traffic Crash Reconstruction

Introduction to Statistics for Traffic Crash Reconstruction Introduction to Statistics for Traffic Crash Reconstruction Jeremy Daily Jackson Hole Scientific Investigations, Inc. c 2003 www.jhscientific.com Why Use and Learn Statistics? 1. We already do when ranging

More information

Chapter 1. Looking at Data

Chapter 1. Looking at Data Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,

More information

Last few slides from last time

Last few slides from last time Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an

More information

Chapter. Objectives. Sampling Distributions

Chapter. Objectives. Sampling Distributions Chapter Sampling Distributions 8 Section 8.1 Distribution of the Sample Mean Objectives 1. Describe the distribution of the sample mean: samples from normal populations 2. Describe the distribution of

More information

Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table

Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table Lesson Plan Answer Questions Summary Statistics Histograms The Normal Distribution Using the Standard Normal Table 1 2. Summary Statistics Given a collection of data, one needs to find representations

More information

Men. Women. Men. Men. Women. Women

Men. Women. Men. Men. Women. Women Math 203 Topics for second exam Statistics: the science of data Chapter 5: Producing data Statistics is all about drawing conclusions about the opinions/behavior/structure of large populations based on

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things. (c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

STT 315 This lecture is based on Chapter 2 of the textbook.

STT 315 This lecture is based on Chapter 2 of the textbook. STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their

More information

Exam #2 Results (as percentages)

Exam #2 Results (as percentages) Oct. 30 Assignment: Read Chapter 19 Try exercises 1, 2, and 4 on p. 424 Exam #2 Results (as percentages) Mean: 71.4 Median: 73.3 Soda attitudes 2015 In a Gallup poll conducted Jul. 8 12, 2015, 1009 adult

More information

Chapter 18: Sampling Distributions

Chapter 18: Sampling Distributions Chapter 18: Sampling Distributions All random variables have probability distributions, and as statistics are random variables, they too have distributions. The random phenomenon that produces the statistics

More information

IDAHO EXTENDED CONTENT STANDARDS MATHEMATICS

IDAHO EXTENDED CONTENT STANDARDS MATHEMATICS Standard 1: Number and Operation Goal 1.1: Understand and use numbers. K.M.1.1.1A 1.M.1.1.1A Recognize symbolic Indicate recognition of expressions as numbers various # s in environments K.M.1.1.2A Demonstrate

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information

LC OL - Statistics. Types of Data

LC OL - Statistics. Types of Data LC OL - Statistics Types of Data Question 1 Characterise each of the following variables as numerical or categorical. In each case, list any three possible values for the variable. (i) Eye colours in a

More information

Essentials of Statistics and Probability

Essentials of Statistics and Probability May 22, 2007 Department of Statistics, NC State University dbsharma@ncsu.edu SAMSI Undergrad Workshop Overview Practical Statistical Thinking Introduction Data and Distributions Variables and Distributions

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching/ Suhasini Subba Rao Review In the previous lecture we looked at the statistics of M&Ms. This example illustrates

More information

Marquette University MATH 1700 Class 5 Copyright 2017 by D.B. Rowe

Marquette University MATH 1700 Class 5 Copyright 2017 by D.B. Rowe Class 5 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 2017 by D.B. Rowe 1 Agenda: Recap Chapter 3.2-3.3 Lecture Chapter 4.1-4.2 Review Chapter 1 3.1 (Exam

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

Bag RED ORANGE GREEN YELLOW PURPLE Candies per Bag

Bag RED ORANGE GREEN YELLOW PURPLE Candies per Bag Skittles Project For this project our entire class when out and bought a standard 2.17 ounce bag of skittles. Before we ate them, we recorded all of our data, the amount of skittles in our bag and the

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

Ch. 7: Estimates and Sample Sizes

Ch. 7: Estimates and Sample Sizes Ch. 7: Estimates and Sample Sizes Section Title Notes Pages Introduction to the Chapter 2 2 Estimating p in the Binomial Distribution 2 5 3 Estimating a Population Mean: Sigma Known 6 9 4 Estimating a

More information

Unit 4 Probability. Dr Mahmoud Alhussami

Unit 4 Probability. Dr Mahmoud Alhussami Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from

More information

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?! Topic 3: Introduction to Statistics Collecting Data We collect data through observation, surveys and experiments. We can collect two different types of data: Categorical Quantitative Algebra 1 Table of

More information

Graphing Data. Example:

Graphing Data. Example: Graphing Data Bar graphs and line graphs are great for looking at data over time intervals, or showing the rise and fall of a quantity over the passage of time. Example: Auto Sales by Year Year Number

More information

NOWCASTING THE OBAMA VOTE: PROXY MODELS FOR 2012

NOWCASTING THE OBAMA VOTE: PROXY MODELS FOR 2012 JANUARY 4, 2012 NOWCASTING THE OBAMA VOTE: PROXY MODELS FOR 2012 Michael S. Lewis-Beck University of Iowa Charles Tien Hunter College, CUNY IF THE US PRESIDENTIAL ELECTION WERE HELD NOW, OBAMA WOULD WIN.

More information

GALLUP NEWS SERVICE JUNE WAVE 1

GALLUP NEWS SERVICE JUNE WAVE 1 GALLUP NEWS SERVICE JUNE WAVE 1 -- FINAL TOPLINE -- Timberline: 937008 IS: 392 Princeton Job #: 15-06-006 Jeff Jones, Lydia Saad June 2-7, 2015 Results are based on telephone interviews conducted June

More information

Chapter 5: Exploring Data: Distributions Lesson Plan

Chapter 5: Exploring Data: Distributions Lesson Plan Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures

More information

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc. Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

1. Descriptive stats methods for organizing and summarizing information

1. Descriptive stats methods for organizing and summarizing information Two basic types of statistics: 1. Descriptive stats methods for organizing and summarizing information Stats in sports are a great example Usually we use graphs, charts, and tables showing averages and

More information

Section 3.2 Measures of Central Tendency

Section 3.2 Measures of Central Tendency Section 3.2 Measures of Central Tendency 1 of 149 Section 3.2 Objectives Determine the mean, median, and mode of a population and of a sample Determine the weighted mean of a data set and the mean of a

More information

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O Halloran Sampling A. Basics 1. Ways to Describe Data Histograms Frequency Tables, etc. 2. Ways to Characterize

More information

AMS 5 NUMERICAL DESCRIPTIVE METHODS

AMS 5 NUMERICAL DESCRIPTIVE METHODS AMS 5 NUMERICAL DESCRIPTIVE METHODS Introduction A histogram provides a graphical description of the distribution of a sample of data. If we want to summarize the properties of such a distribution we can

More information

Bayesian Inference for Binomial Proportion

Bayesian Inference for Binomial Proportion 8 Bayesian Inference for Binomial Proportion Frequently there is a large population where π, a proportion of the population, has some attribute. For instance, the population could be registered voters

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis Summarizing Variability and Determining Distributions CS 147: Computer Systems Performance Analysis Summarizing Variability and Determining Distributions 1

More information

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data Finding Quartiles. Use the median to divide the ordered data set into two halves.. If n is odd, do not include the median in either half. If n is even, split this data set exactly in half.. Q1 is the median

More information

Describing Distributions with Numbers

Describing Distributions with Numbers Describing Distributions with Numbers Using graphs, we could determine the center, spread, and shape of the distribution of a quantitative variable. We can also use numbers (called summary statistics)

More information

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

Chapter 8: Confidence Intervals

Chapter 8: Confidence Intervals Chapter 8: Confidence Intervals Introduction Suppose you are trying to determine the mean rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write

More information

Lab 5 for Math 17: Sampling Distributions and Applications

Lab 5 for Math 17: Sampling Distributions and Applications Lab 5 for Math 17: Sampling Distributions and Applications Recall: The distribution formed by considering the value of a statistic for every possible sample of a given size n from the population is called

More information

Are data normally normally distributed?

Are data normally normally distributed? Standard Normal Image source Are data normally normally distributed? Sample mean: 66.78 Sample standard deviation: 3.37 (66.78-1 x 3.37, 66.78 + 1 x 3.37) (66.78-2 x 3.37, 66.78 + 2 x 3.37) (66.78-3 x

More information

Collecting and Reporting Data

Collecting and Reporting Data Types of Data Data can be classified as qualitative or quantitative: Qualitative data Are observed rather than measured Include written descriptions, videos, photographs, or live observations Examples

More information

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511 Topic 2 - Descriptive Statistics STAT 511 Professor Bruce Craig Types of Information Variables classified as Categorical (qualitative) - variable classifies individual into one of several groups or categories

More information

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67 Chapter 6 The Standard Deviation as a Ruler and the Normal Model 1 /67 Homework Read Chpt 6 Complete Reading Notes Do P129 1, 3, 5, 7, 15, 17, 23, 27, 29, 31, 37, 39, 43 2 /67 Objective Students calculate

More information

What are the mean, median, and mode for the data set below? Step 1

What are the mean, median, and mode for the data set below? Step 1 Unit 11 Review Analyzing Data Name Per The mean is the average of the values. The median is the middle value(s) when the values are listed in order. The mode is the most common value(s). What are the mean,

More information

Do students sleep the recommended 8 hours a night on average?

Do students sleep the recommended 8 hours a night on average? BIEB100. Professor Rifkin. Notes on Section 2.2, lecture of 27 January 2014. Do students sleep the recommended 8 hours a night on average? We first set up our null and alternative hypotheses: H0: μ= 8

More information

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias

Announcements. Lecture 5: Probability. Dangling threads from last week: Mean vs. median. Dangling threads from last week: Sampling bias Recap Announcements Lecture 5: Statistics 101 Mine Çetinkaya-Rundel September 13, 2011 HW1 due TA hours Thursday - Sunday 4pm - 9pm at Old Chem 211A If you added the class last week please make sure to

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11

Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11 Part III Practice Problems Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11 1. In estimating population mean or proportion what is the width of an interval? 2. If 25 college students out of 80 graduate

More information

Describing Data 247. Color Frequency Blue 25 Green 52 Red 41 White 36 Black 39 Grey 23

Describing Data 247. Color Frequency Blue 25 Green 52 Red 41 White 36 Black 39 Grey 23 Describing Data 247 Describing Data Once we have collected data from surveys or experiments, we need to summarize and present the data in a way that will be meaningful to the reader. We will begin with

More information

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Spring 2015: Lembo GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Descriptive statistics concise and easily understood summary of data set characteristics

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS

CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS CHAPTER 8 INTRODUCTION TO STATISTICAL ANALYSIS LEARNING OBJECTIVES: After studying this chapter, a student should understand: notation used in statistics; how to represent variables in a mathematical form

More information

Probability Distributions

Probability Distributions CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas

More information

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic Section 8.1A What Is a Sampling Distribution? Learning Objectives After this section, you should be able to DISTINGUISH between a parameter and a statistic DEFINE sampling distribution DISTINGUISH between

More information

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study CHAPTER PROBLEM Do women really talk more than men? A common belief is that women talk more than men. Is that belief founded in fact, or is it a myth? Do men actually talk more than women? Or do men and

More information

MATH 117 Statistical Methods for Management I Chapter Three

MATH 117 Statistical Methods for Management I Chapter Three Jubail University College MATH 117 Statistical Methods for Management I Chapter Three This chapter covers the following topics: I. Measures of Center Tendency. 1. Mean for Ungrouped Data (Raw Data) 2.

More information

Chapter 1:Descriptive statistics

Chapter 1:Descriptive statistics Slide 1.1 Chapter 1:Descriptive statistics Descriptive statistics summarises a mass of information. We may use graphical and/or numerical methods Examples of the former are the bar chart and XY chart,

More information

Probability Distributions

Probability Distributions Probability Distributions Probability This is not a math class, or an applied math class, or a statistics class; but it is a computer science course! Still, probability, which is a math-y concept underlies

More information

Descriptive Univariate Statistics and Bivariate Correlation

Descriptive Univariate Statistics and Bivariate Correlation ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to

More information

After completing this chapter, you should be able to:

After completing this chapter, you should be able to: Chapter 2 Descriptive Statistics Chapter Goals After completing this chapter, you should be able to: Compute and interpret the mean, median, and mode for a set of data Find the range, variance, standard

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

PETERS TOWNSHIP HIGH SCHOOL

PETERS TOWNSHIP HIGH SCHOOL PETERS TOWNSHIP HIGH SCHOOL COURSE SYLLABUS: ALGEBRA 1 ACADEMIC Course Overview and Essential Skills This course is a study of the language, concepts, and techniques of Algebra that will prepare students

More information

2. Graphing Practice. Warm Up

2. Graphing Practice. Warm Up 2. Graphing Practice In this worksheet you will practice graphing and use your graphs to interpret and analyze data. The first three questions are warm-up questions; complete this section before moving

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

Preliminary Statistics course. Lecture 1: Descriptive Statistics

Preliminary Statistics course. Lecture 1: Descriptive Statistics Preliminary Statistics course Lecture 1: Descriptive Statistics Rory Macqueen (rm43@soas.ac.uk), September 2015 Organisational Sessions: 16-21 Sep. 10.00-13.00, V111 22-23 Sep. 15.00-18.00, V111 24 Sep.

More information

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% Show all work, simplify as appropriate, and use good form and procedure (as in class). Box in your final

More information

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11.

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11. Chapter 4 Statistics 45 CHAPTER 4 BASIC QUALITY CONCEPTS 1.0 Continuous Distributions.0 Measures of Central Tendency 3.0 Measures of Spread or Dispersion 4.0 Histograms and Frequency Distributions 5.0

More information

Discrete Distributions

Discrete Distributions Discrete Distributions STA 281 Fall 2011 1 Introduction Previously we defined a random variable to be an experiment with numerical outcomes. Often different random variables are related in that they have

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Statistics lecture 3. Bell-Shaped Curves and Other Shapes

Statistics lecture 3. Bell-Shaped Curves and Other Shapes Statistics lecture 3 Bell-Shaped Curves and Other Shapes Goals for lecture 3 Realize many measurements in nature follow a bell-shaped ( normal ) curve Understand and learn to compute a standardized score

More information

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above King Abdul Aziz University Faculty of Sciences Statistics Department Final Exam STAT 0 First Term 49-430 A 40 Name No ID: Section: You have 40 questions in 9 pages. You have 90 minutes to solve the exam.

More information

Point Estimation and Confidence Interval

Point Estimation and Confidence Interval Chapter 8 Point Estimation and Confidence Interval 8.1 Point estimator The purpose of point estimation is to use a function of the sample data to estimate the unknown parameter. Definition 8.1 A parameter

More information

Units. Exploratory Data Analysis. Variables. Student Data

Units. Exploratory Data Analysis. Variables. Student Data Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as

More information

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation Chapter Four Numerical Descriptive Techniques 4.1 Numerical Descriptive Techniques Measures of Central Location Mean, Median, Mode Measures of Variability Range, Standard Deviation, Variance, Coefficient

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information

MgtOp 215 Chapter 3 Dr. Ahn

MgtOp 215 Chapter 3 Dr. Ahn MgtOp 215 Chapter 3 Dr. Ahn Measures of central tendency (center, location): measures the middle point of a distribution or data; these include mean and median. Measures of dispersion (variability, spread):

More information

MEASURING THE SPREAD OF DATA: 6F

MEASURING THE SPREAD OF DATA: 6F CONTINUING WITH DESCRIPTIVE STATS 6E,6F,6G,6H,6I MEASURING THE SPREAD OF DATA: 6F othink about this example: Suppose you are at a high school football game and you sample 40 people from the student section

More information

Lecture: Sampling and Standard Error LECTURE 8 1

Lecture: Sampling and Standard Error LECTURE 8 1 Lecture: Sampling and Standard Error 6.0002 LECTURE 8 1 Announcements Relevant reading: Chapter 17 No lecture Wednesday of next week! 6.0002 LECTURE 8 2 Recall Inferential Statistics Inferential statistics:

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the CHAPTER 4 VARIABILITY ANALYSES Chapter 3 introduced the mode, median, and mean as tools for summarizing the information provided in an distribution of data. Measures of central tendency are often useful

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N Probability Distributions Normal Probability Distribution N Chapter 6 Inference It was reported that the 2008 Super Bowl was watched by 97.5 million people. But how does anyone know that? They certainly

More information