Exploring Data. How to Explore Data

Size: px
Start display at page:

Download "Exploring Data. How to Explore Data"

Transcription

1 Exploring Data Statistics is the art and science of learning from data. This may include: Designing appropriate tools to collect data. Organizing data in a meaningful way. Displaying data with appropriate graphs. Summarizing data with numbers. Using data to make predictions. Data are numbers or other information in context. Individuals are the objects described by a set of data. They may be people, animals, or things. A variable is any characteristic of an individual. It can take different values for different individuals. A categorical (or qualitative) variable places an individual into one of several groups or categories. A quantitative variable takes numerical values for which it makes sense to find an average. Not every variable with a number value is quantitative! Examples: The distribution of a variable shows what values the variable takes and how often it takes each value. How to Explore Data Begin by examining each variable by itself. Then move on to study relationships among the data. Start with a graph or graphs. Then add numerical summaries. Example: The following table shows information about several popular cell phone models. Phone Operating System Screen Size (inches) Internal Storage (GB) Expandable Storage Rear Camera (megapixels) Battery Life (Talk Time) (hours) Apple iphone 6S Plus ios No Apple iphone 6s ios No Apple iphone 6 ios No 8 14 BlackBerry DTEK 50 Android Yes BlackBerry Priv Android Yes BlackBerry Leap BlackBerry Yes 8 25 LG X Skin Android Yes 8 7 LG G5 SE Android Yes LG G5 Android Yes Microsoft Lumia 650 Windows Yes 8 13 Microsoft Lumia 950 Windows Yes Microsoft Lumia 950 XL Windows Yes Samsung Galaxy Note 7 Android Yes Samsung Galaxy On 7 Pro Android Yes Samsung Galaxy S7 Edge Android Yes 12 33

2 a) Who/what are the individuals in this data set? b) What variables are measured? Identify each as categorical or quantitative. In what units were the quantitative variables measured? c) Give the distributions of the following for the data set: screen size, internal storage, and presence of expandable memory. Analyzing Categorical Data The values of a categorical variable are labels for the categories, such as male or female. The distribution of a categorical variable gives the categories and either the count or proportion of individuals who fall into each category. Proportion: The fraction of the total that possesses a certain attribute. Proportions can be expressed as fractions, decimals, or percentages. Frequency: The number (count) of individuals in each category. Relative Frequency: The proportion of individuals in each category. Often, we organize categorical data into either a frequency table or a relative frequency table. Example: The following is a frequency table showing the distribution of responses to the question, How do you eat corn on the cob? Find the relative frequency distribution. How do you eat corn on the cob? Frequency Relative Frequency In rows 22 In circles 13 Bite wherever 6 I don t eat corn on the cob 5 Cut the corn off the cobb 4 Total Categorical data is often displayed using bar graphs and pie charts.

3 Bar Graph Procedure: 1. Draw a horizontal line. Write the category names or labels below the line at regularly spaced intervals. 2. Draw a vertical line. Label the scale using either frequency or relative frequency. 3. Place a rectangular bar above each category label with the height corresponding to the frequency or relative frequency. Make sure all bars have the same width. Bars are historically drawn with gaps between them to indicate that the data is not continuous. Pie Chart Procedure: 1. Draw a circle to represent the entire data set. 2. For each category, calculate the size of the central angle of the slice : slice size= 360 relative frequency of category 3. Use a protractor to divide the circle into slices with the appropriate central angles. 4. Label the slices appropriately! An alternative to a pie chart is a segmented bar graph, where there is a single bar with segments corresponding to the different categories. Segmented bar graphs use relative frequencies on the vertical axis. The height of each segment corresponds to the relative frequency of that category. Example: Draw a well-labeled bar graph, a well-labeled segmented bar graph, and a well-labeled pie chart of the corn data from the previous example. Bar graphs are more flexible than pie charts! Bar graphs can compare any set of quantities that are measured in the same units, but pie charts can only compare all parts of a single whole. For example, you couldn t compare the percentages of sophomores, juniors, and seniors who approve of the school s parking policies with a pie chart. These are not parts of the same whole. The percentages wouldn t add to 100%. You could use a bar graph for this data, however.

4 Deceptive Graphs: Watch out for graphs in which the width changes in addition to the height. The eye responds to area, so this makes the graph misleading. This happens a lot in pictographs. Watch out for graphs where the axes don t start at zero (and/or are missing).

5 Watch out for unequally-spaced intervals. Watch out for pie charts or segmented bar graphs where the percentages don t add to 100%. Pie charts and segmented bar graphs represent all the parts of a single whole. Bar graphs, on the other hand, can represent proportions of different groups, so it is okay if the percentages on a bar graph don t add to 100%. Watch out for 3D graphs or graphs set at an angle. This distorts the data. Perception of 3D Pie Charts 20% 30% 20% 30% Cool Confusing Misleading Unreadable

6 Two-Way Tables (also called Contingency Tables) Often, we are interested in the relationship between two categorical variables. In this case, we can organize the data into a table, with the rows representing possible values of one variable and the columns representing possible values of the other. The marginal distribution of one of the categorical variables in a two-way table of counts is the distribution of values of that variable among all the individuals described by the table. (Use the totals in the margins to calculate this). Example: A sample of 200 children from the United Kingdom aged 9-17 was selected from the CensusAtSchool website. The gender of each child was recorded along with which superpower they would most like to have. Here are the results. Calculate the marginal distribution of superpower preferences. Make a graph to display the marginal distribution. Describe what you see. Gender Superpower Female Male Total Invisibility Superstrength Telepathy Fly Freeze Time Total To examine the relationships between variables, we need to calculate some well-chosen percentages from the counts in the table. A conditional distribution of a variable describes the values of that variable among individuals with one specific value of another variable. There is a separate conditional distribution for each value of the other variable. For example, the conditional distribution of superpower preferences for females would tell us what percentage of females prefers each superpower. Example: Using the data above, calculate the conditional distribution of superpower preference for each gender. (This means figure out what percentage of girls want each superpower and what percentage of boys want each superpower.)

7 To compare the conditional distributions of a categorical variable, we use side-by-side bar graphs (or comparative bar graphs). These use the same horizontal and vertical axes for the bar graphs of two of more groups. The bar charts are usually color-coded to indicate which bars correspond to each group. Relative frequency should be used for the vertical axis. If specific values of one variable tend to occur in common with specific values of the other, we say there is an association (or relationship) between the two variables. For example, if males prefer certain superpowers more than females (or vice-versa), we would say that there is an association between gender and superpower choice. Do not use the word correlation when you mean association. Correlation has a very specific meaning in statistics, which we will talk about later in the year. Example: Draw a side-by-side bar graph comparing the superpower choices of males and females. Use percentages for the vertical axis. Then draw a segmented bar graph for each gender. Describe what you see. Does there appear to be an association between gender and superpower choice?

8 Displaying Quantitative Data with Graphs One of the most common parts of a statistical problem is finding an appropriate way to display data. Quantitative data can t be displayed the same way as categorical data (bar graphs and pie charts don t work). The most common ways to display quantitative data are dotplots, stemplots, histograms, and boxplots. How to Examine the Distribution of a Quantitative Variable Describe the overall pattern of a distribution by describing its shape, center, and spread. Point out any outliers (unusually small or unusually large data values). Always put your descriptions in context! Describing Shape: How many peaks does the distribution have? Don t count minor ups and downs, only major peaks. o Unimodal: One peak. o Bimodal: Two peaks. o Multimodal: Three or more peaks. Is the distribution symmetric or skewed? o If the right and left sides of the graph are close to mirror images of each other, describe the distribution as approximately symmetric. Always use the words approximately or roughly, because in real life, distributions of data are almost never perfectly symmetric. o If the right side of the graph is much longer than the left side (tail to the right), describe the distribution as skewed to the right or skewed to positive values or positively skewed. o If the left side of the graph is much longer than the right side (tail to the left), describe the distribution as skewed to the left or skewed to negative values or negatively skewed. Describing Center: Use the median or the mean. Describing Spread: Use the range, interquartile range, or standard deviation, or say something like, The data vary from a low of to a high of.

9 Dotplots: 1. Draw a horizontal line and mark it with an appropriate measurement scale. 2. Locate each value in the data set along the measurement scale and represent it by a dot above the line. If there are two or more observations with the same value, stack the dots vertically. To compare two distributions, stack the dotplots on top of each other, using the same scales. Make sure to label the two groups being compared. Example: Here are the hair lengths of 26 AP Statistics students, in inches. Make a dotplot. Describe the distribution Here are the lengths sorted by gender. Make parallel dotplots of the hair lengths for males and females. Comment on what you see. Male: Female: Stemplots (or Stem-and-Leaf Plots): Each number in the data set is broken into two pieces a stem and a leaf. The stem is the first part of the number and consists of the beginning digits. The leaf is the last part of the number and consists of the final digit(s). 1. Choose stems (one or more of the leading digits) that divide the data into a reasonable number of groups (at least 5, but not too many). List possible stem values (not just those that actually appear in the data set don t skip stems) in a vertical column. Draw a vertical line to the right of the stems. 2. The next digit(s) after the stem become(s) the leaf. List the leaf for every observation to the right of the corresponding stem. 3. Include a key explaining what the stems and leaves represent, e.g., 2 5 represents 2.5 seconds

10 It is common to round and/or truncate (leave off) the remaining digits. For example, in a stemplot of annual salary, we might represent $35,360 as 35 3, 35 4, or as 3 5, depending on our data set. If necessary, consider using split stems. Write each stem more than once, and assign the lower group of leaves to the first stem and the higher group of leaves to the next. For example, put the leaves 0-4 with the first stem and the leaves 5-9 with the second. If you do this, be sure that each stem is assigned an equal number of possible leaf digits (two stems, with five possible leaves each; or five stems, with two possible leaves each). To compare two groups, make a back-to-back stemplot. Use the same set of stems and write the leaves for one group to the right and for the other group to the left. Be sure to label each side to indicate which group is being represented. Example: The data below shows the number of pairs of shoes owned for male and female AP Statistics students. Make a back-to-back stemplot of the data using split stems. Comment on the main differences between the two data sets. Female Male

11 Histograms: 1. Divide the range of the data into intervals of equal width. The intervals are called bins. The low value in each bin is included in the bin, but the high value is not. For example, the bins might be 0 to < 3, 3 to < 6, 6 to < 9, etc. If the data are discrete (the observations take only whole number values) and are tightly packed, the bins are usually centered at the integer values with a width of one unit, so the rectangle for 1 is centered at 1 (0.5 to < 1.5), the rectangle for 2 is centered at 2 (1.5 to < 2.5), etc. There are no set-in-stone rules for how many bins to use (5 to 10 is a common number), but it may be a good idea to see what the graph looks like with different width bins. It can change quite a bit! 2. Find the count (frequency) or percent (relative frequency) of individuals in each class. 3. Label and scale your axes. Mark the boundaries of each interval on a horizontal axis and use either frequency or relative frequency on the vertical axis. 4. Draw a rectangle for each interval. The edges of the rectangle are at the interval boundaries, and the height of the rectangle represents the frequency or relative frequency of the individuals in each interval. Histograms and bar graphs are different! o Bar graphs are used for categorical data. Histograms are used for quantitative data. o With histograms, you have one variable, and you sort the values of the variable by placing them into bins. The histogram shows how many measurements are in each bin. The rectangles can t be rearranged. o With bar graphs, the bars represent different categories. The height compares some characteristic of each category. The bars can be rearranged. o The bars in bar graphs are generally unconnected. The bars in histograms are connected. Example: The following data gives the average points scored per game (PTSG) for the 30 NBA teams in the regular season. Draw two relative frequency histograms using different bin widths. Describe the distribution

12 Describing Quantitative Data with Numbers Population: The entire collection of individuals or objects that you want to learn about. Sample: A part of the population that is selected for study. Resistant Measure: A measure that is not influenced very much by strong skewness or extreme values. Measures of Center: The most common measures of center are the mean and the median. Mean: The sum of the values divided by the number of observations. x1+ x xn xi If the n observations in a sample are x1, x2,..., x n, the mean is x= =. n n The mean can be thought of as the average value, the fair share value, or the balance point of a distribution. The mean is not a resistant measure. It is very sensitive to outliers and skewness. The mean of a sample is abbreviated x (pronounced x-bar ) and the mean of a population is abbreviated μ x (the Greek letter mu, pronounced myoo ). They are both calculated the same way. The distinction will be important later in the year. If the problem doesn t specify whether the data represent a population or a sample, assume you are dealing with a sample and use x. Median (M): The midpoint of a distribution. Half of the observations are smaller than the median and half of the values are larger than the median. To find the median: 1. Put the n observations in order from smallest to largest. 2. If the number of observations, n, is odd, the median is the middle observation of the ordered list. 3. If the number of observations, n, is even, the median is the average (mean) of the two middle observations in the ordered list. The median can be thought of as the typical value of a variable. The median is a resistant measure. It is not changed greatly by strong skewness or outliers. Comparing the Mean and the Median: The mean and median of a roughly symmetric distribution are close together. If the distribution is exactly symmetric, they are equal. However, outliers and other extreme values drag the mean toward them without having much effect on the median. As a result, in skewed distributions, the mean will be further out in the long tail than is the median.

13 Example: Here are the amounts of fat (in grams) in McDonald s beef sandwiches. Make a stemplot of the distribution and comment on its shape. Then calculate the mean and the median amount of fat. Sandwich Fat (g) Sandwich Fat (g) Hamburger 9 Big N Tasty 24 Cheeseburger 12 Big N Tasty with Cheese 28 Double Cheeseburger 23 McRib 26 McDouble 19 Mac Snack Wrap 19 Quarter Pounder 19 Angus Bacon & Cheese 39 Quarter Pounder with Cheese 26 Angus Deluxe 39 Double Quarter Pounder with Cheese 42 Angus Mushroom & Swiss 40 Big Mac 29 Example: Forty students were enrolled in a statistical reasoning course at a California college. The instructor made course materials, grades, and lecture notes available to students on a class web site, and course management software kept track of how often each student accessed any of these web pages. One month after the course began, the instructor requested a report of how many times each student had accessed a class web page. The 40 observations are below. Wasn t it nice of me to put them in order? (not a typo) Here is a dotplot of the data. Describe the distribution. Based on the graph, do you expect the mean or the median to be higher? Calculate the mean and the median to see if you were right. Which measure would be the best choice to describe center in this situation? Number of Visits to Class Website

14 Measures of Spread: Numbers that describe how spread out the data are. The most common are the range, the interquartile range, and the standard deviation. Range: The difference between the maximum and minimum values. Interquartile Range (IQR): First, calculate the quartiles: 1. Arrange the data in increasing order and locate the median, M. (The median is sometimes called the second quartile, or Q2). 2. The first quartile (Q1) is the median of all the observations lower than the median. 3. The third quartile (Q3) is the median of all the observations higher than the median. The interquartile range is calculated as follows: IQR = Q3 Q1 The IQR is the range of the middle 50% of the data. The range and interquartile range are numbers! Don t say The range is 5 to 30. In that case, the range would be IQR Rule for Outliers: Any observation that falls more than 1.5 IQR above the third quartile or below the first quartile. Always check for outliers and examine them closely! They may be errors, or they may tell you something important about your data that you need to pay attention to. Don t ignore them. Boxplots (or Box and Whisker Plots): 1. Find the Five-Number Summary: Minimum Q1 M Q3 Maximum 2. Check for outliers. You must always show this step. Calculate the IQR. Find Q1 ( 1.5 IQR) and Q3+ ( 1.5 IQR). If you have any data points outside these thresholds, they are outliers. 3. Draw the boxplot: Draw a central box from Q1 to Q3. Draw a vertical line in the box to mark the median. Draw the whiskers : lines extending from the box out to the smallest and largest observations that are not outliers. Mark outliers with dots in the appropriate locations. Each section of a boxplot contains 25% of the data. The lower quartile is higher than 25% of the data. The median (or second quartile) is higher than 50% of the data. The upper quartile is higher than 75% of the data. Boxplots are useful for comparing distributions, but you have to be careful with them. They sometimes mask important features of a distribution. For instance, you can t tell from a boxplot whether the distribution is unimodal or bimodal.

15 Example: The data below shows the number of text messages sent by a random sample of students in a day. Draw parallel boxplots of the number of texts sent for male and female students. You must show how you determined whether there are outliers. Compare the distributions. What conclusions can you draw about the texting habits of males and females? Male Female Standard Deviation: The most common measure of spread is the standard deviation. It measures the average or typical distance of the observations from the mean. The formula for standard deviation is slightly different depending on whether you have all the data for the entire population or are dealing with a sample from the population. For a Sample: If the n observations in a sample are x1, x2,..., x n, and the mean is x, the standard deviation is given by: s The sample standard deviation is abbreviated x. variance, abbreviated s 2 x. x ( x x ) + ( x x ) + + ( x x ) ( x x ) n i = = n 1 n 1 s The square of the standard deviation is called the

16 For the Population: The standard deviation of a population of size N with mean μ and observations x1, x2,..., x n is given by: ( x μ) + ( x μ) + + ( x μ) ( x μ) n i σ x = = N N The population standard deviation is abbreviated σ x (the Greek letter sigma). The population variance is abbreviated σ 2 x. The reason that we divide by n 1 in a sample is complicated. We ll discuss it later in the year. Always use s x rather than σ x unless you know that the data represent the entire population, which is rare! Calculating the standard deviation by hand: 1. Calculate the mean, x. 2. Find the distance of each observation from the mean (the deviations). 3. Square each of these distances to eliminate negative numbers. 4. Average the squared distances by adding them together and dividing by n 1. This gives the variance, s 2. x 5. Take the square root of the variance to get the standard deviation, s x. 6. Interpret your result. The standard deviation is the average or typical distance of the observations from the mean. Example: The table below shows the sugar content in several types of candy bar. Find the mean and standard deviation of the data. Interpret your result in context. Candy Bar Sugar (grams) Hershey s Milk Chocolate 31 Krackel 24 Kit Kat 22 York Peppermint Pattie 25 Reese s Peanut Butter Cups 25 Snickers 30 Milky Way 35 Twix 27 3 Musketeers 40 Baby Ruth 33 Nestle Crunch 24 Butterfinger 29 Mounds 21 Almond Joy 20 Mr. Goodbar 22 Payday 21 Heath 23

17 Example: Each of these distributions has a mean of 5. Rank the standard deviations from lowest to highest Properties of the Standard Deviation The standard deviation measures the spread about the mean and should only be used when the mean is chosen as the measure of center. The standard deviation is always greater than or equal to zero. If there is no variability (all observations have the same value), the standard deviation is zero. Otherwise, it is greater than zero. The standard deviation has the same units of measurement as the original observations. This is one reason we often prefer the standard deviation over the variance. The standard deviation is not a resistant measure. A few outliers can change its value dramatically. Choosing Measures of Center and Spread: Use the median and IQR for describing a skewed distribution or a distribution with strong outliers. Use the mean and standard deviation for describing reasonably symmetric distributions without outliers. ALWAYS GRAPH YOUR DATA! Numerical measures of center and spread report specific facts about a distribution, but don t give information about its entire shape. You may miss something important if you don t graph the data.

Exploring Data. How to Explore Data

Exploring Data. How to Explore Data Eploring Data Statistics is the art and science of learning from data. This may include: Designing appropriate tools to collect data. Organizing data in a meaningful way. o Displaying data with appropriate

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Resistant Measure - A statistic that is not affected very much by extreme observations.

Resistant Measure - A statistic that is not affected very much by extreme observations. Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?! Topic 3: Introduction to Statistics Collecting Data We collect data through observation, surveys and experiments. We can collect two different types of data: Categorical Quantitative Algebra 1 Table of

More information

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things. (c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals

More information

1.3.1 Measuring Center: The Mean

1.3.1 Measuring Center: The Mean 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar) of a set of observations, add their values and divide by the number of observations. If the n observations

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Section 1.2 with Graphs The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

Chapter 1. Looking at Data

Chapter 1. Looking at Data Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1

More information

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data CHAPTER 1 Exploring Data 1.2 Displaying Quantitative Data with Graphs The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Displaying Quantitative Data

More information

STT 315 This lecture is based on Chapter 2 of the textbook.

STT 315 This lecture is based on Chapter 2 of the textbook. STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures

More information

MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline.

MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline. MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline. data; variables: categorical & quantitative; distributions; bar graphs & pie charts: What Is Statistics?

More information

Units. Exploratory Data Analysis. Variables. Student Data

Units. Exploratory Data Analysis. Variables. Student Data Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as

More information

Chapter2 Description of samples and populations. 2.1 Introduction.

Chapter2 Description of samples and populations. 2.1 Introduction. Chapter2 Description of samples and populations. 2.1 Introduction. Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 2: Describing Distributions with Numbers CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Graphical Summaries Consider the following data x: 78, 24, 57, 39, 28, 30, 29, 18, 102, 34, 52, 54, 57, 82, 90, 94, 38, 59, 27, 68, 61, 39, 81, 43, 90, 40, 39, 33, 42, 15, 88, 94, 50, 66, 75, 79, 83, 34,31,36,

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

8/4/2009. Describing Data with Graphs

8/4/2009. Describing Data with Graphs Describing Data with Graphs 1 A variable is a characteristic that changes or varies over time and/or for different individuals or objects under consideration. Examples: Hair color, white blood cell count,

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics CHAPTER OUTLINE 6-1 Numerical Summaries of Data 6- Stem-and-Leaf Diagrams 6-3 Frequency Distributions and Histograms 6-4 Box Plots 6-5 Time Sequence Plots 6-6 Probability Plots Chapter

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Cengage Learning

More information

Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data

Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data Mean 26.86667 Standard Error 2.816392 Median 25 Mode 20 Standard Deviation 10.90784 Sample Variance 118.981 Kurtosis -0.61717 Skewness

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The

More information

Performance of fourth-grade students on an agility test

Performance of fourth-grade students on an agility test Starter Ch. 5 2005 #1a CW Ch. 4: Regression L1 L2 87 88 84 86 83 73 81 67 78 83 65 80 50 78 78? 93? 86? Create a scatterplot Find the equation of the regression line Predict the scores Chapter 5: Understanding

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Mean vs.

More information

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed

More information

Lecture 1 : Basic Statistical Measures

Lecture 1 : Basic Statistical Measures Lecture 1 : Basic Statistical Measures Jonathan Marchini October 11, 2004 In this lecture we will learn about different types of data encountered in practice different ways of plotting data to explore

More information

A graph for a quantitative variable that divides a distribution into 25% segments.

A graph for a quantitative variable that divides a distribution into 25% segments. STATISTICS Unit 2 STUDY GUIDE Topics 6-10 Part 1: Vocabulary For each word, be sure you know the definition, the formula, or what the graph looks like. Name Block A. association M. mean absolute deviation

More information

CHAPTER 1 Exploring Data

CHAPTER 1 Exploring Data CHAPTER 1 Exploring Data 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers 1.3 Reading Quiz True or false?

More information

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

Descriptive Statistics Solutions COR1-GB.1305 Statistics and Data Analysis

Descriptive Statistics Solutions COR1-GB.1305 Statistics and Data Analysis Descriptive Statistics Solutions COR-GB.0 Statistics and Data Analysis Types of Data. The class survey asked each respondent to report the following information: gender; birth date; GMAT score; undergraduate

More information

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population . Measures of Central Tendency: Mode, Median and Mean Average a single number that is used to describe the entire sample or population. Mode a. Easiest to compute, but not too stable i. Changing just one

More information

Describing Distributions with Numbers

Describing Distributions with Numbers Describing Distributions with Numbers Using graphs, we could determine the center, spread, and shape of the distribution of a quantitative variable. We can also use numbers (called summary statistics)

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Lecture 1: Descriptive Statistics

Lecture 1: Descriptive Statistics Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics

More information

Lecture Notes 2: Variables and graphics

Lecture Notes 2: Variables and graphics Highlights: Lecture Notes 2: Variables and graphics Quantitative vs. qualitative variables Continuous vs. discrete and ordinal vs. nominal variables Frequency distributions Pie charts Bar charts Histograms

More information

Lecture 1: Description of Data. Readings: Sections 1.2,

Lecture 1: Description of Data. Readings: Sections 1.2, Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then

More information

Example 2. Given the data below, complete the chart:

Example 2. Given the data below, complete the chart: Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Chapters 1 & 2 Exam Review

Chapters 1 & 2 Exam Review Problems 1-3 refer to the following five boxplots. 1.) To which of the above boxplots does the following histogram correspond? (A) A (B) B (C) C (D) D (E) E 2.) To which of the above boxplots does the

More information

M 140 Test 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

M 140 Test 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75 M 140 est 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDI! Problem Max. Points Your Points 1-10 10 11 10 12 3 13 4 14 18 15 8 16 7 17 14 otal 75 Multiple choice questions (1 point each) For questions

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,

More information

Chapter 4.notebook. August 30, 2017

Chapter 4.notebook. August 30, 2017 Sep 1 7:53 AM Sep 1 8:21 AM Sep 1 8:21 AM 1 Sep 1 8:23 AM Sep 1 8:23 AM Sep 1 8:23 AM SOCS When describing a distribution, make sure to always tell about three things: shape, outliers, center, and spread

More information

1.3: Describing Quantitative Data with Numbers

1.3: Describing Quantitative Data with Numbers 1.3: Describing Quantitative Data with Numbers Section 1.3 Describing Quantitative Data with Numbers After this section, you should be able to MEASURE center with the mean and median MEASURE spread with

More information

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data

Review for Exam #1. Chapter 1. The Nature of Data. Definitions. Population. Sample. Quantitative data. Qualitative (attribute) data Review for Exam #1 1 Chapter 1 Population the complete collection of elements (scores, people, measurements, etc.) to be studied Sample a subcollection of elements drawn from a population 11 The Nature

More information

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included:

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included: AP Statistics Chapter 2 Notes 2.1 Describing Location in a Distribution Percentile: The pth percentile of a distribution is the value with p percent of the observations (If your test score places you in

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

Descriptive Statistics Methods of organizing and summarizing any data/information.

Descriptive Statistics Methods of organizing and summarizing any data/information. Introductory Statistics, 10 th ed. by Neil A. Weiss Chapter 1 The Nature of Statistics 1.1 Statistics Basics There are lies, damn lies, and statistics - Mark Twain Descriptive Statistics Methods of organizing

More information

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511 Topic 2 - Descriptive Statistics STAT 511 Professor Bruce Craig Types of Information Variables classified as Categorical (qualitative) - variable classifies individual into one of several groups or categories

More information

Chapter 5: Exploring Data: Distributions Lesson Plan

Chapter 5: Exploring Data: Distributions Lesson Plan Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The

More information

Description of Samples and Populations

Description of Samples and Populations Description of Samples and Populations Random Variables Data are generated by some underlying random process or phenomenon. Any datum (data point) represents the outcome of a random variable. We represent

More information

A C E. Answers Investigation 4. Applications

A C E. Answers Investigation 4. Applications Answers Applications 1. 1 student 2. You can use the histogram with 5-minute intervals to determine the number of students that spend at least 15 minutes traveling to school. To find the number of students,

More information

Statistics and parameters

Statistics and parameters Statistics and parameters Tables, histograms and other charts are used to summarize large amounts of data. Often, an even more extreme summary is desirable. Statistics and parameters are numbers that characterize

More information

Math 082 Final Examination Review

Math 082 Final Examination Review Math 08 Final Examination Review 1) Write the equation of the line that passes through the points (4, 6) and (0, 3). Write your answer in slope-intercept form. ) Write the equation of the line that passes

More information

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution.

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. 1 Histograms p53 The breakfast cereal data Study collected data on nutritional

More information

Comparing Measures of Central Tendency *

Comparing Measures of Central Tendency * OpenStax-CNX module: m11011 1 Comparing Measures of Central Tendency * David Lane This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 1.0 1 Comparing Measures

More information

Vocabulary: Data About Us

Vocabulary: Data About Us Vocabulary: Data About Us Two Types of Data Concept Numerical data: is data about some attribute that must be organized by numerical order to show how the data varies. For example: Number of pets Measure

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:

More information

download instant at

download instant at Chapter 2 Test B Multiple Choice Section 2.1 (Visualizing Variation in Numerical Data) 1. [Objective: Interpret visual displays of numerical data] For twenty days a record store owner counts the number

More information

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75 M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-13 13 14 3 15 8 16 4 17 10 18 9 19 7 20 3 21 16 22 2 Total 75 1 Multiple choice questions (1 point each) 1. Look at

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

MATH 10 INTRODUCTORY STATISTICS

MATH 10 INTRODUCTORY STATISTICS MATH 10 INTRODUCTORY STATISTICS Tommy Khoo Your friendly neighbourhood graduate student. Week 1 Chapter 1 Introduction What is Statistics? Why do you need to know Statistics? Technical lingo and concepts:

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22 Announcements Announcements Lecture 1 - Data and Data Summaries Statistics 102 Colin Rundel January 13, 2013 Homework 1 - Out 1/15, due 1/22 Lab 1 - Tomorrow RStudio accounts created this evening Try logging

More information

Chapter 6 Group Activity - SOLUTIONS

Chapter 6 Group Activity - SOLUTIONS Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. Chapter 3 Numerically Summarizing Data Chapter 3.1 Measures of Central Tendency Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. A1. Mean The

More information

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations:

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations: Measures of center The mean The mean of a distribution is the arithmetic average of the observations: x = x 1 + + x n n n = 1 x i n i=1 The median The median is the midpoint of a distribution: the number

More information

Chapter 1 Introduction & 1.1: Analyzing Categorical Data

Chapter 1 Introduction & 1.1: Analyzing Categorical Data Chapter 1 Chapter 1 Introduction & 1.1: Analyzing Categorical Data Population Sample Make an inference about the population. Collect data from a representative sample... Perform Data Analysis, keeping

More information

Exercises from Chapter 3, Section 1

Exercises from Chapter 3, Section 1 Exercises from Chapter 3, Section 1 1. Consider the following sample consisting of 20 numbers. (a) Find the mode of the data 21 23 24 24 25 26 29 30 32 34 39 41 41 41 42 43 48 51 53 53 (b) Find the median

More information