Stat 101 Exam 1 Important Formulas and Concepts 1

Size: px
Start display at page:

Download "Stat 101 Exam 1 Important Formulas and Concepts 1"

Transcription

1 1 Chapter Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative Variables Name categories for grouping. 3. Quantitative Variables When a variable contains measured numerical values with measurement units. 4. Identifier Variable Each record has a unique value like Student ID or SSN. 2 Chapter Definitions 1. Frequency Table Records to totals and uses the category names to label each row. 2. Relative Frequency Table Displays percentages of the values in each category. 3. Bar Chart Displays the distribution of a categorical variable, showing counts for each category next to each other for easy comparison. 4. Relative Frequency Bar Chart Same as a bar chart but displays the percentage of people in each category rather than the counts. 5. Pie Charts Shows a whole group of cases as a circle. The circle is sliced into pieces whose size is proportional to the fraction of the whole in each category. 1 This version: February 6, 2018, by Dale Embers. May not include all things that could possibly be tested on. To be used as an additional reference to studying all Chapters 1-8. All definitions, formulas, and selected problems come from Intro Stats by De Veaux, Velleman and Bock, 4th edition, published by Pearson.

2 6. Distribution Slices up all the possible values of the variable into equal width bins and gives the number of values (or counts) falling into each bin. 7. Histogram Uses adjacent bars to show the distribution of a quantitative variable. Each bar shows the frequency of values falling into each bin. 8. Unimodal Histogram with one peak. 9. Bi-modal Histogram with two peaks. 10. Uniform Histogram that doesn t appear to have any mode. Bars are approximately the same height for each bin. 11. Symmetric Histogram in which the two halves on either side of the center look approximately like mirror images. 12. Skew Histogram that is not symmetric. 13. Skew Left Histogram with a long tail on the left. 14. Skew Right Histogram with a long tail on the right. 2.2 Formulas 1. Median = Once the data is ordered from smallest to largest, it is the middle value in the data. Divides the histogram into 2 equal pieces. y 2. Mean = Average of all of the values = ȳ = n 3. Range = Max - Min 4. Q1 = Median of the lower half of the data 5. Q3 = Median of the upper half of the data 6. IQR = Q3 Q1 7. Variance: s 2 = (y ȳ) 2 n 1

3 8. Standard Deviation: s = s 2 (y ȳ) 2 = n 1 3 Chapter Definitions 1. Contingency Table Table which shows how the individuals are distributed along each variable. 2. Marginal Distribution Row total or column total in contingency tables. 3. Conditional Distribution Show distribution of one variable for just those cases that satisfy a condition on another variable. Example: Event B given Event A occurs first. 4 Chapter Definitions 1. Boxplot Displays the 5 number summary as a central box with whiskers that extend to the nonoutlying data values. 4.2 Formulas 1. Upper Fence for Boxplot = Q IQR 2. Lower Fence for Boxplot = Q1 1.5IQR 5 Chapter Definitions 1. z-score Tells how many standard deviations a value is from the mean. Regardless of direction, the farther a data vlaue is from the mean the more unusual it is. 2. Standard Normal Model A Normal Model N(µ, σ) with mean 0 and standard deviation Rule In a normal model, about 68% of values fall within 1 standard deviation of the mean,

4 about 95% fall within 2 standard deviations of the mean; about 99.7% of values fall within 3 standard deviations of the mean. 4. Normal Percentile The normal percentile corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below. Compared to area under the curve. See Table Z in Appendix D of the textbook. 5. Normal Probability Plot A graph that tells whether or not it is ok to assume that data is normal. If the graph is a straight line, one may assume that it is normal. 5.2 Formulas 1. z-score: z = y ȳ s = y µ σ 5.3 Properties about the area under a Normal Curve 1. The total area is 100% 2. The mean is the center of a normal curve 3. A Standard Normal Curve has mean = 0 and standard deviation = 1 4. When a normal curve is split in half from the mean, each side contains 50% of the area 5. The normal curve is symmetric 6. If a normal curve is split with 30% of the area on one side, the other side of the curve is 70% of the area 7. If a normal curve has 60% of the area in the middle, the remaining portions are a total of 40%. This 40% is allocated half to each side. So the far left has 20% of the area, the middle is 60% of the area, and the far right side has 20% of the area. Textbook Table Z Appendix D Note: These tables give the percentage to the left of the z value. 6 Chapter Definitions 1. Scatterplot Shows the relationship between 2 quantitative variables.

5 2. Direction of Scatterplot Positive direction means as one variable increases so does the other. Decreasing direction means the association is negative. 3. Form of Scatterplot Is it in a straight line or some other form? 4. Strength of Scatterplot Strong association if there is little scatter around the underlying relationship. 5. Outlier A point that does not fit the overall pattern seen in the scatterplot. 6.2 Formulas 1. Z-Scores for a Scatterplot z x = x x s x z y = y ȳ s y 2. Correlation Coefficient r = = zxzy n 1 [(x x)(y ȳ)] (n 1)s xs y 6.3 Properties of Correlation Coefficient 1. Always between -1 and 1 2. Does not matter which variable you consider as x and y. 3. Treats x and y symmetrically 4. No units 5. Not affected by changes in scale or if the variables are standardized (changing to z- score) 6. Depends only on z-scores 7. Does NOT prove causation. Only provides a relationship between 2 variables. 7 Chapter Definitions 1. Linear Model Equation of the form ŷ = b 0 + b 1 X. ŷ means estimated values for y. 2. Predicted Values Value of ŷ found for a given x-value in the data.

6 3. Fitted Linear Model ŷ = b 0 + b 1 X 4. Residual Differences between data values and the corresponding values predicted by the model (observed - expected) 5. R 2 Gives the fraction of variability of y accounted for by the least squares linear regression on x. It is an overall measure of how successful the regression is in linearly relating y to x. 6. Least Squares Criterion Specifies the unique line that minimizes the variance of the residuals or the sum of squared residuals. 7.2 Formulas 1. Residual = Observed value - Predicted value = y ŷ 2. b 1 = r (s y /s x ) 3. b 0 = ȳ b 1 x 8 Chapter Definitions 1. Extrapolation In any regression situation it is unsafe. Predictions from extrapolation should not be trusted. 2. Outlier Any data point that stands away from the others. 3. Leverage Data points whose x-values are far from the mean of x. 4. Influential Point A point that,if omitted from the data, results in a very different regression model. 8.2 Residual Plots A residual plot is a scatterplot that shows the residual versus x values. There are 3 separate conditions that must be met in order to have an appropriate linear model. These conditions are 1) Linearity Condition, 2) Outlier Condition, and 3) Equal Spread Condition. The Linearly Condition is violated when there are bends in a residual plot. The Outlier Condition

7 is violated if any point is far from the rest of the points in the residual plot. The Equal Spread Condition is violated when the spread changes from one part of the plot to another. 9 Extra Information Review any and all notes and supplementary materials. It may be the case that something was accidentally omitted from this study guide. Also, review any problems that may have been discussed in class as not all example problems may have been provided here.

8 10 Example Problems 1. Use the below table to answer the following questions. Gender Eye Color Blue Green Brown Total Male Female Total (a) Construct a frequency table for Eye Color based on the data above. (b) Find the marginal distribution of gender. (c) What percentage of females have blue eyes? (d) What percentage of green eyed people are male? (e) What percentage of people are females and have green eyes? (f) What percentage of blue eyed people are female? (g) What percentage of males have brown eyes? (h) What percentage of people have brown eyes? (i) What percentage of people are males and have blue eyes? Q40 pg 40 We are investigating whether people taking antidepressants (SSRIs) might be at greater risk of bone fractures. We are given the below contingency table. SSRI no SSRI Total Fractures No Fractures Total (a) What percent of people taking SSRIs have fractures? (b) What percent of people not taking SSRIs have fractures? (c) Is the risk of bone fractures the same among people who were taking SSRIs versus those who were not? 2. In a histogram that is skewed to the right, which is larger, the mean or the median? 3. In a histogram that is skewed or has outliers, which should be reported, the mean or the median? 4. In a histogram that is skewed or has outliers, which should be reported, the IQR or the Standard Deviation? 5. In a histogram that is symmetric with no outliers, which pair of things should be reported, mean with the standard deviation, mean with the IQR, median with the standard deviation, or the median with the IQR?

9 Q23 pg 75 Here are costs of nine compact refrigerators rated very good or excellent by Consumer Reports on their website. 150, 150, 160, 180, 150, 140, 120, 130, 120 Find (a) Mean (b) Median and Quartiles (c) Range and IQR (d) What are the values of the upper fence and the lower fence? (e) Are there any outliers in this data? Why? (f) Find the variance and standard deviation. Q22 pg 103 Shown below are the histogram and summary statistics for the number of camp sites at public parks in Vermont. (a) Which statistics would you use to identify the center and spread of this distribution? Why? (b) How many parks would you classify as outliers? Explain. (c) Create a boxplot for this data.

10 6. Given mean 16 and standard deviation 3. (a) Standardize y = 9 (b) Standardize y = 21 (c) Which of the two above is most unusual? 7. Use the model N(8, 2). (a) What is the mean? (b) What is the standard deviation? (c) What is the variance? (d) Standardize y = Use a normal model with a mean of 50 and standard deviation of 5. (a) Using the model described above, draw the model showing what the Rule predicts. (b) In what interval would you expect the central 99.7% of values to be found? (c) What percent of values are above 50? (d) What percent of values are between 40 and 60? (e) What percent of values are between 40 and 50? (f) What percent of values are between 50 and 60? (g) What percent of values are between 45 and 50? (h) What percent of values are between 50 and 65? (i) What percent of values are above 60? (j) What percent of values are below 45? (k) What percent of values are between 40 and 65? (l) What percent of values are between 45 and 60? 9. Based on the Normal Model N(50, 5), answer the following questions. (a) What percent of values are above 50? (b) What percent of values are above 62? (c) What percent of values are below 39? (d) What percent of values are above 43? (e) What percent of values are below 58? (f) What percent of values are between 37 and 52? (g) What percent of values are betwen and 66?

11 10. Based on the Normal Model N(50, 5), answer the following questions. (a) What cutoff value bounds the highest 5% of values? (b) What cutoff value bounds the lowest 25% of values? (c) What cutoff value bounds the middle 70% of values? 11. Use the given data to answer the following questions. Energy Used (KWH) Price ($) mean = mean = std dev = 6.22 std dev = r = (a) Draw a scatterplot of the above data to study the association between the energy used and the price it cost. (b) What is the direction of the association? (c) What is the form of the relationship? (d) What is the strength of the relationship? (e) Are there any outliers? (f) What is the correlation coefficient? Now consider the regression line for the above data. (a) What is the slope of the regression line? (b) What is the intercept of the regression line? (c) What percent of variation is explained by this model? (d) What does the slope mean in this context? (e) What does the intercept mean in this context? (f) Write down the overall model. (g) What would you predict for the price in the case where there are 18 KWH of energy used? Is this prediction reasonable? (h) The energy company actually charges you $27.50 for 18 KWH of energy used. Is this good? How much would you save or lose compared to what you expected to pay?

12 (i) What would you predict for the price in the case where there are 35 KWH of energy used? Is this prediction reasonable? (j) If the energy used is 3 standard deviations above the mean energy used, what is the predicted standardized value for price? 12. For the below residual plots, decide if a linear model is appropriate. In the case that the linear model is not appropriate, decide which condition is violated (linearity, outlier, or equal spread). Figure 1: Residual Plots (a) Plot 1 (b) Plot 2 (c) Plot 3 (d) Plot 4 (e) Plot 6 (f) Plot 7

13 11 Example Solutions 1. Use the table to answer the questions. (a) Frequency Table: (b) Male: (c) 6 18 = 33.3%. (d) 7 9 = 77.8%. (e) 2 45 = 4.4%. (f) 6 11 = 54.5%. (g) = 55.6%. (h) = 55.6%. (i) 5 45 = 11.1%. Q40 pg 40 (a) 14/137 = 10.2% (b) 244/4871 = 5.01% = 0.6 = 60%. Female: Eye Color Count Blue 11 Green 9 Brown 25 = 0.4 = 40%. (c) No, the risk of bone fractures is about twice as high among the people who were taking SSRIs than among those who were not. 2. The mean is larger. 3. The median. 4. IQR. 5. Mean with the standard deviation. Q23 pg 75 (a) Mean = ȳ = 1300/9 = (b) First, order the numbers from lowest to highest. Median = middle value = 150 Q3 = 150 Q1 = 130 (c) IQR = = 20 Range = Max - Min = = 60 (d) Upper Fence = Q IQR = (20) = 180, Lower Fence = Q1 1.5IQR = (20) = 100.

14 (e) There are not outliers because Max Upper Fence and Min Lower Fence. (f) Variance = (y ȳ) 2 n 1 = ( )2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( = = , Std Dev = V ariance = = Q22 pg 103 (a) Median and IQR because the histogram is skewed right. (b) IQR = = 50 Upper Fence = Q IQR = (50) = = 153 Lower Fence = Q1-1.5IQR = (50) = = 47 The number of parks cannot be negative so the lower fence should be placed at 0. Since max > upper fence, there are outliers. There are probably 4-5 based on the histogram. (c) Note that the histogram below is rounded so if you use exact numbers, it may look slightly different.

15 6. Given mean 16 and standard deviation 3. (a) z = = 7 3 = (b) z = = 5 3 = (c) y = 9 is more unusual. 7. Use the model N(8, 2). (a) Mean = 8. (b) Standard deviation = 2. (c) Variance = 2 2 = 4. (d) z = = 2 2 = 1.

16 8. Use a normal model with a mean of 50 and standard deviation of 5. (a) For the Rule, we will have the following points on the graph (not shown here). µ 3σ = 50 3(5) = 35 µ 2σ = 50 2(5) = 40 µ σ = 50 5 = 45 µ = 50 µ + σ = = 55 µ + 2σ = (5) = 60 µ + 3σ = (5) = 65 (b) Between 35 and 65. (c) 50%. (d) 95%. (e) 47.5%. (f) 47.5%. (g) 34%. (h) 49.85%. (i) 2.5%. (j) 16%. (k) Between 40 and 50 is 47.5%. Between 50 and 65 is 49.85%. Between 40 and 65 is 97.35%. (l) Between 45 and 50 is 34%. Between 50 and 60 is 47.5%. Between 45 and 60 is 81.5%. 9. Based on the Normal Model N(50, 5), answer the following questions. Draw pictures to help you see what is going on. (a) 50%. (b) Step 1: Standardize. z = = 12 = 2.4. Step 2: Calculate value from a 5 5 calculator or a table. From calculator: normalcdf(2.4, 999) = Solution =0.82%. From table: We want area(z > 2.4). The table gives area(z < 2.4) = Our answer is = , which is 0.82%. (c) Step 1: Standardize. z = = 11 = 2.2. Step 2: Calculate value from a 5 5 calculator or a table. From calculator: normalcdf( 999, 2.2) = Solution = 1.39%. From table: We want area(z < 2.2). The table gives us this directly and the value is Solution = 1.39%. (d) Step 1: Standardize. z = = 7 = 1.4. Step 2: Calculate value from a 5 5 calculator or a table. From calculator: normalcdf( 1.4, 999) = Solution = 91.92$. From table: We want area(z > 1.4). The table gives area(z < 1.4) = Our answer is = , which is 91.92%.

17 (e) Step 1: Standardize. z = = 8 = 1.6. Step 2: Calculate value from a 5 5 calculator or a table. From calculator: normalcdf( 999, 1.6) = Solution = 94.52$. From table: We want area(z < 1.6). The table gives us this directly and the value is Solution = 94.52%. (f) Step 1: Standardize both values. z 1 = = 2.6. z 5 2 = = 0.4. Step 2: 5 Calculate value from a calculator or a table. From calculator: normalcdf( 2.6, 0.4) = Solution = 65.08%. From table: We want area( 2.6 < z < 0.4). The table gives area(z < 2.6) = and area(z < 0.4) = Our answer is = , which is 65.07%. The difference between the calculator answer and the table answer is because of rounding. Show your work! (g) Step 1: Standardize both values. z 1 = = z 5 2 = = 3.2. Step 2: 5 Calculate value from a calculator or a table. From calculator: normalcdf(1.45, 3.2) = Solution = 7.28%. From table: We want area(1.45 < z < 3.2). The table gives area(z < 1.45) = and area(z < 3.2) = Our answer is = , which is 7.28%. 10. Based on the Normal Model N(50, 5), answer the following questions. Draw pictures to help you see what is going on. (a) Highest 5% of values corresponds to a z-value of Find this using invnorm(0.95) on your calculator, or looking for the value on a table. Use the z-score formula to solve for the value you are looking for. z = = x = x 50 5 x = (b) Lower 25% of values corresponds to a z-value of Find this using invnorm(0.25) on your calculator, or looking for the value on a table. Use the z-score formula to solve for the value you are looking for. z = 0.67 = x = x 50 5 x = (c) We need 2 values for z in this case. Let the lower value of z be z L and the upper value of z be z R. Find these values by typing invnorm(0.15) and invnorm(0.85) on your calculator. We have z L = z R = 1.04 respectively. Solve for the two values of x. z L = 1.04 = x = x 50 x L = 44.8, z R = 1.04 = x = x 50 x R = Use the data to answer the following. (a) The scatterplot would be as below.

18 Price ($) Price ($) vs Energy Used (KWH) Energy Used (KWH) (b) Direction is positive. (c) The form of the relationship is linear. (d) The strength of the relationship is strong. (e) There are no outliers. (f) The correlation coefficient is Now consider the regression line for the above data. (a) The slope of the regression line is b 1 = r sy s x = ( ) = (b) The intercept of the regression line is b 0 = ȳ b 1 x = (9.125) = (c) The percent of variation explained by this model is R 2 = (0.9687) 2 = = 93.84%. (d) The slope means that as the energy used increases by 1 KWH, the price increases by $ (e) The intercept means that the base price for services, with no energy used, costs $ (f) The overall model is: ŷ = x. (g) You would predict the price to be (18) = $ This prediction is reasonable because we are not extrapolating from the data.

19 (h) The energy company actually charges you $27.50 for 18 KWH of energy used. This is good because we would expect to pay more than this for the energy used. You would save $2.63. Note that when you take = 2.63 you are calculating the residual. (i) You would predict the price to be (35) = $ This prediction is not reasonable and cannot be trusted because we are extrapolating. (j) The predicted standardized value for price is r(sd) = (3) = Classify each of the residual plots. (a) Linear model is not appropriate. The linearity condition is violated. (b) Linear model is appropriate. (c) Linear model is not appropriate. The outlier condition is violated. (d) Linear model is not appropriate. The equal spread condition is violated. (e) Linear model is not appropriate. The equal spread condition is violated. (f) Linear model is not appropriate. The linearity condition is violated.

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

Chapter 1. Looking at Data

Chapter 1. Looking at Data Chapter 1 Looking at Data Types of variables Looking at Data Be sure that each variable really does measure what you want it to. A poor choice of variables can lead to misleading conclusions!! For example,

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

Sociology 6Z03 Review I

Sociology 6Z03 Review I Sociology 6Z03 Review I John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review I Fall 2016 1 / 19 Outline: Review I Introduction Displaying Distributions Describing

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Chapter 4. Displaying and Summarizing. Quantitative Data

Chapter 4. Displaying and Summarizing. Quantitative Data STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

STT 315 This lecture is based on Chapter 2 of the textbook.

STT 315 This lecture is based on Chapter 2 of the textbook. STT 315 This lecture is based on Chapter 2 of the textbook. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing him to use/edit some of their

More information

Example 2. Given the data below, complete the chart:

Example 2. Given the data below, complete the chart: Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

Chapter 3. Measuring data

Chapter 3. Measuring data Chapter 3 Measuring data 1 Measuring data versus presenting data We present data to help us draw meaning from it But pictures of data are subjective They re also not susceptible to rigorous inference Measuring

More information

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?!

Topic 3: Introduction to Statistics. Algebra 1. Collecting Data. Table of Contents. Categorical or Quantitative? What is the Study of Statistics?! Topic 3: Introduction to Statistics Collecting Data We collect data through observation, surveys and experiments. We can collect two different types of data: Categorical Quantitative Algebra 1 Table of

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data:

Lecture 2. Quantitative variables. There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Lecture 2 Quantitative variables There are three main graphical methods for describing, summarizing, and detecting patterns in quantitative data: Stemplot (stem-and-leaf plot) Histogram Dot plot Stemplots

More information

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67 Chapter 6 The Standard Deviation as a Ruler and the Normal Model 1 /67 Homework Read Chpt 6 Complete Reading Notes Do P129 1, 3, 5, 7, 15, 17, 23, 27, 29, 31, 37, 39, 43 2 /67 Objective Students calculate

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

Chapter 5. Understanding and Comparing. Distributions

Chapter 5. Understanding and Comparing. Distributions STAT 141 Introduction to Statistics Chapter 5 Understanding and Comparing Distributions Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 27 Boxplots How to create a boxplot? Assume

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Unit 6 - Simple linear regression

Unit 6 - Simple linear regression Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable

More information

Descriptive Univariate Statistics and Bivariate Correlation

Descriptive Univariate Statistics and Bivariate Correlation ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable QUANTITATIVE DATA Recall that quantitative (numeric) data values are numbers where data take numerical values for which it is sensible to find averages, such as height, hourly pay, and pulse rates. UNIVARIATE

More information

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Cengage Learning

More information

Units. Exploratory Data Analysis. Variables. Student Data

Units. Exploratory Data Analysis. Variables. Student Data Units Exploratory Data Analysis Bret Larget Departments of Botany and of Statistics University of Wisconsin Madison Statistics 371 13th September 2005 A unit is an object that can be measured, such as

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

Practice Questions for Exam 1

Practice Questions for Exam 1 Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon

More information

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things. (c) Epstein 2013 Chapter 5: Exploring Data Distributions Page 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms Individuals are the objects described by a set of data. These individuals

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22

Announcements. Lecture 1 - Data and Data Summaries. Data. Numerical Data. all variables. continuous discrete. Homework 1 - Out 1/15, due 1/22 Announcements Announcements Lecture 1 - Data and Data Summaries Statistics 102 Colin Rundel January 13, 2013 Homework 1 - Out 1/15, due 1/22 Lab 1 - Tomorrow RStudio accounts created this evening Try logging

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

P8130: Biostatistical Methods I

P8130: Biostatistical Methods I P8130: Biostatistical Methods I Lecture 2: Descriptive Statistics Cody Chiuzan, PhD Department of Biostatistics Mailman School of Public Health (MSPH) Lecture 1: Recap Intro to Biostatistics Types of Data

More information

Chapter2 Description of samples and populations. 2.1 Introduction.

Chapter2 Description of samples and populations. 2.1 Introduction. Chapter2 Description of samples and populations. 2.1 Introduction. Statistics=science of analyzing data. Information collected (data) is gathered in terms of variables (characteristics of a subject that

More information

Chapter 5: Exploring Data: Distributions Lesson Plan

Chapter 5: Exploring Data: Distributions Lesson Plan Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Lecture 1: Description of Data. Readings: Sections 1.2,

Lecture 1: Description of Data. Readings: Sections 1.2, Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then

More information

Averages How difficult is QM1? What is the average mark? Week 1b, Lecture 2

Averages How difficult is QM1? What is the average mark? Week 1b, Lecture 2 Averages How difficult is QM1? What is the average mark? Week 1b, Lecture 2 Topics: 1. Mean 2. Mode 3. Median 4. Order Statistics 5. Minimum, Maximum, Range 6. Percentiles, Quartiles, Interquartile Range

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Lecture Notes 2: Variables and graphics

Lecture Notes 2: Variables and graphics Highlights: Lecture Notes 2: Variables and graphics Quantitative vs. qualitative variables Continuous vs. discrete and ordinal vs. nominal variables Frequency distributions Pie charts Bar charts Histograms

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Chapter 6 Group Activity - SOLUTIONS

Chapter 6 Group Activity - SOLUTIONS Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to: STA 2023 Module 5 Regression and Correlation Learning Objectives Upon completing this module, you should be able to: 1. Define and apply the concepts related to linear equations with one independent variable.

More information

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution.

Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. Chapter 3: Displaying and summarizing quantitative data p52 The pattern of variation of a variable is called its distribution. 1 Histograms p53 The breakfast cereal data Study collected data on nutritional

More information

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Lecture 3B: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots Mean vs.

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Chapter 7. Scatterplots, Association, and Correlation

Chapter 7. Scatterplots, Association, and Correlation Chapter 7 Scatterplots, Association, and Correlation Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 29 Objective In this chapter, we study relationships! Instead, we investigate

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 2: Describing Distributions with Numbers CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring

More information

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data Finding Quartiles. Use the median to divide the ordered data set into two halves.. If n is odd, do not include the median in either half. If n is even, split this data set exactly in half.. Q1 is the median

More information

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Overview Key Concepts Understand how adding (subtracting) a constant or multiplying (dividing) by a constant changes the center and/or spread

More information

The empirical ( ) rule

The empirical ( ) rule The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%

More information

1. Descriptive stats methods for organizing and summarizing information

1. Descriptive stats methods for organizing and summarizing information Two basic types of statistics: 1. Descriptive stats methods for organizing and summarizing information Stats in sports are a great example Usually we use graphs, charts, and tables showing averages and

More information

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75 M 225 Test 1 B Name SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-13 13 14 3 15 8 16 4 17 10 18 9 19 7 20 3 21 16 22 2 Total 75 1 Multiple choice questions (1 point each) 1. Look at

More information

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27 Chapter 7: Statistics Describing Data Chapter 7: Statistics Describing Data 1 / 27 Categorical Data Four ways to display categorical data: 1 Frequency and Relative Frequency Table 2 Bar graph (Pareto chart)

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

1.3: Describing Quantitative Data with Numbers

1.3: Describing Quantitative Data with Numbers 1.3: Describing Quantitative Data with Numbers Section 1.3 Describing Quantitative Data with Numbers After this section, you should be able to MEASURE center with the mean and median MEASURE spread with

More information

Chapter 3 Examining Data

Chapter 3 Examining Data Chapter 3 Examining Data This chapter discusses methods of displaying quantitative data with the objective of understanding the distribution of the data. Example During childhood and adolescence, bone

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,

More information

MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline.

MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline. MATH 2560 C F03 Elementary Statistics I Lecture 1: Displaying Distributions with Graphs. Outline. data; variables: categorical & quantitative; distributions; bar graphs & pie charts: What Is Statistics?

More information

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart

ST Presenting & Summarising Data Descriptive Statistics. Frequency Distribution, Histogram & Bar Chart ST2001 2. Presenting & Summarising Data Descriptive Statistics Frequency Distribution, Histogram & Bar Chart Summary of Previous Lecture u A study often involves taking a sample from a population that

More information

Describing Distributions

Describing Distributions Describing Distributions With Numbers April 18, 2012 Summary Statistics. Measures of Center. Percentiles. Measures of Spread. A Summary Statement. Choosing Numerical Summaries. 1.0 What Are Summary Statistics?

More information

Unit Six Information. EOCT Domain & Weight: Algebra Connections to Statistics and Probability - 15%

Unit Six Information. EOCT Domain & Weight: Algebra Connections to Statistics and Probability - 15% GSE Algebra I Unit Six Information EOCT Domain & Weight: Algebra Connections to Statistics and Probability - 15% Curriculum Map: Describing Data Content Descriptors: Concept 1: Summarize, represent, and

More information

Descriptive Data Summarization

Descriptive Data Summarization Descriptive Data Summarization Descriptive data summarization gives the general characteristics of the data and identify the presence of noise or outliers, which is useful for successful data cleaning

More information

PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics

PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics LT1: Basics of Correlation LT2: Measuring Correlation and Line of best fit by eye Univariate (one variable) Displays Frequency tables Bar graphs

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Exploring Data: Distributions Look for overall pattern (shape, center, spread) and deviations (outliers). Mean (use a calculator): x = x 1 + x

More information

FREQUENCY DISTRIBUTIONS AND PERCENTILES

FREQUENCY DISTRIBUTIONS AND PERCENTILES FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly

More information

UCLA STAT 10 Statistical Reasoning - Midterm Review Solutions Observational Studies, Designed Experiments & Surveys

UCLA STAT 10 Statistical Reasoning - Midterm Review Solutions Observational Studies, Designed Experiments & Surveys UCLA STAT 10 Statistical Reasoning - Midterm Review Solutions Observational Studies, Designed Experiments & Surveys.. 1. (i) The treatment being compared is: (ii). (5) 3. (3) 4. (4) Study 1: the number

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

STA 218: Statistics for Management

STA 218: Statistics for Management Al Nosedal. University of Toronto. Fall 2017 My momma always said: Life was like a box of chocolates. You never know what you re gonna get. Forrest Gump. Problem How much do people with a bachelor s degree

More information

Performance of fourth-grade students on an agility test

Performance of fourth-grade students on an agility test Starter Ch. 5 2005 #1a CW Ch. 4: Regression L1 L2 87 88 84 86 83 73 81 67 78 83 65 80 50 78 78? 93? 86? Create a scatterplot Find the equation of the regression line Predict the scores Chapter 5: Understanding

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variable In this lecture: We shall look at two quantitative variables.

More information

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3: Examining Relationships Most statistical studies involve more than one variable. Often in the AP Statistics exam, you will be asked to compare two data sets by using side by side boxplots or

More information

Comparing Measures of Central Tendency *

Comparing Measures of Central Tendency * OpenStax-CNX module: m11011 1 Comparing Measures of Central Tendency * David Lane This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 1.0 1 Comparing Measures

More information

Stat 101: Lecture 6. Summer 2006

Stat 101: Lecture 6. Summer 2006 Stat 101: Lecture 6 Summer 2006 Outline Review and Questions Example for regression Transformations, Extrapolations, and Residual Review Mathematical model for regression Each point (X i, Y i ) in the

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

8/4/2009. Describing Data with Graphs

8/4/2009. Describing Data with Graphs Describing Data with Graphs 1 A variable is a characteristic that changes or varies over time and/or for different individuals or objects under consideration. Examples: Hair color, white blood cell count,

More information

Chapter 1 Introduction & 1.1: Analyzing Categorical Data

Chapter 1 Introduction & 1.1: Analyzing Categorical Data Chapter 1 Chapter 1 Introduction & 1.1: Analyzing Categorical Data Population Sample Make an inference about the population. Collect data from a representative sample... Perform Data Analysis, keeping

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

Math 138 Summer Section 412- Unit Test 1 Green Form, page 1 of 7

Math 138 Summer Section 412- Unit Test 1 Green Form, page 1 of 7 Math 138 Summer 1 2013 Section 412- Unit Test 1 Green Form page 1 of 7 1. Multiple Choice. Please circle your answer. Each question is worth 3 points. (a) Social Security Numbers are illustrations of which

More information

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives F78SC2 Notes 2 RJRC Algebra It is useful to use letters to represent numbers. We can use the rules of arithmetic to manipulate the formula and just substitute in the numbers at the end. Example: 100 invested

More information

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories. Chapter Goals To understand the methods for displaying and describing relationship among variables. Formulate Theories Interpret Results/Make Decisions Collect Data Summarize Results Chapter 7: Is There

More information