All the men living in Turkey can be a population. The average height of these men can be a population parameter

Size: px
Start display at page:

Download "All the men living in Turkey can be a population. The average height of these men can be a population parameter"

Transcription

1 CHAPTER 1: WHY STUDY STATISTICS? Why Study Statistics? Population is a large (or in nite) set of elements that are in the interest of a research question. A parameter is a speci c characteristic of a population All the men living in Turkey can be a population. The average height of these men can be a population parameter Sample is a subset of population that we use to withdraw conclusions or predictions on the parameters of the population (for inferences to be valid, sampling should be random). Statistics is a characteristic of the sample Instead of measuring the height of every man in Turkey, we can randomly select 5000 men from di erent locations of the country. This would be our sample. Then we can nd the average height of these people to estimate average height of the men in Turkey. This would be our sample statistics 1

2 Types of Statistics Inferential Statistics: This is what explained above; i.e. using sample data to make estimation and hypothesis testing (the tools that helps us to make statements and decisions under uncertainty, incomplete information) Descriptive Statistics: Graphical and numerical procedures that are used to present and summarize data. We can use descriptive statistics on either population, or sample data Chapter Summary Terms reviewed in this chapter: Population (Populasyon) Parameter (Parametre) Sample (Örneklem) Inferential Statistics (Ç kar msal Istatistik) Estimation (Tahmin) Descriptive Statistics (Betimleyici Istatistik) 2

3 CHAPTER 2: USING GRAPHS TO DESCRIBE DATA Data, Variable, and Constant Data are usually just a set of numbers representing the same kind of thing, such as body weight. That "thing" is called a variable (it is variable because the numbers vary from subject to subject). If the numbers are the same, the thing is called a constant Classi cation of Variables Categorical (sometimes called Nominal) or Numerical Categorical: (Yes or No), (Like, Dislike or Indi erent),... Numerical: (Discrete: Outcome of a dice,...), (Continuous: Height, time,...) Qualitative or Quantitative Qualitative: These variables are measured on an ordinal, interval, or ratio scale to describe variables. Numerical identi cation is only given to make variables categorized (Yes and No can be labeled as 0 and1). Ordered data indicate the rank of ordering items as well (Like, Dislike and Indi erent can be labelled as 2, 1,0). This thpe of data can be either categorical or numerical Quantitative: They are measured on a nominal scale. Hence, numeric values matter 3

4 Independent or Dependent Independent: A variable that stands alone and isn t changed by the other variables (ex. someone s age) Dependent: A variable that is explained by independent variables 4

5 Tables And Graphs to Describe Categorical Variables The Frequency Distribution Table reveals the number of occurrence (frequency) of each possible outcome A probability distribution is a frequency distribution with each frequency divided by the total number of observations Bar Chart, Pie Chart and Pareto Diagram are the graphics that present the same information with the Frequency Distribution Table Example: Hospital Patients by Unit Frequency Distribution Table Bar Chart Pie Chart Hospital Unit Number of Patients Cardiac Care 1,052 Emergency 2,245 Intensive Care 340 Maternity 552 Surgery 4,630 Number of patients per year Cardiac Care Hospital Patients by Unit Emergency Intensive Care Maternity Surgery Surgery 53% Hospital Patients by Unit Cardiac Care 12% Emergency 25% Intensive Care 4% Maternity 6% 5

6 Pareto Diagram: It is a special Bar Chart. But unlike Bar and Pie Charts, Pareto diagram presents the information in an order (descending or ascending), and the cumulative total is represented by the line Ex: 400 defective items are examined for cause of defect Frequency Distribution Table Source of Manufacturing Error Bad Weld Poor Alignment Missing Part Paint Flaw Electrical Short Cracked case Total Pareto Diagram Number of defects Source of Manufacturing Error Poor Alignment Paint Flaw Bad Weld Missing Part Cracked case Electrical Short Total Arranging Data Number of defects % of Total Defects % Pareto Diagram: Cause of Manufacturing Defect 60% 100% 90% % of defects in each category (bar graph) 50% 40% 30% 20% 10% 0% Poor Alignment Paint Flaw Bad Weld Missing Part Cracked case Electrical Short 80% 70% 60% 50% 40% 30% 20% 10% 0% cumulative %(line graph) 6

7 Tables And Graphs to Describe Numerical Variables We have frequency distribution just like the case with categorical variables. However, since this time the data is not categorized into groups, it is better to form arti cial groups instead of revealing frequency of each data point Ex: A manufacturer of insulation randomly selects 20 winter days and records the daily high temperature: 24, 35, 17, 21, 24, 37, 26, 46, 58, 30, 32, 13, 12, 38, 41, 43, 44, 27, 53, 27 The Ordered Data 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 The Frequency Distribution Table Interval Frequency Relative Frequency Percentage more than 10 but less than more than 20 but less than more than 30 but less than more than 40 but less than more than 50 but less than Total

8 Note: In this example to classify the data we used intervals of 10. However, there is no rule for that. The decision should be case speci c Note: The best graph is always the one that displays the information in the most clear and apprehensible way. There is no restriction for the type of the graph that you would use. However, remember that it may also be risky not to use standard graphs as it may lead confusion for the readers Histogram It is a graph of the (numerical) data in a frequency distribution Interval Frequency 10 but less than but less than but less than but less than but less than 60 2 Frequency Histogram: Daily High Temperature

9 The Cumulative Frequency Distribution & Ogive (graphing cumulative frequencies) Interval Frequency Percentage Cumulative Cumulative Frequency Percentage more than 10 but less than more than 20 but less than more than 30 but less than more than 40 but less than more than 50 but less than Total Ogive: Daily High Temperature 100 Cumulative Percentage

10 A line chart (time-series plot) It is used to show the values of a variable over time (time series data) Time is measured on the horizontal axis The variable of interest is measured on the vertical axis An Example: Magazine Subscriptions by Year Thousands of subscribers Cross-Sectional Data: Refers to data collected by observing many subjects at the same point of time. It is collected usually for the purpose of comparison Time series-cross-sectional Data: Refers to data collected by observing many subjects at the successive points in time 10

11 The shape of the distribution The shape of the distribution is said to be symmetric if the observations are balanced, or evenly distributed, about the center Symmetric Distribution Frequency The shape of the distribution is said to be skewed if the observations are not symmetrically distributed around the center Negatively Skewed Distribution Positively Skewed Distribution Frequency Frequency

12 Tables and Graphs to Describe Relationship Between Variables Graphs illustrated so far have involved only a single variable When two variables exist other techniques are used: Categorical (Qualitative) Variables: Cross tables (or contingency tables) Numerical (Quantitative) Variables : Scatter plots Cross Tables If there are r categories for the rst variable (rows) and c categories for the second variable (columns), the table is called an r x c cross table Ex: 4 x 3 Cross Table for Investment Choices by Investor Investment Investor A Investor B Investor C Total Category Stocks Bonds CD Savings Total

13 Comparing Investors Savings CD Side by side bar chart Bonds Stocks Investor A Investor B Investor C Scatter Diagrams They are used for paired observations taken from two numerical variables. One variable is measured on the vertical axis and the other variable is measured on the horizontal axis Volume per day Cost per day Cost per Day Cost per Day vs. Production Volume Volume per Day 13

14 Chapter Summary Data (veri) in raw form are usually not easy to use for decision making. Some type of organization in the form of table or graphs is needed Terms reviewed in this chapter: Variables (De¼gişkenler): Categorical (Kategorik) Qualitative (Niteliksel) Independent (Ba¼g ms z) Numerical (Say sal) Quantitative (Niceliksel) Dependent (Ba¼g ml ) Ordinal scale (S rasal Ölçek) Ratio scale(oransal Ölçek) Interval scale(aral ksal Ölçek) Nominal scale (Say sal Ölçek) Line chart (Çizgisel gra k) Bar chart (Çubuk gra k) Pie chart (Dairesel Gra k) Pareto diagram (Pareto Diyagram ) Histogram (Histogram) Ogive (A cumulative line graph) The Cumulative Frequency distribution (Kümülatif Frekans Da¼g l m ) Time Series (Zaman Serisi) Time Series (Zaman Serisi) Skewed (Çarp k Da¼g l m) Scatter plot (Saç l m Gra ¼gi) 14

15 15

16 CHAPTER 3: USING NUMERICAL MEASURES TO DESCRIBE DATA Measures of Central Tendency Mean: Arithmetic average of values (sum of values divided by the number of them) Median: Midpoint of ranked values Mode: Most frequently observed value in the data Ex: Suppose the following bicycle prices: 2.000, 100, 300, 100, 500 The mean is: ( )/5=600 The median can be found after ranking: 2.000, 500, 300, 100, 100; which is 300 The mode is 100 Even though the mean is the most generally used measure of central tendency, it is seen that it is subject to outliers that is, it is highly a ected from high or low values in the data even though these values may not be very informative Then median is often used, since the median is not sensitive to extreme values 16

17 Note: the location of the median is n position in the ordered data If the number of values is odd, the median is the middle number If the number of values is even, the median is the average of the two middle numbers Formally, the mean (also called arithmetic mean) is If calculated from population of N values, the mean is denoted by and calculated as: = NP x i i=1 N = x 1 + x 1 + ::: + x N N If calculated from sample size of n values, the mean is denoted by x and calculated as: x = np x i i=1 n = x 1 + x 1 + ::: + x n n 17

18 Mean and Median Depending on Shape of a Distribution Left Skewed Mean < Median Symmetric Mean = Median Right Skewed Median < Mean Measures of Variability Measures of variation give information on the spread or variability of the data values Ex: Same center, di erent variation 18

19 There are di erent measures of variability. The ones we are going to discuss Range: Di erence between the largest and the smallest observations Interquartile Range: Eliminate high- and low-valued observations and calculate the range of the middle 50% of the data Variance: Average of squared deviations of values from the mean Standard Deviation: Square Root of Variance Coe cient of Variation: Standard Deviation divided by mean (shows relative variation) Range: Di erence between the largest and the smallest observations Ex: Range = 14 1 = 13 However, it ignores the way in which data are distributed and sensitive to outliers Range = 12 7 = Range = 12 7 = 5 19

20 Interquartile Range: Eliminate high- and low-valued observations and calculate the range of the middle 50% of the data The rst quartile, Q1, is the value for which 25% of the observations are smaller and 75% are larger Q2 is the same as the median (50% are smaller, 50% are larger) Only 25% of the observations are greater than the third quartile Ex: Median X X minimum Q1 (Q2) Q3 maximum 25% 25% 25% 25% Interquartile range = = 27 20

21 Variance: Average of squared deviations of values from the mean Population mean and variance = NP x i i=1 N 2 = NP (x i ) 2 i=1 N Sample mean and variance x = NP x i i=1 n s 2 = NP (x i x) 2 i=1 n 1 Standard Deviation: It is square root of variance. is the population standard deviation, and s is the sample standard deviation 21

22 Ex: Sample Data (x i ): 10, 12, 14, 15, 17, 18, 18, 24 The sample size, n = 8. The mean can be found by x = = 16 The standard deviation can be found by s s (10 x) 2 + (12 x) 2 + ::: + (24 x) 2 (10 16) 2 + (12 16) 2 + ::: + (24 16) 2 s = = n r 126 s = = 4:2426 (a measure of the average scatter around the mean) 7 You don t have to rank the data to nd variance or standard deviation Both measure is used for hypothesis testing for a single distribution, but cannot be used to compare variability of di erent distributions 22

23 Coe cient of Variation: Shows variation relative to mean, so that it measures relative variation and can be used to compare two or more sets of data measured in di erent units Ex: CV = ( s x )100% Stock A: Average price last year = $50 Standard deviation = $5 CV = ( 5 )100% = 10% 50 Stock B: Average price last year = $100 Standard deviation = $5 CV = ( 5 )100% = 5% 100 Both stocks have the same standard deviation, but stock B is less variable relative to its price 23

24 More About Standard Deviation of a Distribution Chebyshev s Theorem: For any distribution (not necessarily normal) with mean and standard deviation, and k > 1, the part of the observations that fall within the interval k (i.e. k standard deviations of the mean) includes at least this much of the data 100[1 (1=k 2 )]% Ex: At least Within (1 1=1:5 2 ) = 55:5 % k = 1:5 ( 1:5) (1 1=2 2 ) = 75 % k = 2 ( 2) (1 1=3 2 ) = 88:9 % k = 3 ( 3) 24

25 If the data distribution is bell-shaped (normally distributed), then the interval 1 contains about 68 % of the values in the population or the sample 2 contains about 95 % of the values in the population or the sample 3 contains about 99:7 % of the values in the population or the sample 68% μ μ ±1σ 95% μ ± 2σ 99.7% μ ± 3σ 25

26 Weighted Mean and Measures of Grouped Data np w i x i i=1 The weighted mean of a set of data is x = P wi = w 1x 1 + w 2 x 2 + ::: + w n x P n wi where w i is the weight of the i th observation Can be used when data is already grouped into n classes, with w i values in the i th class Suppose a data set contains values m 1 ; m 2 ; :::; m k, occurring with frequencies f 1 ; f 2 ; :::f K Population mean and variance = KP f i m i i=1 N where P N = K f i, and 2 = i=1 KP f i (m i ) 2 i=1 N Sample mean and variance x = NP f i m i i=1 n where P n = K f i, and s 2 = i=1 KP f i (m i x) 2 i=1 n 1 26

27 Measures of Relationships Between Variables The covariance measures the strength of the linear relationship between two variables Population covariance: Cov(x; y) = 2 xy = NP (x i x )(y i y ) i=1 N Sample covariance: Cov(x; y) = s 2 xy = np (x i x)(y i y) i=1 n 1 Only concerned with the strength of the relationship No causal e ect is implied Interpreting Covariance: Cov(x; y) > 0 ) x and y tend to move in the same direction Cov(x; y) < 0 ) x and y tend to move in opposite directions Cov(x; y) = 0 ) there is no linear relation between x and y 27

28 Coe cient of Correlation measures the relative strength of the linear relationship between two variables. It is relative because, unlike covariance, this measure is not a ected from the magnitude of data Population correlation coe cient: = Cov(x; y) x y Sample correlation coe cient: r = Cov(x; y) s x s y It is unit free and ranges between 1 and 1. The closer to 1, the stronger the negative linear relationship. 0 indicates no relationship between the variables of interest Y X X X r = 1 r =.6 r = 0 Y Y r = +1 X r = +.3 X r = 0 X 28

29 29

30 Chapter Summary Terms reviewed in this chapter: Mean (Ortalama) Mode (Mod, Tepe De¼geri) Range (De¼gişim Aral ¼g ) Interquartile Range (Yar -çeyreklik De¼gişim Aral ¼g ) Standard Deviation (Standart Sapma) Covariance (Covaryasyon) Median (Medyan, Ortanca De¼ger) Measure (Ölçü) Variance (Varyasyon) Coe cient of Variation (Varyasyon Katsay s ) Weighted Mean (A¼g rl kl ortalama) Correlation (Corelasyon) 30

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,

More information

A is one of the categories into which qualitative data can be classified.

A is one of the categories into which qualitative data can be classified. Chapter 2 Methods for Describing Sets of Data 2.1 Describing qualitative data Recall qualitative data: non-numerical or categorical data Basic definitions: A is one of the categories into which qualitative

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

After completing this chapter, you should be able to:

After completing this chapter, you should be able to: Chapter 2 Descriptive Statistics Chapter Goals After completing this chapter, you should be able to: Compute and interpret the mean, median, and mode for a set of data Find the range, variance, standard

More information

Statistics I Chapter 2: Univariate data analysis

Statistics I Chapter 2: Univariate data analysis Statistics I Chapter 2: Univariate data analysis Chapter 2: Univariate data analysis Contents Graphical displays for categorical data (barchart, piechart) Graphical displays for numerical data data (histogram,

More information

Class 11 Maths Chapter 15. Statistics

Class 11 Maths Chapter 15. Statistics 1 P a g e Class 11 Maths Chapter 15. Statistics Statistics is the Science of collection, organization, presentation, analysis and interpretation of the numerical data. Useful Terms 1. Limit of the Class

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics Data and Statistics Data consists of information coming from observations, counts, measurements, or responses. Statistics is the science of collecting, organizing, analyzing,

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics By A.V. Vedpuriswar October 2, 2016 Introduction The word Statistics is derived from the Italian word stato, which means state. Statista refers to a person involved with the

More information

P8130: Biostatistical Methods I

P8130: Biostatistical Methods I P8130: Biostatistical Methods I Lecture 2: Descriptive Statistics Cody Chiuzan, PhD Department of Biostatistics Mailman School of Public Health (MSPH) Lecture 1: Recap Intro to Biostatistics Types of Data

More information

Sets and Set notation. Algebra 2 Unit 8 Notes

Sets and Set notation. Algebra 2 Unit 8 Notes Sets and Set notation Section 11-2 Probability Experimental Probability experimental probability of an event: Theoretical Probability number of time the event occurs P(event) = number of trials Sample

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that?

Tastitsticsss? What s that? Principles of Biostatistics and Informatics. Variables, outcomes. Tastitsticsss? What s that? Tastitsticsss? What s that? Statistics describes random mass phanomenons. Principles of Biostatistics and Informatics nd Lecture: Descriptive Statistics 3 th September Dániel VERES Data Collecting (Sampling)

More information

Chapter 1:Descriptive statistics

Chapter 1:Descriptive statistics Slide 1.1 Chapter 1:Descriptive statistics Descriptive statistics summarises a mass of information. We may use graphical and/or numerical methods Examples of the former are the bar chart and XY chart,

More information

additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst

additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst additionalmathematicsstatisticsadditi onalmathematicsstatisticsadditionalm athematicsstatisticsadditionalmathem aticsstatisticsadditionalmathematicsst STATISTICS atisticsadditionalmathematicsstatistic

More information

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES VARIABLE Studying the behavior of random variables, and more importantly functions of random variables is essential for both the

More information

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations:

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations: Measures of center The mean The mean of a distribution is the arithmetic average of the observations: x = x 1 + + x n n n = 1 x i n i=1 The median The median is the midpoint of a distribution: the number

More information

Lecture 1: Descriptive Statistics

Lecture 1: Descriptive Statistics Lecture 1: Descriptive Statistics MSU-STT-351-Sum 15 (P. Vellaisamy: MSU-STT-351-Sum 15) Probability & Statistics for Engineers 1 / 56 Contents 1 Introduction 2 Branches of Statistics Descriptive Statistics

More information

Descriptive Univariate Statistics and Bivariate Correlation

Descriptive Univariate Statistics and Bivariate Correlation ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to

More information

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11.

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11. Chapter 4 Statistics 45 CHAPTER 4 BASIC QUALITY CONCEPTS 1.0 Continuous Distributions.0 Measures of Central Tendency 3.0 Measures of Spread or Dispersion 4.0 Histograms and Frequency Distributions 5.0

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

Descriptive Data Summarization

Descriptive Data Summarization Descriptive Data Summarization Descriptive data summarization gives the general characteristics of the data and identify the presence of noise or outliers, which is useful for successful data cleaning

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information

Frequency Distribution Cross-Tabulation

Frequency Distribution Cross-Tabulation Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected What is statistics? Statistics is the science of: Collecting information Organizing and summarizing the information collected Analyzing the information collected in order to draw conclusions Two types

More information

ECLT 5810 Data Preprocessing. Prof. Wai Lam

ECLT 5810 Data Preprocessing. Prof. Wai Lam ECLT 5810 Data Preprocessing Prof. Wai Lam Why Data Preprocessing? Data in the real world is imperfect incomplete: lacking attribute values, lacking certain attributes of interest, or containing only aggregate

More information

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability STA301- Statistics and Probability Solved MCQS From Midterm Papers March 19,2012 MC100401285 Moaaz.pk@gmail.com Mc100401285@gmail.com PSMD01 MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

More information

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data

Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Chapter 2: Summarising numerical data Further Mathematics 2018 CORE: Data analysis Chapter 2 Summarising numerical data Extract from Study Design Key knowledge Types of data: categorical (nominal and ordinal)

More information

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.)

Section 1.1. Data - Collections of observations (such as measurements, genders, survey responses, etc.) Section 1.1 Statistics - The science of planning studies and experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting, and drawing conclusions based on the data.

More information

TOPIC: Descriptive Statistics Single Variable

TOPIC: Descriptive Statistics Single Variable TOPIC: Descriptive Statistics Single Variable I. Numerical data summary measurements A. Measures of Location. Measures of central tendency Mean; Median; Mode. Quantiles - measures of noncentral tendency

More information

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008

DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS QM 120. Spring 2008 DEPARTMENT OF QUANTITATIVE METHODS & INFORMATION SYSTEMS Introduction to Business Statistics QM 120 Chapter 3 Spring 2008 Measures of central tendency for ungrouped data 2 Graphs are very helpful to describe

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures

More information

Example 2. Given the data below, complete the chart:

Example 2. Given the data below, complete the chart: Statistics 2035 Quiz 1 Solutions Example 1. 2 64 150 150 2 128 150 2 256 150 8 8 Example 2. Given the data below, complete the chart: 52.4, 68.1, 66.5, 75.0, 60.5, 78.8, 63.5, 48.9, 81.3 n=9 The data is

More information

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511 Topic 2 - Descriptive Statistics STAT 511 Professor Bruce Craig Types of Information Variables classified as Categorical (qualitative) - variable classifies individual into one of several groups or categories

More information

Histograms allow a visual interpretation

Histograms allow a visual interpretation Chapter 4: Displaying and Summarizing i Quantitative Data s allow a visual interpretation of quantitative (numerical) data by indicating the number of data points that lie within a range of values, called

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics CHAPTER OUTLINE 6-1 Numerical Summaries of Data 6- Stem-and-Leaf Diagrams 6-3 Frequency Distributions and Histograms 6-4 Box Plots 6-5 Time Sequence Plots 6-6 Probability Plots Chapter

More information

University of Jordan Fall 2009/2010 Department of Mathematics

University of Jordan Fall 2009/2010 Department of Mathematics handouts Part 1 (Chapter 1 - Chapter 5) University of Jordan Fall 009/010 Department of Mathematics Chapter 1 Introduction to Introduction; Some Basic Concepts Statistics is a science related to making

More information

Preliminary Statistics course. Lecture 1: Descriptive Statistics

Preliminary Statistics course. Lecture 1: Descriptive Statistics Preliminary Statistics course Lecture 1: Descriptive Statistics Rory Macqueen (rm43@soas.ac.uk), September 2015 Organisational Sessions: 16-21 Sep. 10.00-13.00, V111 22-23 Sep. 15.00-18.00, V111 24 Sep.

More information

Higher Secondary - First year STATISTICS Practical Book

Higher Secondary - First year STATISTICS Practical Book Higher Secondary - First year STATISTICS Practical Book th_statistics_practicals.indd 07-09-08 8:00:9 Introduction Statistical tools are important for us in daily life. They are used in the analysis of

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

1. Exploratory Data Analysis

1. Exploratory Data Analysis 1. Exploratory Data Analysis 1.1 Methods of Displaying Data A visual display aids understanding and can highlight features which may be worth exploring more formally. Displays should have impact and be

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27

Chapter 7: Statistics Describing Data. Chapter 7: Statistics Describing Data 1 / 27 Chapter 7: Statistics Describing Data Chapter 7: Statistics Describing Data 1 / 27 Categorical Data Four ways to display categorical data: 1 Frequency and Relative Frequency Table 2 Bar graph (Pareto chart)

More information

Essentials of Statistics and Probability

Essentials of Statistics and Probability May 22, 2007 Department of Statistics, NC State University dbsharma@ncsu.edu SAMSI Undergrad Workshop Overview Practical Statistical Thinking Introduction Data and Distributions Variables and Distributions

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:

More information

Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data

Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data Chapter 2: Descriptive Analysis and Presentation of Single- Variable Data Mean 26.86667 Standard Error 2.816392 Median 25 Mode 20 Standard Deviation 10.90784 Sample Variance 118.981 Kurtosis -0.61717 Skewness

More information

BNG 495 Capstone Design. Descriptive Statistics

BNG 495 Capstone Design. Descriptive Statistics BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

REVIEW: Midterm Exam. Spring 2012

REVIEW: Midterm Exam. Spring 2012 REVIEW: Midterm Exam Spring 2012 Introduction Important Definitions: - Data - Statistics - A Population - A census - A sample Types of Data Parameter (Describing a characteristic of the Population) Statistic

More information

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES INTRODUCTION TO APPLIED STATISTICS NOTES PART - DATA CHAPTER LOOKING AT DATA - DISTRIBUTIONS Individuals objects described by a set of data (people, animals, things) - all the data for one individual make

More information

Introduction to Statistics

Introduction to Statistics Why Statistics? Introduction to Statistics To develop an appreciation for variability and how it effects products and processes. Study methods that can be used to help solve problems, build knowledge and

More information

Chapter 01 : What is Statistics?

Chapter 01 : What is Statistics? Chapter 01 : What is Statistics? Feras Awad Data: The information coming from observations, counts, measurements, and responses. Statistics: The science of collecting, organizing, analyzing, and interpreting

More information

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation

Chapter Four. Numerical Descriptive Techniques. Range, Standard Deviation, Variance, Coefficient of Variation Chapter Four Numerical Descriptive Techniques 4.1 Numerical Descriptive Techniques Measures of Central Location Mean, Median, Mode Measures of Variability Range, Standard Deviation, Variance, Coefficient

More information

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart

More information

Practice problems from chapters 2 and 3

Practice problems from chapters 2 and 3 Practice problems from chapters and 3 Question-1. For each of the following variables, indicate whether it is quantitative or qualitative and specify which of the four levels of measurement (nominal, ordinal,

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept.

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept. Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3- Measures of Center 3-3 Measures of Variation Section 3-1: Overview Descriptive Statistics summarize or describe the important

More information

Statistics I Chapter 1: Introduction

Statistics I Chapter 1: Introduction Statistics I Chapter 1: Introduction Chapter 1: Introduction Contents What is Statistics? - definition Key-words: population, parameter, sample, statistic, population size, sample size, individuals, objects

More information

Statistics I Chapter 1: Introduction

Statistics I Chapter 1: Introduction Statistics I Chapter 1: Introduction Chapter 1: Introduction Contents What is Statistics? - definition Key-words: population, parameter, sample, statistic, population size, sample size, individuals, objects

More information

Overview of Dispersion. Standard. Deviation

Overview of Dispersion. Standard. Deviation 15.30 STATISTICS UNIT II: DISPERSION After reading this chapter, students will be able to understand: LEARNING OBJECTIVES To understand different measures of Dispersion i.e Range, Quartile Deviation, Mean

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Notation Measures of Location Measures of Dispersion Standardization Proportions for Categorical Variables Measures of Association Outliers

Notation Measures of Location Measures of Dispersion Standardization Proportions for Categorical Variables Measures of Association Outliers Notation Measures of Location Measures of Dispersion Standardization Proportions for Categorical Variables Measures of Association Outliers Population - all items of interest for a particular decision

More information

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 2: Describing Distributions with Numbers CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring

More information

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the

CHAPTER 4 VARIABILITY ANALYSES. Chapter 3 introduced the mode, median, and mean as tools for summarizing the CHAPTER 4 VARIABILITY ANALYSES Chapter 3 introduced the mode, median, and mean as tools for summarizing the information provided in an distribution of data. Measures of central tendency are often useful

More information

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above King Abdul Aziz University Faculty of Sciences Statistics Department Final Exam STAT 0 First Term 49-430 A 40 Name No ID: Section: You have 40 questions in 9 pages. You have 90 minutes to solve the exam.

More information

Chapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Chapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc. Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data

More information

Chapter 5: Exploring Data: Distributions Lesson Plan

Chapter 5: Exploring Data: Distributions Lesson Plan Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions: Stemplots Describing Center: Mean and Median Describing Variability: The Quartiles The

More information

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not? Hypothesis testing Question Very frequently: what is the possible value of μ? Sample: we know only the average! μ average. Random deviation or not? Standard error: the measure of the random deviation.

More information

Statistics and parameters

Statistics and parameters Statistics and parameters Tables, histograms and other charts are used to summarize large amounts of data. Often, an even more extreme summary is desirable. Statistics and parameters are numbers that characterize

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value

More information

Math Section SR MW 1-2:30pm. Bekki George: University of Houston. Sections

Math Section SR MW 1-2:30pm. Bekki George: University of Houston. Sections Math 3339 Section 21155 - SR 117 - MW 1-2:30pm Bekki George: bekki@math.uh.edu University of Houston Sections 2.1-2.3 Bekki George (UH) Math 3339 Sections 2.1-2.3 1 / 28 Office Hours: Mondays 11am - 12:30pm,

More information

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS

2/2/2015 GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY MEASURES OF CENTRAL TENDENCY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Spring 2015: Lembo GEOGRAPHY 204: STATISTICAL PROBLEM SOLVING IN GEOGRAPHY CHAPTER 3: DESCRIPTIVE STATISTICS AND GRAPHICS Descriptive statistics concise and easily understood summary of data set characteristics

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics Lecture 1: Measures of Central Tendency and Dispersion Donald E. Mercante, PhD Biostatistics May 2010 Biostatistics (LSUHSC) Chapter 2 05/10 1 / 34 Lecture 1: Descriptive

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

An Introduction to Descriptive Statistics. 2. Manually create a dot plot for small and modest sample sizes

An Introduction to Descriptive Statistics. 2. Manually create a dot plot for small and modest sample sizes Living with the Lab Winter 2013 An Introduction to Descriptive Statistics Gerald Recktenwald v: January 25, 2013 gerry@me.pdx.edu Learning Objectives By reading and studying these notes you should be able

More information

Unit 2: Numerical Descriptive Measures

Unit 2: Numerical Descriptive Measures Unit 2: Numerical Descriptive Measures Summation Notation Measures of Central Tendency Measures of Dispersion Chebyshev's Rule Empirical Rule Measures of Relative Standing Box Plots z scores Jan 28 10:48

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Chapter 2 Solutions Page 15 of 28

Chapter 2 Solutions Page 15 of 28 Chapter Solutions Page 15 of 8.50 a. The median is 55. The mean is about 105. b. The median is a more representative average" than the median here. Notice in the stem-and-leaf plot on p.3 of the text that

More information

21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS

21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS 21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS Table of Contents Functions... 2 Polynomials and Rational Functions... 3 Exponential Functions... 4 Logarithmic

More information

CH 1: Data and Statistics

CH 1: Data and Statistics CH 1: Data and Statistics 1. Key Definitions: a. Elements: The entities on which data are collected. b. Population (universe): The set of all elements of interest in a particular study. ( size N) c. Sample:

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010

Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010 Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010 Review Recording observations - Must extract that which is to be analyzed: coding systems,

More information

Statistical Methods: Introduction, Applications, Histograms, Ch

Statistical Methods: Introduction, Applications, Histograms, Ch Outlines Statistical Methods: Introduction, Applications, Histograms, Characteristics November 4, 2004 Outlines Part I: Statistical Methods: Introduction and Applications Part II: Statistical Methods:

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed

More information

Math 221, REVIEW, Instructor: Susan Sun Nunamaker

Math 221, REVIEW, Instructor: Susan Sun Nunamaker Math 221, REVIEW, Instructor: Susan Sun Nunamaker Good Luck & Contact me through through e-mail if you have any questions. 1. Bar graphs can only be vertical. a. true b. false 2.

More information

Statistics lecture 3. Bell-Shaped Curves and Other Shapes

Statistics lecture 3. Bell-Shaped Curves and Other Shapes Statistics lecture 3 Bell-Shaped Curves and Other Shapes Goals for lecture 3 Realize many measurements in nature follow a bell-shaped ( normal ) curve Understand and learn to compute a standardized score

More information