Sampling (Statistics)

Size: px
Start display at page:

Download "Sampling (Statistics)"

Transcription

1 Systems & Biomedical Engineering Department SBE 304: Bio-Statistics Random Sampling and Sampling Distributions Dr. Ayman Eldeib Fall 2018 Sampling (Statistics) Sampling is that part of statistical practice concerned with the selection of a subset of individual observations within a population of individuals intended to yield some knowledge about the population of concern, especially for the purposes of making predictions based on statistical inference. Sampling is an important aspect of data collection. 1

2 Sampling (Statistics) Random Sampling Random sampling is a sampling technique where we select a group of subjects (a sample) for study from a larger group (a population). Each individual is chosen entirely by chance and each member of the population has a known, but possibly non-equal, chance of being included in the sample. Simple Random Sampling Simple random sampling is the basic sampling technique where we select a group of subjects (a sample) for study from a larger group (a population). Each individual is chosen entirely by chance and each member of the population has an equal chance of being included in the sample; i.e. each member of the population is equally likely to be chosen at any stage in the sampling process. Sampling (Statistics) Simple Random Sampling Simple random sampling refers to a sampling method that has the following properties: * The population consists of N objects. * The sample consists of n objects. * All possible samples of n objects are equally likely to occur. The main benefit of simple random sampling is that it guarantees that the sample chosen is representative of the population. This ensures that the statistical conclusions will be valid. 2

3 Sampling (Statistics) To understand sampling, you need to first understand a few basic definitions: The total set of observations that can be made is called the population. A sample is a subset of a population A parameter is a measurable characteristic of a population, such as a mean or standard deviation. A statistic is a measurable characteristic of a sample, such as a mean or standard deviation; i.e., a statistic is any function of the observations in a random sample. A sampling method is a procedure for selecting sample elements from a population. The Mode Measure of Central Tendency The mode is the value that occurs the most frequently in the population or sample. Suppose we draw a sample of five women and measure their weights. They weigh 100, 100, 130, 140, and 150. Since more women weigh 100 than any other weight, the mode would equal 100 pounds. The mode is not necessarily unique, since the same maximum frequency may be attained at different values. Given the list of data [1, 1, 2, 4, 4] the mode is not unique - the dataset may be said to be bimodal, while a set with more than two modes may be described as multimodal. 3

4 Measure of Central Tendency The Median To find the median, we arrange the observations in order from smallest to largest value. If there is an odd number of observations, the median is the middle value. If there is an even number of observations, the median is the average of the two middle values. Thus, in the sample of five women, the median value would be 130 pounds; since 130 pounds is the middle weight. Measure of Central Tendency The Mean The mean of a sample or a population is computed by adding all of the observations and dividing by the number of observations. Returning to the example of the five women, the mean weight would equal ( )/5 = 620/5 = 124 pounds. 4

5 Measure of Central Tendency Proportions and Percentages When the focus is on the degree to which a population possesses a particular attribute, the measure of interest is a percentage or a proportion. A proportion refers to the fraction of the total that possesses a certain attribute. For example, we might ask what proportion of women in our sample weigh less than 135 pounds. Since 3 women weigh less than 135 pounds, the proportion would be 3/5 or A percentage is another way of expressing a proportion. A percentage is equal to the proportion times 100. In our example of the five women, the percent of the total who weigh less than 135 pounds would be 100 * (3/5) or 60 percent. Measure of Central Tendency Notation Of the various measures, the mean and the proportion are most important. The notation used to describe these measures appears below: - - x: Refers to a sample mean. n: Number of observations in the sample p: The proportion of elements in the sample that has a particular attribute. q: The proportion of elements in the sample that does not have a specified attribute. Note that q = 1 - p. X: µ: Refers to a population mean. N: Number of observations in the population P: The proportion of elements in the population that has a particular attribute. Q: The proportion of elements in the population that does not have a specified attribute. Note that Q = 1 - P. Note that capital letters refer to population parameters, and lower-case letters refer to sample statistics. 5

6 Measure of Variability Some parameters attempt to describe the amount of variation between random variables. For example, consider a population of four random variables {5, 5,5, 5}. Here, each of the random variables are equal, so there is no variation. The set {3, 5, 5, 7}, on the other hand, has some variation since some random variables are different. Measure of Variability The Range The range is the simplest measure of variation. It is difference between the biggest and smallest random variable. Range = r = Maximum value - Minimum value Therefore, the range of the four random variables (3, 5, 5, 7} would be 7 3 = 4. 6

7 Measure of Variability Variance It is important to distinguish between the variance of a population and the variance of a sample. They have different notation, and they are computed differently. s 2 : The variance of the sample. s: The standard deviation of the sample. σ 2 : The variance of the population. σ: The standard deviation of the population. Variance Measure of Variability ou may have noticed that the denominator in the calculation of sample variance, unlike the denominator in the calculation of population variance, is n and not (n-1). Why? This is because to calculate the sample variance, deviations with respect to the sample mean are used. Sample observations, tend to be closer to the sample mean than to µ. Thus, the calculated deviations are smaller. As a result, the sample variance obtained is smaller than the population variance. To compensate for this, (n -1) is used as the denominator in place of n in the calculation of sample variance. 7

8 Measure of Variability Variance : Example 1 A population consists of four observations: {1, 3, 5, 7}. What is the variance? Solution: First, we need to compute the population mean. μ = ( ) / 4 = 4, Then, σ 2 = Σ ( X i -μ ) 2 / N = [ ] / 4 = 5 Variance : Example 2 A sample consists of four observations: {1, 3, 5, 7}. What is the variance? Solution: This problem is handled exactly like the previous problem, except that we use the formula for calculating sample variance, rather than the formula for calculating population variance. s 2 = Σ ( x i - x ) 2 / ( n - 1 ) = [ ] / 3 = 20 / 3 = Measure of Variability Variance A more efficient computational formula for the sample variance is obtained as follows: = 8

9 Sampling Distributions For any given population of size N it is possible to get different samples of size n. Each sample may well have a different mean. In fact it is possible to get an entire distribution of different sample means from the various possible samples. The list of all possible values for a statistic and the probability associated with each value is known as a sampling distribution. Sampling Distributions Theorem When the population we are sampling from has, itself, roughly the shape of a normal curve, the sampling distribution of the mean can be approximated closely with a normal distribution regardless of the size of n. 9

10 For large samples ( n 30) the sampling distribution of the sample mean can be approximated closely with a normal distribution with mean equal to the population mean and standard error equal to where σ µ µ σ is the standard deviation of the population. The sampling distribution of the mean will be approximately normally distributed no matter what the population distribution looks like. σ n The standard error of the mean is defined as the standard deviation of the sampling distribution of the mean. Keep in mind that this theorem applies only to the mean and not other statistics. 10

11 Based on this theorem, if is the mean of random sample of size n from a population with mean σ and standard deviation. If n is large, then µ has approximately the standard normal distribution (using the conversion formula). Z = µ σ n Example I In this example, the population distribution shows five levels of a characteristic/variable that were measured in the population. Each level of the variable had the same probability of occurring. The population distribution is presented below. The scores in the population range from 1 to 5. The mean of the population is 3.0 and the standard deviation of the population is 1.41 [μ=3.0; σ =1.41]. 11

12 Example I Frequency distributions of sample means quickly approach the shape of a normal distribution, even if we are taking relatively few, small samples from a population that is not normally distributed (such as the uniform distribution). Example II Let's randomly select an infinite number of samples of the same size from a population that follows a Poisson distribution and calculate the mean of scores in each sample. The sampling distributions are presented as shown on the right side. 12

13 Example III Let's randomly select an infinite number of samples of the same size from a population that follows a Normal distribution and calculate the mean of scores in each sample. The sampling distributions are presented as shown on the right side. Example IV A synthetic fiber used in manufacturing carpet has tensile strength that is normally distributed with mean 75.5 psi and standard deviation 3.5 psi. Find the probability that a random sample of n = 49 fiber specimens will have sample mean tensile strength that exceeds psi. σ = μ = 75.5 psi σ = = 0.5 n µ Z = µ σ n P( 75.75) = P( Z 0.5) 13

MEASURES OF LOCATION AND SPREAD

MEASURES OF LOCATION AND SPREAD MEASURES OF LOCATION AND SPREAD Frequency distributions and other methods of data summarization and presentation explained in the previous lectures provide a fairly detailed description of the data and

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

Lecture 2. Descriptive Statistics: Measures of Center

Lecture 2. Descriptive Statistics: Measures of Center Lecture 2. Descriptive Statistics: Measures of Center Descriptive Statistics summarize or describe the important characteristics of a known set of data Inferential Statistics use sample data to make inferences

More information

Sampling, Frequency Distributions, and Graphs (12.1)

Sampling, Frequency Distributions, and Graphs (12.1) 1 Sampling, Frequency Distributions, and Graphs (1.1) Design: Plan how to obtain the data. What are typical Statistical Methods? Collect the data, which is then subjected to statistical analysis, which

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11.

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11. Chapter 4 Statistics 45 CHAPTER 4 BASIC QUALITY CONCEPTS 1.0 Continuous Distributions.0 Measures of Central Tendency 3.0 Measures of Spread or Dispersion 4.0 Histograms and Frequency Distributions 5.0

More information

Ch. 17. DETERMINATION OF SAMPLE SIZE

Ch. 17. DETERMINATION OF SAMPLE SIZE LOGO Ch. 17. DETERMINATION OF SAMPLE SIZE Dr. Werner R. Murhadi www.wernermurhadi.wordpress.com Descriptive and Inferential Statistics descriptive statistics is Statistics which summarize and describe

More information

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing

More information

Descriptive Univariate Statistics and Bivariate Correlation

Descriptive Univariate Statistics and Bivariate Correlation ESC 100 Exploring Engineering Descriptive Univariate Statistics and Bivariate Correlation Instructor: Sudhir Khetan, Ph.D. Wednesday/Friday, October 17/19, 2012 The Central Dogma of Statistics used to

More information

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept.

Chapter 3 Statistics for Describing, Exploring, and Comparing Data. Section 3-1: Overview. 3-2 Measures of Center. Definition. Key Concept. Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3- Measures of Center 3-3 Measures of Variation Section 3-1: Overview Descriptive Statistics summarize or describe the important

More information

The Empirical Rule, z-scores, and the Rare Event Approach

The Empirical Rule, z-scores, and the Rare Event Approach Overview The Empirical Rule, z-scores, and the Rare Event Approach Look at Chebyshev s Rule and the Empirical Rule Explore some applications of the Empirical Rule How to calculate and use z-scores Introducing

More information

Probability and Samples. Sampling. Point Estimates

Probability and Samples. Sampling. Point Estimates Probability and Samples Sampling We want the results from our sample to be true for the population and not just the sample But our sample may or may not be representative of the population Sampling error

More information

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved.

1-1. Chapter 1. Sampling and Descriptive Statistics by The McGraw-Hill Companies, Inc. All rights reserved. 1-1 Chapter 1 Sampling and Descriptive Statistics 1-2 Why Statistics? Deal with uncertainty in repeated scientific measurements Draw conclusions from data Design valid experiments and draw reliable conclusions

More information

Lecture 3: Chapter 3

Lecture 3: Chapter 3 Lecture 3: Chapter 3 C C Moxley UAB Mathematics 26 January 16 3.2 Measurements of Center Statistics involves describing data sets and inferring things about them. The first step in understanding a set

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

STAT 200 Chapter 1 Looking at Data - Distributions

STAT 200 Chapter 1 Looking at Data - Distributions STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the

More information

Chapter 3. Data Description

Chapter 3. Data Description Chapter 3. Data Description Graphical Methods Pie chart It is used to display the percentage of the total number of measurements falling into each of the categories of the variable by partition a circle.

More information

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest: 1 Chapter 3 - Descriptive stats: Numerical measures 3.1 Measures of Location Mean Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size Example: The number

More information

Chapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc.

Chapter. Numerically Summarizing Data. Copyright 2013, 2010 and 2007 Pearson Education, Inc. Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data

More information

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com 1 Chapter 1: Introduction Material from Devore s book (Ed 8), and Cengagebrain.com Populations and Samples An investigation of some characteristic of a population of interest. Example: Say you want to

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 6 Patrick Breheny University of Iowa to Biostatistics (BIOS 4120) 1 / 36 Our next several lectures will deal with two-sample inference for continuous

More information

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size.

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size. Descriptive Statistics: One of the most important things we can do is to describe our data. Some of this can be done graphically (you should be familiar with histograms, boxplots, scatter plots and so

More information

Math 120 Chapter 3 additional notes

Math 120 Chapter 3 additional notes Math 120 Chapter 3 additional notes MEASURES OF DISPERSION AND NORMAL DISTRIBUTION MEASURES OF DISPERSION In addition to knowing the measures of central tendency, it is often important to know how widely

More information

The Normal Distribution. The Gaussian Curve. Advantages of using Z-score. Importance of normal or Gaussian distribution (ND)

The Normal Distribution. The Gaussian Curve. Advantages of using Z-score. Importance of normal or Gaussian distribution (ND) Importance of normal or Gaussian distribution (ND) The Normal It is the most used distribution Most method are based on the assumption of ND Sum of many independent, random contributions variables (grain

More information

Statistical Inference for Means

Statistical Inference for Means Statistical Inference for Means Jamie Monogan University of Georgia February 18, 2011 Jamie Monogan (UGA) Statistical Inference for Means February 18, 2011 1 / 19 Objectives By the end of this meeting,

More information

Chapter 8: Sampling Distributions. A survey conducted by the U.S. Census Bureau on a continual basis. Sample

Chapter 8: Sampling Distributions. A survey conducted by the U.S. Census Bureau on a continual basis. Sample Chapter 8: Sampling Distributions Section 8.1 Distribution of the Sample Mean Frequently, samples are taken from a large population. Example: American Community Survey (ACS) A survey conducted by the U.S.

More information

Measures of Central Tendency

Measures of Central Tendency Measures of Central Tendency Summary Measures Summary Measures Central Tendency Mean Median Mode Quartile Range Variance Variation Coefficient of Variation Standard Deviation Measures of Central Tendency

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction Engineers and scientists are constantly exposed to collections of facts, or data. The discipline of statistics provides methods for organizing and summarizing data, and for drawing

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Complete Solutions to Examination Questions Complete Solutions to Examination Questions 16

Complete Solutions to Examination Questions Complete Solutions to Examination Questions 16 Complete Solutions to Examination Questions 16 1 Complete Solutions to Examination Questions 16 1. The simplest way to evaluate the standard deviation and mean is to use these functions on your calculator.

More information

Figure Figure

Figure Figure Figure 4-12. Equal probability of selection with simple random sampling of equal-sized clusters at first stage and simple random sampling of equal number at second stage. The next sampling approach, shown

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

Business Statistics: A Decision-Making Approach 6 th Edition. Chapter Goals

Business Statistics: A Decision-Making Approach 6 th Edition. Chapter Goals Chapter 6 Student Lecture Notes 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 6 Introduction to Sampling Distributions Chap 6-1 Chapter Goals To use information from the sample

More information

Part 13: The Central Limit Theorem

Part 13: The Central Limit Theorem Part 13: The Central Limit Theorem As discussed in Part 12, the normal distribution is a very good model for a wide variety of real world data. And in this section we will give even further evidence of

More information

Descriptive Statistics

Descriptive Statistics Sherif Khalifa Sherif Khalifa () Descriptive Statistics 1 / 34 Definition Measures of central tendency yield information about the center, or middle part, of a group of numbers. Mode Median Mean Percentiles

More information

3.1 Measure of Center

3.1 Measure of Center 3.1 Measure of Center Calculate the mean for a given data set Find the median, and describe why the median is sometimes preferable to the mean Find the mode of a data set Describe how skewness affects

More information

UNIVERSITY OF TORONTO MISSISSAUGA. SOC222 Measuring Society In-Class Test. November 11, 2011 Duration 11:15a.m. 13 :00p.m.

UNIVERSITY OF TORONTO MISSISSAUGA. SOC222 Measuring Society In-Class Test. November 11, 2011 Duration 11:15a.m. 13 :00p.m. UNIVERSITY OF TORONTO MISSISSAUGA SOC222 Measuring Society In-Class Test November 11, 2011 Duration 11:15a.m. 13 :00p.m. Location: DV2074 Aids Allowed You may be charged with an academic offence for possessing

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Describing distributions with numbers

Describing distributions with numbers Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Section 1.3 with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE Chapter 1 Exploring Data Introduction: Data Analysis: Making Sense of Data 1.1

More information

Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used

Midrange: mean of highest and lowest scores. easy to compute, rough estimate, rarely used Measures of Central Tendency Mode: most frequent score. best average for nominal data sometimes none or multiple modes in a sample bimodal or multimodal distributions indicate several groups included in

More information

CHAPTER 3. YAKUP ARI,Ph.D.(C)

CHAPTER 3. YAKUP ARI,Ph.D.(C) CHAPTER 3 YAKUP ARI,Ph.D.(C) math.stat.yeditepe@gmail.com REMEMBER!!! The purpose of descriptive statistics is to summarize and organize a set of scores. One of methods of descriptive statistics is to

More information

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Overview 3-2 Measures

More information

BNG 495 Capstone Design. Descriptive Statistics

BNG 495 Capstone Design. Descriptive Statistics BNG 495 Capstone Design Descriptive Statistics Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential statistical methods, with a focus

More information

Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010

Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010 Variables, distributions, and samples (cont.) Phil 12: Logic and Decision Making Fall 2010 UC San Diego 10/18/2010 Review Recording observations - Must extract that which is to be analyzed: coding systems,

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny November 11 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data

More information

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. Chapter 3 Numerically Summarizing Data Chapter 3.1 Measures of Central Tendency Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode. A1. Mean The

More information

Chapter 15 Sampling Distribution Models

Chapter 15 Sampling Distribution Models Chapter 15 Sampling Distribution Models 1 15.1 Sampling Distribution of a Proportion 2 Sampling About Evolution According to a Gallup poll, 43% believe in evolution. Assume this is true of all Americans.

More information

Sections 6.1 and 6.2: The Normal Distribution and its Applications

Sections 6.1 and 6.2: The Normal Distribution and its Applications Sections 6.1 and 6.2: The Normal Distribution and its Applications Definition: A normal distribution is a continuous, symmetric, bell-shaped distribution of a variable. The equation for the normal distribution

More information

Chapter 3 Data Description

Chapter 3 Data Description Chapter 3 Data Description Section 3.1: Measures of Central Tendency Section 3.2: Measures of Variation Section 3.3: Measures of Position Section 3.1: Measures of Central Tendency Definition of Average

More information

Math 10 - Compilation of Sample Exam Questions + Answers

Math 10 - Compilation of Sample Exam Questions + Answers Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Sampling Distributions Parameter and Statistic A is a numerical descriptive measure of a population. Since it is based on the observations in the population, its value is almost always unknown.

More information

Chapter (3) Describing Data Numerical Measures Examples

Chapter (3) Describing Data Numerical Measures Examples Chapter (3) Describing Data Numerical Measures Examples Numeric Measurers Measures of Central Tendency Measures of Dispersion Arithmetic mean Mode Median Geometric Mean Range Variance &Standard deviation

More information

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami

Unit Two Descriptive Biostatistics. Dr Mahmoud Alhussami Unit Two Descriptive Biostatistics Dr Mahmoud Alhussami Descriptive Biostatistics The best way to work with data is to summarize and organize them. Numbers that have not been summarized and organized are

More information

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved

Chapter. Numerically Summarizing Data Pearson Prentice Hall. All rights reserved Chapter 3 Numerically Summarizing Data Section 3.1 Measures of Central Tendency Objectives 1. Determine the arithmetic mean of a variable from raw data 2. Determine the median of a variable from raw data

More information

Chapter 6: SAMPLING DISTRIBUTIONS

Chapter 6: SAMPLING DISTRIBUTIONS Chapter 6: SAMPLING DISTRIBUTIONS Read Section 1.5 Graphical methods may not always be sufficient for describing data. Numerical measures can be created for both populations and samples. Definition A numerical

More information

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES VARIABLE Studying the behavior of random variables, and more importantly functions of random variables is essential for both the

More information

Unit 1: Statistics. Mrs. Valentine Math III

Unit 1: Statistics. Mrs. Valentine Math III Unit 1: Statistics Mrs. Valentine Math III 1.1 Analyzing Data Statistics Study, analysis, and interpretation of data Find measure of central tendency Mean average of the data Median Odd # data pts: middle

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes

ADMS2320.com. We Make Stats Easy. Chapter 4. ADMS2320.com Tutorials Past Tests. Tutorial Length 1 Hour 45 Minutes We Make Stats Easy. Chapter 4 Tutorial Length 1 Hour 45 Minutes Tutorials Past Tests Chapter 4 Page 1 Chapter 4 Note The following topics will be covered in this chapter: Measures of central location Measures

More information

Chapter 1 - Lecture 3 Measures of Location

Chapter 1 - Lecture 3 Measures of Location Chapter 1 - Lecture 3 of Location August 31st, 2009 Chapter 1 - Lecture 3 of Location General Types of measures Median Skewness Chapter 1 - Lecture 3 of Location Outline General Types of measures What

More information

Quiz 2 covered materials in Chapter 5 Chapter 6 Chapter 7. Normal Probability Distribution. Continuous Probability. distribution 11/9/2010.

Quiz 2 covered materials in Chapter 5 Chapter 6 Chapter 7. Normal Probability Distribution. Continuous Probability. distribution 11/9/2010. Quiz covered materials in Chapter 5 Chapter 6 Chapter 7 Normal Probability Distribution Chapter 6 Continuous Probability Distribution Normal Distribution Normal Distribution is symmetrical and bell shaped,

More information

1.3: Describing Quantitative Data with Numbers

1.3: Describing Quantitative Data with Numbers 1.3: Describing Quantitative Data with Numbers Section 1.3 Describing Quantitative Data with Numbers After this section, you should be able to MEASURE center with the mean and median MEASURE spread with

More information

Chapter 7 Sampling Distributions

Chapter 7 Sampling Distributions Statistical inference looks at how often would this method give a correct answer if it was used many many times. Statistical inference works best when we produce data by random sampling or randomized comparative

More information

BIOS 2041: Introduction to Statistical Methods

BIOS 2041: Introduction to Statistical Methods BIOS 2041: Introduction to Statistical Methods Abdus S Wahed* *Some of the materials in this chapter has been adapted from Dr. John Wilson s lecture notes for the same course. Chapter 0 2 Chapter 1 Introduction

More information

2.1 Measures of Location (P.9-11)

2.1 Measures of Location (P.9-11) MATH1015 Biostatistics Week.1 Measures of Location (P.9-11).1.1 Summation Notation Suppose that we observe n values from an experiment. This collection (or set) of n values is called a sample. Let x 1

More information

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population

3.1 Measures of Central Tendency: Mode, Median and Mean. Average a single number that is used to describe the entire sample or population . Measures of Central Tendency: Mode, Median and Mean Average a single number that is used to describe the entire sample or population. Mode a. Easiest to compute, but not too stable i. Changing just one

More information

Lecture 2 and Lecture 3

Lecture 2 and Lecture 3 Lecture 2 and Lecture 3 1 Lecture 2 and Lecture 3 We can describe distributions using 3 characteristics: shape, center and spread. These characteristics have been discussed since the foundation of statistics.

More information

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- # Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola Chapter 3 Statistics for Describing, Exploring, and Comparing Data 3-1 Review and Preview 3-2 Measures

More information

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1 9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her

More information

Def 1 A population consists of the totality of the observations with which we are concerned.

Def 1 A population consists of the totality of the observations with which we are concerned. Chapter 6 Sampling Distributions 6.1 Random Sampling Def 1 A population consists of the totality of the observations with which we are concerned. Remark 1. The size of a populations may be finite or infinite.

More information

Inferences Based on Two Samples

Inferences Based on Two Samples Chapter 6 Inferences Based on Two Samples Frequently we want to use statistical techniques to compare two populations. For example, one might wish to compare the proportions of families with incomes below

More information

Introduction to Statistics

Introduction to Statistics Introduction to Statistics By A.V. Vedpuriswar October 2, 2016 Introduction The word Statistics is derived from the Italian word stato, which means state. Statista refers to a person involved with the

More information

Resistant Measure - A statistic that is not affected very much by extreme observations.

Resistant Measure - A statistic that is not affected very much by extreme observations. Chapter 1.3 Lecture Notes & Examples Section 1.3 Describing Quantitative Data with Numbers (pp. 50-74) 1.3.1 Measuring Center: The Mean Mean - The arithmetic average. To find the mean (pronounced x bar)

More information

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves)

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Outliers Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Notes #1 Chp 12 Outliers In a set of numbers, sometimes there will be

More information

Lecture Topic 4: Chapter 7 Sampling and Sampling Distributions

Lecture Topic 4: Chapter 7 Sampling and Sampling Distributions Lecture Topic 4: Chapter 7 Sampling and Sampling Distributions Statistical Inference: The aim is to obtain information about a population from information contained in a sample. A population is the set

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Math May 13, Final Exam

Math May 13, Final Exam Math 447 - May 13, 2013 - Final Exam Name: Read these instructions carefully: The points assigned are not meant to be a guide to the difficulty of the problems. If the question is multiple choice, there

More information

Unit 4 Probability. Dr Mahmoud Alhussami

Unit 4 Probability. Dr Mahmoud Alhussami Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from

More information

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency

Math 120 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency Math 1 Introduction to Statistics Mr. Toner s Lecture Notes 3.1 Measures of Central Tendency The word average: is very ambiguous and can actually refer to the mean, median, mode or midrange. Notation:

More information

- a value calculated or derived from the data.

- a value calculated or derived from the data. Descriptive statistics: Note: I'm assuming you know some basics. If you don't, please read chapter 1 on your own. It's pretty easy material, and it gives you a good background as to why we need statistics.

More information

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines) Dr. Maddah ENMG 617 EM Statistics 10/12/12 Nonparametric Statistics (Chapter 16, Hines) Introduction Most of the hypothesis testing presented so far assumes normally distributed data. These approaches

More information

CHAPTER 5 Probabilistic Features of the Distributions of Certain Sample Statistics

CHAPTER 5 Probabilistic Features of the Distributions of Certain Sample Statistics CHAPTER 5 Probabilistic Features of the Distributions of Certain Sample Statistics Key Words Sampling Distributions Distribution of the Sample Mean Distribution of the difference between Two Sample Means

More information

SECOND UNIVERSITY EXAMINATION

SECOND UNIVERSITY EXAMINATION OLLSCOIL NA héireann, GAILLIMH NATIONAL UNIVERSITY OF IRELAND, GALWAY AUTUMN EXAMINATIONS, 2000 2001 SECOND UNIVERSITY EXAMINATION STATISTICS [MA237] Dr. D. Harrington, Dr. J.N. Sheahan, Paul Wilson, M.A.,

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 2 Methods for Describing Sets of Data Summary of Central Tendency Measures Measure Formula Description Mean x i / n Balance Point Median ( n +1) Middle Value

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific

More information

Chapter 6 Continuous Probability Distributions

Chapter 6 Continuous Probability Distributions Math 3 Chapter 6 Continuous Probability Distributions The observations generated by different statistical experiments have the same general type of behavior. The followings are the probability distributions

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 3 Numerical Descriptive Measures 3-1 Learning Objectives In this chapter, you learn: To describe the properties of central tendency, variation,

More information

CHAPTER 2: Describing Distributions with Numbers

CHAPTER 2: Describing Distributions with Numbers CHAPTER 2: Describing Distributions with Numbers The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner Lecture PowerPoint Slides Chapter 2 Concepts 2 Measuring Center: Mean and Median Measuring

More information

Week 1: Intro to R and EDA

Week 1: Intro to R and EDA Statistical Methods APPM 4570/5570, STAT 4000/5000 Populations and Samples 1 Week 1: Intro to R and EDA Introduction to EDA Objective: study of a characteristic (measurable quantity, random variable) for

More information

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com

Chapter 1: Introduction. Material from Devore s book (Ed 8), and Cengagebrain.com 1 Chapter 1: Introduction Material from Devore s book (Ed 8), and Cengagebrain.com Populations and Samples An investigation of some characteristic of a population of interest. Example: Say you want to

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

MAT2377. Ali Karimnezhad. Version December 13, Ali Karimnezhad

MAT2377. Ali Karimnezhad. Version December 13, Ali Karimnezhad MAT2377 Ali Karimnezhad Version December 13, 2016 Ali Karimnezhad Comments These slides cover material from Chapter 4. In class, I may use a blackboard. I recommend reading these slides before you come

More information

Topic Page: Central tendency

Topic Page: Central tendency Topic Page: Central tendency Definition: measures of central tendency from Dictionary of Psychological Testing, Assessment and Treatment summary statistics which divide the data into two halves (i.e. half

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching/ Suhasini Subba Rao Review In the previous lecture we looked at the statistics of M&Ms. This example illustrates

More information

Measures of Central Tendency:

Measures of Central Tendency: Measures of Central Tendency: Mean, Median, Mode CasperWendy Measures of Central Tendency Measure of central tendency provides a very convenient way of describing a set of scores with a single number that

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information