ST 371 (IX): Theories of Sampling Distributions

Size: px
Start display at page:

Download "ST 371 (IX): Theories of Sampling Distributions"

Transcription

1 ST 371 (IX): Theories of Sampling Distributions 1 Sample, Population, Parameter and Statistic The major use of inferential statistics is to use information from a sample to infer characteristics about a population. A population is the complete collection of subjects to be studied; it contains all subjects of interest. A sample is a part of the population of interest, a sub-collection selected from a population. A parameter describes a characteristic of a population, while a statistic describes a characteristic of a sample. In general, we will use a statistic to infer the value of a parameter. Unbiased Sample: A sample is unbiased if every individual or the element in the population has an equal chance of being selected. Next we discuss several examples occurred in survey sampling. 1. Survey in presidential election. (a) Option I: Call all registered voters on the phone and ask them who they will vote for. Although this would provide a very accurate result, it would be a very tedious and time consuming project. (b) Option II: Call 4 registered voters,1 in each time zone, and ask them who they will vote for. Although this is a very easy task, the results would not be very reliable. (c) Option III: Randomly select 20,000 registered voters and poll them. The population of interest here is all registered voters, and the parameter is the percentage of them that will vote for a candidate. The sample is the 20,000 registered voters that were polled, and the statistic is the percentage of them that will vote for a candidate. 2. Kathy wants to know how many students in her city use the internet for learning purposes. She used an poll. Based on the replies 1

2 to her poll, she found that 83% of those surveyed used the internet. Kathys sample is biased as she surveyed only the students those who use the internet. She should have randomly selected a few schools and colleges in the city to conduct the survey. 3. Another classic example of a biased sample and the misleading results it produced occurred in In the early days of opinion polling, the American Literary Digest magazine collected over two million postal surveys and predicted that the Republican candidate in the U.S. presidential election, Alf Landon, would beat the incumbent president, Franklin Roosevelt by a large margin. The result was the exact opposite. The Literary Digest survey represented a sample collected from readers of the magazine, supplemented by records of registered automobile owners and telephone users. This sample included an overrepresentation of individuals who were rich, who, as a group, were more likely to vote for the Republican candidate. In contrast, a poll of only 50 thousand citizens selected by George Gallup s organization successfully predicted the result, leading to the popularity of the Gallup poll. Conclusion: To use a sample to make inferences about a population, the sample should be representative of the population (unbiased). 2 Statistics and their Distributions A statistic is a random variable, denoted by an upper case letter whose value can be computed from sample data. We often use a statistic to infer the value of a parameter. Examples include Measures of location: Suppose we observe n realizations of random variable X: x 1,, x n, the sample mean is x = 1 n n i=1 x i. In contrast, the population mean is E(X) = µ. 2

3 the sample median: let x (1),, x (n) denote the ordered values. If n is odd, then x = x ( n+1 2 ). If n is even, x = 1/2[x ( n 2 ) + x ( n 2 +1) ]. In contrast, the population median is µ = FX 1 (0.5). Measure of variability: the sample variance S 2 = 1 n 1 Note that the population variance is n (x i x) 2. i=1 σ 2 = V (X) = E(X µ) 2. Measure of contrasts: Consider random samples from two populations {x 1,, x n } and {y 1,, y m }, for example, in a randomized clinical trial, the difference of the quality of life (QOL) between the patients (or survival time, or cure rate) on two treatment arms T = x ȳ. The contrast between two populations is µ X µ Y = E(X) E(Y ). Each statistic is a random variable and has a probability distribution. The probability distribution of a statistic is referred to as its sampling distribution. The sampling distribution depends not only on the population distribution but also on the method of sampling. The most widely used sampling method is random sampling with replacement. The random variables X 1,, X n are said to form a random sample of size n, or be independently identically distributed (i.i.d.), if 1. The X i s are independent rv s. 2. Every X i has the same probability distribution. Denote by µ and σ 2 the mean and variance of the random variable X. The next theorem follows from the results on the distribution of a linear combination that we shall discuss in Section 4. 3

4 Theorem on the distribution of the sample mean X. 1. E( X) = µ X = µ. 2. V( X) = σ 2 X = σ 2 /n. 3. σ X = σ/ n. Example 1 Let X 1,, X 5 be a random sample from a normal distribution with µ = 1.5 and σ = Find the probability that P ( X 2.0). Find the variance of 5 i=1 X i. 4

5 Example 2 Service time for a certain bank transaction is a random variable having an exponential distribution with parameter λ. Suppose X 1 and X 2 are service times for two independent customers. Consider the average service time X = (X 1 + X 2 )/2. Find the cdf of X. Find the pdf of X. Find the mean and variance of X. 5

6 3 Limit Theorems 3.1 Weak law of large numbers Consider a sample of independent and identically distributed random variables X 1,, X n. The relationship between the sample mean X n = X X n n and true mean of the X i s, E(X i ) = µ, is a problem of pivotal importance in statistics. Typically, µ is unknown and we would like to estimate µ based on X n. The weak law of large numbers says that the sample mean converges in probability to µ. This means that for a large enough sample size n, Xn will be close to µ with high probability. The weak law of large numbers. Let X 1, X 2, be a sequence of independent and identically distributed random variables, each having finite mean E(X i ) = µ. Then, for any ɛ > 0, (3.1) P { Xn µ ε } 0 as n. Example 3 A numerical study of the law of large numbers. We first simulate normal random variables from N(5, 1) with different sample sizes, then calculate the difference between the sample mean and the population mean. n Bias: Xn µ We can see that X n based on a large n tends to be closer to µ than does X n based on a small n. Example 4 (optional) Application of Weak Law of Large Numbers: Monte Carlo Integration. Suppose that we wish to calculate I(f) = f(x)dx,

7 where the integration cannot be done by elementary means or evaluated using tables of integrals. The most common approach is to use a numerical method in which the integral is approximated by a sum; various schemes and computer packages exist for doing this. Another method, called the Monte Carlo method, works in the following way. Generate independent uniform random variables on (0,1), that is, X 1,, X n, and compute I( ˆf) = 1 n n f(x i ). By the law of large numbers, for large n, this should be close to E[f(X)], which is simply E[f(X)] = 1 0 i=1 f(x)dx = I(f). This simple scheme can easily be modified in order to change the range of integration and in other ways. Compared to the standard numerical methods, it is not especially efficient in one dimension, but becomes increasingly efficient as the dimensionality of the integral grows. 3.2 Strong law of large numbers (optional) The strong law of large numbers states that for a sequence of independent and identically distributed random variables X 1, X 2,, the sample mean converges almost surely to the mean of the random variables E(X i ) = µ. Let be a sequence of independent and identically distributed random variables, each having a finite mean µ = E(X i ). Then, with probability 1, X µ as n. The weak law of large numbers states that for any specified large value n, Xn is likely to be near µ. However, it does not say that X n is bound to stay near µ for all values of n larger than n. Thus, it leaves open the possibly that large values of X n µ can occur infinitely often (thought at infrequent intervals). The strong law shows that this cannot occur. In particular, it implies that with probability 1, for any positive value ɛ, X n µ will be greater than ɛ only a finite number of times. 7

8 The strong law of large numbers is of enormous importance, because it provides a direct link between the axioms of probability and the frequency interpretation of probability. If we accept the interpretation that with probability 1 means with certainty, then we can say that P (E) is the limit of the long-run relative frequency of times E would occur in repeated, independent trials of the experiment. 3.3 Central limit theorem The weak law of large numbers says that for X 1,, X n, iid, the sample mean X n is close to E(X i ) = µ when n is large. The Central Limit Theorem provides a more precise approximation by showing that a magnification of the distribution of Xn around µ has approximately a standard normal distribution: The Central Limit Theorem (CLT): Let X 1,, X n be a sequence of independent and identically distributed random variables, each having finite mean E(X i ) = µ and finite variance Var(X i ) = σ 2. Then the distribution X of n µ tends to the standard normal distribution as n. That is, for σ/ n < a <, ( X1 + + X n nµ P σ n ) a 1 a e x2 /2 dx 2π as n. The theorem can be thought of as roughly saying that the sum of a large number of iid random variables has a distribution that is approximately normal. By writing X X n nµ σ n = n ( Xn µ ) (σ n) = X n µ σ/ n, we see that the CLT says that the sample mean X n has a approximately a normal distribution with mean µ and variance σ/ n. The CLT is a remarkable result - only assuming that a sequence of iid random variables have a finite mean and variance, the central limit theorem shows that the mean of the sequence, suitably standardized, always converges to having a 8

9 standard normal distribution. The normal approximation to the binomial distribution is a special case of the central limit theorem. Consider a skewed distribution (lognormal). Consider the histogram of the sample mean X n for n = 1, 5, 20, 50. n=1 n=5 Frequency Frequency x1.bar x2.bar n=10 n=30 Frequency Frequency x3.bar x4.bar We can see from the histograms that the sampling distributions become progressively less skewed as the sample size n increases, therefore the distribution can be better approximated by a normal distribution. This interesting result shows that the central limit theorem can be successfully applied when n is large. In general, the rule of thumb is n > 30. 9

10 Example 5 An Airline overbooks a flight because it expects that there will be no-shows. Assume that (i) There are 200 seats available on the flight. (ii) Seats are occupied only by individuals who made reservations (no standbys). (iii) The probability that a person who made a reservation shows up for the flight is (iv) Reservations show up for the flight independently of each other. 1. If the airline accepts 220 reservations, write an expression for the exact probability that the plane will be full (i.e., at least 200 reservations show up). Use the central limit theorem to approximate this probability. 2. Suppose the airline wants to choose a number n of reservations so that the probability that at least 200 of the n reservations show up is Find the (approximate) minimum value of n. 10

11 Example 6 The number of parking tickets issued in Raleigh on any given weekday has a Poisson distribution with parameter λ = 50. What is the approximate probability that (a) Between 35 and 70 tickets are given out on a particular day? (b) The total number of tickets given out during a 5-day week is between 225 and 275? 11

12 4 Distribution of a Linear Combination Given a collection of n random variables X 1,, X n and n numerical constants a 1,, a n, the rv Y = a 1 X a n X n = n a i X i i=1 is called a linear combination of the X i s. Let X 1,, X n have means µ 1,, µ n, respectively, and variances σ 2 1,, σ 2 n, respectively. Then 1. E(a 1 X 1 + a 2 X a n X n ) = a 1 E(X 1 ) + a 2 E(X 2 ) + + a n E(X n ) = a 1 µ a n µ n. 2. If X 1, X 2,, X n are independent, then Var(a 1 X 1 +, a n X n ) = a 2 1Var(X 1 )+ +a 2 nvar(x n ) = a 2 1σ a 2 nσ 2 n. 3. For any (possibly dependent) random variables X 1,, X n, Var(a 1 X a n X n ) = n i=1 n a i a j Cov(X i, X j ). j=1 The case of normal random variables: If X 1,, X n are independent, normally distributed rv s, then any particular linear combination of the X i s are also normally distributed. Special cases: 1. E( X) = µ X = µ. 2. If all X i are independent, V( X) = σ 2 X = σ 2 /n. 3. E(X 1 X 2 ) = E(X 1 ) E(X 2 ). 4. If X i are independent, then V (X 1 X 2 ) = V (X 1 ) + V (X 2 ). Otherwise V (X 1 X 2 ) = V (X 1 ) + V (X 2 ) 2Cov(X 1, X 2 ). 12

13 Example 7 The total revenue from the sale of the three grades of gasoline on a particular day was Y = 21.2X X X 3. Assume that X 1, X 2 and X 3 are independent with µ 1 = 1000, µ 2 = 500, µ 3 = 300, σ 1 = 100, σ 2 = 80 and σ 3 = 50. What is the probability that the revenue exceeds 45000? 13

14 Example 8 A student has a class that is supposed to end at 9am and another class that is supposed to begin at 9:10am. Suppose that the actual ending time (after 9 in minutes) X 1 N(2, ) and the starting time of the next class X 2 N(10, 1 2 ). Suppose also that the time to get from one location to next location X 3 N(6, 1 2 ). What is the probability that a student makes it to the second class before the lecture starts. 14

15 Example 9 Three different roads feed into a particular freeway entrance. Suppose that during a fixed time period, the number of cars coming from each road onto the freeway X i is normally distributed, with X 1 N(750, 16 2 ), X 2 N(1000, 24 2 ) and X 3 N(550, 18 2 ). (a). What is the expected total number of cars entering the freeway at this point during the period? (b). Suppose X 1, X 2 and X 3 are independent. Find the probability P (X 1 + X 2 + X 3 > 2500). (c). Now suppose that the three streams of traffic are not independent, and Cov(X 1, X 2 ) = 80, Cov(X 1, X 3 ) = 90 and Cov(X 2, X 3 ) = 100. Compute the expected value and variance of the total number of entering cars. 15

Chapter Goals. To introduce you to data collection

Chapter Goals. To introduce you to data collection Chapter Goals To introduce you to data collection You will learn to think critically about the data collected or presented learn various methods for selecting a sample Formulate Theories Interpret Results/Make

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

AP Statistics Review Ch. 7

AP Statistics Review Ch. 7 AP Statistics Review Ch. 7 Name 1. Which of the following best describes what is meant by the term sampling variability? A. There are many different methods for selecting a sample. B. Two different samples

More information

STAT Chapter 5 Continuous Distributions

STAT Chapter 5 Continuous Distributions STAT 270 - Chapter 5 Continuous Distributions June 27, 2012 Shirin Golchi () STAT270 June 27, 2012 1 / 59 Continuous rv s Definition: X is a continuous rv if it takes values in an interval, i.e., range

More information

1 Exercises for lecture 1

1 Exercises for lecture 1 1 Exercises for lecture 1 Exercise 1 a) Show that if F is symmetric with respect to µ, and E( X )

More information

Chapter 18: Sampling Distributions

Chapter 18: Sampling Distributions Chapter 18: Sampling Distributions All random variables have probability distributions, and as statistics are random variables, they too have distributions. The random phenomenon that produces the statistics

More information

Probability and Statistics Notes

Probability and Statistics Notes Probability and Statistics Notes Chapter Five Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Five Notes Spring 2011 1 / 37 Outline 1

More information

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example continued : Coin tossing Math 425 Intro to Probability Lecture 37 Kenneth Harris kaharri@umich.edu Department of Mathematics University of Michigan April 8, 2009 Consider a Bernoulli trials process with

More information

STAT 516 Midterm Exam 2 Friday, March 7, 2008

STAT 516 Midterm Exam 2 Friday, March 7, 2008 STAT 516 Midterm Exam 2 Friday, March 7, 2008 Name Purdue student ID (10 digits) 1. The testing booklet contains 8 questions. 2. Permitted Texas Instruments calculators: BA-35 BA II Plus BA II Plus Professional

More information

Statistics 427: Sample Final Exam

Statistics 427: Sample Final Exam Statistics 427: Sample Final Exam Instructions: The following sample exam was given several quarters ago in Stat 427. The same topics were covered in the class that year. This sample exam is meant to be

More information

Lecture 8 Continuous Random Variables

Lecture 8 Continuous Random Variables Lecture 8 Continuous Random Variables Example: The random number generator will spread its output uniformly across the entire interval from 0 to 1 as we allow it to generate a long sequence of numbers.

More information

Lecture 7: Confidence interval and Normal approximation

Lecture 7: Confidence interval and Normal approximation Lecture 7: Confidence interval and Normal approximation 26th of November 2015 Confidence interval 26th of November 2015 1 / 23 Random sample and uncertainty Example: we aim at estimating the average height

More information

Discrete Distributions

Discrete Distributions Discrete Distributions STA 281 Fall 2011 1 Introduction Previously we defined a random variable to be an experiment with numerical outcomes. Often different random variables are related in that they have

More information

Introduction to Statistical Data Analysis Lecture 4: Sampling

Introduction to Statistical Data Analysis Lecture 4: Sampling Introduction to Statistical Data Analysis Lecture 4: Sampling James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1 / 30 Introduction

More information

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li)

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Department of Educational Psychology University of Illinois at Urbana-Champaign 1 Inferential

More information

December 2010 Mathematics 302 Name Page 2 of 11 pages

December 2010 Mathematics 302 Name Page 2 of 11 pages December 2010 Mathematics 302 Name Page 2 of 11 pages [9] 1. An urn contains red balls, 10 green balls and 1 yellow balls. You randomly select balls, without replacement. (a What ( is( the probability

More information

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).

More information

HT Introduction. P(X i = x i ) = e λ λ x i

HT Introduction. P(X i = x i ) = e λ λ x i MODS STATISTICS Introduction. HT 2012 Simon Myers, Department of Statistics (and The Wellcome Trust Centre for Human Genetics) myers@stats.ox.ac.uk We will be concerned with the mathematical framework

More information

Introduction to Probability

Introduction to Probability LECTURE NOTES Course 6.041-6.431 M.I.T. FALL 2000 Introduction to Probability Dimitri P. Bertsekas and John N. Tsitsiklis Professors of Electrical Engineering and Computer Science Massachusetts Institute

More information

Chapter 4: Continuous Random Variables and Probability Distributions

Chapter 4: Continuous Random Variables and Probability Distributions Chapter 4: and Probability Distributions Walid Sharabati Purdue University February 14, 2014 Professor Sharabati (Purdue University) Spring 2014 (Slide 1 of 37) Chapter Overview Continuous random variables

More information

Math 494: Mathematical Statistics

Math 494: Mathematical Statistics Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/

More information

Applied Statistics I

Applied Statistics I Applied Statistics I Liang Zhang Department of Mathematics, University of Utah July 8, 2008 Liang Zhang (UofU) Applied Statistics I July 8, 2008 1 / 15 Distribution for Sample Mean Liang Zhang (UofU) Applied

More information

STAT 430/510 Probability Lecture 12: Central Limit Theorem and Exponential Distribution

STAT 430/510 Probability Lecture 12: Central Limit Theorem and Exponential Distribution STAT 430/510 Probability Lecture 12: Central Limit Theorem and Exponential Distribution Pengyuan (Penelope) Wang June 15, 2011 Review Discussed Uniform Distribution and Normal Distribution Normal Approximation

More information

Limiting Distributions

Limiting Distributions We introduce the mode of convergence for a sequence of random variables, and discuss the convergence in probability and in distribution. The concept of convergence leads us to the two fundamental results

More information

σ. We further know that if the sample is from a normal distribution then the sampling STAT 2507 Assignment # 3 (Chapters 7 & 8)

σ. We further know that if the sample is from a normal distribution then the sampling STAT 2507 Assignment # 3 (Chapters 7 & 8) STAT 2507 Assignment # 3 (Chapters 7 & 8) DUE: Sections E, F Section G Section H Monday, March 16, in class Tuesday, March 17, in class Wednesday, March 18, in class Last Name Student # First Name Your

More information

MTH135/STA104: Probability

MTH135/STA104: Probability MTH35/STA04: Probability Homework # 3 Due: Tuesday, Sep 0, 005 Prof. Robert Wolpert. from prob 7 p. 9 You roll a fair, six-sided die and I roll a die. You win if the number showing on your die is strictly

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Patrick Breheny September 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 31 Kerrich s experiment Introduction 10,000 coin flips Expectation and

More information

The variable θ is called the parameter of the model, and the set Ω is called the parameter space.

The variable θ is called the parameter of the model, and the set Ω is called the parameter space. Lecture 8 What is a statistical model? A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced the data. The variable θ is called

More information

Chris Piech CS109 CS109 Final Exam. Fall Quarter Dec 14 th, 2017

Chris Piech CS109 CS109 Final Exam. Fall Quarter Dec 14 th, 2017 Chris Piech CS109 CS109 Final Exam Fall Quarter Dec 14 th, 2017 This is a closed calculator/computer exam. You are, however, allowed to use notes in the exam. The last page of the exam is a Standard Normal

More information

Mathematics 375 Probability and Statistics I Final Examination Solutions December 14, 2009

Mathematics 375 Probability and Statistics I Final Examination Solutions December 14, 2009 Mathematics 375 Probability and Statistics I Final Examination Solutions December 4, 9 Directions Do all work in the blue exam booklet. There are possible regular points and possible Extra Credit points.

More information

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay 1 MA421 Introduction Ashis Gangopadhyay Department of Mathematics and Statistics Boston University c Ashis Gangopadhyay 1.1 Introduction 1.1.1 Some key statistical concepts 1. Statistics: Art of data analysis,

More information

Queueing Theory and Simulation. Introduction

Queueing Theory and Simulation. Introduction Queueing Theory and Simulation Based on the slides of Dr. Dharma P. Agrawal, University of Cincinnati and Dr. Hiroyuki Ohsaki Graduate School of Information Science & Technology, Osaka University, Japan

More information

Probability Distributions Columns (a) through (d)

Probability Distributions Columns (a) through (d) Discrete Probability Distributions Columns (a) through (d) Probability Mass Distribution Description Notes Notation or Density Function --------------------(PMF or PDF)-------------------- (a) (b) (c)

More information

Lecture 7: Chapter 7. Sums of Random Variables and Long-Term Averages

Lecture 7: Chapter 7. Sums of Random Variables and Long-Term Averages Lecture 7: Chapter 7. Sums of Random Variables and Long-Term Averages ELEC206 Probability and Random Processes, Fall 2014 Gil-Jin Jang gjang@knu.ac.kr School of EE, KNU page 1 / 15 Chapter 7. Sums of Random

More information

Solutions - Final Exam

Solutions - Final Exam Solutions - Final Exam Instructors: Dr. A. Grine and Dr. A. Ben Ghorbal Sections: 170, 171, 172, 173 Total Marks Exercise 1 7 Exercise 2 6 Exercise 3 6 Exercise 4 6 Exercise 5 6 Exercise 6 9 Total 40 Score

More information

Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets

Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets Introduction to Statistical Data Analysis Lecture 1: Working with Data Sets James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis

More information

Chapter 8: Confidence Interval Estimation: Further Topics

Chapter 8: Confidence Interval Estimation: Further Topics Chapter 8: Confidence Interval Estimation: Further Topics Department of Mathematics Izmir University of Economics Week 11 2014-2015 Introduction In this chapter we will focus on inferential statements

More information

7.1: What is a Sampling Distribution?!?!

7.1: What is a Sampling Distribution?!?! 7.1: What is a Sampling Distribution?!?! Section 7.1 What Is a Sampling Distribution? After this section, you should be able to DISTINGUISH between a parameter and a statistic DEFINE sampling distribution

More information

Introduction and Overview STAT 421, SP Course Instructor

Introduction and Overview STAT 421, SP Course Instructor Introduction and Overview STAT 421, SP 212 Prof. Prem K. Goel Mon, Wed, Fri 3:3PM 4:48PM Postle Hall 118 Course Instructor Prof. Goel, Prem E mail: goel.1@osu.edu Office: CH 24C (Cockins Hall) Phone: 614

More information

Statistics 135 Fall 2007 Midterm Exam

Statistics 135 Fall 2007 Midterm Exam Name: Student ID Number: Statistics 135 Fall 007 Midterm Exam Ignore the finite population correction in all relevant problems. The exam is closed book, but some possibly useful facts about probability

More information

Chapter 7 Sampling Distributions

Chapter 7 Sampling Distributions Statistical inference looks at how often would this method give a correct answer if it was used many many times. Statistical inference works best when we produce data by random sampling or randomized comparative

More information

Probability Distributions for Continuous Variables. Probability Distributions for Continuous Variables

Probability Distributions for Continuous Variables. Probability Distributions for Continuous Variables Probability Distributions for Continuous Variables Probability Distributions for Continuous Variables Let X = lake depth at a randomly chosen point on lake surface If we draw the histogram so that the

More information

Week 2: Review of probability and statistics

Week 2: Review of probability and statistics Week 2: Review of probability and statistics Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED

More information

Statistics and Sampling distributions

Statistics and Sampling distributions Statistics and Sampling distributions a statistic is a numerical summary of sample data. It is a rv. The distribution of a statistic is called its sampling distribution. The rv s X 1, X 2,, X n are said

More information

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1 Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)

More information

If we want to analyze experimental or simulated data we might encounter the following tasks:

If we want to analyze experimental or simulated data we might encounter the following tasks: Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction

More information

Chapter 2 Queueing Theory and Simulation

Chapter 2 Queueing Theory and Simulation Chapter 2 Queueing Theory and Simulation Based on the slides of Dr. Dharma P. Agrawal, University of Cincinnati and Dr. Hiroyuki Ohsaki Graduate School of Information Science & Technology, Osaka University,

More information

Chapter 1: Revie of Calculus and Probability

Chapter 1: Revie of Calculus and Probability Chapter 1: Revie of Calculus and Probability Refer to Text Book: Operations Research: Applications and Algorithms By Wayne L. Winston,Ch. 12 Operations Research: An Introduction By Hamdi Taha, Ch. 12 OR441-Dr.Khalid

More information

Statistics 1B. Statistics 1B 1 (1 1)

Statistics 1B. Statistics 1B 1 (1 1) 0. Statistics 1B Statistics 1B 1 (1 1) 0. Lecture 1. Introduction and probability review Lecture 1. Introduction and probability review 2 (1 1) 1. Introduction and probability review 1.1. What is Statistics?

More information

Definition: A random variable X is a real valued function that maps a sample space S into the space of real numbers R. X : S R

Definition: A random variable X is a real valued function that maps a sample space S into the space of real numbers R. X : S R Random Variables Definition: A random variable X is a real valued function that maps a sample space S into the space of real numbers R. X : S R As such, a random variable summarizes the outcome of an experiment

More information

Chapter 6 Continuous Probability Distributions

Chapter 6 Continuous Probability Distributions Continuous Probability Distributions Learning Objectives 1. Understand the difference between how probabilities are computed for discrete and continuous random variables. 2. Know how to compute probability

More information

Exponential, Gamma and Normal Distribuions

Exponential, Gamma and Normal Distribuions Exponential, Gamma and Normal Distribuions Sections 5.4, 5.5 & 6.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-3339 Cathy Poliak,

More information

Continuous Random Variables. What continuous random variables are and how to use them. I can give a definition of a continuous random variable.

Continuous Random Variables. What continuous random variables are and how to use them. I can give a definition of a continuous random variable. Continuous Random Variables Today we are learning... What continuous random variables are and how to use them. I will know if I have been successful if... I can give a definition of a continuous random

More information

Common Discrete Distributions

Common Discrete Distributions Common Discrete Distributions Statistics 104 Autumn 2004 Taken from Statistics 110 Lecture Notes Copyright c 2004 by Mark E. Irwin Common Discrete Distributions There are a wide range of popular discrete

More information

STA 584 Supplementary Examples (not to be graded) Fall, 2003

STA 584 Supplementary Examples (not to be graded) Fall, 2003 Page 1 of 8 Central Michigan University Department of Mathematics STA 584 Supplementary Examples (not to be graded) Fall, 003 1. (a) If A and B are independent events, P(A) =.40 and P(B) =.70, find (i)

More information

Chapter 5 continued. Chapter 5 sections

Chapter 5 continued. Chapter 5 sections Chapter 5 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions

More information

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic Section 8.1A What Is a Sampling Distribution? Learning Objectives After this section, you should be able to DISTINGUISH between a parameter and a statistic DEFINE sampling distribution DISTINGUISH between

More information

Chapter 8: Confidence Intervals

Chapter 8: Confidence Intervals Chapter 8: Confidence Intervals Introduction Suppose you are trying to determine the mean rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write

More information

Econ 250 Winter 2009 Assignment 2 - Solutions

Econ 250 Winter 2009 Assignment 2 - Solutions Eco50 Winter 2009 Assignment 2 - Solutions. For a restaurant, the time it takes to deliver pizza (in minutes) is uniform over the interval (25, 37). Determine the proportion of deliveries that are made

More information

Confidence Intervals for the Mean of Non-normal Data Class 23, Jeremy Orloff and Jonathan Bloom

Confidence Intervals for the Mean of Non-normal Data Class 23, Jeremy Orloff and Jonathan Bloom Confidence Intervals for the Mean of Non-normal Data Class 23, 8.05 Jeremy Orloff and Jonathan Bloom Learning Goals. Be able to derive the formula for conservative normal confidence intervals for the proportion

More information

Continuous Probability Distributions

Continuous Probability Distributions 1 Chapter 5 Continuous Probability Distributions 5.1 Probability density function Example 5.1.1. Revisit Example 3.1.1. 11 12 13 14 15 16 21 22 23 24 25 26 S = 31 32 33 34 35 36 41 42 43 44 45 46 (5.1.1)

More information

Let X be a continuous random variable, < X < f(x) is the so called probability density function (pdf) if

Let X be a continuous random variable, < X < f(x) is the so called probability density function (pdf) if University of California, Los Angeles Department of Statistics Statistics 1A Instructor: Nicolas Christou Continuous probability distributions Let X be a continuous random variable, < X < f(x) is the so

More information

Limiting Distributions

Limiting Distributions Limiting Distributions We introduce the mode of convergence for a sequence of random variables, and discuss the convergence in probability and in distribution. The concept of convergence leads us to the

More information

SS257a Midterm Exam Monday Oct 27 th 2008, 6:30-9:30 PM Talbot College 342 and 343. You may use simple, non-programmable scientific calculators.

SS257a Midterm Exam Monday Oct 27 th 2008, 6:30-9:30 PM Talbot College 342 and 343. You may use simple, non-programmable scientific calculators. SS657a Midterm Exam, October 7 th 008 pg. SS57a Midterm Exam Monday Oct 7 th 008, 6:30-9:30 PM Talbot College 34 and 343 You may use simple, non-programmable scientific calculators. This exam has 5 questions

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

December 2010 Mathematics 302 Name Page 2 of 11 pages

December 2010 Mathematics 302 Name Page 2 of 11 pages December 2010 Mathematics 302 Name Page 2 of 11 pages [9] 1. An urn contains 5 red balls, 10 green balls and 15 yellow balls. You randomly select 5 balls, without replacement. What is the probability that

More information

COMPSCI 240: Reasoning Under Uncertainty

COMPSCI 240: Reasoning Under Uncertainty COMPSCI 240: Reasoning Under Uncertainty Andrew Lan and Nic Herndon University of Massachusetts at Amherst Spring 2019 Lecture 20: Central limit theorem & The strong law of large numbers Markov and Chebyshev

More information

Discrete probability distributions

Discrete probability distributions Discrete probability s BSAD 30 Dave Novak Fall 08 Source: Anderson et al., 05 Quantitative Methods for Business th edition some slides are directly from J. Loucks 03 Cengage Learning Covered so far Chapter

More information

Lecture 13. Poisson Distribution. Text: A Course in Probability by Weiss 5.5. STAT 225 Introduction to Probability Models February 16, 2014

Lecture 13. Poisson Distribution. Text: A Course in Probability by Weiss 5.5. STAT 225 Introduction to Probability Models February 16, 2014 Lecture 13 Text: A Course in Probability by Weiss 5.5 STAT 225 Introduction to Probability Models February 16, 2014 Whitney Huang Purdue University 13.1 Agenda 1 2 3 13.2 Review So far, we have seen discrete

More information

Business Statistics:

Business Statistics: Chapter 7 Student Lecture Notes 7-1 Department of Quantitative Methods & Information Systems Business Statistics: Chapter 7 Introduction to Sampling Distributions QMIS 220 Dr. Mohammad Zainal Chapter Goals

More information

STAT100 Elementary Statistics and Probability

STAT100 Elementary Statistics and Probability STAT100 Elementary Statistics and Probability Exam, Sample Test, Summer 014 Solution Show all work clearly and in order, and circle your final answers. Justify your answers algebraically whenever possible.

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Asymptotic Statistics-III. Changliang Zou

Asymptotic Statistics-III. Changliang Zou Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (

More information

Review. A Bernoulli Trial is a very simple experiment:

Review. A Bernoulli Trial is a very simple experiment: Review A Bernoulli Trial is a very simple experiment: Review A Bernoulli Trial is a very simple experiment: two possible outcomes (success or failure) probability of success is always the same (p) the

More information

Chapter 6: Functions of Random Variables

Chapter 6: Functions of Random Variables Chapter 6: Functions of Random Variables We are often interested in a function of one or several random variables, U(Y 1,..., Y n ). We will study three methods for determining the distribution of a function

More information

STAT 418: Probability and Stochastic Processes

STAT 418: Probability and Stochastic Processes STAT 418: Probability and Stochastic Processes Spring 2016; Homework Assignments Latest updated on April 29, 2016 HW1 (Due on Jan. 21) Chapter 1 Problems 1, 8, 9, 10, 11, 18, 19, 26, 28, 30 Theoretical

More information

Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University

Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University richamar@gvsu.edu Overview of Lesson In this activity students explore the properties of the distribution

More information

Week 1 Basic Statistical Concepts, Part I

Week 1 Basic Statistical Concepts, Part I Week 1 Basic Statistical Concepts, Part I Week 1 Objectives This week we will give an introduction to the statistical package R, and to statistics. R will be used throughout this class and new features

More information

1 Basic continuous random variable problems

1 Basic continuous random variable problems Name M362K Final Here are problems concerning material from Chapters 5 and 6. To review the other chapters, look over previous practice sheets for the two exams, previous quizzes, previous homeworks and

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

5.2 Continuous random variables

5.2 Continuous random variables 5.2 Continuous random variables It is often convenient to think of a random variable as having a whole (continuous) interval for its set of possible values. The devices used to describe continuous probability

More information

Continuous random variables

Continuous random variables Continuous random variables Can take on an uncountably infinite number of values Any value within an interval over which the variable is definied has some probability of occuring This is different from

More information

6 The normal distribution, the central limit theorem and random samples

6 The normal distribution, the central limit theorem and random samples 6 The normal distribution, the central limit theorem and random samples 6.1 The normal distribution We mentioned the normal (or Gaussian) distribution in Chapter 4. It has density f X (x) = 1 σ 1 2π e

More information

Course information: Instructor: Tim Hanson, Leconte 219C, phone Office hours: Tuesday/Thursday 11-12, Wednesday 10-12, and by appointment.

Course information: Instructor: Tim Hanson, Leconte 219C, phone Office hours: Tuesday/Thursday 11-12, Wednesday 10-12, and by appointment. Course information: Instructor: Tim Hanson, Leconte 219C, phone 777-3859. Office hours: Tuesday/Thursday 11-12, Wednesday 10-12, and by appointment. Text: Applied Linear Statistical Models (5th Edition),

More information

AP Statistics - Chapter 7 notes

AP Statistics - Chapter 7 notes AP Statistics - Chapter 7 notes Day 1: 7.1 Sampling Distributions Read 416 417 What is a parameter? What is a statistic? How is one related to the other? Alternate Example: Identify the population, the

More information

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017 CPSC 531: System Modeling and Simulation Carey Williamson Department of Computer Science University of Calgary Fall 2017 Quote of the Day A person with one watch knows what time it is. A person with two

More information

STAT515, Review Worksheet for Midterm 2 Spring 2019

STAT515, Review Worksheet for Midterm 2 Spring 2019 STAT55, Review Worksheet for Midterm 2 Spring 29. During a week, the proportion of time X that a machine is down for maintenance or repair has the following probability density function: 2( x, x, f(x The

More information

Sampling Distributions. Introduction to Inference

Sampling Distributions. Introduction to Inference Sampling Distributions Introduction to Inference Parameter A parameter is a number that describes the population. A parameter always exists but in practice we rarely know it s value because we cannot examine

More information

Confidence Intervals for the Sample Mean

Confidence Intervals for the Sample Mean Confidence Intervals for the Sample Mean As we saw before, parameter estimators are themselves random variables. If we are going to make decisions based on these uncertain estimators, we would benefit

More information

Northwestern University Department of Electrical Engineering and Computer Science

Northwestern University Department of Electrical Engineering and Computer Science Northwestern University Department of Electrical Engineering and Computer Science EECS 454: Modeling and Analysis of Communication Networks Spring 2008 Probability Review As discussed in Lecture 1, probability

More information

Chapter Learning Objectives. Probability Distributions and Probability Density Functions. Continuous Random Variables

Chapter Learning Objectives. Probability Distributions and Probability Density Functions. Continuous Random Variables Chapter 4: Continuous Random Variables and Probability s 4-1 Continuous Random Variables 4-2 Probability s and Probability Density Functions 4-3 Cumulative Functions 4-4 Mean and Variance of a Continuous

More information

Part 3: Parametric Models

Part 3: Parametric Models Part 3: Parametric Models Matthew Sperrin and Juhyun Park August 19, 2008 1 Introduction There are three main objectives to this section: 1. To introduce the concepts of probability and random variables.

More information

Business Statistics:

Business Statistics: Department of Quantitative Methods & Information Systems Business Statistics: Chapter 7 Introduction to Sampling Distributions QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing this chapter,

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Chapter 6 Continuous Probability Distributions

Chapter 6 Continuous Probability Distributions Math 3 Chapter 6 Continuous Probability Distributions The observations generated by different statistical experiments have the same general type of behavior. The followings are the probability distributions

More information

Estimation and Confidence Intervals

Estimation and Confidence Intervals Estimation and Confidence Intervals Sections 7.1-7.3 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 17-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Exam: practice test 1 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Exam: practice test 1 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Exam: practice test MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the problem. ) Using the information in the table on home sale prices in

More information

15 Discrete Distributions

15 Discrete Distributions Lecture Note 6 Special Distributions (Discrete and Continuous) MIT 4.30 Spring 006 Herman Bennett 5 Discrete Distributions We have already seen the binomial distribution and the uniform distribution. 5.

More information

ax, 0 < x < 1 0, otherwise. f(x)dx = 1).

ax, 0 < x < 1 0, otherwise. f(x)dx = 1). HW 3 (Due Oct. 3, 2017) Name: HW 3.1 Suppose that X has the pdf f(x) = { ax, 0 < x < 1 0, otherwise. (a) Find the value of a (use the requirement f(x)dx = 1). (b) Calculate P (X < 0.3) (c) Calculate P

More information