Statistics for Economists. Lectures 3 & 4

Similar documents
Chapter 2: Discrete Distributions. 2.1 Random Variables of the Discrete Type

Discrete Distributions

Introduction to Statistics. By: Ewa Paszek

Relationship between probability set function and random variable - 2 -

Mathematical Statistics 1 Math A 6330

Statistics for Economists Lectures 6 & 7. Asrat Temesgen Stockholm University

Random Variables Example:

Chapter 3. Discrete Random Variables and Their Probability Distributions

Random Variables. Definition: A random variable (r.v.) X on the probability space (Ω, F, P) is a mapping

Discrete Distributions

Introduction to Statistical Data Analysis Lecture 3: Probability Distributions

Random variables, Expectation, Mean and Variance. Slides are adapted from STAT414 course at PennState

STAT2201. Analysis of Engineering & Scientific Data. Unit 3

Suppose that you have three coins. Coin A is fair, coin B shows heads with probability 0.6 and coin C shows heads with probability 0.8.

Discrete Random Variables

Statistics for Engineers Lecture 2 Discrete Distributions

Lecture 6. Probability events. Definition 1. The sample space, S, of a. probability experiment is the collection of all

Discrete Random Variables

Week 2. Review of Probability, Random Variables and Univariate Distributions

Chapter 3: Discrete Random Variable

Discrete Random Variables

ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables

Geometric Distribution The characteristics of a geometric experiment are: 1. There are one or more Bernoulli trials with all failures except the last

Chapter 2 Random Variables

Chapter 3 Discrete Random Variables

Lecture 13. Poisson Distribution. Text: A Course in Probability by Weiss 5.5. STAT 225 Introduction to Probability Models February 16, 2014

Binomial and Poisson Probability Distributions

Discrete Random Variable

Topic 3: The Expectation of a Random Variable

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Probability, Random Processes and Inference

Bivariate distributions

Notes for Math 324, Part 17

Math/Stat 352 Lecture 8

Conditional Probability

Introduction to Probability and Statistics Slides 3 Chapter 3

p. 4-1 Random Variables

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

b. ( ) ( ) ( ) ( ) ( ) 5. Independence: Two events (A & B) are independent if one of the conditions listed below is satisfied; ( ) ( ) ( )

Part (A): Review of Probability [Statistics I revision]

Expectations. Definition Let X be a discrete rv with set of possible values D and pmf p(x). The expected value or mean value of X, denoted by E(X ) or

Continuous Probability Spaces

To find the median, find the 40 th quartile and the 70 th quartile (which are easily found at y=1 and y=2, respectively). Then we interpolate:

Lecture 3. Discrete Random Variables

Review of Probability. CS1538: Introduction to Simulations

Statistics for Managers Using Microsoft Excel/SPSS Chapter 4 Basic Probability And Discrete Probability Distributions

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019

Discrete random variables and probability distributions

Random Variables. Statistics 110. Summer Copyright c 2006 by Mark E. Irwin

Chapter 4 : Discrete Random Variables

(Ch 3.4.1, 3.4.2, 4.1, 4.2, 4.3)

Chapter 3. Discrete Random Variables and Their Probability Distributions

It can be shown that if X 1 ;X 2 ;:::;X n are independent r.v. s with

Chapter 6 Expectation and Conditional Expectation. Lectures Definition 6.1. Two random variables defined on a probability space are said to be

MAT X (Spring 2012) Random Variables - Part I

MATH Notebook 5 Fall 2018/2019

Known probability distributions

Northwestern University Department of Electrical Engineering and Computer Science

Topic 3: The Expectation of a Random Variable

IEOR 3106: Introduction to Operations Research: Stochastic Models. Professor Whitt. SOLUTIONS to Homework Assignment 1

MATH 250 / SPRING 2011 SAMPLE QUESTIONS / SET 3

Discrete Probability Distributions

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES

STAT 3610: Review of Probability Distributions

Learning Objectives for Stat 225

2.6 Tools for Counting sample points

Fundamental Tools - Probability Theory II

A Probability Primer. A random walk down a probabilistic path leading to some stochastic thoughts on chance events and uncertain outcomes.

SDS 321: Introduction to Probability and Statistics

STAT/MATH 395 PROBABILITY II

Conditional distributions (discrete case)

STAT/MA 416 Midterm Exam 2 Thursday, October 18, Circle the section you are enrolled in:

Lecture Notes 2 Random Variables. Discrete Random Variables: Probability mass function (pmf)

Outline PMF, CDF and PDF Mean, Variance and Percentiles Some Common Distributions. Week 5 Random Variables and Their Distributions

Common Discrete Distributions

An-Najah National University Faculty of Engineering Industrial Engineering Department. Course : Quantitative Methods (65211)

STAT509: Discrete Random Variable

Things to remember when learning probability distributions:

p. 6-1 Continuous Random Variables p. 6-2

Unit II. Page 1 of 12

II. The Binomial Distribution

3 Multiple Discrete Random Variables

Chapter 2. Random Variable. Define single random variables in terms of their PDF and CDF, and calculate moments such as the mean and variance.

Chapter 4a Probability Models

Lectures on Elementary Probability. William G. Faris

CMPSCI 240: Reasoning Under Uncertainty

Chapter 1: Revie of Calculus and Probability

STAT 430/510 Probability Lecture 7: Random Variable and Expectation

Brief Review of Probability

What is Probability? Probability. Sample Spaces and Events. Simple Event

Probability Theory and Random Variables

1. Sample spaces, events and conditional probabilities. A sample space is a finite or countable set S together with a function. P (x) = 1.

3.4. The Binomial Probability Distribution

2. AXIOMATIC PROBABILITY

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008

Probability Distributions Columns (a) through (d)

Math 151. Rumbos Fall Solutions to Review Problems for Exam 2. Pr(X = 1) = ) = Pr(X = 2) = Pr(X = 3) = p X. (k) =

MgtOp 215 Chapter 5 Dr. Ahn

What is a random variable

STAT/MATH 395 A - PROBABILITY II UW Winter Quarter Moment functions. x r p X (x) (1) E[X r ] = x r f X (x) dx (2) (x E[X]) r p X (x) (3)

Transcription:

Statistics for Economists Lectures 3 & 4 Asrat Temesgen Stockholm University 1

CHAPTER 2- Discrete Distributions 2.1. Random variables of the Discrete Type Definition 2.1.1: Given a random experiment with an outcome space S, a function X that assigns one and only real number X(s) =x to each elements s in S is called a random variable. The space of X is the set of real numbers {x:x(s)=x,s S}, where s S means that the element s belongs to the set S. Definition 2.1-2: The probability mass function (p.m.f) f(x) of a discrete random variable X is a function that satisfies the following properties: a) f(x) >0, X S; b) c) where ACS. Note: Let X denote a random variable (r.v) with one-dimensional space S, a subset of the real numbers. Suppose that the space S contains a countable number of points; that is, either S contains a finite number of points, or the points of S can be put into a one to one correspondence with the positive integers. Such a set S is called a set of discrete points or simply a discrete outcome space. Moreover, the r.v. X is called the discrete type, and X is said to have a distribution of the discrete type. For a r.v. X of the discrete type, the probability P(X=x) is frequently denoted by f(x), and this function f(x) is called the probability mass function (p.m.f). Note that some authors refer to f(x) as the probability function, the frequency function, or the probability density function. Example when a p.m.f is constant on the space or support, we say that the distribution is uniform over that space. As an illustration, suppose X has a discrete uniform distribution on S= {1,2, 6} and its p.m.f. is f(x)= 2

We can generalize this result by letting X have a discrete uniform distribution over the first m positive integers, so that its p.m.f. is f(x)=, x = 1,2, m. Example: Roll a four-sided die twice, and let X equal the larger of the two outcomes if they are different and the common value if they are the same. The outcome space for this experiment is S0=, where we assume that each of these 16 points has probability 1/16. Then P(X=1)=P[(1,1)]= 1/16, P(X=2) = P, and similarly P(X=3) = 5/16 and P(X=4) = 7/16. That is, the p.m.f. of X can be written simple as f(x) = P(X=x)=, x = 1,2,3,4 and f(x) =0 elsewhere (i.e. when x S={1,2,3,4}). (The bar graph and the probability histogram are given in figure 2.1.1 of your text book) Example: A lot (collection) consisting of 100 fuses is inspected by the following procedure: Five fuses are chosen at random and tested; if all 5 blow at the correct amperage, the lot is accepted. Suppose that the lot contains 20 defective fuses. If X is a r.v. equal to the number of defective fuses in the sample of 5, the probability of accepting the lot is: Note: If X has a distribution f(x) = P(X=x) =, Where the space S is the collection of nonnegative integers x that satisfies the inequalities x then we say that the r.v. X has a hypergeometric distribution. 3

2.2. Mathematical Expectation Definition 2.2-1. If f(x) is the p.m.f of the r.v X of the discrete type with space S, and if the summation exists, then the sum is called the mathematical expectation or the expected value of the function u(x), and it is denoted by E That is E Note: We can think of E as a weighted mean of u(x), x S, where the weights are the probabilities f(x) = P(X=x), x. Example: Let the r.v X have the p.m.f. f(x) = 1/3, x S, Where S = {-1,0,1} Let u(x) = X 2 Then However, the support of the r.v Y= X 2 is S1 = {0,1} and P(Y=0)= P(X=0) = 1/3 P(Y=1) = P(X=-1) + P(X=1) =. That is, g(y) = 1/3, Y=0 2/3, Y=1; and S1={0,1}. Hence, Theorem 2.2-1 when it exists, the mathematical expectation E satisfies the following properties: a) If C is a constant, the E(c) = c b) If C is a constant and u is a function, then E 4

c) If C1 and C2 are constants and U1 and U2 are functions, then E Proof: a) E(c) = b) E (x) = C c) E = By applying (b), we obtain E Note: Property (C) can be extended to more than two terms by mathematical induction; that is, we have: C ) E. Because of property (C ), E is often called a linear or distributive operator. Exercise 1. Let X have the p.m.f f(x) = x = 1,2,3,4. Find (a) E(X) (b) E(X 2 ) (C) E [X(5-X)] 2. Let u(x)=(x-b) 2, where b is not a function of X, and suppose exists. Find the value of b for which is a minimum. 2.3. The Mean, Variance, and Standard Deviation Definitions 1. The mean of the r.v. X (or of its distribution), denoted by the Greek letter is given by where X has the space S= and f(x) is the p.m.f (i.e. 2. The variance of the r.v X (or of its distribution) is given by: 5

3. The positive square root of the variance is called the standard deviation of X and is denoted by the Greek letter (sigma) Remark: The variance can be computed in another way Proof: That is, equals the difference of the second moment about the origin and the square of the mean. Example Let the p.m.f. of X be defined by f(x)=, x=1,2,3. Then a) The mean of X is b) The second moment about the origin is 2226+3236=366=6 c) The variance of X is d) The standard deviation of X is = Note: is a measure of the middle of the distirbutino of X, and the standard deviation is a measure of the dispersion. Example: The mean of X, which has a uniform distribution on first m positive integers, is given by To find the variance of X, we first find 6

Thus, the variance of X is Example: We find that if X equals the outcome when rolling a fair six-sided die, the p.m.f of X is f(x)=1/6, x=1,2,,6. The respective mean and variance of X are which agrees with Example: Let X be a r.v. with mean Let Y= ax+b, is a r.v too, where a & b are constants. Then the mean of Y is Moreover, the variance of Y is 2= 2 2 Thus, Note: Var(x-1)= Var(x) i.e. adding or subtracting a constant from X does not change the variance. Definitions 1. Let r be a positive integer. If E is finite it is called the r th moment of the distribution about the origin. 2. The expectation E is called the r th moment of the distribution about b. 3. For a given positive integer r, is called the r th factorial moment. 4. The sample mean (or mean of the sample X 1, X 2,, X n), denoted by, which is, in some sense, an estimate of if the latter is unknown. 7

5. The sample variance denoted by s 2,which is in some sense, a better estimate of an unknown 2, is given by:, where the right-hand expression makes the computation easier. 6. The sample standard deviation, S= is a measure of how dispersed the data are from the sample mean. Example: Rolling a fair six-sided die 5 times could result in the following sample of n=5 observations: X 1=3, X 2=1, X 3=2, X 4=6, X 5=3 In this case, and S = 2.4. Bernoulli Trials and the Binomial Distribution A Bernoulli experiment is a random experiment, the outcome of which can be classified in one of two mutually exclusive and exhaustive ways say, success or failure (e.g, female or male, life or death, non-defective or defective). A sequence of Bernoulli trials occurs when a Bernoulli experiment is performed several independent times. So that the probability of success, say P, remains the same from trial to trial. In such a sequence we let P denote the probability of success on each trial, and we let q=1-p denote the probability of failure. Example Suppose that the probability of germination of a beet seed is 0.8 and the germination of a seed is called a success. If we plant 10 seeds and can assume that the germination of one seed is independent of the germination of another seed, this would correspond to 10 Bernoulli trials with P= 0.8 Note: Let X be a r.v. associated with a Bernoulli trial by defining it as follows: X(success)=1 and X (failure)=0. 8

The p.m.f of X can be written as f(x)=p x (1-P) 1-x, X=0,1, and we say that X has a Bernoulli distribution. The expected value of X is 0 +1 =, and the variance of X is: The standard deviation of X is S= Binomial Distribution f(x) = These probabilities are called binomial probabilities, and the r.v X is said to have a binomial distribution. A binomial experiment satisfies the following properties : 1. A Bernoulli (success-failure) experiment is performed n times. 2. The trials are independent. 3. P(success) = P on each trial, and P(failure) = q=1-p 4. The r.v. x equals the number of successes in the n trials. A binomial distribution will be denoted by the symbol b(n,p) and we say that the distribution of X is b(n,p). The constants n and p are called the parameters of the binomial distribution. Example: In the instant lottery with 20% winning tickets, if x equals to the number of winning tickets among n=8 that are purchased, then the probability of purchasing two winning tickets is f(2) = P(x=2)= Example: In the example for Bernoulli trials, the number X of seeds that germinate in n=10 independent trials is b(10,0.8); that is f(x) = In particular P 9

Also, we could compute Such cumulative probabilities are often of interest. We call the function defined by F(x)=P the cumulative distribution function or more simply, the distribution function of the r.v. X values of the distribution function of a r.v. X that is b(n,p) are given in Table II in the appendix for selected values of n ad p. Remark: Recall that if n is a positive integer, then Thus, if we use this binomial expansion with b=p and a=1-p, then the sum of the binomial probabilities is: since f(x) is a p.m.f We use the binomial expansion to find the mean and the variance of a binomial r.v. X that is b(n,p). The mean is given by: Let K= x-1 or x= K+1 in the latter sum. Then To find the variance, we first find the value of E the second factorial moment.using the second factorial moment, we find that the variance of X is given by: Var(X) =. Now, 10

Letting K = x-2 or x = k+2, we obtain Thus, If X is b(n,p), then We will find the mean and variance with the use of the moment generating function in section 2.5 Remark: Suppose that an urn contains N 1, success balls and N 2 failure balls. Now, let P, and let X equal the number of success balls in a random sample of size n that is taken from this urn. If the sampling is done one at a time with replacement, then the distribution of X is b(n,p); if the sampling is done without replacement, then X has a hypergeometric distribution with p.m.f Where x is a non-negative integer such that. When N 1 +N 2 is large and n is relatively small, it makes little difference if the sampling is done with or without replacement. 11

2.5. The moment-generating Function Definition 2.5.1. Let X be a r.v. of the discrete type with p.m.f f(x) and space S. If there is a positive number h such that exists and is finite for h < t < h, then the function of t defined by M(t) = E is called the moment generating function of X (or of the distribution of X), and often abbreviated as m.g.f Note: First, if we set t=0, we have M(0)=1. Moreover, it S is is given by:, then the m.g.f Thus, the coefficient of Accordingly, if two r.v.s (or two distributions of probability) have the same m.g.f, they must have the same distribution of probability. That is, if the two r.v.s. had the two probability mass functions f(x) and g(y), and the same space S={b 1, b 2, } and if for all t, -h<t< h, then mathematical transform theory requires that So if the m.g.f. exists, there is one and only one distribution of probability associated with that m.g.f Example: If X has the m.g.f. then the probabilities are Note: It can be shown that the existence of M(t), for h<t<h, implies that the derivatives of M(t) of all orders exists at t=0; moreover, it is permissible to interchange differentiation and summation as the series converges uniformly. Thus, and for each positive integer r, 12

In particular, if the m.g.f. exists, then Example: The p.m.f. of the binomial distribution is: E, x= 0,1,2,,n. Thus, the m.g.f. is M(t) = Remark: It is interesting to note that here and elsewhere the m.g.f. is usually rather easy to compute if the p.m.f has a factor involving an exponential, like P x in the binomial p.m.f. Thus, the first two derivatives of M(t) are:, and and = np(1-p) In the special case when n=1, X has a Bernoulli distribution and M(t)=(1-P)+Pe t for all real values of t, Negative Binomial Distribution Suppose we observe a sequence of Bernoulli trials until exactly r successes occur, where r is a fixed positive integer. Let the r.v. X denote the number of trials needed to observe the r th success. By the multiplication rule of probabilities, the p.m.f of X, say g(x), equals the product of the probability of obtaining exactly r-1 successes in the first (x-1) trials and the probability P of a success on the r th tried. Thus, the p.m.f of X is We say that X has a negative binomial distribution. If r=1 in the negative binomial distribution, we say that X has a geometric distribution, since the p.m.f consists of 13

terms of a geometric series, namely, geometric series, the sum is given by Recall that for a. Thus, for the geometric distribution, so that g(x) does satisfy the properties of a p.m.f. From the sum of a geometric series, we also note that when K is an integer, Thus, the value of the distribution function at a positive integer, K is Example: Some biology students were checking eye color in a large number of fruit flies. For the individual fly, suppose that the probability of white eyes is ¼ and the probability of red eyes is ¾, and that we may treat these observations as having independent Bernoulli trials. The probability that at least four flies have to be checked for eye color to observe a white-eyed fly is given by The probability that at most four flies have to be checked for eye color to observe a white-eyed fly is given by: The probability that the first fly with write eyes is the fourth fly considered is: 4= =34314=27256=0.1055 Remark: The mean and the variance of a negative binomial random variable X are, respectively, In particular, if r=1, so that X has a geometric distribution then 14

Example: Suppose that during practice a basketball player can make a free throw 80% of the time. Moreover, assume that a sequence of free-throw shooting can be thought of as independent Bernoulli trials. Let X equal the minimum number of free throws that this player must attempt to make a total of 10 shots. The p.m.f. of X is And we have, for example, P(X=12)= g(12)=. The mean variance, & standard deviation of X are, respectively, 2.6. The Poisson Distribution Definition 2.6-1: Let the number of changes that occur in a given continuous interval be counted. Then we have an approximate Poisson process with parameter >0 if the following conditions are satisfied: a) The numbers of changes occurring in non over lapping intervals are independent. b) The probability of exactly one change occurring in a sufficiently short interval of length h is approximately h. c) The probability of two or more changes occurring in a sufficiently short interval is essentially zero. We say that the r.v. X has a Poisson distribution if its p.m.f is of the form, x= 0,1,2,., where > 0. It is easy to see that f(x) has the properties of a p.m.f because, clearly, f(x) and from the maclaurin series expansion of Note: The m.g.f. of X is 15

Now, M (t)= =. The values of the mean & variance of X are, respectively, = That is, for the Poisson distribution, Remark: It is also possible to find the mean & the variance for the Poisson distribution directly, without using the m.g.f (the proof is given on page 101 of your textbook). Table III in the appendix gives values of the F(x) of a Poisson r.v. Example: let X have a Poisson distribution with a mean of =5 Then, using Table III, we obtain Note: If events in a Poisson process occur at a mean rate of per unit, then the expected number of occurrences in an interval of length t is t. for example, if phone calls arrive at a switch board following a Poisson process at a mean rate of 3 per minute, then the expected number of phone calls in a 5-minute period is (3)(5)=15 or if calls arrive at a mean rate of 22 in a 5-minute period, then the expected number of calls per minute is =22(1/5) = 4.4. Moreover, the number of occurrences, say X in the interval of length t has the Poisson p.m.f Example Telephone calls enter a college switch board on the average of two every 3 minutes. If one assumes an approximate Poisson process, what is the probability of 5 or more calls arriving in a 9-minute period? 16

Solution: let X denote the number of calls in a 9-minute period. We see that E(x)=6, that is, on the average, 6 calls will arrive during a 9-minute period. Thus, by Table III. Note: Not only is the Poisson distribution important in its own right, but it can also be used to approximate probabilities for a binomial distribution. If x has a Poisson distribution with parameter, then, with n large, Where P= /n, so that =np. That is, if X has the binomial distribution b(n,p) with large n and small P, then This approximation is reasonably good if n is large. But since was a fixed constant, p should be small, because np=. In particular, the approximation is quite accurate if but it is not bad in other situations violating these bounds somewhat, such as n=50 and P=0.12 Example: A manufacturer of Christmas tree light bulbs knows that 2% of its bulbs are defective. Assuming independence, we have a binomial distribution with parameters P=0.02 and n=100. To approximate the probability that a box of 100 of these bulbs contains at most 3 defective bulbs, we use the Poisson distribution with =(100)(0.02)=2, which gives from table III in the appendix. Using the binomial distribution, we obtain after some tedious calculations, Hence, in this case, the Poisson approximation is extremely close to the true value, but much easier to find. 17