Intro to Probability Instructor: Alexandre Bouchard

Similar documents
Intro to Probability Instructor: Alexandre Bouchard

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Lecture 10: Normal RV. Lisa Yan July 18, 2018

STAT Chapter 5 Continuous Distributions

STAT 430/510: Lecture 10

STAT 430/510 Probability Lecture 12: Central Limit Theorem and Exponential Distribution

Things to remember when learning probability distributions:

Statistics 253/317 Introduction to Probability Models. Winter Midterm Exam Friday, Feb 8, 2013

Chapter 5. Chapter 5 sections

Continuous Distributions

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

Lecture 13. Poisson Distribution. Text: A Course in Probability by Weiss 5.5. STAT 225 Introduction to Probability Models February 16, 2014

Probability Midterm Exam 2:15-3:30 pm Thursday, 21 October 1999

Discrete Distributions

Math Review Sheet, Fall 2008

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

HW7 Solutions. f(x) = 0 otherwise. 0 otherwise. The density function looks like this: = 20 if x [10, 90) if x [90, 100]

Experimental Design and Statistics - AGA47A

Random Variables and Their Distributions

Intro to Probability Instructor: Alexandre Bouchard

Moments. Raw moment: February 25, 2014 Normalized / Standardized moment:

Introduction and Overview STAT 421, SP Course Instructor

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

Chapter 5 continued. Chapter 5 sections

1 Probability and Random Variables

Disjointness and Additivity

1 Random Variable: Topics

Course information: Instructor: Tim Hanson, Leconte 219C, phone Office hours: Tuesday/Thursday 11-12, Wednesday 10-12, and by appointment.

Midterm 2 Review. CS70 Summer Lecture 6D. David Dinh 28 July UC Berkeley

Chapter 4: CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS

STAT 516 Midterm Exam 2 Friday, March 7, 2008

Probability Models. 4. What is the definition of the expectation of a discrete random variable?

Let X be a continuous random variable, < X < f(x) is the so called probability density function (pdf) if

UC Berkeley, CS 174: Combinatorics and Discrete Probability (Fall 2008) Midterm 1. October 7, 2008

Non-parametric Inference and Resampling

Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators.

Exponential Distribution and Poisson Process

STAT/MA 416 Midterm Exam 2 Thursday, October 18, Circle the section you are enrolled in:

Lecture 2: Repetition of probability theory and statistics

Exam 3, Math Fall 2016 October 19, 2016

SDS 321: Introduction to Probability and Statistics

t x 1 e t dt, and simplify the answer when possible (for example, when r is a positive even number). In particular, confirm that EX 4 = 3.

ECE 302, Final 3:20-5:20pm Mon. May 1, WTHR 160 or WTHR 172.

3 Continuous Random Variables

Chapter 6: Large Random Samples Sections

EECS 126 Probability and Random Processes University of California, Berkeley: Spring 2017 Kannan Ramchandran March 21, 2017.

4. Distributions of Functions of Random Variables

Properties of Continuous Probability Distributions The graph of a continuous probability distribution is a curve. Probability is represented by area

Lecture 1: August 28

ISyE 3044 Fall 2017 Test #1a Solutions

15 Discrete Distributions

We introduce methods that are useful in:

Decision making and problem solving Lecture 1. Review of basic probability Monte Carlo simulation

Northwestern University Department of Electrical Engineering and Computer Science

Chapter 4. Continuous Random Variables

Probability Distributions

IEOR 4703: Homework 2 Solutions

Topic 4: Continuous random variables

MATH 3510: PROBABILITY AND STATS July 1, 2011 FINAL EXAM

Formulas for probability theory and linear models SF2941

Midterm Examination. STA 205: Probability and Measure Theory. Thursday, 2010 Oct 21, 11:40-12:55 pm

Continuous Probability Distributions. Uniform Distribution

Chapter 4 : Discrete Random Variables

EECS 126 Probability and Random Processes University of California, Berkeley: Spring 2015 Abhay Parekh February 17, 2015.

MATH 407 FINAL EXAM May 6, 2011 Prof. Alexander

DO NOT OPEN THIS QUESTION BOOKLET UNTIL YOU ARE TOLD TO DO SO

This exam contains 6 questions. The questions are of equal weight. Print your name at the top of this page in the upper right hand corner.

S n = x + X 1 + X X n.

Quick review on Discrete Random Variables

Continuous Distributions

STAT 305 Introduction to Statistical Inference

Class 26: review for final exam 18.05, Spring 2014

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example

Prof. Thistleton MAT 505 Introduction to Probability Lecture 13

ECE302 Exam 2 Version A April 21, You must show ALL of your work for full credit. Please leave fractions as fractions, but simplify them, etc.

Fundamental Tools - Probability Theory II

First Year Examination Department of Statistics, University of Florida

Applied Statistics I

STAT 430/510: Lecture 15

Joint Probability Distributions and Random Samples (Devore Chapter Five)

1 Acceptance-Rejection Method

1 Presessional Probability

2 Random Variable Generation

Topic 3 - Discrete distributions

1 Some basic renewal theory: The Renewal Reward Theorem

BMIR Lecture Series on Probability and Statistics Fall, 2015 Uniform Distribution

Midterm Examination. STA 205: Probability and Measure Theory. Wednesday, 2009 Mar 18, 2:50-4:05 pm

MATH : EXAM 2 INFO/LOGISTICS/ADVICE

Week 9 The Central Limit Theorem and Estimation Concepts

Lecture 7: Sections 2.3 and 2.4 Rational and Exponential Functions. Recall that a power function has the form f(x) = x r where r is a real number.

Delta Method. Example : Method of Moments for Exponential Distribution. f(x; λ) = λe λx I(x > 0)

INSTRUCTIONS USEFUL FORMULAS

Intelligent Systems (AI-2)

Topic 4: Continuous random variables

STAT 311 Practice Exam 2 Key Spring 2016 INSTRUCTIONS

System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models

Extra Topic: DISTRIBUTIONS OF FUNCTIONS OF RANDOM VARIABLES

Math 407: Probability Theory 5/10/ Final exam (11am - 1pm)

CDA5530: Performance Models of Computers and Networks. Chapter 2: Review of Practical Random Variables

Transcription:

www.stat.ubc.ca/~bouchard/courses/stat302-sp2017-18/ Intro to Probability Instructor: Alexandre Bouchard

Info on midterm CALCULATOR: only NON-programmable, NON-scientific, NON-graphing (and of course, no phone). closed book, but we provide a formula sheet Topics examined: up to and not including the Poisson process (but including Poisson approximation!) See Midterm study resources under Tools tab of website: Formula sheet provided; posted on website Also on website: previous year (slightly shorter) midterm

www.stat.ubc.ca/~bouchard/courses/stat302-sp2017-18/ Logistics What else is new/recent on the website: Written assignment 2 released, due after break (but start now if you can!) Webwork: next one will be posted today, due in one week. Clicker grading rule: participation-based (but might give a few extra clicker marks based on correctness)

Plan for today: Poisson process, continued Normal approximations

Review

Def 16 The uniform distribution Density CDF

Def 18 The exponential distribution Density CDF f(x) Mean (i.e. E[X]): 1/λ Variance: 1/λ 2

Space of outcomes (S) Space of labels ( realizations ) Before s X Each value X(s) is a number Example: X(s) = 5

Space of outcomes (S) Space of labels ( realizations ) New r.v.: PP X Each value X(s) is a set Examples: X(s) = X(s ) = x xx x x

X(s) = 0 1 [ x xx ] X N[0,1] 2 1 0 X(s ) = 0 1 [ x ] x

Poisson process: two equivalent descriptions Spacings: D1 D2 D3 D4 D5 xx x x Counts: A

Def 19 Description based on counts G xx x x B A X is called a Poisson process with intensity i if: For all A G, N A ~ Poi(i * length(a)) For all disjoint A, B G, NA is independent of NB Notation: X ~ PP( i ) NA = # points of X in A NB = # points of X in B

Def 19b Description based on spacings D1 D2 D3 D4 D5 xx x x Sets of points where the spacings (distances between consecutive points) are given by: D1 ~ Exp( i ), D2 ~ Exp( i ), D3 ~ Exp( i ), etc All independent and identically distributed (iid)

Application to earthquake distribution

Ex 56 Earthquake example Data: 68 earthquake around the Strait of Messina in the 1700-1980 period Only magnitude > 4.5 kept Foreshocks and aftershocks removed

Ex 56 Data:... Available under Files in case you are curious: earthquakes.xls 1/average = intensity =

Ex 56

Useful facts Exponential (rate λ) Density: λ exp( -λx ) x [0, ) Mean: 1/λ Variance: 1/λ 2 CDF: 1 - exp( -λx) Poisson Process (intensity λ) Let A and B be two disjoint intervals Let their length be LA and LB, Then NA and NB are: independent Poisson r.v.s, with rates λla and λlb Poisson Distribution (mean λ) PMF: λ x e - λ / x!, x {0, 1, 2,...} Mean: λ, Variance: λ

Ex 56a Clicker question The probability that a recurrence interval is shorter than 10 months is given by the expression: A. P(T < 10) where T~Exp( =0.2403) B. P(T < 0.83) where T~Exp( =0.2403) C. P(T < 10) where T~Poisson( l =0.2403*10) D. P(T < 0.83) where T~Poisson( l =0.2403*10)

Ex 56a Clicker question The probability that a recurrence interval is shorter than 10 months is given by the expression: A. P(T < 10) where T~Exp( =0.2403) B. P(T < 0.83) where T~Exp( =0.2403) C. P(T < 10) where T~Poisson( l =0.2403*10) D. P(T < 0.83) where T~Poisson( l =0.2403*10)

Ex 56b Clicker question What is the probability that a recurrence interval is shorter than 10 months? A. 0.1815 B. 0.8185 C. 0.0904 D. 0.9096

Ex 56b Clicker question What is the probability that a recurrence interval is shorter than 10 months? A. 0.1815 B. 0.8185 C. 0.0904 D. 0.9096

Ex 56c Clicker question What proportion of the observed recurrence intervals are greater than 5 years? A. 0.10 B. 0.16 C. 0.28 D. 0.38

Ex 56c Clicker question What proportion of the observed recurrence intervals are greater than 5 years? A. 0.10 B. 0.16 C. 0.28 D. 0.38

Ex 56c Clicker question a. What proportion of the observed recurrence intervals are greater than 5 years (hint: find the areas of the bars where area = density x bar width)? How does it compare to the theoretical value assuming an Exponential distribution for the recurrence intervals? Proportion = 0.06 + 0.06 + 0.02 + 0.03 + 0.02 + 0.05 + 0.02 + 0.02 = 0.28 The observed proportion is close to the theoretical probability P(T > 5) = 1 F(5) = e 0.2403 5 = 0.301

Ex 56d Clicker question What is the probability that a recurrence interval is greater than 5 years (theoretical value)? A. 0.095 B. 0.905 C. 0.301 D. 0.699

Ex 56d Clicker question What is the probability that a recurrence interval is greater than 5 years (theoretical value)? A. 0.095 B. 0.905 C. 0.301 D. 0.699

Ex 56e Clicker question The probability that there are no earthquakes in 5 years is given by the expression: A. P(T < 5) where T~Exp( =0.2403) B. P(T > 5) where T~Exp( =0.2403) C. P(T = 0) where T~Exp( =0.2403) D. P(X = 0) where X~Poisson( l =0.2403*5) E. P(X = 0) where X~Poisson( =0.2403)

Ex 56e Clicker question The probability that there are no earthquakes in 5 years is given by the expression: A. P(T < 5) where T~Exp( =0.2403) B. P(T > 5) where T~Exp( =0.2403) C. P(T = 0) where T~Exp( =0.2403) Both correct D. P(X = 0) where X~Poisson( l =0.2403*5) E. P(X = 0) where X~Poisson( =0.2403)

Ex 56f Clicker question An earthquake just occurred. What is the probability that one has to wait for more than 30 years to observe the 10th earthquake? A. P(T > 30) where T~Exp( =0.2403) B. P(X = 9) where X~Poisson( l =0.2403*30) C. P(X = 10) where X~Poisson( l =0.2403*30) D. P(X l =0.2403*30) E. P(G > 30) where G~Gamma( =10, =0.2403)

Ex 56f Clicker question An earthquake just occurred. What is the probability that one has to wait for more than 30 years to observe the 10th earthquake? A. P(T > 30) where T~Exp( =0.2403) B. P(X = 9) where X~Poisson( l =0.2403*30) C. P(X = 10) where X~Poisson( l =0.2403*30) D. P(X l =0.2403*30) E. P(G > 30) where G~Gamma( =10, =0.2403)

Another way of answering last question: the Gamma distribution

Examples of Gamma random variables 3-Spacings: ~ Gamma(rate = i, shape = 3) 2-Spacings: ~ Gamma(rate = i, shape = 2) 1-Spacings: ~ Exponential(rate = i) xx x x

Def 20 The Gamma distribution Density CDF f(x) / x shape 1 e rate x No convenient form f(x) / g(x) f(x) =Cg(x) for x positive means: for some constant C,

Def 20 The Gamma distribution In other words: the sum of n Exponential with rate λ is a Gamma with shape n and rate λ

Def 20 The Gamma distribution Density CDF f(x) / x shape 1 e rate x No convenient form for x positive Special case: if shape = 1, this is just an exponential 1-Spacings: ~ Exponential(rate = i) ~ Gamma(rate = i, shape = 1) xx x x

A first glance to the Normal approximation

Situations where the Normal approximation is useful Sums of (pretty much*) any iid random variables Sn = X1 + X2 +... + Xn * precise conditions will be described later

Example: random walks Sn = Sn-1 + Xn So far, we know how to deal with cases where Xn = 0 or 1: Suppose Xi is more complicated: Sn = $ n = year Sn = $ Here for example, Xn can be 0, 1, or 2 n = year

Prop 15 Normal approximation: intuition If: S = X1 + X2 +... + Xn Then: d S a + b Z Where: (more detail in next slides) a and b are easy to compute numbers Z is the standard Normal distribution d mean approximately equal in distribution

Normal approximation: intuition If: S = X1 + X2 +... + Xn Then: d S a + b Z Where: (more detail in next slides) a = nµ b = p n µ = E[X i ] = p Var(X i ) = standard deviation (SD)

Def 21 Standard normal distribution 1 Density: f(z) / exp 2 z2

Def 22 d d = and d X = Y means: P(X t) = P(Y t) for all t d X Y means: P(X t) P(Y t) for all t

Ex 57 Example: gambling when you can borrow as much money as you want Suppose a roulette has: 2 green squares, 18 black squares, 18 red squares You bed $1 each round You can borrow as much as you want and will not stop until you play 100 rounds What is the probability you are ahead (do not owe money) after 100 rounds?

Using standard normal CDF tables

Ex 58 Another example - Ideal class size: 150 students - 30% accept the admission offer - Proposed policy: send 450 offers.. - probab no more than 150 attend? A.51% B.79% C.94% D. 99%