Lecture 1 : The Mathematical Theory of Probability

Similar documents
STAT 430/510 Probability

SDS 321: Introduction to Probability and Statistics

Notes 1 Autumn Sample space, events. S is the number of elements in the set S.)

Mathematics. ( : Focus on free Education) (Chapter 16) (Probability) (Class XI) Exercise 16.2

Deep Learning for Computer Vision

Introduction Probability. Math 141. Introduction to Probability and Statistics. Albyn Jones

MODULE 2 RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES DISTRIBUTION FUNCTION AND ITS PROPERTIES

Fundamentals of Probability CE 311S

Lecture Lecture 5


Probability: Terminology and Examples Class 2, Jeremy Orloff and Jonathan Bloom

Chapter 2 PROBABILITY SAMPLE SPACE

What is Probability? Probability. Sample Spaces and Events. Simple Event

Events A and B are said to be independent if the occurrence of A does not affect the probability of B.

Axioms of Probability

324 Stat Lecture Notes (1) Probability

Introduction to Probability and Sample Spaces

Lecture Notes 1 Basic Probability. Elements of Probability. Conditional probability. Sequential Calculation of Probability

Probability: Sets, Sample Spaces, Events

Lecture 3 - Axioms of Probability

Conditional Probability

Statistics for Financial Engineering Session 2: Basic Set Theory March 19 th, 2006

Mean, Median and Mode. Lecture 3 - Axioms of Probability. Where do they come from? Graphically. We start with a set of 21 numbers, Sta102 / BME102

Statistical Inference

Probability Pearson Education, Inc. Slide

Why should you care?? Intellectual curiosity. Gambling. Mathematically the same as the ESP decision problem we discussed in Week 4.

ECE 450 Lecture 2. Recall: Pr(A B) = Pr(A) + Pr(B) Pr(A B) in general = Pr(A) + Pr(B) if A and B are m.e. Lecture Overview

(i) Given that a student is female, what is the probability of having a GPA of at least 3.0?

ELEG 3143 Probability & Stochastic Process Ch. 1 Experiments, Models, and Probabilities

Outline. 1. Define likelihood 2. Interpretations of likelihoods 3. Likelihood plots 4. Maximum likelihood 5. Likelihood ratio benchmarks

CS4705. Probability Review and Naïve Bayes. Slides from Dragomir Radev

Conditional Probability and Bayes Theorem (2.4) Independence (2.5)

Problems from Probability and Statistical Inference (9th ed.) by Hogg, Tanis and Zimmerman.

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability

Probabilistic models

Independence. P(A) = P(B) = 3 6 = 1 2, and P(C) = 4 6 = 2 3.

Chapter 2. Conditional Probability and Independence. 2.1 Conditional Probability

Statistical Theory 1

Chapter 2. Conditional Probability and Independence. 2.1 Conditional Probability

Notation: X = random variable; x = particular value; P(X = x) denotes probability that X equals the value x.

6.2 Introduction to Probability. The Deal. Possible outcomes: STAT1010 Intro to probability. Definitions. Terms: What are the chances of?

Venn Diagrams; Probability Laws. Notes. Set Operations and Relations. Venn Diagram 2.1. Venn Diagrams; Probability Laws. Notes

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

Recap. The study of randomness and uncertainty Chances, odds, likelihood, expected, probably, on average,... PROBABILITY INFERENTIAL STATISTICS

Math 3338: Probability (Fall 2006)

Chapter 13, Probability from Applied Finite Mathematics by Rupinder Sekhon was developed by OpenStax College, licensed by Rice University, and is

k P (X = k)

CMPSCI 240: Reasoning about Uncertainty

Compound Events. The event E = E c (the complement of E) is the event consisting of those outcomes which are not in E.

Statistics Statistical Process Control & Control Charting

PERMUTATIONS, COMBINATIONS AND DISCRETE PROBABILITY

Axioms of Probability. Set Theory. M. Bremer. Math Spring 2018

First Digit Tally Marks Final Count

Applied Statistics I

Conditional Probability

General Info. Grading

Probability Theory Review

Notes Week 2 Chapter 3 Probability WEEK 2 page 1

Topic 5 Basics of Probability

4/17/2012. NE ( ) # of ways an event can happen NS ( ) # of events in the sample space

Probability and Independence Terri Bittner, Ph.D.

Probability (Devore Chapter Two)

2011 Pearson Education, Inc

2. Conditional Probability

Discrete Probability

CS206 Review Sheet 3 October 24, 2018

Quantitative Methods for Decision Making

Lecture 4: Probability, Proof Techniques, Method of Induction Lecturer: Lale Özkahya

2. AXIOMATIC PROBABILITY

Week 2: Probability: Counting, Sets, and Bayes

The probability of an event is viewed as a numerical measure of the chance that the event will occur.

4th IIA-Penn State Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur

Probability Distributions for Discrete RV

The enumeration of all possible outcomes of an experiment is called the sample space, denoted S. E.g.: S={head, tail}

n How to represent uncertainty in knowledge? n Which action to choose under uncertainty? q Assume the car does not have a flat tire

Chapter 14. From Randomness to Probability. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Term Definition Example Random Phenomena

ICS141: Discrete Mathematics for Computer Science I

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 2: Random Experiments. Prof. Vince Calhoun

3.2 Probability Rules

Econ 325: Introduction to Empirical Economics

Outline. 1. Define likelihood 2. Interpretations of likelihoods 3. Likelihood plots 4. Maximum likelihood 5. Likelihood ratio benchmarks

ECE353: Probability and Random Processes. Lecture 2 - Set Theory

Probability. VCE Maths Methods - Unit 2 - Probability

TA Qinru Shi: Based on poll result, the first Programming Boot Camp will be: this Sunday 5 Feb, 7-8pm Gates 114.

More on Distribution Function

STAT Chapter 3: Probability

EE 178 Lecture Notes 0 Course Introduction. About EE178. About Probability. Course Goals. Course Topics. Lecture Notes EE 178

Business Statistics MBA Pokhara University

11. Probability Sample Spaces and Probability

CISC 1100/1400 Structures of Comp. Sci./Discrete Structures Chapter 7 Probability. Outline. Terminology and background. Arthur G.

Introduction to Probability

MATH MW Elementary Probability Course Notes Part I: Models and Counting

(6, 1), (5, 2), (4, 3), (3, 4), (2, 5), (1, 6)

What is a random variable

Lecture 2: Probability. Readings: Sections Statistical Inference: drawing conclusions about the population based on a sample

3rd IIA-Penn State Astrostatistics School July, 2010 Vainu Bappu Observatory, Kavalur

Lecture 8: Probability

9/6/2016. Section 5.1 Probability. Equally Likely Model. The Division Rule: P(A)=#(A)/#(S) Some Popular Randomizers.

Transcription:

Lecture 1 : The Mathematical Theory of Probability 0/ 30

1. Introduction Today we will do 2.1 and 2.2. We will skip Chapter 1. We all have an intuitive notion of probability. Let s see. What is the probability P of tossing two heads in a row with a fair coin? 1/ 30

Method 1 List all possible outcomes { } HH, HT, TH, TT so P =?. Question What did we just assume to arrive at that answer? 2/ 30

Another way 1 st toss 2 nd toss However it is important to put probability into a formal mathematic framework for many reasons. 3/ 30

1. Even elementary Problems become too hard unless we can break them down into simpler problems using the rules of Set Theory. Examples Let s see how you can deal with these now and later. (there is another reason which we will run into later - we often have infinite sets and need calculus e.g. financial math) 4/ 30

Problems 1 What is the probability of getting one head in one hundred tosses of a fair coin? 2 What is the probability of getting 27 heads in one hundred tosses of a fair coin? 5/ 30

2. Transition from the naive theory to the formal mathematical theory To make the transition we introduce the word experiment which will be taken to mean any action or process whose outcome is subject to uncertainty Devore, Ninth Edition- pg. 53. Examples Tossing a fair coin 100 times. Dealing 5 cards from a 52 card deck - a poker hand. Dealing 13 cards from a 52 card deck - a bridge hand. 6/ 30

Definition The set of all possible outcomes of on experiment will be called the sample space of that experiment and denoted S. Experiment 3 tosses of a fair coin. HHH, HHT, HTH, HTT, S = THH, THT, TTH, TTT 7/ 30

Definition A subset A of S is called an event. Problem Find P (at least one head in 3 tosses of a fair coin) We are looking for P(A) where A is a subset of the previous S. 8/ 30

S = {HHH, HHT, HTH, HTT, THH, THT, TTH, TTT} We will call this our favorite sample space from now on. 9/ 30

3. The Formal Mathematical Theory Let S be a set (the sample space). A probability measure P on S is a rule (function) which assigns a real number P(A) to any subset A of S (i.e., to any event) such that the following axioms are satisfied 1 For any event A S we have P(A) 0 2 P(S) = 1 10/ 30

3 If A 1, A 2,, A n,... is a possibly infinite collection of pairwise disjoint (mutually exclusive) events then P (A, A 2... A n...) = P(A n ) n=1 } {{ } sum of an infinite series not just ordinary sum. mutually exclusive means A i A j = for any pair i, j with i j. 11/ 30

Special cases 1 Two mutually-exclusive events A 1 and A 2 (so A 1 A 2 = ) P(A 1 A 2 ) = P(A 1 ) + P(A 2 ) 2 n mutually-exclusive events A 1, A 2,..., A n P(A 1 A 2... A n ) = P(A 1 ) + P(A 2 ) + + P(A n ) 12/ 30

A Class of Examples Let S be a set with n elements. Let A S be any subset. Define P(A) = (A) (S) = (A) n Then P satisfies the axioms 1., 2. and 3. Here (A) means the number elements in A. This is called the equally likely probability measure. 13/ 30

An example in the above class Take our favorite sample space HHH, HHT, HTH, HTT S = THH, THT, TTH, TTT Let A be the subset (event) of outcomes with at least one head and one tail. All the outcomes are equally likely (because the coin is fair) so P(A) = (A) (s) = 6 8 14/ 30

A continuous Example 15 Consider the unit square s in the plane Let A S be any subset. Define P(A) = Area of A Then P satisfies the axioms 1., 2. and 3. 15/ 30

Let A be the subset of points in the square below the diagonal. What is P(A)? Can you find A so that P(A) = 1 π? 16/ 30

4. A Quick Trip Through Set-Theory (pg. 49-50) Let s be a set and A and B be subsets. Then we have A B (union), A B (intersection) and A (complement). Venn diagrams 17/ 30

union intersection A B = everything in S that is in either A or B A B = everything in S that is in A and B 18/ 30

The formulas linking, and To help you remember the formulas that follow use the analogy s set of numbers + The commutative laws A B = B A (analogue a + b = b + a) A B = B A (analogue a b = b a) 19/ 30

The associative laws (A B) C = A (B C) (analogue (a + b) + c = a + (b + c)) (A B) C = A (B C) (analogue (a b) c = a (b c)) Now we have laws that relate two or more of, and. The distributive laws A (B C) = (A B) (A C) (analogue a (b + c) = (a b) + (a c)) A (B C) = (A B) (A C) no analogue Problem What would the analogue of the second distributive law say. It isn t true. 20/ 30

De Morgan s Laws (no analogy with +, ) (A B) = A B (A B) = A B C D C D if and only if (so complement reverses, and ) One way to think of the first formula not in A or B = not in A and not in B 21/ 30

The best way to see it is by a Venn diagram shaded shaded shaded Top square = intersection of bottom two squares 22/ 30

Consequences of the axioms of probability theory pg. 54-56. We will prove two propositions which will be extremely useful to you. Proposition 1 (Complement law) P(A ) = 1 P(A). Proof. A A = S so P(A A ) = P(S) = 1 (axiom 2) ( ) But A A = so by 23/ 30

Proof (Cont.) axiom 3, special case 1 P(A A ) = P(A) + P(A ) ( ) Putting ( ) and ( ) together we get 1 = P(A) + P(A ) Corollary 1 P(φ) = 0. Proof. φ = S so P(φ) = 1 P(S) = 1 1 = 0. 24/ 30

Remark is not the Greek letter phi, it is a Norwegian letter. The symbol was chosen by André Weil. For example the English word beer translates into Norwegian as l. Corollary 2 P(A) 1. Proof. P(A) = 1 P(A ) 1 because P(A ) 0. Hence all probabilities are between zero and one: 0 P(A) 1 25/ 30

To illustrate the use of Proposition 1, let us go back to computing P (at least one head in three tosses) Put S = our favorite sample space. A = at least one head so A = no heads = all tails = TTT so Now we can do 100 tosses P (at least one head) = 1 1 2 100 P(A) = 1 P(TTT) = 1 1 8 = 7 8 26/ 30

Recall that two events A and B are mutually exclusive if A B = and axiom 3 says in this case P(A B) = P(A) + P(B) ( ) The following proposition is absolutely critical for computations Proposition 2 (Additive Law) P(A B) = P(A) + P(B) P(A B) Note that this is consistent with ( ) above because if A B = then P(A B) = P( ) = 0 27/ 30

Proof. The proof is hard. It depends on the following Venn diagram. We see that A B is the union of three mutually exclusive sets. A B = (A B ) (A B) (B A ) so by axiom 3 with n = 3 P(A B) = P(A B ) + P(A B) + P(B A ) ( ) 28/ 30

Proof (Cont.) How do we compute the first and third terms? We have a disjoint union (i.e., union of mutually exclusive sets) A = (A B) (A B ) so by axiom 3 P(A) = P(A B) + P(A B ) whence P(A B ) = P(A) P(A B) (1) Similarly P(B A ) = P(B) P(A B) (3) Plug (1) and (3) into ( ). 29/ 30

What about the intersection of three terms? Proposition 3 P(A B C) = P(A) + P(B) + P(C) P(A B) P(A C) P(B C) +P(A B C) This is (more or less) the principle of exclusion and inclusion 1 include the singletons A, B, C 2 exclude the pairs A B, A C, B C 3 include the triple A B C 30/ 30