Probability Theory. Probability and Statistics for Data Science CSE594 - Spring 2016

Similar documents
Statistical Preliminaries. Stony Brook University CSE545, Fall 2016

Linear Models: Comparing Variables. Stony Brook University CSE545, Fall 2017

Lecture 1. ABC of Probability

Probability Theory Review

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

Appendix A : Introduction to Probability and stochastic processes

MA : Introductory Probability

Dept. of Linguistics, Indiana University Fall 2015

Discrete Probability. Mark Huiskes, LIACS Probability and Statistics, Mark Huiskes, LIACS, Lecture 2

A PECULIAR COIN-TOSSING MODEL

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019

Chapter 7 Wednesday, May 26th

Lecture 2: Repetition of probability theory and statistics

Probability Theory for Machine Learning. Chris Cremer September 2015

Recap of Basic Probability Theory

Bayesian Inference. Introduction

Recap of Basic Probability Theory

Probability Theory and Simulation Methods

Chapter 14. From Randomness to Probability. Copyright 2012, 2008, 2005 Pearson Education, Inc.

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 3 9/10/2008 CONDITIONING AND INDEPENDENCE

Basic Probability. Introduction

Introduction to Stochastic Processes

1 What are probabilities? 2 Sample Spaces. 3 Events and probability spaces

Probability theory basics

Algorithms for Uncertainty Quantification

Chapter 2.5 Random Variables and Probability The Modern View (cont.)

Lecture 1: Probability Fundamentals

MATH MW Elementary Probability Course Notes Part I: Models and Counting

Great Theoretical Ideas in Computer Science

STAT:5100 (22S:193) Statistical Inference I

The probability of an event is viewed as a numerical measure of the chance that the event will occur.

Lecture 3: Random variables, distributions, and transformations

Introduction to Probability. Ariel Yadin. Lecture 1. We begin with an example [this is known as Bertrand s paradox]. *** Nov.

Origins of Probability Theory

Introduction to Probability

Some Basic Concepts of Probability and Information Theory: Pt. 1

Probability Review. Yutian Li. January 18, Stanford University. Yutian Li (Stanford University) Probability Review January 18, / 27

Probability: Part 1 Naima Hammoud

Lecture 4: Probability, Proof Techniques, Method of Induction Lecturer: Lale Özkahya

Machine Learning. Bayes Basics. Marc Toussaint U Stuttgart. Bayes, probabilities, Bayes theorem & examples

Conditional Probability

1 Sequences of events and their limits

Introduction Probability. Math 141. Introduction to Probability and Statistics. Albyn Jones

SDS 321: Introduction to Probability and Statistics

4th IIA-Penn State Astrostatistics School July, 2013 Vainu Bappu Observatory, Kavalur

Independence. P(A) = P(B) = 3 6 = 1 2, and P(C) = 4 6 = 2 3.

Midterm #1 - Solutions

the time it takes until a radioactive substance undergoes a decay

Lecture 3 Probability Basics

Statistical Methods for the Social Sciences, Autumn 2012

Chapter 1: Introduction to Probability Theory

Lecture 9: Conditional Probability and Independence

ECE 302: Chapter 02 Probability Model

Lecture Notes 1 Basic Probability. Elements of Probability. Conditional probability. Sequential Calculation of Probability

P (A) = P (B) = P (C) = P (D) =

University of California, Berkeley, Statistics 134: Concepts of Probability. Michael Lugo, Spring Exam 1

ECE 340 Probabilistic Methods in Engineering M/W 3-4:15. Lecture 2: Random Experiments. Prof. Vince Calhoun

Statistical Inference

Probability: Terminology and Examples Class 2, Jeremy Orloff and Jonathan Bloom

3rd IIA-Penn State Astrostatistics School July, 2010 Vainu Bappu Observatory, Kavalur

Statistics and Econometrics I

Econ 325: Introduction to Empirical Economics

Probability: Axioms, Properties, Interpretations

Statistics 1 - Lecture Notes Chapter 1

CMPSCI 240: Reasoning Under Uncertainty

Example: Suppose we toss a quarter and observe whether it falls heads or tails, recording the result as 1 for heads and 0 for tails.

CPSC340. Probability. Nando de Freitas September, 2012 University of British Columbia

Conditional Probability

Probability. Paul Schrimpf. January 23, Definitions 2. 2 Properties 3

Conditional Probability

1: PROBABILITY REVIEW

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

STAT Chapter 3: Probability

Advanced Herd Management Probabilities and distributions

Lecture 16. Lectures 1-15 Review

Probability and Independence Terri Bittner, Ph.D.

Venn Diagrams; Probability Laws. Notes. Set Operations and Relations. Venn Diagram 2.1. Venn Diagrams; Probability Laws. Notes

M378K In-Class Assignment #1

Conditional Probability & Independence. Conditional Probabilities

Deep Learning for Computer Vision

Chapter 2 Class Notes

Lecture 3 - Axioms of Probability

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008

Why study probability? Set theory. ECE 6010 Lecture 1 Introduction; Review of Random Variables

1 INFO Sep 05

Week 2. Section Texas A& M University. Department of Mathematics Texas A& M University, College Station 22 January-24 January 2019

Probability, Random Processes and Inference

Lecture 11: Random Variables

COMPSCI 240: Reasoning Under Uncertainty

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

Statistics for Engineers

Exam 1. Problem 1: True or false

Probability Year 9. Terminology

Notes 1 Autumn Sample space, events. S is the number of elements in the set S.)

Topic 5 Basics of Probability

Probability Theory and Applications

Methods of Mathematics

Bayes Formula. MATH 107: Finite Mathematics University of Louisville. March 26, 2014

Lecture 2: Probability, conditional probability, and independence

Probability Theory. Introduction to Probability Theory. Principles of Counting Examples. Principles of Counting. Probability spaces.

Transcription:

Probability Theory Probability and Statistics for Data Science CSE594 - Spring 2016

What is Probability? 2

What is Probability? Examples outcome of flipping a coin (seminal example) amount of snowfall mentioning a word mentioning a word a lot 3

What is Probability? The chance that something will happen. Given infinite observations of an event, the proportion of observations where a given outcome happens. Strength of belief that something is true. Mathematical language for quantifying uncertainty - Wasserman 4

Probability (review) Ω : Sample Space, set of all outcomes of a random experiment A : Event (A Ω), collection of possible outcomes of an experiment P(A): Probability of event A, P is a function: events R 5

Probability (review) Ω : Sample Space, set of all outcomes of a random experiment A : Event (A Ω), collection of possible outcomes of an experiment P(A): Probability of event A, P is a function: events R P(Ω) = 1 P(A) 0, for all A If A 1, A 2, are disjoint events then: 6

Probability (review) Ω : Sample Space, set of all outcomes of a random experiment A : Event (A Ω), collection of possible outcomes of an experiment P(A): Probability of event A, P is a function: events R P is a probability measure, if and only if P(Ω) = 1 P(A) 0, for all A If A 1, A 2, are disjoint events then: 7

Probability Examples outcome of flipping a coin (seminal example) amount of snowfall mentioning a word mentioning a word a lot 8

Probability (review) Some Properties: If B A then P(A) P(B) P(A B) P(A) + P(B) P(A B) min(p(a), P(B)) P( A) = P(Ω / A) = 1 - P(A) / is set difference P(A B) will be notated as P(A, B) 9

Probability (Review) Independence Two Events: A and B Does knowing something about A tell us whether B happens (and vice versa)? 10

Probability (Review) Independence Two Events: A and B Does knowing something about A tell us whether B happens (and vice versa)? A: first flip of a fair coin; B: second flip of the same fair coin A: mention or not of the word happy B: mention or not of the word birthday 11

Probability (Review) Independence Two Events: A and B Does knowing something about A tell us whether B happens (and vice versa)? A: first flip of a fair coin; B: second flip of the same fair coin A: mention or not of the word happy B: mention or not of the word birthday Two events, A and B, are independent iff P(A, B) = P(A)P(B) 12

Probability (Review) Conditional Probability P(A, B) P(A B) = ------------- P(B) 13

Probability (Review) Conditional Probability P(A, B) P(A B) = ------------- P(B) H: mention happy in message, m B: mention birthday in message, m P(H) =.01 P(B) =.001 P(H, B) =.0005 P(H B) =?? 14

Probability (Review) Conditional Probability P(A, B) P(A B) = ------------- P(B) H: mention happy in message, m B: mention birthday in message, m P(H) =.01 P(B) =.001 P(H, B) =.0005 P(H B) =.50 H1: first flip of a fair coin is heads H2: second flip of the same coin is heads P(H2) = 0.5 P(H1) = 0.5 P(H2, H1) = 0.25 P(H2 H1) = 0.5 15

Probability (Review) Conditional Probability P(A, B) P(A B) = ------------- P(B) H1: first flip of a fair coin is heads H2: second flip of the same coin is heads P(H2) = 0.5 P(H1) = 0.5 P(H2, H1) = 0.25 P(H2 H1) = 0.5 Two events, A and B, are independent iff P(A, B) = P(A)P(B) P(A, B) = P(A)P(B) iff P(A B) = P(A) 16

Probability (Review) Conditional Probability P(A, B) P(A B) = ------------- P(B) H1: first flip of a fair coin is heads H2: second flip of the same coin is heads P(H2) = 0.5 P(H1) = 0.5 P(H2, H1) = 0.25 P(H2 H1) = 0.5 Two events, A and B, are independent iff P(A, B) = P(A)P(B) P(A, B) = P(A)P(B) iff P(A B) = P(A) Interpretation of Independence: Observing B has no effect on probability of A. 17

Why Probability? 18

Why Probability? A formality to make sense of the world. To quantify uncertainty Should we believe something or not? Is it a meaningful difference? To be able to generalize from one situation or point in time to another. Can we rely on some information? What is the chance Y happens? To organize data into meaningful groups or dimensions Where does X belong? What words are similar to X? 19

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. 20

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = 5 coin tosses = {<HHHHH>, <HHHHT>, <HHHTH>, <HHHTH> } 21

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = 5 coin tosses = {<HHHHH>, <HHHHT>, <HHHTH>, <HHHTH> } We may just care about how many tails? Thus, X(<HHHHH>) = 0 X(<HHHTH>) = 1 X(<TTTHT>) = 4 X(<HTTTT>) = 4 22

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = 5 coin tosses = {<HHHHH>, <HHHHT>, <HHHTH>, <HHHTH> } We may just care about how many tails? Thus, X(<HHHHH>) = 0 X(<HHHTH>) = 1 X(<TTTHT>) = 4 X(<HTTTT>) = 4 X only has 6 possible values: 0, 1, 2, 3, 4, 5 23

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = 5 coin tosses = {<HHHHH>, <HHHHT>, <HHHTH>, <HHHTH> } We may just care about how many tails? Thus, X(<HHHHH>) = 0 X(<HHHTH>) = 1 X(<TTTHT>) = 4 X(<HTTTT>) = 4 X only has 6 possible values: 0, 1, 2, 3, 4, 5 What is the probability that we end up with k = 4 tails? P(X(ω) = k) where ω Ω 24

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = 5 coin tosses = {<HHHHH>, <HHHHT>, <HHHTH>, <HHHTH> } We may just care about how many tails? Thus, X(<HHHHH>) = 0 X(<HHHTH>) = 1 X(<TTTHT>) = 4 X(<HTTTT>) = 4 X only has 6 possible values: 0, 1, 2, 3, 4, 5 What is the probability that we end up with k = 4 tails? P(X = k) := P( {ω : X(ω) = k} ) where ω Ω 25

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = 5 coin tosses = {<HHHHH>, <HHHHT>, <HHHTH>, <HHHTH> } We may just care about how many tails? Thus, X(<HHHHH>) = 0 X(<HHHTH>) = 1 X(<TTTHT>) = 4 X(<HTTTT>) = 4 X only has 6 possible values: 0, 1, 2, 3, 4, 5 What is the probability that we end up with k = 4 tails? P(X = k) := P( {ω : X(ω) = k} ) where ω Ω X(ω) = 4 for 5 out of 32 sets in Ω. Thus, assuming a fair coin, P(X = 4) = 5/32 26

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = 5 coin tosses = {<HHHHH>, <HHHHT>, <HHHTH>, <HHHTH> } We may just care about how many tails? Thus, X(<HHHHH>) = 0 X(<HHHTH>) = 1 X(<TTTHT>) = 4 X(<HTTTT>) = 4 X only has 6 possible values: 0, 1, 2, 3, 4, 5 What is the probability that we end up with k = 4 tails? P(X = k) := P( {ω : X(ω) = k} ) where ω Ω X(ω) = 4 for 5 out of 32 sets in Ω. Thus, assuming a fair coin, P(X = 4) = 5/32 (Not a variable, but a function that we end up notating a lot like a variable) 27

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = 5 coin tosses = {<HHHHH>, <HHHHT>, <HHHTH>, <HHHTH> } We may just care about how many tails? Thus, X(<HHHHH>) = 0 X is a discrete random variable X(<HHHTH>) = 1 if it takes only a countable X(<TTTHT>) = 4 number of values. X(<HTTTT>) = 4 X only has 6 possible values: 0, 1, 2, 3, 4, 5 What is the probability that we end up with k = 4 tails? P(X = k) := P( {ω : X(ω) = k} ) where ω Ω X(ω) = 4 for 5 out of 32 sets in Ω. Thus, assuming a fair coin, P(X = 4) = 5/32 (Not a variable, but a function that we end up notating a lot like a variable) 28

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. X is a continuous random variable if it can take on an infinite number of values between any two given values. X is a discrete random variable if it takes only a countable number of values. 29

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = inches of snowfall = [0, ) R X is a continuous random variable if it can take on an infinite number of values between any two given values. 30

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = inches of snowfall = [0, ) R X is a continuous random variable if it can take on an infinite number of values between any two given values. X amount of inches in a snowstorm X(ω) = ω 31

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = inches of snowfall = [0, ) R X is a continuous random variable if it can take on an infinite number of values between any two given values. X amount of inches in a snowstorm X(ω) = ω What is the probability we receive (at least) a inches? P(X a) := P( {ω : X(ω) a} ) What is the probability we receive between a and b inches? P(a X b) := P( {ω : a X(ω) b} ) 32

Random Variables X: A mapping from Ω to R that describes the question we care about in practice. Example: Ω = inches of snowfall = [0, ) R X is a continuous random variable if it can take on an infinite number of values between any two given values. X amount of inches in a snowstorm X(ω) = ω P(X = i) := 0, for all i Ω What is the probability we receive (at least) a inches? P(X a) := P( {ω : X(ω) a} ) (probability of receiving exactly i inches of snowfall is zero) What is the probability we receive between a and b inches? P(a X b) := P( {ω : a X(ω) b} ) 33

Probability Review what constitutes a probability measure? independence conditional probability random variables discrete continuous 34