UVA CS 6316/4501 Fall 2016 Machine Learning. Lecture 11: Probability Review. Dr. Yanjun Qi. University of Virginia. Department of Computer Science

Size: px
Start display at page:

Download "UVA CS 6316/4501 Fall 2016 Machine Learning. Lecture 11: Probability Review. Dr. Yanjun Qi. University of Virginia. Department of Computer Science"

Transcription

1 UVA CS 6316/4501 Fall 2016 Machine Learning Lecture 11: Probability Review 10/17/16 Dr. Yanjun Qi University of Virginia Department of Computer Science 1

2 Announcements: Schedule Midterm Nov. 26 Wed / 3:30pm 4:45pm / open notes HW4 includes sample midterm quesoons Grading of HW1 is available on Collab SoluOon of HW1 is available on Collab Grading of HW2 will be available next week SoluOon of HW2 will be available next week 10/17/16 2

3 Where are we? è Five major secoons of this course q Regression (supervised) q ClassificaOon (supervised) q Unsupervised models q Learning theory q Graphical models 10/17/16 3

4 Where are we? è Three major secoons for classificaoon We can divide the large variety of classification approaches into roughly three major types 1. Discriminative - directly estimate a decision rule/boundary - e.g., support vector machine, decision tree 2. Generative: - build a generative statistical model - e.g., naïve bayes classifier, Bayesian networks 3. Instance based classifiers - Use observation directly (no models) - e.g. K nearest neighbors 10/17/16 4

5 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Events and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 5

6 The Big Picture Probability Model i.e. Data generaong process Observed Data EsOmaOon / learning / Inference / Data mining But how to specify a model? 10/17/16 6

7 Probability as frequency Consider the following quesoons: 1. What is the probability that when I flip a coin it is heads? 2. why? We can count è ~1/2 3. What is the probability of Blue Ridge Mountains to have an erupong volcano in the near future? è could not count Message: The frequen*st view is very useful, but it seems that we also use domain knowledge to come up with probabili*es. 10/17/16 7 Adapt from Prof. Nando de Freitas s review slides

8 Probability as a measure of uncertainty Imagine we are throwing darts at a wall of size 1x1 and that all darts are guaranteed to fall within this 1x1 wall. What is the probability that a dart will hit the shaded area? Dr. Yanjun Qi / UVA CS 6316 / f16 10/17/16 8 Adapt from Prof. Nando de Freitas s review slides

9 Probability as a measure of uncertainty Probability is a measure of certainty of an event taking place. Dr. Yanjun Qi / UVA CS 6316 / f16 i.e. in the example, we were measuring the chances of hi?ng the shaded area. prob = # RedBoxes # Boxes 10/17/16 9 Adapt from Prof. Nando de Freitas s review slides

10 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Sample space, Event and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 10

11 Probability O die = {1,2,3,4,5,6} O: Elementary Event Throw a 2 The elements of W are called elementary events. 10/17/16 11

12 Probability Probability allows us to measure many events. The events are subsets of the sample space O. For example, for a die we may consider the following events: e.g., GREATER = {5, 6} EVEN = {2, 4, 6} Assign probabili7es to these events: e.g., P(EVEN) = 1/2 10/17/16 12 Adapt from Prof. Nando de Freitas s review slides

13 Sample space and Events Dr. Yanjun Qi / UVA CS 6316 / f16 O : Sample Space, result of an experiment / set of all outcomes If you toss a coin twice O = {HH,HT,TH,TT} Event: a subset of O First toss is head = {HH,HT} S: event space, a set of events: 10/17/16 13 Contains the empty event and O

14 Axioms for Probability Dr. Yanjun Qi / UVA CS 6316 / f16 Defined over (O,S) s.t. 1 >= P(a) >= 0 for all a in S P(O) = 1 If A, B are disjoint, then P(A U B) = p(a) + p(b) 10/17/16 14

15 Axioms for Probability P(O) = P( B ) i B 5 B 3 B 2 B 4 B 7 B 6 B 1 10/17/16 15

16 OR operaoon for Probability We can deduce other axioms from the above ones Ex: P(A U B) for non-disjoint events P( Union of A and B) 10/17/16 16

17 A 10/17/16 17

18 P( IntersecOon of A and B) B A 10/17/16 18

19 B A 10/17/16 19

20 A CondiOonal Probability Dr. Yanjun Qi / UVA CS 6316 / f16 B from Prof. Nando de Freitas s review 10/17/16 20

21 CondiOonal Probability / Chain Rule More ways to write out chain rule P(A,B) = p(b A)p(A) P(A,B) = p(a B)p(B) 10/17/16 21

22 Rule of total probability => MarginalizaOon Dr. Yanjun Qi / UVA CS 6316 / f16 B 5 B 3 B 2 B 4 A B 7 B 6 B 1 p A ( )P( A B ) WHY??? i ( ) = P B i 10/17/16 22

23 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Events and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 23

24 From Events to Random Variable Concise way of specifying arributes of outcomes Modeling students (Grade and Intelligence): O = all possible students (sample space) What are events (subset of sample space) Grade_A = all students with grade A Grade_B = all students with grade B HardWorking_Yes = who works hard Very cumbersome Need funcoons that maps from O to an arribute space T. P(H = YES) = P({student ϵ O : H(student) = YES}) 10/17/16 24

25 O Random Variables (RV) H: hardworking Yes No Dr. Yanjun Qi / UVA CS 6316 / f16 G:Grade A B A+ P(H = Yes) = P( {all students who is working hard on the course}) funcoons that maps from O to an arribute space T. 10/17/16 25

26 NotaOon Digression P(A) is shorthand for P(A=true) P(~A) is shorthand for P(A=false) Same notaoon applies to other binary RVs: P(Gender=M), P(Gender=F) Same notaoon applies to mul*valued RVs: P(Major=history), P(Age=19), P(Q=c) Note: upper case lerers/names for variables, lower case lerers/names for values 10/17/16 26

27 Discrete Random Variables Random variables (RVs) which may take on only a countable number of disinct values X is a RV with arity k if it can take on exactly one value out of {x 1,, x k } 10/17/16 27

28 Probability of Discrete RV Probability mass funcoon (pmf): P(X = x i ) Easy facts about pmf Σ i P(X = x i ) = 1 P(X = x i X = x j ) = 0 if i j P(X = x i U X = x j ) = P(X = x i ) + P(X = x j ) if i j P(X = x 1 U X = x 2 U U X = x k ) = 1 10/17/16 28

29 e.g. Coin Flips You flip a coin Head with probability 0.5 You flip 100 coins How many heads would you expect 10/17/16 29

30 e.g. Coin Flips cont. You flip a coin Head with probability p Binary random variable Bernoulli trial with success probability p You flip k coins How many heads would you expect Number of heads X: discrete random variable Binomial distribuoon with parameters k and p 10/17/16 30

31 Discrete Random Variables Random variables (RVs) which may take on only a countable number of disinct values E.g. the total number of heads X you get if you flip 100 coins X is a RV with arity k if it can take on exactly one value out of { x 1,, x } k E.g. the possible values that X can take on are 0, 1, 2,, /17/16 31

32 e.g., two Common DistribuOons Uniform X U 1,..., N X takes values 1, 2,, N PX ( = i) = 1N E.g. picking balls of different colors from a box Binomial X takes values 0, 1,, k 10/17/16 P( X = i) = X Bin k, p k i E.g. coin flips k Omes ( ) pi 1 p ( ) k i 32

33 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Events and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 33

34 e.g., Coin Flips by Two Persons Your friend and you both flip coins Head with probability 0.5 You flip 50 Omes; your friend flip 100 Omes How many heads will both of you get 10/17/16 34

35 Joint DistribuOon Given two discrete RVs X and Y, their joint distribuion is the distribuoon of X and Y together E.g. P(You get 21 heads AND you friend get 70 heads) E.g. sum ( x y) /17/16 i= 0 j= 0 x y PX= Y= = 1 ( i j ) PYou get heads AND your friend get heads = 1 35

36 Joint DistribuOon Given two discrete RVs X and Y, their joint distribuion is the distribuoon of X and Y together E.g. P(You get 21 heads AND you friend get 70 heads) E.g. sum ( x y) /17/16 i= 0 j= 0 x y PX= Y= = 1 ( i j ) PYou get heads AND your friend get heads = 1 36

37 CondiOonal Probability ( ) P X = x Y = y is the probability of X = x, given the occurrence of Y = y E.g. you get 0 heads, given that your friend gets 61 heads P X ( x Y y) = = = ( = x Y= y) P X ( = y) P Y 10/17/16 37

38 Law of Total Probability Given two discrete RVs X and Y, which take values in { x and, We have 1,, x } { m y 1,, y } n ( = x ) ( ) i = = xi = yj PX PX Y j ( x ) ( ) i yj yj = PX= Y= PY= j 10/17/16 38

39 MarginalizaOon B 4 B 5 B 3 B 2 A B 7 B 6 B 1 Marginal Probability Law of Total Probability ( = x ) ( ) i = = xi = yj PX PX Y j ( x ) ( ) i yj yj = PX= Y= PY= j CondiOonal Probability Marginal Probability 10/17/16 39

40 Simplify NotaOon: Dr. Yanjun Qi / UVA CS 6316 / f16 CondiOonal Probability events ( x Y y) P X = = = ( = x Y= y) P X ( = y) P Y But we will always write it this way: ( x y) P = p( x, y) p( y) X=x Y=y P(X=x true) -> P(X=x) -> P(x) 10/17/16 40

41 Simplify NotaOon: MarginalizaOon We know p(x, Y), what is P(X=x)? We can use the law of total probability, why? Dr. Yanjun Qi / UVA CS 6316 / f16 p ( x) = P( x, y) = y y P ( y) P( x y) B 4 B 7 B 5 B 6 A B 3 B 1 B 2 10/17/16 41

42 MarginalizaOon Cont. Another example ( ) ( ) ( ) ( ) = = y z z y z y x P z y P z y x P x p,,,,,, 10/17/16 42 Dr. Yanjun Qi / UVA CS 6316 / f16

43 Bayes Rule We know that P(rain) = 0.5 If we also know that the grass is wet, then how this affects our belief about whether it rains or not? P( rain wet) = P(rain)P(wet rain) P(wet) 10/17/16 43

44 Bayes Rule We know that P(rain) = 0.5 If we also know that the grass is wet, then how this affects our belief about whether it rains or not? P( rain wet) = P(rain)P(wet rain) P(wet) P( x y) = P(x)P(y x) P(y) 10/17/16 44

45 Bayes Rule X and Y are discrete RVs ( x Y y) P X = = = ( = x Y= y) P X ( = y) P Y ( x Y ) i yj P X 10/17/16 = = = ( = y ) ( ) j = xi = xi P( Y = y X ) P( X ) j = xk = xk P Y X P X k 45

46 10/17/16 46

47 P(A = a 1 B) = P(B A = a 1 )P(A = a 1 ) P(B A = a i )P(A = a i ) i 10/17/16 47

48 Bayes Rule cont. You can condioon on more variables ( ) ) ( ), ( ) (, z y P z x y P z x P z y x P = 10/17/16 48 Dr. Yanjun Qi / UVA CS 6316 / f16

49 CondiOonal Probability Example 10/17/16 49 Adapt from Prof. Nando de Freitas s review slides

50 CondiOonal Probability Example 10/17/16 50 Adapt from Prof. Nando de Freitas s review slides

51 CondiOonal Probability Example 10/17/16 51

52 CondiOonal Probability Example 10/17/16 52

53 è Matrix NotaOon 10/17/16 53

54 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Events and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 54

55 Independent RVs IntuiOon: X and Y are independent means that X = x that Y neither makes it more or less probable = y DefiniOon: X and Y are independent iff PX= x Y= y = PX= x PY= y ( ) ( ) ( ) 10/17/16 55

56 More on Independence ( = x = y) = ( = x) ( = y) PX Y PX PY ( = x = y) = ( = x) P( Y = y X = x) = P( Y = y) P X Y P X E.g. no marer how many heads you get, your friend will not be affected, and vice versa 10/17/16 56

57 More on Independence Dr. Yanjun Qi / UVA CS 6316 / f16 X is independent of Y means that knowing Y does not change our belief about X. P(X Y=y) = P(X) P(X=x, Y=y) = P(X=x) P(Y=y) The above should hold for all x i, y j It is symmetric and wriren as!x Y 10/17/16 57

58 CondiOonally Independent RVs IntuiOon: X and Y are condioonally independent given Z means that once Z is known, the value of X does not add any addiional informaoon about Y DefiniOon: X and Y are condioonally independent given Z iff ( = x = y = z) = ( = x = z) ( = y = z) P X Y Z P X Z P Y Z 10/17/16 If holding for all x i, y j, z k! X Y Z 58

59 CondiOonally Independent RVs! X Y Z!X Y 10/17/16 59

60 More on CondiOonal Independence ( = x = y = z) = ( = x = z) ( = y = z) P X Y Z P X Z P Y Z ( = x = y = z) = ( = x = z) P X Y,Z P X Z ( = y = x = z) = ( = y = z) P Y X,Z P Y Z 10/17/16 60

61 Today Recap : Probability Review The big picture : data <-> probabilisoc model Sample space, Events and Event spaces Random variables Joint probability, Marginal probability, condioonal probability, Chain rule, Bayes Rule, Law of total probability, etc. Independence, condioonal independence 10/17/16 61

62 References q Prof. Andrew Moore s review tutorial q Prof. Nando de Freitas s review slides q Prof. Carlos Guestrin recitaoon slides 10/17/16 62

Where are we? è Five major sec3ons of this course

Where are we? è Five major sec3ons of this course UVA CS 4501-001 / 6501 007 Introduc8on to Machine Learning and Data Mining Lecture 12: Probability and Sta3s3cs Review Yanjun Qi / Jane University of Virginia Department of Computer Science 10/02/14 1

More information

UVA CS / Introduc8on to Machine Learning and Data Mining

UVA CS / Introduc8on to Machine Learning and Data Mining UVA CS 4501-001 / 6501 007 Introduc8on to Machine Learning and Data Mining Lecture 13: Probability and Sta3s3cs Review (cont.) + Naïve Bayes Classifier Yanjun Qi / Jane, PhD University of Virginia Department

More information

STAT 598L Probabilistic Graphical Models. Instructor: Sergey Kirshner. Probability Review

STAT 598L Probabilistic Graphical Models. Instructor: Sergey Kirshner. Probability Review STAT 598L Probabilistic Graphical Models Instructor: Sergey Kirshner Probability Review Some slides are taken (or modified) from Carlos Guestrin s 10-708 Probabilistic Graphical Models Fall 2008 at CMU

More information

Basics on Probability. Jingrui He 09/11/2007

Basics on Probability. Jingrui He 09/11/2007 Basics on Probability Jingrui He 09/11/2007 Coin Flips You flip a coin Head with probability 0.5 You flip 100 coins How many heads would you expect Coin Flips cont. You flip a coin Head with probability

More information

CPSC340. Probability. Nando de Freitas September, 2012 University of British Columbia

CPSC340. Probability. Nando de Freitas September, 2012 University of British Columbia CPSC340 Probability Nando de Freitas September, 2012 University of British Columbia Outline of the lecture This lecture is intended at revising probabilistic concepts that play an important role in the

More information

Probability Theory for Machine Learning. Chris Cremer September 2015

Probability Theory for Machine Learning. Chris Cremer September 2015 Probability Theory for Machine Learning Chris Cremer September 2015 Outline Motivation Probability Definitions and Rules Probability Distributions MLE for Gaussian Parameter Estimation MLE and Least Squares

More information

Machine Learning

Machine Learning Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University August 30, 2017 Today: Decision trees Overfitting The Big Picture Coming soon Probabilistic learning MLE,

More information

Computational Genomics

Computational Genomics Computational Genomics http://www.cs.cmu.edu/~02710 Introduction to probability, statistics and algorithms (brief) intro to probability Basic notations Random variable - referring to an element / event

More information

UVA CS 4501: Machine Learning

UVA CS 4501: Machine Learning UVA CS 4501: Machine Learning Lecture 21: Decision Tree / Random Forest / Ensemble Dr. Yanjun Qi University of Virginia Department of Computer Science Where are we? è Five major sections of this course

More information

Data Mining Techniques. Lecture 3: Probability

Data Mining Techniques. Lecture 3: Probability Data Mining Techniques CS 6220 - Section 3 - Fall 2016 Lecture 3: Probability Jan-Willem van de Meent (credit: Zhao, CS 229, Bishop) Project Vote 1. Freeform: Develop your own project proposals 30% of

More information

COMP 328: Machine Learning

COMP 328: Machine Learning COMP 328: Machine Learning Lecture 2: Naive Bayes Classifiers Nevin L. Zhang Department of Computer Science and Engineering The Hong Kong University of Science and Technology Spring 2010 Nevin L. Zhang

More information

Machine Learning Algorithm. Heejun Kim

Machine Learning Algorithm. Heejun Kim Machine Learning Algorithm Heejun Kim June 12, 2018 Machine Learning Algorithms Machine Learning algorithm: a procedure in developing computer programs that improve their performance with experience. Types

More information

Chapter 6: Probability The Study of Randomness

Chapter 6: Probability The Study of Randomness Chapter 6: Probability The Study of Randomness 6.1 The Idea of Probability 6.2 Probability Models 6.3 General Probability Rules 1 Simple Question: If tossing a coin, what is the probability of the coin

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics February 12, 2018 CS 361: Probability & Statistics Random Variables Monty hall problem Recall the setup, there are 3 doors, behind two of them are indistinguishable goats, behind one is a car. You pick

More information

Machine Learning

Machine Learning Machine Learning 10-701 Tom M. Mitchell Machine Learning Department Carnegie Mellon University January 13, 2011 Today: The Big Picture Overfitting Review: probability Readings: Decision trees, overfiting

More information

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019 Lecture 10: Probability distributions DANIEL WELLER TUESDAY, FEBRUARY 19, 2019 Agenda What is probability? (again) Describing probabilities (distributions) Understanding probabilities (expectation) Partial

More information

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events LECTURE 1 1 Introduction The first part of our adventure is a highly selective review of probability theory, focusing especially on things that are most useful in statistics. 1.1 Sample spaces and events

More information

Dept. of Linguistics, Indiana University Fall 2015

Dept. of Linguistics, Indiana University Fall 2015 L645 Dept. of Linguistics, Indiana University Fall 2015 1 / 34 To start out the course, we need to know something about statistics and This is only an introduction; for a fuller understanding, you would

More information

Recitation 2: Probability

Recitation 2: Probability Recitation 2: Probability Colin White, Kenny Marino January 23, 2018 Outline Facts about sets Definitions and facts about probability Random Variables and Joint Distributions Characteristics of distributions

More information

Introduction to Probability and Statistics (Continued)

Introduction to Probability and Statistics (Continued) Introduction to Probability and Statistics (Continued) Prof. icholas Zabaras Center for Informatics and Computational Science https://cics.nd.edu/ University of otre Dame otre Dame, Indiana, USA Email:

More information

Lecture 1: Basics of Probability

Lecture 1: Basics of Probability Lecture 1: Basics of Probability (Luise-Vitetta, Chapter 8) Why probability in data science? Data acquisition is noisy Sampling/quantization external factors: If you record your voice saying machine learning

More information

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag

Decision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag Decision Trees Nicholas Ruozzi University of Texas at Dallas Based on the slides of Vibhav Gogate and David Sontag Supervised Learning Input: labelled training data i.e., data plus desired output Assumption:

More information

Lecture 10: Bayes' Theorem, Expected Value and Variance Lecturer: Lale Özkahya

Lecture 10: Bayes' Theorem, Expected Value and Variance Lecturer: Lale Özkahya BBM 205 Discrete Mathematics Hacettepe University http://web.cs.hacettepe.edu.tr/ bbm205 Lecture 10: Bayes' Theorem, Expected Value and Variance Lecturer: Lale Özkahya Resources: Kenneth Rosen, Discrete

More information

Probability. Machine Learning and Pattern Recognition. Chris Williams. School of Informatics, University of Edinburgh. August 2014

Probability. Machine Learning and Pattern Recognition. Chris Williams. School of Informatics, University of Edinburgh. August 2014 Probability Machine Learning and Pattern Recognition Chris Williams School of Informatics, University of Edinburgh August 2014 (All of the slides in this course have been adapted from previous versions

More information

Machine Learning. Bayes Basics. Marc Toussaint U Stuttgart. Bayes, probabilities, Bayes theorem & examples

Machine Learning. Bayes Basics. Marc Toussaint U Stuttgart. Bayes, probabilities, Bayes theorem & examples Machine Learning Bayes Basics Bayes, probabilities, Bayes theorem & examples Marc Toussaint U Stuttgart So far: Basic regression & classification methods: Features + Loss + Regularization & CV All kinds

More information

Introduction to Stochastic Processes

Introduction to Stochastic Processes Stat251/551 (Spring 2017) Stochastic Processes Lecture: 1 Introduction to Stochastic Processes Lecturer: Sahand Negahban Scribe: Sahand Negahban 1 Organization Issues We will use canvas as the course webpage.

More information

PAC-learning, VC Dimension and Margin-based Bounds

PAC-learning, VC Dimension and Margin-based Bounds More details: General: http://www.learning-with-kernels.org/ Example of more complex bounds: http://www.research.ibm.com/people/t/tzhang/papers/jmlr02_cover.ps.gz PAC-learning, VC Dimension and Margin-based

More information

Machine Learning: Probability Theory

Machine Learning: Probability Theory Machine Learning: Probability Theory Prof. Dr. Martin Riedmiller Albert-Ludwigs-University Freiburg AG Maschinelles Lernen Theories p.1/28 Probabilities probabilistic statements subsume different effects

More information

Review of probability

Review of probability Review of probability Computer Sciences 760 Spring 2014 http://pages.cs.wisc.edu/~dpage/cs760/ Goals for the lecture you should understand the following concepts definition of probability random variables

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Lecture 2 Probability Copyright 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 3-1 3.1 Definition Random Experiment a process leading to an uncertain

More information

Review of Probability. CS1538: Introduction to Simulations

Review of Probability. CS1538: Introduction to Simulations Review of Probability CS1538: Introduction to Simulations Probability and Statistics in Simulation Why do we need probability and statistics in simulation? Needed to validate the simulation model Needed

More information

Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides

Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides slide 1 Inference with Bayes rule: Example In a bag there are two envelopes one has a red ball (worth $100) and a black ball one

More information

ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables

ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables Department of Electrical Engineering University of Arkansas ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables Dr. Jingxian Wu wuj@uark.edu OUTLINE 2 Random Variable Discrete Random

More information

SDS 321: Introduction to Probability and Statistics

SDS 321: Introduction to Probability and Statistics SDS 321: Introduction to Probability and Statistics Lecture 13: Expectation and Variance and joint distributions Purnamrita Sarkar Department of Statistics and Data Science The University of Texas at Austin

More information

COS402- Artificial Intelligence Fall Lecture 10: Bayesian Networks & Exact Inference

COS402- Artificial Intelligence Fall Lecture 10: Bayesian Networks & Exact Inference COS402- Artificial Intelligence Fall 2015 Lecture 10: Bayesian Networks & Exact Inference Outline Logical inference and probabilistic inference Independence and conditional independence Bayes Nets Semantics

More information

Brief Review of Probability

Brief Review of Probability Brief Review of Probability Nuno Vasconcelos (Ken Kreutz-Delgado) ECE Department, UCSD Probability Probability theory is a mathematical language to deal with processes or experiments that are non-deterministic

More information

Naïve Bayes classification

Naïve Bayes classification Naïve Bayes classification 1 Probability theory Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. Examples: A person s height, the outcome of a coin toss

More information

2/3/04. Syllabus. Probability Lecture #2. Grading. Probability Theory. Events and Event Spaces. Experiments and Sample Spaces

2/3/04. Syllabus. Probability Lecture #2. Grading. Probability Theory. Events and Event Spaces. Experiments and Sample Spaces Probability Lecture #2 Introduction to Natural Language Processing CMPSCI 585, Spring 2004 University of Massachusetts Amherst Andrew McCallum Syllabus Probability and Information Theory Spam filtering

More information

STAT Chapter 3: Probability

STAT Chapter 3: Probability Basic Definitions STAT 515 --- Chapter 3: Probability Experiment: A process which leads to a single outcome (called a sample point) that cannot be predicted with certainty. Sample Space (of an experiment):

More information

Grundlagen der Künstlichen Intelligenz

Grundlagen der Künstlichen Intelligenz Grundlagen der Künstlichen Intelligenz Uncertainty & Probabilities & Bandits Daniel Hennes 16.11.2017 (WS 2017/18) University Stuttgart - IPVS - Machine Learning & Robotics 1 Today Uncertainty Probability

More information

CS 4649/7649 Robot Intelligence: Planning

CS 4649/7649 Robot Intelligence: Planning CS 4649/7649 Robot Intelligence: Planning Probability Primer Sungmoon Joo School of Interactive Computing College of Computing Georgia Institute of Technology S. Joo (sungmoon.joo@cc.gatech.edu) 1 *Slides

More information

Naïve Bayes. Jia-Bin Huang. Virginia Tech Spring 2019 ECE-5424G / CS-5824

Naïve Bayes. Jia-Bin Huang. Virginia Tech Spring 2019 ECE-5424G / CS-5824 Naïve Bayes Jia-Bin Huang ECE-5424G / CS-5824 Virginia Tech Spring 2019 Administrative HW 1 out today. Please start early! Office hours Chen: Wed 4pm-5pm Shih-Yang: Fri 3pm-4pm Location: Whittemore 266

More information

Review of Basic Probability

Review of Basic Probability Review of Basic Probability Erik G. Learned-Miller Department of Computer Science University of Massachusetts, Amherst Amherst, MA 01003 September 16, 2009 Abstract This document reviews basic discrete

More information

Introduction to Probability

Introduction to Probability Introduction to Probability Gambling at its core 16th century Cardano: Books on Games of Chance First systematic treatment of probability 17th century Chevalier de Mere posed a problem to his friend Pascal.

More information

CIVL 7012/8012. Basic Laws and Axioms of Probability

CIVL 7012/8012. Basic Laws and Axioms of Probability CIVL 7012/8012 Basic Laws and Axioms of Probability Why are we studying probability and statistics? How can we quantify risks of decisions based on samples from a population? How should samples be selected

More information

Intelligent Systems (AI-2)

Intelligent Systems (AI-2) Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 19 Oct, 23, 2015 Slide Sources Raymond J. Mooney University of Texas at Austin D. Koller, Stanford CS - Probabilistic Graphical Models D. Page,

More information

Conditional Probability

Conditional Probability Conditional Probability Idea have performed a chance experiment but don t know the outcome (ω), but have some partial information (event A) about ω. Question: given this partial information what s the

More information

Lecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable

Lecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable Lecture Notes 1 Probability and Random Variables Probability Spaces Conditional Probability and Independence Random Variables Functions of a Random Variable Generation of a Random Variable Jointly Distributed

More information

Lecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable

Lecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable Lecture Notes 1 Probability and Random Variables Probability Spaces Conditional Probability and Independence Random Variables Functions of a Random Variable Generation of a Random Variable Jointly Distributed

More information

CS626 Data Analysis and Simulation

CS626 Data Analysis and Simulation CS626 Data Analysis and Simulation Instructor: Peter Kemper R 104A, phone 221-3462, email:kemper@cs.wm.edu Today: Probability Primer Quick Reference: Sheldon Ross: Introduction to Probability Models 9th

More information

Introduction to Machine Learning

Introduction to Machine Learning Introduction to Machine Learning Introduction to Probabilistic Methods Varun Chandola Computer Science & Engineering State University of New York at Buffalo Buffalo, NY, USA chandola@buffalo.edu Chandola@UB

More information

Decision making and problem solving Lecture 1. Review of basic probability Monte Carlo simulation

Decision making and problem solving Lecture 1. Review of basic probability Monte Carlo simulation Decision making and problem solving Lecture 1 Review of basic probability Monte Carlo simulation Why probabilities? Most decisions involve uncertainties Probability theory provides a rigorous framework

More information

Midterm Review CS 6375: Machine Learning. Vibhav Gogate The University of Texas at Dallas

Midterm Review CS 6375: Machine Learning. Vibhav Gogate The University of Texas at Dallas Midterm Review CS 6375: Machine Learning Vibhav Gogate The University of Texas at Dallas Machine Learning Supervised Learning Unsupervised Learning Reinforcement Learning Parametric Y Continuous Non-parametric

More information

Be able to define the following terms and answer basic questions about them:

Be able to define the following terms and answer basic questions about them: CS440/ECE448 Section Q Fall 2017 Final Review Be able to define the following terms and answer basic questions about them: Probability o Random variables, axioms of probability o Joint, marginal, conditional

More information

Intelligent Systems (AI-2)

Intelligent Systems (AI-2) Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 11 Oct, 3, 2016 CPSC 422, Lecture 11 Slide 1 422 big picture: Where are we? Query Planning Deterministic Logics First Order Logics Ontologies

More information

Probability Theory. Prof. Dr. Martin Riedmiller AG Maschinelles Lernen Albert-Ludwigs-Universität Freiburg.

Probability Theory. Prof. Dr. Martin Riedmiller AG Maschinelles Lernen Albert-Ludwigs-Universität Freiburg. Probability Theory Prof. Dr. Martin Riedmiller AG Maschinelles Lernen Albert-Ludwigs-Universität Freiburg riedmiller@informatik.uni-freiburg.de Prof. Dr. Martin Riedmiller Machine Learning Lab, University

More information

Introduction to Bayesian Learning

Introduction to Bayesian Learning Course Information Introduction Introduction to Bayesian Learning Davide Bacciu Dipartimento di Informatica Università di Pisa bacciu@di.unipi.it Apprendimento Automatico: Fondamenti - A.A. 2016/2017 Outline

More information

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability Probability theory Naïve Bayes classification Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. s: A person s height, the outcome of a coin toss Distinguish

More information

Introduction to probability

Introduction to probability Introduction to probability 4.1 The Basics of Probability Probability The chance that a particular event will occur The probability value will be in the range 0 to 1 Experiment A process that produces

More information

Probability Review. Chao Lan

Probability Review. Chao Lan Probability Review Chao Lan Let s start with a single random variable Random Experiment A random experiment has three elements 1. sample space Ω: set of all possible outcomes e.g.,ω={1,2,3,4,5,6} 2. event

More information

UVA CS / Introduc8on to Machine Learning and Data Mining. Lecture 9: Classifica8on with Support Vector Machine (cont.

UVA CS / Introduc8on to Machine Learning and Data Mining. Lecture 9: Classifica8on with Support Vector Machine (cont. UVA CS 4501-001 / 6501 007 Introduc8on to Machine Learning and Data Mining Lecture 9: Classifica8on with Support Vector Machine (cont.) Yanjun Qi / Jane University of Virginia Department of Computer Science

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Chapter 8&9: Classification: Part 3 Instructor: Yizhou Sun yzsun@ccs.neu.edu March 12, 2013 Midterm Report Grade Distribution 90-100 10 80-89 16 70-79 8 60-69 4

More information

Lecture 9: Naive Bayes, SVM, Kernels. Saravanan Thirumuruganathan

Lecture 9: Naive Bayes, SVM, Kernels. Saravanan Thirumuruganathan Lecture 9: Naive Bayes, SVM, Kernels Instructor: Outline 1 Probability basics 2 Probabilistic Interpretation of Classification 3 Bayesian Classifiers, Naive Bayes 4 Support Vector Machines Probability

More information

COS 424: Interacting with Data. Lecturer: Dave Blei Lecture #2 Scribe: Ellen Kim February 7, 2008

COS 424: Interacting with Data. Lecturer: Dave Blei Lecture #2 Scribe: Ellen Kim February 7, 2008 COS 424: Interacting with Data Lecturer: Dave Blei Lecture #2 Scribe: Ellen Kim February 7, 2008 1 Probability Review 1.1 Random Variables A random variable can be used to model any probabilistic outcome.

More information

Probability Theory. Introduction to Probability Theory. Principles of Counting Examples. Principles of Counting. Probability spaces.

Probability Theory. Introduction to Probability Theory. Principles of Counting Examples. Principles of Counting. Probability spaces. Probability Theory To start out the course, we need to know something about statistics and probability Introduction to Probability Theory L645 Advanced NLP Autumn 2009 This is only an introduction; for

More information

Lecture Notes 1 Basic Probability. Elements of Probability. Conditional probability. Sequential Calculation of Probability

Lecture Notes 1 Basic Probability. Elements of Probability. Conditional probability. Sequential Calculation of Probability Lecture Notes 1 Basic Probability Set Theory Elements of Probability Conditional probability Sequential Calculation of Probability Total Probability and Bayes Rule Independence Counting EE 178/278A: Basic

More information

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 CS 70 Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 Introduction One of the key properties of coin flips is independence: if you flip a fair coin ten times and get ten

More information

Randomized Algorithms

Randomized Algorithms Randomized Algorithms Prof. Tapio Elomaa tapio.elomaa@tut.fi Course Basics A new 4 credit unit course Part of Theoretical Computer Science courses at the Department of Mathematics There will be 4 hours

More information

Midterm Review CS 7301: Advanced Machine Learning. Vibhav Gogate The University of Texas at Dallas

Midterm Review CS 7301: Advanced Machine Learning. Vibhav Gogate The University of Texas at Dallas Midterm Review CS 7301: Advanced Machine Learning Vibhav Gogate The University of Texas at Dallas Supervised Learning Issues in supervised learning What makes learning hard Point Estimation: MLE vs Bayesian

More information

Announcements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation

Announcements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation CS 188: Artificial Intelligence Spring 2010 Lecture 15: Bayes Nets II Independence 3/9/2010 Pieter Abbeel UC Berkeley Many slides over the course adapted from Dan Klein, Stuart Russell, Andrew Moore Current

More information

Basic Probability and Information Theory: quick revision

Basic Probability and Information Theory: quick revision Basic Probability and Information Theory: quick revision ML for NLP Lecturer: S Luz http://www.scss.tcd.ie/~luzs/t/cs4ll4/ February 17, 2015 In these notes we review the basics of probability theory and

More information

Review of Probability Mark Craven and David Page Computer Sciences 760.

Review of Probability Mark Craven and David Page Computer Sciences 760. Review of Probability Mark Craven and David Page Computer Sciences 760 www.biostat.wisc.edu/~craven/cs760/ Goals for the lecture you should understand the following concepts definition of probability random

More information

Discrete Random Variable

Discrete Random Variable Discrete Random Variable Outcome of a random experiment need not to be a number. We are generally interested in some measurement or numerical attribute of the outcome, rather than the outcome itself. n

More information

Generative Learning. INFO-4604, Applied Machine Learning University of Colorado Boulder. November 29, 2018 Prof. Michael Paul

Generative Learning. INFO-4604, Applied Machine Learning University of Colorado Boulder. November 29, 2018 Prof. Michael Paul Generative Learning INFO-4604, Applied Machine Learning University of Colorado Boulder November 29, 2018 Prof. Michael Paul Generative vs Discriminative The classification algorithms we have seen so far

More information

STAT:5100 (22S:193) Statistical Inference I

STAT:5100 (22S:193) Statistical Inference I STAT:5100 (22S:193) Statistical Inference I Week 3 Luke Tierney University of Iowa Fall 2015 Luke Tierney (U Iowa) STAT:5100 (22S:193) Statistical Inference I Fall 2015 1 Recap Matching problem Generalized

More information

Lecture Lecture 5

Lecture Lecture 5 Lecture 4 --- Lecture 5 A. Basic Concepts (4.1-4.2) 1. Experiment: A process of observing a phenomenon that has variation in its outcome. Examples: (E1). Rolling a die, (E2). Drawing a card form a shuffled

More information

Course Introduction. Probabilistic Modelling and Reasoning. Relationships between courses. Dealing with Uncertainty. Chris Williams.

Course Introduction. Probabilistic Modelling and Reasoning. Relationships between courses. Dealing with Uncertainty. Chris Williams. Course Introduction Probabilistic Modelling and Reasoning Chris Williams School of Informatics, University of Edinburgh September 2008 Welcome Administration Handout Books Assignments Tutorials Course

More information

STAT2201. Analysis of Engineering & Scientific Data. Unit 3

STAT2201. Analysis of Engineering & Scientific Data. Unit 3 STAT2201 Analysis of Engineering & Scientific Data Unit 3 Slava Vaisman The University of Queensland School of Mathematics and Physics What we learned in Unit 2 (1) We defined a sample space of a random

More information

Review of Probabilities and Basic Statistics

Review of Probabilities and Basic Statistics Alex Smola Barnabas Poczos TA: Ina Fiterau 4 th year PhD student MLD Review of Probabilities and Basic Statistics 10-701 Recitations 1/25/2013 Recitation 1: Statistics Intro 1 Overview Introduction to

More information

Consider an experiment that may have different outcomes. We are interested to know what is the probability of a particular set of outcomes.

Consider an experiment that may have different outcomes. We are interested to know what is the probability of a particular set of outcomes. CMSC 310 Artificial Intelligence Probabilistic Reasoning and Bayesian Belief Networks Probabilities, Random Variables, Probability Distribution, Conditional Probability, Joint Distributions, Bayes Theorem

More information

Fundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner

Fundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner Fundamentals CS 281A: Statistical Learning Theory Yangqing Jia Based on tutorial slides by Lester Mackey and Ariel Kleiner August, 2011 Outline 1 Probability 2 Statistics 3 Linear Algebra 4 Optimization

More information

Machine Learning

Machine Learning Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University September 22, 2011 Today: MLE and MAP Bayes Classifiers Naïve Bayes Readings: Mitchell: Naïve Bayes and Logistic

More information

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability What is Probability? the chance of an event occuring eg 1classical probability 2empirical probability 3subjective probability Section 2 - Probability (1) Probability - Terminology random (probability)

More information

Bayes Classifiers. CAP5610 Machine Learning Instructor: Guo-Jun QI

Bayes Classifiers. CAP5610 Machine Learning Instructor: Guo-Jun QI Bayes Classifiers CAP5610 Machine Learning Instructor: Guo-Jun QI Recap: Joint distributions Joint distribution over Input vector X = (X 1, X 2 ) X 1 =B or B (drinking beer or not) X 2 = H or H (headache

More information

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 1 Review We saw some basic metrics that helped us characterize

More information

CIVL Why are we studying probability and statistics? Learning Objectives. Basic Laws and Axioms of Probability

CIVL Why are we studying probability and statistics? Learning Objectives. Basic Laws and Axioms of Probability CIVL 3103 Basic Laws and Axioms of Probability Why are we studying probability and statistics? How can we quantify risks of decisions based on samples from a population? How should samples be selected

More information

Probability & statistics for linguists Class 2: more probability. D. Lassiter (h/t: R. Levy)

Probability & statistics for linguists Class 2: more probability. D. Lassiter (h/t: R. Levy) Probability & statistics for linguists Class 2: more probability D. Lassiter (h/t: R. Levy) conditional probability P (A B) = when in doubt about meaning: draw pictures. P (A \ B) P (B) keep B- consistent

More information

Be able to define the following terms and answer basic questions about them:

Be able to define the following terms and answer basic questions about them: CS440/ECE448 Fall 2016 Final Review Be able to define the following terms and answer basic questions about them: Probability o Random variables o Axioms of probability o Joint, marginal, conditional probability

More information

Lecture 2: Repetition of probability theory and statistics

Lecture 2: Repetition of probability theory and statistics Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:

More information

Introduc)on to Ar)ficial Intelligence

Introduc)on to Ar)ficial Intelligence Introduc)on to Ar)ficial Intelligence Lecture 10 Probability CS/CNS/EE 154 Andreas Krause Announcements! Milestone due Nov 3. Please submit code to TAs! Grading: PacMan! Compiles?! Correct? (Will clear

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

V7 Foundations of Probability Theory

V7 Foundations of Probability Theory V7 Foundations of Probability Theory Probability : degree of confidence that an event of an uncertain nature will occur. Events : we will assume that there is an agreed upon space of possible outcomes

More information

Intelligent Systems (AI-2)

Intelligent Systems (AI-2) Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 19 Oct, 24, 2016 Slide Sources Raymond J. Mooney University of Texas at Austin D. Koller, Stanford CS - Probabilistic Graphical Models D. Page,

More information

Terminology. Experiment = Prior = Posterior =

Terminology. Experiment = Prior = Posterior = Review: probability RVs, events, sample space! Measures, distributions disjoint union property (law of total probability book calls this sum rule ) Sample v. population Law of large numbers Marginals,

More information

Bayesian Networks Representation

Bayesian Networks Representation Bayesian Networks Representation Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University March 19 th, 2007 Handwriting recognition Character recognition, e.g., kernel SVMs a c z rr r r

More information

EE 178 Lecture Notes 0 Course Introduction. About EE178. About Probability. Course Goals. Course Topics. Lecture Notes EE 178

EE 178 Lecture Notes 0 Course Introduction. About EE178. About Probability. Course Goals. Course Topics. Lecture Notes EE 178 EE 178 Lecture Notes 0 Course Introduction About EE178 About Probability Course Goals Course Topics Lecture Notes EE 178: Course Introduction Page 0 1 EE 178 EE 178 provides an introduction to probabilistic

More information

Probability Lecture #7

Probability Lecture #7 Probability Lecture #7 Introduction to Natural Language Processing CMPSCI 585, Fall 2007 University of Massachusetts Amherst Andrew McCallum Today s Main Points Remember (or learn) about probability theory

More information

CS 630 Basic Probability and Information Theory. Tim Campbell

CS 630 Basic Probability and Information Theory. Tim Campbell CS 630 Basic Probability and Information Theory Tim Campbell 21 January 2003 Probability Theory Probability Theory is the study of how best to predict outcomes of events. An experiment (or trial or event)

More information

Machine Learning

Machine Learning Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University January 14, 2015 Today: The Big Picture Overfitting Review: probability Readings: Decision trees, overfiting

More information

UCSD CSE 21, Spring 2014 [Section B00] Mathematics for Algorithm and System Analysis

UCSD CSE 21, Spring 2014 [Section B00] Mathematics for Algorithm and System Analysis UCSD CSE 21, Spring 2014 [Section B00] Mathematics for Algorithm and System Analysis Lecture 8 Class URL: http://vlsicad.ucsd.edu/courses/cse21-s14/ Lecture 8 Notes Goals for Today Counting Partitions

More information