UVA CS 6316/4501 Fall 2016 Machine Learning. Lecture 11: Probability Review. Dr. Yanjun Qi. University of Virginia. Department of Computer Science
|
|
- Jonah Conley
- 5 years ago
- Views:
Transcription
1 UVA CS 6316/4501 Fall 2016 Machine Learning Lecture 11: Probability Review 10/17/16 Dr. Yanjun Qi University of Virginia Department of Computer Science 1
2 Announcements: Schedule Midterm Nov. 26 Wed / 3:30pm 4:45pm / open notes HW4 includes sample midterm quesoons Grading of HW1 is available on Collab SoluOon of HW1 is available on Collab Grading of HW2 will be available next week SoluOon of HW2 will be available next week 10/17/16 2
3 Where are we? è Five major secoons of this course q Regression (supervised) q ClassificaOon (supervised) q Unsupervised models q Learning theory q Graphical models 10/17/16 3
4 Where are we? è Three major secoons for classificaoon We can divide the large variety of classification approaches into roughly three major types 1. Discriminative - directly estimate a decision rule/boundary - e.g., support vector machine, decision tree 2. Generative: - build a generative statistical model - e.g., naïve bayes classifier, Bayesian networks 3. Instance based classifiers - Use observation directly (no models) - e.g. K nearest neighbors 10/17/16 4
5 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Events and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 5
6 The Big Picture Probability Model i.e. Data generaong process Observed Data EsOmaOon / learning / Inference / Data mining But how to specify a model? 10/17/16 6
7 Probability as frequency Consider the following quesoons: 1. What is the probability that when I flip a coin it is heads? 2. why? We can count è ~1/2 3. What is the probability of Blue Ridge Mountains to have an erupong volcano in the near future? è could not count Message: The frequen*st view is very useful, but it seems that we also use domain knowledge to come up with probabili*es. 10/17/16 7 Adapt from Prof. Nando de Freitas s review slides
8 Probability as a measure of uncertainty Imagine we are throwing darts at a wall of size 1x1 and that all darts are guaranteed to fall within this 1x1 wall. What is the probability that a dart will hit the shaded area? Dr. Yanjun Qi / UVA CS 6316 / f16 10/17/16 8 Adapt from Prof. Nando de Freitas s review slides
9 Probability as a measure of uncertainty Probability is a measure of certainty of an event taking place. Dr. Yanjun Qi / UVA CS 6316 / f16 i.e. in the example, we were measuring the chances of hi?ng the shaded area. prob = # RedBoxes # Boxes 10/17/16 9 Adapt from Prof. Nando de Freitas s review slides
10 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Sample space, Event and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 10
11 Probability O die = {1,2,3,4,5,6} O: Elementary Event Throw a 2 The elements of W are called elementary events. 10/17/16 11
12 Probability Probability allows us to measure many events. The events are subsets of the sample space O. For example, for a die we may consider the following events: e.g., GREATER = {5, 6} EVEN = {2, 4, 6} Assign probabili7es to these events: e.g., P(EVEN) = 1/2 10/17/16 12 Adapt from Prof. Nando de Freitas s review slides
13 Sample space and Events Dr. Yanjun Qi / UVA CS 6316 / f16 O : Sample Space, result of an experiment / set of all outcomes If you toss a coin twice O = {HH,HT,TH,TT} Event: a subset of O First toss is head = {HH,HT} S: event space, a set of events: 10/17/16 13 Contains the empty event and O
14 Axioms for Probability Dr. Yanjun Qi / UVA CS 6316 / f16 Defined over (O,S) s.t. 1 >= P(a) >= 0 for all a in S P(O) = 1 If A, B are disjoint, then P(A U B) = p(a) + p(b) 10/17/16 14
15 Axioms for Probability P(O) = P( B ) i B 5 B 3 B 2 B 4 B 7 B 6 B 1 10/17/16 15
16 OR operaoon for Probability We can deduce other axioms from the above ones Ex: P(A U B) for non-disjoint events P( Union of A and B) 10/17/16 16
17 A 10/17/16 17
18 P( IntersecOon of A and B) B A 10/17/16 18
19 B A 10/17/16 19
20 A CondiOonal Probability Dr. Yanjun Qi / UVA CS 6316 / f16 B from Prof. Nando de Freitas s review 10/17/16 20
21 CondiOonal Probability / Chain Rule More ways to write out chain rule P(A,B) = p(b A)p(A) P(A,B) = p(a B)p(B) 10/17/16 21
22 Rule of total probability => MarginalizaOon Dr. Yanjun Qi / UVA CS 6316 / f16 B 5 B 3 B 2 B 4 A B 7 B 6 B 1 p A ( )P( A B ) WHY??? i ( ) = P B i 10/17/16 22
23 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Events and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 23
24 From Events to Random Variable Concise way of specifying arributes of outcomes Modeling students (Grade and Intelligence): O = all possible students (sample space) What are events (subset of sample space) Grade_A = all students with grade A Grade_B = all students with grade B HardWorking_Yes = who works hard Very cumbersome Need funcoons that maps from O to an arribute space T. P(H = YES) = P({student ϵ O : H(student) = YES}) 10/17/16 24
25 O Random Variables (RV) H: hardworking Yes No Dr. Yanjun Qi / UVA CS 6316 / f16 G:Grade A B A+ P(H = Yes) = P( {all students who is working hard on the course}) funcoons that maps from O to an arribute space T. 10/17/16 25
26 NotaOon Digression P(A) is shorthand for P(A=true) P(~A) is shorthand for P(A=false) Same notaoon applies to other binary RVs: P(Gender=M), P(Gender=F) Same notaoon applies to mul*valued RVs: P(Major=history), P(Age=19), P(Q=c) Note: upper case lerers/names for variables, lower case lerers/names for values 10/17/16 26
27 Discrete Random Variables Random variables (RVs) which may take on only a countable number of disinct values X is a RV with arity k if it can take on exactly one value out of {x 1,, x k } 10/17/16 27
28 Probability of Discrete RV Probability mass funcoon (pmf): P(X = x i ) Easy facts about pmf Σ i P(X = x i ) = 1 P(X = x i X = x j ) = 0 if i j P(X = x i U X = x j ) = P(X = x i ) + P(X = x j ) if i j P(X = x 1 U X = x 2 U U X = x k ) = 1 10/17/16 28
29 e.g. Coin Flips You flip a coin Head with probability 0.5 You flip 100 coins How many heads would you expect 10/17/16 29
30 e.g. Coin Flips cont. You flip a coin Head with probability p Binary random variable Bernoulli trial with success probability p You flip k coins How many heads would you expect Number of heads X: discrete random variable Binomial distribuoon with parameters k and p 10/17/16 30
31 Discrete Random Variables Random variables (RVs) which may take on only a countable number of disinct values E.g. the total number of heads X you get if you flip 100 coins X is a RV with arity k if it can take on exactly one value out of { x 1,, x } k E.g. the possible values that X can take on are 0, 1, 2,, /17/16 31
32 e.g., two Common DistribuOons Uniform X U 1,..., N X takes values 1, 2,, N PX ( = i) = 1N E.g. picking balls of different colors from a box Binomial X takes values 0, 1,, k 10/17/16 P( X = i) = X Bin k, p k i E.g. coin flips k Omes ( ) pi 1 p ( ) k i 32
33 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Events and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 33
34 e.g., Coin Flips by Two Persons Your friend and you both flip coins Head with probability 0.5 You flip 50 Omes; your friend flip 100 Omes How many heads will both of you get 10/17/16 34
35 Joint DistribuOon Given two discrete RVs X and Y, their joint distribuion is the distribuoon of X and Y together E.g. P(You get 21 heads AND you friend get 70 heads) E.g. sum ( x y) /17/16 i= 0 j= 0 x y PX= Y= = 1 ( i j ) PYou get heads AND your friend get heads = 1 35
36 Joint DistribuOon Given two discrete RVs X and Y, their joint distribuion is the distribuoon of X and Y together E.g. P(You get 21 heads AND you friend get 70 heads) E.g. sum ( x y) /17/16 i= 0 j= 0 x y PX= Y= = 1 ( i j ) PYou get heads AND your friend get heads = 1 36
37 CondiOonal Probability ( ) P X = x Y = y is the probability of X = x, given the occurrence of Y = y E.g. you get 0 heads, given that your friend gets 61 heads P X ( x Y y) = = = ( = x Y= y) P X ( = y) P Y 10/17/16 37
38 Law of Total Probability Given two discrete RVs X and Y, which take values in { x and, We have 1,, x } { m y 1,, y } n ( = x ) ( ) i = = xi = yj PX PX Y j ( x ) ( ) i yj yj = PX= Y= PY= j 10/17/16 38
39 MarginalizaOon B 4 B 5 B 3 B 2 A B 7 B 6 B 1 Marginal Probability Law of Total Probability ( = x ) ( ) i = = xi = yj PX PX Y j ( x ) ( ) i yj yj = PX= Y= PY= j CondiOonal Probability Marginal Probability 10/17/16 39
40 Simplify NotaOon: Dr. Yanjun Qi / UVA CS 6316 / f16 CondiOonal Probability events ( x Y y) P X = = = ( = x Y= y) P X ( = y) P Y But we will always write it this way: ( x y) P = p( x, y) p( y) X=x Y=y P(X=x true) -> P(X=x) -> P(x) 10/17/16 40
41 Simplify NotaOon: MarginalizaOon We know p(x, Y), what is P(X=x)? We can use the law of total probability, why? Dr. Yanjun Qi / UVA CS 6316 / f16 p ( x) = P( x, y) = y y P ( y) P( x y) B 4 B 7 B 5 B 6 A B 3 B 1 B 2 10/17/16 41
42 MarginalizaOon Cont. Another example ( ) ( ) ( ) ( ) = = y z z y z y x P z y P z y x P x p,,,,,, 10/17/16 42 Dr. Yanjun Qi / UVA CS 6316 / f16
43 Bayes Rule We know that P(rain) = 0.5 If we also know that the grass is wet, then how this affects our belief about whether it rains or not? P( rain wet) = P(rain)P(wet rain) P(wet) 10/17/16 43
44 Bayes Rule We know that P(rain) = 0.5 If we also know that the grass is wet, then how this affects our belief about whether it rains or not? P( rain wet) = P(rain)P(wet rain) P(wet) P( x y) = P(x)P(y x) P(y) 10/17/16 44
45 Bayes Rule X and Y are discrete RVs ( x Y y) P X = = = ( = x Y= y) P X ( = y) P Y ( x Y ) i yj P X 10/17/16 = = = ( = y ) ( ) j = xi = xi P( Y = y X ) P( X ) j = xk = xk P Y X P X k 45
46 10/17/16 46
47 P(A = a 1 B) = P(B A = a 1 )P(A = a 1 ) P(B A = a i )P(A = a i ) i 10/17/16 47
48 Bayes Rule cont. You can condioon on more variables ( ) ) ( ), ( ) (, z y P z x y P z x P z y x P = 10/17/16 48 Dr. Yanjun Qi / UVA CS 6316 / f16
49 CondiOonal Probability Example 10/17/16 49 Adapt from Prof. Nando de Freitas s review slides
50 CondiOonal Probability Example 10/17/16 50 Adapt from Prof. Nando de Freitas s review slides
51 CondiOonal Probability Example 10/17/16 51
52 CondiOonal Probability Example 10/17/16 52
53 è Matrix NotaOon 10/17/16 53
54 Today : Probability Review Dr. Yanjun Qi / UVA CS 6316 / f16 The big picture Events and Event spaces Random variables Joint probability, MarginalizaOon, condiooning, chain rule, Bayes Rule, law of total probability, etc. Structural properoes Independence, condioonal independence 10/17/16 54
55 Independent RVs IntuiOon: X and Y are independent means that X = x that Y neither makes it more or less probable = y DefiniOon: X and Y are independent iff PX= x Y= y = PX= x PY= y ( ) ( ) ( ) 10/17/16 55
56 More on Independence ( = x = y) = ( = x) ( = y) PX Y PX PY ( = x = y) = ( = x) P( Y = y X = x) = P( Y = y) P X Y P X E.g. no marer how many heads you get, your friend will not be affected, and vice versa 10/17/16 56
57 More on Independence Dr. Yanjun Qi / UVA CS 6316 / f16 X is independent of Y means that knowing Y does not change our belief about X. P(X Y=y) = P(X) P(X=x, Y=y) = P(X=x) P(Y=y) The above should hold for all x i, y j It is symmetric and wriren as!x Y 10/17/16 57
58 CondiOonally Independent RVs IntuiOon: X and Y are condioonally independent given Z means that once Z is known, the value of X does not add any addiional informaoon about Y DefiniOon: X and Y are condioonally independent given Z iff ( = x = y = z) = ( = x = z) ( = y = z) P X Y Z P X Z P Y Z 10/17/16 If holding for all x i, y j, z k! X Y Z 58
59 CondiOonally Independent RVs! X Y Z!X Y 10/17/16 59
60 More on CondiOonal Independence ( = x = y = z) = ( = x = z) ( = y = z) P X Y Z P X Z P Y Z ( = x = y = z) = ( = x = z) P X Y,Z P X Z ( = y = x = z) = ( = y = z) P Y X,Z P Y Z 10/17/16 60
61 Today Recap : Probability Review The big picture : data <-> probabilisoc model Sample space, Events and Event spaces Random variables Joint probability, Marginal probability, condioonal probability, Chain rule, Bayes Rule, Law of total probability, etc. Independence, condioonal independence 10/17/16 61
62 References q Prof. Andrew Moore s review tutorial q Prof. Nando de Freitas s review slides q Prof. Carlos Guestrin recitaoon slides 10/17/16 62
Where are we? è Five major sec3ons of this course
UVA CS 4501-001 / 6501 007 Introduc8on to Machine Learning and Data Mining Lecture 12: Probability and Sta3s3cs Review Yanjun Qi / Jane University of Virginia Department of Computer Science 10/02/14 1
More informationUVA CS / Introduc8on to Machine Learning and Data Mining
UVA CS 4501-001 / 6501 007 Introduc8on to Machine Learning and Data Mining Lecture 13: Probability and Sta3s3cs Review (cont.) + Naïve Bayes Classifier Yanjun Qi / Jane, PhD University of Virginia Department
More informationSTAT 598L Probabilistic Graphical Models. Instructor: Sergey Kirshner. Probability Review
STAT 598L Probabilistic Graphical Models Instructor: Sergey Kirshner Probability Review Some slides are taken (or modified) from Carlos Guestrin s 10-708 Probabilistic Graphical Models Fall 2008 at CMU
More informationBasics on Probability. Jingrui He 09/11/2007
Basics on Probability Jingrui He 09/11/2007 Coin Flips You flip a coin Head with probability 0.5 You flip 100 coins How many heads would you expect Coin Flips cont. You flip a coin Head with probability
More informationCPSC340. Probability. Nando de Freitas September, 2012 University of British Columbia
CPSC340 Probability Nando de Freitas September, 2012 University of British Columbia Outline of the lecture This lecture is intended at revising probabilistic concepts that play an important role in the
More informationProbability Theory for Machine Learning. Chris Cremer September 2015
Probability Theory for Machine Learning Chris Cremer September 2015 Outline Motivation Probability Definitions and Rules Probability Distributions MLE for Gaussian Parameter Estimation MLE and Least Squares
More informationMachine Learning
Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University August 30, 2017 Today: Decision trees Overfitting The Big Picture Coming soon Probabilistic learning MLE,
More informationComputational Genomics
Computational Genomics http://www.cs.cmu.edu/~02710 Introduction to probability, statistics and algorithms (brief) intro to probability Basic notations Random variable - referring to an element / event
More informationUVA CS 4501: Machine Learning
UVA CS 4501: Machine Learning Lecture 21: Decision Tree / Random Forest / Ensemble Dr. Yanjun Qi University of Virginia Department of Computer Science Where are we? è Five major sections of this course
More informationData Mining Techniques. Lecture 3: Probability
Data Mining Techniques CS 6220 - Section 3 - Fall 2016 Lecture 3: Probability Jan-Willem van de Meent (credit: Zhao, CS 229, Bishop) Project Vote 1. Freeform: Develop your own project proposals 30% of
More informationCOMP 328: Machine Learning
COMP 328: Machine Learning Lecture 2: Naive Bayes Classifiers Nevin L. Zhang Department of Computer Science and Engineering The Hong Kong University of Science and Technology Spring 2010 Nevin L. Zhang
More informationMachine Learning Algorithm. Heejun Kim
Machine Learning Algorithm Heejun Kim June 12, 2018 Machine Learning Algorithms Machine Learning algorithm: a procedure in developing computer programs that improve their performance with experience. Types
More informationChapter 6: Probability The Study of Randomness
Chapter 6: Probability The Study of Randomness 6.1 The Idea of Probability 6.2 Probability Models 6.3 General Probability Rules 1 Simple Question: If tossing a coin, what is the probability of the coin
More informationCS 361: Probability & Statistics
February 12, 2018 CS 361: Probability & Statistics Random Variables Monty hall problem Recall the setup, there are 3 doors, behind two of them are indistinguishable goats, behind one is a car. You pick
More informationMachine Learning
Machine Learning 10-701 Tom M. Mitchell Machine Learning Department Carnegie Mellon University January 13, 2011 Today: The Big Picture Overfitting Review: probability Readings: Decision trees, overfiting
More informationLecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019
Lecture 10: Probability distributions DANIEL WELLER TUESDAY, FEBRUARY 19, 2019 Agenda What is probability? (again) Describing probabilities (distributions) Understanding probabilities (expectation) Partial
More informationLECTURE 1. 1 Introduction. 1.1 Sample spaces and events
LECTURE 1 1 Introduction The first part of our adventure is a highly selective review of probability theory, focusing especially on things that are most useful in statistics. 1.1 Sample spaces and events
More informationDept. of Linguistics, Indiana University Fall 2015
L645 Dept. of Linguistics, Indiana University Fall 2015 1 / 34 To start out the course, we need to know something about statistics and This is only an introduction; for a fuller understanding, you would
More informationRecitation 2: Probability
Recitation 2: Probability Colin White, Kenny Marino January 23, 2018 Outline Facts about sets Definitions and facts about probability Random Variables and Joint Distributions Characteristics of distributions
More informationIntroduction to Probability and Statistics (Continued)
Introduction to Probability and Statistics (Continued) Prof. icholas Zabaras Center for Informatics and Computational Science https://cics.nd.edu/ University of otre Dame otre Dame, Indiana, USA Email:
More informationLecture 1: Basics of Probability
Lecture 1: Basics of Probability (Luise-Vitetta, Chapter 8) Why probability in data science? Data acquisition is noisy Sampling/quantization external factors: If you record your voice saying machine learning
More informationDecision Trees. Nicholas Ruozzi University of Texas at Dallas. Based on the slides of Vibhav Gogate and David Sontag
Decision Trees Nicholas Ruozzi University of Texas at Dallas Based on the slides of Vibhav Gogate and David Sontag Supervised Learning Input: labelled training data i.e., data plus desired output Assumption:
More informationLecture 10: Bayes' Theorem, Expected Value and Variance Lecturer: Lale Özkahya
BBM 205 Discrete Mathematics Hacettepe University http://web.cs.hacettepe.edu.tr/ bbm205 Lecture 10: Bayes' Theorem, Expected Value and Variance Lecturer: Lale Özkahya Resources: Kenneth Rosen, Discrete
More informationProbability. Machine Learning and Pattern Recognition. Chris Williams. School of Informatics, University of Edinburgh. August 2014
Probability Machine Learning and Pattern Recognition Chris Williams School of Informatics, University of Edinburgh August 2014 (All of the slides in this course have been adapted from previous versions
More informationMachine Learning. Bayes Basics. Marc Toussaint U Stuttgart. Bayes, probabilities, Bayes theorem & examples
Machine Learning Bayes Basics Bayes, probabilities, Bayes theorem & examples Marc Toussaint U Stuttgart So far: Basic regression & classification methods: Features + Loss + Regularization & CV All kinds
More informationIntroduction to Stochastic Processes
Stat251/551 (Spring 2017) Stochastic Processes Lecture: 1 Introduction to Stochastic Processes Lecturer: Sahand Negahban Scribe: Sahand Negahban 1 Organization Issues We will use canvas as the course webpage.
More informationPAC-learning, VC Dimension and Margin-based Bounds
More details: General: http://www.learning-with-kernels.org/ Example of more complex bounds: http://www.research.ibm.com/people/t/tzhang/papers/jmlr02_cover.ps.gz PAC-learning, VC Dimension and Margin-based
More informationMachine Learning: Probability Theory
Machine Learning: Probability Theory Prof. Dr. Martin Riedmiller Albert-Ludwigs-University Freiburg AG Maschinelles Lernen Theories p.1/28 Probabilities probabilistic statements subsume different effects
More informationReview of probability
Review of probability Computer Sciences 760 Spring 2014 http://pages.cs.wisc.edu/~dpage/cs760/ Goals for the lecture you should understand the following concepts definition of probability random variables
More informationEcon 325: Introduction to Empirical Economics
Econ 325: Introduction to Empirical Economics Lecture 2 Probability Copyright 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 3-1 3.1 Definition Random Experiment a process leading to an uncertain
More informationReview of Probability. CS1538: Introduction to Simulations
Review of Probability CS1538: Introduction to Simulations Probability and Statistics in Simulation Why do we need probability and statistics in simulation? Needed to validate the simulation model Needed
More informationProbability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides
Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides slide 1 Inference with Bayes rule: Example In a bag there are two envelopes one has a red ball (worth $100) and a black ball one
More informationELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables
Department of Electrical Engineering University of Arkansas ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables Dr. Jingxian Wu wuj@uark.edu OUTLINE 2 Random Variable Discrete Random
More informationSDS 321: Introduction to Probability and Statistics
SDS 321: Introduction to Probability and Statistics Lecture 13: Expectation and Variance and joint distributions Purnamrita Sarkar Department of Statistics and Data Science The University of Texas at Austin
More informationCOS402- Artificial Intelligence Fall Lecture 10: Bayesian Networks & Exact Inference
COS402- Artificial Intelligence Fall 2015 Lecture 10: Bayesian Networks & Exact Inference Outline Logical inference and probabilistic inference Independence and conditional independence Bayes Nets Semantics
More informationBrief Review of Probability
Brief Review of Probability Nuno Vasconcelos (Ken Kreutz-Delgado) ECE Department, UCSD Probability Probability theory is a mathematical language to deal with processes or experiments that are non-deterministic
More informationNaïve Bayes classification
Naïve Bayes classification 1 Probability theory Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. Examples: A person s height, the outcome of a coin toss
More information2/3/04. Syllabus. Probability Lecture #2. Grading. Probability Theory. Events and Event Spaces. Experiments and Sample Spaces
Probability Lecture #2 Introduction to Natural Language Processing CMPSCI 585, Spring 2004 University of Massachusetts Amherst Andrew McCallum Syllabus Probability and Information Theory Spam filtering
More informationSTAT Chapter 3: Probability
Basic Definitions STAT 515 --- Chapter 3: Probability Experiment: A process which leads to a single outcome (called a sample point) that cannot be predicted with certainty. Sample Space (of an experiment):
More informationGrundlagen der Künstlichen Intelligenz
Grundlagen der Künstlichen Intelligenz Uncertainty & Probabilities & Bandits Daniel Hennes 16.11.2017 (WS 2017/18) University Stuttgart - IPVS - Machine Learning & Robotics 1 Today Uncertainty Probability
More informationCS 4649/7649 Robot Intelligence: Planning
CS 4649/7649 Robot Intelligence: Planning Probability Primer Sungmoon Joo School of Interactive Computing College of Computing Georgia Institute of Technology S. Joo (sungmoon.joo@cc.gatech.edu) 1 *Slides
More informationNaïve Bayes. Jia-Bin Huang. Virginia Tech Spring 2019 ECE-5424G / CS-5824
Naïve Bayes Jia-Bin Huang ECE-5424G / CS-5824 Virginia Tech Spring 2019 Administrative HW 1 out today. Please start early! Office hours Chen: Wed 4pm-5pm Shih-Yang: Fri 3pm-4pm Location: Whittemore 266
More informationReview of Basic Probability
Review of Basic Probability Erik G. Learned-Miller Department of Computer Science University of Massachusetts, Amherst Amherst, MA 01003 September 16, 2009 Abstract This document reviews basic discrete
More informationIntroduction to Probability
Introduction to Probability Gambling at its core 16th century Cardano: Books on Games of Chance First systematic treatment of probability 17th century Chevalier de Mere posed a problem to his friend Pascal.
More informationCIVL 7012/8012. Basic Laws and Axioms of Probability
CIVL 7012/8012 Basic Laws and Axioms of Probability Why are we studying probability and statistics? How can we quantify risks of decisions based on samples from a population? How should samples be selected
More informationIntelligent Systems (AI-2)
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 19 Oct, 23, 2015 Slide Sources Raymond J. Mooney University of Texas at Austin D. Koller, Stanford CS - Probabilistic Graphical Models D. Page,
More informationConditional Probability
Conditional Probability Idea have performed a chance experiment but don t know the outcome (ω), but have some partial information (event A) about ω. Question: given this partial information what s the
More informationLecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable
Lecture Notes 1 Probability and Random Variables Probability Spaces Conditional Probability and Independence Random Variables Functions of a Random Variable Generation of a Random Variable Jointly Distributed
More informationLecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable
Lecture Notes 1 Probability and Random Variables Probability Spaces Conditional Probability and Independence Random Variables Functions of a Random Variable Generation of a Random Variable Jointly Distributed
More informationCS626 Data Analysis and Simulation
CS626 Data Analysis and Simulation Instructor: Peter Kemper R 104A, phone 221-3462, email:kemper@cs.wm.edu Today: Probability Primer Quick Reference: Sheldon Ross: Introduction to Probability Models 9th
More informationIntroduction to Machine Learning
Introduction to Machine Learning Introduction to Probabilistic Methods Varun Chandola Computer Science & Engineering State University of New York at Buffalo Buffalo, NY, USA chandola@buffalo.edu Chandola@UB
More informationDecision making and problem solving Lecture 1. Review of basic probability Monte Carlo simulation
Decision making and problem solving Lecture 1 Review of basic probability Monte Carlo simulation Why probabilities? Most decisions involve uncertainties Probability theory provides a rigorous framework
More informationMidterm Review CS 6375: Machine Learning. Vibhav Gogate The University of Texas at Dallas
Midterm Review CS 6375: Machine Learning Vibhav Gogate The University of Texas at Dallas Machine Learning Supervised Learning Unsupervised Learning Reinforcement Learning Parametric Y Continuous Non-parametric
More informationBe able to define the following terms and answer basic questions about them:
CS440/ECE448 Section Q Fall 2017 Final Review Be able to define the following terms and answer basic questions about them: Probability o Random variables, axioms of probability o Joint, marginal, conditional
More informationIntelligent Systems (AI-2)
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 11 Oct, 3, 2016 CPSC 422, Lecture 11 Slide 1 422 big picture: Where are we? Query Planning Deterministic Logics First Order Logics Ontologies
More informationProbability Theory. Prof. Dr. Martin Riedmiller AG Maschinelles Lernen Albert-Ludwigs-Universität Freiburg.
Probability Theory Prof. Dr. Martin Riedmiller AG Maschinelles Lernen Albert-Ludwigs-Universität Freiburg riedmiller@informatik.uni-freiburg.de Prof. Dr. Martin Riedmiller Machine Learning Lab, University
More informationIntroduction to Bayesian Learning
Course Information Introduction Introduction to Bayesian Learning Davide Bacciu Dipartimento di Informatica Università di Pisa bacciu@di.unipi.it Apprendimento Automatico: Fondamenti - A.A. 2016/2017 Outline
More informationNaïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability
Probability theory Naïve Bayes classification Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. s: A person s height, the outcome of a coin toss Distinguish
More informationIntroduction to probability
Introduction to probability 4.1 The Basics of Probability Probability The chance that a particular event will occur The probability value will be in the range 0 to 1 Experiment A process that produces
More informationProbability Review. Chao Lan
Probability Review Chao Lan Let s start with a single random variable Random Experiment A random experiment has three elements 1. sample space Ω: set of all possible outcomes e.g.,ω={1,2,3,4,5,6} 2. event
More informationUVA CS / Introduc8on to Machine Learning and Data Mining. Lecture 9: Classifica8on with Support Vector Machine (cont.
UVA CS 4501-001 / 6501 007 Introduc8on to Machine Learning and Data Mining Lecture 9: Classifica8on with Support Vector Machine (cont.) Yanjun Qi / Jane University of Virginia Department of Computer Science
More informationCS6220: DATA MINING TECHNIQUES
CS6220: DATA MINING TECHNIQUES Chapter 8&9: Classification: Part 3 Instructor: Yizhou Sun yzsun@ccs.neu.edu March 12, 2013 Midterm Report Grade Distribution 90-100 10 80-89 16 70-79 8 60-69 4
More informationLecture 9: Naive Bayes, SVM, Kernels. Saravanan Thirumuruganathan
Lecture 9: Naive Bayes, SVM, Kernels Instructor: Outline 1 Probability basics 2 Probabilistic Interpretation of Classification 3 Bayesian Classifiers, Naive Bayes 4 Support Vector Machines Probability
More informationCOS 424: Interacting with Data. Lecturer: Dave Blei Lecture #2 Scribe: Ellen Kim February 7, 2008
COS 424: Interacting with Data Lecturer: Dave Blei Lecture #2 Scribe: Ellen Kim February 7, 2008 1 Probability Review 1.1 Random Variables A random variable can be used to model any probabilistic outcome.
More informationProbability Theory. Introduction to Probability Theory. Principles of Counting Examples. Principles of Counting. Probability spaces.
Probability Theory To start out the course, we need to know something about statistics and probability Introduction to Probability Theory L645 Advanced NLP Autumn 2009 This is only an introduction; for
More informationLecture Notes 1 Basic Probability. Elements of Probability. Conditional probability. Sequential Calculation of Probability
Lecture Notes 1 Basic Probability Set Theory Elements of Probability Conditional probability Sequential Calculation of Probability Total Probability and Bayes Rule Independence Counting EE 178/278A: Basic
More informationDiscrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14
CS 70 Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 Introduction One of the key properties of coin flips is independence: if you flip a fair coin ten times and get ten
More informationRandomized Algorithms
Randomized Algorithms Prof. Tapio Elomaa tapio.elomaa@tut.fi Course Basics A new 4 credit unit course Part of Theoretical Computer Science courses at the Department of Mathematics There will be 4 hours
More informationMidterm Review CS 7301: Advanced Machine Learning. Vibhav Gogate The University of Texas at Dallas
Midterm Review CS 7301: Advanced Machine Learning Vibhav Gogate The University of Texas at Dallas Supervised Learning Issues in supervised learning What makes learning hard Point Estimation: MLE vs Bayesian
More informationAnnouncements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation
CS 188: Artificial Intelligence Spring 2010 Lecture 15: Bayes Nets II Independence 3/9/2010 Pieter Abbeel UC Berkeley Many slides over the course adapted from Dan Klein, Stuart Russell, Andrew Moore Current
More informationBasic Probability and Information Theory: quick revision
Basic Probability and Information Theory: quick revision ML for NLP Lecturer: S Luz http://www.scss.tcd.ie/~luzs/t/cs4ll4/ February 17, 2015 In these notes we review the basics of probability theory and
More informationReview of Probability Mark Craven and David Page Computer Sciences 760.
Review of Probability Mark Craven and David Page Computer Sciences 760 www.biostat.wisc.edu/~craven/cs760/ Goals for the lecture you should understand the following concepts definition of probability random
More informationDiscrete Random Variable
Discrete Random Variable Outcome of a random experiment need not to be a number. We are generally interested in some measurement or numerical attribute of the outcome, rather than the outcome itself. n
More informationGenerative Learning. INFO-4604, Applied Machine Learning University of Colorado Boulder. November 29, 2018 Prof. Michael Paul
Generative Learning INFO-4604, Applied Machine Learning University of Colorado Boulder November 29, 2018 Prof. Michael Paul Generative vs Discriminative The classification algorithms we have seen so far
More informationSTAT:5100 (22S:193) Statistical Inference I
STAT:5100 (22S:193) Statistical Inference I Week 3 Luke Tierney University of Iowa Fall 2015 Luke Tierney (U Iowa) STAT:5100 (22S:193) Statistical Inference I Fall 2015 1 Recap Matching problem Generalized
More informationLecture Lecture 5
Lecture 4 --- Lecture 5 A. Basic Concepts (4.1-4.2) 1. Experiment: A process of observing a phenomenon that has variation in its outcome. Examples: (E1). Rolling a die, (E2). Drawing a card form a shuffled
More informationCourse Introduction. Probabilistic Modelling and Reasoning. Relationships between courses. Dealing with Uncertainty. Chris Williams.
Course Introduction Probabilistic Modelling and Reasoning Chris Williams School of Informatics, University of Edinburgh September 2008 Welcome Administration Handout Books Assignments Tutorials Course
More informationSTAT2201. Analysis of Engineering & Scientific Data. Unit 3
STAT2201 Analysis of Engineering & Scientific Data Unit 3 Slava Vaisman The University of Queensland School of Mathematics and Physics What we learned in Unit 2 (1) We defined a sample space of a random
More informationReview of Probabilities and Basic Statistics
Alex Smola Barnabas Poczos TA: Ina Fiterau 4 th year PhD student MLD Review of Probabilities and Basic Statistics 10-701 Recitations 1/25/2013 Recitation 1: Statistics Intro 1 Overview Introduction to
More informationConsider an experiment that may have different outcomes. We are interested to know what is the probability of a particular set of outcomes.
CMSC 310 Artificial Intelligence Probabilistic Reasoning and Bayesian Belief Networks Probabilities, Random Variables, Probability Distribution, Conditional Probability, Joint Distributions, Bayes Theorem
More informationFundamentals. CS 281A: Statistical Learning Theory. Yangqing Jia. August, Based on tutorial slides by Lester Mackey and Ariel Kleiner
Fundamentals CS 281A: Statistical Learning Theory Yangqing Jia Based on tutorial slides by Lester Mackey and Ariel Kleiner August, 2011 Outline 1 Probability 2 Statistics 3 Linear Algebra 4 Optimization
More informationMachine Learning
Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University September 22, 2011 Today: MLE and MAP Bayes Classifiers Naïve Bayes Readings: Mitchell: Naïve Bayes and Logistic
More informationI - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability
What is Probability? the chance of an event occuring eg 1classical probability 2empirical probability 3subjective probability Section 2 - Probability (1) Probability - Terminology random (probability)
More informationBayes Classifiers. CAP5610 Machine Learning Instructor: Guo-Jun QI
Bayes Classifiers CAP5610 Machine Learning Instructor: Guo-Jun QI Recap: Joint distributions Joint distribution over Input vector X = (X 1, X 2 ) X 1 =B or B (drinking beer or not) X 2 = H or H (headache
More informationProbability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008
Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 1 Review We saw some basic metrics that helped us characterize
More informationCIVL Why are we studying probability and statistics? Learning Objectives. Basic Laws and Axioms of Probability
CIVL 3103 Basic Laws and Axioms of Probability Why are we studying probability and statistics? How can we quantify risks of decisions based on samples from a population? How should samples be selected
More informationProbability & statistics for linguists Class 2: more probability. D. Lassiter (h/t: R. Levy)
Probability & statistics for linguists Class 2: more probability D. Lassiter (h/t: R. Levy) conditional probability P (A B) = when in doubt about meaning: draw pictures. P (A \ B) P (B) keep B- consistent
More informationBe able to define the following terms and answer basic questions about them:
CS440/ECE448 Fall 2016 Final Review Be able to define the following terms and answer basic questions about them: Probability o Random variables o Axioms of probability o Joint, marginal, conditional probability
More informationLecture 2: Repetition of probability theory and statistics
Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:
More informationIntroduc)on to Ar)ficial Intelligence
Introduc)on to Ar)ficial Intelligence Lecture 10 Probability CS/CNS/EE 154 Andreas Krause Announcements! Milestone due Nov 3. Please submit code to TAs! Grading: PacMan! Compiles?! Correct? (Will clear
More informationLecture 1: Probability Fundamentals
Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability
More informationV7 Foundations of Probability Theory
V7 Foundations of Probability Theory Probability : degree of confidence that an event of an uncertain nature will occur. Events : we will assume that there is an agreed upon space of possible outcomes
More informationIntelligent Systems (AI-2)
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 19 Oct, 24, 2016 Slide Sources Raymond J. Mooney University of Texas at Austin D. Koller, Stanford CS - Probabilistic Graphical Models D. Page,
More informationTerminology. Experiment = Prior = Posterior =
Review: probability RVs, events, sample space! Measures, distributions disjoint union property (law of total probability book calls this sum rule ) Sample v. population Law of large numbers Marginals,
More informationBayesian Networks Representation
Bayesian Networks Representation Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University March 19 th, 2007 Handwriting recognition Character recognition, e.g., kernel SVMs a c z rr r r
More informationEE 178 Lecture Notes 0 Course Introduction. About EE178. About Probability. Course Goals. Course Topics. Lecture Notes EE 178
EE 178 Lecture Notes 0 Course Introduction About EE178 About Probability Course Goals Course Topics Lecture Notes EE 178: Course Introduction Page 0 1 EE 178 EE 178 provides an introduction to probabilistic
More informationProbability Lecture #7
Probability Lecture #7 Introduction to Natural Language Processing CMPSCI 585, Fall 2007 University of Massachusetts Amherst Andrew McCallum Today s Main Points Remember (or learn) about probability theory
More informationCS 630 Basic Probability and Information Theory. Tim Campbell
CS 630 Basic Probability and Information Theory Tim Campbell 21 January 2003 Probability Theory Probability Theory is the study of how best to predict outcomes of events. An experiment (or trial or event)
More informationMachine Learning
Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University January 14, 2015 Today: The Big Picture Overfitting Review: probability Readings: Decision trees, overfiting
More informationUCSD CSE 21, Spring 2014 [Section B00] Mathematics for Algorithm and System Analysis
UCSD CSE 21, Spring 2014 [Section B00] Mathematics for Algorithm and System Analysis Lecture 8 Class URL: http://vlsicad.ucsd.edu/courses/cse21-s14/ Lecture 8 Notes Goals for Today Counting Partitions
More information