Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 4 of Data Mining by I. H. Witten, E. Frank and M. A.

Size: px

Start display at page:

Download "Data Mining. Practical Machine Learning Tools and Techniques. Slides for Chapter 4 of Data Mining by I. H. Witten, E. Frank and M. A."

Berniece Norton
5 years ago
Views:

1 Data Mining Practical Machine Learning Tools and Techniques Slides for Chapter of Data Mining by I. H. Witten, E. Frank and M. A. Hall

2 Statistical modeling Opposite of R: use all the attributes Two assumptions: Attributes are equally important statistically independent (given the class value) I.e., knowing the value of one attribute says nothing about the value of another (if the class is known) Independence assumption is never correct! But this scheme works well in practice

3 Probabilities for weather data Temperature 0 /9 /5 /9 /5 /9 0/5 /9 /5 /9 /5 /9 /5 rmal 6 rmal /9 /5 6/9 /5 6/9 /5 /9 /5 9/ 5/ Temp rmal rmal rmal rmal rmal rmal rmal Data Mining: Practical Machine Learning Tools and Techniques (Chapter )

4 Probabilities for weather data Temperature 0 /9 /5 /9 /5 /9 0/5 /9 /5 /9 /5 /9 /5 A new day: rmal 6 rmal /9 /5 6/9 /5 6/9 /5 /9 /5 9/ 5/ Temp.? Likelihood of the two classes For yes = /9 /9 /9 /9 9/ = For no = /5 /5 /5 /5 5/ = Conversion into a probability by normalization: P( yes ) = / ( ) = 0.05 P( no ) = / ( ) = 0.795

5 Bayes s rule Probability of event H given evidence E: Pr [E H]Pr [H] Pr [H E]= Pr [E] A priori probability of H : Probability of event before evidence is seen A posteriori probability of H : Pr [H] Probability of event after evidence is seen Pr [H E] Thomas Bayes Born: 70 in London, England Died: 76 in Tunbridge Wells, Kent, England 5

6 Naïve Bayes for classification Classification learning: what s the probability of the class given an instance? Evidence E = instance Event H = class value for instance Naïve assumption: evidence splits into parts (i.e. attributes) that are independent Pr [E H]Pr [E H] Pr [En H]Pr [H] Pr [H E]= Pr [E] 6

7 Weather data example Temp.? Evidence E Pr [ yes E]=Pr [ = yes] Pr [Temperature= yes] Pr [= yes] Probability of class yes Pr [ = yes] Pr [ yes] Pr [E] = Pr [E] 7

8 The zero-frequency problem What if an attribute value doesn t occur with every class value? (e.g. = high for class yes ) Probability will be zero! Pr [= yes]=0 A posteriori probability will also be zero! Pr [yes E]=0 ( matter how likely the other values are!) Remedy: add to the count for every attribute value-class combination (Laplace estimator) Result: probabilities will never be zero! (also: stabilizes probability estimates) 8

9 Missing values Training: instance is not included in frequency count for attribute value-class combination Classification: attribute will be omitted from calculation Example: Temp.?? Likelihood of yes = /9 /9 /9 9/ = 0.08 Likelihood of no = /5 /5 /5 5/ = 0.0 P( yes ) = 0.08 / ( ) = % P( no ) = 0.0 / ( ) = 59% 0

the normal distribution is defined by two parameters: Sample mean µ Standard

10 Numeric attributes Usual assumption: attributes have a normal or Gaussian probability distribution (given the class) The probability density function for the normal distribution is defined by two parameters: Sample mean µ Standard deviation σ Then the density function f(x) is f (x)= n = x i n i= e π σ n σ= (x i μ) n i= (x μ) σ

11 Statistics for weather data Temperature 6, 68, 65,7, 65, 70, 70, 85, 0 69, 70, 7,80, 70, 75, 90, 9, 7, 85, 80, 95, /9 /5 µ =7 µ =75 µ =79 /9 0/5 σ =6. σ =7.9 σ =0. /9 / µ =86 6/9 /5 σ =9.7 /9 /5 9/ 5/ Example density value: f temperature=66 yes = e =0.00

12 Classifying a new day A new day: Temp true? Likelihood of yes = / /9 9/ = Likelihood of no = / /5 5/ = P( yes ) = / ( ) = 5% P( no ) = / ( ) = 75% Missing values during training are not included in calculation of mean and standard deviation

Naïve Bayes Lecture 6: Self-Study -----

Naïve Bayes Lecture 6: Self-Study ----- Marina Santini Acknowledgements Slides borrowed and adapted from: Data Mining by I. H. Witten, E. Frank and M. A. Hall 1 Lecture 6: Required Reading Daumé III (015: