Course 395: Machine Learning - Lectures

Size: px

Start display at page:

Download "Course 395: Machine Learning - Lectures"

Aron Rose
5 years ago
Views:

1 Course 395: Machne Learnng - Lectures Lecture 1-2: Concept Learnng (M. Pantc Lecture 3-4: Decson Trees & CC Intro (M. Pantc Lecture 5-6: Artfcal Neural Networks (S.Zaferou Lecture 7-8: Instance ased Learnng (M. Pantc Lecture 9-10: Genetc Algorthms (M. Pantc Lecture 11-12: Evaluatng Hypotheses (THs Lecture 13-14: ayesan Learnng-ML Estmaton (S. Zaferou Lecture 15-16: Expectaton Maxmaton (S. Zaferou Lecture 17-18: Inductve Logc Programmng (S. Muggleton

2 ayesan Learnng Expectaton Maxmaton Readng: Sldes

3 ML estmaton Consder a set and a model f D {( x, y1,...,( x n, y 1 n } Maxmum lkelhood estmaton s gven * by: f arg max f p( D f

4 ML estmaton Assumng that the samples are condtonal ndependent gven f f * arg max f n 1 p( y f Further assumng y f ( x + e y f ~ N( f ( x, σ f * arg max f n πσ e 1 2σ 2 ( y f ( x 2

5 ML estmaton

6 ML estmaton Choosng to maxme ts logarthm we get f * n 1 1 arg max f (ln πσ 2πσ ( y f ( x removng the constant terms we get 2 * f argmn ( f ( x y f n 1 2

7 ML: smple example Consder a con flppng experment. Par of cons A and of unknown bases θα and A lands on tal wth P 1 θα We want to estmate: Α θ ( θ, θ θ A

8 ML: smple example A Randomly choose one (wth equal probablty. Perform 10 ndependent tosses (50 con tosses n total.

9 ML: smple example # (number heads observed durng the -th set of tosses x x1, x2,..., x ( 5 x {0,1,...,10} dentty of the con 1, 2,..., ( 5 { A, }

10 ML: smple example : # of heads usng con A HA FA : total # of flps usng con A θ Α H F A A θ H F Maxmum Lkelhood estmaton maxmes log P(, x θ

11 ML: smple example P ( D θ θ H A (1 TA H Α θα θ (1 θ T log P( D θ logθa + Alog(1 θα HA T + H logθ + T log(1 θ log P θα 0 θ Α Η ΗΑ + Τ Α Α

12 ML: smple example

13 Expectaton Maxmaton Consder a more challengng setup: We are gven x ( x1, x2,..., x5 latent (or hdden varables. but not Computng proportons of heads for each con s no longer possble

14 Expectaton Maxmaton t ( t Start wth some ntal parameters θ ( θ A and determne for each of the fve sets whether con A or con was more lkely to have generated the observed flp. Assumng ths data completon s correct apply regular maxmum lkelhood to get Do t untl convergence. ˆ ( ˆ, ˆ ( t θ θˆ ( t+ 1

15 Expectaton Maxmaton Compute probabltes of all possble completons gven θˆ ( t (not just the most probable one These probabltes are used to create a weghted tranng set consstng of all completons. A modfed ML that deals wth weghted tranng data s appled n order to get the new estmate θˆ ( t+ 1

16 Expectaton Maxmaton y usng weghted tranng examples rather than choosng the sngle best completon the EM algorthm accounts for the confdence of the model n each completon of the data

17 Expectaton Maxmaton P( D, θ P( P( D, θ A A P( D A P( D, θ 0.6 H A 0.4 T A A P( A 0.5

18 Expectaton Maxmaton P( D, θ P( D, θ P( D P( P( D, θ 0.5 H 0.5 T P( 0.5

19 0.45, (, (, ( + θ θ θ A A A D P D P D P w 0.55, (, (, ( + θ θ θ A D P D P D P w Expectaton Maxmaton

20 Expectaton Maxmaton EM Algorthm: E-step: Guessng a probablty dstrbuton over completons of mssng data gven the current model M-step: Re-estmate the parameters model gven these completons.

21 Expectaton Maxmaton

22 EM: Mathematcs Startng from an ntal parameters the E-step constructs a functon that lower-bounds. In the M-step, s computed as the maxmum of. In the next E-step, a new lower-bound s constructed; maxmaton of gves and so on

23 EM: Mathematcs The EM algorthm derves from the fact that for all pd (1 where the nequalty s tght when. Jensen s nequalty for all concave functon (e.g., The above holds by lettng

24 EM: Mathematcs Now, consderng the update rule, Where Applyng the tghtness condtons (1. Moreover, and snce (. guarantees that s a lower bound of. Update monotonc mprovement of ML for ncomplete data

25 Course 395: Machne Learnng - Lectures Lecture 1-2: Concept Learnng (M. Pantc Lecture 3-4: Decson Trees & CC Intro (M. Pantc Lecture 5-6: Artfcal Neural Networks (S.Zaferou Lecture 7-8: Instance ased Learnng (M. Pantc Lecture 9-10: Genetc Algorthms (M. Pantc Lecture 11-12: Evaluatng Hypotheses (THs Lecture 13-14: ayesan Learnng ML Estmaton (S. Zaferou Lecture 15-16: Expectaton Maxmaton (S. Zaferou Lecture 17-18: Inductve Logc Programmng (S. Muggleton

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2) 1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons