Crowdsourcing Mul/- Label Classifica/on. Jonathan Bragg University of Washington

Size: px

Start display at page:

Download "Crowdsourcing Mul/- Label Classifica/on. Jonathan Bragg University of Washington"

Horace McKinney
5 years ago
Views:

1 Crowdsourcing Mul/- Label Classifica/on Jonathan Bragg University of Washington

2 Collaborators Dan Weld University of Washington Mausam University of Washington à IIT Delhi

3 Overview What is mul?- label classifica?on? Simple threshold approaches Probabilis?c approaches Choosing which ques?ons to ask

4 Overview What is mul?- label classifica?on? Simple threshold approaches Probabilis?c approaches Choosing which ques?ons to ask

5 Standard classifica?on Data Classes A B C Dai et al. 2013; Kamar, Hacker, and Horvitz 2012; Raykar et al. 2010; Sheng, Provost, and Ipeiro?s 2008; Wauthier and Jordan 2011; Welinder et al. 2010; Whitehill et al. 2009

6 Mul?- label classifica?on Data Classes label label label label label label label label label label

7 Social tagging

8 Overview What is mul?- label classifica?on? Simple threshold approaches Probabilis?c approaches Choosing which ques?ons to ask

9 Overview What is mul?- label classifica?on? Simple threshold approaches Lossless stopping One- away heuris?c Probabilis?c approaches Choosing which ques?ons to ask

10 Sample problem location building person car artist architect animal tiger fish athlete

11 A naïve approach tiger? animal? True True False False person? True False

12 A naïve approach threshold T animal? True False k votes 3? Yes No accept reject Majority vo?ng: T=k/2

13 Lossless stopping k=5 T=3 animal? True False T true? accept

14 Lossless stopping k=5 T=3 animal? True False T true? k- T+1 false? accept reject Ask two people to vote and only ask a third if the first two disagree - TurKit [Lidle et al. 2009]

15 One- away heuris?c k=5 T=3 animal? True False T- 1 true 0 false 0 true k- T false accept reject

16 An applica?on: taxonomies

17 Cascade [Chilton et al. 2013] the crowd Generate Labels Assign Labels to Data Infer Global Taxonomy mul?- label classifica?on naïve cost: data x labels x votes

18 An experiment 100 En??es Brad Pi. Kenny G Washington DC Martha Stewart Whidbey Island The Boston Globe Honda Accord Shanghai 33 Labels person actor director vehicle architect car city location island country Fine- grained en?ty tags [Ling and Weld 2012]

19 Labor reduc?on One- away saves 58% Lossless saves 56%

20 Summary Threshold approaches Lossless stopping (no error) One- away heuris?c (lidle error) Reduc?ons in labor over 50%

21 Overview What is mul?- label classifica?on? Simple threshold approaches Lossless stopping One- away heuris?c Probabilis?c approaches Choosing which ques?ons to ask

22 Overview What is mul?- label classifica?on? Simple threshold approaches Lossless stopping One- away heuris?c Probabilis?c approaches Independent Mul?- label naïve Bayes (MLNB) Choosing which ques?ons to ask

23 A simple probabilis?c model Independent animal? P(animal) True False F T T F T P(animal = True) = 0.04 P(animal = True animal? F ) = P(animal = True animal? F T ) =

24 Are labels independent? animal? person? tiger? P(animal = True) = 0.04 P(animal = True person = True) = P(animal = True?ger = True) =

25 Modeling label co- occurrence P(person) P(animal) P(tiger)

26 Modeling label co- occurrence Mul/- label naïve Bayes (MLNB) P(animal) P(person animal) P(tiger animal) labels trees P(animal = True person? T ) P(animal = True)

27 Model comparison Independent MLNB Inference speed (per item) O( labels ) O( labels 2 ) # of parameters O( labels ) O( labels 2 )

28 Overview What is mul?- label classifica?on? Simple threshold approaches Lossless stopping One- away heuris?c Probabilis?c approaches Independent Mul?- label naïve Bayes Choosing which ques?ons to ask

29 Overview What is mul?- label classifica?on? Simple threshold approaches Lossless stopping One- away heuris?c Probabilis?c approaches Independent Mul?- label naïve Bayes Choosing which ques?ons to ask

30 Choosing which ques?ons to ask Decision policy: How do we choose the next label? Compute heuris?c Select best label Observe vote

31 Choosing which ques?ons to ask Round- robin policy (e.g., Cascade) Votes animal person tiger

32 A probabilis?c approach Greedy policy Most uncertain heuris?c Votes animal person tiger P = 0.04 P = 0.09 P = 0.02

33 A probabilis?c approach Greedy policy Most uncertain heuris?c Votes False animal person tiger P = 0.04 P = 0.09 P = 0.02

34 A probabilis?c approach Greedy policy Most uncertain heuris?c Votes False animal person tiger P = 0.04 P = 0.01 P = 0.02

35 A probabilis?c approach Greedy policy Most uncertain heuris?c Votes False animal person tiger P = 0.05 P = 0.01 P = 0.03

36 A probabilis?c approach Greedy policy Most uncertain heuris?c Votes True False animal person tiger P = 0.05 P = 0.01 P = 0.03

37 A probabilis?c approach Greedy policy Most uncertain heuris?c Votes True False animal person tiger P = 0.7 P = 0.01 P = 0.03

38 A probabilis?c approach Greedy policy Most uncertain heuris?c Votes True False animal person tiger P = 0.7 P = P = 0.1

39 A probabilis?c approach Votes Greedy policy Most uncertain heuris?c Informa/on gain heuris?c True False Performance guarantees animal person tiger P = 0.7 P = P = 0.1

40 Informa?on gain heuris?c Entropy = uncertainty of labels 1 High 1 Low

41 Informa?on gain heuris?c Informa/on gain = expected reduc?on in entropy (uncertainty) tiger? Higher (beder)

42 Informa?on gain heuris?c Informa/on gain = expected reduc?on in entropy (uncertainty) In our models, these computa?ons are local and inexpensive Guaranteed (1-1/e) 63% of op?mal Sensor selec?on [Krause and Guestrin 2005]

43 Choosing which ques?ons to ask Greedy policy Informa?on gain! Compute heuris?c Select best label Observe vote Compute posterior beliefs

44 Probabilis?c results Mean F-score! MLNB" Independent" Cascade (Threshold)" Number of votes per item!

45 Probabilis?c results Mean F-score! MLNB" Independent" Cascade (Threshold)" Number of votes per item!

46 Probabilis?c results Mean F-score! Over 90% reduc?on in labor MLNB" Independent" Cascade (Threshold)" Number of votes per item!

47 Batching

48 A simple approxima?on Single- label 1. Rank labels by informa?on gain 2. Select best label 3. Observe vote Batched k- best 1. Rank labels by informa?on gain 2. Select top k labels 3. Observe k votes

49 Single- label Batching results Batches of size k=7

50 Summary Threshold approaches Lossless stopping (no error) One- away heuris?c (lidle error) Reduc?ons in labor over 50% Probabilis?c approaches Reduc?ons in labor over 90% Theore?cal guarantees Batching (lidle addi?onal error) For details, see [Bragg et al. 2013] to appear at HCOMP

CS 6140: Machine Learning Spring What We Learned Last Week 2/26/16

Logis@cs CS 6140: Machine Learning Spring 2016 Instructor: Lu Wang College of Computer and Informa@on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Sign