Rule Extraction for Machine Translation

Size: px

Start display at page:

Download "Rule Extraction for Machine Translation"

Vivien Cameron
5 years ago
Views:

1 Rule Extraction for Machine Translation Andreas Maletti Institute of Computer cience Universität Leipzig, Germany on leave from: Institute for Natural Language Processing Universität tuttgart, Germany Darmstadt eptember 10, 2014

2 Main notions Machine translation (MT) Automatic natural language translation (by a computer) as opposed to: manual translation computer-aided translation (e.g., translation memory) Rule Extraction for MT A. Maletti 2

3 Main notions Machine translation (MT) Automatic natural language translation (by a computer) as opposed to: manual translation computer-aided translation (e.g., translation memory) tatistical machine translation (MT) MT using systems automatically obtained from translations as opposed to: rule-based machine translation (old) YTRAN example-based machine translation translation by analogy Rule Extraction for MT A. Maletti 2

4 hort history Timeline 1 Dark age (60s 90s) rule-based systems (e.g., YTRAN) CHOMKYAN approach perfect translation, poor coverage Rule Extraction for MT A. Maletti 3

5 hort history Timeline 1 Dark age (60s 90s) rule-based systems (e.g., YTRAN) CHOMKYAN approach perfect translation, poor coverage 2 Reformation (1991 present) phrase-based and syntax-based systems statistical approach cheap, automatically trained Rule Extraction for MT A. Maletti 3

6 hort history Timeline 1 Dark age (60s 90s) rule-based systems (e.g., YTRAN) CHOMKYAN approach perfect translation, poor coverage 2 Reformation (1991 present) phrase-based and syntax-based systems statistical approach cheap, automatically trained 3 Potential future semantics-based systems (e.g., FRAMENET-based) semi-supervised, statistical approach basic understanding of translated text Rule Extraction for MT A. Maletti 3

7 Examples Applications Technical manuals Example (An mp3 player) The synchronous manifestation of lyrics is a procedure for can broadcasting the music, waiting the mp3 file at the same time showing the lyrics. Rule Extraction for MT A. Maletti 4

8 Examples Applications Technical manuals Example (An mp3 player) With the this kind method that the equipments that synchronous function of support up broadcast to make use of document create setup, you can pass the LCD window way the check at the document contents that broadcast. Rule Extraction for MT A. Maletti 4

9 Examples Applications Technical manuals Example (An mp3 player) That procedure returns offerings to have to modify, and delete, and stick top, keep etc. edit function. Rule Extraction for MT A. Maletti 4

10 Examples Applications Technical manuals Example (Hotel Uppsala, weden) Wir hatten die Zimmer eingestuft wird als uperior weil sie renoviert wurde im letzten Jahr oder zwei. Unsere Zimmer hatten Parkettboden und waren sehr geräumig. Man musste allerdings nicht musste seitwärts bewegen. Rule Extraction for MT A. Maletti 4

11 Examples Applications Technical manuals Example (Hotel Uppsala, weden) Wir hatten die Zimmer eingestuft wird als uperior weil sie renoviert wurde im letzten Jahr oder zwei. Unsere Zimmer hatten Parkettboden und waren sehr geräumig. Man musste allerdings nicht musste seitwärts bewegen. We stayed in rooms classified as superior because they had been renovated in the last year or two. Our rooms had wood floors and were roomy. You didn t have to walk sideways to move around. Rule Extraction for MT A. Maletti 4

12 Examples Applications Technical manuals U military Example (JONE, HEN, HERZOG 2009) oldier: Okay, what is your name? Local: Abdul. oldier: And your last name? Local: Al Farran. Rule Extraction for MT A. Maletti 4

13 Examples Applications Technical manuals U military Example (JONE, HEN, HERZOG 2009) oldier: Okay, what is your name? Local: Abdul. oldier: And your last name? Local: Al Farran. peech-to-text machine translation oldier: Local: Okay, what s your name? milk a mechanic and I am here I mean yes Rule Extraction for MT A. Maletti 4

14 Examples Applications Technical manuals U military Example (JONE, HEN, HERZOG 2009) oldier: Okay, what is your name? Local: Abdul. oldier: And your last name? Local: Al Farran. peech-to-text machine translation oldier: Local: oldier: Local: Okay, what s your name? milk a mechanic and I am here I mean yes What is your last name? every two weeks my son s name is ismail Rule Extraction for MT A. Maletti 4

15 Examples Applications Technical manuals U military MDN, Knowledge Base... Rule Extraction for MT A. Maletti 4

16 tandard pipeline chema Input Translation model Language model Output (the models are often integrated in practice) Rule Extraction for MT A. Maletti 5

17 tandard pipeline chema Input Translation model Language model Output (the models are often integrated in practice) Required resources bilingual text (sentences in both languages) 1.5M sent. Rule Extraction for MT A. Maletti 5

18 tandard pipeline chema Input Translation model Language model Output (the models are often integrated in practice) Required resources bilingual text (sentences in both languages) monolingual text (in target language) 1.5M sent. 44M sent. Rule Extraction for MT A. Maletti 5

19 Word Alignment English-German example We can help countries catch up, but not by putting their neighbours on hold Wir können Ländern beim Aufholen helfen, aber nicht, indem wir ihre Nachbarn in den Wartesaal schicken English-Russian example I was the one who sat down and copied them ß áûë åäèíñòâåííûì, êòî çàíÿëñÿ êîïèðîâàíèåì äåìî íà êàññåòå Rule Extraction for MT A. Maletti 6

20 Parallel Corpus EUROPARL German-English parallel corpus 1, 920, 209 parallel sentences 44, 548, 491 words in German 47, 818, 827 words in English sentence-aligned, but not word-aligned from parliament proceedings Rule Extraction for MT A. Maletti 7

21 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Algorithm 1 phrase pair ([j, j ], [i, i ]) consistently aligned if l [i, i ] for all l [j, j ] and (l, l ) A l [j, j ] for all l [i, i ] and (l, l ) A Rule Extraction for MT A. Maletti 8

22 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Algorithm 1 phrase pair ([j, j ], [i, i ]) consistently aligned if l [i, i ] for all l [j, j ] and (l, l ) A l [j, j ] for all l [i, i ] and (l, l ) A 2 extract all consistently aligned phrase pairs 3 (restrict length of phrases based on corpus size) Rule Extraction for MT A. Maletti 8

23 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

24 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

25 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

26 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

27 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

28 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

29 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

30 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

31 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Rule Extraction for MT A. Maletti 9

32 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Formally: For better readability: ([1,1], [2,3]) ([2,2], [4,4]) ([3,3], [1,1]) ([4,5], [5,5]) ([6,6], [6,6]) ([7,7], [7,7]) ([8,8], [8,8]) ([9,11], [9,9]) ([12,13], [10,10]) Könnten would like ie your mir I eine Auskunft advice zu about Artikel Rule im Zusammenhang mit concerning der Unzulässigkeit inadmissibility Rule Extraction for MT A. Maletti 9

33 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Notes these were only minimal phrase pairs extract all (sensible) combinations of these e.g., ([1, 1], [2, 3]) and ([2, 2], [4, 4]) yield ([1, 2], [2, 4]) Rule Extraction for MT A. Maletti 10

34 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Notes these were only minimal phrase pairs extract all (sensible) combinations of these e.g., ([1, 1], [2, 3]) and ([2, 2], [4, 4]) yield ([1, 2], [2, 4]) unaligned words can be added to neighboring phrases e.g., ([12, 13], [10, 10]) extends to ([12, 14], [10, 10]) Rule Extraction for MT A. Maletti 10

35 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Notes these were only minimal phrase pairs extract all (sensible) combinations of these e.g., ([1, 1], [2, 3]) and ([2, 2], [4, 4]) yield ([1, 2], [2, 4]) unaligned words can be added to neighboring phrases e.g., ([12, 13], [10, 10]) extends to ([12, 14], [10, 10]) Könnten ie would like your der Unzulässigkeit geben inadmissibility Rule Extraction for MT A. Maletti 10

36 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Alternative representation (rectangles): Rule Extraction for MT A. Maletti 11

37 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Alternative representation (rectangles): Rule Extraction for MT A. Maletti 11

38 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Alternative representation (rectangles): Rule Extraction for MT A. Maletti 11

39 Phrase-based Models Könnten 1 ie 2 mir 3 eine 4 Auskunft 5 zu 6 Artikel im 9 Zusammenhang 10 mit 11 der 12 Unzulässigkeit 13 geben 14 I 1 would 2 like 3 your 4 advice 5 about 6 Rule concerning 9 inadmissibility 10 Alternative representation (rectangles): Rule Extraction for MT A. Maletti 11

40 Phrase-based Models Rule weights simple relative frequencies during extraction normally different normalizations as features Rule Extraction for MT A. Maletti 12

41 Phrase-based Models Rule weights simple relative frequencies during extraction normally different normalizations as features Ideally weights should be set to utility of the rule for explaining the training data would require reprocessing of training data EM or similar algorithms available impractical, EM not used Rule Extraction for MT A. Maletti 12

42 yntax-based Machine Translation yntax-based systems Input Parser Machine translation system Language model Output Rule Extraction for MT A. Maletti 13

43 yntax-based Machine Translation CHARNIAK parser: [CHARNIAK, JOHNON, 2005] VP PRP MD VP We must VB PP bear IN PP in NN DT NN IN mind the Community as DT NN BitPar parser: [CHMID, 2006] -TOP TOP $. a whole -B/Pl VMFIN-HD-Pl VP-OC/inf. PPER-HD-Nom.Pl müssen -DA PP-OP/V VVINF-HD $, VP-OC/zu Wir PPER-HD-Dat.Pl PROAV-PH hüten, VP-OC/inf VZ-HD uns davor -OA VVINF-HD PTKZU-PM VMINF-HD PI-HD-Acc.g.Neut vergemeinschaften zu wollen alles Rule Extraction for MT A. Maletti 14

44 yntax-based Machine Translation Arabic-English Yugoslav President Voislav signed for erbia. Translit.: w twly AltwqyE En rbya Alr}ys AlywgwslAfy fwyslaf. And then the matter was decided, and everything was put in place. Translit.: f kan An tm AlHsm w wdet Al>mwr fy nab ha. Below are the male and female winners in the different categories. Translit.: w hna Al>wA}l w Al>wlyAt fy mxtlf Alf}At. Rule Extraction for MT A. Maletti 15

45 yntax-based Machine Translation Alignment Yugoslav President Voislav signed for erbia w twly AltwqyE En rbya Alr}ys AlywgwslAfy fwyslaf Rule Extraction for MT A. Maletti 16

46 yntax-based Machine Translation [GALLEY et al., 2004] -BJ VP NML N VBD PP JJ N Voislav signed IN Yugoslav President for N erbia rbya twly w PV AltwqyE En NN-PROP Alr}ys AlywgwslAfy fwyslaf DET-NN PREP DET-NN DET-ADJ NN-PROP PP -OBJ -BJ CONJ VP Rule Extraction for MT A. Maletti 17

47 yntax-based Machine Translation -BJ VP NML Voislav signed PP elect next node bottom-up Yugoslav President for erbia rbya AltwqyE En Alr}ys AlywgwslAfy fwyslaf PP twly -OBJ -BJ w VP Rule Extraction for MT A. Maletti 18

48 yntax-based Machine Translation -BJ VP NML Voislav signed PP Yugoslav President for elect next node bottom-up Identify maximal subtree of aligned nodes erbia rbya AltwqyE En Alr}ys AlywgwslAfy fwyslaf PP twly -OBJ -BJ w VP Rule Extraction for MT A. Maletti 18

49 yntax-based Machine Translation -BJ VP NML Voislav signed PP Yugoslav President for erbia elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. rbya AltwqyE En Alr}ys AlywgwslAfy fwyslaf PP twly -OBJ -BJ w VP Rule Extraction for MT A. Maletti 18

50 yntax-based Machine Translation -BJ VP NML Voislav signed PP Yugoslav President for erbia rbya AltwqyE En Alr}ys AlywgwslAfy fwyslaf PP twly -OBJ -BJ w VP elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Rule Extraction for MT A. Maletti 18

51 yntax-based Machine Translation -BJ VP NML Voislav signed PP qy President for erbia rbya AltwqyE En Alr}ys qy fwyslaf PP twly -OBJ -BJ w VP elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat Yugoslav q Y AlywgwslAfy Rule Extraction for MT A. Maletti 18

52 yntax-based Machine Translation -BJ VP NML qv signed PP qy qp q f q q AltwqyE q f qp qy qv PP twly -OBJ -BJ w VP elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat Yugoslav q Y AlywgwslAfy President q P Alr}ys Voislav q V fwyslaf for q f En erbia q rbya Rule Extraction for MT A. Maletti 18

53 yntax-based Machine Translation -BJ VP NML qv signed PP qy qp q f q q AltwqyE q f qp qy qv PP twly -OBJ -BJ w VP elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat Yugoslav q Y AlywgwslAfy President q P Alr}ys Voislav q V fwyslaf for q f En erbia q rbya Rule Extraction for MT A. Maletti 18

54 yntax-based Machine Translation -BJ VP NML qv signed PP qy qp q f q q AltwqyE q f qp qy qv PP twly -OBJ -BJ w VP elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat NML(q Y, q P ) q NML (q P, q Y ) Rule Extraction for MT A. Maletti 18

55 yntax-based Machine Translation -BJ VP qnml qv signed PP q f q q AltwqyE q f qv PP qnml twly -OBJ -BJ w VP elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat NML(q Y, q P ) q NML (q P, q Y ) Rule Extraction for MT A. Maletti 18

56 yntax-based Machine Translation w -BJ qnml qv AltwqyE q f twly -OBJ VP signed PP q VP q f qnml PP q qv -BJ elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat NML(q Y, q P ) q NML (q P, q Y ) (q ) q (q ) PP(q f, q ) q PP PP(q f, q ) -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rule Extraction for MT A. Maletti 18

57 yntax-based Machine Translation q -BJ signed AltwqyE q PP VP twly -OBJ w VP q PP q -BJ elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat NML(q Y, q P ) q NML (q P, q Y ) (q ) q (q ) PP(q f, q ) q PP PP(q f, q ) -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rule Extraction for MT A. Maletti 18

58 yntax-based Machine Translation q -BJ signed AltwqyE q PP VP twly -OBJ w VP q PP q -BJ elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat NML(q Y, q P ) q NML (q P, q Y ) (q ) q (q ) PP(q f, q ) q PP PP(q f, q ) -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rule Extraction for MT A. Maletti 18

59 yntax-based Machine Translation q -BJ signed AltwqyE q PP VP twly -OBJ w VP q PP q -BJ elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat NML(q Y, q P ) q NML (q P, q Y ) (q ) q (q ) PP(q f, q ) q PP PP(q f, q ) -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rule Extraction for MT A. Maletti 18

60 yntax-based Machine Translation q -BJ signed AltwqyE q PP VP twly -OBJ w VP q PP q -BJ elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat NML(q Y, q P ) q NML (q P, q Y ) (q ) q (q ) PP(q f, q ) q PP PP(q f, q ) -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rule Extraction for MT A. Maletti 18

61 yntax-based Machine Translation q -BJ signed AltwqyE q PP VP twly -OBJ w VP q PP q -BJ elect next node bottom-up Identify maximal subtree of aligned nodes Identify subtree of nodes aligned to aligned nodes, etc. Extract rule and leave state Repeat NML(q Y, q P ) q NML (q P, q Y ) (q ) q (q ) PP(q f, q ) q PP PP(q f, q ) -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rule Extraction for MT A. Maletti 18

62 yntax-based Machine Translation Rules Yugoslav q Y AlywgwslAfy President q P Alr}ys Voislav q V fwyslaf for q f En erbia q rbya NML(q Y, q P ) q NML (q P, q Y ) (q ) q (q ) PP(q f, q ) q PP PP(q f, q ) -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rule Extraction for MT A. Maletti 19

63 yntax-based Machine Translation Rules Yugoslav q Y AlywgwslAfy President q P Alr}ys Voislav q V fwyslaf for q f En erbia q rbya NML(q Y, q P ) q NML (q P, q Y ) (q ) q (q ) PP(q f, q ) q PP PP(q f, q ) -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rules of an Extended Top-down Tree Transducer Rule Extraction for MT A. Maletti 19

64 Extended Top-down Tree Transducer Advantages simple and natural model easy to train [GRAEHL et al., 2008] symmetric (from linguistic resources) Rule Extraction for MT A. Maletti 20

65 Extended Top-down Tree Transducer Advantages simple and natural model easy to train [GRAEHL et al., 2008] symmetric (from linguistic resources) Disadvantages weights set in the same manner does not capture utility, EM available EM not used in practice Rule Extraction for MT A. Maletti 20

66 From Automata to Transducers Graphical representation q -BJ -BJ q NML q V -BJ q NML q V corresponds to two productions q -BJ -BJ(q NML, q V ) q -BJ -BJ(q NML, (q V )) Rule Extraction for MT A. Maletti 21

67 From Automata to Transducers General idea ynchronous grammars are essentially two grammars over the same nonterminals whose productions are paired Convention same nonterminals are synchronized (or linked) and develop at the same time Rule Extraction for MT A. Maletti 22

68 From Automata to Transducers Approach join two productions q 1 r 1 and q 2 r 2 to (q 1, q 2 ) (r 1, r 2 ) Rule Extraction for MT A. Maletti 23

69 From Automata to Transducers Approach join two productions q 1 r 1 and q 2 r 2 to (q 1, q 2 ) (r 1, r 2 ) demand q 1 = q = q 2 for simplicity and write r 1 q r2 Rule Extraction for MT A. Maletti 23

70 From Automata to Transducers Approach join two productions q 1 r 1 and q 2 r 2 to (q 1, q 2 ) (r 1, r 2 ) demand q 1 = q = q 2 for simplicity and write r 1 paired productions develop input and output tree at the same time q r2 Rule Extraction for MT A. Maletti 23

71 From Automata to Transducers q q Used rule: Next rule: q q CONJ wa q Rule Extraction for MT A. Maletti 24

72 From Automata to Transducers q CONJ q wa Used rule: Next rule: q q CONJ wa q q 1 VP p q 2 q p q 1 q 2 Rule Extraction for MT A. Maletti 24

73 From Automata to Transducers q 1 VP CONJ p q 2 wa p q 1 q 2 Used rule: Next rule: q 1 VP q p q 1 q 2 V saw p V ra aa p q 2 Rule Extraction for MT A. Maletti 24

74 From Automata to Transducers q 1 VP CONJ V q 2 wa V q 1 q 2 saw ra aa Used rule: Next rule: V saw p V ra aa DT the r q 1 r Rule Extraction for MT A. Maletti 24

75 From Automata to Transducers VP CONJ DT r V q 2 wa V q 2 the saw ra aa r Used rule: Next rule: DT the r q 1 r N boy r N atefl Rule Extraction for MT A. Maletti 24

76 From Automata to Transducers VP CONJ DT N V q 2 wa V q 2 the boy saw ra aa N atefl Used rule: Next rule: N boy r N atefl DT the r q 2 r Rule Extraction for MT A. Maletti 24

77 From Automata to Transducers VP CONJ DT N V wa V the boy saw DT r ra aa N r the atefl Used rule: Next rule: DT the r q 2 r N door r N albab Rule Extraction for MT A. Maletti 24

78 From Automata to Transducers VP CONJ DT N V wa V the boy saw DT N ra aa N N the door atefl albab Used rule: Next rule: N door r N albab Rule Extraction for MT A. Maletti 24

79 From Automata to Transducers Remarks synchronization breaks almost all existing constructions (e.g., the normalization construction) the basic grammar model very important Rule Extraction for MT A. Maletti 25

80 Other yntax-based Models Important relations CFG = synchronous context-free grammar LTG-LTG [CHIANG, 2007] (synchronous local tree grammar) ln-top special top-down tree transducer TG = synchronous tree substitution grammar TG-TG [EINER, 2003] ln-xtop special extended top-down tree transducer TAG = synchronous tree adjunction grammar [HIEBER, CHABE, 1990] TAG-TAG CFTG = synchronous context-free tree grammar [NEDERHOF, VOGLER, 2012] CFTG-CFTG Rule Extraction for MT A. Maletti 26

81 Other yntax-based Models Towards asymetric relations TG = synch. tree-sequence substitution grammar [ZHANG et al., 2008] TG-TG lmbot = local shallow multi bottom-up tree transducer [BRAUNE et al., 2013] LTG-TG ln-xmbot corresponds rougly to RTG-TG Rule Extraction for MT A. Maletti 27

82 Where is Machine Learning? Examples unsupervised learning of translation models for TG [BLUNOM et al., 2008] uses GIBB sampling, hierarchical DIRICHLET process,... Rule Extraction for MT A. Maletti 28

83 Where is Machine Learning? Examples unsupervised learning of translation models for TG [BLUNOM et al., 2008] uses GIBB sampling, hierarchical DIRICHLET process,... But... can only be used on small data does not scale well currently not used in practice Rule Extraction for MT A. Maletti 28

84 Machine Learning & Grammatical Inference Quo vadis? Rule Extraction for MT A. Maletti 29

85 Literature elected references ARNOLD, DAUCHET: Morphismes et Bimorphismes d Arbres Theoret. Comput. ci. 20, 1982 BRAUNE, EEMANN, QUERNHEIM, : hallow local multi bottom-up tree transducers in statistical machine translation. Proc. ACL, 2013 GALLEY, HOPKIN, KNIGHT, MARCU What s in a Translation Rule? Proc. NAACL, 2004 KOEHN: tatistical Machine Translation Cambridge University Press, 2010, GRAEHL, HOPKIN, KNIGHT: The Power of Extended Top-down Tree Transducers. IAM J. Comput. 39, 2009 OCH, NEY: The Alignment Template Approach to tatistical Machine Translation. Comput. Linguist. 30, 2004 Rule Extraction for MT A. Maletti 30

Tree Transducers in Machine Translation

Tree Transducers in Machine Translation Andreas Maletti Universitat Rovira i Virgili Tarragona, pain andreas.maletti@urv.cat Jena August 23, 2010 Tree Transducers in MT A. Maletti 1 Machine translation