Lecture 7: Introduction to syntax-based MT

Size: px
Start display at page:

Download "Lecture 7: Introduction to syntax-based MT"

Transcription

1 Lecture 7: Introduction to syntax-based MT Andreas Maletti Statistical Machine Translation Stuttgart December 16, 2011 SMT VII A. Maletti 1

2 Lecture 7 Goals Overview Tree substitution grammars (tree automata) Synchronous grammars (tree transducers) SMT VII A. Maletti 2

3 Contents 1 Overview 2 Tree reresentations 3 Bar-Hillel Construction 4 Variants 5 Tree Transducers SMT VII A. Maletti 3

4 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: And then the matter was decided, and everything was ut in lace Outut: SMT VII A. Maletti 4

5 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: then the matter was decided, and everything was ut in lace Outut: SMT VII A. Maletti 4

6 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter was decided, and everything was ut in lace Outut: SMT VII A. Maletti 4

7 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter was decided, and everything was ut in lace Outut: f SMT VII A. Maletti 4

8 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter was decided, and everything was ut in lace Outut: f kan SMT VII A. Maletti 4

9 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter was decided, and everything was ut in lace Outut: f kan SMT VII A. Maletti 4

10 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter was decided, and everything was ut in lace Outut: f kan SMT VII A. Maletti 4

11 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter was decided, and everything was ut in lace Outut: f kan SMT VII A. Maletti 4

12 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter, and everything was ut in lace Outut: f kan An tm AlHsm SMT VII A. Maletti 4

13 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter and everything was ut in lace Outut: f kan An tm AlHsm SMT VII A. Maletti 4

14 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter everything was ut in lace Outut: f kan An tm AlHsm w SMT VII A. Maletti 4

15 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter was ut in lace Outut: f kan An tm AlHsm w SMT VII A. Maletti 4

16 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter was ut in lace Outut: f kan An tm AlHsm w SMT VII A. Maletti 4

17 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: the matter in lace Outut: f kan An tm AlHsm w wdet SMT VII A. Maletti 4

18 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: in lace Outut: f kan An tm AlHsm w wdet Al>mwr SMT VII A. Maletti 4

19 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: lace Outut: f kan An tm AlHsm w wdet Al>mwr fy SMT VII A. Maletti 4

20 Word-based system (FST) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: Outut: f kan An tm AlHsm w wdet Al>mwr fy nsab ha SMT VII A. Maletti 4

21 Phrase-based machine translation Schema Inut Machine translation system Language model Outut Phrase-based systems Inut Segmenter Machine translation system Language model Outut SMT VII A. Maletti 5

22 Phrase-based machine translation Schema Inut Machine translation system Language model Outut Phrase-based systems Inut Segmenter Machine translation system Language model Outut SMT VII A. Maletti 5

23 Phrase-based system (FST+Perm) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: And then the matter was decided, and everything was ut in lace Outut: SMT VII A. Maletti 6

24 Phrase-based system (FST+Perm) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: And then 1 the matter 5 was decided 2, and everything 3 was ut 4 in lace 6 Outut: SMT VII A. Maletti 6

25 Phrase-based system (FST+Perm) And then the matter was decided, and everything was ut in lace f kan An tm AlHsm w wdet Al>mwr fy nsab ha Derivation Inut: And then 1 the matter 5 was decided 2, and everything 3 was ut 4 in lace 6 Outut: f kan 1 An tm AlHsm 2 w 3 wdet 4 Almwr 5 fy nsab ha 6 SMT VII A. Maletti 6

26 Machine translation (cont d) Phrase-based systems Inut Segmenter Machine translation system Language model Outut Syntax-based systems Inut Parser Machine translation system Language model Outut SMT VII A. Maletti 7

27 Machine translation (cont d) Phrase-based systems Inut Segmenter Machine translation system Language model Outut Syntax-based systems Inut Parser Machine translation system Language model Outut SMT VII A. Maletti 7

28 Syntax-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 8

29 Syntax-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 8

30 Syntax-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 8

31 Syntax-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 8

32 Syntax-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 8

33 Parser S S CC S CC ADVP NP-SBJ-9 VP and NP-SBJ-1 VP And RB DT NN VBD VP NN VBD VP then the matter was VBN NP-9 everything was VBN NP-1 PP decided, ut * IN NP in NN lace And then the matter was decided, and everything was ut in lace (thanks to KEVIN KNIGHT for the data) SMT VII A. Maletti 9

34 Semantics-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 10

35 Semantics-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 10

36 Semantics-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 10

37 Semantics-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 10

38 Semantics-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 10

39 Semantics-based Aroach Overview English German Semantics Syntax Phrase SMT VII A. Maletti 10

40 Contents 1 Overview 2 Tree reresentations 3 Bar-Hillel Construction 4 Variants 5 Tree Transducers SMT VII A. Maletti 11

41 Parsing and CFG Examle (Context-free grammar) S NP VP VP VBP ADVP JJ Colorless NNS ideas RB furiously NP JJ JJ NNS ADVP RB JJ green VBP slee Derivation S Colorless green ideas slee furiously SMT VII A. Maletti 12

42 Parse tree Examle S NP VP JJ JJ NNS VBP ADVP Colorless green ideas slee RB furiously Remark We are interested in the arse tree, not just whether S w! SMT VII A. Maletti 13

43 Parse tree Examle S NP VP JJ JJ NNS VBP ADVP Colorless green ideas slee RB furiously Remark We are interested in the arse tree, not just whether S w! SMT VII A. Maletti 13

44 Parse tree Examle S NP VP JJ JJ NNS VBP ADVP Colorless green ideas slee RB furiously Remark We are interested in the arse tree, not just whether S w! But there can be exonentially many arse trees for a sentence. SMT VII A. Maletti 13

45 Packed tree language Remark A tree language is often called forest in NLP. Examle S S NP VP NP VP JJ JJ NNS VBP ADVP JJ JJ NNS VBP ADVP Colorless green ideas slee RB green green ideas slee RB furiously furiously SMT VII A. Maletti 14

46 Packed tree language Remark A tree language is often called forest in NLP. Examle S S NP VP NP VP JJ JJ NNS VBP ADVP JJ JJ NNS VBP ADVP Colorless green ideas slee RB green green ideas slee RB S furiously S furiously NP NP VP JJ green JJ Colorless JJ green NNS ideas VBP slee ADVP RB furiously SMT VII A. Maletti 14

47 Local tree language Definition A local tree grammar is a grammar with rules of the form S S N 1... N k SMT VII A. Maletti 15

48 Local tree language Examle S NP S VP NP NP JJ JJ NNS VP VBP VP ADVP ADVP ADVP RB JJ JJ Colorless JJ JJ green NNS NNS ideas VBP VBP slee RB RB furiously Derivation S NP VP JJ JJ NNS VBP ADVP Colorless green ideas slee RB furiously SMT VII A. Maletti 16

49 Local tree language Definition The tree languages generated by local tree grammars are the local tree languages. Theorem The set of derivations of a context-free grammar forms a local tree language. SMT VII A. Maletti 17

50 Local tree language Definition The tree languages generated by local tree grammars are the local tree languages. Theorem The set of derivations of a context-free grammar forms a local tree language. SMT VII A. Maletti 17

51 Local tree language Question Is the tree language consisting of only S NP VP JJ JJ NNS VBP ADVP Colorless green ideas slee RB furiously a local tree language? SMT VII A. Maletti 18

52 Local tree language Question Is the tree language consisting of only S NP VP JJ JJ NNS VBP ADVP Colorless green ideas slee RB furiously a local tree language? Answer NO! SMT VII A. Maletti 18

53 Local tree language Notes Local tree languages have undesirable roerties: not closed under union cannot reresent all finite languages... SMT VII A. Maletti 19

54 Regular tree language Definition A regular tree grammar is a grammar with rules of the form q S q 1... q k The such generated languages are the regular tree languages. Remark Regular tree grammars are local tree grammars with hidden states. SMT VII A. Maletti 20

55 Regular tree language Definition A regular tree grammar is a grammar with rules of the form q S q 1... q k The such generated languages are the regular tree languages. Remark Regular tree grammars are local tree grammars with hidden states. SMT VII A. Maletti 20

56 Regular tree language Examle q S S q NP q NP q VP NP q VP JJ q JJ q NNS VP JJ JJ q VBP q ADVP q c q JJ JJ q NNS NNS q VBD VBD q g q i q s q ADVP ADVP q RB q RB RB q f Derivation q S SMT VII A. Maletti 21

57 Regular tree language Examle q S S q NP q NP q VP NP q VP JJ q JJ q NNS VP JJ JJ q VBP q ADVP q c q JJ JJ q NNS NNS q VBD VBD q g q i q s q ADVP ADVP q RB q RB RB q f Derivation q S q NP S q VP SMT VII A. Maletti 21

58 Regular tree language Examle q S S q NP q NP q VP NP q VP JJ q JJ q NNS VP JJ JJ q VBP q ADVP q c q JJ JJ q NNS NNS q VBD VBD q g q i q s q ADVP ADVP q RB q RB RB q f Derivation q S S q NP q VP NP S q VP JJ q JJ q NNS SMT VII A. Maletti 21

59 Regular tree languages Princial roerties Finite languages are regular Closed under all Boolean oerations Closed under relabelings, linear homomorhisms, inverse homomorhisms Can be determinized and minimized (bottom-u) Summary They are basically the tree version of finite-state automata with the same nice roerties. SMT VII A. Maletti 22

60 Trees and their yield Definition The yield of a tree is the string of its leaves (in natural order). Theorem The yield language of a regular tree language is context-free. Each context-free language is the yield of a regular tree language. SMT VII A. Maletti 23

61 Questions Question Is {σ(t, t) t arbitrary tree} regular? SMT VII A. Maletti 24

62 Questions Question Is {σ(t, t) t arbitrary tree} regular? Answer NO! SMT VII A. Maletti 24

63 Questions Question Is {σ(t, t) t arbitrary tree} regular? Answer NO! Question Is {σ(γ n (α), γ n (α)) n N} regular? SMT VII A. Maletti 24

64 Questions Question Is {σ(t, t) t arbitrary tree} regular? Answer NO! Question Is {σ(γ n (α), γ n (α)) n N} regular? Answer NO! Remark Not every tree language with context-free yield language is regular! SMT VII A. Maletti 24

65 Back to arsing Observation Most CFG-arsers are regular tree grammars (+ control) because they are based on a CFG ( local tree grammar) and have hidden states (or features) Alternative The features can be made exlicit in the arse tree structure. SMT VII A. Maletti 25

66 Back to arsing Observation Most CFG-arsers are regular tree grammars (+ control) because they are based on a CFG ( local tree grammar) and have hidden states (or features) Alternative The features can be made exlicit in the arse tree structure. SMT VII A. Maletti 25

67 Contents 1 Overview 2 Tree reresentations 3 Bar-Hillel Construction 4 Variants 5 Tree Transducers SMT VII A. Maletti 26

68 Binarization Inut tree S Binarized tree S NP VP NP VP JJ JJ NNS VBP VBP ADVP Colorless green ideas slee RB slee RB furiously Colorless JJ NNS furiously green ideas Theorem A tree language is regular if and only if its binarization is regular SMT VII A. Maletti 27

69 Binarization Inut tree S Binarized tree S NP VP NP VP JJ JJ NNS VBP VBP ADVP Colorless green ideas slee RB slee RB furiously Colorless JJ NNS furiously green ideas Theorem A tree language is regular if and only if its binarization is regular SMT VII A. Maletti 27

70 Individual runs Run on the yield () Colorless ( 1 ) ideas ( 2 ) slee ( 3 ) furiously ( ) Run on the inut tree S, q NP, q VP, q 1 JJ, q NNS, q VBP, q 1 ADVP, q 2 Colorless, w ideas, w slee, w RB, q 2 furiously, w SMT VII A. Maletti 28

71 Bar-Hillel construction Run on the yield () Colorless ( 1 ) ideas ( 2 ) slee ( 3 ) furiously ( ) Comosite run, S, q,, NP, q, 2 2, VP, q 1,, JJ, q, 1 1, NNS, q, 2 2, VBP, q 1, 3 3, ADVP, q 2,, Colorless, w, 1 1, ideas, w, 2 2, slee, w, 3 3, RB, q 2, 3, furiously, w, SMT VII A. Maletti 29

72 Bar-Hillel construction (cont d) Illustration σ q σ q = = = = = 1 q 1 1 q 1 2 q 2 ν(, α, q) α q = e q SMT VII A. Maletti 30

73 Bar-Hillel construction (cont d) Theorem The regular restriction of a regular tree language is regular Remark Comlexity: O(mn 3 ) m: size of the regular tree grammar n: size of the regular grammar (or inut string) Conclusion We can arse with regular tree grammars in O(mn 3 ). SMT VII A. Maletti 31

74 Contents 1 Overview 2 Tree reresentations 3 Bar-Hillel Construction 4 Variants 5 Tree Transducers SMT VII A. Maletti 32

75 To-down Bar-Hillel construction Run on the yield (1) Colorless (2) ideas (3) slee (4) furiously (5) Comosite run 1, S, q, 5 1, NP, q, 3 3, VP, q 1, 5 SMT VII A. Maletti 33

76 To-down Bar-Hillel construction Run on the yield (1) Colorless (2) ideas (3) slee (4) furiously (5) Comosite run 1, S, q, 5 1, NP, q, 3 3, VP, q 1, 5 1, JJ, q, 2 2, NNS, q, 3 SMT VII A. Maletti 33

77 To-down Bar-Hillel construction Run on the yield (1) Colorless (2) ideas (3) slee (4) furiously (5) Comosite run 1, S, q, 5 1, NP, q, 3 3, VP, q 1, 5 1, JJ, q, 2 2, NNS, q, 3 3, VBP, q 1, 4 4, ADVP, q 2, 5 1, Colorless, w, 2 2, ideas, w, 3 3, slee, w, 4 4, RB, q 2, 5 4, furiously, w, 5 SMT VII A. Maletti 33

78 Bottom-u Bar-Hillel construction Run on the yield (1) Colorless (2) ideas (3) slee (4) furiously (5) Comosite run 1, Colorless, w, 2 SMT VII A. Maletti 34

79 Bottom-u Bar-Hillel construction Run on the yield (1) Colorless (2) ideas (3) slee (4) furiously (5) Comosite run 1, JJ, q, 2 1, Colorless, w, 2 SMT VII A. Maletti 34

80 Bottom-u Bar-Hillel construction Run on the yield (1) Colorless (2) ideas (3) slee (4) furiously (5) Comosite run 1, JJ, q, 2 1, Colorless, w, 2 2, NNS, q, 3 2, ideas, w, 3 SMT VII A. Maletti 34

81 Bottom-u Bar-Hillel construction Run on the yield (1) Colorless (2) ideas (3) slee (4) furiously (5) Comosite run 1, NP, q, 3 1, JJ, q, 2 1, Colorless, w, 2 2, NNS, q, 3 2, ideas, w, 3 SMT VII A. Maletti 34

82 Bottom-u Bar-Hillel construction Run on the yield (1) Colorless (2) ideas (3) slee (4) furiously (5) Comosite run 1, S, q, 5 1, NP, q, 3 3, VP, q 1, 5 1, JJ, q, 2 2, NNS, q, 3 3, VBP, q 1, 4 4, ADVP, q 2, 5 1, Colorless, w, 2 2, ideas, w, 3 3, slee, w, 4 4, RB, q 2, 5 4, furiously, w, 5 SMT VII A. Maletti 34

83 Summary Key oints regular tree grammar as efficient tree data structure context-free behavior basis for most syntax-based translation models Further models weaker models are generally inadequate tree adjoining grammars (more exressive, but worse comutational roerties) Automata on directed acyclic grahs (see Daniel s lecture) SMT VII A. Maletti 35

84 Contents 1 Overview 2 Tree reresentations 3 Bar-Hillel Construction 4 Variants 5 Tree Transducers SMT VII A. Maletti 36

85 Extended To-down Tree Transducer Definition Each rule now has an inut and an outut side, which are both full trees (not just a single symbol followed by states) Examle σ q σ q r q q r α x x α x x SMT VII A. Maletti 37

86 Link Structure σ q q SMT VII A. Maletti 38

87 Link Structure σ q q SMT VII A. Maletti 38

88 Derivation Rules σ q q σ q r q r α x x α x x SMT VII A. Maletti 39

89 Derivation Rules σ q q σ q r q r α x x α x x SMT VII A. Maletti 39

90 Derivation Rules σ q q σ q r q r α x x α x x σ q q SMT VII A. Maletti 39

91 Derivation Rules σ q q σ q r q r α x x α x x σ q q σ q q SMT VII A. Maletti 39

92 Derivation Rules σ q q σ q r q r α x x α x x σ q q σ α q α q SMT VII A. Maletti 39

93 Derivation Rules σ q q σ q r q r α x x α x x σ α q α q σ α q α q SMT VII A. Maletti 39

94 Derivation Rules σ q q σ q r q r α x x α x x σ α q α q σ α q σ q r α q q r SMT VII A. Maletti 39

95 Derivation Rules σ q q σ q r q r α x x α x x σ α q σ q r α q q r σ α σ q r α q r SMT VII A. Maletti 39

96 Derivation Rules σ q q σ q r q r α x x α x x σ α σ q r α q r σ α σ α SMT VII A. Maletti 39

97 Semantics of XTOP σ α σ α SMT VII A. Maletti 40

98 Semantics of XTOP α σ σ α SMT VII A. Maletti 40

99 References Bar-Hillel, Perles, Shamir: On formal roerties of simle hrase structure grammars. Language and Information, 1964 Berstel, Reutenauer: Recognizable formal ower series on trees. Theoret. Comut. Sci. 18, , 1982 Borchardt: The theory of recognizable tree series. Ph.D. thesis TU Dresden, 2005 Gécseg, Steinby: Tree Automata. Akadémiai Kiadó, Budaest 1984 Nederhof, Satta: Probabilistic arsing as intersection. In IWPT 2003 Thank you for your attention! SMT VII A. Maletti 41

Statistical Machine Translation of Natural Languages

Statistical Machine Translation of Natural Languages 1/26 Statistical Machine Translation of Natural Languages Heiko Vogler Technische Universität Dresden Germany Graduiertenkolleg Quantitative Logics and Automata Dresden, November, 2012 1/26 Weighted Tree

More information

Lecture 9: Decoding. Andreas Maletti. Stuttgart January 20, Statistical Machine Translation. SMT VIII A. Maletti 1

Lecture 9: Decoding. Andreas Maletti. Stuttgart January 20, Statistical Machine Translation. SMT VIII A. Maletti 1 Lecture 9: Decoding Andreas Maletti Statistical Machine Translation Stuttgart January 20, 2012 SMT VIII A. Maletti 1 Lecture 9 Last time Synchronous grammars (tree transducers) Rule extraction Weight training

More information

Applications of Tree Automata Theory Lecture VI: Back to Machine Translation

Applications of Tree Automata Theory Lecture VI: Back to Machine Translation Applications of Tree Automata Theory Lecture VI: Back to Machine Translation Andreas Maletti Institute of Computer Science Universität Leipzig, Germany on leave from: Institute for Natural Language Processing

More information

Tree transformations and dependencies

Tree transformations and dependencies Tree transformations and deendencies Andreas Maletti Universität Stuttgart Institute for Natural Language Processing Azenbergstraße 12, 70174 Stuttgart, Germany andreasmaletti@imsuni-stuttgartde Abstract

More information

Tree Transducers in Machine Translation

Tree Transducers in Machine Translation Tree Transducers in Machine Translation Andreas Maletti Universitat Rovira i Virgili Tarragona, pain andreas.maletti@urv.cat Jena August 23, 2010 Tree Transducers in MT A. Maletti 1 Machine translation

More information

Probabilistic Context-Free Grammars. Michael Collins, Columbia University

Probabilistic Context-Free Grammars. Michael Collins, Columbia University Probabilistic Context-Free Grammars Michael Collins, Columbia University Overview Probabilistic Context-Free Grammars (PCFGs) The CKY Algorithm for parsing with PCFGs A Probabilistic Context-Free Grammar

More information

Chapter 14 (Partially) Unsupervised Parsing

Chapter 14 (Partially) Unsupervised Parsing Chapter 14 (Partially) Unsupervised Parsing The linguistically-motivated tree transformations we discussed previously are very effective, but when we move to a new language, we may have to come up with

More information

Hierarchies of Tree Series TransducersRevisited 1

Hierarchies of Tree Series TransducersRevisited 1 Hierarchies of Tree Series TransducersRevisited 1 Andreas Maletti 2 Technische Universität Dresden Fakultät Informatik June 27, 2006 1 Financially supported by the Friends and Benefactors of TU Dresden

More information

Parsing. Based on presentations from Chris Manning s course on Statistical Parsing (Stanford)

Parsing. Based on presentations from Chris Manning s course on Statistical Parsing (Stanford) Parsing Based on presentations from Chris Manning s course on Statistical Parsing (Stanford) S N VP V NP D N John hit the ball Levels of analysis Level Morphology/Lexical POS (morpho-synactic), WSD Elements

More information

Structure and Complexity of Grammar-Based Machine Translation

Structure and Complexity of Grammar-Based Machine Translation Structure and of Grammar-Based Machine Translation University of Padua, Italy New York, June 9th, 2006 1 2 Synchronous context-free grammars Definitions Computational problems 3 problem SCFG projection

More information

How to train your multi bottom-up tree transducer

How to train your multi bottom-up tree transducer How to train your multi bottom-up tree transducer Andreas Maletti Universität tuttgart Institute for Natural Language Processing tuttgart, Germany andreas.maletti@ims.uni-stuttgart.de Portland, OR June

More information

Random Generation of Nondeterministic Tree Automata

Random Generation of Nondeterministic Tree Automata Random Generation of Nondeterministic Tree Automata Thomas Hanneforth 1 and Andreas Maletti 2 and Daniel Quernheim 2 1 Department of Linguistics University of Potsdam, Germany 2 Institute for Natural Language

More information

Probabilistic Context Free Grammars. Many slides from Michael Collins

Probabilistic Context Free Grammars. Many slides from Michael Collins Probabilistic Context Free Grammars Many slides from Michael Collins Overview I Probabilistic Context-Free Grammars (PCFGs) I The CKY Algorithm for parsing with PCFGs A Probabilistic Context-Free Grammar

More information

Probabilistic Context Free Grammars. Many slides from Michael Collins and Chris Manning

Probabilistic Context Free Grammars. Many slides from Michael Collins and Chris Manning Probabilistic Context Free Grammars Many slides from Michael Collins and Chris Manning Overview I Probabilistic Context-Free Grammars (PCFGs) I The CKY Algorithm for parsing with PCFGs A Probabilistic

More information

Probabilistic Context-free Grammars

Probabilistic Context-free Grammars Probabilistic Context-free Grammars Computational Linguistics Alexander Koller 24 November 2017 The CKY Recognizer S NP VP NP Det N VP V NP V ate NP John Det a N sandwich i = 1 2 3 4 k = 2 3 4 5 S NP John

More information

Applications of Tree Automata Theory Lecture V: Theory of Tree Transducers

Applications of Tree Automata Theory Lecture V: Theory of Tree Transducers Applications of Tree Automata Theory Lecture V: Theory of Tree Transducers Andreas Maletti Institute of Computer Science Universität Leipzig, Germany on leave from: Institute for Natural Language Processing

More information

Preservation of Recognizability for Weighted Linear Extended Top-Down Tree Transducers

Preservation of Recognizability for Weighted Linear Extended Top-Down Tree Transducers Preservation of Recognizability for Weighted Linear Extended Top-Down Tree Transducers Nina eemann and Daniel Quernheim and Fabienne Braune and Andreas Maletti University of tuttgart, Institute for Natural

More information

The Power of Tree Series Transducers

The Power of Tree Series Transducers The Power of Tree Series Transducers Andreas Maletti 1 Technische Universität Dresden Fakultät Informatik June 15, 2006 1 Research funded by German Research Foundation (DFG GK 334) Andreas Maletti (TU

More information

CKY & Earley Parsing. Ling 571 Deep Processing Techniques for NLP January 13, 2016

CKY & Earley Parsing. Ling 571 Deep Processing Techniques for NLP January 13, 2016 CKY & Earley Parsing Ling 571 Deep Processing Techniques for NLP January 13, 2016 No Class Monday: Martin Luther King Jr. Day CKY Parsing: Finish the parse Recognizer à Parser Roadmap Earley parsing Motivation:

More information

Compositions of Bottom-Up Tree Series Transformations

Compositions of Bottom-Up Tree Series Transformations Compositions of Bottom-Up Tree Series Transformations Andreas Maletti a Technische Universität Dresden Fakultät Informatik D 01062 Dresden, Germany maletti@tcs.inf.tu-dresden.de May 17, 2005 1. Motivation

More information

This kind of reordering is beyond the power of finite transducers, but a synchronous CFG can do this.

This kind of reordering is beyond the power of finite transducers, but a synchronous CFG can do this. Chapter 12 Synchronous CFGs Synchronous context-free grammars are a generalization of CFGs that generate pairs of related strings instead of single strings. They are useful in many situations where one

More information

A Syntax-based Statistical Machine Translation Model. Alexander Friedl, Georg Teichtmeister

A Syntax-based Statistical Machine Translation Model. Alexander Friedl, Georg Teichtmeister A Syntax-based Statistical Machine Translation Model Alexander Friedl, Georg Teichtmeister 4.12.2006 Introduction The model Experiment Conclusion Statistical Translation Model (STM): - mathematical model

More information

Harvard CS 121 and CSCI E-207 Lecture 10: Ambiguity, Pushdown Automata

Harvard CS 121 and CSCI E-207 Lecture 10: Ambiguity, Pushdown Automata Harvard CS 121 and CSCI E-207 Lecture 10: Ambiguity, Pushdown Automata Salil Vadhan October 4, 2012 Reading: Sipser, 2.2. Another example of a CFG (with proof) L = {x {a, b} : x has the same # of a s and

More information

LECTURER: BURCU CAN Spring

LECTURER: BURCU CAN Spring LECTURER: BURCU CAN 2017-2018 Spring Regular Language Hidden Markov Model (HMM) Context Free Language Context Sensitive Language Probabilistic Context Free Grammar (PCFG) Unrestricted Language PCFGs can

More information

Compositions of Tree Series Transformations

Compositions of Tree Series Transformations Compositions of Tree Series Transformations Andreas Maletti a Technische Universität Dresden Fakultät Informatik D 01062 Dresden, Germany maletti@tcs.inf.tu-dresden.de December 03, 2004 1. Motivation 2.

More information

Decoding and Inference with Syntactic Translation Models

Decoding and Inference with Syntactic Translation Models Decoding and Inference with Syntactic Translation Models March 5, 2013 CFGs S NP VP VP NP V V NP NP CFGs S NP VP S VP NP V V NP NP CFGs S NP VP S VP NP V NP VP V NP NP CFGs S NP VP S VP NP V NP VP V NP

More information

In this chapter, we explore the parsing problem, which encompasses several questions, including:

In this chapter, we explore the parsing problem, which encompasses several questions, including: Chapter 12 Parsing Algorithms 12.1 Introduction In this chapter, we explore the parsing problem, which encompasses several questions, including: Does L(G) contain w? What is the highest-weight derivation

More information

Aspects of Tree-Based Statistical Machine Translation

Aspects of Tree-Based Statistical Machine Translation Aspects of Tree-Based Statistical Machine Translation Marcello Federico Human Language Technology FBK 2014 Outline Tree-based translation models: Synchronous context free grammars Hierarchical phrase-based

More information

Cryptanalysis of Pseudorandom Generators

Cryptanalysis of Pseudorandom Generators CSE 206A: Lattice Algorithms and Alications Fall 2017 Crytanalysis of Pseudorandom Generators Instructor: Daniele Micciancio UCSD CSE As a motivating alication for the study of lattice in crytograhy we

More information

Outline. CS21 Decidability and Tractability. Regular expressions and FA. Regular expressions and FA. Regular expressions and FA

Outline. CS21 Decidability and Tractability. Regular expressions and FA. Regular expressions and FA. Regular expressions and FA Outline CS21 Decidability and Tractability Lecture 4 January 14, 2019 FA and Regular Exressions Non-regular languages: Puming Lemma Pushdown Automata Context-Free Grammars and Languages January 14, 2019

More information

Harvard CS 121 and CSCI E-207 Lecture 9: Regular Languages Wrap-Up, Context-Free Grammars

Harvard CS 121 and CSCI E-207 Lecture 9: Regular Languages Wrap-Up, Context-Free Grammars Harvard CS 121 and CSCI E-207 Lecture 9: Regular Languages Wrap-Up, Context-Free Grammars Salil Vadhan October 2, 2012 Reading: Sipser, 2.1 (except Chomsky Normal Form). Algorithmic questions about regular

More information

Proof Nets and Boolean Circuits

Proof Nets and Boolean Circuits Proof Nets and Boolean Circuits Kazushige Terui terui@nii.ac.j National Institute of Informatics, Tokyo 14/07/04, Turku.1/44 Motivation (1) Proofs-as-Programs (Curry-Howard) corresondence: Proofs = Programs

More information

Minimization of Weighted Automata

Minimization of Weighted Automata Minimization of Weighted Automata Andreas Maletti Universitat Rovira i Virgili Tarragona, Spain Wrocław May 19, 2010 Minimization of Weighted Automata Andreas Maletti 1 In Collaboration with ZOLTÁN ÉSIK,

More information

Computational Models - Lecture 4 1

Computational Models - Lecture 4 1 Computational Models - Lecture 4 1 Handout Mode Iftach Haitner. Tel Aviv University. November 21, 2016 1 Based on frames by Benny Chor, Tel Aviv University, modifying frames by Maurice Herlihy, Brown University.

More information

Statistical Machine Translation

Statistical Machine Translation Statistical Machine Translation -tree-based models (cont.)- Artem Sokolov Computerlinguistik Universität Heidelberg Sommersemester 2015 material from P. Koehn, S. Riezler, D. Altshuler Bottom-Up Decoding

More information

Foundations of Informatics: a Bridging Course

Foundations of Informatics: a Bridging Course Foundations of Informatics: a Bridging Course Week 3: Formal Languages and Semantics Thomas Noll Lehrstuhl für Informatik 2 RWTH Aachen University noll@cs.rwth-aachen.de http://www.b-it-center.de/wob/en/view/class211_id948.html

More information

Analysing Soft Syntax Features and Heuristics for Hierarchical Phrase Based Machine Translation

Analysing Soft Syntax Features and Heuristics for Hierarchical Phrase Based Machine Translation Analysing Soft Syntax Features and Heuristics for Hierarchical Phrase Based Machine Translation David Vilar, Daniel Stein, Hermann Ney IWSLT 2008, Honolulu, Hawaii 20. October 2008 Human Language Technology

More information

CS 6120/CS4120: Natural Language Processing

CS 6120/CS4120: Natural Language Processing CS 6120/CS4120: Natural Language Processing Instructor: Prof. Lu Wang College of Computer and Information Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Assignment/report submission

More information

Processing/Speech, NLP and the Web

Processing/Speech, NLP and the Web CS460/626 : Natural Language Processing/Speech, NLP and the Web (Lecture 25 Probabilistic Parsing) Pushpak Bhattacharyya CSE Dept., IIT Bombay 14 th March, 2011 Bracketed Structure: Treebank Corpus [ S1[

More information

Syntax-Directed Translations and Quasi-alphabetic Tree Bimorphisms Revisited

Syntax-Directed Translations and Quasi-alphabetic Tree Bimorphisms Revisited Syntax-Directed Translations and Quasi-alphabetic Tree Bimorphisms Revisited Andreas Maletti and C t lin Ionuµ Tîrn uc Universitat Rovira i Virgili Departament de Filologies Romàniques Av. Catalunya 35,

More information

CS626: NLP, Speech and the Web. Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 14: Parsing Algorithms 30 th August, 2012

CS626: NLP, Speech and the Web. Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 14: Parsing Algorithms 30 th August, 2012 CS626: NLP, Speech and the Web Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 14: Parsing Algorithms 30 th August, 2012 Parsing Problem Semantics Part of Speech Tagging NLP Trinity Morph Analysis

More information

CS460/626 : Natural Language

CS460/626 : Natural Language CS460/626 : Natural Language Processing/Speech, NLP and the Web (Lecture 23, 24 Parsing Algorithms; Parsing in case of Ambiguity; Probabilistic Parsing) Pushpak Bhattacharyya CSE Dept., IIT Bombay 8 th,

More information

Bisimulation Minimisation for Weighted Tree Automata

Bisimulation Minimisation for Weighted Tree Automata Bisimulation Minimisation for Weighted Tree Automata Johanna Högberg 1, Andreas Maletti 2, and Jonathan May 3 1 Department of Computing Science, Umeå University S 90187 Umeå, Sweden johanna@cs.umu.se 2

More information

Parsing with Context-Free Grammars

Parsing with Context-Free Grammars Parsing with Context-Free Grammars CS 585, Fall 2017 Introduction to Natural Language Processing http://people.cs.umass.edu/~brenocon/inlp2017 Brendan O Connor College of Information and Computer Sciences

More information

Tree Adjoining Grammars

Tree Adjoining Grammars Tree Adjoining Grammars TAG: Parsing and formal properties Laura Kallmeyer & Benjamin Burkhardt HHU Düsseldorf WS 2017/2018 1 / 36 Outline 1 Parsing as deduction 2 CYK for TAG 3 Closure properties of TALs

More information

Exam Computability and Complexity

Exam Computability and Complexity Total number of points:... Number of extra sheets of paper:... Exam Computability and Complexity by Jiri Srba, January 2009 Student s full name CPR number Study number Before you start, fill in the three

More information

Computational Linguistics

Computational Linguistics Computational Linguistics Dependency-based Parsing Clayton Greenberg Stefan Thater FR 4.7 Allgemeine Linguistik (Computerlinguistik) Universität des Saarlandes Summer 2016 Acknowledgements These slides

More information

CS : Speech, NLP and the Web/Topics in AI

CS : Speech, NLP and the Web/Topics in AI CS626-449: Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-17: Probabilistic parsing; insideoutside probabilities Probability of a parse tree (cont.) S 1,l NP 1,2

More information

Automata Theory. CS F-10 Non-Context-Free Langauges Closure Properties of Context-Free Languages. David Galles

Automata Theory. CS F-10 Non-Context-Free Langauges Closure Properties of Context-Free Languages. David Galles Automata Theory CS411-2015F-10 Non-Context-Free Langauges Closure Properties of Context-Free Languages David Galles Department of Computer Science University of San Francisco 10-0: Fun with CFGs Create

More information

Compositions of Bottom-Up Tree Series Transformations

Compositions of Bottom-Up Tree Series Transformations Compositions of Bottom-Up Tree Series Transformations Andreas Maletti a Technische Universität Dresden Fakultät Informatik D 01062 Dresden, Germany maletti@tcs.inf.tu-dresden.de May 27, 2005 1. Motivation

More information

CS460/626 : Natural Language

CS460/626 : Natural Language CS460/626 : Natural Language Processing/Speech, NLP and the Web (Lecture 27 SMT Assignment; HMM recap; Probabilistic Parsing cntd) Pushpak Bhattacharyya CSE Dept., IIT Bombay 17 th March, 2011 CMU Pronunciation

More information

Computational Linguistics. Acknowledgements. Phrase-Structure Trees. Dependency-based Parsing

Computational Linguistics. Acknowledgements. Phrase-Structure Trees. Dependency-based Parsing Computational Linguistics Dependency-based Parsing Dietrich Klakow & Stefan Thater FR 4.7 Allgemeine Linguistik (Computerlinguistik) Universität des Saarlandes Summer 2013 Acknowledgements These slides

More information

Introduction to Automata

Introduction to Automata Introduction to Automata Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjin@postech.ac.kr 1 /

More information

Linking Theorems for Tree Transducers

Linking Theorems for Tree Transducers Linking Theorems for Tree Transducers Andreas Maletti maletti@ims.uni-stuttgart.de peyer October 1, 2015 Andreas Maletti Linking Theorems for MBOT Theorietag 2015 1 / 32 tatistical Machine Translation

More information

COMP/MATH 300 Topics for Spring 2017 June 5, Review and Regular Languages

COMP/MATH 300 Topics for Spring 2017 June 5, Review and Regular Languages COMP/MATH 300 Topics for Spring 2017 June 5, 2017 Review and Regular Languages Exam I I. Introductory and review information from Chapter 0 II. Problems and Languages A. Computable problems can be expressed

More information

Syntax-based Statistical Machine Translation

Syntax-based Statistical Machine Translation Syntax-based Statistical Machine Translation Philip Williams and Philipp Koehn 29 October 2014 Part I Part II - Introduction - Rule Extraction Part III - Decoding Part IV - Extensions Syntax-based Statistical

More information

6.891: Lecture 24 (December 8th, 2003) Kernel Methods

6.891: Lecture 24 (December 8th, 2003) Kernel Methods 6.891: Lecture 24 (December 8th, 2003) Kernel Methods Overview ffl Recap: global linear models ffl New representations from old representations ffl computational trick ffl Kernels for NLP structures ffl

More information

16. Binary Search Trees

16. Binary Search Trees Dictionary imlementation 16. Binary Search Trees [Ottman/Widmayer, Ka..1, Cormen et al, Ka. 12.1-12.] Hashing: imlementation of dictionaries with exected very fast access times. Disadvantages of hashing:

More information

CS 545 Lecture XVI: Parsing

CS 545 Lecture XVI: Parsing CS 545 Lecture XVI: Parsing brownies_choco81@yahoo.com brownies_choco81@yahoo.com Benjamin Snyder Parsing Given a grammar G and a sentence x = (x1, x2,..., xn), find the best parse tree. We re not going

More information

Bimorphism Machine Translation

Bimorphism Machine Translation Bimorphism Machine Translation Von der Fakultät für Mathematik und Informatik der Universität Leipzig angenommene D I S S E R T A T I O N zur Erlangung des akademischen Grades DOCTOR PHILOSOPHIAE (Dr.

More information

Bringing machine learning & compositional semantics together: central concepts

Bringing machine learning & compositional semantics together: central concepts Bringing machine learning & compositional semantics together: central concepts https://githubcom/cgpotts/annualreview-complearning Chris Potts Stanford Linguistics CS 244U: Natural language understanding

More information

Computability Theory

Computability Theory CS:4330 Theory of Computation Spring 2018 Computability Theory Decidable Problems of CFLs and beyond Haniel Barbosa Readings for this lecture Chapter 4 of [Sipser 1996], 3rd edition. Section 4.1. Decidable

More information

Roger Levy Probabilistic Models in the Study of Language draft, October 2,

Roger Levy Probabilistic Models in the Study of Language draft, October 2, Roger Levy Probabilistic Models in the Study of Language draft, October 2, 2012 224 Chapter 10 Probabilistic Grammars 10.1 Outline HMMs PCFGs ptsgs and ptags Highlight: Zuidema et al., 2008, CogSci; Cohn

More information

Model checking, verification of CTL. One must verify or expel... doubts, and convert them into the certainty of YES [Thomas Carlyle]

Model checking, verification of CTL. One must verify or expel... doubts, and convert them into the certainty of YES [Thomas Carlyle] Chater 5 Model checking, verification of CTL One must verify or exel... doubts, and convert them into the certainty of YES or NO. [Thomas Carlyle] 5. The verification setting Page 66 We introduce linear

More information

Formal Modeling in Cognitive Science Lecture 29: Noisy Channel Model and Applications;

Formal Modeling in Cognitive Science Lecture 29: Noisy Channel Model and Applications; Formal Modeling in Cognitive Science Lecture 9: and ; ; Frank Keller School of Informatics University of Edinburgh keller@inf.ed.ac.uk Proerties of 3 March, 6 Frank Keller Formal Modeling in Cognitive

More information

Lecture 24: Randomized Complexity, Course Summary

Lecture 24: Randomized Complexity, Course Summary 6.045 Lecture 24: Randomized Complexity, Course Summary 1 1/4 1/16 1/4 1/4 1/32 1/16 1/32 Probabilistic TMs 1/16 A probabilistic TM M is a nondeterministic TM where: Each nondeterministic step is called

More information

The Power of Weighted Regularity-Preserving Multi Bottom-up Tree Transducers

The Power of Weighted Regularity-Preserving Multi Bottom-up Tree Transducers International Journal of Foundations of Computer cience c World cientific Publishing Company The Power of Weighted Regularity-Preserving Multi Bottom-up Tree Transducers ANDREA MALETTI Universität Leipzig,

More information

16. Binary Search Trees

16. Binary Search Trees Dictionary imlementation 16. Binary Search Trees [Ottman/Widmayer, Ka..1, Cormen et al, Ka. 1.1-1.] Hashing: imlementation of dictionaries with exected very fast access times. Disadvantages of hashing:

More information

Pure and O-Substitution

Pure and O-Substitution International Journal of Foundations of Computer Science c World Scientific Publishing Company Pure and O-Substitution Andreas Maletti Department of Computer Science, Technische Universität Dresden 01062

More information

Part I - Introduction Part II - Rule Extraction Part III - Decoding Part IV - Extensions

Part I - Introduction Part II - Rule Extraction Part III - Decoding Part IV - Extensions Syntax-based Statistical Machine Translation Philip Williams and Philipp Koehn 29 October 2014 Part I - Introduction Part II - Rule Extraction Part III - Decoding Part IV - Extensions Syntax-based Statistical

More information

A Probabilistic Forest-to-String Model for Language Generation from Typed Lambda Calculus Expressions

A Probabilistic Forest-to-String Model for Language Generation from Typed Lambda Calculus Expressions A Probabilistic Forest-to-String Model for Language Generation from Typed Lambda Calculus Expressions Wei Lu and Hwee Tou Ng National University of Singapore 1/26 The Task (Logical Form) λx 0.state(x 0

More information

Decidability (intro.)

Decidability (intro.) CHAPTER 4 Decidability Contents Decidable Languages decidable problems concerning regular languages decidable problems concerning context-free languages The Halting Problem The diagonalization method The

More information

SYLLABUS. Introduction to Finite Automata, Central Concepts of Automata Theory. CHAPTER - 3 : REGULAR EXPRESSIONS AND LANGUAGES

SYLLABUS. Introduction to Finite Automata, Central Concepts of Automata Theory. CHAPTER - 3 : REGULAR EXPRESSIONS AND LANGUAGES Contents i SYLLABUS UNIT - I CHAPTER - 1 : AUT UTOMA OMATA Introduction to Finite Automata, Central Concepts of Automata Theory. CHAPTER - 2 : FINITE AUT UTOMA OMATA An Informal Picture of Finite Automata,

More information

Natural Language Processing CS Lecture 06. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Natural Language Processing CS Lecture 06. Razvan C. Bunescu School of Electrical Engineering and Computer Science Natural Language Processing CS 6840 Lecture 06 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Statistical Parsing Define a probabilistic model of syntax P(T S):

More information

An introduction to forest-regular languages

An introduction to forest-regular languages An introduction to forest-regular languages Mika Raento Basic Research Unit, Helsinki Institute for Information Technology Deartment of Comuter Science, University of Helsinki Mika.Raento@cs.Helsinki.FI

More information

Pretest (Optional) Use as an additional pacing tool to guide instruction. August 21

Pretest (Optional) Use as an additional pacing tool to guide instruction. August 21 Trimester 1 Pretest (Otional) Use as an additional acing tool to guide instruction. August 21 Beyond the Basic Facts In Trimester 1, Grade 7 focus on multilication. Daily Unit 1: The Number System Part

More information

Statistical Methods for NLP

Statistical Methods for NLP Statistical Methods for NLP Stochastic Grammars Joakim Nivre Uppsala University Department of Linguistics and Philology joakim.nivre@lingfil.uu.se Statistical Methods for NLP 1(22) Structured Classification

More information

Peter Wood. Department of Computer Science and Information Systems Birkbeck, University of London Automata and Formal Languages

Peter Wood. Department of Computer Science and Information Systems Birkbeck, University of London Automata and Formal Languages and and Department of Computer Science and Information Systems Birkbeck, University of London ptw@dcs.bbk.ac.uk Outline and Doing and analysing problems/languages computability/solvability/decidability

More information

Approximating min-max k-clustering

Approximating min-max k-clustering Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost

More information

RELATING TREE SERIES TRANSDUCERS AND WEIGHTED TREE AUTOMATA

RELATING TREE SERIES TRANSDUCERS AND WEIGHTED TREE AUTOMATA International Journal of Foundations of Computer Science Vol. 16, No. 4 (2005) 723 741 c World Scientific Publishing Company RELATING TREE SERIES TRANSDUCERS AND WEIGHTED TREE AUTOMATA ANDREAS MALETTI

More information

Decidable and undecidable languages

Decidable and undecidable languages The Chinese University of Hong Kong Fall 2011 CSCI 3130: Formal languages and automata theory Decidable and undecidable languages Andrej Bogdanov http://www.cse.cuhk.edu.hk/~andrejb/csc3130 Problems about

More information

CSE 311 Lecture 02: Logic, Equivalence, and Circuits. Emina Torlak and Kevin Zatloukal

CSE 311 Lecture 02: Logic, Equivalence, and Circuits. Emina Torlak and Kevin Zatloukal CSE 311 Lecture 02: Logic, Equivalence, and Circuits Emina Torlak and Kevin Zatloukal 1 Toics Proositional logic A brief review of Lecture 01. Classifying comound roositions Converse, contraositive, and

More information

10/17/04. Today s Main Points

10/17/04. Today s Main Points Part-of-speech Tagging & Hidden Markov Model Intro Lecture #10 Introduction to Natural Language Processing CMPSCI 585, Fall 2004 University of Massachusetts Amherst Andrew McCallum Today s Main Points

More information

The SUBTLE NL Parsing Pipeline: A Complete Parser for English Mitch Marcus University of Pennsylvania

The SUBTLE NL Parsing Pipeline: A Complete Parser for English Mitch Marcus University of Pennsylvania The SUBTLE NL Parsing Pipeline: A Complete Parser for English Mitch Marcus University of Pennsylvania 1 PICTURE OF ANALYSIS PIPELINE Tokenize Maximum Entropy POS tagger MXPOST Ratnaparkhi Core Parser Collins

More information

Finite-state Machines: Theory and Applications

Finite-state Machines: Theory and Applications Finite-state Machines: Theory and Applications Unweighted Finite-state Automata Thomas Hanneforth Institut für Linguistik Universität Potsdam December 10, 2008 Thomas Hanneforth (Universität Potsdam) Finite-state

More information

Named Entity Recognition using Maximum Entropy Model SEEM5680

Named Entity Recognition using Maximum Entropy Model SEEM5680 Named Entity Recognition using Maximum Entroy Model SEEM5680 Named Entity Recognition System Named Entity Recognition (NER): Identifying certain hrases/word sequences in a free text. Generally it involves

More information

Intro to Theory of Computation

Intro to Theory of Computation Intro to Theory of Computation LECTURE 7 Last time: Proving a language is not regular Pushdown automata (PDAs) Today: Context-free grammars (CFG) Equivalence of CFGs and PDAs Sofya Raskhodnikova 1/31/2016

More information

Computational Models - Lecture 4 1

Computational Models - Lecture 4 1 Computational Models - Lecture 4 1 Handout Mode Iftach Haitner and Yishay Mansour. Tel Aviv University. April 3/8, 2013 1 Based on frames by Benny Chor, Tel Aviv University, modifying frames by Maurice

More information

Synchronous Grammars

Synchronous Grammars ynchronous Grammars ynchronous grammars are a way of simultaneously generating pairs of recursively related strings ynchronous grammar w wʹ ynchronous grammars were originally invented for programming

More information

} Some languages are Turing-decidable A Turing Machine will halt on all inputs (either accepting or rejecting). No infinite loops.

} Some languages are Turing-decidable A Turing Machine will halt on all inputs (either accepting or rejecting). No infinite loops. and their languages } Some languages are Turing-decidable A Turing Machine will halt on all inputs (either accepting or rejecting). No infinite loops. } Some languages are Turing-recognizable, but not

More information

An Overview of Witt Vectors

An Overview of Witt Vectors An Overview of Witt Vectors Daniel Finkel December 7, 2007 Abstract This aer offers a brief overview of the basics of Witt vectors. As an alication, we summarize work of Bartolo and Falcone to rove that

More information

CISC4090: Theory of Computation

CISC4090: Theory of Computation CISC4090: Theory of Computation Chapter 2 Context-Free Languages Courtesy of Prof. Arthur G. Werschulz Fordham University Department of Computer and Information Sciences Spring, 2014 Overview In Chapter

More information

S NP VP 0.9 S VP 0.1 VP V NP 0.5 VP V 0.1 VP V PP 0.1 NP NP NP 0.1 NP NP PP 0.2 NP N 0.7 PP P NP 1.0 VP NP PP 1.0. N people 0.

S NP VP 0.9 S VP 0.1 VP V NP 0.5 VP V 0.1 VP V PP 0.1 NP NP NP 0.1 NP NP PP 0.2 NP N 0.7 PP P NP 1.0 VP  NP PP 1.0. N people 0. /6/7 CS 6/CS: Natural Language Processing Instructor: Prof. Lu Wang College of Computer and Information Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang The grammar: Binary, no epsilons,.9..5

More information

Examples from Elements of Theory of Computation. Abstract. Introduction

Examples from Elements of Theory of Computation. Abstract. Introduction Examles from Elements of Theory of Comutation Mostafa Ghandehari Samee Ullah Khan Deartment of Comuter Science and Engineering University of Texas at Arlington, TX-7609, USA Tel: +(87)7-5688, Fax: +(87)7-784

More information

Parsing with Context-Free Grammars

Parsing with Context-Free Grammars Parsing with Context-Free Grammars Berlin Chen 2005 References: 1. Natural Language Understanding, chapter 3 (3.1~3.4, 3.6) 2. Speech and Language Processing, chapters 9, 10 NLP-Berlin Chen 1 Grammars

More information

CFLs and Regular Languages. CFLs and Regular Languages. CFLs and Regular Languages. Will show that all Regular Languages are CFLs. Union.

CFLs and Regular Languages. CFLs and Regular Languages. CFLs and Regular Languages. Will show that all Regular Languages are CFLs. Union. We can show that every RL is also a CFL Since a regular grammar is certainly context free. We can also show by only using Regular Expressions and Context Free Grammars That is what we will do in this half.

More information

Unit 2: Tree Models. CS 562: Empirical Methods in Natural Language Processing. Lectures 19-23: Context-Free Grammars and Parsing

Unit 2: Tree Models. CS 562: Empirical Methods in Natural Language Processing. Lectures 19-23: Context-Free Grammars and Parsing CS 562: Empirical Methods in Natural Language Processing Unit 2: Tree Models Lectures 19-23: Context-Free Grammars and Parsing Oct-Nov 2009 Liang Huang (lhuang@isi.edu) Big Picture we have already covered...

More information

Pumping Lemma for CFLs

Pumping Lemma for CFLs Pumping Lemma for CFLs v y s Here we go again! Intuition: If L is CF, then some CFG G produces strings in L If some string in L is very long, it will have a very tall parse tree If a parse tree is taller

More information

Grammar formalisms Tree Adjoining Grammar: Formal Properties, Parsing. Part I. Formal Properties of TAG. Outline: Formal Properties of TAG

Grammar formalisms Tree Adjoining Grammar: Formal Properties, Parsing. Part I. Formal Properties of TAG. Outline: Formal Properties of TAG Grammar formalisms Tree Adjoining Grammar: Formal Properties, Parsing Laura Kallmeyer, Timm Lichte, Wolfgang Maier Universität Tübingen Part I Formal Properties of TAG 16.05.2007 und 21.05.2007 TAG Parsing

More information

FORMAL LANGUAGES, AUTOMATA AND COMPUTATION

FORMAL LANGUAGES, AUTOMATA AND COMPUTATION FORMAL LANGUAGES, AUTOMATA AND COMPUTATION DECIDABILITY ( LECTURE 15) SLIDES FOR 15-453 SPRING 2011 1 / 34 TURING MACHINES-SYNOPSIS The most general model of computation Computations of a TM are described

More information