Movement-Generalized Minimalist Grammars

Similar documents
An Additional Observation on Strict Derivational Minimalism

Locality and the Complexity of Minimalist Derivation Tree Languages

Models of Adjunction in Minimalist Grammars

Sharpening the empirical claims of generative syntax through formalization

Towards a Factorization of String-Based Phonology

Person Case Constraints and Feature Complexity in Syntax

Reference-Set Computation = Minimalism + Transformational Rules?

Model-Theory of Property Grammars with Features

Well-Nestedness Properly Subsumes Strict Derivational Minimalism

Insertion Minimalist Grammars: Eliminating redundancies between merge and move

Towards an Algebraic Morphosyntax

On the Logic of LGB Type Structures. Part III: Minimalism

Transforming Linear Context Free Rewriting Systems into Minimalist Grammars

Some Remarks on Locality Conditions and Minimalist Grammars

On rigid NL Lambek grammars inference from generalized functor-argument data

A Polynomial Time Algorithm for Parsing with the Bounded Order Lambek Calculus

Tree Adjoining Grammars

Automata theory. An algorithmic approach. Lecture Notes. Javier Esparza

Sharpening the empirical claims of generative syntax through formalization

Derivational order and ACGs

Parsing with Context-Free Grammars

Exam: Synchronous Grammars

Parsing. Context-Free Grammars (CFG) Laura Kallmeyer. Winter 2017/18. Heinrich-Heine-Universität Düsseldorf 1 / 26

Compositionality and Syntactic Structure Marcus Kracht Department of Linguistics UCLA 3125 Campbell Hall 405 Hilgard Avenue Los Angeles, CA 90095

Parsing Beyond Context-Free Grammars: Tree Adjoining Grammars

Well-Nestedness Properly Subsumes Strict Derivational Minimalism

Finite and Algorithmic Model Theory II: Automata-Based Methods

Automata, Complexity, and Optimality Theory

FLAC Context-Free Grammars

FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY

A Computational Approach to Minimalism

Finite Presentations of Pregroups and the Identity Problem

Foundations of Informatics: a Bridging Course

COM364 Automata Theory Lecture Note 2 - Nondeterminism

Unifying Adjunct Islands and Freezing Effects in Minimalist Grammars

Computational complexity of commutative grammars

Context Free Languages. Automata Theory and Formal Grammars: Lecture 6. Languages That Are Not Regular. Non-Regular Languages

A Tree Transducer Model of Reference-Set Computation

The word problem in Z 2 and formal language theory

Chapter 0 Introduction. Fourth Academic Year/ Elective Course Electrical Engineering Department College of Engineering University of Salahaddin

What You Must Remember When Processing Data Words

cse303 ELEMENTS OF THE THEORY OF COMPUTATION Professor Anita Wasilewska

From Monadic Second-Order Definable String Transformations to Transducers

Context Free Grammars

Multiple Context-free Grammars

Parikh s theorem. Håkan Lindqvist

Multiple Context Free Grammars

An introduction to mildly context sensitive grammar formalisms. Combinatory Categorial Grammar

Mathematical Preliminaries. Sipser pages 1-28

Tree Automata and Rewriting

Surface Reasoning Lecture 2: Logic and Grammar

Theory of Computation

ON MINIMAL CONTEXT-FREE INSERTION-DELETION SYSTEMS

Formal Languages, Automata and Models of Computation

Weak Cost Monadic Logic over Infinite Trees

Weak vs. Strong Finite Context and Kernel Properties

Theoretical Computer Science

COMP/MATH 300 Topics for Spring 2017 June 5, Review and Regular Languages

Sharpening the empirical claims of generative syntax through formalization

Introduction to Kleene Algebras

Computational Models - Lecture 4

CMSC 330: Organization of Programming Languages. Pushdown Automata Parsing

ACLT: Algebra, Categories, Logic in Topology - Grothendieck's generalized topological spaces (toposes)

Finite-state Machines: Theory and Applications

Formal Epistemology: Lecture Notes. Horacio Arló-Costa Carnegie Mellon University

A* Search. 1 Dijkstra Shortest Path

Opleiding Informatica

This lecture covers Chapter 7 of HMU: Properties of CFLs

SOME NOTES ON LOGIC AND QUANTIFIER RAISING

What is this course about?

Pattern Logics and Auxiliary Relations

All psychiatrists are doctors All doctors are college graduates All psychiatrists are college graduates

X-bar theory. X-bar :

Counter Automata and Classical Logics for Data Words

Ling 5801: Lecture Notes 7 From Programs to Context-Free Grammars

Closure Properties of Regular Languages. Union, Intersection, Difference, Concatenation, Kleene Closure, Reversal, Homomorphism, Inverse Homomorphism

Sri vidya college of engineering and technology

Theory of Computation

Chapter 4: Computation tree logic

Propositional and Predicate Logic - IV

Computational Models - Lecture 5 1

FORMAL LANGUAGES, AUTOMATA AND COMPUTABILITY

Displacement Logic for Grammar

CSE 311: Foundations of Computing I Autumn 2014 Practice Final: Section X. Closed book, closed notes, no cell phones, no calculators.

BASIC MATHEMATICAL TECHNIQUES

Lecture 17: Language Recognition

COMP-330 Theory of Computation. Fall Prof. Claude Crépeau. Lec. 10 : Context-Free Grammars

About the relationship between formal logic and complexity classes

Sharpening the empirical claims of generative syntax through formalization

Chapter Five: Nondeterministic Finite Automata

Automata, Logic and Games: Theory and Application

Introduction to Theory of Computing

Epsilon-Free Grammars and Lexicalized Grammars that Generate the Class of the Mildly Context-Sensitive Languages

A spatial model of derivations

First-Order Logic. 1 Syntax. Domain of Discourse. FO Vocabulary. Terms

Abstract model theory for extensions of modal logic

Complete Partial Orders, PCF, and Control

Languages. Languages. An Example Grammar. Grammars. Suppose we have an alphabet V. Then we can write:

Modular Grammar Design with Typed Parametric Principles

Bringing machine learning & compositional semantics together: central concepts

Transcription:

Movet-Generalized Minimalist Grammars Thomas Graf tgraf@ucla.edu tgraf.bol.ucla.edu University of California, Los Angeles LACL 2012 July 2, 2012

Outline 1 Monadic Second-Order Logic (MSO) Talking About Graphs MSO Graph Transductions 2 Minimalist Grammars (MGs) 3 Model-Theoretic Perspective on MGs 4 Generalizing Move

MSO as a Graph Description Language MSO = extension of FO with quantification over sets A set L of graphs is MSO-definable iff there is an MSO-formula φ such that L is the set of models of φ. A tree language is regular iff it is MSO-definable. Common Predicates Symbol + σ Relation immediate dominance proper dominance equivalence label σ

MSO Graph Transductions Basic Idea Interpret output graph as (sub)structure of the input graph Definition (MSO graph transduction) A (non-copying) MSO graph transduction τ consists of an MSO formula φ defining the domain of the input graph, an MSO formula ψ defining the domain of the output graph, a family of MSO formulas defining the relations in the output graph in terms of the relations in the input graph. A graph transduction from trees to trees is called an MSO tree transduction (MSO-TT).

Examples Two Easy Transductions a b c τ a b c Edge relation: Edge relation: τ : x y x + y x y τ a b c d b d Edge relation: Edge relation: τ : x y z[x z y z] d(x) a(x) c(x) b(x) b(x)

Examples Two Easy Transductions a b c τ a b c Edge relation: Edge relation: τ : x y x + y x y τ a b c d b d Edge relation: Edge relation: τ : x y z[x z y z] d(x) a(x) c(x) b(x) b(x)

MSO and MCFLs Regular tree languages yield CFLs. However, the class of MCFLs is identical to the string yield of the image of the regular tree languages under MSO tree transductions (cf. Mönnich 2006). REG CFL string yield MSO-TT string yield MCFL MCFL str(mso-tt(reg))

Minimalist Grammars Originally formulated in Stabler (1997) Highly lexicalized formalism inspired by Minimalist syntax (Chomsky 1995). Weakly equivalent to MCFGs (Harkema 2001; Michaelis 1998, 2001) More succinct than MCFGs (Stabler 2012) Derivation trees provide compact finite-state representation (Michaelis 2001; Kobele et al. 2007) Attractive closure properties (Graf 2011; Kobele 2011) Very extensible while preserving weak generative capacity

The Atoms of a Minimalist Grammar Minimalist Grammars (Stabler 1997) An MG is a 5-tuple G := Σ, Feat, F, Lex, Op, where Σ is an alphabet, Feat is a non-empty finite set of category features f, selector features = f, movet licensee features f, movet licensor features +f, F Feat is a set of final category features, the lexicon Lex is a finite subset of Σ Feat +, Op := {merge, move} is the set of structure-building operations. For every MGs it suffices to specify Lex and F.

Operation 1: < = c γ c δ γ δ > = c γ c δ δ γ

Operation 2: Move > +f γ Move δ γ f δ Shortest Move Constraint (SMC) o two lexical items may have the same licensee feature as their first unchecked feature. f γ f δ

Operation 2: Move > +f γ Move δ γ f δ Shortest Move Constraint (SMC) o two lexical items may have the same licensee feature as their first unchecked feature. f γ f δ

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C the = D like = D = D V which = D wh

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C < the = D like = D = D V which = D wh

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C < < the = D like = D = D V which = D wh

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C < < < the = D like = D = D V which = D wh

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C > < < < the = D like = D = D V which = D wh

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C > do = V +wh C < < < the = D like = D = D V which = D wh

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C < > do = V +wh C < < < the = D like = D = D V which = D wh

A Toy Example (Without Recursion) MG with F = {C} :: the :: = D which :: = D wh like :: = D = D V ε :: = V C do :: = V + wh C < > which = D wh do = V +wh C < < > < the = D like = D = D V

A Model-Theoretic View of MGs Every MG is uniquely specified by its set of derivations. Derivations are trees can be defined model-theoretically Each node in a derivation belongs to some lexical item lexicon is a finite set of treelets called slices A derivation tree is a combination of slices that respects the constraints of the feature calculus. A given MG s derivation tree language is the largest set of such well-formed slice combinations. Crucially, the constraints of the feature calculus can be expressed in tree-geometric terms.

Derivation Trees < > Move do = V +wh C which = D wh do = V +wh C the = D < < > like = D = D V < the = D like = D = D V which = D wh

Slices Intuitively, slices are the derivation tree equivalent of phrasal projection: Each slice marks the subpart of the derivation that a lexical item has control over by virtue of its selector and licensor features. Slices Given a derivation tree t and lexical item l occurring in t, slice(l) is defined as follows: l slice(l), if node n of t immediately dominates a node s slice(l), then n slice(l) iff the operation denoted by the label of n erased a selector or licensor feature of l. The unique n slice(l) that isn t (properly) dominated by any n slice(l) is called the slice root of l.

Example of Slices Move do = V + wh C the = D like = D = D V what D wh

Example of Slices the = D do = V + wh C = = V +wh Move = D like = D = D V = D what D wh

Regulating If n is a node associated to selector feature = f, it immediately dominates the slice root of some lexical item with category feature f.

Regulating Move: Finding Occurrences A Move node that checks a licensee feature of lexical item l is called an occurrence of l. Definition (Occurrences) For every combination t of slices and lexical item l in t with licensee features f 1 f n, the occurrences of l in t are: occ 0 (l) is the mother of the slice root of l in t (if it exists). occ i (l) is the unique node m of t labeled Move such that m is associated to +f i, and m properly dominates occ i 1, and there is no node n in t such that n is associated to +f i, and n properly dominates occ i 1, and n is properly dominated by m.

Exercise: Find the Occurrences! Example Move +h Move +g d Move Move +f +g c f g a b g h

Regulating Move: Resource Sensitivity & SMC Move For every lexical item l with licensee features f 1 f n, there exist nodes m 1,..., m n such that m i (and no other node) is the ith occurrence of l. SMC For every Move node m there is exactly one lexical item l such that m is an occurrence of l.

Why the SMC Works Example 1 Move do = V +wh C which = D wh like = D = D V which = D wh

Why the SMC Works [cont.] Example 2 Move +h Move +g d Move Move +g +f c f g a b g h

Mapping to Derived Trees Derivation trees can be mapped to derived trees by a multi bottom-up transducer (Kobele et al. 2007) However, an MSO tree transduction is more intuitive. The transduction adds a dominance branch from an LI s highest occurrence to the root of its slice. The original dominance branch is removed. Linear order and interior node labels also need to be changed. See the paper for details. The Transduction Formula x y x y l[f-occ(x, l) sliceroot(y, l)] f-occ(x, l) occurrence(x, l) z[z + x occurrence(z, l)]

Example Transduction Example Move do = V +wh C the = D like = D = D V which = D wh

Example Transduction Example Move do = V +wh C the = D like = D = D V which = D wh

Example Transduction Example Move do = V +wh C the = D like = D = D V which = D wh

The Basic Idea We now have a FO-definable theory of Minimalist derivations and a simple MSO transduction to derived trees. But there is wiggle room: MGs generate MCFLs, and MCFL = str(mso-tt(reg)). So we can change certain parameters in the definitions, as long as MSO-definability is preserved parametric template to create new variants of Move

Parameters of Move The variants of Move discussed in the syntactic literature vary along several disions: Possible landing sites: raising, lowering, sidewards Size of moved constituent: phrase, head, pied-piped phrase Visibility: overt, covert Directionality: left, right Loci of Parameters Parameter landing site size visibility directionality Locus definition of occurrence transduction transduction transduction

Replacing Proper Dominance Reminder: Definition of Occurrence occ i (l) is the unique node m of t labeled Move such that m is associated to +f i, and m properly dominates occ i 1, and there is no node n in t such that n is associated to +f i, and n properly dominates occ i 1, and n is properly dominated by m. Alternative Instantiations Relation inverse proper dominance proper dominance across at most 1 slice slice containt slice containt restricted to wh-phrases Movet Type lowering antilocal raising sidewards wh-clustering

Size of Moved Constituent Reminder: The Transduction Formula x y x y l[f-occ(x, l) sliceroot(y, l)] Alternative Instantiations Formula y l l [sliceroot(y, l ) selects(l, l)] Movet Type head movet pied-piping

Further Observations With respect to directionality, all movet types can be replaced by a combination of raising and lowering (but new lexical items might be needed to furnish extra landing sites). In most cases, strong generative capacity is increased (e.g. lowering makes it possible to generate any given TAL with X -like trees). Many MSO-definable relations do not yield useful movet types (e.g. all reflexive and all symmetric relations).

Conclusion What was Accomplished? MSO definition of MGs as derivation tree languages with a mapping to derived trees. Definitions provide a parameterized template that can easily be tuned to create new movet types without increasing weak generative capacity. Why Should You Care? Combined with the results of Graf (2011) and Kobele (2011), we now have a general system for adding constraints and movet types to MGs extremely flexible framework (just about anything from the literature can be incorporated) ew movet types make it possible to emulate other formalisms resource sharing, new perspectives

References I Chomsky, oam. 1995. The minimalist program. Cambridge, Mass.: MIT Press. Graf, Thomas. 2011. Closure properties of minimalist derivation tree languages. In LACL 2011, ed. Sylvain Pogodalla and Jean-Philippe Prost, volume 6736 of Lecture otes in Artificial Intelligence, 96 111. Harkema, Henk. 2001. A characterization of minimalist languages. In Logical aspects of computational linguistics (lacl 01), ed. Philippe de Groote, Glyn Morrill, and Christian Retoré, volume 2099 of Lecture otes in Artificial Intelligence, 193 211. Berlin: Springer. Kobele, Gregory M. 2011. Minimalist tree languages are closed under intersection with recognizable tree languages. In LACL 2011, ed. Sylvain Pogodalla and Jean-Philippe Prost, volume 6736 of Lecture otes in Artificial Intelligence, 129 144. Kobele, Gregory M., Christian Retoré, and Sylvain Salvati. 2007. An automata-theoretic approach to minimalism. In Model Theoretic Syntax at 10, ed. James Rogers and Stephan Kepser, 71 80. Michaelis, Jens. 1998. Derivational minimalism is mildly context-sensitive. Lecture otes in Artificial Intelligence 2014:179 198. Michaelis, Jens. 2001. Transforming linear context-free rewriting systems into minimalist grammars. Lecture otes in Artificial Intelligence 2099:228 244. Mönnich, Uwe. 2006. Grammar morphisms. Ms. University of Tübingen.

References II Stabler, Edward P. 1997. Derivational minimalism. In Logical aspects of computational linguistics, ed. Christian Retoré, volume 1328 of Lecture otes in Computer Science, 68 95. Berlin: Springer. Stabler, Edward P. 2012. Bayesian, minimalist, incretal syntactic analysis. Ms., UCLA.