Learning Dependency-Based Compositional Semantics

Similar documents
Semantic Parsing with Combinatory Categorial Grammars

Driving Semantic Parsing from the World s Response

Cross-lingual Semantic Parsing

Introduction to Semantic Parsing with CCG

Outline. Learning. Overview Details Example Lexicon learning Supervision signals

Lecture 5: Semantic Parsing, CCGs, and Structured Classification

Sequence labeling. Taking collective a set of interrelated instances x 1,, x T and jointly labeling them

Parsing with Context-Free Grammars

The Infinite PCFG using Hierarchical Dirichlet Processes

, District of Columbia

JOINT PROBABILISTIC INFERENCE OF CAUSAL STRUCTURE

Hidden Markov Models. x 1 x 2 x 3 x N

Preview: Making a Mental Map of the Region

Natural Language Processing. Slides from Andreas Vlachos, Chris Manning, Mihai Surdeanu

Sequence Modelling with Features: Linear-Chain Conditional Random Fields. COMP-599 Oct 6, 2015

North American Geography. Lesson 2: My Country tis of Thee

Structured Prediction Models via the Matrix-Tree Theorem

Bringing machine learning & compositional semantics together: central concepts

Statistical Methods for NLP

Location Option Details. 1. Florida

Scale Drawings and Maps

A Probabilistic Forest-to-String Model for Language Generation from Typed Lambda Calculus Expressions

Part of Speech Tagging: Viterbi, Forward, Backward, Forward- Backward, Baum-Welch. COMP-599 Oct 1, 2015

To make a grammar probabilistic, we need to assign a probability to each context-free rewrite

Investigation 11.3 Weather Maps

Outline. Learning. Overview Details

CSCE 222 Discrete Structures for Computing. Propositional Logic. Dr. Hyunyoung Lee. !!!!!! Based on slides by Andreas Klappenecker

Probabilistic Graphical Models

A. Geography Students know the location of places, geographic features, and patterns of the environment.

Semantics and Pragmatics of NLP

Quasi-Second-Order Parsing for 1-Endpoint-Crossing, Pagenumber-2 Graphs

Natural Language Processing CS Lecture 06. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Expectation Maximization (EM)

Standard Indicator That s the Latitude! Students will use latitude and longitude to locate places in Indiana and other parts of the world.

California: Land and People Lesson 1: Locating California

Statistical Methods for NLP

State Geography Due: Tuesday, October 24, 2017

Learning Goals of CS245 Logic and Computation

Natural Language Processing : Probabilistic Context Free Grammars. Updated 5/09

Meteorology 110. Lab 1. Geography and Map Skills

The Noisy Channel Model and Markov Models

High School World History Cycle 2 Week 2 Lifework

LEARNING COMPOSITIONALITY

Lecture 15. Probabilistic Models on Graph

HI SUMMER WORK

Introduction to Semantics. The Formalization of Meaning 1

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis

A proof theoretical account of polarity items and monotonic inference.

Probabilistic Reasoning. (Mostly using Bayesian Networks)

Expectation Maximization Algorithm

AI Programming CS S-09 Knowledge Representation

Linear Dynamical Systems

Algorithms for Syntax-Aware Statistical Machine Translation

Multivariate Statistics

Challenge 1: Learning About the Physical Geography of Canada and the United States

Integrated Science I: Locating earthquake epicenters. Name Per Date

Rank University AMJ AMR ASQ JAP OBHDP OS PPSYCH SMJ SUM 1 University of Pennsylvania (T) Michigan State University

CSE446: Clustering and EM Spring 2017

Lecture 5: Ecological distance metrics; Principal Coordinates Analysis. Univariate testing vs. community analysis

Logic and machine learning review. CS 540 Yingyu Liang

Generative Models for Sentences

Intelligent Systems (AI-2)

PAC Generalization Bounds for Co-training

Machine Learning for natural language processing

American Tour: Climate Objective To introduce contour maps as data displays.

The Semantics of Definite DPs 1. b. Argument Position: (i) [ A politician ] arrived from Washington. (ii) Joe likes [ the politician ].

Summary of Natural Hazard Statistics for 2008 in the United States

Pima Community College Students who Enrolled at Top 200 Ranked Universities

New Educators Campaign Weekly Report

Natural Language Processing

Advanced Topics in LP and FP

Summary of Terminal Master s Degree Programs in Philosophy

Language to Logical Form with Neural Attention

CS 331: Artificial Intelligence Propositional Logic I. Knowledge-based Agents

Knowledge-based Agents. CS 331: Artificial Intelligence Propositional Logic I. Knowledge-based Agents. Outline. Knowledge-based Agents

Guided Reading Activity

Jakarta International School 6 th Grade Formative Assessment Graphing and Statistics -Black

Generative and Discriminative Approaches to Graphical Models CMSC Topics in AI

Kathryn Robinson. Grades 3-5. From the Just Turn & Share Centers Series VOLUME 12

The exam is closed book, closed calculator, and closed notes except your one-page crib sheet.

12. Dezember

Predicate Calculus. CS 270 Math Foundations of Computer Science Jeremy Johnson

Movement-Generalized Minimalist Grammars

UNIT II REGULAR LANGUAGES

Tuning as Linear Regression

2005 Mortgage Broker Regulation Matrix

Club Convergence and Clustering of U.S. State-Level CO 2 Emissions

MINERALS THROUGH GEOGRAPHY

Intelligent Systems (AI-2)

Conditional Random Field

Predicate Calculus. Formal Methods in Verification of Computer Systems Jeremy Johnson

NLU: Semantic parsing

Logic and Modelling. Introduction to Predicate Logic. Jörg Endrullis. VU University Amsterdam

Outline. Parsing. Approximations Some tricks Learning agenda-based parsers

CPSC 540: Machine Learning

A gentle introduction to Hidden Markov Models

Propositional Logic: Logical Agents (Part I)

Graph-Based Lexicon Expansion with Sparsity-Inducing Penalties

Information theoretical approach for domain ontology exploration in large Earth observation image archives

Ling 130 Notes: Syntax and Semantics of Propositional Logic

Transcription:

Learning Dependency-Based Compositional Semantics Semantic Representations for Textual Inference Workshop Mar. 0, 0 Percy Liang Google/Stanford joint work with Michael Jordan and Dan Klein

Motivating Problem: Question Answering

Motivating Problem: Question Answering What is the largest city in California?

Motivating Problem: Question Answering What is the largest city in California? What is the largest city in a state bordering California?

Semantic Interpretation 4

Semantic Interpretation What is the largest city in a state bordering California? Phoenix 4

Semantic Interpretation What is the largest city in a state bordering California?? Phoenix 4

Semantic Interpretation What is the largest city in a state bordering California? argmax({c :city(c)^9s.state(s) ^ loc(c, s) ^ border(s,ca)}, population) Phoenix 4

Semantic Interpretation What is the largest city in a state bordering California? argmax({c :city(c) ^9s.state(s) ^ loc(c, s)^border(s,ca)}, population) Phoenix 4

Semantic Interpretation What is the largest city in a state bordering California? argmax({c :city(c) ^9s.state(s) ^ loc(c, s) ^ border(s,ca)}, population) Phoenix 4

Semantic Interpretation What is the largest city in a state bordering California? argmax({c : city(c) ^9s.state(s) ^ loc(c, s) ^ border(s,ca)}, population) Phoenix 4

Semantic Interpretation What is the largest city in a state bordering California? argmax({c : city(c) ^9s.state(s) ^ loc(c, s) ^ border(s,ca)}, population) computation Phoenix 4

Semantic Interpretation What is the largest city in a state bordering California?? computation Phoenix 4

Supervision for Semantic Interpretation 6

Supervision for Semantic Interpretation Detailed Supervision (current) What is the largest city in California? argmax({c : city(c) ^ loc(c,ca)}, population) 6

Supervision for Semantic Interpretation Detailed Supervision (current) What is the largest city in California? expert argmax({c : city(c) ^ loc(c,ca)}, population) 6

Supervision for Semantic Interpretation Detailed Supervision (current) -doesn tscaleup What is the largest city in California? expert argmax({c : city(c) ^ loc(c,ca)}, population) 6

Supervision for Semantic Interpretation Detailed Supervision (current) -doesn tscaleup What is the largest city in California? expert argmax({c : city(c) ^ loc(c,ca)}, population) Natural Supervision (new) What is the largest city in California? Los Angeles 6

Supervision for Semantic Interpretation Detailed Supervision (current) -doesn tscaleup What is the largest city in California? expert argmax({c : city(c) ^ loc(c,ca)}, population) Natural Supervision (new) What is the largest city in California? Los Angeles non-expert 6

Supervision for Semantic Interpretation Detailed Supervision (current) -doesn tscaleup What is the largest city in California? expert argmax({c : city(c) ^ loc(c,ca)}, population) Natural Supervision (new) -scalesup What is the largest city in California? non-expert Los Angeles 6

Supervision for Semantic Interpretation Detailed Supervision (current) -doesn tscaleup - representation-dependent What is the largest city in California? expert argmax({c : city(c) ^ loc(c,ca)}, population) Natural Supervision (new) -scalesup What is the largest city in California? non-expert Los Angeles 6

Supervision for Semantic Interpretation Detailed Supervision (current) -doesn tscaleup - representation-dependent What is the largest city in California? expert argmax({c : city(c) ^ loc(c,ca)}, population) Natural Supervision (new) -scalesup - representation-independent What is the largest city in California? Los Angeles non-expert 6

Outline Representation loc state border CA city major traverse river traverse AZ x Learning z w y Experiments 9

Considerations Computational: how to e ciently search exponential space? 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? Los Angeles 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? x.state(x) Los Angeles 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? x.city(x) Los Angeles 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? x.city(x) ^ loc(x, CA) Los Angeles 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? x.state(x) ^ border(x, CA) Los Angeles 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? population(ca) Los Angeles 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? argmax( x.city(x) ^ loc(x, CA), x.population(x)) Los Angeles 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? LF LF LF LF LF LF LF LF LF LF LF LF LF LF LF LF LF Los Angeles 0

Considerations Computational: how to e ciently search exponential space? What is the most populous city in California? LF LF LF LF LF LF LF LF LF LF LF LF LF LF LF LF LF Los Angeles Statistical: how to parametrize mapping from sentence to logical form? What is the most populous city in California? argmax( x.city(x) ^ loc(x, CA), x.population(x)) 0

Dependency-Based Compositional Semantics (DCS) What is the most populous city in California?

Dependency-Based Compositional Semantics (DCS) What is the most populous city in California? c argmax population city loc CA Los Angeles

Dependency-Based Compositional Semantics (DCS) What is the most populous city in California? c argmax population city loc CA Los Angeles Advantages of DCS: nice computational, statistical, linguistic properties

Where do the answers come from? What is the most populous city in California? c argmax population city loc CA Los Angeles

Where do the answers come from? What is the most populous city in California? c argmax population city loc CA Database Los Angeles

Database city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts state Alabama Alaska Arizona border Washington Oregon Washington Idaho Oregon Washington 3

Basic DCS Trees DCS tree Database city loc CA 4

Basic DCS Trees DCS tree Constraints Database city loc CA ADCStreeencodesaconstraint satisfaction problem (CSP) 4

Basic DCS Trees DCS tree Constraints Database city loc c city city San Francisco Chicago Boston CA ADCStreeencodesaconstraint satisfaction problem (CSP) 4

DCS tree Basic DCS Trees Constraints Database city loc CA c city ` loc city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts ADCStreeencodesaconstraint satisfaction problem (CSP) 4

DCS tree Basic DCS Trees Constraints Database city loc CA c city ` loc s CA city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts CA California ADCStreeencodesaconstraint satisfaction problem (CSP) 4

DCS tree Basic DCS Trees Constraints Database city loc CA c city c = ` ` loc s CA city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts CA California ADCStreeencodesaconstraint satisfaction problem (CSP) 4

DCS tree Basic DCS Trees Constraints Database city loc CA c city c = ` ` loc ` = s s CA city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts CA California ADCStreeencodesaconstraint satisfaction problem (CSP) 4

DCS tree Basic DCS Trees Constraints Database city loc CA c city c = ` ` loc ` = s s CA city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts CA California ADCStreeencodesaconstraint satisfaction problem (CSP) 4

DCS tree Basic DCS Trees Constraints Database city loc CA c city c = ` ` loc ` = s s CA city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts CA California ADCStreeencodesaconstraint satisfaction problem (CSP) 4

DCS tree Basic DCS Trees Constraints Database city loc CA c city c = ` ` loc ` = s s CA city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts CA California ADCStreeencodesaconstraint satisfaction problem (CSP) 4

DCS tree Basic DCS Trees Constraints Database city loc CA c city c = ` ` loc ` = s s CA city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts CA California ADCStreeencodesaconstraint satisfaction problem (CSP) Computation: dynamic programming ) time = O(# nodes) 4

Properties of DCS Trees loc state border CA city traverse river major traverse AZ 5

Properties of DCS Trees loc state border CA city traverse river major traverse AZ Trees 5

Properties of DCS Trees loc state border CA city traverse river major traverse AZ Linguistics Trees syntactic locality 5

Properties of DCS Trees loc state border CA city traverse river major traverse AZ Linguistics Trees Computation syntactic locality e cient interpretation 5

Divergence between Syntactic and Semantic Scope most populous city in California 6

Divergence between Syntactic and Semantic Scope most populous city in California Syntax city populous in most California 6

Divergence between Syntactic and Semantic Scope most populous city in California Syntax Semantics city populous in argmax( x.city(x) ^ loc(x, CA), x.population(x)) most California 6

Divergence between Syntactic and Semantic Scope most populous city in California Syntax Semantics city populous in argmax( x.city(x) ^ loc(x, CA), x.population(x)) most California 6

Divergence between Syntactic and Semantic Scope most populous city in California Syntax Semantics city populous in argmax( x.city(x) ^ loc(x, CA), x.population(x)) most California Problem: syntactic scope is lower than semantic scope 6

Divergence between Syntactic and Semantic Scope most populous city in California Syntax Semantics city populous in argmax( x.city(x) ^ loc(x, CA), x.population(x)) most California Problem: syntactic scope is lower than semantic scope If DCS trees look like syntax, how do we get correct semantics? 6

Solution: Mark-Execute most populous city in California Superlatives x c argmax population city loc CA 7

Solution: Mark-Execute most populous city in California Superlatives x Mark at syntactic scope c argmax population city loc CA 7

most populous city in California Solution: Mark-Execute Superlatives Execute at semantic scope Mark at syntactic scope population c argmax x city loc CA 7

Alaska borders no states. Solution: Mark-Execute Negation Execute at semantic scope Mark at syntactic scope AK x border state q no 7

Some river traverses every city. Solution: Mark-Execute Quantification (narrow) Execute at semantic scope Mark at syntactic scope river q x traverse city q some every 7

Some river traverses every city. Solution: Mark-Execute Quantification (wide) Execute at semantic scope Mark at syntactic scope river q x traverse city q some every 7

Some river traverses every city. Solution: Mark-Execute Quantification (wide) Execute at semantic scope Mark at syntactic scope river q x traverse city q some every Analogy: Montague s quantifying in, Carpenter s scoping constructor 7

Outline Representation loc state border CA city major traverse river traverse AZ x Learning z w y Experiments 8

Graphical Model z capital CA database w 9

Graphical Model z capital CA database w y Sacramento 9

Graphical Model z database capital CA w y Sacramento Interpretation: p(y z, w) (deterministic) 9

Graphical Model x capital of California? z database capital CA w y Sacramento Interpretation: p(y z, w) (deterministic) 9

Graphical Model x capital of California? parameters z database capital CA w y Sacramento Interpretation: p(y z, w) (deterministic) 9

Graphical Model parameters database x z capital of California? capital CA w y Sacramento Semantic Parsing: p(z x, ) (probabilistic) Interpretation: p(y z, w) (deterministic) 9

Plan x capital of California? parameters z capital What s possible? z Z(x) What s probable? p(z x, ) CA Learning from (x, y) data database w y Sacramento 0

Words to Predicates (Lexical Semantics) What is the most populous city in CA?

Words to Predicates (Lexical Semantics) CA What is the most populous city in CA? Lexical Triggers:. String match CA ) CA

Words to Predicates (Lexical Semantics) argmax What is the most populous city in CA? CA Lexical Triggers:. String match CA ) CA. Function words (0 words) most ) argmax

Words to Predicates (Lexical Semantics) city state river city state river argmax population population CA What is the most populous city in CA? Lexical Triggers:. String match CA ) CA. Function words (0 words) most ) argmax 3. Nouns/adjectives city ) city state river population

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] i most populous city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] i most populous k city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] C i,k C k,j i most populous k city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] population city c argmax loc CA C i,k C k,j i most populous k city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] city population loc population c c argmax CA city argmax loc CA C i,k C k,j i most populous k city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] city population loc population c c argmax CA city argmax loc CA C i,k C k,j i most populous k city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] city loc loc population c population CA city argmax c argmax loc CA C i,k C k,j i most populous k city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] city loc loc population c population CA city argmax c argmax loc CA C i,k C k,j i most populous k city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] city border loc population c population CA city argmax c argmax loc CA C i,k C k,j i most populous k city in California j

Predicates to DCS Trees (Compositional Semantics) C i,j = set of DCS trees for span [i, j] population c argmax city population c argmax loc CA city loc CA C i,k C k,j i most populous k city in California j

Plan x capital of California? parameters z capital What s possible? z Z(x) What s probable? p(z x, ) CA Learning from (x, y) data database w y Sacramento 3

Log-linear Model z: city loc CA x: city in California 4

Log-linear Model z: city loc CA x: city in California features(x, z) =( ) R d 4

Log-linear Model z: city loc CA x: city in California in features(x, z) =( loc :) R d 4

Log-linear Model z: city loc CA x: city in California in loc : city loc features(x, z) =( :) R d 4

Log-linear Model z: city loc CA x: city in California in loc : city loc : features(x, z) =( ) R d 4

Log-linear Model z: city loc CA x: city in California in loc : city loc : features(x, z) =( R d ) score(x, z) =features(x, z) 4

Log-linear Model z: city loc CA x: city in California in loc : city loc : features(x, z) =( ) R d score(x, z) =features(x, z) p(z x, ) = P e score(x,z) z 0 Z(x) escore(x,z0 ) 4

Plan x capital of California? parameters z capital What s possible? z Z(x) What s probable? p(z x, ) CA Learning from (x, y) data database w y Sacramento 5

Learning Objective Function: p(y z,w) p(z x, ) Interpretation Semantic parsing 6

Learning Objective Function: max p(y z,w) p(z x, ) Interpretation Semantic parsing 6

Learning Objective Function: max Pz p(y z,w) p(z x, ) Interpretation Semantic parsing 6

Learning Objective Function: max Pz p(y z,w) p(z x, ) Interpretation Semantic parsing EM-like Algorithm: parameters (0, 0,...,0) 6

Learning Objective Function: max Pz p(y z,w) p(z x, ) Interpretation Semantic parsing EM-like Algorithm: parameters (0, 0,...,0) enumerate/score DCS trees 6

Learning Objective Function: max Pz p(y z,w) p(z x, ) Interpretation Semantic parsing EM-like Algorithm: parameters (0, 0,...,0) enumerate/score DCS trees k-best list tree tree tree3 tree4 tree5 6

Learning Objective Function: max Pz p(y z,w) p(z x, ) Interpretation Semantic parsing EM-like Algorithm: parameters (0.,.3,...,0.7) enumerate/score DCS trees numerical optimization (L-BFGS) k-best list tree tree tree3 tree4 tree5 6

Learning Objective Function: max Pz p(y z,w) p(z x, ) Interpretation Semantic parsing EM-like Algorithm: parameters (0.,.3,...,0.7) enumerate/score DCS trees numerical optimization (L-BFGS) k-best list tree3 tree8 tree6 tree tree4 6

Learning Objective Function: max Pz p(y z,w) p(z x, ) Interpretation Semantic parsing EM-like Algorithm: parameters (0.3,.4,...,0.6) enumerate/score DCS trees numerical optimization (L-BFGS) k-best list tree3 tree8 tree6 tree tree4 6

Learning Objective Function: max Pz p(y z,w) p(z x, ) Interpretation Semantic parsing EM-like Algorithm: parameters (0.3,.4,...,0.6) enumerate/score DCS trees numerical optimization (L-BFGS) k-best list tree3 tree8 tree tree4 tree9 6

Outline Representation loc state border CA city major traverse river traverse AZ x Learning z w y Experiments 7

US Geography Benchmark Standard semantic parsing benchmark since 990s 600 training examples, 80 test examples 8

US Geography Benchmark Standard semantic parsing benchmark since 990s 600 training examples, 80 test examples What is the highest point in Florida? How many states have a city called Rochester? What is the longest river that runs through a state that borders Tennessee? Of the states washed by the Mississippi river which has the lowest point? 8

US Geography Benchmark Standard semantic parsing benchmark since 990s 600 training examples, 80 test examples What is the highest point in Florida? ) answer(a,highest(a,(place(a),loc(a,b),const(b,stateid(florida))))) How many states have a city called Rochester? ) answer(a,count(b,(state(b),loc(c,b),const(c,cityid(rochester, ))),A)) What is the longest river that runs through a state that borders Tennessee? ) answer(a,longest(a,(river(a),traverse(a,b),state(b),next to(b,c),const(c,stateid(tennessee))))) Of the states washed by the Mississippi river which has the lowest point? ) answer(a,lowest(b,(state(a),traverse(c,a),const(c,riverid(mississippi)),loc(b,a),place(b)))) Supervision in past work: question + program 8

US Geography Benchmark Standard semantic parsing benchmark since 990s 600 training examples, 80 test examples What is the highest point in Florida? ) Walton County How many states have a city called Rochester? ) What is the longest river that runs through a state that borders Tennessee? ) Missouri Of the states washed by the Mississippi river which has the lowest point? ) Louisiana Supervision in past work: question + program Supervision in this work: question + answer 8

Input to Learning Algorithm Training data (600 examples) What is the highest point in Florida? ) Walton County How many states have a city called Rochester? ) What is the longest river that runs through a state that borders Tennessee? ) Missouri Of the states washed by the Mississippi river which has the lowest point? ) Louisiana 9

Input to Learning Algorithm Training data (600 examples) What is the highest point in Florida? ) Walton County How many states have a city called Rochester? ) What is the longest river that runs through a state that borders Tennessee? ) Missouri Of the states washed by the Mississippi river which has the lowest point? ) Louisiana Lexicon (75 words) city ) city state ) state mountain ) mountain, peak 9

Input to Learning Algorithm Training data (600 examples) What is the highest point in Florida? ) Walton County How many states have a city called Rochester? ) What is the longest river that runs through a state that borders Tennessee? ) Missouri Of the states washed by the Mississippi river which has the lowest point? ) Louisiana Lexicon (75 words) Database city ) city state ) state mountain ) mountain, peak city San Francisco Chicago Boston loc Mount Shasta California San Francisco California Boston Massachusetts state Alabama Alaska Arizona border Washington Oregon Washington Idaho Oregon Washington 9

Experiment On Geo, 50 training examples, 50 test examples 00 95 test accuracy 90 85 80 75 30

Experiment On Geo, 50 training examples, 50 test examples System Description Lexicon (gen./spec.) Logical forms cgcr0 FunQL [Clarke et al., 00] 00 95 test accuracy 90 85 80 75 73.% cgcr0 30

Experiment On Geo, 50 training examples, 50 test examples System Description Lexicon (gen./spec.) Logical forms cgcr0 FunQL [Clarke et al., 00] dcs our system 00 95 test accuracy 90 85 80 75 73.% 78.9% cgcr0 dcs 30

Experiment On Geo, 50 training examples, 50 test examples System Description Lexicon (gen./spec.) Logical forms cgcr0 FunQL [Clarke et al., 00] dcs our system dcs + our system 00 95 test accuracy 90 85 80 75 73.% 78.9% 87.% cgcr0 dcs dcs + 30

Experiment On Geo, 600 training examples, 80 test examples 3

Experiment On Geo, 600 training examples, 80 test examples System Description Lexicon Logical forms 00 95 test accuracy 90 85 80 75 3

Experiment On Geo, 600 training examples, 80 test examples System Description Lexicon Logical forms zc05 CCG [Zettlemoyer & Collins, 005] 00 95 test accuracy 90 85 80 75 79.3% zc05 3

Experiment On Geo, 600 training examples, 80 test examples System Description zc05 CCG [Zettlemoyer & Collins, 005] zc07 relaxed CCG [Zettlemoyer & Collins, 007] Lexicon Logical forms 00 95 test accuracy 90 85 80 79.3% 86.% 75 zc05 zc07 3

Experiment On Geo, 600 training examples, 80 test examples System Description zc05 CCG [Zettlemoyer & Collins, 005] zc07 relaxed CCG [Zettlemoyer & Collins, 007] kzgs0 CCG w/unification [Kwiatkowski et al., 00] Lexicon Logical forms 00 test accuracy 95 90 85 80 79.3% 86.% 88.9% 75 zc05 zc07 kzgs0 3

Experiment On Geo, 600 training examples, 80 test examples System Description zc05 CCG [Zettlemoyer & Collins, 005] zc07 relaxed CCG [Zettlemoyer & Collins, 007] kzgs0 CCG w/unification [Kwiatkowski et al., 00] dcs our system Lexicon Logical forms 00 test accuracy 95 90 85 80 79.3% 86.% 88.9% 88.6% 75 zc05 zc07 kzgs0 dcs 3

Experiment On Geo, 600 training examples, 80 test examples System Description zc05 CCG [Zettlemoyer & Collins, 005] zc07 relaxed CCG [Zettlemoyer & Collins, 007] kzgs0 CCG w/unification [Kwiatkowski et al., 00] dcs our system dcs + our system 00 Lexicon Logical forms test accuracy 95 90 85 80 79.3% 86.% 88.9% 88.6% 9.% 75 zc05 zc07 kzgs0 dcs dcs + 3

Some Intuition on Learning 3

Some Intuition on Learning parameters () search DCS trees (hard!) () numerical optimization k-best lists 3

Some Intuition on Learning () search DCS trees (hard!) parameters k-best lists () numerical optimization If no DCS tree on k-best list is correct, skip example in () 3

Some Intuition on Learning () search DCS trees (hard!) parameters k-best lists () numerical optimization If no DCS tree on k-best list is correct, skip example in () % examples trained on 00 80 60 40 0 3 4 iteration 3

Some Intuition on Learning () search DCS trees (hard!) parameters k-best lists () numerical optimization If no DCS tree on k-best list is correct, skip example in () % examples trained on 00 80 60 40 0 3 4 iteration E ect: automatic curriculum learning, learning improves search 3

Current Limitations 9

Only using forward information Current Limitations Execute program to get answer, but want to invert 9

Current Limitations Only using forward information Execute program to get answer, but want to invert Non-identifiability of program If all cities in database are in US, then can t distinguish {c : city(c)} and {c : city(c) ^ loc(c, US)} 9

Current Limitations Only using forward information Execute program to get answer, but want to invert Non-identifiability of program If all cities in database are in US, then can t distinguish {c : city(c)} and {c : city(c) ^ loc(c, US)} Unknown facts: How far is Los Angeles from Boston? Database has no distance information 9

Current Limitations Only using forward information Execute program to get answer, but want to invert Non-identifiability of program If all cities in database are in US, then can t distinguish {c : city(c)} and {c : city(c) ^ loc(c, US)} Unknown facts: How far is Los Angeles from Boston? Database has no distance information Unknown concepts: What states are landlocked? Need to induce database view for landlocked(x) = border(x, ocean) 9

Conclusion Goal: learn to answer questions from question/answer pairs 30

Conclusion Goal: learn to answer questions from question/answer pairs Empirical result: DCS (no logical forms) u existing systems (with logical forms) 30

Conclusion Goal: learn to answer questions from question/answer pairs Empirical result: DCS (no logical forms) u existing systems (with logical forms) Conceptual contribution: DCS trees Trees: connects dependency syntax with e cient evaluation 30

Conclusion Goal: learn to answer questions from question/answer pairs Empirical result: DCS (no logical forms) u existing systems (with logical forms) Conceptual contribution: DCStrees Trees: connects dependency syntax with e cient evaluation Mark-Execute: unifying framework for handling scope 30

thank you 35