frequencies that matter in sentence processing

Size: px

Start display at page:

Download "frequencies that matter in sentence processing"

Erika Singleton
6 years ago
Views:

1 frequencies that matter in sentence processing Marten van Schijndel 1 September 7, Department of Linguistics, The Ohio State University van Schijndel Frequencies that matter September 7, / 69

2 frequencies are important van Schijndel Frequencies that matter September 7, / 69

3 frequencies are important Occurrence frequencies describe languages well Zipf Statistical NLP (esp vector spaces) van Schijndel Frequencies that matter September 7, / 69

4 frequencies are important Occurrence frequencies describe languages well Zipf Statistical NLP (esp vector spaces) Occurrence frequencies have major influence on sentence processing Behavioral measures (eg, reading times) Processing measures (eg, ERPs) Uniform Information Density Saarland SFB-1102 van Schijndel Frequencies that matter September 7, / 69

5 null hypothesis demands control van Schijndel Frequencies that matter September 7, / 69

6 null hypothesis demands control Linguists must control for frequencies to do research van Schijndel Frequencies that matter September 7, / 69

7 null hypothesis demands control Linguists must control for frequencies to do research How do people try to account for frequencies? van Schijndel Frequencies that matter September 7, / 69

8 Case Study 1: Cloze Probabilities van Schijndel, Culicover, & Schuler (2014) Pertains to: Pickering & Traxler (2003), inter alia van Schijndel Frequencies that matter September 7, / 69

9 generation/cloze probabilities Ask subjects to generate distribution van Schijndel Frequencies that matter September 7, / 69

10 generation/cloze probabilities Ask subjects to generate distribution Sentence generation norming: Write sentences with these words landed, sneezed, laughed, van Schijndel Frequencies that matter September 7, / 69

11 generation/cloze probabilities Ask subjects to generate distribution Sentence generation norming: Write sentences with these words landed, sneezed, laughed, Cloze norming: Complete this sentence The pilot landed van Schijndel Frequencies that matter September 7, / 69

12 generation/cloze probabilities Ask subjects to generate distribution Sentence generation norming: Write sentences with these words landed, sneezed, laughed, Cloze norming: Complete this sentence The pilot landed the plane van Schijndel Frequencies that matter September 7, / 69

13 generation/cloze probabilities Ask subjects to generate distribution Sentence generation norming: Write sentences with these words landed, sneezed, laughed, Cloze norming: Complete this sentence The pilot landed the plane The pilot landed in the field van Schijndel Frequencies that matter September 7, / 69

14 generation/cloze probabilities Ask subjects to generate distribution Sentence generation norming: Write sentences with these words landed, sneezed, laughed, Cloze norming: Complete this sentence NP:The pilot landed the plane PP: The pilot landed in the field Pickering & Traxler (2003) used 6 cloze tasks to determine frequencies van Schijndel Frequencies that matter September 7, / 69

15 generation/cloze probabilities Ask subjects to generate distribution Sentence generation norming: Write sentences with these words landed, sneezed, laughed, Cloze norming: Complete this sentence NP:The pilot landed the plane PP: The pilot landed in the field 25% 40% Pickering & Traxler (2003) used 6 cloze tasks to determine frequencies van Schijndel Frequencies that matter September 7, / 69

16 pickering & traxler (2003) Stimuli (1) That s the plane that the pilot landed behind in the fog (2) That s the truck that the pilot landed behind in the fog Readers slow down at landed in (2) van Schijndel Frequencies that matter September 7, / 69

17 pickering & traxler (2003) Stimuli (1) That s the plane that the pilot landed behind in the fog (2) That s the truck that the pilot landed behind in the fog Readers slow down at landed in (2) Suggests they try to link truck as the object of landed despite: landed biased for PP complement 40% PP complement 25% NP complement van Schijndel Frequencies that matter September 7, / 69

18 pickering & traxler (2003) Readers initially adopt a transitive interpretation despite subcat bias van Schijndel Frequencies that matter September 7, / 69

19 pickering & traxler (2003) Readers initially adopt a transitive interpretation despite subcat bias Early-attachment processing heuristic van Schijndel Frequencies that matter September 7, / 69

20 pickering & traxler (2003) Readers initially adopt a transitive interpretation despite subcat bias Early-attachment processing heuristic But what about syntactic frequencies? van Schijndel Frequencies that matter September 7, / 69

21 generalized categorial grammar (gcg) Nguyen et al (2012) N N A-aN D N-aD N-rN V-gN the apple that D N N-aD V-aN-gN V-aN-bN the girl ate van Schijndel Frequencies that matter September 7, / 69

22 generalized categorial grammar (gcg) Nguyen et al (2012) N N A-aN D N-aD N-rN V -gn the apple that D N N-aD V-aN -gn V-aN-bN the girl ate van Schijndel Frequencies that matter September 7, / 69

23 what about syntactic frequencies? Pickering & Traxler (2003) (1) That s the plane that the pilot landed behind in the fog (2) That s the truck that the pilot landed behind in the fog (a) VP-gNP (b) VP-gNP VP-gNP PP VP PP-gNP TV t i P NP IV P t i landed behind Transitive landed behind Intransitive van Schijndel Frequencies that matter September 7, / 69

24 what about syntactic frequencies? Pickering & Traxler (2003) (1) That s the plane that the pilot landed behind in the fog (2) That s the truck that the pilot landed behind in the fog (a) VP-gNP (b) VP-gNP VP-gNP PP VP PP-gNP TV t i P NP IV P t i landed behind Transitive landed behind Intransitive van Schijndel Frequencies that matter September 7, / 69

25 what about syntactic frequencies? Pickering & Traxler (2003) (1) That s the plane that the pilot landed behind in the fog (2) That s the truck that the pilot landed behind in the fog (a) VP-gNP (b) VP-gNP VP-gNP PP VP PP-gNP TV t i P NP IV P t i landed behind Transitive landed behind Intransitive van Schijndel Frequencies that matter September 7, / 69

26 what about syntactic frequencies? Pickering & Traxler (2003) (1) That s the plane that the pilot landed behind in the fog (2) That s the truck that the pilot landed behind in the fog van Schijndel et al (2014) Using syntactic probabilities with cloze data: van Schijndel Frequencies that matter September 7, / 69

27 what about syntactic frequencies? Pickering & Traxler (2003) (1) That s the plane that the pilot landed behind in the fog (2) That s the truck that the pilot landed behind in the fog van Schijndel et al (2014) Using syntactic probabilities with cloze data: P(Transitive landed) 0016 P(Intransitive landed) 0004 van Schijndel Frequencies that matter September 7, / 69

28 what about syntactic frequencies? Pickering & Traxler (2003) (1) That s the plane that the pilot landed behind in the fog (2) That s the truck that the pilot landed behind in the fog van Schijndel et al (2014) Using syntactic probabilities with cloze data: P(Transitive landed) 0016 P(Intransitive landed) 0004 Transitive interpretation is 300% more likely! van Schijndel Frequencies that matter September 7, / 69

29 results: case study 1 Subcat processing accounted for by hierarchic syntactic frequencies Early attachment heuristic unnecessary van Schijndel Frequencies that matter September 7, / 69

30 results: case study 1 Subcat processing accounted for by hierarchic syntactic frequencies Early attachment heuristic unnecessary Also applies to heavy-np shift heuristics (Staub, 2006), unaccusative processing (Staub et al, 2007), etc van Schijndel Frequencies that matter September 7, / 69

31 results: case study 1 Subcat processing accounted for by hierarchic syntactic frequencies Early attachment heuristic unnecessary Also applies to heavy-np shift heuristics (Staub, 2006), unaccusative processing (Staub et al, 2007), etc Suggests cloze probabilities are insufficient as a frequency control van Schijndel Frequencies that matter September 7, / 69

32 results: case study 1 Subcat processing accounted for by hierarchic syntactic frequencies Early attachment heuristic unnecessary Also applies to heavy-np shift heuristics (Staub, 2006), unaccusative processing (Staub et al, 2007), etc Suggests cloze probabilities are insufficient as a frequency control But do people use hierarchic syntactic probabilities? van Schijndel Frequencies that matter September 7, / 69

33 Case Study 2: N-grams and Syntactic Probabilities van Schijndel & Schuler (2015) Pertains to: Frank & Bod (2011), inter alia van Schijndel Frequencies that matter September 7, / 69

34 case study 2: overview Previous studies have debated whether humans use hierarchic syntax van Schijndel Frequencies that matter September 7, / 69

35 case study 2: overview Previous studies have debated whether humans use hierarchic syntax NP NP RC D N IN VP the apple that D NP N V ate the girl van Schijndel Frequencies that matter September 7, / 69

36 case study 2: overview Previous studies have debated whether humans use hierarchic syntax NP NP RC D N IN VP the apple that D NP N V ate the girl But how robust were their models? van Schijndel Frequencies that matter September 7, / 69

37 case study 2: overview This work shows that: van Schijndel Frequencies that matter September 7, / 69

38 case study 2: overview This work shows that: N-gram models can be greatly improved (accumulation) van Schijndel Frequencies that matter September 7, / 69

39 case study 2: overview This work shows that: N-gram models can be greatly improved (accumulation) Hierarchic syntax is still predictive over stronger baseline (Long distance dependencies independently improve model) van Schijndel Frequencies that matter September 7, / 69

40 hierarchic syntax in reading? The red 1 2 apple that the girl ate Frank & Bod (2011) van Schijndel Frequencies that matter September 7, / 69

41 hierarchic syntax in reading? The red 1 2 apple that the girl ate Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) van Schijndel Frequencies that matter September 7, / 69

42 hierarchic syntax in reading? The w 1 red w2 apple w 3 that w 4 the w 5 girl w 6 ate Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) van Schijndel Frequencies that matter September 7, / 69

43 hierarchic syntax in reading? The red apple that the 4 chars {}}{ girl w 6 ate Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) van Schijndel Frequencies that matter September 7, / 69

44 hierarchic syntax in reading? The red apple that the 4 chars {}}{ girl w 6 ate Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) van Schijndel Frequencies that matter September 7, / 69

45 hierarchic syntax in reading? The red apple that the girl ate Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) Test POS Predictors: Echo State Network (ESN) Phrase Structure Grammar (PSG) van Schijndel Frequencies that matter September 7, / 69

46 hierarchic syntax in reading? DT ADJ NN IN DT NN VBD Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) Test POS Predictors: Echo State Network (ESN) Phrase Structure Grammar (PSG) van Schijndel Frequencies that matter September 7, / 69

47 hierarchic syntax in reading? NP NP RC DT NP IN VP JJ NN DT NP NN VBD Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) Test POS Predictors: Echo State Network (ESN) Phrase Structure Grammar (PSG) van Schijndel Frequencies that matter September 7, / 69

48 hierarchic syntax in reading? Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) Test POS Predictors: Echo State Network (ESN) Phrase Structure Grammar (PSG) van Schijndel Frequencies that matter September 7, / 69

49 hierarchic syntax in reading? Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) Test POS Predictors: Echo State Network (ESN) Phrase Structure Grammar (PSG) Outcome: PSG < ESN + PSG ESN = ESN + PSG van Schijndel Frequencies that matter September 7, / 69

50 hierarchic syntax in reading? Frank & Bod (2011) Baseline: Sentence Position Word length Test POS Predictors: Echo State Network (ESN) Phrase Structure Grammar (PSG) N-grams (Unigram, bigram) Outcome: PSG < ESN + PSG ESN = ESN + PSG Sequential helps over hierarchic van Schijndel Frequencies that matter September 7, / 69

51 hierarchic syntax in reading? Frank & Bod (2011) Baseline: Sentence Position Word length N-grams (Unigram, bigram) Test POS Predictors: Echo State Network (ESN) Phrase Structure Grammar (PSG) Outcome: PSG < ESN + PSG ESN = ESN + PSG Hierarchic doesn t help over sequential van Schijndel Frequencies that matter September 7, / 69

52 hierarchic syntax in reading? Fossum & Levy (2012) Replicated Frank & Bod (2011): PSG < ESN + PSG ESN = ESN + PSG van Schijndel Frequencies that matter September 7, / 69

53 hierarchic syntax in reading? Fossum & Levy (2012) Replicated Frank & Bod (2011): PSG < ESN + PSG ESN = ESN + PSG Better n-gram baseline (more data) changes result: PSG = ESN + PSG ESN = ESN + PSG van Schijndel Frequencies that matter September 7, / 69

54 hierarchic syntax in reading? Fossum & Levy (2012) Replicated Frank & Bod (2011): PSG < ESN + PSG ESN = ESN + PSG Better n-gram baseline (more data) changes result: PSG = ESN + PSG Sequential doesn t help over hierarchic ESN = ESN + PSG van Schijndel Frequencies that matter September 7, / 69

55 hierarchic syntax in reading? Fossum & Levy (2012) Replicated Frank & Bod (2011): PSG < ESN + PSG ESN = ESN + PSG Better n-gram baseline (more data) changes result: PSG = ESN + PSG Sequential doesn t help over hierarchic ESN = ESN + PSG Also: lexicalized syntax improves PSG fit van Schijndel Frequencies that matter September 7, / 69

56 improved n-gram baseline Most previous reading time studies: N-grams trained on WSJ, Dundee, BNC (or less) van Schijndel Frequencies that matter September 7, / 69

57 improved n-gram baseline Most previous reading time studies: N-grams trained on WSJ, Dundee, BNC (or less) This study: N-grams trained on Gigaword 40 van Schijndel Frequencies that matter September 7, / 69

58 improved n-gram baseline Most previous reading time studies: N-grams trained on WSJ, Dundee, BNC (or less) Unigrams/Bigrams This study: N-grams trained on Gigaword 40 van Schijndel Frequencies that matter September 7, / 69

59 improved n-gram baseline 20 N gram Perplexity Perplexity N van Schijndel Frequencies that matter September 7, / 69

60 improved n-gram baseline Most previous reading time studies: N-grams trained on WSJ, Dundee, BNC Unigrams/Bigrams This study: N-grams trained on Gigaword 40 5-grams van Schijndel Frequencies that matter September 7, / 69

61 improved n-gram baseline Most previous reading time studies: N-grams trained on WSJ, Dundee, BNC Unigrams/Bigrams Only from region boundaries This study: N-grams trained on Gigaword 40 5-grams van Schijndel Frequencies that matter September 7, / 69

62 improved n-gram baseline Bigram Example Reading time of girl after red The red 1 apple that the 2 girl } {{ } region X : bigram target ate X: bigram condition van Schijndel Frequencies that matter September 7, / 69

63 improved n-gram baseline Bigram Example Reading time of girl after red The red 1 apple that the 2 girl } {{ } region X : bigram target ate X: bigram condition Fails to capture entire sequence; van Schijndel Frequencies that matter September 7, / 69

64 improved n-gram baseline Bigram Example Reading time of girl after red The red 1 apple that the 2 girl } {{ } region X : bigram target ate X: bigram condition Fails to capture entire sequence; Conditions never generated; van Schijndel Frequencies that matter September 7, / 69

65 improved n-gram baseline Bigram Example Reading time of girl after red The red 1 apple that the 2 girl } {{ } region X : bigram target ate X: bigram condition Fails to capture entire sequence; Conditions never generated; Probability of sequence is deficient van Schijndel Frequencies that matter September 7, / 69

66 improved n-gram baseline Cumulative Bigram Example Reading time of girl after red: The 1 red apple that the 2 girl ate X : bigram targets X: bigram conditions van Schijndel Frequencies that matter September 7, / 69

67 improved n-gram baseline Cumulative Bigram Example Reading time of girl after red: The 1 red apple that the 2 girl ate X : bigram targets X: bigram conditions Captures entire sequence; Well-formed sequence probability; Reflects processing that must be done by humans van Schijndel Frequencies that matter September 7, / 69

68 improved n-gram baseline Most previous reading time studies: N-grams trained on WSJ, Dundee, BNC Unigrams/Bigrams Only from region boundaries This study: N-grams trained on Gigaword 40 5-grams van Schijndel Frequencies that matter September 7, / 69

69 improved n-gram baseline Most previous reading time studies: N-grams trained on WSJ, Dundee, BNC Unigrams/Bigrams Only from region boundaries This study: N-grams trained on Gigaword 40 5-grams Cumulative and Non-cumulative van Schijndel Frequencies that matter September 7, / 69

70 evaluation Dundee Corpus (Kennedy et al, 2003) 10 subjects 2,388 sentences First pass durations ( 200,000) Go-past durations ( 200,000) Exclusions: Unknown words (<5 tokens) First and last of each line Regions larger than 4 words (track loss) van Schijndel Frequencies that matter September 7, / 69

71 cumu-n-grams predict reading times Baseline: Fixed Effects Sentence Position Word length Region Length Preceding word fixated? Random Effects Item/Subject Intercepts By Subject Slopes: All Fixed Effects N-grams (5-grams) N-grams (Cumu-5-grams) van Schijndel Frequencies that matter September 7, / 69

72 cumu-n-grams predict reading times Baseline: Fixed Effects Sentence Position Word length Region Length Preceding word fixated? Random Effects Item/Subject Intercepts By Subject Slopes: All Fixed Effects N-grams (5-grams) N-grams (Cumu-5-grams) van Schijndel Frequencies that matter September 7, / 69

73 cumu-n-grams predict reading times First Pass and Go-Past van Schijndel Frequencies that matter September 7, / 69

74 follow-up questions Is hierarchic surprisal useful over the better baseline? van Schijndel Frequencies that matter September 7, / 69

75 follow-up questions Is hierarchic surprisal useful over the better baseline? If so, can it be similarly improved through accumulation? van Schijndel Frequencies that matter September 7, / 69

76 follow-up questions Is hierarchic surprisal useful over the better baseline? If so, can it be similarly improved through accumulation? van Schijndel & Schuler (2013) found it could over weaker baselines Grammar: Berkeley parser, WSJ, 5 split-merge cycles (Petrov & Klein 2007) van Schijndel Frequencies that matter September 7, / 69

77 hierarchic surprisal predicts reading times Baseline: Fixed Effects Same as before N-grams (5-grams) N-grams (Cumu-5-grams) van Schijndel Frequencies that matter September 7, / 69

78 hierarchic surprisal predicts reading times Baseline: Fixed Effects Same as before N-grams (5-grams) N-grams (Cumu-5-grams) Random Effects Same as before By Subject Slopes: Hierarchic surprisal Cumu-Hierarchic surprisal van Schijndel Frequencies that matter September 7, / 69

79 hierarchic surprisal predicts reading times Baseline: Fixed Effects Same as before N-grams (5-grams) N-grams (Cumu-5-grams) Random Effects Same as before By Subject Slopes: Hierarchic surprisal Cumu-Hierarchic surprisal van Schijndel Frequencies that matter September 7, / 69

80 hierarchic surprisal predicts reading times First Pass and Go-Past van Schijndel Frequencies that matter September 7, / 69

81 cumulative surprisal doesn t help?! Suggests previous findings were due to weaker n-gram baseline van Schijndel Frequencies that matter September 7, / 69

82 cumulative surprisal doesn t help?! Suggests previous findings were due to weaker n-gram baseline Suggests only local PCFG surprisal affects reading times van Schijndel Frequencies that matter September 7, / 69

83 cumulative surprisal doesn t help?! Suggests previous findings were due to weaker n-gram baseline Suggests only local PCFG surprisal affects reading times Follow-up work shows long distance dependencies independently influence reading times van Schijndel Frequencies that matter September 7, / 69

84 results: case study 2 Hierarchic syntax predicts reading times over strong linear baseline van Schijndel Frequencies that matter September 7, / 69

85 results: case study 2 Hierarchic syntax predicts reading times over strong linear baseline Long-distance dependencies help over hierarchic syntax van Schijndel Frequencies that matter September 7, / 69

86 results: case study 2 Hierarchic syntax predicts reading times over strong linear baseline Long-distance dependencies help over hierarchic syntax Studies should use cumu-n-grams in their baselines van Schijndel Frequencies that matter September 7, / 69

87 what does this mean for our models? We need to carefully control for: Cloze probabilities (Smith 2011) van Schijndel Frequencies that matter September 7, / 69

88 what does this mean for our models? We need to carefully control for: Cloze probabilities (Smith 2011) N-gram frequencies (local and cumulative) van Schijndel Frequencies that matter September 7, / 69

89 what does this mean for our models? We need to carefully control for: Cloze probabilities (Smith 2011) N-gram frequencies (local and cumulative) Hierarchic syntactic frequencies van Schijndel Frequencies that matter September 7, / 69

90 what does this mean for our models? We need to carefully control for: Cloze probabilities (Smith 2011) N-gram frequencies (local and cumulative) Hierarchic syntactic frequencies Long distance dependency frequencies van Schijndel Frequencies that matter September 7, / 69

91 what does this mean for our models? We need to carefully control for: Cloze probabilities (Smith 2011) N-gram frequencies (local and cumulative) Hierarchic syntactic frequencies Long distance dependency frequencies (discourse, etc) Then we can try to interpret experimental results What do we do about convergence? Is there a way to avoid this explosion of control predictors? van Schijndel Frequencies that matter September 7, / 69

92 Case Study 3: Evading Frequency Confounds van Schijndel, Murphy, & Schuler (2015) van Schijndel Frequencies that matter September 7, / 69

93 motivation Can we measure memory load with fewer controls? van Schijndel Frequencies that matter September 7, / 69

94 motivation Can we measure memory load with fewer controls? Why do so many factors influence results? van Schijndel Frequencies that matter September 7, / 69

95 motivation Can we measure memory load with fewer controls? Why do so many factors influence results? Low dimensionality measures van Schijndel Frequencies that matter September 7, / 69

96 motivation Can we measure memory load with fewer controls? Why do so many factors influence results? Low dimensionality measures Do the factors become separable in another space? van Schijndel Frequencies that matter September 7, / 69

97 motivation Can we measure memory load with fewer controls? Why do so many factors influence results? Low dimensionality measures Do the factors become separable in another space? Let s try using MEG van Schijndel Frequencies that matter September 7, / 69

98 what is meg? van Schijndel Frequencies that matter September 7, / 69

99 what is meg? 102 locations van Schijndel Frequencies that matter September 7, / 69

100 how might meg reflect load? Jensen et al, (2012) van Schijndel Frequencies that matter September 7, / 69

101 how might meg reflect load? Jensen et al, (2012) van Schijndel Frequencies that matter September 7, / 69

102 where to look? Memory is a function of distributed processing van Schijndel Frequencies that matter September 7, / 69

103 where to look? Memory is a function of distributed processing Look for synchronized firing between sensors (brain regions) van Schijndel Frequencies that matter September 7, / 69

104 where to look? van Schijndel Frequencies that matter September 7, / 69

105 where to look? Memory is a function of distributed processing Look for synchronized firing between sensors (brain regions) This study uses spectral coherence measurements van Schijndel Frequencies that matter September 7, / 69

106 spectral coherence coherence(x, y) = E[S xy ] E[Sxx ] E[S yy ] van Schijndel Frequencies that matter September 7, / 69

107 spectral coherence coherence(x, y) = E[S xy ] E[Sxx ] E[S yy ] cross-correlation autocorrelations van Schijndel Frequencies that matter September 7, / 69

108 spectral coherence coherence(x, y) = E[S xy ] E[Sxx ] E[S yy ] cross-correlation autocorrelations Amount of connectivity (synchronization) not caused by chance van Schijndel Frequencies that matter September 7, / 69

109 spectral coherence: phase synchrony Fell & Axmacher (2011) van Schijndel Frequencies that matter September 7, / 69

110 audiobook meg corpus Collected 2 years ago at CMU van Schijndel Frequencies that matter September 7, / 69

111 audiobook meg corpus Collected 2 years ago at CMU 3 subjects van Schijndel Frequencies that matter September 7, / 69

112 audiobook meg corpus Collected 2 years ago at CMU 3 subjects Heart of Darkness, ch 2 12,342 words 80 (8 x 10) minutes Synched with parallel audio recording and forced alignment van Schijndel Frequencies that matter September 7, / 69

audiobook meg corpus Collected 2 years ago at CMU 3 subjects Heart of Darkness, ch 2 12,342 words 80 (8 x 10) minutes Synched with parallel audio recording and forced alignment 306-channel Elekta

113 audiobook meg corpus Collected 2 years ago at CMU 3 subjects Heart of Darkness, ch 2 12,342 words 80 (8 x 10) minutes Synched with parallel audio recording and forced alignment 306-channel Elekta Neuromag, CMU Movement/noise correction: SSP, SSS, tsss Band-pass filtered Hz Downsampled to 125 Hz Visually scanned for muscle artifacts; none found van Schijndel Frequencies that matter September 7, / 69

114 memory load via center embedding d1 The cart broke d2 that the man bought van Schijndel Frequencies that matter September 7, / 69

115 memory load via center embedding d1 The cart broke d2 that the man bought Depth annotations: van Schijndel et al, (2013) parser Nguyen et al, (2012) Generalized Categorial Grammar (GCG) van Schijndel Frequencies that matter September 7, / 69

116 data filtering Remove words: in short or long sentences (<4 or >50 words) that follow a word at another depth that fail to parse Partition data: Dev set: One third of corpus Test set: Two thirds of corpus van Schijndel Frequencies that matter September 7, / 69

117 compute coherence Group by factor Compute coherence over subsets of 4 epochs van Schijndel Frequencies that matter September 7, / 69

118 dev coherence Coherence (d 2 d 1 ) Frequency (Hz) Time (s) van Schijndel Frequencies that matter September 7, / 69

119 dev coherence +variance Coherence (d 2 d 1 ) Frequency (Hz) Time (s) van Schijndel Frequencies that matter September 7, / 69

120 possible confounds? Sentence position Unigram, Bigram, Trigram: COCA logprobs PCFG surprisal: parser output van Schijndel Frequencies that matter September 7, / 69

121 dev results Factor p-value Unigram 0941 Bigram 0257 Trigram 0073 PCFG Surprisal 0482 Sentence Position 0031 Depth 0005 Depth 1 (40 items) Depth 2 (1118 items) van Schijndel Frequencies that matter September 7, / 69

122 test results Factor p-value Unigram Bigram Trigram PCFG Surprisal Sentence Position Depth Depth 1 (86 items) Depth 2 (2142 items) van Schijndel Frequencies that matter September 7, / 69

123 test results Factor p-value Unigram Bigram Trigram PCFG Surprisal Sentence Position Depth Bonferroni correction removes trigrams, but van Schijndel Frequencies that matter September 7, / 69

124 compute coherence: increased resolution Group by factor Compute coherence over subsets of 6 epochs van Schijndel Frequencies that matter September 7, / 69

125 test results: increased resolution Factor p-value Trigram Depth Depth 1 (57 items) Depth 2 (1428 items) van Schijndel Frequencies that matter September 7, / 69

126 results: case study 3 Memory load is reflected in MEG connectivity Common confounds do not pose problems for oscillatory measures van Schijndel Frequencies that matter September 7, / 69

127 conclusions Cloze probabilities are insufficient as frequency control Hierarchic syntactic frequencies strongly influence processing Reading time studies need to use local and cumulative n-grams Oscillatory analyses could avoid control predictor explosion van Schijndel Frequencies that matter September 7, / 69

acknowledgements Stefan Frank, Matthew Traxler, Shari Speer, Roberto Zamparelli Attendees of CogSci 2014, CUNY 2015, NAACL 2015, CMCL 2015 OSU Linguistics Targeted Investment for Excellence

128 acknowledgements Stefan Frank, Matthew Traxler, Shari Speer, Roberto Zamparelli Attendees of CogSci 2014, CUNY 2015, NAACL 2015, CMCL 2015 OSU Linguistics Targeted Investment for Excellence ( ) National Science Foundation (DGE ) University of Pittsburgh Medical Center MEG Seed Fund National Institutes of Health CRCNS (5R01HD ) van Schijndel Frequencies that matter September 7, / 69

129 thanks! questions? Cloze probabilities are insufficient as frequency control Hierarchic syntactic frequencies strongly influence processing Reading time studies need to use local and cumulative n-grams Oscillatory analyses could avoid control predictor explosion van Schijndel Frequencies that matter September 7, / 69

130 improved n-gram baseline 20 N gram Perplexity Perplexity N van Schijndel Frequencies that matter September 7, / 69

131 further improved n-gram baseline 20 N gram Perplexity Perplexity N van Schijndel Frequencies that matter September 7, / 69

132 further improved n-gram baseline 20 Pruned N gram Perplexity Perplexity pruned unpruned N van Schijndel Frequencies that matter September 7, / 69

133 model How probable is each subtree? Wall Street Journal (WSJ) section of the Penn Treebank: a) VP-gNP (b) VP-gNP VP-gNP PP VP PP-gNP VP-gNP Adv P NP VP Adv P t i TV t i carefully behind IV carefully behind landed Transitive landed Intransitive van Schijndel Frequencies that matter September 7, / 69

134 model How probable is each subtree? Wall Street Journal (WSJ) section of the Penn Treebank: a) VP-gNP (b) VP-gNP VP-gNP PP VP PP-gNP VP-gNP Adv P NP VP Adv P t i TV landed t i carefully Transitive behind IV landed carefully behind Intransitive van Schijndel Frequencies that matter September 7, / 69

135 model What is the probability of each interpretation? P(syntactic configuration) P(generating the verb from that tree) P(Transitive) = P(VP-gNP VP-gNP PP) P(verb TV) (1) P(Intransitive) = P(VP-gNP VP PP-gNP) P(verb IV) (2) van Schijndel Frequencies that matter September 7, / 69

136 model What is the probability of each interpretation? P(syntactic configuration) P(generating the verb from that tree) P(subcat bias)/p(preterminal prior) P(Transitive) = P(VP-gNP VP-gNP PP) P(verb TV) (1) P(Intransitive) = P(VP-gNP VP PP-gNP) P(verb IV) (2) van Schijndel Frequencies that matter September 7, / 69

137 model What is the probability of each interpretation? P(syntactic configuration) P(subcat bias)/p(preterminal prior) P(Transitive) = P(VP-gNP VP-gNP PP) P(verb TV) (1) P(VP-gNP VP-gNP PP) P(TV verb) P(TV) P(Intransitive) = P(VP-gNP VP PP-gNP) P(verb IV) (2) P(VP-gNP VP PP-gNP) P(IV verb) P(IV) van Schijndel Frequencies that matter September 7, / 69

138 model What are the preterminal priors? Relative prior probability from the WSJ: P(TV):P(IV) = 26: 1 van Schijndel Frequencies that matter September 7, / 69

139 model What is the probability of each interpretation? P(syntactic configuration) P(subcat bias)/p(preterminal prior) P(Transitive) P(VP-gNP VP-gNP PP) = 017 P(TV verb) 26 P(Intransitive) P(VP-gNP VP PP-gNP) = 001 P(IV verb) 10 P(TV verb) P(TV) P(IV verb) P(IV) (1) (2) van Schijndel Frequencies that matter September 7, / 69

140 model What is the probability of each interpretation? P(syntactic configuration) P(subcat bias)/p(preterminal prior) P(Transitive) P(VP-gNP VP-gNP PP) = 017 P(TV verb) 26 P(Intransitive) P(VP-gNP VP PP-gNP) = 001 P(IV verb) 10 P(TV verb) P(TV) = 0065 P(TV verb) (1) P(IV verb) P(IV) = 001 P(IV verb) (2) van Schijndel Frequencies that matter September 7, / 69

141 model What is the probability of each interpretation? P(syntactic configuration) P(subcat bias)/p(preterminal prior) P(Transitive) P(VP-gNP VP-gNP PP) = 017 P(TV verb) 26 P(Intransitive) P(VP-gNP VP PP-gNP) = 001 P(IV verb) 10 P(TV verb) P(TV) = 0065 P(TV verb) (1) P(IV verb) P(IV) = 001 P(IV verb) (2) Pickering & Traxler (2003) experimentally determined subcat biases for a wide variety of verbs van Schijndel Frequencies that matter September 7, / 69

142 evaluation Pickering & Traxler (2003) (1) That s the plane that the pilot landed carefully behind in the fog at the airport (2) That s the truck that the pilot landed carefully behind in the fog at the airport Using Pickering & Traxler s (2003) subcat bias data: van Schijndel Frequencies that matter September 7, / 69

143 evaluation Pickering & Traxler (2003) (1) That s the plane that the pilot landed carefully behind in the fog at the airport (2) That s the truck that the pilot landed carefully behind in the fog at the airport Using Pickering & Traxler s (2003) subcat bias data: P(Transitive landed) = 0016 P(Intransitive landed) = 0004 van Schijndel Frequencies that matter September 7, / 69

The relation of surprisal and human processing

The relation of surprisal and human processing difficulty Information Theory Lecture Vera Demberg and Matt Crocker Information Theory Lecture, Universität des Saarlandes April 19th, 2015 Information theory