Acquiring Strongly-related Events using Predicate-argument Co-occurring Statistics and Caseframe

Size: px
Start display at page:

Download "Acquiring Strongly-related Events using Predicate-argument Co-occurring Statistics and Caseframe"

Transcription

1 Web 96% 79.1% 2 Acquiring Strongly-related Events using Predicate-argument Co-occurring Statistics and Caseframe Tomohide Shibata 1 and Sadao Kurohashi 1 This paper proposes a method for automatically acquiring strongly-related events from a large corpus using predicate-argument co-occurring statistics and caseframe. The co-occurrence measure is calculated using an association rule mining method, and the importance of an argument for each predicateargument is judged. Then, the argument alignment in the pair of predicatearguments is performed by using a caseframe. We conducted experiments using a Web corpus consisting of 1.6G sentences. The accuracy for the extracted event pairs was 96%, and the accuracy of the argument alignment was 79.1%. The number of acquired event pairs was about 20 thousands. 1 Kyoto University 1. 1) ( ) 2) 3) P A 1 P A 2 A 1 :{,,...} A 1 :{,,...} A 2 :{,...} A 2 :{,...} A 3 :{ } A 1 A 2 P A 1 P A 2 A 3 P A 2 P A 2 Chambers 4),5) () (1) a. b. (1-a) P A 2 (1-b) P A 1 Chambers 2 P A 1 A 2 : {,...} P A 2 A 3 : { } 1 c 2011 Information Processing Society of Japan

2 A 2:{,...} A 3:{ } P A 1 A 2 :{,...} P A 2 P A 1 A 1 :{,,...} P A WordNet 6) WordNet LifeNet 7) 8 41 EventNet Openmind Commonsense Knowledge Base 8) Regneri Amazon Mechanical Turk 9) Lin 10) X is the author of Y X wrote Y X,Y Chambers 4),5) accused XX claimedx argueddismissed X / 12) - 13) 14) 3. 1 Web P A 1 P A 2 P A 1 P A 2 15) 1) c 2011 Information Processing Society of Japan

3 Web コーパス 述語項構造ペアの抽出 PA1 彼ガ財布ヲ拾う 財布ヲ拾う ドライバーガ財布ヲ拾う PA2 警察ニ届ける 警察ニ届ける 届ける 1 P A 1 P A 2 拾う : 10 ガ ヲ 男, 女の子, 財布, 電話, 格フレームに基づく項のアライメント PA1 財布ヲ拾う 警察ニ届ける A1 : { 人, 男, } ガ A2 : { 財布, } ヲ 1 述語項構造ペアの共起度計算 拾う PA2 届ける : 20 ガ ヲ ニ 男, 人, 財布, 金, 警察, 交番, A1 : { 人, 男, } ガ A2 : { 財布, } ヲ 届ける A3 : { 警察 } ニ P A 1 P A 2 P A 1 P A 2 4. (2) a. b. 2 77,, 105,,, 502,, ID, 956,, 1829,, 1901,, 1 (P A 1 P A 2 ) P A 1 P A 2 P A 2 P A 1 ( P A 1 P A 2 ) P A 2 ( P A 2 ) 16) - 2,000 2 n (P (c n)) c 77 P A 1:, P A 2: P A 1: 3 c 2011 Information Processing Society of Japan

4 77, P A 2: 77 P A 1 :, P A 2 : P A 1 : 77, P A 2 : ) ) I = I 1, I 2,, I m t (t I) T (T = t 1, t 2,, t n ) X Y (X, Y I, X Y = φ) X Y X antecedent (left-hand side, lhs)y consequent (right-hand side, rhs) 3 support confidence lift support(x Y ) = C(X Y ) T confidence(x Y ) = C(X Y ) C(X) lift(x Y ) = confidence(x Y ) support(y ) = support(x Y ) support(x) C(X) X support XY confidence X Y lift X Y (1) (2) (3) 3 () P A 1 P A , , - Apriori 17) abc t 1 abcd t 2 t 1 t 2 Apriori support confidence 5.2 Apriori Apriori 4 3 X P A 1 P A 1 0 Y P A 2 P A 2 0 lift lift-min lift-max lift-max Apriori 3 ( 1 ) - - ( 2 ) - 4 c 2011 Information Processing Society of Japan

5 4 () :1 (2), (2), (3513), (80), :10 (4), (2), (580), (136), :1 (164), (144), (103400), (4797), :20 (11), (8), (8), (6), (2587), P A 1 - P A 2 - P A P A 1 P A 2 P A 1P A Web 1) 4 P A 1 cf 1 P A 2 cf 2 P A 1 P A 2 ( 1 ) P A 1 P A 2 5 (2) P A 2 ( 2 ) argmax cf 1,cf 2 max a sim(arg 1, a(arg 1)) (4) a a a P A 1 P A 2 arg 1 P A 1 a(arg 1 ) arg 1 P A 2 a arg1 a(arg 1 ) sim arg 1 a(arg 1) cosine :10:20 sim 2 cosine :10 ( 4, 2, 2, ) :20 ( 11, 8, 0, ) P A 1 P A 2 P A 1 10 P A 2 20 P A 1 P A 2,, c 2011 Information Processing Society of Japan

6 5 96(96.0%) 4(4.0%) 76(79.1%) 20(20.8%) 7 ( 6 ) P A 1 P A 2 6 (5 ) P A 1 P A 2 (1) - (2) - - (3) (4) (5) (6) (7) - (8) - - JUMAN 1 KNP Apriori support confidence lift-min, lift-max 1010, ) 30, (1) (2) A 1 :{,,...} A1 :{,,...} A 2 :{ } A 1 :{,,,...} A 1 :{,,,...} A 2 :{ } A 3 :{ } (3) A 1 :{,,,...} A 1 :{,,,...} A 1 :{,...} (4) A 1:{,...} A 2 :{,,...} A 2 :{,,...} (5) A 1 :{,,...} A 2 :{,,...} A 1 :{,,...} A 2 :{,,...} A 3 :{ } A 1 :{,,...} (6) A 2:{,,...} A 2 :{,,...} A 1 :{,,...} (7) A 1 :{ } A 1 :{ } 5 96% 6 ( 6 (8)) % 7 7 (6) P A 1 A 1 P A 2 A 1 A 1 A 3 A 1 :{,,...} A 2:{,,...} A 3 :{,,...} A 2:{,,...} A 3 :{,,...} 6 c 2011 Information Processing Society of Japan

7 7 (7) P A 2 P A 1 P A 1 P A 2 A 2 :{,,,...} A 2:{,,,...} A 1 :{ } 8 ( ) P A 1 P A (3,768 / 23,180) (549 / 1,944) (474 / 2,689) (753 / 2,764) (7,106 / 14,713) (1,054 / 3,284) (344 / 2,113) (1,042 / 3,086) (549 / 1,944) ) ( F ) ) Web 2 w v e(w, d)e(v, g) w d v g d g e(w, d) e(v, g) pmi(e(w, d), e(v, g)) = log P (e(w, d), e(v, g)) P (e(w, d))p (e(v, g)) k (k 5 ) 8 P A 1 P A 2 P A 1P A (5) 2 ( lift ) Chamber c 2011 Information Processing Society of Japan

8 3 ([, ] ) RTE(Recognizing Textual Entailment) 1) Kawahara, D. and Kurohashi, S.: A Fully-Lexicalized Probabilistic Model for Japanese Syntactic and Case Structure Analysis, Proceedings of the HLT- NAACL2006, pp (2006). 2) Bean, D. and Riloff, E.: Unsupervised Learning of Contextual Role Knowledge for Coreference Resolution, HLT-NAACL 2004: Main Proceedings, pp (2004). 3) Gerber, M. and Chai, J.: Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp (2010). 4) Chambers, N. and Jurafsky, D.: Unsupervised Learning of Narrative Event Chains, Proceedings of ACL-08: HLT, pp (2008). 5) Chambers, N. and Jurafsky, D.: Unsupervised Learning of Narrative Schemas and their Participants, Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp (2009). 6) Miller, G. A.: Wordnet: A lexical detabase for English, Communications of the ACM (1995). 7) Singh, P. and Williams, W.: LifeNet: A Propositional Model of Ordinary Human Activity, Proceedings of Workshop on Distributed and Collaborative Knowledge Capture (2003). 8) Espinosa, J. and Lieberman, H.: EventNet: Inferring Temporal Relations Between Commonsense Events, Proceedings of the 4th Mexican International Conference on Artificial Intelligence, pp (2005). 9) Regneri, M., Koller, A. and Pinkal, M.: Learning Script Knowledge with Web Experiments, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp (2010). 10) Lin, D. and Pantel, P.: Discovery of Inference Rules for Question Answering, Natural Language Engineering, Vol.7, No.4, pp (2001). 11) Szpektor, I. and Dagan, I.: Learning Entailment Rules for Unary Templates, Proceedings of the 22nd International Conference on Computational Linguistics (COL- ING), pp (2008). 12) Fujiki, T., Nanba, H. and Okumura, M.: Automatic Acquisition of Script Knowledge from a Text Collection, Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2003), pp (2003). 13) Torisawa, K.: Acquiring Inference Rules with Temporal Constraints by using Japanese Coordinated Sentences and Noun-Verb Co-occurrences, Proceedings of Human Language Technology Conference/North American chapter of the Association for Computational Linguistics annual meeting (HLT-NAACL06), pp (2006). 14) Abe, S., Inui, K. and Matsumoto, Y.: Two-phased event relation acquisition: coupling the relation-oriented and argument-oriented approaches, Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pp. 1 8 (2008). 15) Agrawal, R., Imielinski, T. and Swami, A.: Mining association rules between sets of items in large databases, Proceedings of the ACM-SIGMOD 1993 International Conference on Management of Data (1993), pp (1993). 16) Kazama, J. and Torisawa, K.: Inducing Gazetteers for Named Entity Recognition by Large-Scale Clustering of Dependency Relations, Proceedings of ACL-08: HLT, pp (2008). 17) Borgelt, C. and Kruse, R.: Induction of Association Rules: Apriori Implementation, Proceedings of 15th Conference on Computational Statistics, pp (2002). 18) Sasano, R., Kawahara, D. and Kurohashi, S.: Improving Coreference Resolution Using Bridging Reference Resolution and Automatically Acquired Synonyms, Discourse Anaphora and Anaphor Resolution Colloquium, pp (2007). 8 c 2011 Information Processing Society of Japan

Automatically Evaluating Text Coherence using Anaphora and Coreference Resolution

Automatically Evaluating Text Coherence using Anaphora and Coreference Resolution 1 1 Barzilay 1) Automatically Evaluating Text Coherence using Anaphora and Coreference Resolution Ryu Iida 1 and Takenobu Tokunaga 1 We propose a metric for automatically evaluating discourse coherence

More information

A Fully-Lexicalized Probabilistic Model for Japanese Zero Anaphora Resolution

A Fully-Lexicalized Probabilistic Model for Japanese Zero Anaphora Resolution A Fully-Lexicalized Probabilistic Model for Japanese Zero Anaphora Resolution Ryohei Sasano Graduate School of Information Science and Technology, University of Tokyo ryohei@nlp.kuee.kyoto-u.ac.jp Daisuke

More information

A Hierarchical Bayesian Model for Unsupervised Induction of Script Knowledge

A Hierarchical Bayesian Model for Unsupervised Induction of Script Knowledge A Hierarchical Bayesian Model for Unsupervised Induction of Script Knowledge Lea Frermann Ivan Titov Manfred Pinkal April, 28th 2014 1 / 22 Contents 1 Introduction 2 Technical Background 3 The Script Model

More information

Recognizing Implicit Discourse Relations through Abductive Reasoning with Large-scale Lexical Knowledge

Recognizing Implicit Discourse Relations through Abductive Reasoning with Large-scale Lexical Knowledge Recognizing Implicit Discourse Relations through Abductive Reasoning with Large-scale Lexical Knowledge Jun Sugiura, Naoya Inoue, and Kentaro Inui Tohoku University, 6-3-09 Aoba, Aramaki, Aoba-ku, Sendai,

More information

Mining coreference relations between formulas and text using Wikipedia

Mining coreference relations between formulas and text using Wikipedia Mining coreference relations between formulas and text using Wikipedia Minh Nghiem Quoc 1, Keisuke Yokoi 2, Yuichiroh Matsubayashi 3 Akiko Aizawa 1 2 3 1 Department of Informatics, The Graduate University

More information

Removing trivial associations in association rule discovery

Removing trivial associations in association rule discovery Removing trivial associations in association rule discovery Geoffrey I. Webb and Songmao Zhang School of Computing and Mathematics, Deakin University Geelong, Victoria 3217, Australia Abstract Association

More information

Chinese Zero Pronoun Resolution: A Joint Unsupervised Discourse-Aware Model Rivaling State-of-the-Art Resolvers

Chinese Zero Pronoun Resolution: A Joint Unsupervised Discourse-Aware Model Rivaling State-of-the-Art Resolvers Chinese Zero Pronoun Resolution: A Joint Unsupervised Discourse-Aware Model Rivaling State-of-the-Art Resolvers Chen Chen and Vincent Ng Human Language Technology Research Institute University of Texas

More information

Fertilization of Case Frame Dictionary for Robust Japanese Case Analysis

Fertilization of Case Frame Dictionary for Robust Japanese Case Analysis Fertilization of Case Frame Dictionary for Robust Japanese Case Analysis Daisuke Kawahara and Sadao Kurohashi Graduate School of Information Science and Technology, University of Tokyo PRESTO, Japan Science

More information

Excitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web

Excitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web Excitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web Chikara Hashimoto Kentaro Torisawa Stijn De Saeger Jong-Hoon Oh Jun ichi Kazama National Institute

More information

Utilizing Portion of Patent Families with No Parallel Sentences Extracted in Estimating Translation of Technical Terms

Utilizing Portion of Patent Families with No Parallel Sentences Extracted in Estimating Translation of Technical Terms 1 1 1 2 2 30% 70% 70% NTCIR-7 13% 90% 1,000 Utilizing Portion of Patent Families with No Parallel Sentences Extracted in Estimating Translation of Technical Terms Itsuki Toyota 1 Yusuke Takahashi 1 Kensaku

More information

An Empirical Study on Dimensionality Optimization in Text Mining for Linguistic Knowledge Acquisition

An Empirical Study on Dimensionality Optimization in Text Mining for Linguistic Knowledge Acquisition An Empirical Study on Dimensionality Optimization in Text Mining for Linguistic Knowledge Acquisition Yu-Seop Kim 1, Jeong-Ho Chang 2, and Byoung-Tak Zhang 2 1 Division of Information and Telecommunication

More information

P R + RQ P Q: Transliteration Mining Using Bridge Language

P R + RQ P Q: Transliteration Mining Using Bridge Language P R + RQ P Q: Transliteration Mining Using Bridge Language Mitesh M. Khapra Raghavendra Udupa n Institute of Technology Microsoft Research, Bombay, Bangalore, Powai, Mumbai 400076 raghavu@microsoft.com

More information

Determining Word Sense Dominance Using a Thesaurus

Determining Word Sense Dominance Using a Thesaurus Determining Word Sense Dominance Using a Thesaurus Saif Mohammad and Graeme Hirst Department of Computer Science University of Toronto EACL, Trento, Italy (5th April, 2006) Copyright cfl2006, Saif Mohammad

More information

Capturing Salience with a Trainable Cache Model for Zero-anaphora Resolution

Capturing Salience with a Trainable Cache Model for Zero-anaphora Resolution Capturing Salience with a Trainable Cache Model for Zero-anaphora Resolution Ryu Iida Department of Computer Science Tokyo Institute of Technology 2-12-1, Ôokayama, Meguro, Tokyo 152-8552, Japan ryu-i@cl.cs.titech.ac.jp

More information

Toponym Disambiguation using Ontology-based Semantic Similarity

Toponym Disambiguation using Ontology-based Semantic Similarity Toponym Disambiguation using Ontology-based Semantic Similarity David S Batista 1, João D Ferreira 2, Francisco M Couto 2, and Mário J Silva 1 1 IST/INESC-ID Lisbon, Portugal {dsbatista,msilva}@inesc-id.pt

More information

A Surface-Similarity Based Two-Step Classifier for RITE-VAL

A Surface-Similarity Based Two-Step Classifier for RITE-VAL A Surface-Similarity Based Two-Step Classifier for RITE-VAL Shohei Hattori Satoshi Sato Graduate School of Engineering, Nagoya University Furo-cho, Chikusa-ku, Nagoya, 464-8603, JAPAN {syohei_h,ssato}@nuee.nagoya-u.ac.jp

More information

Proposition Knowledge Graphs. Gabriel Stanovsky Omer Levy Ido Dagan Bar-Ilan University Israel

Proposition Knowledge Graphs. Gabriel Stanovsky Omer Levy Ido Dagan Bar-Ilan University Israel Proposition Knowledge Graphs Gabriel Stanovsky Omer Levy Ido Dagan Bar-Ilan University Israel 1 Problem End User 2 Case Study: Curiosity (Mars Rover) Curiosity is a fully equipped lab. Curiosity is a rover.

More information

Assignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran

Assignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran Assignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran 1. Let X, Y be two itemsets, and let denote the support of itemset X. Then the confidence of the rule X Y,

More information

An Introduction to String Re-Writing Kernel

An Introduction to String Re-Writing Kernel An Introduction to String Re-Writing Kernel Fan Bu 1, Hang Li 2 and Xiaoyan Zhu 3 1,3 State Key Laboratory of Intelligent Technology and Systems 1,3 Tsinghua National Laboratory for Information Sci. and

More information

Learning Features from Co-occurrences: A Theoretical Analysis

Learning Features from Co-occurrences: A Theoretical Analysis Learning Features from Co-occurrences: A Theoretical Analysis Yanpeng Li IBM T. J. Watson Research Center Yorktown Heights, New York 10598 liyanpeng.lyp@gmail.com Abstract Representing a word by its co-occurrences

More information

Linking people in videos with their names using coreference resolution (Supplementary Material)

Linking people in videos with their names using coreference resolution (Supplementary Material) Linking people in videos with their names using coreference resolution (Supplementary Material) Vignesh Ramanathan, Armand Joulin, Percy Liang, Li Fei-Fei Department of Electrical Engineering, Stanford

More information

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University

Text Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University Text Mining Dr. Yanjun Li Associate Professor Department of Computer and Information Sciences Fordham University Outline Introduction: Data Mining Part One: Text Mining Part Two: Preprocessing Text Data

More information

Joint Inference for Event Timeline Construction

Joint Inference for Event Timeline Construction Joint Inference for Event Timeline Construction Quang Xuan Do Wei Lu Dan Roth Department of Computer Science University of Illinois at Urbana-Champaign Urbana, IL 61801, USA {quangdo2,luwei,danr}@illinois.edu

More information

A Continuous-Time Model of Topic Co-occurrence Trends

A Continuous-Time Model of Topic Co-occurrence Trends A Continuous-Time Model of Topic Co-occurrence Trends Wei Li, Xuerui Wang and Andrew McCallum Department of Computer Science University of Massachusetts 140 Governors Drive Amherst, MA 01003-9264 Abstract

More information

Latent Dirichlet Allocation Based Multi-Document Summarization

Latent Dirichlet Allocation Based Multi-Document Summarization Latent Dirichlet Allocation Based Multi-Document Summarization Rachit Arora Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai - 600 036, India. rachitar@cse.iitm.ernet.in

More information

Learning Textual Entailment using SVMs and String Similarity Measures

Learning Textual Entailment using SVMs and String Similarity Measures Learning Textual Entailment using SVMs and String Similarity Measures Prodromos Malakasiotis and Ion Androutsopoulos Department of Informatics Athens University of Economics and Business Patision 76, GR-104

More information

A Linguistic Inspection of Textual Entailment

A Linguistic Inspection of Textual Entailment A Linguistic Inspection of Textual Entailment Maria Teresa Pazienza 1, Marco Pennacchiotti 1, and Fabio Massimo Zanzotto 2 1 University of Roma Tor Vergata, Via del Politecnico 1, Roma, Italy, {pazienza,

More information

Learning Scripts as Hidden Markov Models

Learning Scripts as Hidden Markov Models Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Learning Scripts as Hidden Markov Models J. Walker Orr, Prasad Tadepalli, Janardhan Rao Doppa, Xiaoli Fern, Thomas G. Dietterich

More information

Coreference Resolution with! ILP-based Weighted Abduction

Coreference Resolution with! ILP-based Weighted Abduction Coreference Resolution with! ILP-based Weighted Abduction Naoya Inoue, Ekaterina Ovchinnikova, Kentaro Inui, Jerry Hobbs Tohoku University, Japan! ISI/USC, USA Motivation Long-term goal: unified framework

More information

Annotating Spatial Containment Relations Between Events

Annotating Spatial Containment Relations Between Events Annotating Spatial Containment Relations Between Events Kirk Roberts, Travis Goodwin, Sanda M. Harabagiu Human Language Technology Research Institute University of Texas at Dallas Richardson TX 75080 {kirk,travis,sanda}@hlt.utdallas.edu

More information

CLRG Biocreative V

CLRG Biocreative V CLRG ChemTMiner @ Biocreative V Sobha Lalitha Devi., Sindhuja Gopalan., Vijay Sundar Ram R., Malarkodi C.S., Lakshmi S., Pattabhi RK Rao Computational Linguistics Research Group, AU-KBC Research Centre

More information

Data Mining and Knowledge Discovery. Petra Kralj Novak. 2011/11/29

Data Mining and Knowledge Discovery. Petra Kralj Novak. 2011/11/29 Data Mining and Knowledge Discovery Petra Kralj Novak Petra.Kralj.Novak@ijs.si 2011/11/29 1 Practice plan 2011/11/08: Predictive data mining 1 Decision trees Evaluating classifiers 1: separate test set,

More information

Association Rule. Lecturer: Dr. Bo Yuan. LOGO

Association Rule. Lecturer: Dr. Bo Yuan. LOGO Association Rule Lecturer: Dr. Bo Yuan LOGO E-mail: yuanb@sz.tsinghua.edu.cn Overview Frequent Itemsets Association Rules Sequential Patterns 2 A Real Example 3 Market-Based Problems Finding associations

More information

Statistical Machine Learning Theory. From Multi-class Classification to Structured Output Prediction. Hisashi Kashima.

Statistical Machine Learning Theory. From Multi-class Classification to Structured Output Prediction. Hisashi Kashima. http://goo.gl/jv7vj9 Course website KYOTO UNIVERSITY Statistical Machine Learning Theory From Multi-class Classification to Structured Output Prediction Hisashi Kashima kashima@i.kyoto-u.ac.jp DEPARTMENT

More information

Statistical Machine Learning Theory. From Multi-class Classification to Structured Output Prediction. Hisashi Kashima.

Statistical Machine Learning Theory. From Multi-class Classification to Structured Output Prediction. Hisashi Kashima. http://goo.gl/xilnmn Course website KYOTO UNIVERSITY Statistical Machine Learning Theory From Multi-class Classification to Structured Output Prediction Hisashi Kashima kashima@i.kyoto-u.ac.jp DEPARTMENT

More information

Text Mining. March 3, March 3, / 49

Text Mining. March 3, March 3, / 49 Text Mining March 3, 2017 March 3, 2017 1 / 49 Outline Language Identification Tokenisation Part-Of-Speech (POS) tagging Hidden Markov Models - Sequential Taggers Viterbi Algorithm March 3, 2017 2 / 49

More information

KSU Team s System and Experience at the NTCIR-11 RITE-VAL Task

KSU Team s System and Experience at the NTCIR-11 RITE-VAL Task KSU Team s System and Experience at the NTCIR-11 RITE-VAL Task Tasuku Kimura Kyoto Sangyo University, Japan i1458030@cse.kyoto-su.ac.jp Hisashi Miyamori Kyoto Sangyo University, Japan miya@cse.kyoto-su.ac.jp

More information

(1) [John (i) ] attacked [Bob (j) ]. Police arrested him (i).

(1) [John (i) ] attacked [Bob (j) ]. Police arrested him (i). 1,a) 1,b) 1,c) 1,d) 1,e) e.g., man arrest e.g., man arrest man [Van de Crus 2014]. 1. [1], [2], [3] [4] [5] [6], [7], [8], [9] [10], [11] 1 Tohoku Universit a) masauki.ono@ecei.tohoku.ac.jp b) naoa-i@ecei.tohoku.ac.jp

More information

The Winograd Schema Challenge and Reasoning about Correlation

The Winograd Schema Challenge and Reasoning about Correlation The Winograd Schema Challenge and Reasoning about Correlation Daniel Bailey University of Nebraska at Omaha dbailey@unomahaedu Abstract The Winograd Schema Challenge is an alternative to the Turing Test

More information

Classification & Information Theory Lecture #8

Classification & Information Theory Lecture #8 Classification & Information Theory Lecture #8 Introduction to Natural Language Processing CMPSCI 585, Fall 2007 University of Massachusetts Amherst Andrew McCallum Today s Main Points Automatically categorizing

More information

Sequences and Information

Sequences and Information Sequences and Information Rahul Siddharthan The Institute of Mathematical Sciences, Chennai, India http://www.imsc.res.in/ rsidd/ Facets 16, 04/07/2016 This box says something By looking at the symbols

More information

Chapter 2 Quality Measures in Pattern Mining

Chapter 2 Quality Measures in Pattern Mining Chapter 2 Quality Measures in Pattern Mining Abstract In this chapter different quality measures to evaluate the interest of the patterns discovered in the mining process are described. Patterns represent

More information

Discovery of Frequent Word Sequences in Text. Helena Ahonen-Myka. University of Helsinki

Discovery of Frequent Word Sequences in Text. Helena Ahonen-Myka. University of Helsinki Discovery of Frequent Word Sequences in Text Helena Ahonen-Myka University of Helsinki Department of Computer Science P.O. Box 26 (Teollisuuskatu 23) FIN{00014 University of Helsinki, Finland, helena.ahonen-myka@cs.helsinki.fi

More information

It s a Contradiction No, it s Not: A Case Study using Functional Relations

It s a Contradiction No, it s Not: A Case Study using Functional Relations It s a Contradiction No, it s Not: A Case Study using Functional Relations Alan Ritter, Doug Downey, Stephen Soderland and Oren Etzioni Turing Center Department of Computer Science and Engineering University

More information

An Approach to Classification Based on Fuzzy Association Rules

An Approach to Classification Based on Fuzzy Association Rules An Approach to Classification Based on Fuzzy Association Rules Zuoliang Chen, Guoqing Chen School of Economics and Management, Tsinghua University, Beijing 100084, P. R. China Abstract Classification based

More information

Annotation tasks and solutions in CLARIN-PL

Annotation tasks and solutions in CLARIN-PL Annotation tasks and solutions in CLARIN-PL Marcin Oleksy, Ewa Rudnicka Wrocław University of Technology marcin.oleksy@pwr.edu.pl ewa.rudnicka@pwr.edu.pl CLARIN ERIC Common Language Resources and Technology

More information

Be principled! A Probabilistic Model for Lexical Entailment

Be principled! A Probabilistic Model for Lexical Entailment Be principled! A Probabilistic Model for Lexical Entailment Eyal Shnarch Computer Science Department Bar-Ilan University Ramat-Gan, Israel shey@cs.biu.ac.il Jacob Goldberger School of Engineering Bar-Ilan

More information

First Order Logic Implication (4A) Young W. Lim 4/6/17

First Order Logic Implication (4A) Young W. Lim 4/6/17 First Order Logic (4A) Young W. Lim Copyright (c) 2016-2017 Young W. Lim. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version

More information

Integrating Order Information and Event Relation for Script Event Prediction

Integrating Order Information and Event Relation for Script Event Prediction Integrating Order Information and Event Relation for Script Event Prediction Zhongqing Wang 1,2, Yue Zhang 2 and Ching-Yun Chang 2 1 Soochow University, China 2 Singapore University of Technology and Design

More information

The Noisy Channel Model and Markov Models

The Noisy Channel Model and Markov Models 1/24 The Noisy Channel Model and Markov Models Mark Johnson September 3, 2014 2/24 The big ideas The story so far: machine learning classifiers learn a function that maps a data item X to a label Y handle

More information

D2: For each type 1 quantifier Q, Q acc (R) = {a : Q(aR) = 1}.

D2: For each type 1 quantifier Q, Q acc (R) = {a : Q(aR) = 1}. Some Formal Properties of Higher Order Anaphors R. Zuber Laboratoire de Linguistique Formelle, CNRS and University Paris-Diderot Richard.Zuber@linguist.univ-paris-diderot.fr Abstract Formal properties

More information

CPDA Based Fuzzy Association Rules for Learning Achievement Mining

CPDA Based Fuzzy Association Rules for Learning Achievement Mining 2009 International Conference on Machine Learning and Computing IPCSIT vol.3 (2011) (2011) IACSIT Press, Singapore CPDA Based Fuzzy Association Rules for Learning Achievement Mining Jr-Shian Chen 1, Hung-Lieh

More information

Reflexives and non-fregean quantifiers

Reflexives and non-fregean quantifiers UCLA Working Papers in Linguistics, Theories of Everything Volume 17, Article 49: 439-445, 2012 Reflexives and non-fregean quantifiers Richard Zuber It is shown that depending on the subject noun phrase

More information

EXTRACTION AND VISUALIZATION OF GEOGRAPHICAL NAMES IN TEXT

EXTRACTION AND VISUALIZATION OF GEOGRAPHICAL NAMES IN TEXT Abstract EXTRACTION AND VISUALIZATION OF GEOGRAPHICAL NAMES IN TEXT Xueying Zhang zhangsnowy@163.com Guonian Lv Zhiren Xie Yizhong Sun 210046 Key Laboratory of Virtual Geographical Environment (MOE) Naning

More information

BnO at NTCIR-10 RITE: A Strong Shallow Approach and an Inference-based Textual Entailment Recognition System

BnO at NTCIR-10 RITE: A Strong Shallow Approach and an Inference-based Textual Entailment Recognition System BnO at NTCIR-10 RITE: A Strong Shallow Approach and an Inference-based Textual Entailment Recognition System ABSTRACT Ran Tian National Institute of Informatics, Japan tianran@nii.ac.jp Takuya Matsuzaki

More information

Information Extraction and GATE. Valentin Tablan University of Sheffield Department of Computer Science NLP Group

Information Extraction and GATE. Valentin Tablan University of Sheffield Department of Computer Science NLP Group Information Extraction and GATE Valentin Tablan University of Sheffield Department of Computer Science NLP Group Information Extraction Information Extraction (IE) pulls facts and structured information

More information

Probabilistic Coordination Disambiguation in a Fully-lexicalized Japanese Parser

Probabilistic Coordination Disambiguation in a Fully-lexicalized Japanese Parser Probabilistic Coordination Disambiguation in a Fully-lexicalized Japanese Parser Daisuke Kawahara National Institute of Information and Communications Technology, 3-5 Hikaridai Seika-cho, Soraku-gun, Kyoto,

More information

Natural Language Processing : Probabilistic Context Free Grammars. Updated 5/09

Natural Language Processing : Probabilistic Context Free Grammars. Updated 5/09 Natural Language Processing : Probabilistic Context Free Grammars Updated 5/09 Motivation N-gram models and HMM Tagging only allowed us to process sentences linearly. However, even simple sentences require

More information

Latent Variable Models in NLP

Latent Variable Models in NLP Latent Variable Models in NLP Aria Haghighi with Slav Petrov, John DeNero, and Dan Klein UC Berkeley, CS Division Latent Variable Models Latent Variable Models Latent Variable Models Observed Latent Variable

More information

情報工学実験 4: データマイニング班 (week 6) 機械学習概観の振り返り

情報工学実験 4: データマイニング班 (week 6) 機械学習概観の振り返り 情報工学実験 4: データマイニング班 (week 6) 機械学習概観の振り返り 1. ( 復習 ) 用語 2. ( 復習 ) 機械学習における問題設定 3. ( 復習 )scikit-learnの使い方 4. ( 復習 ) モデルと仮説 5. ( 復習 ) 問題 アルゴリズム モデル 6. ( 復習 ) 過学習 7. ( 復習 ) ペナルティ項 ( 正則化 ) 8. ( 復習 ) 交差検定 実験ページ

More information

Global Machine Learning for Spatial Ontology Population

Global Machine Learning for Spatial Ontology Population Global Machine Learning for Spatial Ontology Population Parisa Kordjamshidi, Marie-Francine Moens KU Leuven, Belgium Abstract Understanding spatial language is important in many applications such as geographical

More information

Towards a Probabilistic Model for Lexical Entailment

Towards a Probabilistic Model for Lexical Entailment Towards a Probabilistic Model for Lexical Entailment Eyal Shnarch Computer Science Department Bar-Ilan University Ramat-Gan, Israel shey@cs.biu.ac.il Jacob Goldberger School of Engineering Bar-Ilan University

More information

Contexts for Quantification

Contexts for Quantification Contexts for Quantification Valeria de Paiva Stanford April, 2011 Valeria de Paiva (Stanford) C4Q April, 2011 1 / 28 Natural logic: what we want Many thanks to Larry, Ulrik for slides! Program Show that

More information

10/17/04. Today s Main Points

10/17/04. Today s Main Points Part-of-speech Tagging & Hidden Markov Model Intro Lecture #10 Introduction to Natural Language Processing CMPSCI 585, Fall 2004 University of Massachusetts Amherst Andrew McCallum Today s Main Points

More information

Semantics and Pragmatics of NLP Pronouns

Semantics and Pragmatics of NLP Pronouns Semantics and Pragmatics of NLP Pronouns School of Informatics University of Edinburgh Outline Observations About Data 1 Observations of what factors influence the way pronouns get resolved 2 Some algorithms

More information

Automatic Generation of Shogi Commentary with a Log-Linear Language Model

Automatic Generation of Shogi Commentary with a Log-Linear Language Model 1,a) 2, 1,b) 1,c) 3,d) 1,e) 2011 11 4, 2011 12 1 2 Automatic Generation of Shogi Commentary with a Log-Linear Language Model Hirotaka Kameko 1,a) Makoto Miwa 2, 1,b) Yoshimasa Tsuruoka 1,c) Shinsuke Mori

More information

Hidden Markov Models, I. Examples. Steven R. Dunbar. Toy Models. Standard Mathematical Models. Realistic Hidden Markov Models.

Hidden Markov Models, I. Examples. Steven R. Dunbar. Toy Models. Standard Mathematical Models. Realistic Hidden Markov Models. , I. Toy Markov, I. February 17, 2017 1 / 39 Outline, I. Toy Markov 1 Toy 2 3 Markov 2 / 39 , I. Toy Markov A good stack of examples, as large as possible, is indispensable for a thorough understanding

More information

Penn Treebank Parsing. Advanced Topics in Language Processing Stephen Clark

Penn Treebank Parsing. Advanced Topics in Language Processing Stephen Clark Penn Treebank Parsing Advanced Topics in Language Processing Stephen Clark 1 The Penn Treebank 40,000 sentences of WSJ newspaper text annotated with phrasestructure trees The trees contain some predicate-argument

More information

Discovering Classes of Strongly Equivalent Logic Programs with Negation as Failure in the Head

Discovering Classes of Strongly Equivalent Logic Programs with Negation as Failure in the Head Discovering Classes of Strongly Equivalent Logic Programs with Negation as Failure in the Head Jianmin Ji School of Computer Science and Technology University of Science and Technology of China Hefei,

More information

Topic #3 Predicate Logic. Predicate Logic

Topic #3 Predicate Logic. Predicate Logic Predicate Logic Predicate Logic Predicate logic is an extension of propositional logic that permits concisely reasoning about whole classes of entities. Propositional logic treats simple propositions (sentences)

More information

Semantic Similarity from Corpora - Latent Semantic Analysis

Semantic Similarity from Corpora - Latent Semantic Analysis Semantic Similarity from Corpora - Latent Semantic Analysis Carlo Strapparava FBK-Irst Istituto per la ricerca scientifica e tecnologica I-385 Povo, Trento, ITALY strappa@fbk.eu Overview Latent Semantic

More information

Encoding Tree Pair-based Graphs in Learning Algorithms: the Textual Entailment Recognition Case

Encoding Tree Pair-based Graphs in Learning Algorithms: the Textual Entailment Recognition Case Encoding Tree Pair-based Graphs in Learning Algorithms: the Textual Entailment Recognition Case Alessandro Moschitti DII University of Trento Via ommarive 14 38100 POVO (T) - Italy moschitti@dit.unitn.it

More information

An Improved Stemming Approach Using HMM for a Highly Inflectional Language

An Improved Stemming Approach Using HMM for a Highly Inflectional Language An Improved Stemming Approach Using HMM for a Highly Inflectional Language Navanath Saharia 1, Kishori M. Konwar 2, Utpal Sharma 1, and Jugal K. Kalita 3 1 Department of CSE, Tezpur University, India {nava

More information

Logical Agents. September 14, 2004

Logical Agents. September 14, 2004 Logical Agents September 14, 2004 The aim of AI is to develop intelligent agents that can reason about actions and their effects and about the environment, create plans to achieve a goal, execute the plans,

More information

Natural Language Processing CS Lecture 06. Razvan C. Bunescu School of Electrical Engineering and Computer Science

Natural Language Processing CS Lecture 06. Razvan C. Bunescu School of Electrical Engineering and Computer Science Natural Language Processing CS 6840 Lecture 06 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Statistical Parsing Define a probabilistic model of syntax P(T S):

More information

Introduction to Semantics. Common Nouns and Adjectives in Predicate Position 1

Introduction to Semantics. Common Nouns and Adjectives in Predicate Position 1 Common Nouns and Adjectives in Predicate Position 1 (1) The Lexicon of Our System at Present a. Proper Names: [[ Barack ]] = Barack b. Intransitive Verbs: [[ smokes ]] = [ λx : x D e. IF x smokes THEN

More information

Machine Learning for Interpretation of Spatial Natural Language in terms of QSR

Machine Learning for Interpretation of Spatial Natural Language in terms of QSR Machine Learning for Interpretation of Spatial Natural Language in terms of QSR Parisa Kordjamshidi 1, Joana Hois 2, Martijn van Otterlo 1, and Marie-Francine Moens 1 1 Katholieke Universiteit Leuven,

More information

Learning Random Walk Models for Inducing Word Dependency Distributions

Learning Random Walk Models for Inducing Word Dependency Distributions Learning Random Walk Models for Inducing Word Dependency Distributions Kristina Toutanova Christopher D. Manning Andrew Y. Ng Computer Science Department, Stanford University, Stanford, CA 94305, USA Abstract

More information

The Benefits of a Model of Annotation

The Benefits of a Model of Annotation The Benefits of a Model of Annotation Rebecca J. Passonneau and Bob Carpenter Columbia University Center for Computational Learning Systems Department of Statistics LAW VII, August 2013 Conventional Approach

More information

Toponym Disambiguation by Arborescent Relationships

Toponym Disambiguation by Arborescent Relationships Journal of Computer Science 6 (6): 653-659, 2010 ISSN 1549-3636 2010 Science Publications Toponym Disambiguation by Arborescent Relationships Imene Bensalem and Mohamed-Khireddine Kholladi Department of

More information

Annotating Spatial Containment Relations Between Events. Kirk Roberts, Travis Goodwin, and Sanda Harabagiu

Annotating Spatial Containment Relations Between Events. Kirk Roberts, Travis Goodwin, and Sanda Harabagiu Annotating Spatial Containment Relations Between Events Kirk Roberts, Travis Goodwin, and Sanda Harabagiu Problem Previous Work Schema Corpus Future Work Problem Natural language documents contain a large

More information

Tuning as Linear Regression

Tuning as Linear Regression Tuning as Linear Regression Marzieh Bazrafshan, Tagyoung Chung and Daniel Gildea Department of Computer Science University of Rochester Rochester, NY 14627 Abstract We propose a tuning method for statistical

More information

Textual Entailment as a Directional Relation

Textual Entailment as a Directional Relation Textual Entailment as a Directional Relation Doina Tătar, Gabriela Şerban, Mihiş Andreea Department of Computer Science University Babes-Bolyai, Cluj-Napoca, Romania dtatar@cs.ubbcluj.ro, gabis@cs.ubbcluj.ro,

More information

A Syntax-based Statistical Machine Translation Model. Alexander Friedl, Georg Teichtmeister

A Syntax-based Statistical Machine Translation Model. Alexander Friedl, Georg Teichtmeister A Syntax-based Statistical Machine Translation Model Alexander Friedl, Georg Teichtmeister 4.12.2006 Introduction The model Experiment Conclusion Statistical Translation Model (STM): - mathematical model

More information

NUL System at NTCIR RITE-VAL tasks

NUL System at NTCIR RITE-VAL tasks NUL System at NTCIR RITE-VAL tasks Ai Ishii Nihon Unisys, Ltd. ai.ishi@unisys.co.jp Hiroshi Miyashita Nihon Unisys, Ltd. hiroshi.miyashita@unisys.co.jp Mio Kobayashi Chikara Hoshino Nihon Unisys, Ltd.

More information

Title 古典中国語 ( 漢文 ) の形態素解析とその応用 安岡, 孝一 ; ウィッテルン, クリスティアン ; 守岡, 知彦 ; 池田, 巧 ; 山崎, 直樹 ; 二階堂, 善弘 ; 鈴木, 慎吾 ; 師, 茂. Citation 情報処理学会論文誌 (2018), 59(2):

Title 古典中国語 ( 漢文 ) の形態素解析とその応用 安岡, 孝一 ; ウィッテルン, クリスティアン ; 守岡, 知彦 ; 池田, 巧 ; 山崎, 直樹 ; 二階堂, 善弘 ; 鈴木, 慎吾 ; 師, 茂. Citation 情報処理学会論文誌 (2018), 59(2): Title 古典中国語 ( 漢文 ) の形態素解析とその応用 Author(s) 安岡, 孝一 ; ウィッテルン, クリスティアン ; 守岡, 知彦 ; 池田, 巧 ; 山崎, 直樹 ; 二階堂, 善弘 ; 鈴木, 慎吾 ; 師, 茂 Citation 情報処理学会論文誌 (2018), 59(2): 323-331 Issue Date 2018-02-15 URL http://hdl.handle.net/2433/229121

More information

Maschinelle Sprachverarbeitung

Maschinelle Sprachverarbeitung Maschinelle Sprachverarbeitung Parsing with Probabilistic Context-Free Grammar Ulf Leser Content of this Lecture Phrase-Structure Parse Trees Probabilistic Context-Free Grammars Parsing with PCFG Other

More information

Location Name Disambiguation Exploiting Spatial Proximity and Temporal Consistency

Location Name Disambiguation Exploiting Spatial Proximity and Temporal Consistency Location Name Disambiguation Exploiting Spatial Proximity and Temporal Consistency Takashi Awamura Eiji Aramaki Daisuke Kawahara Tomohide Shibata Sadao Kurohashi Graduate School of Informatics, Kyoto University

More information

Mining Exceptional Relationships with Grammar-Guided Genetic Programming

Mining Exceptional Relationships with Grammar-Guided Genetic Programming Knowledge and Information Systems manuscript No. (will be inserted by the editor) Mining Exceptional Relationships with Grammar-Guided Genetic Programming J. M. Luna M. Pechenizkiy S. Ventura Received:

More information

2002 Journal of Software, )

2002 Journal of Software, ) 1000-9825/2002/13(04)0732-07 2002 Journal of Software Vol13 No4 ( 310027); ( 310027) E-mail hdstsysgn@cisezjueducn http//wwwzjueducn ; ; ; TP18 A ( ) (anaphora) 3 [1~3] GroszSidnerWalker Grosz Sidner PAL

More information

Maschinelle Sprachverarbeitung

Maschinelle Sprachverarbeitung Maschinelle Sprachverarbeitung Parsing with Probabilistic Context-Free Grammar Ulf Leser Content of this Lecture Phrase-Structure Parse Trees Probabilistic Context-Free Grammars Parsing with PCFG Other

More information

Latent Dirichlet Allocation Introduction/Overview

Latent Dirichlet Allocation Introduction/Overview Latent Dirichlet Allocation Introduction/Overview David Meyer 03.10.2016 David Meyer http://www.1-4-5.net/~dmm/ml/lda_intro.pdf 03.10.2016 Agenda What is Topic Modeling? Parametric vs. Non-Parametric Models

More information

Extraction of Opposite Sentiments in Classified Free Format Text Reviews

Extraction of Opposite Sentiments in Classified Free Format Text Reviews Extraction of Opposite Sentiments in Classified Free Format Text Reviews Dong (Haoyuan) Li 1, Anne Laurent 2, Mathieu Roche 2, and Pascal Poncelet 1 1 LGI2P - École des Mines d Alès, Parc Scientifique

More information

Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics

Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics Chin-Yew Lin and Franz Josef Och Information Sciences Institute University of Southern California

More information

Entropy as an Indicator of Context Boundaries An Experiment Using a Web Search Engine

Entropy as an Indicator of Context Boundaries An Experiment Using a Web Search Engine Entropy as an Indicator of Context Boundaries An Experiment Using a Web Search Engine Kumiko Tanaka-Ishii Graduate School of Information Science and Technology, University of Tokyo kumiko@i.u-tokyo.ac.jp

More information

A Model for Multimodal Reference Resolution

A Model for Multimodal Reference Resolution A Model for Multimodal Reference Resolution Luis Pineda National Autonomous University of Mexico (UNAM) Gabriela Garza An important aspect of the interpretation of multimodal messages is the ability to

More information

Mining Positive and Negative Fuzzy Association Rules

Mining Positive and Negative Fuzzy Association Rules Mining Positive and Negative Fuzzy Association Rules Peng Yan 1, Guoqing Chen 1, Chris Cornelis 2, Martine De Cock 2, and Etienne Kerre 2 1 School of Economics and Management, Tsinghua University, Beijing

More information

Recognizing Spatial Containment Relations between Event Mentions

Recognizing Spatial Containment Relations between Event Mentions Recognizing Spatial Containment Relations between Event Mentions Kirk Roberts Human Language Technology Research Institute University of Texas at Dallas kirk@hlt.utdallas.edu Sanda M. Harabagiu Human Language

More information

Domain Adaptation for Word Sense Disambiguation under the Problem of Covariate Shift

Domain Adaptation for Word Sense Disambiguation under the Problem of Covariate Shift Domain Adaptation for Word Sense Disambiguation under the Problem of Covariate Shift HIRONORI KIKUCHI 1,a) HIROYUKI SHINNOU 1,b) Abstract: Word sense disambiguation(wsd) is the task of identifying the

More information

Francisco M. Couto Mário J. Silva Pedro Coutinho

Francisco M. Couto Mário J. Silva Pedro Coutinho Francisco M. Couto Mário J. Silva Pedro Coutinho DI FCUL TR 03 29 Departamento de Informática Faculdade de Ciências da Universidade de Lisboa Campo Grande, 1749 016 Lisboa Portugal Technical reports are

More information