Acquiring Strongly-related Events using Predicate-argument Co-occurring Statistics and Caseframe
|
|
- Richard Norton
- 5 years ago
- Views:
Transcription
1 Web 96% 79.1% 2 Acquiring Strongly-related Events using Predicate-argument Co-occurring Statistics and Caseframe Tomohide Shibata 1 and Sadao Kurohashi 1 This paper proposes a method for automatically acquiring strongly-related events from a large corpus using predicate-argument co-occurring statistics and caseframe. The co-occurrence measure is calculated using an association rule mining method, and the importance of an argument for each predicateargument is judged. Then, the argument alignment in the pair of predicatearguments is performed by using a caseframe. We conducted experiments using a Web corpus consisting of 1.6G sentences. The accuracy for the extracted event pairs was 96%, and the accuracy of the argument alignment was 79.1%. The number of acquired event pairs was about 20 thousands. 1 Kyoto University 1. 1) ( ) 2) 3) P A 1 P A 2 A 1 :{,,...} A 1 :{,,...} A 2 :{,...} A 2 :{,...} A 3 :{ } A 1 A 2 P A 1 P A 2 A 3 P A 2 P A 2 Chambers 4),5) () (1) a. b. (1-a) P A 2 (1-b) P A 1 Chambers 2 P A 1 A 2 : {,...} P A 2 A 3 : { } 1 c 2011 Information Processing Society of Japan
2 A 2:{,...} A 3:{ } P A 1 A 2 :{,...} P A 2 P A 1 A 1 :{,,...} P A WordNet 6) WordNet LifeNet 7) 8 41 EventNet Openmind Commonsense Knowledge Base 8) Regneri Amazon Mechanical Turk 9) Lin 10) X is the author of Y X wrote Y X,Y Chambers 4),5) accused XX claimedx argueddismissed X / 12) - 13) 14) 3. 1 Web P A 1 P A 2 P A 1 P A 2 15) 1) c 2011 Information Processing Society of Japan
3 Web コーパス 述語項構造ペアの抽出 PA1 彼ガ財布ヲ拾う 財布ヲ拾う ドライバーガ財布ヲ拾う PA2 警察ニ届ける 警察ニ届ける 届ける 1 P A 1 P A 2 拾う : 10 ガ ヲ 男, 女の子, 財布, 電話, 格フレームに基づく項のアライメント PA1 財布ヲ拾う 警察ニ届ける A1 : { 人, 男, } ガ A2 : { 財布, } ヲ 1 述語項構造ペアの共起度計算 拾う PA2 届ける : 20 ガ ヲ ニ 男, 人, 財布, 金, 警察, 交番, A1 : { 人, 男, } ガ A2 : { 財布, } ヲ 届ける A3 : { 警察 } ニ P A 1 P A 2 P A 1 P A 2 4. (2) a. b. 2 77,, 105,,, 502,, ID, 956,, 1829,, 1901,, 1 (P A 1 P A 2 ) P A 1 P A 2 P A 2 P A 1 ( P A 1 P A 2 ) P A 2 ( P A 2 ) 16) - 2,000 2 n (P (c n)) c 77 P A 1:, P A 2: P A 1: 3 c 2011 Information Processing Society of Japan
4 77, P A 2: 77 P A 1 :, P A 2 : P A 1 : 77, P A 2 : ) ) I = I 1, I 2,, I m t (t I) T (T = t 1, t 2,, t n ) X Y (X, Y I, X Y = φ) X Y X antecedent (left-hand side, lhs)y consequent (right-hand side, rhs) 3 support confidence lift support(x Y ) = C(X Y ) T confidence(x Y ) = C(X Y ) C(X) lift(x Y ) = confidence(x Y ) support(y ) = support(x Y ) support(x) C(X) X support XY confidence X Y lift X Y (1) (2) (3) 3 () P A 1 P A , , - Apriori 17) abc t 1 abcd t 2 t 1 t 2 Apriori support confidence 5.2 Apriori Apriori 4 3 X P A 1 P A 1 0 Y P A 2 P A 2 0 lift lift-min lift-max lift-max Apriori 3 ( 1 ) - - ( 2 ) - 4 c 2011 Information Processing Society of Japan
5 4 () :1 (2), (2), (3513), (80), :10 (4), (2), (580), (136), :1 (164), (144), (103400), (4797), :20 (11), (8), (8), (6), (2587), P A 1 - P A 2 - P A P A 1 P A 2 P A 1P A Web 1) 4 P A 1 cf 1 P A 2 cf 2 P A 1 P A 2 ( 1 ) P A 1 P A 2 5 (2) P A 2 ( 2 ) argmax cf 1,cf 2 max a sim(arg 1, a(arg 1)) (4) a a a P A 1 P A 2 arg 1 P A 1 a(arg 1 ) arg 1 P A 2 a arg1 a(arg 1 ) sim arg 1 a(arg 1) cosine :10:20 sim 2 cosine :10 ( 4, 2, 2, ) :20 ( 11, 8, 0, ) P A 1 P A 2 P A 1 10 P A 2 20 P A 1 P A 2,, c 2011 Information Processing Society of Japan
6 5 96(96.0%) 4(4.0%) 76(79.1%) 20(20.8%) 7 ( 6 ) P A 1 P A 2 6 (5 ) P A 1 P A 2 (1) - (2) - - (3) (4) (5) (6) (7) - (8) - - JUMAN 1 KNP Apriori support confidence lift-min, lift-max 1010, ) 30, (1) (2) A 1 :{,,...} A1 :{,,...} A 2 :{ } A 1 :{,,,...} A 1 :{,,,...} A 2 :{ } A 3 :{ } (3) A 1 :{,,,...} A 1 :{,,,...} A 1 :{,...} (4) A 1:{,...} A 2 :{,,...} A 2 :{,,...} (5) A 1 :{,,...} A 2 :{,,...} A 1 :{,,...} A 2 :{,,...} A 3 :{ } A 1 :{,,...} (6) A 2:{,,...} A 2 :{,,...} A 1 :{,,...} (7) A 1 :{ } A 1 :{ } 5 96% 6 ( 6 (8)) % 7 7 (6) P A 1 A 1 P A 2 A 1 A 1 A 3 A 1 :{,,...} A 2:{,,...} A 3 :{,,...} A 2:{,,...} A 3 :{,,...} 6 c 2011 Information Processing Society of Japan
7 7 (7) P A 2 P A 1 P A 1 P A 2 A 2 :{,,,...} A 2:{,,,...} A 1 :{ } 8 ( ) P A 1 P A (3,768 / 23,180) (549 / 1,944) (474 / 2,689) (753 / 2,764) (7,106 / 14,713) (1,054 / 3,284) (344 / 2,113) (1,042 / 3,086) (549 / 1,944) ) ( F ) ) Web 2 w v e(w, d)e(v, g) w d v g d g e(w, d) e(v, g) pmi(e(w, d), e(v, g)) = log P (e(w, d), e(v, g)) P (e(w, d))p (e(v, g)) k (k 5 ) 8 P A 1 P A 2 P A 1P A (5) 2 ( lift ) Chamber c 2011 Information Processing Society of Japan
8 3 ([, ] ) RTE(Recognizing Textual Entailment) 1) Kawahara, D. and Kurohashi, S.: A Fully-Lexicalized Probabilistic Model for Japanese Syntactic and Case Structure Analysis, Proceedings of the HLT- NAACL2006, pp (2006). 2) Bean, D. and Riloff, E.: Unsupervised Learning of Contextual Role Knowledge for Coreference Resolution, HLT-NAACL 2004: Main Proceedings, pp (2004). 3) Gerber, M. and Chai, J.: Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp (2010). 4) Chambers, N. and Jurafsky, D.: Unsupervised Learning of Narrative Event Chains, Proceedings of ACL-08: HLT, pp (2008). 5) Chambers, N. and Jurafsky, D.: Unsupervised Learning of Narrative Schemas and their Participants, Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp (2009). 6) Miller, G. A.: Wordnet: A lexical detabase for English, Communications of the ACM (1995). 7) Singh, P. and Williams, W.: LifeNet: A Propositional Model of Ordinary Human Activity, Proceedings of Workshop on Distributed and Collaborative Knowledge Capture (2003). 8) Espinosa, J. and Lieberman, H.: EventNet: Inferring Temporal Relations Between Commonsense Events, Proceedings of the 4th Mexican International Conference on Artificial Intelligence, pp (2005). 9) Regneri, M., Koller, A. and Pinkal, M.: Learning Script Knowledge with Web Experiments, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp (2010). 10) Lin, D. and Pantel, P.: Discovery of Inference Rules for Question Answering, Natural Language Engineering, Vol.7, No.4, pp (2001). 11) Szpektor, I. and Dagan, I.: Learning Entailment Rules for Unary Templates, Proceedings of the 22nd International Conference on Computational Linguistics (COL- ING), pp (2008). 12) Fujiki, T., Nanba, H. and Okumura, M.: Automatic Acquisition of Script Knowledge from a Text Collection, Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2003), pp (2003). 13) Torisawa, K.: Acquiring Inference Rules with Temporal Constraints by using Japanese Coordinated Sentences and Noun-Verb Co-occurrences, Proceedings of Human Language Technology Conference/North American chapter of the Association for Computational Linguistics annual meeting (HLT-NAACL06), pp (2006). 14) Abe, S., Inui, K. and Matsumoto, Y.: Two-phased event relation acquisition: coupling the relation-oriented and argument-oriented approaches, Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pp. 1 8 (2008). 15) Agrawal, R., Imielinski, T. and Swami, A.: Mining association rules between sets of items in large databases, Proceedings of the ACM-SIGMOD 1993 International Conference on Management of Data (1993), pp (1993). 16) Kazama, J. and Torisawa, K.: Inducing Gazetteers for Named Entity Recognition by Large-Scale Clustering of Dependency Relations, Proceedings of ACL-08: HLT, pp (2008). 17) Borgelt, C. and Kruse, R.: Induction of Association Rules: Apriori Implementation, Proceedings of 15th Conference on Computational Statistics, pp (2002). 18) Sasano, R., Kawahara, D. and Kurohashi, S.: Improving Coreference Resolution Using Bridging Reference Resolution and Automatically Acquired Synonyms, Discourse Anaphora and Anaphor Resolution Colloquium, pp (2007). 8 c 2011 Information Processing Society of Japan
Automatically Evaluating Text Coherence using Anaphora and Coreference Resolution
1 1 Barzilay 1) Automatically Evaluating Text Coherence using Anaphora and Coreference Resolution Ryu Iida 1 and Takenobu Tokunaga 1 We propose a metric for automatically evaluating discourse coherence
More informationA Fully-Lexicalized Probabilistic Model for Japanese Zero Anaphora Resolution
A Fully-Lexicalized Probabilistic Model for Japanese Zero Anaphora Resolution Ryohei Sasano Graduate School of Information Science and Technology, University of Tokyo ryohei@nlp.kuee.kyoto-u.ac.jp Daisuke
More informationA Hierarchical Bayesian Model for Unsupervised Induction of Script Knowledge
A Hierarchical Bayesian Model for Unsupervised Induction of Script Knowledge Lea Frermann Ivan Titov Manfred Pinkal April, 28th 2014 1 / 22 Contents 1 Introduction 2 Technical Background 3 The Script Model
More informationRecognizing Implicit Discourse Relations through Abductive Reasoning with Large-scale Lexical Knowledge
Recognizing Implicit Discourse Relations through Abductive Reasoning with Large-scale Lexical Knowledge Jun Sugiura, Naoya Inoue, and Kentaro Inui Tohoku University, 6-3-09 Aoba, Aramaki, Aoba-ku, Sendai,
More informationMining coreference relations between formulas and text using Wikipedia
Mining coreference relations between formulas and text using Wikipedia Minh Nghiem Quoc 1, Keisuke Yokoi 2, Yuichiroh Matsubayashi 3 Akiko Aizawa 1 2 3 1 Department of Informatics, The Graduate University
More informationRemoving trivial associations in association rule discovery
Removing trivial associations in association rule discovery Geoffrey I. Webb and Songmao Zhang School of Computing and Mathematics, Deakin University Geelong, Victoria 3217, Australia Abstract Association
More informationChinese Zero Pronoun Resolution: A Joint Unsupervised Discourse-Aware Model Rivaling State-of-the-Art Resolvers
Chinese Zero Pronoun Resolution: A Joint Unsupervised Discourse-Aware Model Rivaling State-of-the-Art Resolvers Chen Chen and Vincent Ng Human Language Technology Research Institute University of Texas
More informationFertilization of Case Frame Dictionary for Robust Japanese Case Analysis
Fertilization of Case Frame Dictionary for Robust Japanese Case Analysis Daisuke Kawahara and Sadao Kurohashi Graduate School of Information Science and Technology, University of Tokyo PRESTO, Japan Science
More informationExcitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web
Excitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web Chikara Hashimoto Kentaro Torisawa Stijn De Saeger Jong-Hoon Oh Jun ichi Kazama National Institute
More informationUtilizing Portion of Patent Families with No Parallel Sentences Extracted in Estimating Translation of Technical Terms
1 1 1 2 2 30% 70% 70% NTCIR-7 13% 90% 1,000 Utilizing Portion of Patent Families with No Parallel Sentences Extracted in Estimating Translation of Technical Terms Itsuki Toyota 1 Yusuke Takahashi 1 Kensaku
More informationAn Empirical Study on Dimensionality Optimization in Text Mining for Linguistic Knowledge Acquisition
An Empirical Study on Dimensionality Optimization in Text Mining for Linguistic Knowledge Acquisition Yu-Seop Kim 1, Jeong-Ho Chang 2, and Byoung-Tak Zhang 2 1 Division of Information and Telecommunication
More informationP R + RQ P Q: Transliteration Mining Using Bridge Language
P R + RQ P Q: Transliteration Mining Using Bridge Language Mitesh M. Khapra Raghavendra Udupa n Institute of Technology Microsoft Research, Bombay, Bangalore, Powai, Mumbai 400076 raghavu@microsoft.com
More informationDetermining Word Sense Dominance Using a Thesaurus
Determining Word Sense Dominance Using a Thesaurus Saif Mohammad and Graeme Hirst Department of Computer Science University of Toronto EACL, Trento, Italy (5th April, 2006) Copyright cfl2006, Saif Mohammad
More informationCapturing Salience with a Trainable Cache Model for Zero-anaphora Resolution
Capturing Salience with a Trainable Cache Model for Zero-anaphora Resolution Ryu Iida Department of Computer Science Tokyo Institute of Technology 2-12-1, Ôokayama, Meguro, Tokyo 152-8552, Japan ryu-i@cl.cs.titech.ac.jp
More informationToponym Disambiguation using Ontology-based Semantic Similarity
Toponym Disambiguation using Ontology-based Semantic Similarity David S Batista 1, João D Ferreira 2, Francisco M Couto 2, and Mário J Silva 1 1 IST/INESC-ID Lisbon, Portugal {dsbatista,msilva}@inesc-id.pt
More informationA Surface-Similarity Based Two-Step Classifier for RITE-VAL
A Surface-Similarity Based Two-Step Classifier for RITE-VAL Shohei Hattori Satoshi Sato Graduate School of Engineering, Nagoya University Furo-cho, Chikusa-ku, Nagoya, 464-8603, JAPAN {syohei_h,ssato}@nuee.nagoya-u.ac.jp
More informationProposition Knowledge Graphs. Gabriel Stanovsky Omer Levy Ido Dagan Bar-Ilan University Israel
Proposition Knowledge Graphs Gabriel Stanovsky Omer Levy Ido Dagan Bar-Ilan University Israel 1 Problem End User 2 Case Study: Curiosity (Mars Rover) Curiosity is a fully equipped lab. Curiosity is a rover.
More informationAssignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran
Assignment 7 (Sol.) Introduction to Data Analytics Prof. Nandan Sudarsanam & Prof. B. Ravindran 1. Let X, Y be two itemsets, and let denote the support of itemset X. Then the confidence of the rule X Y,
More informationAn Introduction to String Re-Writing Kernel
An Introduction to String Re-Writing Kernel Fan Bu 1, Hang Li 2 and Xiaoyan Zhu 3 1,3 State Key Laboratory of Intelligent Technology and Systems 1,3 Tsinghua National Laboratory for Information Sci. and
More informationLearning Features from Co-occurrences: A Theoretical Analysis
Learning Features from Co-occurrences: A Theoretical Analysis Yanpeng Li IBM T. J. Watson Research Center Yorktown Heights, New York 10598 liyanpeng.lyp@gmail.com Abstract Representing a word by its co-occurrences
More informationLinking people in videos with their names using coreference resolution (Supplementary Material)
Linking people in videos with their names using coreference resolution (Supplementary Material) Vignesh Ramanathan, Armand Joulin, Percy Liang, Li Fei-Fei Department of Electrical Engineering, Stanford
More informationText Mining. Dr. Yanjun Li. Associate Professor. Department of Computer and Information Sciences Fordham University
Text Mining Dr. Yanjun Li Associate Professor Department of Computer and Information Sciences Fordham University Outline Introduction: Data Mining Part One: Text Mining Part Two: Preprocessing Text Data
More informationJoint Inference for Event Timeline Construction
Joint Inference for Event Timeline Construction Quang Xuan Do Wei Lu Dan Roth Department of Computer Science University of Illinois at Urbana-Champaign Urbana, IL 61801, USA {quangdo2,luwei,danr}@illinois.edu
More informationA Continuous-Time Model of Topic Co-occurrence Trends
A Continuous-Time Model of Topic Co-occurrence Trends Wei Li, Xuerui Wang and Andrew McCallum Department of Computer Science University of Massachusetts 140 Governors Drive Amherst, MA 01003-9264 Abstract
More informationLatent Dirichlet Allocation Based Multi-Document Summarization
Latent Dirichlet Allocation Based Multi-Document Summarization Rachit Arora Department of Computer Science and Engineering Indian Institute of Technology Madras Chennai - 600 036, India. rachitar@cse.iitm.ernet.in
More informationLearning Textual Entailment using SVMs and String Similarity Measures
Learning Textual Entailment using SVMs and String Similarity Measures Prodromos Malakasiotis and Ion Androutsopoulos Department of Informatics Athens University of Economics and Business Patision 76, GR-104
More informationA Linguistic Inspection of Textual Entailment
A Linguistic Inspection of Textual Entailment Maria Teresa Pazienza 1, Marco Pennacchiotti 1, and Fabio Massimo Zanzotto 2 1 University of Roma Tor Vergata, Via del Politecnico 1, Roma, Italy, {pazienza,
More informationLearning Scripts as Hidden Markov Models
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Learning Scripts as Hidden Markov Models J. Walker Orr, Prasad Tadepalli, Janardhan Rao Doppa, Xiaoli Fern, Thomas G. Dietterich
More informationCoreference Resolution with! ILP-based Weighted Abduction
Coreference Resolution with! ILP-based Weighted Abduction Naoya Inoue, Ekaterina Ovchinnikova, Kentaro Inui, Jerry Hobbs Tohoku University, Japan! ISI/USC, USA Motivation Long-term goal: unified framework
More informationAnnotating Spatial Containment Relations Between Events
Annotating Spatial Containment Relations Between Events Kirk Roberts, Travis Goodwin, Sanda M. Harabagiu Human Language Technology Research Institute University of Texas at Dallas Richardson TX 75080 {kirk,travis,sanda}@hlt.utdallas.edu
More informationCLRG Biocreative V
CLRG ChemTMiner @ Biocreative V Sobha Lalitha Devi., Sindhuja Gopalan., Vijay Sundar Ram R., Malarkodi C.S., Lakshmi S., Pattabhi RK Rao Computational Linguistics Research Group, AU-KBC Research Centre
More informationData Mining and Knowledge Discovery. Petra Kralj Novak. 2011/11/29
Data Mining and Knowledge Discovery Petra Kralj Novak Petra.Kralj.Novak@ijs.si 2011/11/29 1 Practice plan 2011/11/08: Predictive data mining 1 Decision trees Evaluating classifiers 1: separate test set,
More informationAssociation Rule. Lecturer: Dr. Bo Yuan. LOGO
Association Rule Lecturer: Dr. Bo Yuan LOGO E-mail: yuanb@sz.tsinghua.edu.cn Overview Frequent Itemsets Association Rules Sequential Patterns 2 A Real Example 3 Market-Based Problems Finding associations
More informationStatistical Machine Learning Theory. From Multi-class Classification to Structured Output Prediction. Hisashi Kashima.
http://goo.gl/jv7vj9 Course website KYOTO UNIVERSITY Statistical Machine Learning Theory From Multi-class Classification to Structured Output Prediction Hisashi Kashima kashima@i.kyoto-u.ac.jp DEPARTMENT
More informationStatistical Machine Learning Theory. From Multi-class Classification to Structured Output Prediction. Hisashi Kashima.
http://goo.gl/xilnmn Course website KYOTO UNIVERSITY Statistical Machine Learning Theory From Multi-class Classification to Structured Output Prediction Hisashi Kashima kashima@i.kyoto-u.ac.jp DEPARTMENT
More informationText Mining. March 3, March 3, / 49
Text Mining March 3, 2017 March 3, 2017 1 / 49 Outline Language Identification Tokenisation Part-Of-Speech (POS) tagging Hidden Markov Models - Sequential Taggers Viterbi Algorithm March 3, 2017 2 / 49
More informationKSU Team s System and Experience at the NTCIR-11 RITE-VAL Task
KSU Team s System and Experience at the NTCIR-11 RITE-VAL Task Tasuku Kimura Kyoto Sangyo University, Japan i1458030@cse.kyoto-su.ac.jp Hisashi Miyamori Kyoto Sangyo University, Japan miya@cse.kyoto-su.ac.jp
More information(1) [John (i) ] attacked [Bob (j) ]. Police arrested him (i).
1,a) 1,b) 1,c) 1,d) 1,e) e.g., man arrest e.g., man arrest man [Van de Crus 2014]. 1. [1], [2], [3] [4] [5] [6], [7], [8], [9] [10], [11] 1 Tohoku Universit a) masauki.ono@ecei.tohoku.ac.jp b) naoa-i@ecei.tohoku.ac.jp
More informationThe Winograd Schema Challenge and Reasoning about Correlation
The Winograd Schema Challenge and Reasoning about Correlation Daniel Bailey University of Nebraska at Omaha dbailey@unomahaedu Abstract The Winograd Schema Challenge is an alternative to the Turing Test
More informationClassification & Information Theory Lecture #8
Classification & Information Theory Lecture #8 Introduction to Natural Language Processing CMPSCI 585, Fall 2007 University of Massachusetts Amherst Andrew McCallum Today s Main Points Automatically categorizing
More informationSequences and Information
Sequences and Information Rahul Siddharthan The Institute of Mathematical Sciences, Chennai, India http://www.imsc.res.in/ rsidd/ Facets 16, 04/07/2016 This box says something By looking at the symbols
More informationChapter 2 Quality Measures in Pattern Mining
Chapter 2 Quality Measures in Pattern Mining Abstract In this chapter different quality measures to evaluate the interest of the patterns discovered in the mining process are described. Patterns represent
More informationDiscovery of Frequent Word Sequences in Text. Helena Ahonen-Myka. University of Helsinki
Discovery of Frequent Word Sequences in Text Helena Ahonen-Myka University of Helsinki Department of Computer Science P.O. Box 26 (Teollisuuskatu 23) FIN{00014 University of Helsinki, Finland, helena.ahonen-myka@cs.helsinki.fi
More informationIt s a Contradiction No, it s Not: A Case Study using Functional Relations
It s a Contradiction No, it s Not: A Case Study using Functional Relations Alan Ritter, Doug Downey, Stephen Soderland and Oren Etzioni Turing Center Department of Computer Science and Engineering University
More informationAn Approach to Classification Based on Fuzzy Association Rules
An Approach to Classification Based on Fuzzy Association Rules Zuoliang Chen, Guoqing Chen School of Economics and Management, Tsinghua University, Beijing 100084, P. R. China Abstract Classification based
More informationAnnotation tasks and solutions in CLARIN-PL
Annotation tasks and solutions in CLARIN-PL Marcin Oleksy, Ewa Rudnicka Wrocław University of Technology marcin.oleksy@pwr.edu.pl ewa.rudnicka@pwr.edu.pl CLARIN ERIC Common Language Resources and Technology
More informationBe principled! A Probabilistic Model for Lexical Entailment
Be principled! A Probabilistic Model for Lexical Entailment Eyal Shnarch Computer Science Department Bar-Ilan University Ramat-Gan, Israel shey@cs.biu.ac.il Jacob Goldberger School of Engineering Bar-Ilan
More informationFirst Order Logic Implication (4A) Young W. Lim 4/6/17
First Order Logic (4A) Young W. Lim Copyright (c) 2016-2017 Young W. Lim. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version
More informationIntegrating Order Information and Event Relation for Script Event Prediction
Integrating Order Information and Event Relation for Script Event Prediction Zhongqing Wang 1,2, Yue Zhang 2 and Ching-Yun Chang 2 1 Soochow University, China 2 Singapore University of Technology and Design
More informationThe Noisy Channel Model and Markov Models
1/24 The Noisy Channel Model and Markov Models Mark Johnson September 3, 2014 2/24 The big ideas The story so far: machine learning classifiers learn a function that maps a data item X to a label Y handle
More informationD2: For each type 1 quantifier Q, Q acc (R) = {a : Q(aR) = 1}.
Some Formal Properties of Higher Order Anaphors R. Zuber Laboratoire de Linguistique Formelle, CNRS and University Paris-Diderot Richard.Zuber@linguist.univ-paris-diderot.fr Abstract Formal properties
More informationCPDA Based Fuzzy Association Rules for Learning Achievement Mining
2009 International Conference on Machine Learning and Computing IPCSIT vol.3 (2011) (2011) IACSIT Press, Singapore CPDA Based Fuzzy Association Rules for Learning Achievement Mining Jr-Shian Chen 1, Hung-Lieh
More informationReflexives and non-fregean quantifiers
UCLA Working Papers in Linguistics, Theories of Everything Volume 17, Article 49: 439-445, 2012 Reflexives and non-fregean quantifiers Richard Zuber It is shown that depending on the subject noun phrase
More informationEXTRACTION AND VISUALIZATION OF GEOGRAPHICAL NAMES IN TEXT
Abstract EXTRACTION AND VISUALIZATION OF GEOGRAPHICAL NAMES IN TEXT Xueying Zhang zhangsnowy@163.com Guonian Lv Zhiren Xie Yizhong Sun 210046 Key Laboratory of Virtual Geographical Environment (MOE) Naning
More informationBnO at NTCIR-10 RITE: A Strong Shallow Approach and an Inference-based Textual Entailment Recognition System
BnO at NTCIR-10 RITE: A Strong Shallow Approach and an Inference-based Textual Entailment Recognition System ABSTRACT Ran Tian National Institute of Informatics, Japan tianran@nii.ac.jp Takuya Matsuzaki
More informationInformation Extraction and GATE. Valentin Tablan University of Sheffield Department of Computer Science NLP Group
Information Extraction and GATE Valentin Tablan University of Sheffield Department of Computer Science NLP Group Information Extraction Information Extraction (IE) pulls facts and structured information
More informationProbabilistic Coordination Disambiguation in a Fully-lexicalized Japanese Parser
Probabilistic Coordination Disambiguation in a Fully-lexicalized Japanese Parser Daisuke Kawahara National Institute of Information and Communications Technology, 3-5 Hikaridai Seika-cho, Soraku-gun, Kyoto,
More informationNatural Language Processing : Probabilistic Context Free Grammars. Updated 5/09
Natural Language Processing : Probabilistic Context Free Grammars Updated 5/09 Motivation N-gram models and HMM Tagging only allowed us to process sentences linearly. However, even simple sentences require
More informationLatent Variable Models in NLP
Latent Variable Models in NLP Aria Haghighi with Slav Petrov, John DeNero, and Dan Klein UC Berkeley, CS Division Latent Variable Models Latent Variable Models Latent Variable Models Observed Latent Variable
More information情報工学実験 4: データマイニング班 (week 6) 機械学習概観の振り返り
情報工学実験 4: データマイニング班 (week 6) 機械学習概観の振り返り 1. ( 復習 ) 用語 2. ( 復習 ) 機械学習における問題設定 3. ( 復習 )scikit-learnの使い方 4. ( 復習 ) モデルと仮説 5. ( 復習 ) 問題 アルゴリズム モデル 6. ( 復習 ) 過学習 7. ( 復習 ) ペナルティ項 ( 正則化 ) 8. ( 復習 ) 交差検定 実験ページ
More informationGlobal Machine Learning for Spatial Ontology Population
Global Machine Learning for Spatial Ontology Population Parisa Kordjamshidi, Marie-Francine Moens KU Leuven, Belgium Abstract Understanding spatial language is important in many applications such as geographical
More informationTowards a Probabilistic Model for Lexical Entailment
Towards a Probabilistic Model for Lexical Entailment Eyal Shnarch Computer Science Department Bar-Ilan University Ramat-Gan, Israel shey@cs.biu.ac.il Jacob Goldberger School of Engineering Bar-Ilan University
More informationContexts for Quantification
Contexts for Quantification Valeria de Paiva Stanford April, 2011 Valeria de Paiva (Stanford) C4Q April, 2011 1 / 28 Natural logic: what we want Many thanks to Larry, Ulrik for slides! Program Show that
More information10/17/04. Today s Main Points
Part-of-speech Tagging & Hidden Markov Model Intro Lecture #10 Introduction to Natural Language Processing CMPSCI 585, Fall 2004 University of Massachusetts Amherst Andrew McCallum Today s Main Points
More informationSemantics and Pragmatics of NLP Pronouns
Semantics and Pragmatics of NLP Pronouns School of Informatics University of Edinburgh Outline Observations About Data 1 Observations of what factors influence the way pronouns get resolved 2 Some algorithms
More informationAutomatic Generation of Shogi Commentary with a Log-Linear Language Model
1,a) 2, 1,b) 1,c) 3,d) 1,e) 2011 11 4, 2011 12 1 2 Automatic Generation of Shogi Commentary with a Log-Linear Language Model Hirotaka Kameko 1,a) Makoto Miwa 2, 1,b) Yoshimasa Tsuruoka 1,c) Shinsuke Mori
More informationHidden Markov Models, I. Examples. Steven R. Dunbar. Toy Models. Standard Mathematical Models. Realistic Hidden Markov Models.
, I. Toy Markov, I. February 17, 2017 1 / 39 Outline, I. Toy Markov 1 Toy 2 3 Markov 2 / 39 , I. Toy Markov A good stack of examples, as large as possible, is indispensable for a thorough understanding
More informationPenn Treebank Parsing. Advanced Topics in Language Processing Stephen Clark
Penn Treebank Parsing Advanced Topics in Language Processing Stephen Clark 1 The Penn Treebank 40,000 sentences of WSJ newspaper text annotated with phrasestructure trees The trees contain some predicate-argument
More informationDiscovering Classes of Strongly Equivalent Logic Programs with Negation as Failure in the Head
Discovering Classes of Strongly Equivalent Logic Programs with Negation as Failure in the Head Jianmin Ji School of Computer Science and Technology University of Science and Technology of China Hefei,
More informationTopic #3 Predicate Logic. Predicate Logic
Predicate Logic Predicate Logic Predicate logic is an extension of propositional logic that permits concisely reasoning about whole classes of entities. Propositional logic treats simple propositions (sentences)
More informationSemantic Similarity from Corpora - Latent Semantic Analysis
Semantic Similarity from Corpora - Latent Semantic Analysis Carlo Strapparava FBK-Irst Istituto per la ricerca scientifica e tecnologica I-385 Povo, Trento, ITALY strappa@fbk.eu Overview Latent Semantic
More informationEncoding Tree Pair-based Graphs in Learning Algorithms: the Textual Entailment Recognition Case
Encoding Tree Pair-based Graphs in Learning Algorithms: the Textual Entailment Recognition Case Alessandro Moschitti DII University of Trento Via ommarive 14 38100 POVO (T) - Italy moschitti@dit.unitn.it
More informationAn Improved Stemming Approach Using HMM for a Highly Inflectional Language
An Improved Stemming Approach Using HMM for a Highly Inflectional Language Navanath Saharia 1, Kishori M. Konwar 2, Utpal Sharma 1, and Jugal K. Kalita 3 1 Department of CSE, Tezpur University, India {nava
More informationLogical Agents. September 14, 2004
Logical Agents September 14, 2004 The aim of AI is to develop intelligent agents that can reason about actions and their effects and about the environment, create plans to achieve a goal, execute the plans,
More informationNatural Language Processing CS Lecture 06. Razvan C. Bunescu School of Electrical Engineering and Computer Science
Natural Language Processing CS 6840 Lecture 06 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Statistical Parsing Define a probabilistic model of syntax P(T S):
More informationIntroduction to Semantics. Common Nouns and Adjectives in Predicate Position 1
Common Nouns and Adjectives in Predicate Position 1 (1) The Lexicon of Our System at Present a. Proper Names: [[ Barack ]] = Barack b. Intransitive Verbs: [[ smokes ]] = [ λx : x D e. IF x smokes THEN
More informationMachine Learning for Interpretation of Spatial Natural Language in terms of QSR
Machine Learning for Interpretation of Spatial Natural Language in terms of QSR Parisa Kordjamshidi 1, Joana Hois 2, Martijn van Otterlo 1, and Marie-Francine Moens 1 1 Katholieke Universiteit Leuven,
More informationLearning Random Walk Models for Inducing Word Dependency Distributions
Learning Random Walk Models for Inducing Word Dependency Distributions Kristina Toutanova Christopher D. Manning Andrew Y. Ng Computer Science Department, Stanford University, Stanford, CA 94305, USA Abstract
More informationThe Benefits of a Model of Annotation
The Benefits of a Model of Annotation Rebecca J. Passonneau and Bob Carpenter Columbia University Center for Computational Learning Systems Department of Statistics LAW VII, August 2013 Conventional Approach
More informationToponym Disambiguation by Arborescent Relationships
Journal of Computer Science 6 (6): 653-659, 2010 ISSN 1549-3636 2010 Science Publications Toponym Disambiguation by Arborescent Relationships Imene Bensalem and Mohamed-Khireddine Kholladi Department of
More informationAnnotating Spatial Containment Relations Between Events. Kirk Roberts, Travis Goodwin, and Sanda Harabagiu
Annotating Spatial Containment Relations Between Events Kirk Roberts, Travis Goodwin, and Sanda Harabagiu Problem Previous Work Schema Corpus Future Work Problem Natural language documents contain a large
More informationTuning as Linear Regression
Tuning as Linear Regression Marzieh Bazrafshan, Tagyoung Chung and Daniel Gildea Department of Computer Science University of Rochester Rochester, NY 14627 Abstract We propose a tuning method for statistical
More informationTextual Entailment as a Directional Relation
Textual Entailment as a Directional Relation Doina Tătar, Gabriela Şerban, Mihiş Andreea Department of Computer Science University Babes-Bolyai, Cluj-Napoca, Romania dtatar@cs.ubbcluj.ro, gabis@cs.ubbcluj.ro,
More informationA Syntax-based Statistical Machine Translation Model. Alexander Friedl, Georg Teichtmeister
A Syntax-based Statistical Machine Translation Model Alexander Friedl, Georg Teichtmeister 4.12.2006 Introduction The model Experiment Conclusion Statistical Translation Model (STM): - mathematical model
More informationNUL System at NTCIR RITE-VAL tasks
NUL System at NTCIR RITE-VAL tasks Ai Ishii Nihon Unisys, Ltd. ai.ishi@unisys.co.jp Hiroshi Miyashita Nihon Unisys, Ltd. hiroshi.miyashita@unisys.co.jp Mio Kobayashi Chikara Hoshino Nihon Unisys, Ltd.
More informationTitle 古典中国語 ( 漢文 ) の形態素解析とその応用 安岡, 孝一 ; ウィッテルン, クリスティアン ; 守岡, 知彦 ; 池田, 巧 ; 山崎, 直樹 ; 二階堂, 善弘 ; 鈴木, 慎吾 ; 師, 茂. Citation 情報処理学会論文誌 (2018), 59(2):
Title 古典中国語 ( 漢文 ) の形態素解析とその応用 Author(s) 安岡, 孝一 ; ウィッテルン, クリスティアン ; 守岡, 知彦 ; 池田, 巧 ; 山崎, 直樹 ; 二階堂, 善弘 ; 鈴木, 慎吾 ; 師, 茂 Citation 情報処理学会論文誌 (2018), 59(2): 323-331 Issue Date 2018-02-15 URL http://hdl.handle.net/2433/229121
More informationMaschinelle Sprachverarbeitung
Maschinelle Sprachverarbeitung Parsing with Probabilistic Context-Free Grammar Ulf Leser Content of this Lecture Phrase-Structure Parse Trees Probabilistic Context-Free Grammars Parsing with PCFG Other
More informationLocation Name Disambiguation Exploiting Spatial Proximity and Temporal Consistency
Location Name Disambiguation Exploiting Spatial Proximity and Temporal Consistency Takashi Awamura Eiji Aramaki Daisuke Kawahara Tomohide Shibata Sadao Kurohashi Graduate School of Informatics, Kyoto University
More informationMining Exceptional Relationships with Grammar-Guided Genetic Programming
Knowledge and Information Systems manuscript No. (will be inserted by the editor) Mining Exceptional Relationships with Grammar-Guided Genetic Programming J. M. Luna M. Pechenizkiy S. Ventura Received:
More information2002 Journal of Software, )
1000-9825/2002/13(04)0732-07 2002 Journal of Software Vol13 No4 ( 310027); ( 310027) E-mail hdstsysgn@cisezjueducn http//wwwzjueducn ; ; ; TP18 A ( ) (anaphora) 3 [1~3] GroszSidnerWalker Grosz Sidner PAL
More informationMaschinelle Sprachverarbeitung
Maschinelle Sprachverarbeitung Parsing with Probabilistic Context-Free Grammar Ulf Leser Content of this Lecture Phrase-Structure Parse Trees Probabilistic Context-Free Grammars Parsing with PCFG Other
More informationLatent Dirichlet Allocation Introduction/Overview
Latent Dirichlet Allocation Introduction/Overview David Meyer 03.10.2016 David Meyer http://www.1-4-5.net/~dmm/ml/lda_intro.pdf 03.10.2016 Agenda What is Topic Modeling? Parametric vs. Non-Parametric Models
More informationExtraction of Opposite Sentiments in Classified Free Format Text Reviews
Extraction of Opposite Sentiments in Classified Free Format Text Reviews Dong (Haoyuan) Li 1, Anne Laurent 2, Mathieu Roche 2, and Pascal Poncelet 1 1 LGI2P - École des Mines d Alès, Parc Scientifique
More informationAutomatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics
Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics Chin-Yew Lin and Franz Josef Och Information Sciences Institute University of Southern California
More informationEntropy as an Indicator of Context Boundaries An Experiment Using a Web Search Engine
Entropy as an Indicator of Context Boundaries An Experiment Using a Web Search Engine Kumiko Tanaka-Ishii Graduate School of Information Science and Technology, University of Tokyo kumiko@i.u-tokyo.ac.jp
More informationA Model for Multimodal Reference Resolution
A Model for Multimodal Reference Resolution Luis Pineda National Autonomous University of Mexico (UNAM) Gabriela Garza An important aspect of the interpretation of multimodal messages is the ability to
More informationMining Positive and Negative Fuzzy Association Rules
Mining Positive and Negative Fuzzy Association Rules Peng Yan 1, Guoqing Chen 1, Chris Cornelis 2, Martine De Cock 2, and Etienne Kerre 2 1 School of Economics and Management, Tsinghua University, Beijing
More informationRecognizing Spatial Containment Relations between Event Mentions
Recognizing Spatial Containment Relations between Event Mentions Kirk Roberts Human Language Technology Research Institute University of Texas at Dallas kirk@hlt.utdallas.edu Sanda M. Harabagiu Human Language
More informationDomain Adaptation for Word Sense Disambiguation under the Problem of Covariate Shift
Domain Adaptation for Word Sense Disambiguation under the Problem of Covariate Shift HIRONORI KIKUCHI 1,a) HIROYUKI SHINNOU 1,b) Abstract: Word sense disambiguation(wsd) is the task of identifying the
More informationFrancisco M. Couto Mário J. Silva Pedro Coutinho
Francisco M. Couto Mário J. Silva Pedro Coutinho DI FCUL TR 03 29 Departamento de Informática Faculdade de Ciências da Universidade de Lisboa Campo Grande, 1749 016 Lisboa Portugal Technical reports are
More information