Bayesian Poisson Tensor Factorization for Inferring Multilateral Relations from Sparse Dyadic Event Counts

Similar documents
Canadian Imports of Honey

2017 Source of Foreign Income Earned By Fund

Big Data at BBVA Research using BigQuery

Supplementary Appendix for. Version: February 3, 2014

1. Impacts of Natural Disasters by Region, 2008

International Student Enrollment Fall 2018 By CIP Code, Country of Citizenship, and Education Level Harpur College of Arts and Sciences

USDA Dairy Import License Circular for 2018

ia PU BLi s g C o M Pa K T Wa i n CD-1576

04 June Dim A W V Total. Total Laser Met

GAMINGRE 8/1/ of 7

USDA Dairy Import License Circular for 2018 Commodity/

Asia. JigsawGeo. Free Printable Maps for Geography Education. Try our geography games for the ipod Touch or iphone.

Grand Total Baccalaureate Post-Baccalaureate Masters Doctorate Professional Post-Professional

DISTILLED SPIRITS - EXPORTS BY VALUE DECEMBER 2017

About the Authors Geography and Tourism: The Attraction of Place p. 1 The Elements of Geography p. 2 Themes of Geography p. 4 Location: The Where of

Florida's Refugee Population Statistical Report

Discovering the World of Geography

Natural Resource Management Indicators for the Least Developed Countries

21st Century Global Learning

How Well Are Recessions and Recoveries Forecast? Prakash Loungani, Herman Stekler and Natalia Tamirisa

READY TO SCRAP: HOW MANY VESSELS AT DEMOLITION VALUE?

AD HOC DRAFTING GROUP ON TRANSNATIONAL ORGANISED CRIME (PC-GR-COT) STATUS OF RATIFICATIONS BY COUNCIL OF EUROPE MEMBER STATES

Additional VEX Worlds 2019 Spot Allocations

2001 Environmental Sustainability Index

Most Recent Periodic Report Initial State Report. Next Periodic Accession/Ratification. Report Publication Publication. Report Due

Theme 4. Disaster Mitigation and Risk Management

Multimedia on Nuclear Reactor Physics In order to improve education and training quality, a Multimedia on Nuclear Reactor Physics has been developed.

Forecast Million Lbs. % Change 1. Carryin August 1, ,677, ,001, % 45.0

Country of Citizenship, College-Wide - All Students, Fall 2014

International and regional network status

2,152,283. Japan 6,350,859 32,301 6,383,160 58,239, ,790 58,464,425 6,091, ,091,085 52,565, ,420 52,768,905

Grand Total Baccalaureate Post-Baccalaureate Masters Doctorate Professional Post-Professional

IEEE Transactions on Image Processing EiC Report

Nigerian Capital Importation QUARTER THREE 2016

USDA Dairy Import License Circular for 2018

Appendix B: Detailed tables showing overall figures by country and measure

THIS REPORT CONTAINS ASSESSMENTS OF COMMODITY AND TRADE ISSUES MADE BY USDA STAFF AND NOT NECESSARILY STATEMENTS OF OFFICIAL U.S.

Does socio-economic indicator influent ICT variable? II. Method of data collection, Objective and data gathered

Scaling Seed Kits Through Household Gardens

Do Policy-Related Shocks Affect Real Exchange Rates? An Empirical Analysis Using Sign Restrictions and a Penalty-Function Approach

University of Oklahoma, Norman Campus International Student Report Fall 2014

Governments that have requested pre-export notifications pursuant to article 12, paragraph 10 (a), of the 1988 Convention

TIGER: Tracking Indexes for the Global Economic Recovery By Eswar Prasad, Karim Foda, and Ethan Wu

Grain: World Markets and Trade

Uranium and thorium resources: Evaluation and reporting issues

ˆ GDP t = GDP t SCAN t (1) t stat : (3.71) (5.53) (3.27) AdjustedR 2 : 0.652

TIGER: Tracking Indexes for the Global Economic Recovery By Eswar Prasad and Karim Foda

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the WHO European Region

A long-term global forecast for the extraction of oil and gas from shale formations

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

The Chemical Weapons Convention, Biological and Toxin Weapons Convention, Geneva Protocol

DESKTOP STUDY ON GLOBAL IMPORTS OF HANDMADE CARPETS & FLOOR COVERINGS AT A GLANCE

W o r l d O i l a n d G a s R e v i e w

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the WHO European Region

OCTOBER Almond Industry Position Report Crop Year /01-10/31 Kernel Wt /01-10/31 Kernel Wt.

CONTINENT WISE ANALYSIS OF ZOOLOGICAL SCIENCE PERIODICALS: A SCIENTOMETRIC STUDY

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

Florida's Refugee Population Statistical Report

Kernel Wt. 593,190,150 1,218,046,237 1,811,236, ,364, ,826, ,191, Crop Year

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

MOCK EXAMINATION 1. Name Class Date INSTRUCTIONS

World Agricultural Production

Radiation Protection Procedures

WHO EpiData. A monthly summary of the epidemiological data on selected Vaccine preventable diseases in the European Region

Forecast Million Lbs. % Change 1. Carryin August 1, ,677, ,001, % 45.0

Bahrain, Israel, Jordan, Kuwait, Saudi Arabia, United Arab Emirates, Azerbaijan, Iraq, Qatar and Sudan.

A COMPREHENSIVE WORLDWIDE WEB-BASED WEATHER RADAR DATABASE

World Agricultural Production

TMM UPDATE TRANS DAY OF REMEMBRANCE 2017

Proposed AKS for 6 th Grade Social Studies

Global Data Catalog initiative Christophe Charpentier ArcGIS Content Product Manager

ICC Rev August 2010 Original: English. Agreement. International Coffee Council 105 th Session September 2010 London, England

Solow model: Convergence

DISTILLED SPIRITS - IMPORTS BY VALUE DECEMBER 2017

DISTILLED SPIRITS - IMPORTS BY VOLUME DECEMBER 2017

WHO EpiData. A monthly summary of the epidemiological data on selected vaccine preventable diseases in the European Region

Situation on the death penalty in the world. UNGA Vote 2012 Resolutio n 67/176. UNGA Vote 2010 Resolutio n 65/206. UNGA Vote 2008 Resolutio n 63/168

World Blueberry Acreage & Production

World Agricultural Production

The Outer Space Legal Regime and UN Register of Space Objects

AP Human Geography Summer Assignment

natural gas World Oil and Gas Review

SDG s and the role of Earth Observation

Janet L. Klein Dickinson High School AP Human Geography Dear Advanced Placement Human Geography Student,

INTERNATIONAL S T U D E N T E N R O L L M E N T

World Agricultural Production

JRC MARS Bulletin Crop monitoring in Europe. January 2017 Minor frost damages so far. Improved hardening of winter cereals in central Europe

Pre-Processing of Dynamic Networks * Its Impact on Observed Communities and Other Analytics

Unit of Study: Physical Geography & Settlement Patterns; Culture & Civilizations; and The Spread of Ideas

Stochastic Analysis and Forecasts of the Patterns of Speed, Acceleration, and Levels of Material Stock Accumulation in Society

PRECURSORS. Pseudoephedrine preparations 3,4-MDP-2-P a P-2-P b. Ephedrine

Fall International Student Enrollment Statistics

1 st Six Weeks # of Days. Unit # and Title Unit 1 Geography Overview

Office of Budget & Planning 311 Thomas Boyd Hall Baton Rouge, LA Telephone 225/ Fax 225/

Florida's Refugee Population Statistical Report

Bilateral Labour Agreements, 2004

Learning Objectives. Math Chapter 3. Chapter 3. Association. Response and Explanatory Variables

Appendices. Please note that Internet resources are of a time-sensitive nature and URL addresses may often change or be deleted.

COUNCIL. Hundred and Fifty-eighth Session. Rome, 4-8 December Status of Current Assessments and Arrears as at 27 November 2017

Fall International Student Enrollment & Scholar Statistics

Transcription:

Bayesian Poisson Tensor Factorization for Inferring Multilateral Relations from Sparse Dyadic Event Counts KDD 2015 Aaron Schein UMass Amherst Joint work with: John Paisley, Dave Blei & Hanna Wallach Columbia University Microsoft Research

International relations are multilateral

From Schrodt (1993) Event Data in Foreign Policy Analysis:

From Schrodt (1993) Event Data in Foreign Policy Analysis:

From Schrodt (1993) Event Data in Foreign Policy Analysis:

From Schrodt (1993) Event Data in Foreign Policy Analysis:

From Schrodt (1993) Event Data in Foreign Policy Analysis:

From Schrodt (1993) Event Data in Foreign Policy Analysis:

From Schrodt (1993) Event Data in Foreign Policy Analysis:

From Schrodt (1993) Event Data in Foreign Policy Analysis:

From Schrodt (1993) Event Data in Foreign Policy Analysis:

Multilateral relation A coherent thread of international events.

Multilateral relation A coherent thread of international events. Characterized by: 1. sender countries 2. receiver countries 3. action types 4. time steps

Multilateral relation A coherent thread of international events. Characterized by: 1. sender countries 2. receiver countries 3. action types 4. time steps

Multilateral relation A coherent thread of international events. Characterized by: 1. sender countries 2. receiver countries 3. action types 4. time steps Who is doing what to whom, when?

Goal: Infer multilateral relations Tease apart the coherent threads of:

Goal: Infer multilateral relations Tease apart the coherent threads of: (millions of unstructured dyadic events)

Sample result: time steps senders receivers action types

Sample result: Afghanistan War time steps senders receivers action types

Sparse event counts Express intent to cooperate Threaten United States Russia Israel Iraq China United Kingdom Iran Palestine France Japan Europe Pakistan Germany Turkey Afghanistan Egypt Africa Syria Lebanon North Korea South Korea Australia Jordan Italy Saudi Arabia Canada Indonesia Nigeria Sudan Spain Philippines Ukraine Kenya NATO Poland July 2002 United States Russia Israel Iraq China United Kingdom Iran Palestine France Japan Europe Pakistan Germany Turkey Afghanistan Egypt Africa Syria Lebanon North Korea South Korea Australia Jordan Italy Saudi Arabia Canada Indonesia Nigeria Sudan Spain Philippines Ukraine Kenya NATO Poland United States Russia Israel Iraq China United Kingdom Iran Palestine France Japan Europe Pakistan Germany Turkey Afghanistan Egypt Africa Syria Lebanon North Korea South Korea Australia Jordan Italy Saudi Arabia Canada Indonesia Nigeria Sudan Spain Philippines Ukraine Kenya NATO Poland

Sparse event counts Express intent to cooperate Threaten United States Russia Israel Iraq China United Kingdom Iran Palestine France Japan Europe Pakistan Germany Turkey Afghanistan Egypt Africa Syria Lebanon North Korea South Korea Australia Jordan Italy Saudi Arabia Canada Indonesia Nigeria Sudan Spain Philippines Ukraine Kenya NATO Poland July 2002 United States Russia Israel Iraq China United Kingdom Iran Palestine France Japan Europe Pakistan Germany Turkey Afghanistan Egypt Africa Syria Lebanon North Korea South Korea Australia Jordan Italy Saudi Arabia Canada Indonesia Nigeria Sudan Spain Philippines Ukraine Kenya NATO Poland United States Russia Israel Iraq China United Kingdom Iran Palestine France Japan Europe Pakistan Germany Turkey Afghanistan Egypt Africa Syria Lebanon North Korea South Korea Australia Jordan Italy Saudi Arabia Canada Indonesia Nigeria Sudan Spain Philippines Ukraine Kenya NATO Poland

Poisson matrix factorization Australia New Zealand China Vietnam Cambodia Laos Australia New Zealand China Vietnam Cambodia Laos

Poisson matrix factorization Australia New Zealand China Vietnam Cambodia Laos Australia New Zealand China Vietnam Cambodia Laos

Poisson matrix factorization Australia New Zealand China Vietnam Cambodia Laos Australia New Zealand China Vietnam Cambodia Laos

Poisson matrix factorization Australia New Zealand China Vietnam Cambodia Laos Australia New Zealand China Vietnam Cambodia Laos

Poisson matrix factorization Australia New Zealand China Vietnam Cambodia Laos Australia New Zealand China Vietnam Cambodia Laos

Poisson matrix factorization Y = N N

Poisson matrix factorization Y = N y ij Poisson X k N (1) (2) ik jk

Poisson matrix factorization Y = N y ij Poisson X k N (1) (2) ik jk how active country i is as a sender in community k

Poisson matrix factorization Y = N y ij Poisson X k N (1) (2) ik jk how active country i is as a sender in community k how active country j is as a receiver in community k

Poisson matrix factorization Y Pois (1) (2)

Poisson matrix factorization Y Pois (1) (2) Fitting this model is a form of nonnegative matrix factorization: Y (1) N K 2 R + (2) K N 2 R +

Poisson matrix factorization Australia New Zealand Australia New Zealand China Vietnam Cambodia Laos China Vietnam (1) 1 (2) 1 Cambodia Laos component 1

Poisson tensor factorization y ijat Poisson X k! (1) (2) (3) (4) ik jk ak tk

Poisson tensor factorization y ijat Poisson X k! (1) (2) (3) (4) ik jk ak tk how active country i is as a sender in multilateral relation k j how active country is as a receiver in multilateral relation k

Poisson tensor factorization how relevant is action type to multilateral relation k y ijat Poisson a X k! (1) (2) (3) (4) ik jk ak tk how active country i is as a sender in multilateral relation k j how active country is as a receiver in multilateral relation k

Poisson tensor factorization how relevant is action type to multilateral relation k y ijat Poisson a X k how active at time step t is multilateral relation k! (1) (2) (3) (4) ik jk ak tk how active country i is j how active country is as a sender in multilateral relation k as a receiver in multilateral relation k

Poisson tensor factorization y ijat Poisson X k! (1) (2) (3) (4) ik jk ak tk

Poisson tensor factorization y ijat Poisson X k! (1) (2) (3) (4) ik jk ak tk Fitting this model is a form of nonnegative tensor factorization:

Poisson tensor factorization y ijat Poisson X k! (1) (2) (3) (4) ik jk ak tk Fitting this model is a form of nonnegative tensor factorization: (1) N K 2 R + sender factors Y (2) (3) 2 R + N K receiver factors 2 R + A K action type factors (4) 2 R + T K time step factors

Sample component time-steps senders receivers action types

Sample component: Yugoslav Wars Components correspond to multilateral relations

Parameter estimation: Bayesian inference y ijat Poisson X k (1) ik (2) jk (3) ak (4) tk!

Parameter estimation: Bayesian inference y ijat Poisson X k (1) ik (2) jk (3) ak (4) tk! (1) ik (2) jk Gamma, (1) Gamma, (2) (3) ak Gamma, (3) (4) tk Gamma, (4)

Parameter estimation: Bayesian inference y ijat Poisson X k (1) ik (2) jk (3) ak (4) tk! (1) ik (2) jk Gamma, (1) Gamma, (2) (3) ak Gamma, (3) (4) tk Gamma, (4) } Sparsity-inducing Gamma priors < 1

Parameter estimation: Bayesian inference P (1), (2), (3), (4) Y

Parameter estimation: Bayesian inference P (1), (2), (3), (4) Y cannot compute this analytically

Parameter estimation: Variational inference P (1), (2), (3), (4) Y

Parameter estimation: Variational inference P (1), (2), (3), (4) Y cannot compute this analytically

Parameter estimation: Variational inference P (1), (2), (3), (4) Y cannot compute this analytically Define a convenient family of distributions: Q (1), (2), (3), (4)

Parameter estimation: Variational inference P (1), (2), (3), (4) Y cannot compute this analytically Define a convenient family of distributions: Q (1), (2), (3), (4) Optimize the parameters of Q to minimize: KL(Q P )

Parameter estimation: Variational inference (m)

Parameter estimation: Variational inference ˆ (m) ik

Parameter estimation: Variational inference Q( (m) ik )

Parameter estimation: Variational inference Q( (m) ik ) = Gamma (m) ik ; (m) ik, (m) ik

Parameter estimation: Variational inference Q( (m) ik ) = Gamma (m) ik ; (m) ik, (m) ik { variational parameters

Parameter estimation: Variational inference Q( (m) ik ) = Gamma (m) ik ; (m) ik, (m) ik After inference, if we need a point estimate: ˆ (m) ik := E Q [ (m) ik ]= ik (m) (m) ik

Parameter estimation: Variational inference Q( (m) ik ) = Gamma (m) ik ; (m) ik, (m) ik After inference, if we need a point estimate: ˆ (m) ik := G Q [ (m) ik ik )) (m) ik ]=exp( ( (m)

Parameter estimation: Variational inference

Parameter estimation: Variational inference Bayesian PTF generalizes much better than maximum likelihood PTF when the data is very sparse!

Parameter estimation: Variational inference Bayesian PTF generalizes much better than maximum likelihood PTF when the data is very sparse! No sacrifice in efficiency!

Code and more sample results available: https://github.com/aschein/bptf

Our favorite component 0.40 0.35 0.30 0.25 time steps senders receivers action types 0.20 0.15 0.10 0.05 0.00 Jan 2011 Mar 2011 May 2011 Jun 2011 Aug 2011 Sep 2011 Nov 2011 Jan 2012 Feb 2012 Apr 2012 May 2012 Jul 2012 Sep 2012 Oct 2012 Dec 2012 0.45 0.40 0.35 0.30 0.25 0.20 0.15 0.10 0.05 0.00 Ecuador United Kingdom United States Sweden Australia Spain Russia Japan Belarus Cuba Ecuador United Kingdom United States Sweden Australia Russia Belarus Spain Paraguay Europe Consult Aid Appeal Make Statement Coerce Disapprove Intend to Cooperate Cooperate (Diplomatic) Reject Cooperate (Material)

Our favorite component 0.40 0.35 0.30 0.25 time steps senders receivers action types 0.20 0.15 0.10 0.05 0.00 Jan 2011 Mar 2011 May 2011 Jun 2011 Aug 2011 Sep 2011 Nov 2011 Jan 2012 Feb 2012 Apr 2012 May 2012 Jul 2012 Sep 2012 Oct 2012 Dec 2012 0.45 0.40 0.35 0.30 0.25 0.20 0.15 0.10 0.05 0.00 Ecuador United Kingdom United States Sweden Australia Spain Russia Japan Belarus Cuba Ecuador United Kingdom United States Sweden Australia Russia Belarus Spain Paraguay Europe Consult Aid Appeal Make Statement Coerce Disapprove Intend to Cooperate Cooperate (Diplomatic) Reject Cooperate (Material)

Our favorite component 0.40 0.35 0.30 0.25 time steps senders receivers action types 0.20 0.15 0.10 0.05 0.00 Jan 2011 Mar 2011 May 2011 Jun 2011 Aug 2011 Sep 2011 Nov 2011 Jan 2012 Feb 2012 Apr 2012 May 2012 Jul 2012 Sep 2012 Oct 2012 Dec 2012 0.45 0.40 0.35 0.30 0.25 0.20 0.15 0.10 0.05 0.00 Ecuador United Kingdom United States Sweden Australia Spain Russia Japan Belarus Cuba Ecuador United Kingdom United States Sweden Australia Russia Belarus Spain Paraguay Europe Consult Aid Appeal Make Statement Coerce Disapprove Intend to Cooperate Cooperate (Diplomatic) Reject Cooperate (Material)

Thanks! Come to poster 897