Mining Triadic Closure Patterns in Social Networks

Similar documents
Complex-Network Modelling and Inference

An Introduction to Exponential-Family Random Graph Models

Modeling of Growing Networks with Directional Attachment and Communities

Influence Maximization in Dynamic Social Networks

RaRE: Social Rank Regulated Large-scale Network Embedding

Interact with Strangers

Power Laws & Rich Get Richer

Web Structure Mining Nodes, Links and Influence

Networks as vectors of their motif frequencies and 2-norm distance as a measure of similarity

A LINE GRAPH as a model of a social network

CS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine

Inferring Friendship from Check-in Data of Location-Based Social Networks

MobiHoc 2014 MINIMUM-SIZED INFLUENTIAL NODE SET SELECTION FOR SOCIAL NETWORKS UNDER THE INDEPENDENT CASCADE MODEL

Analysis & Generative Model for Trust Networks

Modeling population growth in online social networks

New Directions in Computer Science

Reducing Computation Time for the Analysis of Large Social Science Datasets

Exploring the Patterns of Human Mobility Using Heterogeneous Traffic Trajectory Data

Opinion Dynamics on Triad Scale Free Network

Believe it Today or Tomorrow? Detecting Untrustworthy Information from Dynamic Multi-Source Data

A Bivariate Point Process Model with Application to Social Media User Content Generation

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Link Prediction. Eman Badr Mohammed Saquib Akmal Khan

A New Space for Comparing Graphs

Temporal patterns in information and social systems

Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data

Analysis of Multiview Legislative Networks with Structured Matrix Factorization: Does Twitter Influence Translate to the Real World?

College Algebra. Word Problems

Introduction to Forecasting

Efficient Respondents Selection for Biased Survey using Online Social Networks

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018

Analysis of Algorithms. Unit 5 - Intractable Problems

Methods and tools for semantic social network analysis

A Framework of Detecting Burst Events from Micro-blogging Streams

Similarity Measures for Link Prediction Using Power Law Degree Distribution

Online Advertising is Big Business

Review. Gibbs sampling. Relational learning (properties of sets of entities) MH with proposal. failure mode: lock-down

StructInf: Mining Structural Influence from Social Streams

Random Effects Models for Network Data

11 : Gaussian Graphic Models and Ising Models

Beyond correlation networks

Prediction of Citations for Academic Papers

Finding Love in a Hopeless Place Applying Graphical Analysis to Advance the Frontier of Matching Algorithms

Supporting Statistical Hypothesis Testing Over Graphs

DS504/CS586: Big Data Analytics Graph Mining II

DS504/CS586: Big Data Analytics Graph Mining II

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

Stat 315c: Introduction

Beating Social Pulse: Understanding Information Propagation via Online Social Tagging Systems 1

Online Advertising is Big Business

Spam ain t as Diverse as It Seems: Throttling OSN Spam with Templates Underneath

ECEN 689 Special Topics in Data Science for Communications Networks

6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search

Socio-spatial Properties of Online Location-based Social Networks

Yahoo! Labs Nov. 1 st, Liangjie Hong, Ph.D. Candidate Dept. of Computer Science and Engineering Lehigh University

Click Prediction and Preference Ranking of RSS Feeds

Lesson 3 - Linear Functions

Ch. 12: Workload Forecasting

Social and Technological Network Analysis. Lecture 11: Spa;al and Social Network Analysis. Dr. Cecilia Mascolo

Joint user knowledge and matrix factorization for recommender systems

Point-of-Interest Recommendations: Learning Potential Check-ins from Friends

Facebook Friends! and Matrix Functions

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

arxiv: v1 [physics.soc-ph] 15 Dec 2009

Predicting New Search-Query Cluster Volume

Incorporating Social Context and Domain Knowledge for Entity Recognition

Friends or Foes: Detecting Dishonest Recommenders in Online Social Networks

Kristina Lerman USC Information Sciences Institute

Random Walk Based Algorithms for Complex Network Analysis

Data Mining Techniques

Modeling Dynamic Evolution of Online Friendship Network

What is this Page Known for? Computing Web Page Reputations. Outline

Degree Distribution: The case of Citation Networks

Why does attention to web articles fall with time?

How does Google rank webpages?

Densest subgraph computation and applications in finding events on social media

CS246 Final Exam, Winter 2011

Kernel methods, kernel SVM and ridge regression

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Dynamic modeling of organizational coordination over the course of the Katrina disaster

arxiv:cond-mat/ v1 3 Sep 2004

Multimedia analysis and retrieval

Link Prediction with Personalized Social Influence

Quadratic Application Problems

Department of Computer Science, Guiyang University, Guiyang , GuiZhou, China

Social Interaction Based Video Recommendation: Recommending YouTube Videos to Facebook Users

Recommendation Systems

A Continuous-Time Model of Topic Co-occurrence Trends

Mining Topic-level Opinion Influence in Microblog

Uncovering News-Twitter Reciprocity via Interaction Patterns

Mining Newsgroups Using Networks Arising From Social Behavior by Rakesh Agrawal et al. Presented by Will Lee

Deterministic Decentralized Search in Random Graphs

SOUTH COAST COASTAL RECREATION METHODS

Blog Community Discovery and Evolution

Distributed ML for DOSNs: giving power back to users

Online Social Networks and Media. Link Analysis and Web Search

Reductionist View: A Priori Algorithm and Vector-Space Text Retrieval. Sargur Srihari University at Buffalo The State University of New York

Understanding Social Characteristic from Spatial Proximity in Mobile Social Network

Small-world structure of earthquake network

EVOLUTION is manifested to be a common property of. Interest-aware Information Diffusion in Evolving Social Networks

Transcription:

Mining Triadic Closure Patterns in Social Networks Hong Huang, University of Goettingen Jie Tang, Tsinghua University Sen Wu, Stanford University Lu Liu, Northwestern University Xiaoming Fu, University of Goettingen

Networked World 1.26 billion users 700 billion minutes/month 280 million users 80% of users are 80-90 s 555 million users.5 billion tweets/day 560 million users influencing our daily life 79 million users per month 9.65 billion items/year 500 million users 35 billion on 11/11 800 million users ~50% revenue from network life 2/17

A Trillion Dollar Opportunity Social networks already become a bridge to connect our daily physical life and the virtual web space On2Off [1] [1] Online to Offline is trillion dollar business http://techcrunch.com/2010/08/07/why-online2offline-commerce-is-a-trillion-dollar-opportunity/ 3

Triangle Laws Real social networks have a lot of triangles Friends of friends are friends Any patterns? 2X the friends, 2X the triangles? B A C How many different structured triads can we have? Christos Faloutsos s keynote speech on Apr.9 4

Triads in networks 0 1 2 3 4 5 6 7 8 9 10 11 12 Milo R, Itzkovitz S, Kashtan N, et al.. Superfamilies of evolved and designed networks. Science, 2004 5

Triads in networks 0 1 2 3 4 5 6 7 8 9 10 11 12 Milo R, Itzkovitz S, Kashtan N, et al.. Superfamilies of evolved and designed networks. Science, 2004 6

Open Triad to Triadic Closure Open Triad Closed Triad 7

Open Triad to Triadic Closure Open Triad Closed Triad However, the formation mechanism is not clear 8

Problem Formalization B t 1 t 2 Given network G t = V, E, Y T are candidate open triad: Goal: Predict the formation of triadic closure A C f: ( G t, Y t, X t t=1, T) Y T+1 t 3 t 3 > t 2 > t 1 9

Dataset 44 thousand newly formed closed triads per day 360 thousand new links Time span: Aug 29 th, 2012 Sep 29 th, 2012 Weibo 200 out-degree per user 700 thousand nodes 400 million following links 10

Observation - Network Topology Y-axis: probability that each open triad forms triadic closures 11

Observation - Demography L(A, B) means A and B are from the same city 0 female; 1 male e.g., 001 means A and B are female while C is male.

Observation - Demography L(A, B) means A and B are from the same city 0 female; 1 male e.g., 001 means A and B are female while C is male.

Observation - Social Role 0 ordinary user 1 opinion leader (top 1% PageRank) e.g., 001 means A and B are ordinary user while C is opinion leader. 14

Summary Intuitions: Men are more inclined to form triadic closure Triads of opinion leaders themselves are more likely to be closed. Correlation 15

Considered the intuitions and correlations THE PROPOSED MODEL AND RESULTS 16

Triad Factor Graph (TriadFG) Model Correlation Factor h Latent Variable Input network Map candidate open triads Example: to nodes in model Model Whether three users come from the same place? Attribute factor f 17

Solution Given a network G = {V, E, X, Y} Objective function: φ θ = logp θ (Y X, G) P Y X, G P X Y P Y G Tr d = 1 exp { α Z j f j x ij, y i } 1 i=1 j=1 attribute factor f θ = ( α j, {μ k }) 1 exp { μ Z k h k (Y Trc )} 2 c k Correlation factor h 18

Learning Algorithm Lou T, Tang J, Hopcroft J, et al. Learning to predict reciprocity and triadic closure in social networks[j]. ACM Transactions on Knowledge Discovery from Data (TKDD), 2013, 7(2): 5. 19

Results on the Weibo data Baselines: SVM, Logistic Algorithm Precision Recall F1 Accuracy SVM 0.890 0.844 0.866 0.882 Logistic 0.882 0.913 0.897 0.885 TriadFG 0.901 0.953 0.926 0.931 20

Factor Contribution Analysis Demography(D) Popularity(S) Network Topology(N) Structural hole (H) 21

Conclusion Problem: Triadic closure formation prediction B Observations Network Topology Demography A? C Social Role Solution: TriadFG model Future work 22

23

Attribute factor Definition Feature Function Network topology Is open triad 5/2/0/4/1/3 1/1/1/1/0/0 Demography A,B,C from the same place 1 A,C from the same place 1 C is female 1 B is female 1 Social role A/B/C is popular user 1/0/1 A,B,C are all popular user 1 Two users are popular 1 One user is popular 1 A/B/C is structural hole spanner 1/0/1 Two users are structural hole spanner 1 One user is structural hole spanner 1 25

Structural hole When two separate clusters possess nonredundant information, there is said to be a structural hole between them 26

Observation - Social Role 0 ordinary user; 1 opinion 0 ordinary user; 1 structural leader hole spanner e.g., 001 means A and B are e.g., 001 means A and B are ordinary user while C is opinion ordinary user while C is leader. structural hole spanner. Lou T, Tang J. Mining structural hole spanners through information diffusion in social networks, 27 www2013

Popular users in Weibo vs. Twitter The rich get richer (Both) P 1XX > P(0XX), validates preferential attachment In twitter, popular users functions in triadic closure formation, while in Weibo reverse In Twitter, P X1X > P(X0X) In Weibo, ordinary users have more chances to connect other users. Popular users in China are more close 28

Qualitative Case Study 29