Jure Leskovec Stanford University

Size: px
Start display at page:

Download "Jure Leskovec Stanford University"

Transcription

1 Jure Leskovec Stanford University

2 2 Part 1: Models for networks Part 2: Information flows in networks Information spread in social media

3 3 Information flows through networks Analyzing underlying mechanisms for the real-time spread of information through on-line networks Motivating questions: How to track messages that spread through networks? How to predict the spread of information? How to identify networks over which the messages spread?

4 4 What are basic units of information? Would like to track units of information that: correspond to pieces of information: events, articles, vary over the order of days, and can be handled at large scale Ideas: (1) Cascading links to articles Textual fragments: (2) URLs and hashtags on Twitter (3) Phrases inside quotes: Engadget BBC Slashdot Obscure tech story Small tech blog NYT Wired CNN

5 [SDM 07] 8/11/2012 Jure Leskovec KDD Summer School Posts Blogs Time ordered hyperlinks Information cascade Identify cascades graphs induced by a time ordered propagation of hyper-links [Adamic-Adar 05] [SDM 07]

6 [SDM 07] 8/11/2012 Jure Leskovec KDD Summer School Cascade shapes (ordered by decreasing frequency) 10 million posts and 350,000 cascades Cascades are mainly stars (wide and bushy trees) Interesting relation between the cascade frequency and structure

7 7 URLs and re-tweets: Explicit information diffusion mechanism on Twitter B sees A s tweet and forwards it to its follower by re-tweeting follows A information By following re-tweet (URL, #tag) cascades we can establish the information flow B

8 [KDD 09] 8/11/2012 Jure Leskovec KDD Summer School Meme-tracking: Extract textual fragments that travel relatively unchanged, through many articles: Look for phrases inside quotes: About 1.25 quotes per document in Spinn3r data Why it works? Quotes are integral parts of journalistic practices tend to follow iterations of a story as it evolves are attributed to individuals and have time and location

9 [KDD 09] Quote: Our opponent is someone who sees America, it seems, as being so imperfect, imperfect enough that he s palling around with terrorists who would target their own country. 8/11/2012 Jure Leskovec (@jure), KDD Summer School

10 [KDD 09] Goal: Find mutational variants of a quote Objective: In a DAG of approx. quote inclusion, delete min total edge weight s.t. each component has a single sink Our basic units are quotes Length 4, freq. 10 Gives 22M quotes DAG-partitioning is NP-hard but heuristics are effective: Gives ~35,000 non-trivial clusters BCD ABC CEF 8/11/2012 Jure Leskovec (@jure), KDD Summer School 2012 BDXCY ABCD ABXCE CEFP UVCEXF Nodes are phrases Edges are inclusion relations Edges have weights ABCEFG ABCDEFGH CEFPQR 10

11 [KDD 09] Volume over time of top 50 largest total clusters in Jul /11/2012 Jure Leskovec KDD Summer School

12 12 There are many other ways to trace information through networks: Trace chain letters [Liben-Nowell-Kleinberg, 08] Use text classifiers to predict whether there was information flow between two blog posts [Adar- Adamic, 05] Trace the spread Facebook Page Fans over the Facebook network [Sun et al. 09] Diffusion of favoriting a photo on Flickr [Cha et al. 09] Product recommendations [Leskovec et al. 06]

13

14 [WSDM 11] 14 Q: How does information attention rise and decay? [Wu-Huberman 07] [Szabo-Huberman, 08] Item i: Piece of information (e.g., quote, url, hashtag) Volume x i (t): # of times i was mentioned at time t Volume = number of mentions = attention = popularity Q: What are typical classes of shapes of x i (t)?

15 15 Given: Volume of an item over time i.e., number of mentions of a quote over time Goal: Want to discover types of shapes of volume time series

16 [WSDM 11] 16 Goal: Cluster time series & find cluster centers Time series distance function needs to be: 200 x i (t) 500 x j (t) t (time) Invariance to scaling, a t (time) x i (t) x k (t) 10 t (time) 50 t (time) Invariance to translation, q d( x i, x j ) = min a, q ( ) x ( t) a x ( t q) K-Spectral Centroid clustering [WSDM 11] t i j 2

17 [WSDM 11] 17 Newspaper Pro Blog TV Agency Blogs Quotes: 1 year, 172M docs, 343M quotes Same 6 shapes for Twitter: 580M tweets, 8M #tags Same shapes also for query popularity [Kulkarni et al. 11]

18 18 How much attention will information get? How many sites mention information at particular time? Traditional view: In a network nodes spread information to their neighbors Problem: a The network may be unknown Idea: Predict the future number c of mentions based on who got e infected in the past volume time b? now d

19 19 How much attention will information get? Who reports the information and when? 1h: Gizmodo, Engadget, Wired 2h: Reuters, Associated Press 3h: New York Times, CNN Q: How many sites will mention the information at hour 4, 5,...? Motivating question: If NYT mentions info at time t How many additional mentions does this generate (on other sites) at time t+1, t+2,?

20 20 Idea: Predict the volume based on who mentioned in the past Solution: Linear Influence Model (LIM) Model the global influence of each node Predict future volume from node influences Advantages: No knowledge of network needed Contagion can jump between the nodes

21 21 t M(t) x(t) 1 u, w 2 2 v, x, y 3 3? K=1 contagion: x t volume (number of new mentions) at time t M t set of newly infected nodes at time t A t = t <t M(t ) nodes infected before t How does LIM predict the future number of infections x t + 1? Each node u has an influence function I u (t): After node u gets infected, how many other nodes tend to get infected Predict future volume using the influence functions of nodes infected in the past

22 [ICDM 10] 22 Node u has an influence function I u (t): I u (t): After node u gets mentions, how many other nodes tend to mention t hours later I u (t) e.g.: Influence function of CNN: How many sites say the info after CNN says it? Estimate the influence function from past data How to predict future volume x i (t + 1)? Predict future volume using the influence functions of nodes infected in the past t

23 [ICDM 10] 23 The LIM model: Volume x i (t) of i at time t A i (t) a set of nodes that mentioned i before time t And let: I u (t): influence function of u t u : time when u mentioned i Volume Predict future volume as a sum of influences: x ( t + 1) = I ( t t ) i u A i u ( t) u I u t u t v t w I v I w

24 [ICDM 10] I u After node u mentions the info, I u (t) other mentions tend to occur t hours later I u (t) is not observable, need to estimate it We make no assumption about the shape of I u (t) Want to set influence functions I u (t) such that we minimize the error: x i( t + 1) Iu ( t tu ) i t u A ( t) 8/11/2012 Jure Leskovec (@jure), KDD Summer School 2012 i 2 24

25 [ICDM 10] 25 Discrete non-parametric influence functions: Discrete time units I u (t) non-negative vector of length L I u (t) = [I u (1), I u (2), I u (3),, I u (L)] I u Find I u (t) by solving a least-squares-like problem: L min I u, u x + i( t 1) Iu( t tu) i t u Ai ( t) 2

26 [ICDM 10] 26 Take top 1,000 quotes by the total volume: Total 372,000 mentions on 16,000 websites Build LIM on 100 highest-volume websites x i (t) number of mentions across 16,000 websites A i (t) which of 100 sites mentioned quote i and when Improvement in L2-norm over 1-time lag predictor Bursty phrases Steady phrases Overall AR 7.21% 8.30% 7.41% ARMA 6.85% 8.71% 7.75% LIM (N=100) 20.06% 6.24% 14.31%

27 [ICDM 10] 27 Influence functions give insights: Q: NYT writes a post on politics, how many people tend to mention it next day? A: Influence function of NYT for political phrases! Experimental setup: 5 media types: Newspapers, Pro Blogs, TVs, News agencies, Blogs 6 topics: Politics, nation, entertainment, business, technology, sports For all phrases in the topic, estimate average influence function by media type

28 [ICDM 10] 28 Influence functions: Business (and politics) are driven by mainstream media Entertainment (and sports) are driven by blogs and TV Newspapers and news agencies do not influence the volume Business Entertainment

29

30 [KDD 10, NIPS 10] 30 But how does information really spread? We only see the infection but not the edge Can we reconstruct a (hidden) network?

31 [KDD 10, NIPS 10] 31 There is a hidden diffusion network: a c e We only see the times C when nodes get infected : Cascade c 1 = {(a, 1), (c, 2), (b, 3), (e, 4)} Cascade c 2 = {(c, 1), (a, 4), (b, 5), (d, 6)} b Want to infer who-infects-whom network! d

32 [KDD 10, NIPS 10] 32 Process We observe It s hidden Virus propagation Viruses propagate through the network We only observe when people get sick But NOT who infected whom Word of mouth & Viral marketing Recommendations and influence propagate We only observe when people buy products But NOT who influenced whom Can we infer the underlying network?

33 [KDD 10] 33 Goal: Find a graph G that best explains the observed infection times C = { cc = {(u, t)} } Given a graph G, define the likelihood P(C G) of cascades C occurring in G More precisely, we need: P c (u, v) = f(t u, t v ) diffusion model prob. that u infects v in cascade c P(c T) prob. that c spread in a particular tree pattern T P(c G) prob. that cascade c occurred in G P(C G) prob. that a set of cascades C occurred in G a c e b d

34 34 Why is inferring G hard: (1) Given a single G: How to efficiently compute P(C G)? (2) Now that we can evaluate a single G How to efficiently find G * that maximizes P(C G)? There are O(2 N*N ) different graphs!! N number of nodes

35 [KDD 10] 35 Consider 1 cascade: The model Cascade reaches node u at time t u, and spreads to u s neighbors v: With prob. β cascade propagates along edge (u, v) and infects node v some Δ time units later: t v = t u + Δ More formally: P c u, v β f(t v t u ) Incubation time: f t v t u e Δ t if Δ t > 0,else ε ε captures influence external to the network At any time a node can get externally infected with prob. ε β a c ε e tv ε b ε d

36 [KDD 10] 36 Given node infection times and tree T: c = { (a, 1), (c, 2), (b, 3), (e, 4) } T T = { a b, a c, b e } a b Prob. that c propagates in pattern T c e d Graph G Edges that propagated Edges that failed to propagate Approximate it as:

37 [KDD 10] 37 How likely is c to spread in graph G? c = {(a, 1), (c, 2), (b, 3), (e, 4)} a b a b a b c d c d c d e Need to consider all possible ways for c to spread in G (i.e., all spanning trees T): e e Consider the most likely propagation tree

38 [KDD 10] 38 Score of a graph G for a set of cascades C: We assume cascades c Care independent Want to find the best graph G : The problem is NP-hard: MAX-k-COVER [KDD 10]

39 [KDD 10] 39 Greedy hill-climbing algorithm to max P(C G): Start with empty G 0 (G with no edges) Add k edges (k is a parameter) At every step i + 1 add an edge e to G i that maximizes the marginal improvement arg max e G i P C G i e P(C G i ) Add e to the graph: G i+1 = G i e

40 40 Theorem [KDD 10]: Function P(C G) is monotonic, and submodular in the edges of G F C (A {x}) F C (A) F C (B {x}) F C (B) Gain of adding an edge to a small graph Gain of adding an edge to a large graph A B VxV Consequence: Greedy hill-climbing gives near optimal solution Approximation guarantee (~0.63 of OPT)

41 41 Memetracker data: Aug 08 Sept million news and blog articles Extract 343 million phrases Record times t i (w) when site w mentions quote i Given times when sites mention quotes Infer the network of information diffusion: Who tends to copy (repeat after) whom

42 [KDD 10] 5,000 news sites: Blogs Mainstream media 8/11/2012 Jure Leskovec KDD Summer School

43 [KDD 10] Blogs Mainstream media 8/11/2012 Jure Leskovec KDD Summer School

44 [KDD 07] Want to read things before others do. Detect blue & yellow soon but miss red. Detect all stories but late. 8/11/2012 Jure Leskovec KDD Summer School

45 8/11/2012 Jure Leskovec KDD Summer School Given a budget (e.g., of 3 blogs) Select sites to cover the most of the network Blogosphere

46 Question: Which websites should one read to catch big stories? Idea: Each blog covers part of the network Each dot is a blog Proximity is based on the number of common cascades 8/11/2012 Jure Leskovec (@jure), KDD Summer School

47 8/11/2012 Jure Leskovec KDD Summer School Which blogs to read to be most up to date? Our solution % of stories detected (higher is better) In-links Out-links # posts (used by Technorati) Random Number of selected blogs

48 8/11/2012 Jure Leskovec KDD Summer School Messages arriving through networks from real-time sources requires new ways of thinking about information dynamics and consumption: Tracking information through (implicit) networks Quantify the dynamics of online media Predict the diffusion of information And infer networks of information diffusion

49 49 A framework for tracking information through the news and social networks, to quantify the dynamics of social media Predicting the diffusion of information Inferring networks of diffusion and influence: Many possible further directions: Epidimiology Viral Marketing

50 8/11/2012 Jure Leskovec KDD Summer School [KDD 09] Meme-tracking and the Dynamics of the News Cycle, by J. Leskovec, L. Backstrom, J. Kleinberg. KDD, [WSDM 11] Patterns of Temporal Variation in Online Media by J. Yang, J. Leskovec. ACM International Conference on Web Search and Data Minig (WSDM), [ICDM 10] Modeling Information Diffusion in Implicit Networks by J. Yang, J. Leskovec. IEEE International Conference On Data Mining (ICDM), [KDD 10] Inferring Networks of Diffusion and Influence by M. Gomez-Rodriguez, J. Leskovec, A. Krause. KDD, [NIPS 10] On the Convexity of Latent Social Network Inference by S. A. Myers, J. Leskovec. Neural Information Processing Systems (NIPS), [KDD 07] Cost-effective Outbreak Detection in Networks by J. Leskovec, A. Krause, C. Guestrin, C. Faloutsos, J. VanBriesen, N. Glance. KDD [SDM 07] Cascading Behavior in Large Blog Graphs by J. Leskovec, M. McGlohon, C. Faloutsos, N. Glance, M. Hurst. SDM,

51 51 [Adar-Adamic 05] E. Adar and L. A. Adamic, Tracking Information Epidemics in Blogspace, Web Intelligence, 2005 [Goyal et al. 10] Goyal, A., Bonchi, F. and Lakshmanan, L.V.S., Learning influence probabilities in social networks, WSDM '10 [Kulkarni et al. 11] A. Kulkarni, J. Teevan, J. M. Svore, and S. T. Dumais, Understanding Temporal Query Dynamics, in WSDM [LibenNowell-Kleinberg 08] D. Liben-Nowell and J. Kleinberg, Tracing the Flow of Information on a Global Scale Using Internet Chain-Letter Data, PNAS [Cha et al. 09] Cha, M. and Mislove, Alan and Gummadi, Krishna P., A measurement-driven analysis of information propagation in the flickr social network. In WWW '09. [De Choudhury 10] De Choudhury, M., Lin, Y.-R., Sundaram, H., Candan, K.S., Xie, L. & Kelliher, A. How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media? ICWSM '10 Gruhl, D., Guha, R., Liben-Nowell, D. & Tomkins, A. Information Diffusion Through Blogspace. WWW 2004.

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu 12/3/2013 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu 10/24/2012 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

More information

CS224W: Analysis of Networks Jure Leskovec, Stanford University

CS224W: Analysis of Networks Jure Leskovec, Stanford University Announcements: Please fill HW Survey Weekend Office Hours starting this weekend (Hangout only) Proposal: Can use 1 late period CS224W: Analysis of Networks Jure Leskovec, Stanford University http://cs224w.stanford.edu

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu Find most influential set S of size k: largest expected cascade size f(s) if set S is activated

More information

Greedy Maximization Framework for Graph-based Influence Functions

Greedy Maximization Framework for Graph-based Influence Functions Greedy Maximization Framework for Graph-based Influence Functions Edith Cohen Google Research Tel Aviv University HotWeb '16 1 Large Graphs Model relations/interactions (edges) between entities (nodes)

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu How to organize/navigate it? First try: Human curated Web directories Yahoo, DMOZ, LookSmart

More information

Influence Maximization in Continuous Time Diffusion Networks

Influence Maximization in Continuous Time Diffusion Networks Manuel Gomez-Rodriguez 1,2 Bernhard Schölkopf 1 1 MPI for Intelligent Systems and 2 Stanford University MANUELGR@STANFORD.EDU BS@TUEBINGEN.MPG.DE Abstract The problem of finding the optimal set of source

More information

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank

More information

CS 322: (Social and Information) Network Analysis Jure Leskovec Stanford University

CS 322: (Social and Information) Network Analysis Jure Leskovec Stanford University CS 322: (Social and Inormation) Network Analysis Jure Leskovec Stanord University Initially some nodes S are active Each edge (a,b) has probability (weight) p ab b 0.4 0.4 0.2 a 0.4 0.2 0.4 g 0.2 Node

More information

CSCI 3210: Computational Game Theory. Cascading Behavior in Networks Ref: [AGT] Ch 24

CSCI 3210: Computational Game Theory. Cascading Behavior in Networks Ref: [AGT] Ch 24 CSCI 3210: Computational Game Theory Cascading Behavior in Networks Ref: [AGT] Ch 24 Mohammad T. Irfan Email: mirfan@bowdoin.edu Web: www.bowdoin.edu/~mirfan Course Website: www.bowdoin.edu/~mirfan/csci-3210.html

More information

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank

More information

Learning Influence Functions from Incomplete Observations

Learning Influence Functions from Incomplete Observations Learning Influence Functions from Incomplete Observations Xinran He Ke Xu David Kempe Yan Liu University of Southern California, Los Angeles, CA 90089 {xinranhe, xuk, dkempe, yanliu.cs}@usc.edu Abstract

More information

Predicting Trends in Social Networks via Dynamic Activeness Model

Predicting Trends in Social Networks via Dynamic Activeness Model Predicting Trends in Social Networks via Dynamic Activeness Model Shuyang Lin Xiangnan Kong Philip S. Yu Department of Computer Science Computer Science Department University of Illinois at Chicago King

More information

Maximizing the Spread of Influence through a Social Network. David Kempe, Jon Kleinberg, Éva Tardos SIGKDD 03

Maximizing the Spread of Influence through a Social Network. David Kempe, Jon Kleinberg, Éva Tardos SIGKDD 03 Maximizing the Spread of Influence through a Social Network David Kempe, Jon Kleinberg, Éva Tardos SIGKDD 03 Influence and Social Networks Economics, sociology, political science, etc. all have studied

More information

Characterizing Information Diffusion in Online Social Networks with Linear Diffusive Model

Characterizing Information Diffusion in Online Social Networks with Linear Diffusive Model Characterizing Information Diffusion in Online Social Networks with Linear Diffusive Model Feng Wang, Haiyan Wang, Kuai Xu, Jianhong Wu, Xiaohua Jia School of Mathematical and Natural Sciences, Arizona

More information

A note on modeling retweet cascades on Twitter

A note on modeling retweet cascades on Twitter A note on modeling retweet cascades on Twitter Ashish Goel 1, Kamesh Munagala 2, Aneesh Sharma 3, and Hongyang Zhang 4 1 Department of Management Science and Engineering, Stanford University, ashishg@stanford.edu

More information

Detecting Anti-majority Opinionists Using Value-weighted Mixture Voter Model

Detecting Anti-majority Opinionists Using Value-weighted Mixture Voter Model Detecting Anti-majority Opinionists Using Value-weighted Mixture Voter Model Masahiro Kimura, Kazumi Saito 2, Kouzou Ohara 3, and Hiroshi Motoda 4 Department of Electronics and Informatics, Ryukoku University

More information

Influence Maximization in Dynamic Social Networks

Influence Maximization in Dynamic Social Networks Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang and Xiaoming Sun Department of Computer Science and Technology, Tsinghua University Department of Computer

More information

Diffusion of Innovation and Influence Maximization

Diffusion of Innovation and Influence Maximization Diffusion of Innovation and Influence Maximization Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics

More information

Models of Communication Dynamics for Simulation of Information Diffusion

Models of Communication Dynamics for Simulation of Information Diffusion Models of Communication Dynamics for Simulation of Information Diffusion Konstantin Mertsalov, Malik Magdon-Ismail, Mark Goldberg Rensselaer Polytechnic Institute Department of Computer Science 11 8th

More information

CS224W: Analysis of Networks Jure Leskovec, Stanford University

CS224W: Analysis of Networks Jure Leskovec, Stanford University CS224W: Analysis of Networks Jure Leskovec, Stanford University http://cs224w.stanford.edu 10/30/17 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 2

More information

A Fast Approximation for Influence Maximization in Large Social Networks

A Fast Approximation for Influence Maximization in Large Social Networks A Fast Approximation for Influence Maximization in Large Social Networks ABSTRACT Jong-Ryul Lee Dept. of Computer Science, KAIST 29 Daehak-ro, Yuseong-gu, Daejeon, Korea jrlee@islab.kaist.ac.kr This paper

More information

Reconstructing Patterns of Information Diffusion from Incomplete Observations

Reconstructing Patterns of Information Diffusion from Incomplete Observations Reconstructing Patterns of Information Diffusion from Incomplete Observations Flavio Chierichetti Department of Computer Science Cornell University Ithaca, NY 14853 Jon Kleinberg Department of Computer

More information

MobiHoc 2014 MINIMUM-SIZED INFLUENTIAL NODE SET SELECTION FOR SOCIAL NETWORKS UNDER THE INDEPENDENT CASCADE MODEL

MobiHoc 2014 MINIMUM-SIZED INFLUENTIAL NODE SET SELECTION FOR SOCIAL NETWORKS UNDER THE INDEPENDENT CASCADE MODEL MobiHoc 2014 MINIMUM-SIZED INFLUENTIAL NODE SET SELECTION FOR SOCIAL NETWORKS UNDER THE INDEPENDENT CASCADE MODEL Jing (Selena) He Department of Computer Science, Kennesaw State University Shouling Ji,

More information

Beating Social Pulse: Understanding Information Propagation via Online Social Tagging Systems 1

Beating Social Pulse: Understanding Information Propagation via Online Social Tagging Systems 1 Journal of Universal Computer Science, vol. 18, no. 8 (2012, 1022-1031 submitted: 16/9/11, accepted: 14/12/11, appeared: 28/4/12 J.UCS Beating Social Pulse: Understanding Information Propagation via Online

More information

Jure Leskovec Joint work with Jaewon Yang, Julian McAuley

Jure Leskovec Joint work with Jaewon Yang, Julian McAuley Jure Leskovec (@jure) Joint work with Jaewon Yang, Julian McAuley Given a network, find communities! Sets of nodes with common function, role or property 2 3 Q: How and why do communities form? A: Strength

More information

Uncovering the Temporal Dynamics of Diffusion Networks

Uncovering the Temporal Dynamics of Diffusion Networks Manuel Gomez-Rodriguez,2 David Balduzzi Bernhard Schölkopf MPI for Intelligent Systems and 2 Stanford University MANUELGR@STANFORD.EDU BALDUZZI@TUEBINGEN.MPG.DE BS@TUEBINGEN.MPG.DE Abstract Time plays

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Lecture #9: Link Analysis Seoul National University 1 In This Lecture Motivation for link analysis Pagerank: an important graph ranking algorithm Flow and random walk formulation

More information

Why does attention to web articles fall with time?

Why does attention to web articles fall with time? Why does attention to web articles fall with time? M.V. Simkin and V.P. Roychowdhury Department of Electrical Engineering, University of California, Los Angeles, CA 90095-594 We analyze access statistics

More information

Active Learning and Optimized Information Gathering

Active Learning and Optimized Information Gathering Active Learning and Optimized Information Gathering Lecture 13 Submodularity (cont d) CS 101.2 Andreas Krause Announcements Homework 2: Due Thursday Feb 19 Project milestone due: Feb 24 4 Pages, NIPS format:

More information

Diffusion of information and social contagion

Diffusion of information and social contagion Diffusion of information and social contagion Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics

More information

Link Prediction. Eman Badr Mohammed Saquib Akmal Khan

Link Prediction. Eman Badr Mohammed Saquib Akmal Khan Link Prediction Eman Badr Mohammed Saquib Akmal Khan 11-06-2013 Link Prediction Which pair of nodes should be connected? Applications Facebook friend suggestion Recommendation systems Monitoring and controlling

More information

Cost and Preference in Recommender Systems Junhua Chen LESS IS MORE

Cost and Preference in Recommender Systems Junhua Chen LESS IS MORE Cost and Preference in Recommender Systems Junhua Chen, Big Data Research Center, UESTC Email:junmshao@uestc.edu.cn http://staff.uestc.edu.cn/shaojunming Abstract In many recommender systems (RS), user

More information

RaRE: Social Rank Regulated Large-scale Network Embedding

RaRE: Social Rank Regulated Large-scale Network Embedding RaRE: Social Rank Regulated Large-scale Network Embedding Authors: Yupeng Gu 1, Yizhou Sun 1, Yanen Li 2, Yang Yang 3 04/26/2018 The Web Conference, 2018 1 University of California, Los Angeles 2 Snapchat

More information

Overlapping Communities

Overlapping Communities Overlapping Communities Davide Mottin HassoPlattner Institute Graph Mining course Winter Semester 2017 Acknowledgements Most of this lecture is taken from: http://web.stanford.edu/class/cs224w/slides GRAPH

More information

Modeling, Analysis, and Control of Information Propagation in Multi-layer and Multiplex Networks. Osman Yağan

Modeling, Analysis, and Control of Information Propagation in Multi-layer and Multiplex Networks. Osman Yağan Modeling, Analysis, and Control of Information Propagation in Multi-layer and Multiplex Networks Osman Yağan Department of ECE Carnegie Mellon University Joint work with Y. Zhuang and V. Gligor (CMU) Alex

More information

9. Submodular function optimization

9. Submodular function optimization Submodular function maximization 9-9. Submodular function optimization Submodular function maximization Greedy algorithm for monotone case Influence maximization Greedy algorithm for non-monotone case

More information

Full Diffusion History Reconstruction in Networks

Full Diffusion History Reconstruction in Networks 205 IEEE International Conference on Big Data (Big Data) Full Diffusion History Reconstruction in Networks Zhen Chen, Hanghang Tong and Lei Ying School of Electrical, Computer and Energy Engineering School

More information

Learning the Nature of Information in Social Networks

Learning the Nature of Information in Social Networks Learning the Nature of Information in Social Networks Rakesh Agrawal Michalis Potamias Evimaria Terzi Microsoft Research Groupon Boston University rakesha@microsoft.com michalis@groupon.com evimaria@cs.bu.edu

More information

Kristina Lerman USC Information Sciences Institute

Kristina Lerman USC Information Sciences Institute Rethinking Network Structure Kristina Lerman USC Information Sciences Institute Università della Svizzera Italiana, December 16, 2011 Measuring network structure Central nodes Community structure Strength

More information

Info-Cluster Based Regional Influence Analysis in Social Networks

Info-Cluster Based Regional Influence Analysis in Social Networks Info-Cluster Based Regional Influence Analysis in Social Networks Chao Li,2,3, Zhongying Zhao,2,3,JunLuo, and Jianping Fan Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen

More information

MAE 298, Lecture 8 Feb 4, Web search and decentralized search on small-worlds

MAE 298, Lecture 8 Feb 4, Web search and decentralized search on small-worlds MAE 298, Lecture 8 Feb 4, 2008 Web search and decentralized search on small-worlds Search for information Assume some resource of interest is stored at the vertices of a network: Web pages Files in a file-sharing

More information

More spreaders but less influence: An analysis of indirect influence on Twitter

More spreaders but less influence: An analysis of indirect influence on Twitter More spreaders but less influence: An analysis of indirect influence on Twitter Xin Shuai a, Ying Ding b, Jerome Busemeyer c, Shanshan Chen c, Yuyin Sun b, Jie Tang d a School of Informatics and Computing,

More information

Online Social Networks and Media. Opinion formation on social networks

Online Social Networks and Media. Opinion formation on social networks Online Social Networks and Media Opinion formation on social networks Diffusion of items So far we have assumed that what is being diffused in the network is some discrete item: E.g., a virus, a product,

More information

Sparsification of Influence Networks

Sparsification of Influence Networks Sparsification of Influence Networks Michael Mathioudakis 1 mathiou@cs.toronto.edu Francesco Bonchi 2 bonchi@yahoo-inc.com Carlos Castillo 2 chato@yahoo-inc.com Aristides Gionis 2 gionis@yahoo-inc.com

More information

Finding Patterns in Blog Shapes and Blog Evolution

Finding Patterns in Blog Shapes and Blog Evolution Finding Patterns in Blog Shapes and Blog Evolution Mary McGlohon, Jure Leskovec, Christos Faloutsos Matthew Hurst, Natalie Glance January 2007 CMU-ML-07-100 Finding Patterns in Blog Shapes and Blog Evolution

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu Non-overlapping vs. overlapping communities 11/10/2010 Jure Leskovec, Stanford CS224W: Social

More information

Learning with Temporal Point Processes

Learning with Temporal Point Processes Learning with Temporal Point Processes t Manuel Gomez Rodriguez MPI for Software Systems Isabel Valera MPI for Intelligent Systems Slides/references: http://learning.mpi-sws.org/tpp-icml18 ICML TUTORIAL,

More information

Identification of Bursts in a Document Stream

Identification of Bursts in a Document Stream Identification of Bursts in a Document Stream Toshiaki FUJIKI 1, Tomoyuki NANNO 1, Yasuhiro SUZUKI 1 and Manabu OKUMURA 2 1 Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute

More information

6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search

6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search 6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search Daron Acemoglu and Asu Ozdaglar MIT September 30, 2009 1 Networks: Lecture 7 Outline Navigation (or decentralized search)

More information

On the Precision of Social and Information Networks

On the Precision of Social and Information Networks On the Precision of Social and Information Networks Kamesh Munagala (Duke) Reza Bosagh Zadeh (Stanford) Ashish Goel (Stanford) Aneesh Sharma(Twitter, Inc.) Information Networks Social Networks play an

More information

Dynamics of Real-world Networks

Dynamics of Real-world Networks Dynamics of Real-world Networks Thesis proposal Jurij Leskovec Machine Learning Department Carnegie Mellon University May 2, 2007 Thesis committee: Christos Faloutsos, CMU Avrim Blum, CMU John Lafferty,

More information

Unified Modeling of User Activities on Social Networking Sites

Unified Modeling of User Activities on Social Networking Sites Unified Modeling of User Activities on Social Networking Sites Himabindu Lakkaraju IBM Research - India Manyata Embassy Business Park Bangalore, Karnataka - 5645 klakkara@in.ibm.com Angshu Rai IBM Research

More information

Online Social Networks and Media. Link Analysis and Web Search

Online Social Networks and Media. Link Analysis and Web Search Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information

More information

Profit Maximization for Viral Marketing in Online Social Networks

Profit Maximization for Viral Marketing in Online Social Networks Maximization for Viral Marketing in Online Social Networks Jing TANG Interdisciplinary Graduate School Nanyang Technological University Email: tang0311@ntu.edu.sg Xueyan TANG School of Comp. Sci. & Eng.

More information

Modeling Indirect Influence on Twitter

Modeling Indirect Influence on Twitter Modeling Indirect Influence on Twitter Xin Shuai, Indiana University Bloomington, USA Ying Ding, Indiana University Bloomington, USA Jerome Busemeyer, Indiana University Bloomington, USA Shanshan Chen,

More information

Time sensitive influence maximization in social networks

Time sensitive influence maximization in social networks Time sensitive influence maximization in social networks Mohammadi, A, Saraee, MH and Mirzaei, A http://dx.doi.org/10.1177/0165551515602808 Title Authors Type URL Time sensitive influence maximization

More information

EM (cont.) November 26 th, Carlos Guestrin 1

EM (cont.) November 26 th, Carlos Guestrin 1 EM (cont.) Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University November 26 th, 2007 1 Silly Example Let events be grades in a class w 1 = Gets an A P(A) = ½ w 2 = Gets a B P(B) = µ

More information

Conquering the Complexity of Time: Machine Learning for Big Time Series Data

Conquering the Complexity of Time: Machine Learning for Big Time Series Data Conquering the Complexity of Time: Machine Learning for Big Time Series Data Yan Liu Computer Science Department University of Southern California Mini-Workshop on Theoretical Foundations of Cyber-Physical

More information

Greedy Search in Social Networks

Greedy Search in Social Networks Greedy Search in Social Networks David Liben-Nowell Carleton College dlibenno@carleton.edu Joint work with Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkins. IPAM, Los Angeles 8 May 2007

More information

Supporting Statistical Hypothesis Testing Over Graphs

Supporting Statistical Hypothesis Testing Over Graphs Supporting Statistical Hypothesis Testing Over Graphs Jennifer Neville Departments of Computer Science and Statistics Purdue University (joint work with Tina Eliassi-Rad, Brian Gallagher, Sergey Kirshner,

More information

Web Structure Mining Nodes, Links and Influence

Web Structure Mining Nodes, Links and Influence Web Structure Mining Nodes, Links and Influence 1 Outline 1. Importance of nodes 1. Centrality 2. Prestige 3. Page Rank 4. Hubs and Authority 5. Metrics comparison 2. Link analysis 3. Influence model 1.

More information

Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach

Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach Author: Jaewon Yang, Jure Leskovec 1 1 Venue: WSDM 2013 Presenter: Yupeng Gu 1 Stanford University 1 Background Community

More information

Time Constrained Influence Maximization in Social Networks

Time Constrained Influence Maximization in Social Networks Time Constrained Influence Maximization in Social Networks Bo Liu Facebook Menlo Park, CA, USA bol@fb.com Gao Cong, Dong Xu School of Computer Engineering Nanyang Technological University, Singapore {gaocong,

More information

ECS 253 / MAE 253, Lecture 15 May 17, I. Probability generating function recap

ECS 253 / MAE 253, Lecture 15 May 17, I. Probability generating function recap ECS 253 / MAE 253, Lecture 15 May 17, 2016 I. Probability generating function recap Part I. Ensemble approaches A. Master equations (Random graph evolution, cluster aggregation) B. Network configuration

More information

Back to the Past: Source Identification in Diffusion Networks from Partially Observed Cascades

Back to the Past: Source Identification in Diffusion Networks from Partially Observed Cascades Back to the Past: Source Identification in Diffusion Networks from Partially Observed Cascades Mehrdad Farajtabar Manuel Gomez-Rodriguez Nan Du Mohammad Zamani Hongyuan Zha Le Song Georgia Institute of

More information

Slides based on those in:

Slides based on those in: Spyros Kontogiannis & Christos Zaroliagis Slides based on those in: http://www.mmds.org High dim. data Graph data Infinite data Machine learning Apps Locality sensitive hashing PageRank, SimRank Filtering

More information

Hybrid Models for Text and Graphs. 10/23/2012 Analysis of Social Media

Hybrid Models for Text and Graphs. 10/23/2012 Analysis of Social Media Hybrid Models for Text and Graphs 10/23/2012 Analysis of Social Media Newswire Text Formal Primary purpose: Inform typical reader about recent events Broad audience: Explicitly establish shared context

More information

Uncovering the structure and temporal dynamics of information propagation

Uncovering the structure and temporal dynamics of information propagation Network Science : page of 4. c Cambridge University Press 24 doi:.7/nws.24.3 Uncovering the structure and temporal dynamics of information propagation MANUEL GOMEZ RODRIGUEZ Department of Empirical Inference,

More information

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices Communities Via Laplacian Matrices Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices The Laplacian Approach As with betweenness approach, we want to divide a social graph into

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu Intro sessions to SNAP C++ and SNAP.PY: SNAP.PY: Friday 9/27, 4:5 5:30pm in Gates B03 SNAP

More information

Mining Newsgroups Using Networks Arising From Social Behavior by Rakesh Agrawal et al. Presented by Will Lee

Mining Newsgroups Using Networks Arising From Social Behavior by Rakesh Agrawal et al. Presented by Will Lee Mining Newsgroups Using Networks Arising From Social Behavior by Rakesh Agrawal et al. Presented by Will Lee wwlee1@uiuc.edu September 28, 2004 Motivation IR on newsgroups is challenging due to lack of

More information

Computing Competing Cascades on Signed Networks

Computing Competing Cascades on Signed Networks Noname manuscript No. (will be inserted by the editor Computing Competing Cascades on Signed Networks Ajitesh Srivastava Charalampos Chelmis Viktor K. Prasanna Received: date / Accepted: date Abstract

More information

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS DATA MINING LECTURE 3 Link Analysis Ranking PageRank -- Random walks HITS How to organize the web First try: Manually curated Web Directories How to organize the web Second try: Web Search Information

More information

Blog Community Discovery and Evolution

Blog Community Discovery and Evolution Blog Community Discovery and Evolution Mutual Awareness, Interactions and Community Stories Yu-Ru Lin, Hari Sundaram, Yun Chi, Junichi Tatemura and Belle Tseng What do people feel about Hurricane Katrina?

More information

Time-Critical Influence Maximization in Social Networks with Time-Delayed Diffusion Process

Time-Critical Influence Maximization in Social Networks with Time-Delayed Diffusion Process Time-Critical Influence Maximization in Social Networks with Time-Delayed Diffusion Process Wei Chen Microsoft Research Asia Beijing, China weic@microsoft.com Wei Lu University of British Columbia Vancouver,

More information

arxiv: v3 [physics.data-an] 15 Apr 2011

arxiv: v3 [physics.data-an] 15 Apr 2011 Modeling the Structure and Evolution of Discussion Cascades arxiv:.673v3 [physics.data-an] 5 Apr 2 ABSTRACT Vicenç Gómez Donders Institute for Brain Cognition and Behaviour Radboud University Nijmegen

More information

arxiv: v4 [cs.si] 21 May 2017

arxiv: v4 [cs.si] 21 May 2017 Submitted to manuscript 2017 No Time to Observe: Adaptive Influence Maximization with Partial Feedback Jing Yuan Shaojie Tang The University of Texas at Dallas arxiv:1609.00427v4 [cs.si] 21 May 2017 Although

More information

Online Social Networks and Media. Link Analysis and Web Search

Online Social Networks and Media. Link Analysis and Web Search Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information

More information

Machine Learning and Modeling for Social Networks

Machine Learning and Modeling for Social Networks Machine Learning and Modeling for Social Networks Olivia Woolley Meza, Izabela Moise, Nino Antulov-Fatulin, Lloyd Sanders 1 Spreading and Influence on social networks Computational Social Science D-GESS

More information

Robust Influence Maximization

Robust Influence Maximization Robust Influence Maximization Wei Chen Microsoft Research weic@microsoft.com Mingfei Zhao IIIS, Tsinghua University mingfeizhao@hotmail.com Tian Lin Tsinghua University lint0@mails.tsinghua.edu.cn Xuren

More information

Modeling Dynamic Evolution of Online Friendship Network

Modeling Dynamic Evolution of Online Friendship Network Commun. Theor. Phys. 58 (2012) 599 603 Vol. 58, No. 4, October 15, 2012 Modeling Dynamic Evolution of Online Friendship Network WU Lian-Ren ( ) 1,2, and YAN Qiang ( Ö) 1 1 School of Economics and Management,

More information

Diversity Regularization of Latent Variable Models: Theory, Algorithm and Applications

Diversity Regularization of Latent Variable Models: Theory, Algorithm and Applications Diversity Regularization of Latent Variable Models: Theory, Algorithm and Applications Pengtao Xie, Machine Learning Department, Carnegie Mellon University 1. Background Latent Variable Models (LVMs) are

More information

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University Note to other teachers and users of these slides: We would be delighted if you found this our material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them to fit

More information

Spam ain t as Diverse as It Seems: Throttling OSN Spam with Templates Underneath

Spam ain t as Diverse as It Seems: Throttling OSN Spam with Templates Underneath Spam ain t as Diverse as It Seems: Throttling OSN Spam with Templates Underneath Hongyu Gao, Yi Yang, Kai Bu, Yan Chen, Doug Downey, Kathy Lee, Alok Choudhary Northwestern University, USA Zhejiang University,

More information

Measuring the Inspiration Rate of Topics in Bibliographic Networks

Measuring the Inspiration Rate of Topics in Bibliographic Networks Measuring the Inspiration Rate of Topics in Bibliographic Networks Livio Bioglio, Valentina Rho, and Ruggero G. Pensa Dept. of Computer Science, University of Turin, Turin, Italy {livio.bioglio,valentina.rho,ruggero.pensa}@unito.it

More information

Aditya Bhaskara CS 5968/6968, Lecture 1: Introduction and Review 12 January 2016

Aditya Bhaskara CS 5968/6968, Lecture 1: Introduction and Review 12 January 2016 Lecture 1: Introduction and Review We begin with a short introduction to the course, and logistics. We then survey some basics about approximation algorithms and probability. We also introduce some of

More information

Link Analysis. Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze

Link Analysis. Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze Link Analysis Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze 1 The Web as a Directed Graph Page A Anchor hyperlink Page B Assumption 1: A hyperlink between pages

More information

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University.

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University. Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University http://www.mmds.org #1: C4.5 Decision Tree - Classification (61 votes) #2: K-Means - Clustering

More information

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018 Lab 8: Measuring Graph Centrality - PageRank Monday, November 5 CompSci 531, Fall 2018 Outline Measuring Graph Centrality: Motivation Random Walks, Markov Chains, and Stationarity Distributions Google

More information

arxiv: v2 [cs.si] 17 Nov 2010

arxiv: v2 [cs.si] 17 Nov 2010 A Framework for Quantitative Analysis of Cascades on Networks arxiv:1011.3571v2 [cs.si] 17 Nov 2010 ABSTRACT Rumi Ghosh USC Information Sciences Institute 4676 Admiralty Way Marina del Rey, CA 90292 rumig@usc.edu

More information

Minimizing Seed Set Selection with Probabilistic Coverage Guarantee in a Social Network

Minimizing Seed Set Selection with Probabilistic Coverage Guarantee in a Social Network Minimizing Seed Set Selection with Probabilistic Coverage Guarantee in a Social Network Peng Zhang Purdue University zhan1456@purdue.edu Yajun Wang Microsoft yajunw@microsoft.com Wei Chen Microsoft weic@microsoft.com

More information

KINETICS OF COMPLEX SOCIAL CONTAGION. János Kertész Central European University. Pohang, May 27, 2016

KINETICS OF COMPLEX SOCIAL CONTAGION. János Kertész Central European University. Pohang, May 27, 2016 KINETICS OF COMPLEX SOCIAL CONTAGION János Kertész Central European University Pohang, May 27, 2016 Theory: Zhongyuan Ruan, Gerardo Iniguez, Marton Karsai, JK: Kinetics of social contagion Phys. Rev. Lett.

More information

DS504/CS586: Big Data Analytics Graph Mining II

DS504/CS586: Big Data Analytics Graph Mining II Welcome to DS504/CS586: Big Data Analytics Graph Mining II Prof. Yanhua Li Time: 6-8:50PM Thursday Location: AK233 Spring 2018 v Course Project I has been graded. Grading was based on v 1. Project report

More information

CS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine

CS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine CS 277: Data Mining Mining Web Link Structure Class Presentations In-class, Tuesday and Thursday next week 2-person teams: 6 minutes, up to 6 slides, 3 minutes/slides each person 1-person teams 4 minutes,

More information

Information Relaxation is Ultradiffusive

Information Relaxation is Ultradiffusive ABSTRACT Information Relaxation is Ultradiffusive We investigate how the overall response to a piece of information evolves and relaxes as a function of time in social networks like Reddit, Digg and Youtube.

More information

Mining Triadic Closure Patterns in Social Networks

Mining Triadic Closure Patterns in Social Networks Mining Triadic Closure Patterns in Social Networks Hong Huang, University of Goettingen Jie Tang, Tsinghua University Sen Wu, Stanford University Lu Liu, Northwestern University Xiaoming Fu, University

More information

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 3 Centrality, Similarity, and Strength Ties Prof. James She james.she@ust.hk 1 Last lecture 2 Selected works from Tutorial

More information

Applying Latent Dirichlet Allocation to Group Discovery in Large Graphs

Applying Latent Dirichlet Allocation to Group Discovery in Large Graphs Lawrence Livermore National Laboratory Applying Latent Dirichlet Allocation to Group Discovery in Large Graphs Keith Henderson and Tina Eliassi-Rad keith@llnl.gov and eliassi@llnl.gov This work was performed

More information

BAYESIAN INFERENCE OF DIFFUSION NETWORKS WITH UNKNOWN INFECTION TIMES. Shohreh Shaghaghian and Mark Coates

BAYESIAN INFERENCE OF DIFFUSION NETWORKS WITH UNKNOWN INFECTION TIMES. Shohreh Shaghaghian and Mark Coates IEEE Statistical Signal Processing Workshop (SSP) BAYESIAN INFERENCE OF DIFFUSION NETWORKS WITH UNKNOWN INFECTION TIMES Shohreh Shaghaghian and Mark Coates Department of Electrical and Computer Engineering,

More information