CMU SCS : Multimedia Databases and Data Mining CMU SCS Must-read Material CMU SCS Must-read Material (cont d)

Size: px
Start display at page:

Download "CMU SCS : Multimedia Databases and Data Mining CMU SCS Must-read Material CMU SCS Must-read Material (cont d)"

Transcription

1 15-826: Multimedia Databases and Data Mining Lecture #26: Graph mining - patterns Christos Faloutsos Must-read Material Michalis Faloutsos, Petros Faloutsos and Christos Faloutsos, On Power-Law Relationships of the Internet Topology, SIGCOMM R. Albert, H. Jeong, and A.-L. Barabasi, Diameter of the World Wide Web Nature, 401, (1999). Reka Albert and Albert-Laszlo Barabasi Statistical mechanics of complex networks, Reviews of Modern Physics, 74, 47 (2002). Jure Leskovec, Jon Kleinberg, Christos Faloutsos Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations, KDD 2005, Chicago, IL, USA 2 Must-read Material (cont d) D. Chakrabarti and C. Faloutsos, Graph Mining: Laws, Generators and Algorithms, in ACM Computing Surveys, 3 8(1), 2006 J. Leskovec, D. Chakrabarti, J. Kleinberg, and C. Faloutsos, Realistic, Mathematically Tractable Graph Generation and Evolution, Using Kronecker Multiplication, in PKDD 2005, Porto, Portugal 3 1

2 Outline Part 1: Topology, laws and generators Part 2: PageRank, HITS and eigenvalues Part 3: Influence, communities 4 Thanks Deepayan Chakrabarti (CMU) Soumen Chakrabarti (IIT-Bombay) Michalis Faloutsos (UCR) George Siganos (UCR) 5 Introduction Internet Map [lumeta.com] Food Web [Martinez 91] Protein Interactions [genomebiology.com] Graphs are everywhere! Friendship Network [Moody 01] 6 2

3 Graph structures Physical networks Physical Internet Telephone lines Commodity distribution networks 7 Networks derived from "behavior" Telephone call patterns , Blogs, Web, Databases, XML Language processing Web of trust, epinions 8 Outline Topology & laws Generators Discussion and tools Motivating questions: 9 3

4 Motivating questions What do real graphs look like? What properties of nodes, edges are important to model? What local and global properties are important to measure? How to generate realistic graphs? 10 Given a graph: Motivating questions Are there un-natural sub -graphs? (criminals rings or terrorist cells)? How do P2P networks evolve? 11 Why should we care? Internet Map [lumeta.com] Food Web [Martinez 91] Protein Interactions [genomebiology.com] Friendship Network [Moody 01] 12 4

5 Why should we care? Models & patterns help with: A1: extrapolations: how will the Internet /Web look like next year? A2: algorithm design: what is a realistic network topology, to try a new routing protocol? to study virus/rumor propagation, and immunization? 13 Why should we care? (cont d) A3: Sampling: How to get a good sample of a network? A4: Abnormalities: is this sub-graph / sub -community / sub-network normal? (what is normal?) 14 Outline Topology & laws Static graphs Evolving graphs Generators Discussion and tools 15 5

6 Topology How does the Internet look like? Any rules? (Looks random right?) 16 Are real graphs random? random (Erdos-Renyi) graph 100 nodes, avg degree = 2 before layout after layout No obvious patterns (generated with: pajek /pajek/ ) 17 Laws and patterns Real graphs are NOT random!! Diameter in- and out- degree distributions other (surprising) patterns 18 6

7 Laws degree distributions Q: avg degree is ~2 - what is the most probable degree? count?? 2 degree 19 Laws degree distributions Q: avg degree is ~3 - what is the most probable degree? count?? count 2 degree 2 degree 20 S1.Power-law: outdegree O Frequency Exponent = slope O = Nov 97 Outdegree The plot is linear in log-log scale [FFF 99] freq = degree (-2.15) 21 7

8 S2.Power-law: rank R outdegree Exponent = slope R = R Dec 98 Rank: nodes in decreasing outdegree order The plot is a line in log-log scale 22 S3. Eigenvalues Let A be the adjacency matrix of graph The eigenvalue λ is: A v = λ v, where v some vector Eigenvalues are strongly related to graph topology A B C D 23 S3. Power-law: eigen E Eigenvalue Exponent = slope E = Dec 98 Rank of decreasing eigenvalue Eigenvalues in decreasing order (first 20) [Mihail+, 02]: R = 2 * E 24 8

9 S4. The Node Neighborhood N(h) = # of pairs of nodes within h hops 25 S4. The Node Neighborhood Q: average degree = 3 - how many neighbors should I expect within 1,2, h hops? Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 h hops -> 3 h 26 S4. The Node Neighborhood Q: average degree = 3 - how many neighbors should I expect within 1,2, h hops? Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 WE HAVE DUPLICATES! h hops -> 3 h 27 9

10 S4. The Node Neighborhood Q: average degree = 3 - how many neighbors should I expect within 1,2, h hops? Potential answer: 1 hop -> 3 neighbors 2 hops -> 3 * 3 avg degree: meaningless! h hops -> 3 h 28 S4. Power-law: hopplot H # of Pairs H = 4.86 # of Pairs H = 2.83 Hops Dec 98 Hops Router level 95 Pairs of nodes as a function of hops N(h)= h H 29 Observation Q: Intuition behind hop exponent? A: intrinsic=fractal dimensionality of the network... N(h) ~ h 1 N(h) ~ h

11 But: Q1: How about graphs from other domains? Q2: How about temporal evolution? 31 The Peer-to-Peer Topology [Jovanovic+] Frequency versus degree Number of adjacent peers follows a power-law 32 More Power laws Also hold for other web graphs [Barabasi+, 99], [Kumar+, 99] with additional rules (bi-partite cores follow power laws) 33 11

12 Time Evolution: rank R Domain level #days since Nov. 97 The rank exponent has not changed! [Siganos+, 03] 34 Outline Topology & laws Static graphs Evolving graphs Generators Discussion and tools 35 Any other laws? Yes! 36 12

13 Any other laws? Yes! Small diameter (~ constant!) six degrees of separation / Kevin Bacon small worlds [Watts and Strogatz] 37 Any other laws? Bow-tie, for the web [Kumar+ 99] IN, SCC, OUT, tendrils disconnected components 38 Any other laws? power-laws in communities (bi-partite cores) [Kumar+, 99] Log(count) n:1 n:3 n:2 Log(m) 2:3 core (m:n core) 39 13

14 Any other laws? Jellyfish for Internet [Tauro+ 01] core: ~clique ~5 concentric layers many 1-degree nodes 40 How about triangles? 41 Triangle Laws Real social networks have a lot of triangles 42 14

15 Triangle Laws Real social networks have a lot of triangles Friends of friends are friends Any patterns? 43 S5. Triangle Law: #1 [Tsourakakis ICDM 2008] HEP-TH ASN Epinions X-axis: # of Triangles a node participates in Y-axis: count of such nodes 1-44 S6. Triangle Law: #2 [Tsourakakis ICDM 2008] Reuters SN Epinions X-axis: degree Y-axis: mean # triangles Notice: slope ~ degree exponent (insets) 45 15

16 Triangle Law: Computations [Tsourakakis ICDM 2008] But: triangles are expensive to compute (3-way join; several approx. algos) Q: Can we do that quickly? 46 Triangle Law: Computations [Tsourakakis ICDM 2008] But: triangles are expensive to compute (3-way join; several approx. algos) Q: Can we do that quickly? A: Yes! #triangles = 1/6 Sum ( λ i 3 ) (and, because of skewness, we only need the top few eigenvalues! 47 Triangle Law: Computations [Tsourakakis ICDM 2008] x+ speed-up, (c) 2010 C. Faloutsos high accuracy 48 16

17 Outline Topology & laws Static graphs - eigenspokes Evolving graphs Generators Discussion and tools 49 EigenSpokes B. Aditya Prakash, Mukund Seshadri, Ashwin Sridharan, Sridhar Machiraju and Christos Faloutsos: EigenSpokes: Surprising Patterns and Scalable Community Chipping in Large Graphs, PAKDD 2010, Hyderabad, India, June (c) 2010 C. Faloutsos 50 EigenSpokes Eigenvectors of adjacency matrix equivalent to singular vectors (symmetric, undirected graph) (c) 2010 C. Faloutsos 51 17

18 EigenSpokes Eigenvectors of adjacency matrix equivalent to singular vectors (symmetric, undirected graph) details N N (c) 2010 C. Faloutsos 52 EigenSpokes 2 nd Principal component EE plot: u2 Scatter plot of scores of u1 vs u2 One would expect Many origin A few scattered ~randomly u1 1 st Principal component (c) 2010 C. Faloutsos 53 EigenSpokes EE plot: Scatter plot of scores of u1 vs u2 One would expect Many origin A few scattered ~randomly u2 90 o u (c) 2010 C. Faloutsos 54 18

19 EigenSpokes - pervasiveness Present in mobile social graph across time and space Patent citation graph (c) 2010 C. Faloutsos 55 EigenSpokes - explanation Near-cliques, or near -bipartite-cores, loosely connected (c) 2010 C. Faloutsos 56 EigenSpokes - explanation Near-cliques, or near -bipartite-cores, loosely connected (c) 2010 C. Faloutsos 57 19

20 EigenSpokes - explanation Near-cliques, or near -bipartite-cores, loosely connected spy plot of top 20 nodes So what? Extract nodes with high scores high connectivity Good communities (c) 2010 C. Faloutsos 58 Bipartite Communities! patents from same inventor(s) cut-and-paste bibliography! magnified bipartite community (c) 2010 C. Faloutsos 59 Outline Topology & laws Static graphs Weighted graphs Evolving graphs Generators Discussion and tools 60 20

21 How about weighted graphs? A: even more laws! 61 Observation W.1: Fortification Q: How do the weights of nodes relate to degree? (c) 2010 C. Faloutsos 62 Observation W.1: Fortification More donors, more $? $10 Reagan $5 $7 Clinton (c) 2010 C. Faloutsos 63 21

22 Observation W.1: fortification: Snapshot Power Law Weight: super-linear on in-degree exponent iw : 1.01 < iw < 1.26 More donors, even more $ $10 In-weights ($) Orgs-Candidates e.g. John Kerry, $10M received, from 1K donors $ Edges (# donors) (c) 2010 C. Faloutsos 64 Fortification: Snapshot Power Law At any time, total incoming weight of a node is proportional to in-degree with PL exponent iw : i.e < iw < 1.26, super-linear More donors, even more $ Orgs-Candidates In-weights ($) e.g. John Kerry, $10M received, from 1K donors Edges (# donors) CIKM Copyright: (c) 2010 Faloutsos, C. Faloutsos Tong (2008) 1-65 Summary of laws static, & weighted graphs Power laws for degree distributions... for eigenvalues, bi-partite cores Small diameter ( 6 degrees ) Bow-tie for web; jelly-fish for internet Power laws for triangles and for weighted graphs 66 22

23 Outline Topology & laws Static graphs Evolving graphs Generators Discussion and tools 67 Evolution of diameter? Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(n))) i.e.., slowly increasing with network size Q: What is happening, in reality? 68 Evolution of diameter? Prior analysis, on power-law-like graphs, hints that diameter ~ O(log(N)) or diameter ~ O( log(log(n))) i.e.., slowly increasing with network size Q: What is happening, in reality? A: It shrinks(!!), towards a constant value 69 23

24 T1. Shrinking diameter [Leskovec+05a] diameter Citations among physics papers 2003: 29,555 papers 352,807 citations For each month M, create a graph of all citations up to month M time 70 T1. Shrinking diameter Authors & publications nodes 272 edges ,000 nodes 20,000 authors 38,000 papers 133,000 edges 71 T1. Shrinking diameter Patents & citations ,000 nodes 676,000 edges million nodes 16.5 million edges Each year is a datapoint 72 24

25 T1. Shrinking diameter Autonomous systems ,000 nodes 10,000 edges ,000 nodes 26,000 edges One graph per day diameter N 73 Temporal evolution of graphs N(t) nodes; E(t) edges at time t suppose that N(t+1) = 2 * N(t) Q: what is your guess for E(t+1) =? 2 * E(t) 74 Temporal evolution of graphs N(t) nodes; E(t) edges at time t suppose that N(t+1) = 2 * N(t) Q: what is your guess for E(t+1) =? 2 * E(t) A: over-doubled! 75 25

26 T2. Temporal evolution of graphs A: over-doubled - but obeying: E(t) ~ N(t) a for all t where 1<a<2 76 T2. Densification Power Law ArXiv: Physics papers and their citations E(t) 1.69 N(t) 77 T2. Densification Power Law ArXiv: Physics papers and their citations E(t) tree N(t) 78 26

27 T2. Densification Power Law ArXiv: Physics papers and their citations E(t) clique N(t) 79 T2. Densification Power Law U.S. Patents, citing each other E(t) 1.66 N(t) 80 T2. Densification Power Law Autonomous Systems E(t) 1.18 N(t) 81 27

28 T2. Densification Power Law ArXiv: authors & papers E(t) 1.15 N(t) 82 More on Time-evolving graphs M. McGlohon, L. Akoglu, and C. Faloutsos Weighted Graphs and Disconnected Components: Patterns and a Generator. SIG-KDD T3: Gelling Point Q1: How does the GCC emerge? 84 28

29 T3: Gelling Point Most real graphs display a gelling point After gelling point, they exhibit typical behavior. This is marked by a spike in IMDB diameter. Diameter t=1914 Time 85 T4: NLCC behavior Q2: How do NLCC s emerge and join with the GCC? (``NLCC = non-largest conn. components) Do they continue to grow in size? or do they shrink? or stabilize? 86 T4: NLCC behavior After the gelling point, the GCC takes off, but NLCC s remain ~constant (actually, oscillate). IMDB CC size 87 29

30 How do new edges appear? [LBKT 08] Microscopic Evolution of Social Networks Jure Leskovec, Lars Backstrom, Ravi Kumar, Andrew Tomkins. (ACM KDD), How do edges appear in time? [LBKT 08] Edge gap δ(d): inter-arrival time between d th and d+1 st edge What is the PDF of δ? Poisson? δ 89 T5: How do edges appear in time? [LBKT 08] LinkedIn Edge gap δ(d): inter-arrival time between d th and d+1 st edge CIKM Copyright: (c) 2010 Faloutsos, C. Faloutsos Tong (2008)

31 # in links T6 : popularity over time lag: days after post Post popularity + lag (c) 2010 C. Faloutsos 91 # in links (log) T6 : popularity over time days after post (log) Post popularity drops-off exponentially? POWER LAW! Exponent? (c) 2010 C. Faloutsos 92 T6 : popularity over time # in links (log) -1.6 Post popularity drops-off exponentially? POWER LAW! Exponent? -1.6 close to -1.5: Barabasi s stack model and like the zero-crossings of a random walk days after post (log) (c) 2010 C. Faloutsos 93 31

32 Summary of laws Power laws for degree distributions... for eigenvalues, bi-partite cores Small & shrinking diameter ( 6 degrees ) Bow-tie for web; jelly-fish for internet ``Densification Power Law, over time Triangles; weights Oscillating NLCC Power-law inter-arrival times 94 Outline Topology & laws Generators Erdos Renyi Degree-based Process-based Recursive generators Discussion and tools 95 Generators How to generate random, realistic graphs? Erdos-Renyi model: beautiful, but unrealistic degree-based generators process-based generators recursive/self-similar generators 96 32

33 random graph 100 nodes, avg degree = 2 Fascinating properties (phase transition) But: unrealistic (Poisson degree distribution!= power law) Erdos-Renyi 97 E-R model & Phase transition Pc vary avg degree D watch Pc = 1 Prob( there is a giant connected component) How do you expect it to be? 0?? D 98 E-R model & Phase transition Pc vary avg degree D watch Pc = 1 Prob( there is a giant connected component) How do you expect it to be? 0 N->infty D 0 N=10^3 D 99 33

34 Degree-based Figure out the degree distribution (eg., Zipf ) Assign degrees to nodes Put edges, so that they match the original degree distribution 100 Process-based Barabasi; Barabasi-Albert: Preferential attachment -> power-law tails! rich get richer [Kumar+]: preferential attachment + mimick Create communities 101 Process-based (cont d) [Fabrikant+, 02]: H.O.T.: connect to closest, high connectivity neighbor [Pennock+, 02]: Winner does NOT take all

35 Outline Topology & laws Generators Erdos Renyi Degree-based Process-based Recursive generators Discussion and tools 103 Recursive generators (RMAT [Chakrabarti+, 04]) Kronecker product 104 Wish list for a generator: Power-law-tail in- and out-degrees Power-law-tail scree plots shrinking/constant diameter Densification Power Law communities-within-communities

36 Graph Patterns Power Laws Effective Diameter Count vs Indegree Count vs Outdegree Hop-plot Eigenvalue vs Rank Network values vs Rank Count vs edge-stress 106 Wish list for a generator: Power-law-tail in- and out-degrees Power-law-tail scree plots shrinking/constant diameter Densification Power Law communities-within-communities Q: how to achieve all of them? 107 Wish list for a generator: Power-law-tail in- and out-degrees Power-law-tail scree plots shrinking/constant diameter Densification Power Law communities-within-communities Q: how to achieve all of them? A: Kronecker matrix product [Leskovec+05b]

37 Kronecker product 109 Kronecker product 110 Kronecker product N N*N N**

38 Kronecker Product Communities within communities 112 Properties of Kronecker graphs: Power-law-tail in- and out-degrees Power-law-tail scree plots constant diameter perfect Densification Power Law communities-within-communities 113 Properties of Kronecker graphs: Power-law-tail in- and out-degrees Power-law-tail scree plots constant diameter perfect Densification Power Law communities-within-communities and we can prove all of the above (first and only generator that does that)

39 Kronecker - patents Degree Scree Diameter D.P.L. 115 Outline Topology & laws Generators Discussion - tools 116 Power laws Q1: Why so many? A1:

40 Power laws Q1: Why so many? A1: self-similarity; rich-get-richer, etc we saw Newman s paper Can you recall other settings with power laws? 118 log(freq) a A famous power law: Zipf s law the Bible - rank vs frequency (log-log) log(rank) 119 Power laws, cont ed web hit counts [Huberman] Click-stream data [Montgomery+01]

41 u-id s Click-stream data url s Web Site Traffic log(count) Zipf yahoo log(freq) log(count) super-surfer log(freq) 121 Conclusions - Laws Laws and patterns: Power laws for degrees, eigenvalues, communities /cores Small diameter and shrinking diameter Bow-tie; jelly-fish densification power law Triangles; weights Oscillating NLCCs Power-law edge-inter-arrival times 122 Conclusions - Generators Preferential attachment (Barabasi) Variations Recursion Kronecker graphs

42 Conclusions Power laws rank/frequency plots ~ log-log NCDF log-log PDF Self-similarity / recursion / fractals correlation integral = hop-plot 124 Resources Generators: Kronecker (christos@cs.cmu.edu) BRITE INET: Other resources Visualization - graph algo s: Graphviz: pajek: /networks/pajek/ Kevin Bacon web site: /

43 References [Aiello+, '00] William Aiello, Fan R. K. Chung, Linyuan Lu: A random graph model for massive graphs. STOC 2000: [Albert+] Reka Albert, Hawoong Jeong, and Albert -Laszlo Barabasi: Diameter of the World Wide Web, Nature (1999) [Barabasi, '03] Albert-Laszlo Barabasi Linked: How Everything Is Connected to Everything Else and What It Means (Plume, 2003) 127 References, cont d [Barabasi+, '99] Albert-Laszlo Barabasi and Reka Albert. Emergence of scaling in random networks. Science, 286: , 1999 [Broder+, '00] Andrei Broder, Ravi Kumar, Farzin Maghoul, Prabhakar Raghavan, Sridhar Rajagopalan, Raymie Stata, Andrew Tomkins, and Janet Wiener. Graph structure in the web, WWW, References, cont d [Chakrabarti+, 04] RMAT: A recursive graph generator, D. Chakrabarti, Y. Zhan, C. Faloutsos, SIAM-DM 2004 [Dill+, '01] Stephen Dill, Ravi Kumar, Kevin S. McCurley, Sridhar Rajagopalan, D. Sivakumar, Andrew Tomkins: Self-similarity in the Web. VLDB 2001:

44 References, cont d [Fabrikant+, '02] A. Fabrikant, E. Koutsoupias, and C.H. Papadimitriou. Heuristically Optimized Trade-offs: A New Paradigm for Power Laws in the Internet. ICALP, Malaga, Spain, July 2002 [FFF, 99] M. Faloutsos, P. Faloutsos, and C. Faloutsos, "On power-law relationships of the Internet topology," in SIGCOMM, References, cont d [Jovanovic+, '01] M. Jovanovic, F.S. Annexstein, and K.A. Berman. Modeling Peer-to-Peer Network Topologies through "Small-World" Models and Power Laws. In TELFOR, Belgrade, Yugoslavia, November, 2001 [Kumar+ '99] Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Andrew Tomkins: Extracting Large-Scale Knowledge Bases from the Web. VLDB 1999: References, cont d [Leland+, '94] W. E. Leland, M.S. Taqqu, W. Willinger, D.V. Wilson, On the Self-Similar Nature of Ethernet Traffic, IEEE Transactions on Networking, 2, 1, pp 1-15, Feb [Leskovec+05a] Jure Leskovec, Jon Kleinberg, Christos Faloutsos, Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations (KDD 2005)

45 References, cont d [Leskovec+05b] Jure Leskovec, Deepayan Chakrabarti, Jon Kleinberg, Christos Faloutsos Realistic, Mathematically Tractable Graph Generation and Evolution, Using Kronecker Multiplication (ECML/PKDD 2005), Porto, Portugal, References, cont d [Leskovec+07] Jure Leskovec and Christos Faloutsos, Scalable Modeling of Real Graphs using Kronecker Multiplication, ICML [Mihail+, '02] Milena Mihail, Christos H. Papadimitriou: On the Eigenvalue Power Law. RANDOM 2002: References, cont d [Milgram '67] Stanley Milgram: The Small World Problem, Psychology Today 1(1), (1967) [Montgomery+, 01] Alan L. Montgomery, Christos Faloutsos: Identifying Web Browsing Trends and Patterns. IEEE Computer 34(7): (2001)

46 References, cont d [Palmer+, 01] Chris Palmer, Georgos Siganos, Michalis Faloutsos, Christos Faloutsos and Phil Gibbons The connectivity and fault-tolerance of the Internet topology (NRDM 2001), Santa Barbara, CA, May 25, 2001 [Pennock+, '02] David M. Pennock, Gary William Flake, Steve Lawrence, Eric J. Glover, C. Lee Giles: Winners don't take all: Characterizing the competition for links on the web Proc. Natl. Acad. Sci. USA 99(8): (2002) 136 References, cont d [Schroeder, 91] Manfred Schroeder Fractals, Chaos, Power Laws: Minutes from an Infinite Paradise W H Freeman & Co., References, cont d [Siganos+, '03] G. Siganos, M. Faloutsos, P. Faloutsos, C. Faloutsos Power-Laws and the AS -level Internet Topology, Transactions on Networking, August [Watts+ Strogatz, '98] D. J. Watts and S. H. Strogatz Collective dynamics of 'small-world' networks, Nature, 393: (1998) [Watts, '03] Duncan J. Watts Six Degrees: The Science of a Connected Age W.W. Norton & Company; (February 2003)

Finding patterns in large, real networks

Finding patterns in large, real networks Thanks to Finding patterns in large, real networks Christos Faloutsos CMU www.cs.cmu.edu/~christos/talks/ucla-05 Deepayan Chakrabarti (CMU) Michalis Faloutsos (UCR) George Siganos (UCR) UCLA-IPAM 05 (c)

More information

Must-read material (1 of 2) : Multimedia Databases and Data Mining. Must-read material (2 of 2) Main outline

Must-read material (1 of 2) : Multimedia Databases and Data Mining. Must-read material (2 of 2) Main outline Must-read material (1 of 2) 15-826: Multimedia Databases and Data Mining Lecture #29: Graph mining - Generators & tools Fully Automatic Cross-Associations, by D. Chakrabarti, S. Papadimitriou, D. Modha

More information

Data mining in large graphs

Data mining in large graphs Data mining in large graphs Christos Faloutsos University www.cs.cmu.edu/~christos ALLADIN 2003 C. Faloutsos 1 Outline Introduction - motivation Patterns & Power laws Scalability & Fast algorithms Fractals,

More information

Realistic, Mathematically Tractable Graph Generation and Evolution, Using Kronecker Multiplication

Realistic, Mathematically Tractable Graph Generation and Evolution, Using Kronecker Multiplication Realistic, Mathematically Tractable Graph Generation and Evolution, Using Kronecker Multiplication Jurij Leskovec 1, Deepayan Chakrabarti 1, Jon Kleinberg 2, and Christos Faloutsos 1 1 School of Computer

More information

CMU SCS. Large Graph Mining. Christos Faloutsos CMU

CMU SCS. Large Graph Mining. Christos Faloutsos CMU Large Graph Mining Christos Faloutsos CMU Thank you! Hillol Kargupta NGDM 2007 C. Faloutsos 2 Outline Problem definition / Motivation Static & dynamic laws; generators Tools: CenterPiece graphs; fraud

More information

RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs

RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Leman Akoglu Mary McGlohon Christos Faloutsos Carnegie Mellon University, School of Computer Science {lakoglu, mmcgloho, christos}@cs.cmu.edu

More information

CMU SCS Mining Billion-node Graphs: Patterns, Generators and Tools

CMU SCS Mining Billion-node Graphs: Patterns, Generators and Tools Mining Billion-node Graphs: Patterns, Generators and Tools Christos Faloutsos CMU (on sabbatical at google) Tamara Kolda Thank you! C. Faloutsos (CMU + Google) 2 Our goal: Open source system for mining

More information

CS224W: Analysis of Networks Jure Leskovec, Stanford University

CS224W: Analysis of Networks Jure Leskovec, Stanford University CS224W: Analysis of Networks Jure Leskovec, Stanford University http://cs224w.stanford.edu 10/30/17 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 2

More information

RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs

RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs Submitted for Blind Review Abstract How do real, weighted graphs change over time? What patterns, if any, do they obey? Earlier studies

More information

Motivation Data mining: ~ find patterns (rules, outliers) Problem#1: How do real graphs look like? Problem#2: How do they evolve? Problem#3: How to ge

Motivation Data mining: ~ find patterns (rules, outliers) Problem#1: How do real graphs look like? Problem#2: How do they evolve? Problem#3: How to ge Graph Mining: Laws, Generators and Tools Christos Faloutsos CMU Thank you! Sharad Mehrotra UCI 2007 C. Faloutsos 2 Outline Problem definition / Motivation Static & dynamic laws; generators Tools: CenterPiece

More information

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

6.207/14.15: Networks Lecture 12: Generalized Random Graphs 6.207/14.15: Networks Lecture 12: Generalized Random Graphs 1 Outline Small-world model Growing random networks Power-law degree distributions: Rich-Get-Richer effects Models: Uniform attachment model

More information

arxiv: v2 [stat.ml] 21 Aug 2009

arxiv: v2 [stat.ml] 21 Aug 2009 KRONECKER GRAPHS: AN APPROACH TO MODELING NETWORKS Kronecker graphs: An Approach to Modeling Networks arxiv:0812.4905v2 [stat.ml] 21 Aug 2009 Jure Leskovec Computer Science Department, Stanford University

More information

Networks as vectors of their motif frequencies and 2-norm distance as a measure of similarity

Networks as vectors of their motif frequencies and 2-norm distance as a measure of similarity Networks as vectors of their motif frequencies and 2-norm distance as a measure of similarity CS322 Project Writeup Semih Salihoglu Stanford University 353 Serra Street Stanford, CA semih@stanford.edu

More information

Deterministic Decentralized Search in Random Graphs

Deterministic Decentralized Search in Random Graphs Deterministic Decentralized Search in Random Graphs Esteban Arcaute 1,, Ning Chen 2,, Ravi Kumar 3, David Liben-Nowell 4,, Mohammad Mahdian 3, Hamid Nazerzadeh 1,, and Ying Xu 1, 1 Stanford University.

More information

Self Similar (Scale Free, Power Law) Networks (I)

Self Similar (Scale Free, Power Law) Networks (I) Self Similar (Scale Free, Power Law) Networks (I) E6083: lecture 4 Prof. Predrag R. Jelenković Dept. of Electrical Engineering Columbia University, NY 10027, USA {predrag}@ee.columbia.edu February 7, 2007

More information

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 4 May 2000

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 4 May 2000 Topology of evolving networks: local events and universality arxiv:cond-mat/0005085v1 [cond-mat.dis-nn] 4 May 2000 Réka Albert and Albert-László Barabási Department of Physics, University of Notre-Dame,

More information

preferential attachment

preferential attachment preferential attachment introduction to network analysis (ina) Lovro Šubelj University of Ljubljana spring 2018/19 preferential attachment graph models are ensembles of random graphs generative models

More information

Chapter 2 STATISTICAL PROPERTIES OF SOCIAL NETWORKS. Mary McGlohon. Leman Akoglu. Christos Faloutsos

Chapter 2 STATISTICAL PROPERTIES OF SOCIAL NETWORKS. Mary McGlohon. Leman Akoglu. Christos Faloutsos Chapter 2 STATISTICAL PROPERTIES OF SOCIAL NETWORKS Mary McGlohon School of Computer Science Carnegie Mellon University mmcgloho@cs.cmu.edu Leman Akoglu School of Computer Science Carnegie Mellon University

More information

T , Lecture 6 Properties and stochastic models of real-world networks

T , Lecture 6 Properties and stochastic models of real-world networks T-79.7003, Lecture 6 Properties and stochastic models of real-world networks Charalampos E. Tsourakakis 1 1 Aalto University November 1st, 2013 Properties of real-world networks Properties of real-world

More information

Weighted graphs and disconnected components

Weighted graphs and disconnected components Weighted graphs and disconnected components Patterns and a Generator Mary McGlohon Carnegie Mellon University School of Computer Science Forbes Ave. Pittsburgh, Penn. USA mmcgloho@cs.cmu.edu Leman Akoglu

More information

Weighted Graphs and Disconnected Components

Weighted Graphs and Disconnected Components Weighted Graphs and Disconnected Components Patterns and a Generator Mary McGlohon Carnegie Mellon University School of Computer Science Forbes Ave. Pittsburgh, Penn. USA mmcgloho@cs.cmu.edu Leman Akoglu

More information

1 Complex Networks - A Brief Overview

1 Complex Networks - A Brief Overview Power-law Degree Distributions 1 Complex Networks - A Brief Overview Complex networks occur in many social, technological and scientific settings. Examples of complex networks include World Wide Web, Internet,

More information

Dynamics of Real-world Networks

Dynamics of Real-world Networks Dynamics of Real-world Networks Thesis proposal Jurij Leskovec Machine Learning Department Carnegie Mellon University May 2, 2007 Thesis committee: Christos Faloutsos, CMU Avrim Blum, CMU John Lafferty,

More information

arxiv: v1 [physics.soc-ph] 15 Dec 2009

arxiv: v1 [physics.soc-ph] 15 Dec 2009 Power laws of the in-degree and out-degree distributions of complex networks arxiv:0912.2793v1 [physics.soc-ph] 15 Dec 2009 Shinji Tanimoto Department of Mathematics, Kochi Joshi University, Kochi 780-8515,

More information

arxiv:physics/ v3 [physics.soc-ph] 28 Jan 2007

arxiv:physics/ v3 [physics.soc-ph] 28 Jan 2007 arxiv:physics/0603229v3 [physics.soc-ph] 28 Jan 2007 Graph Evolution: Densification and Shrinking Diameters Jure Leskovec School of Computer Science, Carnegie Mellon University, Pittsburgh, PA Jon Kleinberg

More information

6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search

6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search 6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search Daron Acemoglu and Asu Ozdaglar MIT September 30, 2009 1 Networks: Lecture 7 Outline Navigation (or decentralized search)

More information

Complex-Network Modelling and Inference

Complex-Network Modelling and Inference Complex-Network Modelling and Inference Lecture 12: Random Graphs: preferential-attachment models Matthew Roughan http://www.maths.adelaide.edu.au/matthew.roughan/notes/

More information

A Scalable Null Model for Directed Graphs Matching All Degree Distributions: In, Out, and Reciprocal

A Scalable Null Model for Directed Graphs Matching All Degree Distributions: In, Out, and Reciprocal A Scalable Null Model for Directed Graphs Matching All Degree Distributions: In, Out, and Reciprocal Nurcan Durak, Tamara G. Kolda, Ali Pinar, and C. Seshadhri Sandia National Laboratories Livermore, CA

More information

A Scalable Null Model for Directed Graphs Matching All Degree Distributions: In, Out, and Reciprocal

A Scalable Null Model for Directed Graphs Matching All Degree Distributions: In, Out, and Reciprocal A Scalable Null Model for Directed Graphs Matching All Degree Distributions: In, Out, and Reciprocal Nurcan Durak, Tamara G. Kolda, Ali Pinar, and C. Seshadhri Sandia National Laboratories Livermore, CA

More information

Analysis & Generative Model for Trust Networks

Analysis & Generative Model for Trust Networks Analysis & Generative Model for Trust Networks Pranav Dandekar Management Science & Engineering Stanford University Stanford, CA 94305 ppd@stanford.edu ABSTRACT Trust Networks are a specific kind of social

More information

MAE 298, Lecture 4 April 9, Exploring network robustness

MAE 298, Lecture 4 April 9, Exploring network robustness MAE 298, Lecture 4 April 9, 2006 Switzerland Germany Spain Italy Japan Netherlands Russian Federation Sweden UK USA Unknown Exploring network robustness What is a power law? (Also called a Pareto Distribution

More information

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank

More information

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank

More information

A stochastic model for the link analysis of the Web

A stochastic model for the link analysis of the Web A stochastic model for the link analysis of the Web Paola Favati, IIT-CNR, Pisa. Grazia Lotti, University of Parma. Ornella Menchi and Francesco Romani, University of Pisa Three examples of link distribution

More information

Spectral Analysis of Directed Complex Networks. Tetsuro Murai

Spectral Analysis of Directed Complex Networks. Tetsuro Murai MASTER THESIS Spectral Analysis of Directed Complex Networks Tetsuro Murai Department of Physics, Graduate School of Science and Engineering, Aoyama Gakuin University Supervisors: Naomichi Hatano and Kenn

More information

Outline : Multimedia Databases and Data Mining. Indexing - similarity search. Indexing - similarity search. C.

Outline : Multimedia Databases and Data Mining. Indexing - similarity search. Indexing - similarity search. C. Outline 15-826: Multimedia Databases and Data Mining Lecture #30: Conclusions C. Faloutsos Goal: Find similar / interesting things Intro to DB Indexing - similarity search Points Text Time sequences; images

More information

The Diameter of Random Massive Graphs

The Diameter of Random Massive Graphs The Diameter of Random Massive Graphs Linyuan Lu October 30, 2000 Abstract Many massive graphs (such as the WWW graph and Call graphs) share certain universal characteristics which can be described by

More information

LINK ANALYSIS. Dr. Gjergji Kasneci Introduction to Information Retrieval WS

LINK ANALYSIS. Dr. Gjergji Kasneci Introduction to Information Retrieval WS LINK ANALYSIS Dr. Gjergji Kasneci Introduction to Information Retrieval WS 2012-13 1 Outline Intro Basics of probability and information theory Retrieval models Retrieval evaluation Link analysis Models

More information

Eigenvalues of Random Power Law Graphs (DRAFT)

Eigenvalues of Random Power Law Graphs (DRAFT) Eigenvalues of Random Power Law Graphs (DRAFT) Fan Chung Linyuan Lu Van Vu January 3, 2003 Abstract Many graphs arising in various information networks exhibit the power law behavior the number of vertices

More information

Mini course on Complex Networks

Mini course on Complex Networks Mini course on Complex Networks Massimo Ostilli 1 1 UFSC, Florianopolis, Brazil September 2017 Dep. de Fisica Organization of The Mini Course Day 1: Basic Topology of Equilibrium Networks Day 2: Percolation

More information

. p.1. Mathematical Models of the WWW and related networks. Alan Frieze

. p.1. Mathematical Models of the WWW and related networks. Alan Frieze . p.1 Mathematical Models of the WWW and related networks Alan Frieze The WWW is an example of a large real-world network.. p.2 The WWW is an example of a large real-world network. It grows unpredictably

More information

Deterministic scale-free networks

Deterministic scale-free networks Physica A 299 (2001) 559 564 www.elsevier.com/locate/physa Deterministic scale-free networks Albert-Laszlo Barabasi a;, Erzsebet Ravasz a, Tamas Vicsek b a Department of Physics, College of Science, University

More information

Models of Communication Dynamics for Simulation of Information Diffusion

Models of Communication Dynamics for Simulation of Information Diffusion Models of Communication Dynamics for Simulation of Information Diffusion Konstantin Mertsalov, Malik Magdon-Ismail, Mark Goldberg Rensselaer Polytechnic Institute Department of Computer Science 11 8th

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu How to organize/navigate it? First try: Human curated Web directories Yahoo, DMOZ, LookSmart

More information

MAE 298, Lecture 8 Feb 4, Web search and decentralized search on small-worlds

MAE 298, Lecture 8 Feb 4, Web search and decentralized search on small-worlds MAE 298, Lecture 8 Feb 4, 2008 Web search and decentralized search on small-worlds Search for information Assume some resource of interest is stored at the vertices of a network: Web pages Files in a file-sharing

More information

Networks as a tool for Complex systems

Networks as a tool for Complex systems Complex Networs Networ is a structure of N nodes and 2M lins (or M edges) Called also graph in Mathematics Many examples of networs Internet: nodes represent computers lins the connecting cables Social

More information

Link Analysis. Leonid E. Zhukov

Link Analysis. Leonid E. Zhukov Link Analysis Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural Analysis and Visualization

More information

Chaos, Complexity, and Inference (36-462)

Chaos, Complexity, and Inference (36-462) Chaos, Complexity, and Inference (36-462) Lecture 21 Cosma Shalizi 3 April 2008 Models of Networks, with Origin Myths Erdős-Rényi Encore Erdős-Rényi with Node Types Watts-Strogatz Small World Graphs Exponential-Family

More information

Faloutsos, Tong ICDE, 2009

Faloutsos, Tong ICDE, 2009 Large Graph Mining: Patterns, Tools and Case Studies Christos Faloutsos Hanghang Tong CMU Copyright: Faloutsos, Tong (29) 2-1 Outline Part 1: Patterns Part 2: Matrix and Tensor Tools Part 3: Proximity

More information

Overview of Network Theory

Overview of Network Theory Overview of Network Theory MAE 298, Spring 2009, Lecture 1 Prof. Raissa D Souza University of California, Davis Example social networks (Immunology; viral marketing; aliances/policy) M. E. J. Newman The

More information

Modeling of Growing Networks with Directional Attachment and Communities

Modeling of Growing Networks with Directional Attachment and Communities Modeling of Growing Networks with Directional Attachment and Communities Masahiro KIMURA, Kazumi SAITO, Naonori UEDA NTT Communication Science Laboratories 2-4 Hikaridai, Seika-cho, Kyoto 619-0237, Japan

More information

arxiv:cond-mat/ v1 3 Sep 2004

arxiv:cond-mat/ v1 3 Sep 2004 Effects of Community Structure on Search and Ranking in Information Networks Huafeng Xie 1,3, Koon-Kiu Yan 2,3, Sergei Maslov 3 1 New Media Lab, The Graduate Center, CUNY New York, NY 10016, USA 2 Department

More information

An Analysis on Link Structure Evolution Pattern of Web Communities

An Analysis on Link Structure Evolution Pattern of Web Communities DEWS2005 5-C-01 153-8505 4-6-1 E-mail: {imafuji,kitsure}@tkl.iis.u-tokyo.ac.jp 1999 2002 HITS An Analysis on Link Structure Evolution Pattern of Web Communities Noriko IMAFUJI and Masaru KITSUREGAWA Institute

More information

Social Networks- Stanley Milgram (1967)

Social Networks- Stanley Milgram (1967) Complex Networs Networ is a structure of N nodes and 2M lins (or M edges) Called also graph in Mathematics Many examples of networs Internet: nodes represent computers lins the connecting cables Social

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Lecture #9: Link Analysis Seoul National University 1 In This Lecture Motivation for link analysis Pagerank: an important graph ranking algorithm Flow and random walk formulation

More information

Chaos, Complexity, and Inference (36-462)

Chaos, Complexity, and Inference (36-462) Chaos, Complexity, and Inference (36-462) Lecture 21: More Networks: Models and Origin Myths Cosma Shalizi 31 March 2009 New Assignment: Implement Butterfly Mode in R Real Agenda: Models of Networks, with

More information

CS224W: Social and Information Network Analysis

CS224W: Social and Information Network Analysis CS224W: Social and Information Network Analysis Reaction Paper Adithya Rao, Gautam Kumar Parai, Sandeep Sripada Keywords: Self-similar networks, fractality, scale invariance, modularity, Kronecker graphs.

More information

Complex networks: an introduction

Complex networks: an introduction Alain Barrat Complex networks: an introduction CPT, Marseille, France ISI, Turin, Italy http://www.cpt.univ-mrs.fr/~barrat http://cxnets.googlepages.com Plan of the lecture I. INTRODUCTION II. I. Networks:

More information

Evolving network with different edges

Evolving network with different edges Evolving network with different edges Jie Sun, 1,2 Yizhi Ge, 1,3 and Sheng Li 1, * 1 Department of Physics, Shanghai Jiao Tong University, Shanghai, China 2 Department of Mathematics and Computer Science,

More information

Competition and multiscaling in evolving networks

Competition and multiscaling in evolving networks EUROPHYSICS LETTERS 5 May 2 Europhys. Lett., 54 (4), pp. 436 442 (2) Competition and multiscaling in evolving networs G. Bianconi and A.-L. Barabási,2 Department of Physics, University of Notre Dame -

More information

CS345a: Data Mining Jure Leskovec and Anand Rajaraman Stanford University

CS345a: Data Mining Jure Leskovec and Anand Rajaraman Stanford University CS345a: Data Mining Jure Leskovec and Anand Rajaraman Stanford University TheFind.com Large set of products (~6GB compressed) For each product A=ributes Related products Craigslist About 3 weeks of data

More information

Network Science (overview, part 1)

Network Science (overview, part 1) Network Science (overview, part 1) Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California rgera@nps.edu Excellence Through Knowledge Overview Current research Section 1:

More information

Directed Scale-Free Graphs

Directed Scale-Free Graphs Directed Scale-Free Graphs Béla Bollobás Christian Borgs Jennifer Chayes Oliver Riordan Abstract We introduce a model for directed scale-free graphs that grow with preferential attachment depending in

More information

Data Mining and Analysis: Fundamental Concepts and Algorithms

Data Mining and Analysis: Fundamental Concepts and Algorithms Data Mining and Analysis: Fundamental Concepts and Algorithms dataminingbook.info Mohammed J. Zaki 1 Wagner Meira Jr. 2 1 Department of Computer Science Rensselaer Polytechnic Institute, Troy, NY, USA

More information

Heavy Tails: The Origins and Implications for Large Scale Biological & Information Systems

Heavy Tails: The Origins and Implications for Large Scale Biological & Information Systems Heavy Tails: The Origins and Implications for Large Scale Biological & Information Systems Predrag R. Jelenković Dept. of Electrical Engineering Columbia University, NY 10027, USA {predrag}@ee.columbia.edu

More information

Link Prediction. Eman Badr Mohammed Saquib Akmal Khan

Link Prediction. Eman Badr Mohammed Saquib Akmal Khan Link Prediction Eman Badr Mohammed Saquib Akmal Khan 11-06-2013 Link Prediction Which pair of nodes should be connected? Applications Facebook friend suggestion Recommendation systems Monitoring and controlling

More information

Network models: dynamical growth and small world

Network models: dynamical growth and small world Network models: dynamical growth and small world Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics

More information

1 The preferential attachment mechanism

1 The preferential attachment mechanism Inference, Models and Simulation for Complex Systems CSCI 7-1 Lecture 14 2 October 211 Prof. Aaron Clauset 1 The preferential attachment mechanism This mechanism goes by many other names and has been reinvented

More information

networks in molecular biology Wolfgang Huber

networks in molecular biology Wolfgang Huber networks in molecular biology Wolfgang Huber networks in molecular biology Regulatory networks: components = gene products interactions = regulation of transcription, translation, phosphorylation... Metabolic

More information

Slides based on those in:

Slides based on those in: Spyros Kontogiannis & Christos Zaroliagis Slides based on those in: http://www.mmds.org High dim. data Graph data Infinite data Machine learning Apps Locality sensitive hashing PageRank, SimRank Filtering

More information

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS DATA MINING LECTURE 3 Link Analysis Ranking PageRank -- Random walks HITS How to organize the web First try: Manually curated Web Directories How to organize the web Second try: Web Search Information

More information

Adventures in random graphs: Models, structures and algorithms

Adventures in random graphs: Models, structures and algorithms BCAM January 2011 1 Adventures in random graphs: Models, structures and algorithms Armand M. Makowski ECE & ISR/HyNet University of Maryland at College Park armand@isr.umd.edu BCAM January 2011 2 Complex

More information

Must-read Material : Multimedia Databases and Data Mining. Indexing - Detailed outline. Outline. Faloutsos

Must-read Material : Multimedia Databases and Data Mining. Indexing - Detailed outline. Outline. Faloutsos Must-read Material 15-826: Multimedia Databases and Data Mining Tamara G. Kolda and Brett W. Bader. Tensor decompositions and applications. Technical Report SAND2007-6702, Sandia National Laboratories,

More information

Groups of vertices and Core-periphery structure. By: Ralucca Gera, Applied math department, Naval Postgraduate School Monterey, CA, USA

Groups of vertices and Core-periphery structure. By: Ralucca Gera, Applied math department, Naval Postgraduate School Monterey, CA, USA Groups of vertices and Core-periphery structure By: Ralucca Gera, Applied math department, Naval Postgraduate School Monterey, CA, USA Mostly observed real networks have: Why? Heavy tail (powerlaw most

More information

Greedy Search in Social Networks

Greedy Search in Social Networks Greedy Search in Social Networks David Liben-Nowell Carleton College dlibenno@carleton.edu Joint work with Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkins. IPAM, Los Angeles 8 May 2007

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University.

CS246: Mining Massive Datasets Jure Leskovec, Stanford University. CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu What is the structure of the Web? How is it organized? 2/7/2011 Jure Leskovec, Stanford C246: Mining Massive

More information

Shlomo Havlin } Anomalous Transport in Scale-free Networks, López, et al,prl (2005) Bar-Ilan University. Reuven Cohen Tomer Kalisky Shay Carmi

Shlomo Havlin } Anomalous Transport in Scale-free Networks, López, et al,prl (2005) Bar-Ilan University. Reuven Cohen Tomer Kalisky Shay Carmi Anomalous Transport in Complex Networs Reuven Cohen Tomer Kalisy Shay Carmi Edoardo Lopez Gene Stanley Shlomo Havlin } } Bar-Ilan University Boston University Anomalous Transport in Scale-free Networs,

More information

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices Communities Via Laplacian Matrices Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices The Laplacian Approach As with betweenness approach, we want to divide a social graph into

More information

A spectral algorithm for computing social balance

A spectral algorithm for computing social balance A spectral algorithm for computing social balance Evimaria Terzi and Marco Winkler Abstract. We consider social networks in which links are associated with a sign; a positive (negative) sign indicates

More information

Alan Frieze Charalampos (Babis) E. Tsourakakis WAW June 12 WAW '12 1

Alan Frieze Charalampos (Babis) E. Tsourakakis WAW June 12 WAW '12 1 Alan Frieze af1p@random.math.cmu.edu Charalampos (Babis) E. Tsourakakis ctsourak@math.cmu.edu WAW 2012 22 June 12 WAW '12 1 Introduction Degree Distribution Diameter Highest Degrees Eigenvalues Open Problems

More information

Long Term Evolution of Networks CS224W Final Report

Long Term Evolution of Networks CS224W Final Report Long Term Evolution of Networks CS224W Final Report Jave Kane (SUNET ID javekane), Conrad Roche (SUNET ID conradr) Nov. 13, 2014 Introduction and Motivation Many studies in social networking and computer

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu March 16, 2016 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision

More information

Complex Network Measurements: Estimating the Relevance of Observed Properties

Complex Network Measurements: Estimating the Relevance of Observed Properties Complex Network Measurements: Estimating the Relevance of Observed Properties Matthieu Latapy and Clémence Magnien LIP6 CNRS and UPMC avenue du président Kennedy, 756 Paris, France (Matthieu.Latapy,Clemence.Magnien)@lip6.fr

More information

Analyzing Evolution of Web Communities using a Series of Japanese Web Archives

Analyzing Evolution of Web Communities using a Series of Japanese Web Archives 53 8505 4 6 E-mail: {toyoda,kitsure}@tkl.iis.u-tokyo.ac.jp 999 2002 4 Analyzing Evolution of Web Communities using a Series of Japanese Web Archives Masashi TOYODA and Masaru KITSUREGAWA Institute of Industrial

More information

Online Social Networks and Media. Link Analysis and Web Search

Online Social Networks and Media. Link Analysis and Web Search Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information

More information

CS249: ADVANCED DATA MINING

CS249: ADVANCED DATA MINING CS249: ADVANCED DATA MINING Graph and Network Instructor: Yizhou Sun yzsun@cs.ucla.edu May 31, 2017 Methods Learnt Classification Clustering Vector Data Text Data Recommender System Decision Tree; Naïve

More information

INFO 2950 Intro to Data Science. Lecture 18: Power Laws and Big Data

INFO 2950 Intro to Data Science. Lecture 18: Power Laws and Big Data INFO 2950 Intro to Data Science Lecture 18: Power Laws and Big Data Paul Ginsparg Cornell University, Ithaca, NY 7 Apr 2016 1/25 Power Laws in log-log space y = cx k (k=1/2,1,2) log 10 y = k log 10 x +log

More information

Power Laws & Rich Get Richer

Power Laws & Rich Get Richer Power Laws & Rich Get Richer CMSC 498J: Social Media Computing Department of Computer Science University of Maryland Spring 2015 Hadi Amiri hadi@umd.edu Lecture Topics Popularity as a Network Phenomenon

More information

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University.

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University. Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University http://www.mmds.org #1: C4.5 Decision Tree - Classification (61 votes) #2: K-Means - Clustering

More information

Online Sampling of High Centrality Individuals in Social Networks

Online Sampling of High Centrality Individuals in Social Networks Online Sampling of High Centrality Individuals in Social Networks Arun S. Maiya and Tanya Y. Berger-Wolf Department of Computer Science University of Illinois at Chicago 85 S. Morgan, Chicago, IL 667,

More information

Temporal patterns in information and social systems

Temporal patterns in information and social systems Temporal patterns in information and social systems Matúš Medo University of Fribourg, Switzerland COST Workshop Quantifying scientific impact: networks, measures, insights? 12-13 February, 2015, Zurich

More information

ople are co-authors if there edge to each of them. derived from the five largest PH, HEP TH, HEP PH, place a time-stamp on each h paper, and for each perission to the arxiv. The the period from April 1992

More information

Modeling Dynamic Evolution of Online Friendship Network

Modeling Dynamic Evolution of Online Friendship Network Commun. Theor. Phys. 58 (2012) 599 603 Vol. 58, No. 4, October 15, 2012 Modeling Dynamic Evolution of Online Friendship Network WU Lian-Ren ( ) 1,2, and YAN Qiang ( Ö) 1 1 School of Economics and Management,

More information

Online Social Networks and Media. Link Analysis and Web Search

Online Social Networks and Media. Link Analysis and Web Search Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information

More information

RTG: A Recursive Realistic Graph Generator Using Random Typing

RTG: A Recursive Realistic Graph Generator Using Random Typing RTG: A Recursive Realistic Graph Generator Using Random Typing Leman Akoglu and Christos Faloutsos Carnegie Mellon University School of Computer Science {lakoglu,christos}@cs.cmu.edu Abstract. We propose

More information

Lecture VI Introduction to complex networks. Santo Fortunato

Lecture VI Introduction to complex networks. Santo Fortunato Lecture VI Introduction to complex networks Santo Fortunato Plan of the course I. Networks: definitions, characteristics, basic concepts in graph theory II. III. IV. Real world networks: basic properties

More information

Analyzing Evolution of Web Communities using a Series of Japanese Web Archives

Analyzing Evolution of Web Communities using a Series of Japanese Web Archives DEWS2003 2-P-05 53 8505 4 6 E-mail: {toyoda,kitsure}@tkl.iis.u-tokyo.ac.jp 999 2002 4 Analyzing Evolution of Web Communities using a Series of Japanese Web Archives Masashi TOYODA and Masaru KITSUREGAWA

More information

Link Analysis Information Retrieval and Data Mining. Prof. Matteo Matteucci

Link Analysis Information Retrieval and Data Mining. Prof. Matteo Matteucci Link Analysis Information Retrieval and Data Mining Prof. Matteo Matteucci Hyperlinks for Indexing and Ranking 2 Page A Hyperlink Page B Intuitions The anchor text might describe the target page B Anchor

More information

Similarity Measures for Link Prediction Using Power Law Degree Distribution

Similarity Measures for Link Prediction Using Power Law Degree Distribution Similarity Measures for Link Prediction Using Power Law Degree Distribution Srinivas Virinchi and Pabitra Mitra Dept of Computer Science and Engineering, Indian Institute of Technology Kharagpur-72302,

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu Intro sessions to SNAP C++ and SNAP.PY: SNAP.PY: Friday 9/27, 4:5 5:30pm in Gates B03 SNAP

More information