Comparing linear modularization criteria using the relational notation
|
|
- Clifford Cross
- 5 years ago
- Views:
Transcription
1 Comparing linear modularization criteria using the relational notation Université Pierre et Marie Curie Laboratoire de Statistique Théorique et Appliquée April 30th, /35
2 Table of contents 1 Introduction and objective Definitions The definition of community 2 Relational approach 3 Contributions 4 Applications 5 Conclusions 2/35
3 1 Introduction and objective Definitions The definition of community 2 Relational approach Mathematical Relational modelling Properties verified by linear modularization criteria 3 Contributions Linear criteria Comparison of linear criteria 4 Applications The generalized Louvain algorithm Examples: real large graphs 5 Conclusions 3/35
4 Description of the problem Nowadays, we can find networks everywhere: biology, sociology, computer programming, marketing, etc). cyber-marketing, cyber-security. It is difficult to analyse a network directly because of its big size. Therefore, we need to decompose it in clusters or modules modularize it. Different modularization criteria have been formulated in different contexts in the last few years and we need to compare them. Objective: Compare the partitions found by different linear criteria We will provide a unified notation of different linear modularization criteria to understand the properties of the clusters found by their optimization. Moreover, this notation allows to easily identify the criteria having a resolution limit. 4/35
5 Definitions Definitions and notations A graph G(V, E) is a set of objects V, called nodes, linked by edges E. 5/35
6 Definitions Definitions and notations A graph G(V, E) is a set of objects V, called nodes, linked by edges E. N = V is the number of nodes and M = E is the number of edges. 5/35
7 Definitions Definitions and notations A graph G(V, E) is a set of objects V, called nodes, linked by edges E. N = V is the number of nodes and M = E is the number of edges. A graph is completely described by a N N matrix called the Adjacency Matrix A defined as follows { 1 if there is an edge between nodes i and i, a ii = 0 otherwise. Example: given a graph with N = 6 and M = 7. its adjacency matrix is: A = (1) 5/35
8 Definitions Definitions and notations A graph G(V, E) is a set of objects V, called nodes, linked by edges E. N = V is the number of nodes and M = E is the number of edges. A graph is completely described by a N N matrix called the Adjacency Matrix A defined as follows { 1 if there is an edge between nodes i and i, a ii = 0 otherwise. Example: given a graph with N = 6 and M = 7. its adjacency matrix is: A = (1) The degree d i of node i is the number of edges incident to i. d i = i a ii = a i. = a.i The average degree of the graph is d av = 2M N. The Density of the graph is δ = 2M N 2. 5/35
9 Definitions Definitions and notations A graph G(V, E) is a set of objects V, called nodes, linked by edges E. N = V is the number of nodes and M = E is the number of edges. A graph is completely described by a N N matrix called the Adjacency Matrix A defined as follows { 1 if there is an edge between nodes i and i, a ii = 0 otherwise. Example: given a graph with N = 6 and M = 7. its adjacency matrix is: A = (1) The degree d i of node i is the number of edges incident to i. d i = i a ii = a i. = a.i The average degree of the graph is d av = 2M N. The Density of the graph is δ = 2M N 2. Just in case the graph is weighted we will denote the adjacency matrix W. 5/35
10 The definition of community What is a community? Each criterion is based on its own definition of community. Definitions of community: Densely connected groups of nodes with few connections between groups whose density is higher than the density of the whole graph. Set of nodes where the connections are not randomly distributed. many others... The words community, module and cluster will be treated as synonyms 6/35
11 Table of contents 1 Introduction and objective 2 Relational approach Mathematical Relational modelling Properties verified by linear modularization criteria 3 Contributions 4 Applications 5 Conclusions 7/35
12 Mathematical Relational modelling Mathematical Relational modelling Let X be a square matrix of order N defining an equivalence relation on V as follows: { 1 if i and i are in the same cluster i, i V V x ii = (2) 0 otherwise 8/35
13 Mathematical Relational modelling Mathematical Relational modelling Let X be a square matrix of order N defining an equivalence relation on V as follows: { 1 if i and i are in the same cluster i, i V V x ii = (2) 0 otherwise We present a modularization criterion as a function F to optimize: max X subject to the constraints of an equivalence relation: or min F (A, X). (3) X x ii {0, 1} Binarity (4) x ii = 1 i Reflexivity x ii x i i = 0 (i, i ) Symmetry x ii + x i i x ii 1 (i, i, i ) Transitivity Finding the exact solution of this problem turns impractical for large graphs, therefore we will use heuristics ad-hoc. 8/35
14 Properties verified by linear modularization criteria Properties verified by linear modularization criteria A criterion is linear if it can be written in the general form: N N F (X) = φ(a ii )x ii + K (5) i=1 i =1 9/35
15 Properties verified by linear modularization criteria Properties verified by linear modularization criteria A criterion is linear if it can be written in the general form: N N F (X) = φ(a ii )x ii + K (5) i=1 i =1 Besides that, the criterion has the property of General balance if it can be written in the form: N N N N F (X) = φ(a ii )x ii + φ(a ii ) x ii (6) i=1 i =1 i=1 i =1 where K is any constant depending only on the original data; φ(a ii ) 0 i, i and φ(a ii ) 0 i, i are non negative functions verifying: N N i=1 i =1 φii > 0 and N N φ i=1 i =1 ii > 0;. N N N N The quantities φ(a ii )x ii and φ(a ii ) x ii are called positive (+) i=1 i =1 and negative (-) agreements respectively. i=1 i =1 9/35
16 Properties verified by linear modularization criteria The impact of the property of General balance Let κ denote the number of clusters obtained after optimization of the criterion. Given a graph 10/35
17 Properties verified by linear modularization criteria The impact of the property of General balance Let κ denote the number of clusters obtained after optimization of the criterion. If (-) agreements missing ( φ ii = 0 i, i ) Given a graph then all nodes are clustered together, κ = 1 10/35
18 Properties verified by linear modularization criteria The impact of the property of General balance Let κ denote the number of clusters obtained after optimization of the criterion. If (-) agreements If (+) agreements missing ( φ ii = 0 i, i ) missing (φ ii = 0 i, i ) Given a graph then all nodes are clustered together, κ = 1 then all nodes are separated, κ = N 10/35
19 Properties verified by linear modularization criteria Contribution: Different levels of general balance for linear criteria Property of Local balance A balanced linear criterion whose functions φ ii and φ ii satisfy φ ii + φ ii = K L (i, i ) where K L is a constant depending only upon the pair (i, i ) has the property of local balance. Therefore K L must not depend on global properties of the graph. 11/35
20 Properties verified by linear modularization criteria Contribution: Different levels of general balance for linear criteria Property of Local balance A balanced linear criterion whose functions φ ii and φ ii satisfy φ ii + φ ii = K L (i, i ) where K L is a constant depending only upon the pair (i, i ) has the property of local balance. Therefore K L must not depend on global properties of the graph. Property of Global balance A balanced linear criterion whose functions φ ii and φ ii satisfy N N ( ) φii + φ ii = KG i=1 i =1 where K G is a constant depending on global properties of the graph has the property of global balance. 11/35
21 Properties verified by linear modularization criteria Contribution: Different levels of general balance for linear criteria Null Model A global balanced linear criterion whose functions φ ii and φ ii satisfy the following conditions: N N N N φ ii = φ ii i=1 i =1 i=1 i =1 φ ii + φ ii = K G (i, i ) where K G is a constant depending on global properties of the graph is a criterion based on a null model. 12/35
22 Properties verified by linear modularization criteria Different levels of general balance for linear criteria Figure: Venn Diagram of criteria verifying the property of general balance. 13/35
23 Properties verified by linear modularization criteria Resolution limit [Fortunato and Barthelemy (2006)] A null model has a resolution limit. Figure: A null model may fail to identify modules smaller than a scale depending on the global network, even cliques (complete graphs with m nodes k m ) connected by single edges. 14/35
24 Table of contents 1 Introduction and objective 2 Relational approach 3 Contributions Linear criteria Comparison of linear criteria 4 Applications 5 Conclusions 15/35
25 Linear criteria Some linear criteria in relational notation Criterion Zahn-Condorcet (1785, 1964) Owsiński - Zadrożny (1986) Newman-Girvan (2004) Relational notation N N F ZC (X) = (a ii x ii + ā ii x ii ) F ZOZ (X) = i=1 i =1 N i=1 i =1 with 0 < α < 1 F NG (X) = 1 N 2M N ((1 α)a ii x ii + αā ii x ii ) i=1 i =1 N (a ii a i.a.i 2M ) x ii Table: Relational notation of linear modularity functions. 16/35
26 Linear criteria Some linear criteria in relational notation (continuation) Criterion Deviation to Uniformity (2013) Deviation to Indetermination (2013) The Balanced Modularity (2013) Relational notation F UNIF (X) = 1 N N 2M F DI (X) = 1 2M F BM (X) = N N i=1 i =1 N i=1 i =1 i=1 i =1 where P ii = ai.a.i 2M and P ii = ( a ii 2M ) N 2 x ii ( a ii a i. N a.i N + 2M ) N 2 x ii N ( (aii P ii ) x ii + (ā ii P ii ) x ii ) ( ā ii (N ai.)(n a.i ) N 2 2M ) Table: Relational notation of linear modularity functions. 17/35
27 Linear criteria Existing linear criteria: Deviation to a particular structure Principe: A random graph does not have community structure. Uniformity structure Independence structure Multiplicative model: Indetermination structure Additive model: Geometric mean Arithmetic mean 18/35
28 Linear criteria Properties of these linear criteria The 6 criteria have the property of General balance. Criterion Zahn-Condorcet Owsiński-Zadrożny Newman-Girvan Deviation to Uniformity Deviation to Indetermination Balanced modularity Global balance Local Null Balance model X X X X X X Table: Balance Property for Linear criteria 19/35
29 Comparison of linear criteria The number of clusters The number of clusters found by the generalized Louvain algorithm for two real networks. Network Jazz Internet N = 198 N = 69,949 M = 2,742 M = 351,380 Criterion κ κ Zahn-Condorcet 38 40,123 Owsiński-Zadrożny 6 α = 2% 456 α < 1% Deviation to Uniformity Newman-Girvan 4 46 Deviation to Indetermination 6 39 Balanced Modularity 5 41 How to explain these differences? 20/35
30 Comparison of linear criteria First approach: the deviation form notation Criterion Notation F (X) = Zahn-Condorcet F ZC (X) = Owsiński-Zadrożny F OZ (X) = Deviation to uniformity F UNIF (X) = Newman-Girvan F NG (X) = Deviation to indetermination F DI (X) = N N i=1 i =1 N i=1 i =1 N N i=1 i =1 ( a ii 1 2 N (φ ii φ ii )x ii ) x ii N (a ii α)x ii N i=1 i =1 N N ( i=1 i =1 N N i=1 i =1 ( a ii 2M N 2 ) x ii a ii ai.a.i 2M ( a ii ) x ii ( ai. N + a.i N 2M N 2 )) x ii 21/35
31 Comparison of linear criteria First approach: the deviation form notation The (+) agreements term is the same for the 5 criteria: φ ZC ii = φoz ii = φng ii = φdi ii = φunif ii = a ii (i, i ). So we have to analyse the (-) agreements term. We distinguish 2 cases: 22/35
32 Comparison of linear criteria First approach: the deviation form notation The (+) agreements term is the same for the 5 criteria: φ ZC ii = φoz ii = φng ii = φdi ii = φunif ii = a ii (i, i ). So we have to analyse the (-) agreements term. We distinguish 2 cases: Case 1: φ is a constant not depending on each pair of nodes. Thas is the case of Zahn-Condorcet, Owsiński-Zadrożny and Deviation to Uniformity criteria. 22/35
33 Comparison of linear criteria First approach: The degree distribution Case 2: φ depends on the degree of each pair of nodes as for Newman-Girvan and Deviation to Indetermination criteria. 23/35
34 Comparison of linear criteria First approach: The degree distribution Case 2: φ depends on the degree of each pair of nodes as for Newman-Girvan and Deviation to Indetermination criteria. Many real networks fall into the class of scale-free networks, meaning that their degree distribution follows a power-law which distinguish them from random networks [Barabasi and Albert (1999)]. In a scale-free network a few nodes called hubs have many connexions whereas most nodes have few connexions. 23/35
35 Comparison of linear criteria Comparison between Newman-Girvan, Deviation to Indetermination and the Balanced Modularity Maximizing the Balanced Modularity turns out to maximize the following expressions depending upon the Newman-Girvan criterion and the Deviation to Indetermination respectively. N N ( ) (ai. d av )(a.i d av ) F BM = 2F NG + x ii. 2M(1 δ) i=1 i =1 ( F BM = 2F DI ) N δ i=1 i =1 N ( (ai. d av )(a.i d av ) N 2 (1 δ) The Balanced Modularity behaves as a regulator between the Newman-Girvan criterion and the Deviation to Indetermination. ) x ii. 24/35
36 Comparison of linear criteria Second approach: Impact of merging two clusters Now let us suppose we want to merge two clusters C 1 and C 2 in the network of sizes n 1 and n 2 respectively. Let us suppose as well they are connected by l edges and they have average degree d 1 av et d 2 av respectively. 25/35
37 Comparison of linear criteria Impact of merging two clusters What is the contribution of merging two clusters to the value of each criterion? 26/35
38 Comparison of linear criteria Impact of merging two clusters What is the contribution of merging two clusters to the value of each criterion? The contribution C of merging two clusters will be: C = n 1 i C 1 n 2 i C 2 (φ ii φ ii ) (7) 26/35
39 Comparison of linear criteria Impact of merging two clusters What is the contribution of merging two clusters to the value of each criterion? The contribution C of merging two clusters will be: C = n 1 i C 1 n 2 i C 2 (φ ii φ ii ) (7) The objective is to compare function φ(.) to function φ(.) If C > 0 the criterion merges the two clusters, it is a gain. If C < 0 the criterion separates the two clusters, it is a cost. 26/35
40 Comparison of linear criteria The Contribution of merging two clusters Contribution of merging two clusters for linear criteria. n 1 n 2 Criterion: F C F = (φ ii φ ii ) i C 1 i C ( 2 Zahn-Condorcet C ZC = l n ) 1n 2 2 Owsiński-Zadrożny C OZ = (l n 1 n 2 α) 0 < α < 1 Deviation to Uniformity C UNIF = ( (l n 1 n 2 δ) d Newman-Girvan C NG = l n 1 n avd 1 2 ) av 2 2M Deviation to Indetermination C DI = ( ( d 1 l n 1 n av 2 N + d av 2 N 2M N 2 )) 27/35
41 Comparison of linear criteria Summary by criterion Criterion Zahn-Condorcet Owsiński-Zadrożny Deviation to Uniformity Characteristics of the clustering The density of edges of each cluster is at least equal to 50%. No resolution limit. For real networks the optimal partition contains many small clusters or single nodes. It gives the choice to define the minimum required within-cluster density, α. For α = 0.5 the Owsiński-Zadrożny criterion the Zahn-Condorcet criterion. No resolution limit. A particular case of Owsiński-Zadrożny criterion with α = δ. The density of within cluster edges of each cluster is at least δ. It has a resolution limit. 28/35
42 Comparison of linear criteria Summary by criterion Criterion Newman-Girvan Deviation to Indetermination Balanced modularity Characteristics of the clustering It has a resolution limit. The contribution depends on the degree distribution of the clusters. The optimal partition has no single nodes. It has a resolution limit. The contribution depends on the degree distribution of the clusters. It has a resolution limit. The contribution depends on the degree distribution of the clusters. Depending upon δ and d av this criterion behaves like a regulator between the NG criterion and the DI criterion. 29/35
43 Table of contents 1 Introduction and objective 2 Relational approach 3 Contributions 4 Applications The generalized Louvain algorithm Examples: real large graphs 5 Conclusions 30/35
44 Examples: real large graphs Modularizing large graphs with the generalized Louvain algorithm Number of clusters found by the generalized Louvain algorithm (see [Campigotto et al. (2014)]) Web nd.edu Amazon Youtube WebBase01 N 325k 334k 1M 118M M 1M 925k 3M 1B δ Criterion κ κ κ κ ZC 201, , ,849 71,806,729 UNIF ,584 2,777,580 NG ,567 2,759,248 DI ,985 2,770,647 BM ,410 2,736,808 Ref: Zahn-Condorcet (ZC), Deviation to Uniformity (UNIF), Newman-Girvan (NG), Deviation to Indetermination(DI) and Balanced Modularity (BM) 31/35
45 Examples: real large graphs Running times obtained for each criterion N M NG ZC DI UNIF BM karate arxiv 9k 48k internet 70k 351k webndu 325k 2M webuk M 783M webbase M 1B 430 1, Table: Running times (in seconds) obtained for each criterion 32/35
46 Table of contents 1 Introduction and objective 2 Relational approach 3 Contributions 4 Applications 5 Conclusions 33/35
47 Conclusions We presented six different modularization criteria in Relational notation. 34/35
48 Conclusions We presented six different modularization criteria in Relational notation. We described and clearly defined the property of balance by making the link between this property and the resolution limit property. 34/35
49 Conclusions We presented six different modularization criteria in Relational notation. We described and clearly defined the property of balance by making the link between this property and the resolution limit property. The generic Louvain algorithm allowed us to modularize real large graphs and we could compare the number of clusters found by the different criteria. 34/35
50 Conclusions We presented six different modularization criteria in Relational notation. We described and clearly defined the property of balance by making the link between this property and the resolution limit property. The generic Louvain algorithm allowed us to modularize real large graphs and we could compare the number of clusters found by the different criteria. We characterized the partitions found by six linear modularization criteria. We saw that two criteria who have a local definition are based on a the density of within-cluster edges (Zahn-Condorcet and Owsiński-Zadrożny), whereas others are based on a null model (Newman-Girvan, Deviation to Uniformity, Deviation to Indetermination and the Balanced Modularity). These criteria have a resolution limit. 34/35
51 Thanks for your attention! 35/35
Comparison of linear modularization criteria using the relational formalism, an approach to easily identify resolution limit
Comparison of linear modularization criteria using the relational formalism, an approach to easily identify resolution limit Patricia Conde-Céspedes, Jean-François Marcotorchino, Emmanuel Viennet, 1 L2TI
More informationThe Trouble with Community Detection
The Trouble with Community Detection Aaron Clauset Santa Fe Institute 7 April 2010 Nonlinear Dynamics of Networks Workshop U. Maryland, College Park Thanks to National Science Foundation REU Program James
More informationModularity and Graph Algorithms
Modularity and Graph Algorithms David Bader Georgia Institute of Technology Joe McCloskey National Security Agency 12 July 2010 1 Outline Modularity Optimization and the Clauset, Newman, and Moore Algorithm
More informationOn Resolution Limit of the Modularity in Community Detection
The Ninth International Symposium on Operations Research and Its Applications (ISORA 10) Chengdu-Jiuzhaigou, China, August 19 23, 2010 Copyright 2010 ORSC & APORC, pp. 492 499 On Resolution imit of the
More information1 Matrix notation and preliminaries from spectral graph theory
Graph clustering (or community detection or graph partitioning) is one of the most studied problems in network analysis. One reason for this is that there are a variety of ways to define a cluster or community.
More information1 Complex Networks - A Brief Overview
Power-law Degree Distributions 1 Complex Networks - A Brief Overview Complex networks occur in many social, technological and scientific settings. Examples of complex networks include World Wide Web, Internet,
More informationA Dimensionality Reduction Framework for Detection of Multiscale Structure in Heterogeneous Networks
Shen HW, Cheng XQ, Wang YZ et al. A dimensionality reduction framework for detection of multiscale structure in heterogeneous networks. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 27(2): 341 357 Mar. 2012.
More informationOn Defining and Computing Communities
AU Herning Aarhus University martino@hih.au.dk UPF, Barcelona, November 29th 2012 Zacharys Karate Club 25 26 12 5 17 7 11 28 32 6 24 27 29 20 1 22 9 30 34 3 2 13 15 33 4 14 18 16 31 8 21 10 23 19 A community
More informationMultislice community detection
Multislice community detection P. J. Mucha, T. Richardson, K. Macon, M. A. Porter, J.-P. Onnela Jukka-Pekka JP Onnela Harvard University NetSci2010, MIT; May 13, 2010 Outline (1) Background (2) Multislice
More informationarxiv: v1 [physics.soc-ph] 20 Oct 2010
arxiv:00.098v [physics.soc-ph] 20 Oct 200 Spectral methods for the detection of network community structure: A comparative analysis Hua-Wei Shen and Xue-Qi Cheng Institute of Computing Technology, Chinese
More informationSocial and Technological Network Analysis. Lecture 4: Modularity and Overlapping Dr. Cecilia Mascolo
Social and Technological Network Analysis Lecture 4: Modularity and Overlapping Communi@es Dr. Cecilia Mascolo In This Lecture We will describe the concept of Modularity and Modularity Op@miza@on. We will
More informationModularity in several random graph models
Modularity in several random graph models Liudmila Ostroumova Prokhorenkova 1,3 Advanced Combinatorics and Network Applications Lab Moscow Institute of Physics and Technology Moscow, Russia Pawe l Pra
More informationMultimedia Databases - 68A6 Final Term - exercises
Multimedia Databases - 8A Final Term - exercises Exercises for the preparation to the final term June, the 1th 00 quiz 1. approximation of cosine similarity An approximate computation of the cosine similarity
More informationNonparametric Bayesian Matrix Factorization for Assortative Networks
Nonparametric Bayesian Matrix Factorization for Assortative Networks Mingyuan Zhou IROM Department, McCombs School of Business Department of Statistics and Data Sciences The University of Texas at Austin
More informationPartition-Merge: Distributed Inference and Modularity Optimization
Partition-Merge: Distributed Inference and Modularity Optimization Vincent Blondel Universite Catholique de Louvain, Belgium Kyomin Jung Seoul National University, South Korea Pushmeet Kohli DeepMind,
More informationNetworks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource
Networks Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource Networks in biology Protein-Protein Interaction Network of Yeast Transcriptional regulatory network of E.coli Experimental
More informationCS224W: Social and Information Network Analysis Jure Leskovec, Stanford University
CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu Non-overlapping vs. overlapping communities 11/10/2010 Jure Leskovec, Stanford CS224W: Social
More informationMining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University
Note to other teachers and users of these slides: We would be delighted if you found this our material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them to fit
More informationGraph Clustering Algorithms
PhD Course on Graph Mining Algorithms, Università di Pisa February, 2018 Clustering: Intuition to Formalization Task Partition a graph into natural groups so that the nodes in the same cluster are more
More informationWhen are networks truly modular?
Physica D 224 (2006) 20 26 www.elsevier.com/locate/physd When are networks truly modular? Jörg Reichardt, Stefan Bornholdt Institute for Theoretical Physics, University of Bremen, Otto-Hahn-Allee, 28359
More informationCS224W: Analysis of Networks Jure Leskovec, Stanford University
CS224W: Analysis of Networks Jure Leskovec, Stanford University http://cs224w.stanford.edu 10/30/17 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu 2
More informationNetwork Observational Methods and. Quantitative Metrics: II
Network Observational Methods and Whitney topics Quantitative Metrics: II Community structure (some done already in Constraints - I) The Zachary Karate club story Degree correlation Calculating degree
More informationEmerging communities in networks a flow of ties
Emerging communities in networks a flow of ties Małgorzata J. Krawczyk, Przemysław Gawroński, Krzysztof Kułakowski Complex Systems Group Faculty of Physics and Applied Computer Science AGH University of
More informationThe Role of Network Science in Biology and Medicine. Tiffany J. Callahan Computational Bioscience Program Hunter/Kahn Labs
The Role of Network Science in Biology and Medicine Tiffany J. Callahan Computational Bioscience Program Hunter/Kahn Labs Network Analysis Working Group 09.28.2017 Network-Enabled Wisdom (NEW) empirically
More informationComputational Lower Bounds for Community Detection on Random Graphs
Computational Lower Bounds for Community Detection on Random Graphs Bruce Hajek, Yihong Wu, Jiaming Xu Department of Electrical and Computer Engineering Coordinated Science Laboratory University of Illinois
More informationCommunity detection in time-dependent, multiscale, and multiplex networks
Community detection in time-dependent, multiscale, and multiplex networks P. J. Mucha, T. Richardson, K. Macon, M. A. Porter, J.-P. Onnela Jukka-Pekka Onnela Harvard University MERSIH, November 13, 2009
More informationAdventures in random graphs: Models, structures and algorithms
BCAM January 2011 1 Adventures in random graphs: Models, structures and algorithms Armand M. Makowski ECE & ISR/HyNet University of Maryland at College Park armand@isr.umd.edu BCAM January 2011 2 Complex
More informationCommunity Detection. Data Analytics - Community Detection Module
Community Detection Data Analytics - Community Detection Module Zachary s karate club Members of a karate club (observed for 3 years). Edges represent interactions outside the activities of the club. Community
More informationProject in Computational Game Theory: Communities in Social Networks
Project in Computational Game Theory: Communities in Social Networks Eldad Rubinstein November 11, 2012 1 Presentation of the Original Paper 1.1 Introduction In this section I present the article [1].
More informationJure Leskovec Joint work with Jaewon Yang, Julian McAuley
Jure Leskovec (@jure) Joint work with Jaewon Yang, Julian McAuley Given a network, find communities! Sets of nodes with common function, role or property 2 3 Q: How and why do communities form? A: Strength
More informationRelations Between Adjacency And Modularity Graph Partitioning: Principal Component Analysis vs. Modularity Component Analysis
Relations Between Adjacency And Modularity Graph Partitioning: Principal Component Analysis vs. Modularity Component Analysis Hansi Jiang Carl Meyer North Carolina State University October 27, 2015 1 /
More informationCommunities, Spectral Clustering, and Random Walks
Communities, Spectral Clustering, and Random Walks David Bindel Department of Computer Science Cornell University 26 Sep 2011 20 21 19 16 22 28 17 18 29 26 27 30 23 1 25 5 8 24 2 4 14 3 9 13 15 11 10 12
More informationarxiv: v2 [physics.soc-ph] 13 Jun 2009
Alternative approach to community detection in networks A.D. Medus and C.O. Dorso Departamento de Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Pabellón 1, Ciudad Universitaria,
More informationUncovering structure in biological networks: A model-based approach
Uncovering structure in biological networks: A model-based approach J-J Daudin, F. Picard, S. Robin, M. Mariadassou UMR INA-PG / ENGREF / INRA, Paris Mathématique et Informatique Appliquées Statistics
More informationCSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68
CSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68 References 1 L. Freeman, Centrality in Social Networks: Conceptual Clarification, Social Networks, Vol. 1, 1978/1979, pp. 215 239. 2 S. Wasserman
More informationOverlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach
Overlapping Community Detection at Scale: A Nonnegative Matrix Factorization Approach Author: Jaewon Yang, Jure Leskovec 1 1 Venue: WSDM 2013 Presenter: Yupeng Gu 1 Stanford University 1 Background Community
More informationFinding Common Modules in a Time-Varying Network with Application to the Drosophila Melanogaster Gene Regulation Network
Finding Common Modules in a Time-Varying Network with Application to the Drosophila Melanogaster Gene Regulation Network Abstract Finding functional modules in gene regulation networks is an important
More information1 Matrix notation and preliminaries from spectral graph theory
Graph clustering (or community detection or graph partitioning) is one of the most studied problems in network analysis. One reason for this is that there are a variety of ways to define a cluster or community.
More informationChapter 11. Matrix Algorithms and Graph Partitioning. M. E. J. Newman. June 10, M. E. J. Newman Chapter 11 June 10, / 43
Chapter 11 Matrix Algorithms and Graph Partitioning M. E. J. Newman June 10, 2016 M. E. J. Newman Chapter 11 June 10, 2016 1 / 43 Table of Contents 1 Eigenvalue and Eigenvector Eigenvector Centrality The
More informationCooperative Game Theory Approaches for Network Partitioning
Cooperative Game Theory Approaches for Network Partitioning Konstantin Avrachenkov, Aleksei Kondratev, Vladimir Mazalov To cite this version: Konstantin Avrachenkov, Aleksei Kondratev, Vladimir Mazalov.
More informationThe non-backtracking operator
The non-backtracking operator Florent Krzakala LPS, Ecole Normale Supérieure in collaboration with Paris: L. Zdeborova, A. Saade Rome: A. Decelle Würzburg: J. Reichardt Santa Fe: C. Moore, P. Zhang Berkeley:
More informationCS224W: Social and Information Network Analysis Jure Leskovec, Stanford University
CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu 10/24/2012 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu
More informationErzsébet Ravasz Advisor: Albert-László Barabási
Hierarchical Networks Erzsébet Ravasz Advisor: Albert-László Barabási Introduction to networks How to model complex networks? Clustering and hierarchy Hierarchical organization of cellular metabolism The
More informationA Multiobjective GO based Approach to Protein Complex Detection
Available online at www.sciencedirect.com Procedia Technology 4 (2012 ) 555 560 C3IT-2012 A Multiobjective GO based Approach to Protein Complex Detection Sumanta Ray a, Moumita De b, Anirban Mukhopadhyay
More informationA new centrality measure for probabilistic diffusion in network
ACSIJ Advances in Computer Science: an International Journal, Vol. 3, Issue 5, No., September 204 ISSN : 2322-557 A new centrality measure for probabilistic diffusion in network Kiyotaka Ide, Akira Namatame,
More informationQuantum Annealing Approaches to Graph Partitioning on the D-Wave System
Quantum Annealing Approaches to Graph Partitioning on the D-Wave System 2017 D-Wave QUBITS Users Conference Applications 1: Optimization S. M. Mniszewski, smm@lanl.gov H. Ushijima-Mwesigwa, hayato@lanl.gov
More informationA Modified Method Using the Bethe Hessian Matrix to Estimate the Number of Communities
Journal of Advanced Statistics, Vol. 3, No. 2, June 2018 https://dx.doi.org/10.22606/jas.2018.32001 15 A Modified Method Using the Bethe Hessian Matrix to Estimate the Number of Communities Laala Zeyneb
More informationMarkov Chains and Spectral Clustering
Markov Chains and Spectral Clustering Ning Liu 1,2 and William J. Stewart 1,3 1 Department of Computer Science North Carolina State University, Raleigh, NC 27695-8206, USA. 2 nliu@ncsu.edu, 3 billy@ncsu.edu
More informationNetwork Science (overview, part 1)
Network Science (overview, part 1) Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California rgera@nps.edu Excellence Through Knowledge Overview Current research Section 1:
More informationProtein Complex Identification by Supervised Graph Clustering
Protein Complex Identification by Supervised Graph Clustering Yanjun Qi 1, Fernanda Balem 2, Christos Faloutsos 1, Judith Klein- Seetharaman 1,2, Ziv Bar-Joseph 1 1 School of Computer Science, Carnegie
More informationA Study on Relationship between Modularity and Diffusion Dynamics in Networks from Spectral Analysis Perspective
A Study on Relationship between Modularity and Diffusion Dynamics in Networks from Spectral Analysis Perspective Kiyotaka Ide, Akira Namatame Department of Computer Science National Defense Academy of
More informationLecture VI Introduction to complex networks. Santo Fortunato
Lecture VI Introduction to complex networks Santo Fortunato Plan of the course I. Networks: definitions, characteristics, basic concepts in graph theory II. III. IV. Real world networks: basic properties
More informationOn efficient use of entropy centrality for social network analysis and community detection
On efficient use of entropy centrality for social network analysis and community detection ALEXANDER G. NIKOLAEV, RAIHAN RAZIB, ASHWIN KUCHERIYA PRESENTER: PRIYA BALACHANDRAN MARY ICSI 445/660 12/1/2015
More informationV 6 Network analysis
V 6 Network analysis - Dijkstra algorithm: compute shortest pathways - Graph layout - Network robustness - Graph modularity Fri, May 4, 2018 Bioinformatics 3 SS 18 V 6 1 The Shortest Path Problem Problem:
More informationInteraction Network Analysis
CSI/BIF 5330 Interaction etwork Analsis Young-Rae Cho Associate Professor Department of Computer Science Balor Universit Biological etworks Definition Maps of biochemical reactions, interactions, regulations
More informationPreface. Contributors
CONTENTS Foreword Preface Contributors PART I INTRODUCTION 1 1 Networks in Biology 3 Björn H. Junker 1.1 Introduction 3 1.2 Biology 101 4 1.2.1 Biochemistry and Molecular Biology 4 1.2.2 Cell Biology 6
More informationCommunity structure in complex networks. Santo Fortunato
Community structure in complex networks Santo Fortunato Me in a nutshell Computational Social Science Networks Science of Science Computational social science Science of science - c P(c/c) Agricultural
More informationIdentifying communities within energy landscapes
Identifying communities within energy landscapes Claire P. Massen and Jonathan P. K. Doye University Chemical Laboratory, Lensfield Road, Cambridge CB2 1EW, United Kingdom (Dated: February 2, 28) arxiv:cond-mat/412469v1
More informationCommunity Detection in Signed Networks: an Error-Correcting Code Approach
Community Detection in Signed Networks: an Error-Correcting Code Approach Cheng-Shang Chang, Duan-Shin Lee, Li-Heng Liou, and Sheng-Min Lu Institute of Communications Engineering, National Tsing Hua University
More informationNetworks as vectors of their motif frequencies and 2-norm distance as a measure of similarity
Networks as vectors of their motif frequencies and 2-norm distance as a measure of similarity CS322 Project Writeup Semih Salihoglu Stanford University 353 Serra Street Stanford, CA semih@stanford.edu
More informationIV. Analyse de réseaux biologiques
IV. Analyse de réseaux biologiques Catherine Matias CNRS - Laboratoire de Probabilités et Modèles Aléatoires, Paris catherine.matias@math.cnrs.fr http://cmatias.perso.math.cnrs.fr/ ENSAE - 2014/2015 Sommaire
More informationSpectral Graph Theory for. Dynamic Processes on Networks
Spectral Graph Theory for Dynamic Processes on etworks Piet Van Mieghem in collaboration with Huijuan Wang, Dragan Stevanovic, Fernando Kuipers, Stojan Trajanovski, Dajie Liu, Cong Li, Javier Martin-Hernandez,
More informationV 5 Robustness and Modularity
Bioinformatics 3 V 5 Robustness and Modularity Mon, Oct 29, 2012 Network Robustness Network = set of connections Failure events: loss of edges loss of nodes (together with their edges) loss of connectivity
More information9.5 HONORS Determine Odd and Even Functions Graphically and Algebraically
9.5 HONORS Determine Odd and Even Functions Graphically and Algebraically Use this blank page to compile the most important things you want to remember for cycle 9.5: 181 Even and Odd Functions Even Functions:
More informationarxiv: v1 [physics.soc-ph] 20 May 2016
Community Detection Using Multilayer Edge Mixture Model Han Zhang, Chang-Dong Wang, and Jian-Huang Lai School of Data and Computer Science, Sun Yat-sen University, Guangzhou, P. R. China. Philip S. Yu,
More informationarxiv: v1 [cs.si] 8 Oct 2018
An ensemble based on a bi-objective evolutionary spectral algorithm for graph clustering Camila P.S. Tautenhain and Mariá C.V. Nascimento Instituto de Ciência e Tecnologia, Universidade Federal de São
More informationStability-optimization algorithms for the detection of community structure in networks
Stability-optimization algorithms for the detection of community structure in networks Marianne McKenzie Keble College University of Oxford A thesis submitted for the degree of MSc in Mathematics and the
More informationEvolution of a social network: The role of cultural diversity
PHYSICAL REVIEW E 73, 016135 2006 Evolution of a social network: The role of cultural diversity A. Grabowski 1, * and R. A. Kosiński 1,2, 1 Central Institute for Labour Protection National Research Institute,
More informationNetwork Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai
Network Biology: Understanding the cell s functional organization Albert-László Barabási Zoltán N. Oltvai Outline: Evolutionary origin of scale-free networks Motifs, modules and hierarchical networks Network
More informationDeterministic modularity optimization
EPJ B proofs (will be inserted by the editor) Deterministic modularity optimization S. Lehmann 1,2,a and L.K. Hansen 1 1 Informatics and Mathematical Modeling, Technical University of Denmark 2 Center
More informationDRAGAN STEVANOVIĆ. Key words. Modularity matrix; Community structure; Largest eigenvalue; Complete multipartite
A NOTE ON GRAPHS WHOSE LARGEST EIGENVALUE OF THE MODULARITY MATRIX EQUALS ZERO SNJEŽANA MAJSTOROVIĆ AND DRAGAN STEVANOVIĆ Abstract. Informally, a community within a graph is a subgraph whose vertices are
More informationUncovering the Digital Divide and the Physical Divide in Senegal Using Mobile Phone Data
Uncovering the Digital Divide and the Physical Divide in Senegal Using Mobile Phone Data Song Gao, Bo Yan, Li Gong, Blake Regalia, Yiting Ju, Yingjie Hu STKO Lab, Department of Geography, University of
More informationCommunities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices
Communities Via Laplacian Matrices Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices The Laplacian Approach As with betweenness approach, we want to divide a social graph into
More informationDesign and characterization of chemical space networks
Design and characterization of chemical space networks Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-University Bonn 16 August 2015 Network representations of chemical spaces
More informationClustering of Pathogenic Genes in Human Co-regulatory Network. Michael Colavita Mentor: Soheil Feizi Fifth Annual MIT PRIMES Conference May 17, 2015
Clustering of Pathogenic Genes in Human Co-regulatory Network Michael Colavita Mentor: Soheil Feizi Fifth Annual MIT PRIMES Conference May 17, 2015 Topics Background Genetic Background Regulatory Networks
More informationCausality and communities in neural networks
Causality and communities in neural networks Leonardo Angelini, Daniele Marinazzo, Mario Pellicoro, Sebastiano Stramaglia TIRES-Center for Signal Detection and Processing - Università di Bari, Bari, Italy
More informationMini course on Complex Networks
Mini course on Complex Networks Massimo Ostilli 1 1 UFSC, Florianopolis, Brazil September 2017 Dep. de Fisica Organization of The Mini Course Day 1: Basic Topology of Equilibrium Networks Day 2: Percolation
More informationAsymptotic Modularity of some Graph Classes
Asymptotic Modularity of some Graph Classes Fabien de Montgolfier 1, Mauricio Soto 1, and Laurent Viennot 2 1 LIAFA, UMR 7089 CNRS - Université Paris Diderot. 2 INRIA and Université Paris Diderot fm@liafa.jussieu.fr
More informationCohesive subgroups. MGT 780 Spring 2014 Steve Borgatti
Cohesive subgroups MGT 780 Spring 2014 Steve Borgatti What are cohesive subgroups? Loosely speaking, network clumps Regions of high density and low distance Initially conceived of as formalizations of
More informationA DISCUSSION OF KEISLER S ORDER
A DISCUSSION OF KEISLER S ORDER ADELE PADGETT Abstract. In model theory, the complex numbers as an algebraically closed field are often given as an example of a simple, well-behaved mathematical structure,
More informationA LINE GRAPH as a model of a social network
A LINE GRAPH as a model of a social networ Małgorzata Krawczy, Lev Muchni, Anna Mańa-Krasoń, Krzysztof Kułaowsi AGH Kraów Stern School of Business of NY University outline - ideas, definitions, milestones
More informationBayesian methods for graph clustering
Author manuscript, published in "Advances in data handling and business intelligence (2009) 229-239" DOI : 10.1007/978-3-642-01044-6 Bayesian methods for graph clustering P. Latouche, E. Birmelé, and C.
More informationCohesive Subgroups. October 4, Based on slides by Steve Borgatti. Modifications (many sensible) by Rich DeJordy
Cohesive Subgroups October 4, 26 Based on slides by Steve Borgatti. Modifications (many sensible) by Rich DeJordy Before We Start Any questions on cohesion? Why do we care about Cohesive Subgroups 1. They
More informationStatistical Evaluation of Spectral Methods for Anomaly Detection in Static Networks
Statistical Evaluation of Spectral Methods for Anomaly Detection in Static Networks Tomilayo Komolafe 1, A. Valeria Quevedo 1,2, Srijan Sengupta 1 and William H. Woodall 1 arxiv:1711.01378v3 [stat.ap]
More informationThe Beginning of Graph Theory. Theory and Applications of Complex Networks. Eulerian paths. Graph Theory. Class Three. College of the Atlantic
Theory and Applications of Complex Networs 1 Theory and Applications of Complex Networs 2 Theory and Applications of Complex Networs Class Three The Beginning of Graph Theory Leonhard Euler wonders, can
More informationCS224W: Social and Information Network Analysis Jure Leskovec, Stanford University
CS224W: Social and Information Network Analysis Jure Leskovec Stanford University Jure Leskovec, Stanford University http://cs224w.stanford.edu Task: Find coalitions in signed networks Incentives: European
More informationData Mining Techniques
Data Mining Techniques CS 622 - Section 2 - Spring 27 Pre-final Review Jan-Willem van de Meent Feedback Feedback https://goo.gl/er7eo8 (also posted on Piazza) Also, please fill out your TRACE evaluations!
More informationData science with multilayer networks: Mathematical foundations and applications
Data science with multilayer networks: Mathematical foundations and applications CDSE Days University at Buffalo, State University of New York Monday April 9, 2018 Dane Taylor Assistant Professor of Mathematics
More informationarxiv:cond-mat/ v1 [cond-mat.dis-nn] 27 Mar 2006
Statistical Mechanics of Community Detection arxiv:cond-mat/0603718v1 [cond-mat.dis-nn] 27 Mar 2006 Jörg Reichardt 1 and Stefan Bornholdt 1 1 Institute for Theoretical Physics, University of Bremen, Otto-Hahn-Allee,
More informationCS224W: Social and Information Network Analysis Jure Leskovec, Stanford University
CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu Intro sessions to SNAP C++ and SNAP.PY: SNAP.PY: Friday 9/27, 4:5 5:30pm in Gates B03 SNAP
More informationNetwork models: random graphs
Network models: random graphs Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural Analysis
More informationFinding normalized and modularity cuts by spectral clustering. Ljubjana 2010, October
Finding normalized and modularity cuts by spectral clustering Marianna Bolla Institute of Mathematics Budapest University of Technology and Economics marib@math.bme.hu Ljubjana 2010, October Outline Find
More informationA New Facet Generating Procedure for the Stable Set Polytope
1 / 22 A New Facet Generating Procedure for the Stable Set Polytope Álinson S. Xavier a Manoel Campêlo b a Mestrado e Doutorado em Ciência da Computação Universidade Federal do Ceará Fortaleza, Brazil
More informationStatistical analysis of biological networks.
Statistical analysis of biological networks. Assessing the exceptionality of network motifs S. Schbath Jouy-en-Josas/Evry/Paris, France http://genome.jouy.inra.fr/ssb/ Colloquium interactions math/info,
More informationGraph Detection and Estimation Theory
Introduction Detection Estimation Graph Detection and Estimation Theory (and algorithms, and applications) Patrick J. Wolfe Statistics and Information Sciences Laboratory (SISL) School of Engineering and
More informationPartitions versus sets : a case of duality
Partitions versus sets : a case of duality Laurent Lyaudet Université d Orléans - LIFO, Rue Léonard de Vinci, B.P. 6759, F-45067 Orléans Cedex 2, France Frédéric Mazoit 1 Université Bordeaux - LaBRI, 351,
More informationPROBABILISTIC LATENT SEMANTIC ANALYSIS
PROBABILISTIC LATENT SEMANTIC ANALYSIS Lingjia Deng Revised from slides of Shuguang Wang Outline Review of previous notes PCA/SVD HITS Latent Semantic Analysis Probabilistic Latent Semantic Analysis Applications
More informationStatistical mechanics of community detection
Statistical mechanics of community detection Jörg Reichardt and Stefan Bornholdt Institute for Theoretical Physics, University of Bremen, Otto-Hahn-Allee, D-28359 Bremen, Germany Received 22 December 2005;
More informationSpiking Neural P Systems and Modularization of Complex Networks from Cortical Neural Network to Social Networks
Spiking Neural P Systems and Modularization of Complex Networks from Cortical Neural Network to Social Networks Adam Obtu lowicz Institute of Mathematics, Polish Academy of Sciences Śniadeckich 8, P.O.B.
More informationAn Efficient Algorithm for Protein-Protein Interaction Network Analysis to Discover Overlapping Functional Modules
An Efficient Algorithm for Protein-Protein Interaction Network Analysis to Discover Overlapping Functional Modules Ying Liu 1 Department of Computer Science, Mathematics and Science, College of Professional
More information