Centrality Measures. Leonid E. Zhukov

Similar documents
Centrality Measures. Leonid E. Zhukov

Node and Link Analysis

Node Centrality and Ranking on Networks

Web Structure Mining Nodes, Links and Influence

Diffusion and random walks on graphs

CSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68

Unit 5: Centrality. ICPSR University of Michigan, Ann Arbor Summer 2015 Instructor: Ann McCranie

Network models: random graphs

Link Analysis. Leonid E. Zhukov

Epidemics on networks

Epidemics on networks

Epidemics and information spreading

Power laws. Leonid E. Zhukov

UNIT 4 RANK CORRELATION (Rho AND KENDALL RANK CORRELATION

STABILITY AND CONTINUITY OF CENTRALITY MEASURES IN WEIGHTED GRAPHS. Department of Electrical and Systems Engineering, University of Pennsylvania

Diffusion of Innovation and Influence Maximization

Diffusion of Innovation

Understand the difference between symmetric and asymmetric measures

Degree Distribution: The case of Citation Networks

Introduction Centrality Measures Implementation Applications Limitations Homework. Centrality Metrics. Ron Hagan, Yunhe Feng, and Jordan Bush

Network models: dynamical growth and small world

Applications to network analysis: Eigenvector centrality indices Lecture notes

Lecture 13 Spectral Graph Algorithms

Data Mining and Analysis: Fundamental Concepts and Algorithms

Centrality in Social Networks

Lecture: Modeling graphs with electrical networks

1 Complex Networks - A Brief Overview

CS224W: Social and Information Network Analysis Lada Adamic

Session III: New ETSI Model on Wideband Speech and Noise Transmission Quality Phase II. STF Validation results

Politician Family Networks and Electoral Outcomes: Evidence from the Philippines. Online Appendix. Cesi Cruz, Julien Labonne, and Pablo Querubin

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

6.207/14.15: Networks Lectures 4, 5 & 6: Linear Dynamics, Markov Chains, Centralities

Peripheries of Cohesive Subsets

Centrality. Structural Importance of Nodes

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University

Finding central nodes in large networks

Freeman (2005) - Graphic Techniques for Exploring Social Network Data

BIOL 4605/7220 CH 20.1 Correlation

Pseudocode for calculating Eigenfactor TM Score and Article Influence TM Score using data from Thomson-Reuters Journal Citations Reports

Research Article Analysis of Unweighted Amino Acids Network

1. Pearson linear correlation calculating testing multiple correlation. 2. Spearman rank correlation calculating testing 3. Other correlation measures

τ xy N c N d n(n 1) / 2 n +1 y = n 1 12 v n n(n 1) ρ xy Brad Thiessen

AXIOMS FOR CENTRALITY

Peripheries of cohesive subsets

Diffusion of information and social contagion

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Harmonic distances, centralities and systemic stability in heterogeneous interbank networks

Bonacich Measures as Equilibria in Network Models

Complex Social System, Elections. Introduction to Network Analysis 1

Kristina Lerman USC Information Sciences Institute

UNCORRECTED PROOF. A Graph-theoretic perspective on centrality. Stephen P. Borgatti a,, Martin G. Everett b,1. 1. Introduction

Measuring Segregation in Social Networks

Online Sampling of High Centrality Individuals in Social Networks

Nonparametric Independence Tests

Unit 14: Nonparametric Statistical Methods

AIR FORCE INSTITUTE OF TECHNOLOGY

Slide 7.1. Theme 7. Correlation

Graph Metrics and Dimension Reduction

Lecture 1: From Data to Graphs, Weighted Graphs and Graph Laplacian

Discrete-time Consensus Filters on Directed Switching Graphs

MATH 829: Introduction to Data Mining and Analysis Clustering II

Bargaining in Global Communication Networks

Lecture 1: Graphs, Adjacency Matrices, Graph Laplacian

Focus was on solving matrix inversion problems Now we look at other properties of matrices Useful when A represents a transformations.

Bivariate Paired Numerical Data

Provision of non-excludable public goods on networks: from equilibrium to centrality measures

Markov Chains and Spectral Clustering

Lecture 1 and 2: Introduction and Graph theory basics. Spring EE 194, Networked estimation and control (Prof. Khan) January 23, 2012

Lecture 10: Dimension Reduction Techniques

A note on top-k lists: average distance between two top-k lists

Móstoles, Spain. Keywords: complex networks, dual graph, line graph, line digraph.

Local and Consistent Centrality Measures in Networks 1

Bargaining, Information Networks and Interstate

The Social Climbing Game

Combinatorial Types of Tropical Eigenvector

Machine Learning for Data Science (CS4786) Lecture 11

Network Theory with Applications to Economics and Finance

Proof of Theorem 1. Tao Lei CSAIL,MIT. Here we give the proofs of Theorem 1 and other necessary lemmas or corollaries.

Spatial Autocorrelation (2) Spatial Weights

Lecture: Some Practical Considerations (3 of 4)

Copula modeling for discrete data

Fuzzy order-equivalence for similarity measures

Complexity Theory of Polynomial-Time Problems

Qualifying Examination Winter 2017

Spectral Graph Theory Tools. Analysis of Complex Networks

International Journal of Business, Humanities and Social Science Vol. 1, No.1, October 2012 THE HYBRID MODEL OF AGENT S POWER IN SOCIAL NETWORKS

DECISION MAKING SUPPORT AND EXPERT SYSTEMS

Proximity data visualization with h-plots

Correlation and Regression

Dimension Reduction and Low-dimensional Embedding

Class President: A Network Approach to Popularity. Due July 18, 2014

Dimension Reduction Techniques. Presented by Jie (Jerry) Yu

MSc / PhD Course Advanced Biostatistics. dr. P. Nazarov

18.312: Algebraic Combinatorics Lionel Levine. Lecture 19

Link Analysis Information Retrieval and Data Mining. Prof. Matteo Matteucci

Computing Trusted Authority Scores in Peer-to-Peer Web Search Networks

Overview of clustering analysis. Yuehua Cui

Link Analysis Ranking

An Algorithmist s Toolkit September 10, Lecture 1

Networks in the Real World

Transcription:

Centrality Measures Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural Analysis and Visualization of Networks Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 1 / 22

Graph-theoretic measures Which vertices are important? M.Grandjean, 2014 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 2 / 22

Graph-theoretic measures The eccentricity ɛ(v) of a vertex v is the maximum distance between v and any other vertex u of the graph ɛ(v) = max u V d(u, v) Graph diameter is the maximum eccentricity d = max v V ɛ(v) Graph radius is the minimum eccentricity r = min v V ɛ(v). A point v is a central point of a graph if the eccentricity of the point equals the graph radius ɛ(v) = r from Eric Weisstein MathWorld Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 3 / 22

Graph-theoretic measures Graph center is a set of of vertices with graph eccentricity equal to the graph radius ɛ(v) = r - set of central points Graph periphery is a set of vertices that have graph eccentricities equal to the graph diameter ɛ(v) = d from Eric Weisstein MathWorld Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 4 / 22

Centrality Measures Sociology. Most "important"actors: actor location in the social network Actor centrality - involvement with other actors, many ties, source or recipient. Undirected network. Actor prestige - recipient (object) of many ties, ties directed to an actor. Directed network. In this lecture: undirected graphs, symmetric matrix A ij = A ji, A = A T Linton Freeman, 1979 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 5 / 22

Three graphs Star graph Circle graph Line Graph Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 6 / 22

Degree centrality Degree centrality: number of nearest neighbours C D (i) = k(i) = j A ij = j A ji Normalized degree centrality C D (i) = 1 n 1 C D(i) High centrality degree -direct contact with many other actors Low degree - not active, peripheral actor Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 7 / 22

Closeness centrality Closeness centrality: how close an actor to all the other actors in network C C (i) = j 1 d(i, j) Normalized closeness centrality C C (i) = (n 1)C C (i) Actor in the center can quickly interact with all others, short communication path to others, minimal number of steps to reach others [*** Harmonic centrality = j 1 d(i,j) ***] Alex Bavelas, 1948 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 8 / 22

Betweenness centrality Betweenness centrality: number of shortest paths going through the actor σ st (i) C B (i) = σ st (i) σ st Normalized betweenness centrality s t i C B (i) = 2 (n 1)(n 2) C B(i) Probability that a communication from s to t will go through i (geodesics) Linton Freeman, 1977 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 9 / 22

Eigenvector centrality Importance of a node depends on the importance of its neighbors (recursive definition) v i j A ij v j v i = 1 A ij v j λ j Av = λv Select an eigenvector associated with largest eigenvalue λ = λ 1, v = v 1 Phillip Bonacich, 1972. Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 10 / 22

Centrality examples Closeness centrality from www.activenetworks.net Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 11 / 22

Centrality examples Betweenness centrality from www.activenetworks.net Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 12 / 22

Centrality examples Eigenvector centrality from www.activenetworks.net Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 13 / 22

Katz status index Weighted count of all paths coming to the node: the weight of path of length n is counted with attenuation factor β n, β < 1 λ 1 k i = β j A ij + β 2 j A 2 ij + β 3 j A 3 ij +... k = (βa + β 2 A 2 + β 3 A 3 +...)e = (β n A n )e = ( (βa) n I)e n=1 n=0 (βa) n = (I βa) 1 n=0 k = ((I βa) 1 I)e (I βa)k = βae Leo Katz, 1953 k = β(i βa) 1 Ae Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 14 / 22

Bonacich centrality Two-parametric centrality measure c(α, β) β - degree to which an individual status is a function of the statuses of those to whom he is connected (can be positive if connected to powerful and negative, if connected to powerless ) α - normalization parameter c i (α, β) = j (α + βc j )A ij c = αae + βac (I βa)c = αae c = α(i βa) 1 Ae α - found from normalization c 2 = c 2 i = 1 Phillip Bonacich, 1987 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 15 / 22

Modified versions Katz centrality (Newman): x i = α j A ij x j + β i Alpha-centrality (Bonacich): x = αax + β x = (I αa) 1 β x i = α j A ij x j + 1 x = αax + e x = (I αa) 1 e Bonacich, 2001,Newman, 2010 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 16 / 22

Centrality examples A) Degree centrality B) Closeness centrality C) Betweenness centrality D) Eigenvector centrality E) Katz centrality F) Alpha centrality from Claudio Rocchini Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 17 / 22

Centralization Centralization (network measure) - how central the most central node in the network in relation to all other nodes. C x = N i [C x (p ) C x (p i )] max N i [C x (p ) C x (p i )] C x - one of the centrality measures p - node with the largest centrality value max - is taken over all graphs with the same number of nodes (for degree, closeness and betweenness the most centralized structure is the star graph) Linton Freeman, 1979 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 18 / 22

Prestige Prestige - measure of node importance in directed graphs Degree prestige k in (i) Proximity prestige (closeness) Status or Rank prestige (Katz, Bonacich) Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 19 / 22

Metrics comparison Pearson correlation coefficient r = n i=1 (X i X )(Y i Ȳ ) n i=1 (X n i X ) 2 i=1 (Y i Ȳ ) 2 Shows linear dependence between variables, 1 r 1 (perfect when related by linear function) Spearman rank correlation coefficient (Sperman s rho): Convert raw scores to ranks - sort by score: X i x i, Y i y i ρ = 1 6 n i=1 (x i y i ) 2 n(n 2 1) Shows strength of monotonic association (perfect for monotone increasing/decreasing relationship) Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 20 / 22

Ranking comparison The Kendall tau rank distance is a metric that counts the number of pairwise disagreements between two ranking lists Kendall rank correlation coefficient, commonly referred to as Kendall s tau coefficient τ = n c n d n(n 1)/2 n c - number of concordant pairs, n d - number of discordant pairs 1 τ 1, perfect agreement τ = 1, reversed τ = 1 Example Rank 1 A B C D E Rank 2 C D A B E τ = 6 4 5(5 1)/2 = 0.2 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 21 / 22

References Centrality in Social Networks. Conceptual Clarification, Linton C. Freeman, Social Networks, 1, 215-239, 1979 Power and Centrality: A Family of Measures, Phillip Bonacich, The American Journal of Sociology, Vol. 92, No. 5, 1170-1182, 1987 A new status index derived from sociometric analysis, L. Katz, Psychometrika, 19, 39-43, 1953. Eigenvector-like measures of centrality for asymmetric relations, Phillip Bonacich, Paulette Lloyd, Social Networks 23, 191?201, 2001 Leonid E. Zhukov (HSE) Lecture 5 10.02.2015 22 / 22