matrices with row sums equal to one. For applications to ranking, we examine the computation of a dominant left eigenvector of a stochastic matrix.

Size: px

Start display at page:

Download "matrices with row sums equal to one. For applications to ranking, we examine the computation of a dominant left eigenvector of a stochastic matrix."

Lydia Wood
5 years ago
Views:

1 Abstract SELEE, TERESA MARGARET. Stochastic Matrices:, and Applications to Ranking. Under the direction of Ilse C. F. Ipsen.) We present two different views of row) stochastic matrices, which are nonnegative matrices with row sums equal to one. For applications to ranking, we examine the computation of a dominant left eigenvector of a stochastic matrix. The stochastic matrix of interest is called the Google matrix and contains information about how pages of the Internet are linked to one another. The dominant left eigenvector of the Google matrix yields a ranking for each Web page, which helps to determine the order in which search results are returned. These results are presented in Chapter 1. Chapter 2 presents results for coefficients of ergodicity, which measure the rate at which products of stochastic matrices, especially products whose number of factors is unbounded, converge to a matrix of rank one. Ergodicity arises in the context of Markov chains and signals the tendency of the rows of such products to equalize. We present unified notation and definitions for coefficients of ergodicity applied to stochastic matrices, extend the definitions to general complex matrices, and illustrate aconnectionbetweenergodicitycoefficientsandinclusionregionsforeigenvalues.

3 Stochastic Matrices:, and Applications to Ranking by Teresa Margaret Selee AdissertationsubmittedtotheGraduateFacultyof North Carolina State University in partial fulfillment of the requirements for the Degree of Doctor of Philosophy Applied Mathematics Raleigh, North Carolina 2009 APPROVED BY: I. C. F. Ipsen Chair of Advisory Committee S. L. Campbell C. T. Kelley C. D. Meyer

4 Dedication This work is dedicated to my family and friends. Without your love and support, this would not have been possible. ii

5 Biography Teresa Selee was born with bright eyes and an adventurous spirit in the Cleveland suburb of Elyria, Ohio. She spent the first eighteen years of her life growing up in North Ridgeville, Ohio. She was always eager to learn, looked forward to school, and did well in most subjects. It wasn t until her senior year of high school that she realized her true interest in math. Upon graduation, she attended Youngstown State University on a full academic scholarship. After a year of studying math, she added a second major in economics. Academically she was growing, but she yearned for more. During her third year of college she traveled across the pond to spend a semester at Oxford Brookes University, in Oxford, England. Before returning to the U.S., she spent six weeks traveling in Europe, visiting more than ten countries. During her travels, she realized the world was a classroom, as she learned about art, history, culture, food, and so much more. Reluctantly, Teresa left Europe and returned to YSU for her senior year, earning high honors for her Bachelor s degrees in Mathematics and Economics. Unsure what to do with her degrees, Teresa worked for a few months before realizing how much she missed the challenging environment of the classroom. After being accepted to a PhD iii

6 program in mathematics at North Carolina State University, Teresa took off traveling again. She spent two months in Australia, driving around the country and visiting friends. She then spent another two months in Northern/Eastern Europe and Ireland, visiting more than ten countries. Since she started graduate school six years ago, Teresa has had the opportunities to travel for work and pleasure to Paris, Düsseldorf, Boston, New Orleans, San Diego, San Francisco, Livermore and Chicago, as well as three cross-country drives. Now that she has completed her degree, Teresa is moving to Atlanta to work as a Research Scientist at Georgia Tech Research Institute. She is looking forward to the adventures awaiting her in a new city, with a new job, and especially with her soon-to-be husband Steve Stanislav. iv

7 Acknowledgements Thanks to all those involved in my academic life. To my high school mathematics teacher, Mr. Adam Baillie, who helped me to recognize my potential and develop my math skills. To Doug Faires, Nate Ritchey, and the rest of the Mathematics department at YSU who were an endless source or enrichment, encouragement and advice. You were supportive, and provided me so many opportunities that have helped me to develop into the person I am today. To Teresa Riley and Rochelle Ruffer, two amazing professors, without whom I would never have earned my Economics degree. To Tammy Kolda, Philip Kegelmeyer, and Josh Griffin whose time and energy helped me to have a great experience as a summer intern at Sandia National Laboratory in Livermore, CA. Thank you for growing my interest in tensors, clustering and data mining. To the faculty and students, past and present, of the student Numerical Analysis seminar. Thank you for being a willing audience for practice talks, and for the useful feedback. v

8 To the administrative staff in the Math department at NCSU. You are the backbone of this department and the key to keeping everything moving smoothly. Thank you for everything you do. To my committee, Steve Campbell, Tim Kelley, and Carl Meyer for your support, advice, and assistance, not only during my thesis writing, but throughout my career at NCSU. To my advisor, Ilse Ipsen, without whom this work would not be possible. Thank you for your time, ideas, feedback, and advice. Thank you also for the opportunities you afforded me to work, travel and expand my mathematical horizons. Thanks to my family, past, present, and future. To my parents who always supported me, and knew I could do this even when I had my doubts. Thank you for all the time and effort you ve invested in me. To my sister, Jacquelyn, who in the last 25 years has gone from an annoyance to one of my best friends. Thank you for always being there. I am so thankful to have you as a friend. To my countless other family, grandparents, aunts, uncles, cousins, and more for always being the fun, crazy, loving people I call family. To my future in-laws and new family. Thank you for your support and I am excited to become a part of your family. vi

9 Thanks to my friends. To my officemates: Kelly Dickson, Laura Ellwein, Morgan Root, and Julie Beier. I m so happy to have met you and spent so many days with you. Without your daily support and encouragement, I never would have survived this experience. To my fellow students and friends who became my family in North Carolina, including Brian Adams, Steve and Lindsay May, Kelly Sweetingham, Ryan and Anna Hart Siskind, Iti and Tony Klein, Kristen Devault, Sarah Muccio, Laura Ellwein, Kelly Dickson, Julie Beier, Keri Kehoe, John Telliho, Corey Winton, Morgan and Lis Root, Ross and Anna Braymer, and many more. You have always been there for work and fun, school and play. Whether it be working through a tricky math problem, studying for exams, preparing for presentations, going out for food and drinks, watching and playing sports, or leaving town for a weekend of fun. I am so thankful to have all of you in my life. To Amory Starkey for always believing in me. You are such a wonderful person and I wish you every happiness for your future. To Mary Spann who has been through many ups and downs with me for almost twenty years. Finally, thanks to my soon-to-be husband Steve Stanislav. You are everything in my world. Thank you for your love, support, friendship, advice, and the countless meals you ve cooked for me. I cannot wait to spend the rest of my life with you. vii

10 Table of Contents List of Figures xi 1 Introduction PageRank Computation, with Special Attention to Dangling Nodes Introduction The ingredients Lumping Similarity transformation Expression for PageRank Algorithm Several dangling node vectors PageRanks of dangling versus nondangling nodes Only dangling nodes Introduction to ergodicity coefficients viii

11 3.2 Definitions and Notation Weak ergodicity General ergodicity coefficients for stochastic matrices First class of ergodicity coefficients Second class of ergodicity coefficients Specific ergodicity coefficients for stochastic matrices Ergodicity coefficients of stochastic matrices in the 1-norm Ergodicity coefficients of stochastic matrices in the -norm Ergodicity coefficients of stochastic matrices in the p-norm Connections between τ 1 S) andτ S) Ergodicity coefficients for general matrices Relations between different p-norm ergodicity coefficients Explicit forms for τ p w, A) forspecificvaluesofp Ergodicity coefficients for matrices with special properties Constant row sum matrices Doubly stochastic matrices Nonnegative, irreducible matrices Ergodicity coefficients with the dominant left eigenvector Applications Condition Numbers Eigenvalue bounds Connection of τ 2 to Singular Values Connection of τ 2 to Lehmann Bounds ix

12 4 Conclusions and Future Work Contributions from Chapter Contributions and Future work for Chapter References x

13 List of Figures Figure 2.1 A simple model of the link structure of the Web. The sphere ND represents the set of nondangling nodes, and D represents the set of dangling nodes. The submatrix H 11 represents all the links from nondangling nodes to nondangling nodes, while the submatrix H 12 represents links from nondangling to dangling nodes Figure 2.2 Sources of PageRank. Nondangling nodes receive their PageRank from v 1 and w 1,distributedthroughthelinksH 11.Incontrast,thePageRank of the dangling nodes comes from v 2, w 2,andthePageRankofthe nondangling nodes through the links H Figure 2.3 Sources of PageRank when w 1 =0. Thenondanglingnodesreceive their PageRank only from v 1.Thedanglingnodes,incontrast,receivetheir PageRank from v 2 and w 2,aswellasfromthePageRankofthenondangling nodes filtered through the links H Figure 2.4 Sources of PageRank when w 2 =0. Thedanglingnodesreceivetheir xi

14 PageRank only from v 2, and from the PageRank of the nondangling nodes filtered through the links H xii

15 Chapter 1 Introduction Stochastic matrices have been studied in many contexts, in part because of their relation to Markov chain theory and the study of products of stochastic matrices. Additionally, stochastic matrices have many useful properties, such as a dominant eigenvalue of 1 and a dominant right eigenvector as a vector of all 1 s. In this work, we consider two different applications for stochastic matrices. Chapter 2 describes an application to ranking with respect to Google and the PageRank vector. The material in Chapter 2waspublishedinDecember2007intheSIAMJournalonMatrixAnalysisandApplications 54). Chapter 3 contains results on the coefficient of ergodicity, which can be thought of as a bound on the subdominant eigenvalues of a matrix. For applications to ranking, we present a simple algorithm for computing the PageRank stationary distribution) of the stochastic Google matrix G. Thealgorithmlumps all dangling nodes into a single node. We express lumping as a similarity transformation of G and show that the PageRank of the nondangling nodes can be computed 1

16 Chapter 1. Introduction separately from that of the dangling nodes. The algorithm applies the power method only to the smaller lumped matrix, but the convergence rate is the same as that of the power method applied to the full matrix G. Theefficiencyofthealgorithmincreasesas the number of dangling nodes increases. We also extend the expression for PageRank and the algorithm to more general Google matrices that have several different dangling node vectors, when it is required to distinguish among different classes of dangling nodes. We analyze the effect of the dangling node vector on the PageRank and show that the PageRank of the dangling nodes depends strongly on that of the nondangling nodes but not vice versa. We also consider stochastic matrices and their relationship to ergodicity coefficients. These coefficients were originally introduced in the context of rates of convergence of finite, inhomogeneous Markov chains. In general, ergodicity deals with the long term behavior of dynamical systems. Thus for systems of finite, inhomogeneous Markov chains, ergodicity refers to the long-term behavior of products of stochastic matrices. In this paper we focus only on finite dimensional matrices. For some information on infinite dimensional stochastic matrices, see Isaacson and Madsen 57), Pax 80), Paz and Reichaw 82), and Rhodius 89). An ergodicity coefficient τ 1 S) isdefinedforann n stochastic matrix S, as τ 1 S) = max z 1 =1 z T e=0 S T z 1 where the maximum is taken over all z R n and e is a column vector of all 1 s Notice that this form of an ergodicity coefficient is simply the norm of a matrix restricted to 2

17 Chapter 1. Introduction a subspace. In addition to being continuous and bounded, τ 1 S) issubmultiplicative, and bounds the magnitude of the second largest eigenvalue of S. Thus τ 1 S 1 S 2 ) τ 1 S 1 )τ 1 S 2 )and λ τ 1 S) foralleigenvaluesλ 1ofS for stochastic matrices S 1 and S 2. These two properties combine to yield an upper bound on the magnitude of subdominant eigenvalues of the product of stochastic matrices. The upper bound is computed as the product of the ergodicity coefficients of the individual matrices. That is, for any eigenvalue λ 1ofthestochasticmatrixS 1 S 2, λ τ 1 S 1 S 2 ) τ 1 S 1 )τ 1 S 2 ). This inequality gives an eigenvalue bound for products of stochastic matrices, and is useful since, in general the eigenvalues of a product of matrices are not equal to the product of the eigenvalues of the matrices. Chapter 3 discusses coefficient of ergodicity from a linear algebra point of view. We choose to express these coefficients as matrix norms restricted to a subspace, although we mention other expressions, as well. We present properties, and explicit forms for various ergodicity coefficients and simplify results and notation from the last 100 years, writing everything in language more familiar to the numerical linear algebra community. We conclude with new and existing applications, including condition numbers, eigenvalue and singular value bounds. 3

18 Chapter 2 PageRank Computation, with Special Attention to Dangling Nodes 2.1 Introduction The order in which the search engine Google displays the Web pages is determined, to a large extent, by the PageRank vector 19; 77). The PageRank vector contains, for every Web page, a ranking that reflects the importance of the Web page. Mathematically, the PageRank vector π is the stationary distribution of the so-called Google matrix, a sparse stochastic matrix whose dimension exceeds 11.5 billion 39). The Google matrix 4

19 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes G is a convex combination of two stochastic matrices G = αs +1 α)e, 0 α<1, where the matrix S represents the link structure of the Web, and the primary purpose of the rank-one matrix E is to force uniqueness for π. In particular, element i, j) of S is nonzero if Web page i contains a link pointing to Web page j. However, not all Web pages contain links to other pages. Image files or pdf files, and uncrawled or protected pages have no links to other pages. These pages are called dangling nodes, andtheirnumbermayexceedthenumberofnondanglingpages31, section 2). The rows in the matrix S corresponding to dangling nodes would be zero if left untreated. Several ideas have been proposed to deal with the zero rows and force S to be stochastic 31). The most popular approach adds artificial links to the dangling nodes, by replacing zero rows in the matrix with the same vector, w, sothatthematrix S is stochastic. It is natural as well as efficient to exclude the dangling nodes with their artificial links from the PageRank computation. This can be done, for instance, by lumping all the dangling nodes into a single node 70). In section 2.3, we provide a rigorous justification for lumping the dangling nodes in the Google matrix G, byexpressing lumping as a similarity transformation of G Theorem 2.3.1). We show that the PageRank of the nondangling nodes can be computed separately from that of the dangling nodes Theorem 2.3.2), and we present an efficient algorithm for computing PageRank by applying the power method only to the much smaller, lumped matrix section 2.3.3). Because the dangling nodes are excluded from most of the computations, 5

20 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes the operation count depends, to a large extent, on only the number of nondangling nodes, as opposed to the total number of Web pages. The algorithm has the same convergence rate as the power method applied to G, butismuchfasterbecauseit operates on a much smaller matrix. The efficiency of the algorithm increases as the number of dangling nodes increases. Many other algorithms have been proposed for computing PageRank, including classical iterative methods 2; 16; 68), Krylov subspace methods 34; 35), extrapolation methods 17; 18; 48; 61; 60), and aggregation/disaggregation methods 20; 55; 69); see also the survey papers 8; 66) and the book 67). Our algorithm is faster than the power method applied to the full Google matrix G, butretainsalltheadvantagesof the power method: It is simple to implement and requires minimal storage. Unlike Krylov subspace methods, our algorithm exhibits predictable convergence behavior and is insensitive to changes in the matrix 34). Moreover, our algorithm should become more competitive as the Web frontier expands and the number of dangling nodes increases. The algorithms in 68; 70) are special cases of our algorithm because our algorithm allows the dangling node and personalization vectors to be different, and thereby facilitates the implementation of TrustRank 41). TrustRank is designed to diminish the harm done by link spamming and was patented by Google in March ). Moreover, our algorithm can be extended to a more general Google matrix that contains several different dangling node vectors section 2.3.4). In section 2.4 we examine how the PageRanks of the dangling and nondangling nodes influence each other, as well as the effect of the dangling node vector w on the PageRanks of dangling and nondangling nodes. In particular we show Theorem 2.4.1) 6

21 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes that the PageRanks of the dangling nodes depend strongly on the PageRanks of the nondangling nodes but not vice versa. Finally, in section 2.5, we consider a theoretical) extreme case, where the Web consists solely of dangling nodes. We present a Jordan decomposition for general rank-one matrices Theorems and 2.5.2) and deduce from it a Jordan decomposition for a Google matrix of rank one Corollary 2.5.3). 2.2 The ingredients Let n be the number of Web pages and k the number of nondangling nodes among the Web pages, 1 k<n.wemodelthelinkstructureofthewebbythen n matrix H H 11 H 12, 0 0 where the k k matrix H 11 represents the links among the nondangling nodes, and H 12 represents the links from nondangling to dangling nodes; see Figure 2.1. The n k zero rows in H are associated with the dangling nodes. H H 11 ND 12 D Figure 2.1: A simple model of the link structure of the Web. The sphere ND represents the set of nondangling nodes, and D represents the set of dangling nodes. The submatrix H 11 represents all the links from nondangling nodes to nondangling nodes, while the submatrix H 12 represents links from nondangling to dangling nodes. 7

22 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes The elements in the nonzero rows of H are nonnegative and sum to one, H 11 0, H 12 0, H 11 e + H 12 e = e, where 1 e., 1 and the inequalities are to be interpreted elementwise. To obtain a stochastic matrix, we add artificial links to the dangling nodes. That is, we replace each zero row in H by the same dangling node vector w = w 1, w 0, w w T e =1. w 2 Here w 1 is k 1, w 2 is n k) 1, denotes the one norm maximal column sum), and the superscript T denotes the transpose. The resulting matrix S H + dw T = H 11 H 12, where d 0, ew1 T ew2 T e is stochastic, that is, S 0andSe = e. Finally, so as to work with a stochastic matrix that has a unique stationary distribution, one selects a personalization vector v = v 1, v 0, v =1, v 2 8

23 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes where v 1 is k 1andv 2 is n k) 1, and defines the Google matrix as the convex combination G αs +1 α)ev T, 0 α<1. Although the stochastic matrix G may not be primitive or irreducible, its eigenvalue 1isdistinctandthemagnitudeofallothereigenvaluesisboundedbyα 32; 47; 60; 61; 107). Therefore G has a unique stationary distribution, π T G = π T, π 0, π =1. The stationary distribution π is called PageRank. Elementi of π represents the PageRank for Web page i. If we partition the PageRank conformally with G, π = π 1, π 2 then π 1 represents the PageRank associated with the nondangling nodes and π 2 represents the PageRank of the dangling nodes. The identity matrix of order n will be denoted by I n [e 1 e n ], or simply by I. 2.3 Lumping We show that lumping can be viewed as a similarity transformation of the Google matrix; we derive an expression for PageRank in terms of the stationary distribution 9

24 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes of the lumped matrix; we present an algorithm for computing PageRank that is based on lumping; and we extend everything to a Google matrix that has several different dangling node vectors, when it is required to distinguish among different classes of dangling nodes. It was observed in 70) that the Google matrix represents a lumpable Markov chain. The concept of lumping was originally introduced for general Markov matrices, to speed up the computation of the stationary distribution or to obtain bounds 25; 40; 59; 62). Below we paraphrase lumpability 62, Theorem 6.3.2) in matrix terms: Let P be a permutation matrix and M 11 M 1,k+1 PMP T =.. M k+1,1 M k+1,k+1 be a partition of a stochastic matrix M. Then M is lumpable with respect to this partition if each vector M ij e is a multiple of the all-ones vector e, i j, 1 i, j k+1. The Google matrix G is lumpable if all dangling nodes are lumped into a single node 70, Proposition 1). We condense the notation in section 2.2 and write the Google matrix as G = G 11 G 12, where u = u 1 αw +1 α)v, 2.1) eu T 1 eu T 2 u 2 G 11 is k k, andg 12 is n k) k. Here element i, j) ofg 11 corresponds to block M ij,1 i, j k; rowi of G 12 corresponds to block M i,k+1,1 i k; columni of eu T 1 10

25 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes corresponds to M k+1,i,1 i k; andeu T 2 corresponds to M k+1,k Similarity transformation We show that lumping the dangling nodes in the Google matrix can be accomplished by a similarity transformation that reduces G to block upper triangular form. Theorem With the notation in section 2.2 and the matrix G as partitioned in 2.1), let 0 X I k 0, where L I n k L n k êet and ê e e 1 =.. 1 Then XGX 1 =, where G 1) G G1) u T 1 G 12 e. u T 2 e The matrix G 1) is stochastic of order k +1 with the same nonzero eigenvalues as G. Proof. From it follows that X 1 = I k 0, L 1 = I n k +êe T, 0 L 1 XGX 1 = G 11 G 12 I +êe T ) e 1 u T 1 e 1 u T 2 I +êet ) 11

26 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes has the same eigenvalues as G. Inordertorevealtheeigenvalues,wechooseadifferent partitioning and separate the leading k + 1 rows and columns,and observe that G 12 I +êe T )e 1 = G 12 e, u T 2 I +êet )e 1 = u T 2 e to obtain the block triangular matrix XGX 1 = 0 0 G1) with at least n k 1zeroeigenvalues Expression for PageRank We giveanexpressionforthepagerankπ in terms of the stationary distribution σ of the small matrix G 1). Theorem With the notation in section 2.2 and the matrix G as partitioned in 2.1), let and partition σ T = σ T G 11 u T 1 G 12 e = σ T, σ 0, σ =1 u T 2 e [σ T1:k σ k+1 ],whereσ k+1 is a scalar. Then the PageRank equals π T = σ T 1:k σ T G 12. u T 2 12

27 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes Proof. As in the proof of Theorem 2.3.1, we write XGX 1 = G1) G 2), 0 0 where G 2) G 12 I +êe T )[e 2 e n k ]. u T 2 ] The vector [σ T σ T G 2) is an eigenvector for XGX 1 associated with the eigenvalue λ =1. Hence ˆπ [σ T σ T G 2) ] X is an eigenvector of G associated with λ =1andamultipleofthestationarydistribution π of G. Since G 1) has the same nonzero eigenvalues as G, andthedominant eigenvalue 1 of G is distinct 32; 47; 60; 61; 107), the stationary distribution σ of G 1) is unique. Next we express ˆπ in terms of quantities in the matrix G. Wereturntotheoriginal partitioning which separates the leading k elements, ˆπ T = [ σ T 1:k )] σ k+1 σ T G 2) I k 0. 0 L Multiplying out ˆπ T = [ σ T 1:k ) ] σ k+1 σ T G 2) L 13

28 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes shows that ˆπ has the same leading k elements as σ. We now examine the trailing n k components of ˆπ T.Tothisendwepartitionthe matrix L = I n k n kêe 1 and distinguish the first row and column, L = e I 1 n k n k eet Then the eigenvector part associated with the dangling nodes is z T ] [ [σ k+1 σ T G 2) L = σ k+1 1 n k σt G 2) e σ T G 2) I )] 1 n k eet. To remove the terms containing G 2) in z, wesimplify I +êe T )[e 2 e n k ]e =I +êe T )ê =n k)ê. Hence G 2) e =n k) G 12 ê 2.2) u T 2 14

29 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes and 1 n k σt G 2) e = σ T G 12 ê = σ T G 12 e σ T G 12 e 1 u T 2 = σ k+1 σ T G 12 e 1, u T 2 u T 2 u T 2 where we used ê = e e 1,andthefactthatσ is the stationary distribution of G 1),so σ k+1 = σ T G 12 e. u T 2 Therefore the leading element of z equals z 1 = σ k+1 1 n k σt G 2) e = σ T G 12 u T 2 e 1. For the remaining elements of z, we use2.2)to simplify G 2) I 1 ) n k eet = G 2) 1 n k G2) ee T = G 2) G 12 êe T. u T 2 Replacing I +êe T )[e 2 e n k ]=[e 2 e n k ]+êe T 15

30 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes in G 2) yields z2:n k T = σt G 2) I 1 ) n k eet = σ T G 12 [e 2 e n k ]. u T 2 Therefore the eigenvector part associated with the dangling nodes is [ z = z 1 z T 2:n k ] = σ T G 12 u T 2 and ˆπ = σ T 1:k σ T G 12. u T 2 Since π is unique, as discussed in section 2.2, we conclude that ˆπ = π if ˆπ T e =1. This follows, again, from the fact that σ is the stationary distribution of G 1) and σ T [ G 12 ]e = σ u T k Algorithm We presentanalgorithm,basedontheorem2.3.2,forcomputingthepagerankπ from the stationary distribution σ of the lumped matrix G 1) G 11 u T 1 G 12 e. u T 2 e 16

31 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes The input to the algorithm consists of the nonzero elements of the hyperlink matrix H, the personalization vector v, thedanglingnodevectorw, andtheamplificationfactor α. The output of the algorithm is an approximation ˆπ to the PageRank π, whichis computed from an approximation ˆσ of σ. Algorithm % Inputs: H, v, w, α Output: ˆπ % Power method applied to G 1) : ] Choose a starting vector ˆσ T = [ˆσ T1:k ˆσ k+1 with ˆσ 0, ˆσ =1. While not converged ˆσ T 1:k = αˆσt 1:k H α)v T 1 + αˆσ k+1 w T 1 ˆσ k+1 =1 ˆσ T 1:k e end while % Recover PageRank: [ ˆπ T = ˆσ 1:k T αˆσ 1:k T H α)v2 T + αˆσ k+1 w2 T Each iteration of the power method applied to G 1) involves a sparse matrix vector multiply with the k k matrix H 11 as well as several vector operations. Thus the dangling nodes are excluded from the power method computation. The convergence rate of the power method applied to G is α 56). Algorithm has the same convergence rate, because G 1) has the same nonzero eigenvalues as G see Theorem 2.3.1), but is much faster because it operates on a smaller matrix whose dimension does not depend on the number of dangling nodes. The final step in Algorithm recovers π via a single sparse matrix vector multiply with the k n k) matrixh 12,aswellasseveral vector operations. ]. 17

32 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes Algorithm is significantly faster than the power method applied to the full Google matrix G, butitretainsalladvantagesofthepowermethod: Itissimpleto implement and requires minimal storage. Unlike Krylov subspace methods, Algorithm exhibits predictable convergence behavior and is insensitive to changes in the matrix 34). The methods in 68; 70) are special cases of Algorithm because they allow the dangling node vector to be different from the personalization vector, thereby facilitating the implementation of TrustRank 41). TrustRank allows zero elements in the personalization vector v in order to diminish the harm done by link spamming. Algorithm can also be extended to the situation in which the Google matrix has several different dangling node vectors; see section The power method in Algorithm corresponds to Stage 1 of the algorithm in 70). However, Stage 2 of that algorithm involves the power method on a rank-two matrix of order n k +1. In contrast, Algorithm simply performs a single matrix vector multiply with the k n k) matrixh 12.Thereisnoproofthatthetwo-stage algorithm in 70) does compute the PageRank Several dangling node vectors So far we have treated all dangling nodes in the same way, by assigning them the same dangling node vector w. However, onedanglingnodevectormaybeinadequatefor an advanced Web search. For instance, one may want to distinguish different types of dangling node pages based on their functions e.g., text files, image files, videos, etc.); or one may want to personalize a Web search and assign different vectors to dangling node pages pertaining to different topics, different languages, or different domains; see 18

33 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes the discussion in 70, section 8.2). To facilitate such a model for an advanced Web search, we extend the single class of dangling nodes to m 1differentclasses,byassigningadifferentdanglingnode vector w i to each class, 1 i m. Asaconsequenceweneedtoextendlumpingtoa more general Google matrix that is obtained by replacing the n k zero rows in the hyperlink matrix H by m 1possiblydifferentdanglingnodevectorsw 1,...,w m. The more general Google matrix is F k k 1... k m k F 11 F 12 F 1,m+1 k 1 eu T 11 eu T 12 eu T 1,m k m eu T m1 eu T m2 eu T m,m+1, where u i u i1. u i,m+1 αw i +1 α)v. Let π be the PageRank associated with F, π T F = π T, π 0, π =1. 19

34 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes We explain our approach for the case where F has two types of dangling nodes, F = k k 1 k 2 k F 11 F 12 F 13 k 1 eu T 11 eu T 12 eu T 13. k 2 eu T 21 eu T 22 eu T 23 We perform the lumping by a sequence of similarity transformations that starts at the bottom of the matrix. The first similarity transformation lumps the dangling nodes represented by u 2 and leaves the leading block of order k + k 1 unchanged, X 1 k + k 1 k 2 k + k 1 I 0, k 2 0 L 1 where L 1 lumps the k 2 trailing rows and columns of F, L 1 I 1 k 2 êe T, L 1 1 I +êe T, ê = e e 1 =

35 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes Applying the similarity transformation to F gives X 1 FX 1 1 = k k 1 1 k 2 1 k F 11 F 12 F 13 e F13 k 1 eu T 11 eu T 12 u T 13e)e eũ T 13 1 u T 21 u T 22 u T 23 e ũt 23 k with F 13 F 13 L 1 1 ] ] [e 2 e k2, ũ T j3 ut j3 L 1 1 [e 2 e k2, j =1, 2. The leading diagonal block of order k + k 1 +1is astochastic matrixwith the same nonzero eigenvalues as F. Beforeapplyingthesecondsimilaritytransformationthat lumps the dangling nodes represented by u 1,wemovetherowswithu 1 and corresponding columns) to the bottom of the nonzero matrix, merely to keep the notation simple. The move is accomplished by the permutation matrix P 1 [e 1 e k e k+k1 +1 e k+1 e k+k1 e k+k1 +2 e n ]. 21

36 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes The symmetrically permuted matrix P 1 X 1 FX 1 1 P T 1 = F 11 F 13 e F 12 F13 u T 21 u T 23e u T 22 ũ T 23 eu T 11 u T 13 e)e eut 12 eũ T retains a leading diagonal block that is stochastic. Now we repeat the lumping on dangling nodes represented by u 1,bymeansofthesimilaritytransformation X 2 k +1 k 1 k 2 1 k +1 I 0 0 k 1 0 L 2 0, k I where L 2 lumps the trailing k 1 nonzero rows, L 2 I 1 k 1 êe T, L 1 2 I +êe T. 22

37 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes The similarity transformation produces the lumped matrix X 2 P 1 X 1 FX 1 1 P T 1 X 1 2 = k 1 1 k 1 1 k 2 1 k F 11 F 13 e F 12 e F12 F13 1 u T 21 u T 23e u T 22e ũ T 22 ũ T 23 1 u T 11 u T 13 e ut 12 e ũt 12 ũ T 13. k k Finally, for notational purposes, we restore the original ordering of dangling nodes by permuting rows and columns k +1andk +2, P 2 [e 1 e k e k+2 e k+1 e k+3 e n ]. The final lumped matrix is P 2 X 2 P 1 X 1 FX 1 1 P T 1 X 1 2 P T 2 = F 11 F 12 e F 13 e u T 11 u T 12 e ut 13 e = F 1). u T 21 e ut 22 e ut 23 e The above discussion for m =2illustrateshowtoextendTheorems2.3.1and2.3.2to any number m of dangling node vectors. 23

38 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes Theorem Define X i as and k +i 1) + m i j=1 k j k m i+1 1 i + m j=m i+2 k j k +i 1) + m i j=1 k j I 0 0 k m i+1 0 L i 0 1 i + m j=m i+2 k j 0 0 I m i P i [e 1 e k e r+i e k+1 e r+i 1 e r+i+1 e n ], r = k + k j. j=1 Then P m X m P m 1 X m 1 P 1 X 1 FX 1 1 P T 1 X 1 m P T m = F 1), 0 0 where the lumped matrix F 11 F 12 e F 1,m+1 e F 1) u T 11 u T 12e u T 1,m+1e... u T m1 u T m2 e ut m,m+1 e is stochastic of order k + m with the same nonzero eigenvalues as F. 24

39 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes Theorem Let ρ be the stationary distribution of the lumped matrix F 11 F 12 e F 1,m+1 e F 1) u T 11 u T 12e u T 1,m+1e ; 2.3)... u T m1 u T m2 e ut m,m+1 e that is, ρ T F 1) = ρ T, ρ 0, ρ =1. [ With the partition ρ T = F equals ρ T 1:k ρ T k+1:k+m ],whereρ k+1:k+m is m 1, thepagerankof π T = ρ T 1:k ρ T F 12 F 1,m+1 u T 12 u T 1,m+1... u T m2 u T m,m PageRanks of dangling versus nondangling nodes We examine how the PageRanks of dangling and nondangling nodes influence each other, as well as the effect of the dangling node vector on the PageRanks. From Theorem and Algorithm 2.3.3, we see that the PageRank π 1 of the nondangling nodes can be computed separately from the PageRank π 2 of the dangling nodes, and that π 2 depends directly on π 1. The expressions below make this even clearer. 25

40 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes Theorem With the notation in section 2.2, π T 1 = 1 α)v T 1 + ρw T 1 ) I αh11 ) 1, π T 2 = απ T 1 H α)v T 2 + α1 π 1 )w T 2, where ρ α 1 1 α)vt 1 I αh 11) 1 e 1+αw T 1 I αh 11 ) 1 e 0. Proof. Rather than using Theorem we found it easier just to start from scratch. From G = αh + dw T )+1 α)v T and the fact that π T e =1,itfollowsthatπ is the solution to the linear system π T =1 α)v T I αh αdw T ) 1, whose coefficient matrix is a strictly row diagonally dominant M-matrix 2, equation 5)), 16, equation 2), Proposition 2.4). Since R I αh is also an M- matrix, it is nonsingular, and the elements of R 1 are nonnegative 9, section 6). The Sherman Morrison formula 36, section 2.1.3) implies that R αdw T ) 1 = R 1 + αr 1 dw T R 1 1 αw T R 1 d. Substituting this into the expression for π gives π T =1 α)v T R 1 + α1 α)vt R 1 d 1 αw T R 1 d wt R ) 26

41 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes We now show that the denominator 1 αw T R 1 d>0. Using the partition R 1 =I αh) 1 = I αh 11) 1 α I αh 11 ) 1 H 12 0 I yields 1 αw T R 1 d =1 α αw T 1 I αh 11) 1 H 12 e + w T 2 e). 2.5) Rewrite the term involving H 12 by observing that H 11 e + H 12 e = e and that I αh 11 is an M-matrix, so 0 α I αh 11 ) 1 H 12 e = e 1 α)i αh 11 ) 1 e. 2.6) Substituting this into 2.5) and using 1 = w T e = w1 T e+wt 2 e shows that the denominator in the Sherman Morrison formula is positive, 1 αw T R 1 d =1 α) 1+αw T 1 I αh 11 ) 1 e ) > 0. Furthermore, 0 α<1implies1 αw T R 1 d>1 α. Substituting the simplified denominator into the expression 2.4) for π yields π T =1 α)v T R 1 v T R 1 d + α 1+αw1 T I αh 11 ) 1 e wt R ) 27

42 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes We obtain for π 1 π1 T = 1 α)v1 T v T R 1 d + α 1+αw1 T I αh 11) 1 e wt 1 ) I αh 11 ) 1. Combining the partitioning of R 1,2.6),andv1 T e + v2 T e =1gives [ ] 0 v T R 1 I αh 11 ) 1 α I αh 11 ) 1 H 12 d = v1 T v2 T 0 0 I e = αv1 T I αh 11) 1 H 12 e + v2 T e = 1 1 α)v1 T I αh 11) 1 e. Hence π T 1 = 1 α)v T 1 + ρwt 1 ) I αh11 ) 1 with ρ>0. To obtain the expression for π 2,observethatthesecondblockelementin π T I αh αdw T )=1 α)v T equals απ T 1 H 12 + π T 2 απt 2 ewt 2 =1 α)vt 2. The result follows from π T 1 e + π T 2 e =1. Remark We drawthefollowingconclusionsfromtheorem2.4.1withregard to how dangling and nondangling nodes accumulate PageRank; see Figure 2.2. The PageRank π 1 of the nondangling nodes does not depend on the connectivity among the dangling nodes elements of w 2 ), the personalization vector for the 28

43 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes w 1 H 11 v 1 v 2 w 2 H 12 Figure 2.2: Sources of PageRank. Nondangling nodes receive their PageRank from v 1 and w 1,distributedthroughthelinksH 11.Incontrast,thePageRankofthedangling nodes comes from v 2, w 2,andthePageRankofthenondanglingnodesthroughthe links H 12. dangling nodes elements of v 2 ), or the links from nondangling to dangling nodes elements of H 12 ). To be specific, π 1 does not depend on individual elements of w 2, v 2,andH 12. Rather, the dependence is on the norms, through v 2 = 1 v 1, w 2 = 1 w 1,andH 12 e = e H 11 e. The PageRank π 1 of the nondangling nodes does not depend on the PageRank π 2 of the dangling nodes or their number, because π 1 can be computed without knowledge of π 2. The nondangling nodes receive their PageRank π 1 from their personalization vector v 1 and the dangling node vector w 1,bothofwhicharedistributedthrough the links H 11. The dangling nodes receive their PageRank π 2 from three sources: the associated part v 2 of the personalization vector; the associated part w 2 of the dangling node vector; and the PageRank π 1 of the nondangling nodes filtered through the connecting links H

44 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes The links H 12 determine how much PageRank flows from nondangling to dangling nodes. The influence of the associated dangling node vector w 2 on the PageRank π 2 of the dangling nodes diminishes as the combined PageRank π 1 of the nondangling nodes increases. Taking norms in Theorem gives a bound on the combined PageRank of the nondangling nodes. As in section 2.2, the norm is z z T e for z 0. Corollary With the assumptions of Theorem 2.4.1, π 1 = 1 α) v 1 H + α w 1 H 1+α w 1 H, where z H z T I αh 11 ) 1 e for any z 0 and 1 α) z z H 1 1 α z. Proof. Since I αh 11 ) 1 is nonsingular with nonnegative elements, H is a norm. Let be the infinity norm maximal row sum). Then the Hölder inequality 36, section 2.2.2) implies for any z 0, z H z I αh 11 ) α z. As forthelowerbound, z H z αz T H 11 e 1 α) z. 30

45 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes H 11 v 1 v 2 H 12 w 2 Figure 2.3: Sources of PageRank when w 1 = 0. The nondangling nodes receive their PageRank only from v 1.Thedanglingnodes,incontrast,receivetheirPageRankfrom v 2 and w 2,aswellasfromthePageRankofthenondanglingnodesfilteredthroughthe links H 12. Corollary implies that the combined PageRank π 1 of the nondangling nodes is an increasing function of w 1.Inparticular,whenw 1 =0,thecombinedPageRank π 1 is minimal among all w and the dangling vector w 2 has a stronger influence on the PageRank π 2 of the dangling nodes. The dangling nodes act like a sink and absorb more PageRank because there are no links back to the nondangling nodes; see Figure 2.3. When w 1 =0weget π T 1 = 1 α)v T 1 I αh 11 ) 1, 2.8) π T 2 = απ T 1 H α)v T 2 + α1 π 1 )w T 2. In the other extreme case when w 2 =0,thedanglingnodesarenotconnectedto each other; see Figure 2.4: π T 1 = 1 α)v T 1 + ρwt 1 ) I αh11 ) 1, 2.9) π T 2 = απ T 1 H α)v T 2. 31

46 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes In this case the PageRank π 1 of the nondangling nodes has only a positive influence on the PageRank of the dangling nodes. H 11 v 1 w 1 v 2 H 12 Figure 2.4: Sources of PageRank when w 2 = 0. The dangling nodes receive their PageRank only from v 2,andfromthePageRankofthenondanglingnodesfiltered through the links H 12. An expression for π when dangling node and personalization vectors are the same, i.e., w = v, wasgivenin26), ) π T =1 α) 1+ αvt R 1 d v T R 1, where R I αh. 1 αv T R 1 d In this case the PageRank vector π is a multiple of the vector v T I αh) Only dangling nodes We examine the theoretical) extreme case when all Web pages are dangling nodes. In this case the matrices S and G have rank one. We first derive a Jordan decomposition for general matrices of rank one, before we present a Jordan form for a Google matrix of rank one. We start with rank-one matrices that are diagonalizable. The vector e j denotes the jth column of the identity matrix I. 32

47 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes Theorem eigenvalue decomposition). Let A = yz T 0be a real square matrix with λ z T y 0.Ifz has an element z j 0,thenX 1 AX = λe j e T j,where X I + ye T j 1 z j e j z T, X 1 = I e j e T j 1 λ yzt + 1+y j λ e jz T. Proof. The matrix A has a repeated eigenvalue zero and a distinct nonzero eigenvalue λ with right eigenvector y and left eigenvector z. From λye T j = AX = λxe j e T j and X 1 A = e j z T it follows that X 1 X = I and X 1 AX = λe j e T j. Now we consider rank-one matrices that are not diagonalizable. In this case all eigenvalues are zero, and the matrix has a Jordan block of order two. Theorem Jordan decomposition). Let A = yz T 0be a real square matrix with z T y =0.Theny and z have elements y j z j 0 y k z k, j<k.defineasymmetric permutation matrix P so that Pe k = e j+1 and Pe j = e j.setŷ Py and û Pz e j+1. Then X 1 AX = e j e T j+1 with X P ) I +ŷe Tj 1ûj e j û T, X 1 = I e j e T j + 1 ŷû T 1 +ŷ j e j û T ŷ k ŷ k ) P. Proof. To satisfy z T y =0fory 0andz 0,wemusthavey j z j 0andy k z k 0 for some j<k. Since A is a rank-one matrix with all eigenvalues equal to zero, it must have a 33

48 Chapter 2. PageRank Computation, with Special Attention to Dangling Nodes Jordan block of the form [ 0 0 ˆX 1 0 ) I +ŷe Tj 1ûj e j û T, ˆX 1 = ]. To reveal this Jordan block, set ẑ Pz, I e j e T j + 1 ŷû T 1 +ŷ j e j û T ŷ k ŷ k ). Then the matrix Â ŷẑt has a Jordan decomposition ˆX 1 Â ˆX = e j e T j+1. Thisfollows from u j = z j,ŷe T j+1 = Â ˆX = ˆXe j e T j+1,and ˆX 1 Â = e j ẑ T. Finally, we undo the permutation by means of X P ˆX, X 1 = ˆX 1 P,sothat X 1 X = I and X 1 AX = e j e T j+1. Theorems and can also be derived from 51, Theorem 1.4). In the theoretical) extreme case when all Web pages are dangling nodes, the Google matrix is diagonalizable of rank one. Corollary rank-one Google matrix). With the notation in section 2.2 and 2.1), let G = eu T.Letu j 0be a nonzero element of u. ThenX 1 GX = e j e T j with X = I + ee T j 1 v j e j u T and X 1 = I e j e T j eu T +2e j u T. In particular, π T = e T j X 1 = u T. Proof. Since 1 = u T e 0,theGooglematrixisdiagonalizable,andtheexpressionin Theorem applies. Corollary can also be derived from 107, Theorems 2.1, 2.3). 34

49 Chapter Introduction to ergodicity coefficients Ergodicity coefficients were originally introduced in the context of rates of convergence of finite, inhomogeneous Markov chains. In general, ergodicity deals with the long term behavior of dynamical systems. Thus for systems of finite, inhomogeneous Markov chains, ergodicity refers to the long-term behavior of products of stochastic matrices. An ergodicity coefficient τ 1 S) isdefinedforann n stochastic matrix S, as τ 1 S) = max z 1 =1 z T e=0 S T z 1 where the maximum is taken over all z R n and e is a column vector of all 1 s Notice that this form of an ergodicity coefficient is simply the norm of a matrix restricted to asubspace. Inadditiontobeingcontinuousandbounded,τ 1 S) issubmultiplicative, and bounds the magnitude of the second largest eigenvalue of S. Thus τ 1 S 1 S 2 ) 35

50 τ 1 S 1 )τ 1 S 2 ) and λ τ 1 S) foralleigenvaluesλ 1ofS for stochastic matrices S 1 and S 2. These two properties combine to yield an upper bound on the magnitude of subdominant eigenvalues of the product of stochastic matrices. The upper bound is computed as the product of the ergodicity coefficients of the individual matrices. That is, for any eigenvalue λ 1ofthestochasticmatrixS 1 S 2, λ τ 1 S 1 S 2 ) τ 1 S 1 )τ 1 S 2 ). This inequality gives an eigenvalue bound for products of stochastic matrices, and is useful since, in general the eigenvalues of a product of matrices are not equal to the product of the eigenvalues of the matrices. This chapter discusses coefficient of ergodicity from a linear algebra point of view. We choose to express these coefficients as matrix norms restricted to a subspace, although we mention other expressions, as well. We present properties, and explicit forms for various ergodicity coefficients and simplify results and notation from more than 75 references spanning more than 100 years, writing everything in language more familiar to the numerical linear algebra community. We conclude with new and existing applications, including condition numbers, eigenvalue and singular value bounds. 3.2 Definitions and Notation We adopt the following notation for this chapter. We denote matrices as capital letters, e.g., A. All vectors are assumed to be column vectors, and are denoted by lowercase letters, e.g., a. Unless otherwise noted in a theorem or proof, the i, j) entryofa is a ij,andthei th element of vector a is a i. We use the common notation of I as the appropriate-sized identity matrix. We use 36

51 0 as a matrix, vector, and scalar, depending on the situation. The column vector e i is the canonical vector with an 1 in the i th position and 0 s elsewhere. The column vector of all ones is denoted e. We denote the transpose of x R n 1 as x T R 1 n,andthe conjugate transpose of x C n 1 as x C 1 n. We define a row) stochastic matrix, S R n n so that s ij 0foralli, j and n ) the elements in each row of S sum to 1 j=1 s ij =1fori =1,...,n,orSe = e. Further, we define a doubly stochastic matrix P R n n as a matrix that is both row and column stochastic, so that p ij 0foralli, j and the elements of each row and each column sum to one, so that Pe = e and P T e = e. Aprobabilityvectorv is defined so that v i 0foralli and i v i =1. We also employ several norms throughout this chapter. The one-norm of a column vector x is x 1 = i x i, andtheinfinity-normofacolumnvectorx is x = max i x i. The p-norm of a column vector x is x p = i x i p ) 1/p. The one-norm of amatrixa is the maximal column sum, A 1 =max j i a ij, andtheinfinity-norm is the maximal row sum, A =max i j a ij. In general, the p-norm of a matrix A C m n is A p =max x p=1 Ax p for x C n. The Frobenius matrix norm is A F,with A 2 F = i,j a ij 2 =tracea A)76,p279). Finally, we use the terms ergodicity coefficient and coefficient of ergodicity interchangeably. 37

52 3.3 Weak ergodicity To understand the history and development of coefficients of ergodicity we begin with ashortdiscussionofthehistoryofweakergodicityforproductsandsequencesof stochastic matrices. Seneta 95, 1),97, 1), 98, ) credits Kolmogorov 65) with the following definition of weak ergodicity for a sequence. Definition Let {S k }, k 1, beasequenceofn n stochastic matrices, and define T p,r) = S p+1 S p+2 S p+r,withi, j) entry t p,r) ij. The sequence {S k } is weakly ergodic if for all i, j, s =1,...,n and p 0, t p,r) is ) t p,r) js 0 as r. Essentially, a sequence of stochastic matrices is weakly ergodic if, as the number of factors in a product approaches infinity, the rows of the product tend to equalize. Throughout the ergodicity literature, there appears to be some debate over the origins of necessary and sufficient conditions for weak ergodicity. Seneta gives excellent summaries of the discussion of the history and development of these conditions in 95, 1), 97, 1), and 98, Ch 3-4). For additional information, some ergodicity papers include those by Cohn 23; 24), Dobrushin 28), Hajnal 42; 43), Kingman 63), and Paz and Reichaw 82). These papers are available in English, and we ve found them to be more accesible and available than some of the older or foreign-language papers, including some from the 1920s-1950s by Bernštein, Doeblin, Dynkin, Sapogov and Sarymsakov 10; 11; 12; 13; 14; 15; 27; 29; 30; 92; 93; 94). 38

PAGERANK COMPUTATION, WITH SPECIAL ATTENTION TO DANGLING NODES

PAGERANK COMPUTATION, WITH SPECIAL ATTENTION TO DANGLING NODES ILSE CF IPSEN AND TERESA M SELEE Abstract We present a simple algorithm for computing the PageRank (stationary distribution) of the stochastic