Authoritative Sources in a Hyperlinked Enviroment

Size: px
Start display at page:

Download "Authoritative Sources in a Hyperlinked Enviroment"

Transcription

1 Authoritative Sources in a Hyperlinked Enviroment Dimitris Sakavalas June 15, 2010

2 Outline Introduction Problem description Authorities and Broad-topic queries Algorithm outline Construction of a Focused Subgraph of the www Expanding R into the base set S Heuristics impoving base set S Computing Hubs and Authorities Iterative Algorithm Elements of Linear Algebra Convergence of the Iterative Algorithm Similar page queries Multiple Sets of Hubs and Authorities Non-Principal Eigenvectors and Clustering of G Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 2/25

3 Introduction Problem description www-search Discover pages relevant to a given query string Discover the most authoritative pages (subjective, requires human evaluation) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 3/25

4 Introduction Problem description www-search Discover pages relevant to a given query string Discover the most authoritative pages (subjective, requires human evaluation) Queries Specic queries Scarcity Problem Broad-topic queries Abundance Problem Similar-page queries Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 3/25

5 Introduction Authorities and Broad-topic queries In search of a denition of authority Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25

6 Introduction Authorities and Broad-topic queries In search of a denition of authority No endogenous measure of a page to assess authority (ex. Text-based ranking functions for queries: \Harvard", \search engines", \automobile manufacturers", images) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25

7 Introduction Authorities and Broad-topic queries In search of a denition of authority No endogenous measure of a page to assess authority (ex. Text-based ranking functions for queries: \Harvard", \search engines", \automobile manufacturers", images) Solution: Analysis of the link structure Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25

8 Introduction Authorities and Broad-topic queries In search of a denition of authority No endogenous measure of a page to assess authority (ex. Text-based ranking functions for queries: \Harvard", \search engines", \automobile manufacturers", images) Solution: Analysis of the link structure Pitfalls: Advertising links Navigational links Universal Popularity links Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25

9 Introduction Authorities and Broad-topic queries In search of a denition of authority No endogenous measure of a page to assess authority (ex. Text-based ranking functions for queries: \Harvard", \search engines", \automobile manufacturers", images) Solution: Analysis of the link structure Pitfalls: Advertising links Navigational links Universal Popularity links Question.How reliable is page p? Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25

10 Introduction Algorithm outline Hubs Pages that link to many Authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 5/25

11 Introduction Algorithm outline Hubs Pages that link to many Authorities Outline of the algorithm 1 Create a focused subgraph of the www from the output of a text-based engine (Small collection of pages likely to contain the most authoritative pages) 2 Identify hubs and authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 5/25

12 Construction of a Focused Subgraph of the www Directed graph G = (V; E) of the www. V the collection of hyperlinked pages of the www V : (nodes correspond to the pages) (p; q) E There is a link from p to q G[W]. Subgraph of G induced on W V Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 6/25

13 Construction of a Focused Subgraph of the www Directed graph G = (V; E) of the www. V the collection of hyperlinked pages of the www V : (nodes correspond to the pages) (p; q) E There is a link from p to q G[W]. Subgraph of G induced on W V Goal Construct G = G[S ] such that 1 S is relatively small (computational cost) 2 S rich in relevant pages (search quality) 3 S contains most of the strongest authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 6/25

14 Construction of a Focused Subgraph of the www Directed graph G = (V; E) of the www. V the collection of hyperlinked pages of the www V : (nodes correspond to the pages) (p; q) E There is a link from p to q G[W]. Subgraph of G induced on W V Goal Construct G = G[S ] such that 1 S is relatively small (computational cost) 2 S rich in relevant pages (search quality) 3 S contains most of the strongest authorities Observation S rich in relevant pages many links to authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 6/25

15 Construction of a Focused Subgraph of the www Notation Γ + (p): the set of all pages p points to Γ (p): the set of all pages pointing to p E: a text-based search engine Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 7/25

16 Construction of a Focused Subgraph of the www Notation Γ + (p): the set of all pages p points to Γ (p): the set of all pages pointing to p E: a text-based search engine 1st Step Root set R ={t highest ranked pages for the query from E} R satises (i); (ii). Generally doesn't satisfy (iii) and is structureless (few links) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 7/25

17 Construction of a Focused Subgraph of the www Notation Γ + (p): the set of all pages p points to Γ (p): the set of all pages pointing to p E: a text-based search engine 1st Step Root set R ={t highest ranked pages for the query from E} R satises (i); (ii). Generally doesn't satisfy (iii) and is structureless (few links) Observation If a = R, strong authority, there is a high probability that p R such that a Γ + (p) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 7/25

18 Expanding R into the base set S 2nd Step Idea:Expand R into the Base set S by adding strong authorities and relevant pages Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 8/25

19 Expanding R into the base set S 2nd Step Idea:Expand R into the Base set S by adding strong authorities and relevant pages Subgraph(; E ; t ; d) t; d N Set S := R For each page p R Add all pages in Γ + (p) to S If Γ (p) d then, Add all pages in Γ (p) to S Else Add an arbitary set of d pages from Γ (p) to S End. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 8/25

20 Expanding R into the base set S Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 9/25

21 Heuristics impoving S Links Transverse: link between pages with dierent domain name Intrinsic: link between pages with the same domain name (navigational) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 10/25

22 Heuristics impoving S Links Transverse: link between pages with dierent domain name Intrinsic: link between pages with the same domain name (navigational) 3rd Step:Heuristics 1 Delete all Intrinsic links from the graph G 2 Allow up to a small number m pages from a single domain to point to any given page p Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 10/25

23 Heuristics impoving S Links Transverse: link between pages with dierent domain name Intrinsic: link between pages with the same domain name (navigational) 3rd Step:Heuristics 1 Delete all Intrinsic links from the graph G 2 Allow up to a small number m pages from a single domain to point to any given page p Finally we obtain a small subgraph G, relatively focused on the query, containing many relevant pages and strong authorities. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 10/25

24 Hubs and Authorities Purely In-degree ordering of Authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 11/25

25 Hubs and Authorities Purely In-degree ordering of Authorities Problem. \Universally popular" pages, large in-degree regardless of the underlying query topic (ex.amazon Books) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 11/25

26 Hubs and Authorities Purely In-degree ordering of Authorities Problem. \Universally popular" pages, large in-degree regardless of the underlying query topic (ex.amazon Books) Observation Authorities: high in-degree, considerable overlap in the sets of pages that point to them(hubs) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 11/25

27 Hubs and Authorities Observation (mutually reinforcing relationship) Good Hub: a page that points to many good authorities Good Authority: a page that is pointed by many good hubs Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 12/25

28 Hubs and Authorities Observation (mutually reinforcing relationship) Good Hub: a page that points to many good authorities Good Authority: a page that is pointed by many good hubs Denitions p S = {1; 2; :::; n} assign: authority weight x <p> 0 hub weight y <p> 0 Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 12/25

29 Hubs and Authorities Observation (mutually reinforcing relationship) Good Hub: a page that points to many good authorities Good Authority: a page that is pointed by many good hubs Denitions p S = {1; 2; :::; n} assign: authority weight x <p> 0 hub weight y <p> 0 corresponding vectors x = x <1> x <2>. x <n> and y = y <1> y <2>. y <n> Normalized such that x = y = 1 example.for norm 1 ; n i=1 x <i> = n i=1 y <i> = 1 Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 12/25

30 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E y <q> Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25

31 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E O(x ; y) (y-weight update): y <p> ; y <p> q:(p;q) E y <q> x <q> ; Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25

32 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E O(x ; y) (y-weight update): y <p> ; y <p> q:(p;q) E y <q> x <q> ; Idea Find the disired \equilibrium" values for the weights by applying I; O operations iterativelly. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25

33 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E O(x ; y) (y-weight update): y <p> ; y <p> q:(p;q) E y <q> x <q> ; Idea Find the disired \equilibrium" values for the weights by applying I; O operations iterativelly. Question. Does this \equilibrium" exist? Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25

34 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E O(x ; y) (y-weight update): y <p> ; y <p> q:(p;q) E y <q> x <q> ; Idea Find the disired \equilibrium" values for the weights by applying I; O operations iterativelly. Question. Does this \equilibrium" exist? YES (linear algebra) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25

35 Iterative algorithm Iterate(G ; k) z := (1; 1; :::; 1) R n Set x 0 := z Set y 0 := z For i = 1; 2; :::; k Apply the I(x i 1 ; y i 1 ) operation to obtain new x weights x i Apply the O(x i ; y i 1) operation to obtain new y weights y i Normalize x i, obtaining x i Normalize y i, obtaining y i End Return (x k ; y k ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 14/25

36 Iterative algorithm Iterate(G ; k) z := (1; 1; :::; 1) R n Set x 0 := z Set y 0 := z For i = 1; 2; :::; k Apply the I(x i 1 ; y i 1 ) operation to obtain new x weights x i Apply the O(x i ; y i 1) operation to obtain new y weights y i Normalize x i, obtaining x i Normalize y i, obtaining y i End Return (x k ; y k ) Filter(G ; k ; c) (x k ; y k ) :=Iterate(G ; k) Report the pages with the c largest coordinates in x k as authorities Report the pages with the c largest coordinates in y k as hubs Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 14/25

37 Elements of Linear Algebra Notation Matrix M C n n Eigenvalues i (M ) Eigenvectors! i (M ) eigenspace V subspace of C n multiplicity of : m = dim(v ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 15/25

38 Elements of Linear Algebra Notation Matrix M C n n Eigenvalues i (M ) Eigenvectors! i (M ) eigenspace V subspace of C n multiplicity of : m = dim(v ) Denitions Positive matrix M : If M i;j > 0; i ; j Primitive matrix M : If m N; M m : positive Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 15/25

39 Elements of Linear Algebra Let M R n n symmetric, nonnegative matrix (M i;j 0; i ; j ) Theorems 1 M has at most n distinct eigenvalues and = n m i Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 16/25

40 Elements of Linear Algebra Let M R n n symmetric, nonnegative matrix (M i;j 0; i ; j ) Theorems 1 M has at most n distinct eigenvalues and m i = n 2 M has only real eigenvalues i Denote them 1 (M ) 2 (M ) ::: n (M ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 16/25

41 Elements of Linear Algebra Let M R n n symmetric, nonnegative matrix (M i;j 0; i ; j ) Theorems 1 M has at most n distinct eigenvalues and m i = n 2 M has only real eigenvalues i Denote them 1 (M ) 2 (M ) ::: n (M ) 3 (Perron-Frobenius) If M is primitive then: i. Largest eigenvalue 1 (M ) > 0 and m = 1 ii. 1 > i i 1 iii. 1 has a corresponding eigenvector! 1 (M ) with all entries positive (the principal eigenvector) Consider! 1 (M ) normalized,! 1 (M ) = 1 Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 16/25

42 Elements of Linear Algebra Let M R n n symmetric, nonnegative matrix (M i;j 0; i ; j ) Theorems 1 M has at most n distinct eigenvalues and m i = n 2 M has only real eigenvalues i Denote them 1 (M ) 2 (M ) ::: n (M ) 3 (Perron-Frobenius) If M is primitive then: i. Largest eigenvalue 1 (M ) > 0 and m = 1 ii. 1 > i i 1 iii. 1 has a corresponding eigenvector! 1 (M ) with all entries positive (the principal eigenvector) Consider! 1 (M ) normalized,! 1 (M ) = 1 4 If M is primitive and vector v not orthogonal to the principal eigenvector! 1 (M ), let v k be the unit vector in the direction of M k k v. Then v k! 1 (M ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 16/25

43 Elements of Linear Algebra Observation Note that we can consider! 1 (M ) as a vector of the orthonormal base of V 1. Moreover dimv 1 = m 1 = 1 and all entries in! 1 (M ) are positive. Hence the principal eigenvector! 1 (M ) is unique. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 17/25

44 Connection of Iterative algorithm to linear Algebra Let A be the adjacency matrix of G Observation Operations I(x ; y) and O(x ; y) can be written as x A y and y Ax respectivelly Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 18/25

45 Connection of Iterative algorithm to linear Algebra Let A be the adjacency matrix of G Observation Operations I(x ; y) and O(x ; y) can be written as x A y and y Ax respectivelly Iterate Algorithm The algorithm initializes x 0 = y 0 = z =. 1 step of thalgorithm we have the following steps: I(x i ; y i 1) x i = A y i 1 O(x i ; y i ) y i = Ax i 1. In the and i th Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 18/25

46 Connection of Iterative algorithm to linear Algebra Let A be the adjacency matrix of G Observation Operations I(x ; y) and O(x ; y) can be written as x A y and y Ax respectivelly Iterate Algorithm The algorithm initializes x 0 = y 0 = z =. 1 step of thalgorithm we have the following steps: I(x i ; y i 1) x i = A y i 1 1. In the and i th O(x i ; y i ) y i = Ax i x i = A Ax i 1 = (A A) i x 0 y i = AA y i 1 = (AA ) i y 0 } x i = (A A) i z y i = (AA ) i z ( ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 18/25

47 Convergence of the Iterative Algorithm Observation We consider x i ; y i normalized in each step as in Iterate algorithm. So in every step x i = y i = 1 For simplicity, matrices AA and A A are considered primitive Obviously z c 0; c R n 0 Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 19/25

48 Convergence of the Iterative Algorithm Observation We consider x i ; y i normalized in each step as in Iterate algorithm. So in every step x i = y i = 1 For simplicity, matrices AA and A A are considered primitive Obviously z c 0; c R n 0 Theorem The sequences {x i } and {y i } produced by Iterate Algorithm converge to limits x ; y respectively Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 19/25

49 Convergence of the Iterative Algorithm Observation We consider x i ; y i normalized in each step as in Iterate algorithm. So in every step x i = y i = 1 For simplicity, matrices AA and A A are considered primitive Obviously z c 0; c R n 0 Theorem The sequences {x i } and {y i } produced by Iterate Algorithm converge to limits x ; y respectively Proof. A A primitive, z not orthogonal to the principal eigenvector! 1 (A A) = x, x i the unit vector in the direction (A A) i z Hence from Theorem 4 {x i } x, simirarly {y i } y =! 1 (AA ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 19/25

50 Conclusion Conclusion The process of iterated I=O operations will converge. The desired equilibrium values for the x ; y weights, are the values x ; y. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 20/25

51 Conclusion Conclusion The process of iterated I=O operations will converge. The desired equilibrium values for the x ; y weights, are the values x ; y. Observation In practice the convergence of Iterate is quite rapid, so one can compute x ; y weights by starting from any initial vectors x 0 ; y 0, and performing a xed number of I; O operations Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 20/25

52 Conclusion Conclusion The process of iterated I=O operations will converge. The desired equilibrium values for the x ; y weights, are the values x ; y. Observation In practice the convergence of Iterate is quite rapid, so one can compute x ; y weights by starting from any initial vectors x 0 ; y 0, and performing a xed number of I; O operations Hyperlink-Induced Topic Search (HITS) algorithm (Ask.com) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 20/25

53 Similar-Page Queries Query. Find pages similar to p Idea Initiate search with the page p instead of a query string Use the link structure to infer \similarity among pages" Similar pages to p Strongest authorities in local region of p Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 21/25

54 Similar-Page Queries Query. Find pages similar to p Idea Initiate search with the page p instead of a query string Use the link structure to infer \similarity among pages" Similar pages to p Strongest authorities in local region of p Adaptation of the broad-topic query method 1 Find t pages pointing to p Assemble root set R p 2 Expand R p to base set S p as before 3 Search for authorities and hubs in the focused subgraph G p Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 21/25

55 Multiple Sets of Hubs and Authorities Situation. For a query string, relevant pages grouped into clusters. Reasons Dierent meanings of. ex. \jaguar\ String arises as term in multiple technical communities. ex. \randomized algorithms" String refers to a highly polarized issue. ex.\abortion" Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 22/25

56 Multiple Sets of Hubs and Authorities Situation. For a query string, relevant pages grouped into clusters. Reasons Dierent meanings of. ex. \jaguar\ String arises as term in multiple technical communities. ex. \randomized algorithms" String refers to a highly polarized issue. ex.\abortion" Problem. Identify the clusters Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 22/25

57 Multiple Sets of Hubs and Authorities Situation. For a query string, relevant pages grouped into clusters. Reasons Dierent meanings of. ex. \jaguar\ String arises as term in multiple technical communities. ex. \randomized algorithms" String refers to a highly polarized issue. ex.\abortion" Problem. Identify the clusters Idea Clusters represented in focused subgraph G as densely linked subgraphs(collection of hubs and authorities) Extract densely linked collections of hubs and authorities through non-principal eigenvectors of AA and A A Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 22/25

58 Non-Principal Eigenvectors and Clustering of G Proposition AA and A A have the same multiset of eigenvalues, and their eigenvectors can be chosen so that! i (AA ) = A! i (A A) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 23/25

59 Non-Principal Eigenvectors and Clustering of G Proposition AA and A A have the same multiset of eigenvalues, and their eigenvectors can be chosen so that! i (AA ) = A! i (A A) Observation For each pair of eigenvectors x i =! i (A A); y i =! i (AA ) operation I(x i ; y i ) keeps x weights parallel to x i : x = A! i (AA ) = A A! i (A A) = i x i Simirarly operation O(x i ; y i ) keeps x weights parallel to y i Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 23/25

60 Non-Principal Eigenvectors and Clustering of G Proposition AA and A A have the same multiset of eigenvalues, and their eigenvectors can be chosen so that! i (AA ) = A! i (A A) Observation For each pair of eigenvectors x i =! i (A A); y i =! i (AA ) operation I(x i ; y i ) keeps x weights parallel to x i : x = A! i (AA ) = A A! i (A A) = i x i Simirarly operation O(x i ; y i ) keeps x weights parallel to y i Non-principal eigenvectors have both positive and negative entries. Hence each pair of (x i ; y i ) provide us with two densely connected set of hubs and authorities Pages that correspond to the c coordinates with the most positive values Pages that correspond to the c coordinates with the most negative values Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 23/25

61 Non-Principal Eigenvectors and Clustering of G Description of the method For each of the st few non-principal eigenvectors (x i ; y i ) nd the two densely connected set of hubs and authorities through an algorithm similar(but less clean conceptually) to the Iterate algorithm Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 24/25

62 Non-Principal Eigenvectors and Clustering of G Description of the method For each of the st few non-principal eigenvectors (x i ; y i ) nd the two densely connected set of hubs and authorities through an algorithm similar(but less clean conceptually) to the Iterate algorithm The pages with large coordinates in the rst few nonprincipal eigenvectors tend to recur, so that essentially the same collection of hubs and authorities will often be generated by several of the strongest nonprincipal eigenvectors Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 24/25

63 Similar Concepts in Other Areas Social networks \Standing" Bibliometrics \Impact" Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 25/25

64 Similar Concepts in Other Areas Social networks \Standing" Bibliometrics \Impact" Economics (Wassily Leontief 1941-Nobel prize 1973) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 25/25

Data Mining and Matrices

Data Mining and Matrices Data Mining and Matrices 10 Graphs II Rainer Gemulla, Pauli Miettinen Jul 4, 2013 Link analysis The web as a directed graph Set of web pages with associated textual content Hyperlinks between webpages

More information

1 Searching the World Wide Web

1 Searching the World Wide Web Hubs and Authorities in a Hyperlinked Environment 1 Searching the World Wide Web Because diverse users each modify the link structure of the WWW within a relatively small scope by creating web-pages on

More information

Link Analysis Ranking

Link Analysis Ranking Link Analysis Ranking How do search engines decide how to rank your query results? Guess why Google ranks the query results the way it does How would you do it? Naïve ranking of query results Given query

More information

Introduction to Search Engine Technology Introduction to Link Structure Analysis. Ronny Lempel Yahoo Labs, Haifa

Introduction to Search Engine Technology Introduction to Link Structure Analysis. Ronny Lempel Yahoo Labs, Haifa Introduction to Search Engine Technology Introduction to Link Structure Analysis Ronny Lempel Yahoo Labs, Haifa Outline Anchor-text indexing Mathematical Background Motivation for link structure analysis

More information

CS 3750 Advanced Machine Learning. Applications of SVD and PCA (LSA and Link analysis) Cem Akkaya

CS 3750 Advanced Machine Learning. Applications of SVD and PCA (LSA and Link analysis) Cem Akkaya CS 375 Advanced Machine Learning Applications of SVD and PCA (LSA and Link analysis) Cem Akkaya Outline SVD and LSI Kleinberg s Algorithm PageRank Algorithm Vector Space Model Vector space model represents

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu How to organize/navigate it? First try: Human curated Web directories Yahoo, DMOZ, LookSmart

More information

Hyperlinked-Induced Topic Search (HITS) identifies. authorities as good content sources (~high indegree) HITS [Kleinberg 99] considers a web page

Hyperlinked-Induced Topic Search (HITS) identifies. authorities as good content sources (~high indegree) HITS [Kleinberg 99] considers a web page IV.3 HITS Hyperlinked-Induced Topic Search (HITS) identifies authorities as good content sources (~high indegree) hubs as good link sources (~high outdegree) HITS [Kleinberg 99] considers a web page a

More information

A Note on Google s PageRank

A Note on Google s PageRank A Note on Google s PageRank According to Google, google-search on a given topic results in a listing of most relevant web pages related to the topic. Google ranks the importance of webpages according to

More information

Online Social Networks and Media. Link Analysis and Web Search

Online Social Networks and Media. Link Analysis and Web Search Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information

More information

Link Analysis. Leonid E. Zhukov

Link Analysis. Leonid E. Zhukov Link Analysis Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural Analysis and Visualization

More information

Link Analysis. Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze

Link Analysis. Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze Link Analysis Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze 1 The Web as a Directed Graph Page A Anchor hyperlink Page B Assumption 1: A hyperlink between pages

More information

Faloutsos, Tong ICDE, 2009

Faloutsos, Tong ICDE, 2009 Large Graph Mining: Patterns, Tools and Case Studies Christos Faloutsos Hanghang Tong CMU Copyright: Faloutsos, Tong (29) 2-1 Outline Part 1: Patterns Part 2: Matrix and Tensor Tools Part 3: Proximity

More information

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018 Lab 8: Measuring Graph Centrality - PageRank Monday, November 5 CompSci 531, Fall 2018 Outline Measuring Graph Centrality: Motivation Random Walks, Markov Chains, and Stationarity Distributions Google

More information

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS DATA MINING LECTURE 3 Link Analysis Ranking PageRank -- Random walks HITS How to organize the web First try: Manually curated Web Directories How to organize the web Second try: Web Search Information

More information

Wiki Definition. Reputation Systems I. Outline. Introduction to Reputations. Yury Lifshits. HITS, PageRank, SALSA, ebay, EigenTrust, VKontakte

Wiki Definition. Reputation Systems I. Outline. Introduction to Reputations. Yury Lifshits. HITS, PageRank, SALSA, ebay, EigenTrust, VKontakte Reputation Systems I HITS, PageRank, SALSA, ebay, EigenTrust, VKontakte Yury Lifshits Wiki Definition Reputation is the opinion (more technically, a social evaluation) of the public toward a person, a

More information

CS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine

CS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine CS 277: Data Mining Mining Web Link Structure Class Presentations In-class, Tuesday and Thursday next week 2-person teams: 6 minutes, up to 6 slides, 3 minutes/slides each person 1-person teams 4 minutes,

More information

How does Google rank webpages?

How does Google rank webpages? Linear Algebra Spring 016 How does Google rank webpages? Dept. of Internet and Multimedia Eng. Konkuk University leehw@konkuk.ac.kr 1 Background on search engines Outline HITS algorithm (Jon Kleinberg)

More information

Applications to network analysis: Eigenvector centrality indices Lecture notes

Applications to network analysis: Eigenvector centrality indices Lecture notes Applications to network analysis: Eigenvector centrality indices Lecture notes Dario Fasino, University of Udine (Italy) Lecture notes for the second part of the course Nonnegative and spectral matrix

More information

Detailed Proof of The PerronFrobenius Theorem

Detailed Proof of The PerronFrobenius Theorem Detailed Proof of The PerronFrobenius Theorem Arseny M Shur Ural Federal University October 30, 2016 1 Introduction This famous theorem has numerous applications, but to apply it you should understand

More information

Spectra of Adjacency and Laplacian Matrices

Spectra of Adjacency and Laplacian Matrices Spectra of Adjacency and Laplacian Matrices Definition: University of Alicante (Spain) Matrix Computing (subject 3168 Degree in Maths) 30 hours (theory)) + 15 hours (practical assignment) Contents 1. Spectra

More information

Online Social Networks and Media. Link Analysis and Web Search

Online Social Networks and Media. Link Analysis and Web Search Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information

More information

SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices)

SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Chapter 14 SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Today we continue the topic of low-dimensional approximation to datasets and matrices. Last time we saw the singular

More information

1 Matrix notation and preliminaries from spectral graph theory

1 Matrix notation and preliminaries from spectral graph theory Graph clustering (or community detection or graph partitioning) is one of the most studied problems in network analysis. One reason for this is that there are a variety of ways to define a cluster or community.

More information

6.207/14.15: Networks Lectures 4, 5 & 6: Linear Dynamics, Markov Chains, Centralities

6.207/14.15: Networks Lectures 4, 5 & 6: Linear Dynamics, Markov Chains, Centralities 6.207/14.15: Networks Lectures 4, 5 & 6: Linear Dynamics, Markov Chains, Centralities 1 Outline Outline Dynamical systems. Linear and Non-linear. Convergence. Linear algebra and Lyapunov functions. Markov

More information

STA141C: Big Data & High Performance Statistical Computing

STA141C: Big Data & High Performance Statistical Computing STA141C: Big Data & High Performance Statistical Computing Lecture 6: Numerical Linear Algebra: Applications in Machine Learning Cho-Jui Hsieh UC Davis April 27, 2017 Principal Component Analysis Principal

More information

Chapter 3 Transformations

Chapter 3 Transformations Chapter 3 Transformations An Introduction to Optimization Spring, 2014 Wei-Ta Chu 1 Linear Transformations A function is called a linear transformation if 1. for every and 2. for every If we fix the bases

More information

Lecture 15 Perron-Frobenius Theory

Lecture 15 Perron-Frobenius Theory EE363 Winter 2005-06 Lecture 15 Perron-Frobenius Theory Positive and nonnegative matrices and vectors Perron-Frobenius theorems Markov chains Economic growth Population dynamics Max-min and min-max characterization

More information

Fiedler s Theorems on Nodal Domains

Fiedler s Theorems on Nodal Domains Spectral Graph Theory Lecture 7 Fiedler s Theorems on Nodal Domains Daniel A. Spielman September 19, 2018 7.1 Overview In today s lecture we will justify some of the behavior we observed when using eigenvectors

More information

Google Page Rank Project Linear Algebra Summer 2012

Google Page Rank Project Linear Algebra Summer 2012 Google Page Rank Project Linear Algebra Summer 2012 How does an internet search engine, like Google, work? In this project you will discover how the Page Rank algorithm works to give the most relevant

More information

MultiRank and HAR for Ranking Multi-relational Data, Transition Probability Tensors, and Multi-Stochastic Tensors

MultiRank and HAR for Ranking Multi-relational Data, Transition Probability Tensors, and Multi-Stochastic Tensors MultiRank and HAR for Ranking Multi-relational Data, Transition Probability Tensors, and Multi-Stochastic Tensors Michael K. Ng Centre for Mathematical Imaging and Vision and Department of Mathematics

More information

Machine Learning. Principal Components Analysis. Le Song. CSE6740/CS7641/ISYE6740, Fall 2012

Machine Learning. Principal Components Analysis. Le Song. CSE6740/CS7641/ISYE6740, Fall 2012 Machine Learning CSE6740/CS7641/ISYE6740, Fall 2012 Principal Components Analysis Le Song Lecture 22, Nov 13, 2012 Based on slides from Eric Xing, CMU Reading: Chap 12.1, CB book 1 2 Factor or Component

More information

(a) II and III (b) I (c) I and III (d) I and II and III (e) None are true.

(a) II and III (b) I (c) I and III (d) I and II and III (e) None are true. 1 Which of the following statements is always true? I The null space of an m n matrix is a subspace of R m II If the set B = {v 1,, v n } spans a vector space V and dimv = n, then B is a basis for V III

More information

Complex Social System, Elections. Introduction to Network Analysis 1

Complex Social System, Elections. Introduction to Network Analysis 1 Complex Social System, Elections Introduction to Network Analysis 1 Complex Social System, Network I person A voted for B A is more central than B if more people voted for A In-degree centrality index

More information

ECEN 689 Special Topics in Data Science for Communications Networks

ECEN 689 Special Topics in Data Science for Communications Networks ECEN 689 Special Topics in Data Science for Communications Networks Nick Duffield Department of Electrical & Computer Engineering Texas A&M University Lecture 8 Random Walks, Matrices and PageRank Graphs

More information

Complex Networks CSYS/MATH 303, Spring, Prof. Peter Dodds

Complex Networks CSYS/MATH 303, Spring, Prof. Peter Dodds Complex Networks CSYS/MATH 303, Spring, 2011 Prof. Peter Dodds Department of Mathematics & Statistics Center for Complex Systems Vermont Advanced Computing Center University of Vermont Licensed under the

More information

Linear Algebra: Characteristic Value Problem

Linear Algebra: Characteristic Value Problem Linear Algebra: Characteristic Value Problem . The Characteristic Value Problem Let < be the set of real numbers and { be the set of complex numbers. Given an n n real matrix A; does there exist a number

More information

CSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68

CSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68 CSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68 References 1 L. Freeman, Centrality in Social Networks: Conceptual Clarification, Social Networks, Vol. 1, 1978/1979, pp. 215 239. 2 S. Wasserman

More information

Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora

Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora Scribe: Today we continue the

More information

Link Analysis Information Retrieval and Data Mining. Prof. Matteo Matteucci

Link Analysis Information Retrieval and Data Mining. Prof. Matteo Matteucci Link Analysis Information Retrieval and Data Mining Prof. Matteo Matteucci Hyperlinks for Indexing and Ranking 2 Page A Hyperlink Page B Intuitions The anchor text might describe the target page B Anchor

More information

Linear algebra and applications to graphs Part 1

Linear algebra and applications to graphs Part 1 Linear algebra and applications to graphs Part 1 Written up by Mikhail Belkin and Moon Duchin Instructor: Laszlo Babai June 17, 2001 1 Basic Linear Algebra Exercise 1.1 Let V and W be linear subspaces

More information

Eigenvalues of Exponentiated Adjacency Matrices

Eigenvalues of Exponentiated Adjacency Matrices Eigenvalues of Exponentiated Adjacency Matrices Tracy Powell Lesley A Ward, Advisor Estelle Basor, Reader November 21, 25 Department of Mathematics 2 Chapter Problem A graph G is made up of vertices, or

More information

Conceptual Questions for Review

Conceptual Questions for Review Conceptual Questions for Review Chapter 1 1.1 Which vectors are linear combinations of v = (3, 1) and w = (4, 3)? 1.2 Compare the dot product of v = (3, 1) and w = (4, 3) to the product of their lengths.

More information

Node and Link Analysis

Node and Link Analysis Node and Link Analysis Leonid E. Zhukov School of Applied Mathematics and Information Science National Research University Higher School of Economics 10.02.2014 Leonid E. Zhukov (HSE) Lecture 5 10.02.2014

More information

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data. Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two

More information

Web Structure Mining Nodes, Links and Influence

Web Structure Mining Nodes, Links and Influence Web Structure Mining Nodes, Links and Influence 1 Outline 1. Importance of nodes 1. Centrality 2. Prestige 3. Page Rank 4. Hubs and Authority 5. Metrics comparison 2. Link analysis 3. Influence model 1.

More information

Lecture 7 Mathematics behind Internet Search

Lecture 7 Mathematics behind Internet Search CCST907 Hidden Order in Daily Life: A Mathematical Perspective Lecture 7 Mathematics behind Internet Search Dr. S. P. Yung (907A) Dr. Z. Hua (907B) Department of Mathematics, HKU Outline Google is the

More information

PROBABILISTIC LATENT SEMANTIC ANALYSIS

PROBABILISTIC LATENT SEMANTIC ANALYSIS PROBABILISTIC LATENT SEMANTIC ANALYSIS Lingjia Deng Revised from slides of Shuguang Wang Outline Review of previous notes PCA/SVD HITS Latent Semantic Analysis Probabilistic Latent Semantic Analysis Applications

More information

1 Inner Product and Orthogonality

1 Inner Product and Orthogonality CSCI 4/Fall 6/Vora/GWU/Orthogonality and Norms Inner Product and Orthogonality Definition : The inner product of two vectors x and y, x x x =.., y =. x n y y... y n is denoted x, y : Note that n x, y =

More information

googling it: how google ranks search results Courtney R. Gibbons October 17, 2017

googling it: how google ranks search results Courtney R. Gibbons October 17, 2017 googling it: how google ranks search results Courtney R. Gibbons October 17, 2017 Definition (Relevance) (noun): the quality or state of being closely connected or appropriate: this film has contemporary

More information

IR: Information Retrieval

IR: Information Retrieval / 44 IR: Information Retrieval FIB, Master in Innovation and Research in Informatics Slides by Marta Arias, José Luis Balcázar, Ramon Ferrer-i-Cancho, Ricard Gavaldá Department of Computer Science, UPC

More information

Spectral Analysis of k-balanced Signed Graphs

Spectral Analysis of k-balanced Signed Graphs Spectral Analysis of k-balanced Signed Graphs Leting Wu 1, Xiaowei Ying 1, Xintao Wu 1, Aidong Lu 1 and Zhi-Hua Zhou 2 1 University of North Carolina at Charlotte, USA, {lwu8,xying, xwu,alu1}@uncc.edu

More information

6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search

6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search 6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search Daron Acemoglu and Asu Ozdaglar MIT September 30, 2009 1 Networks: Lecture 7 Outline Navigation (or decentralized search)

More information

Web Ranking. Classification (manual, automatic) Link Analysis (today s lesson)

Web Ranking. Classification (manual, automatic) Link Analysis (today s lesson) Link Analysis Web Ranking Documents on the web are first ranked according to their relevance vrs the query Additional ranking methods are needed to cope with huge amount of information Additional ranking

More information

Chapter 6. Eigenvalues. Josef Leydold Mathematical Methods WS 2018/19 6 Eigenvalues 1 / 45

Chapter 6. Eigenvalues. Josef Leydold Mathematical Methods WS 2018/19 6 Eigenvalues 1 / 45 Chapter 6 Eigenvalues Josef Leydold Mathematical Methods WS 2018/19 6 Eigenvalues 1 / 45 Closed Leontief Model In a closed Leontief input-output-model consumption and production coincide, i.e. V x = x

More information

22m:033 Notes: 7.1 Diagonalization of Symmetric Matrices

22m:033 Notes: 7.1 Diagonalization of Symmetric Matrices m:33 Notes: 7. Diagonalization of Symmetric Matrices Dennis Roseman University of Iowa Iowa City, IA http://www.math.uiowa.edu/ roseman May 3, Symmetric matrices Definition. A symmetric matrix is a matrix

More information

Parallel Singular Value Decomposition. Jiaxing Tan

Parallel Singular Value Decomposition. Jiaxing Tan Parallel Singular Value Decomposition Jiaxing Tan Outline What is SVD? How to calculate SVD? How to parallelize SVD? Future Work What is SVD? Matrix Decomposition Eigen Decomposition A (non-zero) vector

More information

Node Centrality and Ranking on Networks

Node Centrality and Ranking on Networks Node Centrality and Ranking on Networks Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Social

More information

Math 1553, Introduction to Linear Algebra

Math 1553, Introduction to Linear Algebra Learning goals articulate what students are expected to be able to do in a course that can be measured. This course has course-level learning goals that pertain to the entire course, and section-level

More information

Quadratic forms. Here. Thus symmetric matrices are diagonalizable, and the diagonalization can be performed by means of an orthogonal matrix.

Quadratic forms. Here. Thus symmetric matrices are diagonalizable, and the diagonalization can be performed by means of an orthogonal matrix. Quadratic forms 1. Symmetric matrices An n n matrix (a ij ) n ij=1 with entries on R is called symmetric if A T, that is, if a ij = a ji for all 1 i, j n. We denote by S n (R) the set of all n n symmetric

More information

1 Last time: least-squares problems

1 Last time: least-squares problems MATH Linear algebra (Fall 07) Lecture Last time: least-squares problems Definition. If A is an m n matrix and b R m, then a least-squares solution to the linear system Ax = b is a vector x R n such that

More information

series. Utilize the methods of calculus to solve applied problems that require computational or algebraic techniques..

series. Utilize the methods of calculus to solve applied problems that require computational or algebraic techniques.. 1 Use computational techniques and algebraic skills essential for success in an academic, personal, or workplace setting. (Computational and Algebraic Skills) MAT 203 MAT 204 MAT 205 MAT 206 Calculus I

More information

Link Analysis and Web Search

Link Analysis and Web Search Link Analysis and Web Search Episode 11 Baochun Li Professor Department of Electrical and Computer Engineering University of Toronto Link Analysis and Web Search (Chapter 13, 14) Information networks and

More information

LINEAR ALGEBRA 1, 2012-I PARTIAL EXAM 3 SOLUTIONS TO PRACTICE PROBLEMS

LINEAR ALGEBRA 1, 2012-I PARTIAL EXAM 3 SOLUTIONS TO PRACTICE PROBLEMS LINEAR ALGEBRA, -I PARTIAL EXAM SOLUTIONS TO PRACTICE PROBLEMS Problem (a) For each of the two matrices below, (i) determine whether it is diagonalizable, (ii) determine whether it is orthogonally diagonalizable,

More information

Web Ranking. Classification (manual, automatic) Link Analysis (today s lesson)

Web Ranking. Classification (manual, automatic) Link Analysis (today s lesson) Link Analysis Web Ranking Documents on the web are first ranked according to their relevance vrs the query Additional ranking methods are needed to cope with huge amount of information Additional ranking

More information

Spectral Graph Theory

Spectral Graph Theory Spectral Graph Theory Aaron Mishtal April 27, 2016 1 / 36 Outline Overview Linear Algebra Primer History Theory Applications Open Problems Homework Problems References 2 / 36 Outline Overview Linear Algebra

More information

Notes on Linear Algebra and Matrix Theory

Notes on Linear Algebra and Matrix Theory Massimo Franceschet featuring Enrico Bozzo Scalar product The scalar product (a.k.a. dot product or inner product) of two real vectors x = (x 1,..., x n ) and y = (y 1,..., y n ) is not a vector but a

More information

Calculating Web Page Authority Using the PageRank Algorithm

Calculating Web Page Authority Using the PageRank Algorithm Jacob Miles Prystowsky and Levi Gill Math 45, Fall 2005 1 Introduction 1.1 Abstract In this document, we examine how the Google Internet search engine uses the PageRank algorithm to assign quantitatively

More information

1. The Polar Decomposition

1. The Polar Decomposition A PERSONAL INTERVIEW WITH THE SINGULAR VALUE DECOMPOSITION MATAN GAVISH Part. Theory. The Polar Decomposition In what follows, F denotes either R or C. The vector space F n is an inner product space with

More information

Latent Semantic Analysis. Hongning Wang

Latent Semantic Analysis. Hongning Wang Latent Semantic Analysis Hongning Wang CS@UVa VS model in practice Document and query are represented by term vectors Terms are not necessarily orthogonal to each other Synonymy: car v.s. automobile Polysemy:

More information

Final Review Written by Victoria Kala SH 6432u Office Hours R 12:30 1:30pm Last Updated 11/30/2015

Final Review Written by Victoria Kala SH 6432u Office Hours R 12:30 1:30pm Last Updated 11/30/2015 Final Review Written by Victoria Kala vtkala@mathucsbedu SH 6432u Office Hours R 12:30 1:30pm Last Updated 11/30/2015 Summary This review contains notes on sections 44 47, 51 53, 61, 62, 65 For your final,

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University.

CS246: Mining Massive Datasets Jure Leskovec, Stanford University. CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu What is the structure of the Web? How is it organized? 2/7/2011 Jure Leskovec, Stanford C246: Mining Massive

More information

Principal Component Analysis

Principal Component Analysis Machine Learning Michaelmas 2017 James Worrell Principal Component Analysis 1 Introduction 1.1 Goals of PCA Principal components analysis (PCA) is a dimensionality reduction technique that can be used

More information

On the mathematical background of Google PageRank algorithm

On the mathematical background of Google PageRank algorithm Working Paper Series Department of Economics University of Verona On the mathematical background of Google PageRank algorithm Alberto Peretti, Alberto Roveda WP Number: 25 December 2014 ISSN: 2036-2919

More information

Bruce Hendrickson Discrete Algorithms & Math Dept. Sandia National Labs Albuquerque, New Mexico Also, CS Department, UNM

Bruce Hendrickson Discrete Algorithms & Math Dept. Sandia National Labs Albuquerque, New Mexico Also, CS Department, UNM Latent Semantic Analysis and Fiedler Retrieval Bruce Hendrickson Discrete Algorithms & Math Dept. Sandia National Labs Albuquerque, New Mexico Also, CS Department, UNM Informatics & Linear Algebra Eigenvectors

More information

Multimedia Databases - 68A6 Final Term - exercises

Multimedia Databases - 68A6 Final Term - exercises Multimedia Databases - 8A Final Term - exercises Exercises for the preparation to the final term June, the 1th 00 quiz 1. approximation of cosine similarity An approximate computation of the cosine similarity

More information

7. Symmetric Matrices and Quadratic Forms

7. Symmetric Matrices and Quadratic Forms Linear Algebra 7. Symmetric Matrices and Quadratic Forms CSIE NCU 1 7. Symmetric Matrices and Quadratic Forms 7.1 Diagonalization of symmetric matrices 2 7.2 Quadratic forms.. 9 7.4 The singular value

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu 2/7/2012 Jure Leskovec, Stanford C246: Mining Massive Datasets 2 Web pages are not equally important www.joe-schmoe.com

More information

Chapter 7: Symmetric Matrices and Quadratic Forms

Chapter 7: Symmetric Matrices and Quadratic Forms Chapter 7: Symmetric Matrices and Quadratic Forms (Last Updated: December, 06) These notes are derived primarily from Linear Algebra and its applications by David Lay (4ed). A few theorems have been moved

More information

MA 265 FINAL EXAM Fall 2012

MA 265 FINAL EXAM Fall 2012 MA 265 FINAL EXAM Fall 22 NAME: INSTRUCTOR S NAME:. There are a total of 25 problems. You should show work on the exam sheet, and pencil in the correct answer on the scantron. 2. No books, notes, or calculators

More information

Properties of Matrices and Operations on Matrices

Properties of Matrices and Operations on Matrices Properties of Matrices and Operations on Matrices A common data structure for statistical analysis is a rectangular array or matris. Rows represent individual observational units, or just observations,

More information

MATH 829: Introduction to Data Mining and Analysis Clustering II

MATH 829: Introduction to Data Mining and Analysis Clustering II his lecture is based on U. von Luxburg, A Tutorial on Spectral Clustering, Statistics and Computing, 17 (4), 2007. MATH 829: Introduction to Data Mining and Analysis Clustering II Dominique Guillot Departments

More information

Riemann Hypotheses. Alex J. Best 4/2/2014. WMS Talks

Riemann Hypotheses. Alex J. Best 4/2/2014. WMS Talks Riemann Hypotheses Alex J. Best WMS Talks 4/2/2014 In this talk: 1 Introduction 2 The original hypothesis 3 Zeta functions for graphs 4 More assorted zetas 5 Back to number theory 6 Conclusion The Riemann

More information

Introduction. Chapter One

Introduction. Chapter One Chapter One Introduction The aim of this book is to describe and explain the beautiful mathematical relationships between matrices, moments, orthogonal polynomials, quadrature rules and the Lanczos and

More information

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank

More information

Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) 1.1 The Formal Denition of a Vector Space

Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) 1.1 The Formal Denition of a Vector Space Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) Contents 1 Vector Spaces 1 1.1 The Formal Denition of a Vector Space.................................. 1 1.2 Subspaces...................................................

More information

Bounds on the Ratio of Eigenvalues

Bounds on the Ratio of Eigenvalues Bounds on the Ratio of Eigenvalues Tracy Powell Lesley A Ward, Advisor Estelle Basor, Reader December 0, 2005 Department of Mathematics Abstract In this paper, we find lower and upper bounds for the leading

More information

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra. DS-GA 1002 Lecture notes 0 Fall 2016 Linear Algebra These notes provide a review of basic concepts in linear algebra. 1 Vector spaces You are no doubt familiar with vectors in R 2 or R 3, i.e. [ ] 1.1

More information

Index. Copyright (c)2007 The Society for Industrial and Applied Mathematics From: Matrix Methods in Data Mining and Pattern Recgonition By: Lars Elden

Index. Copyright (c)2007 The Society for Industrial and Applied Mathematics From: Matrix Methods in Data Mining and Pattern Recgonition By: Lars Elden Index 1-norm, 15 matrix, 17 vector, 15 2-norm, 15, 59 matrix, 17 vector, 15 3-mode array, 91 absolute error, 15 adjacency matrix, 158 Aitken extrapolation, 157 algebra, multi-linear, 91 all-orthogonality,

More information

On the convergence of weighted-average consensus

On the convergence of weighted-average consensus On the convergence of weighted-average consensus Francisco Pedroche Miguel Rebollo Carlos Carrascosa Alberto Palomares arxiv:307.7562v [math.oc] 29 Jul 203 Abstract In this note we give sufficient conditions

More information

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University.

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University. Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University http://www.mmds.org #1: C4.5 Decision Tree - Classification (61 votes) #2: K-Means - Clustering

More information

Link Mining PageRank. From Stanford C246

Link Mining PageRank. From Stanford C246 Link Mining PageRank From Stanford C246 Broad Question: How to organize the Web? First try: Human curated Web dictionaries Yahoo, DMOZ LookSmart Second try: Web Search Information Retrieval investigates

More information

MATH36001 Perron Frobenius Theory 2015

MATH36001 Perron Frobenius Theory 2015 MATH361 Perron Frobenius Theory 215 In addition to saying something useful, the Perron Frobenius theory is elegant. It is a testament to the fact that beautiful mathematics eventually tends to be useful,

More information

0.1 Naive formulation of PageRank

0.1 Naive formulation of PageRank PageRank is a ranking system designed to find the best pages on the web. A webpage is considered good if it is endorsed (i.e. linked to) by other good webpages. The more webpages link to it, and the more

More information

1 T 1 = where 1 is the all-ones vector. For the upper bound, let v 1 be the eigenvector corresponding. u:(u,v) E v 1(u)

1 T 1 = where 1 is the all-ones vector. For the upper bound, let v 1 be the eigenvector corresponding. u:(u,v) E v 1(u) CME 305: Discrete Mathematics and Algorithms Instructor: Reza Zadeh (rezab@stanford.edu) Final Review Session 03/20/17 1. Let G = (V, E) be an unweighted, undirected graph. Let λ 1 be the maximum eigenvalue

More information

Dimensionality Reduction: PCA. Nicholas Ruozzi University of Texas at Dallas

Dimensionality Reduction: PCA. Nicholas Ruozzi University of Texas at Dallas Dimensionality Reduction: PCA Nicholas Ruozzi University of Texas at Dallas Eigenvalues λ is an eigenvalue of a matrix A R n n if the linear system Ax = λx has at least one non-zero solution If Ax = λx

More information

1.3 Convergence of Regular Markov Chains

1.3 Convergence of Regular Markov Chains Markov Chains and Random Walks on Graphs 3 Applying the same argument to A T, which has the same λ 0 as A, yields the row sum bounds Corollary 0 Let P 0 be the transition matrix of a regular Markov chain

More information

Nonnegative Matrices I

Nonnegative Matrices I Nonnegative Matrices I Daisuke Oyama Topics in Economic Theory September 26, 2017 References J. L. Stuart, Digraphs and Matrices, in Handbook of Linear Algebra, Chapter 29, 2006. R. A. Brualdi and H. J.

More information

Math 4153 Exam 3 Review. The syllabus for Exam 3 is Chapter 6 (pages ), Chapter 7 through page 137, and Chapter 8 through page 182 in Axler.

Math 4153 Exam 3 Review. The syllabus for Exam 3 is Chapter 6 (pages ), Chapter 7 through page 137, and Chapter 8 through page 182 in Axler. Math 453 Exam 3 Review The syllabus for Exam 3 is Chapter 6 (pages -2), Chapter 7 through page 37, and Chapter 8 through page 82 in Axler.. You should be sure to know precise definition of the terms we

More information

1. What is the determinant of the following matrix? a 1 a 2 4a 3 2a 2 b 1 b 2 4b 3 2b c 1. = 4, then det

1. What is the determinant of the following matrix? a 1 a 2 4a 3 2a 2 b 1 b 2 4b 3 2b c 1. = 4, then det What is the determinant of the following matrix? 3 4 3 4 3 4 4 3 A 0 B 8 C 55 D 0 E 60 If det a a a 3 b b b 3 c c c 3 = 4, then det a a 4a 3 a b b 4b 3 b c c c 3 c = A 8 B 6 C 4 D E 3 Let A be an n n matrix

More information

Validated computation tool for the Perron-Frobenius eigenvalue using graph theory

Validated computation tool for the Perron-Frobenius eigenvalue using graph theory Validated computation tool for the Perron-Frobenius eigenvalue using graph theory Kaori Nagatou Kyushu University, JAPAN [ Joint work with Yutaka Ishii, Kyushu University ] AGA Seminar in INI, March 13,

More information