Authoritative Sources in a Hyperlinked Enviroment
|
|
- Angelina Stevenson
- 5 years ago
- Views:
Transcription
1 Authoritative Sources in a Hyperlinked Enviroment Dimitris Sakavalas June 15, 2010
2 Outline Introduction Problem description Authorities and Broad-topic queries Algorithm outline Construction of a Focused Subgraph of the www Expanding R into the base set S Heuristics impoving base set S Computing Hubs and Authorities Iterative Algorithm Elements of Linear Algebra Convergence of the Iterative Algorithm Similar page queries Multiple Sets of Hubs and Authorities Non-Principal Eigenvectors and Clustering of G Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 2/25
3 Introduction Problem description www-search Discover pages relevant to a given query string Discover the most authoritative pages (subjective, requires human evaluation) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 3/25
4 Introduction Problem description www-search Discover pages relevant to a given query string Discover the most authoritative pages (subjective, requires human evaluation) Queries Specic queries Scarcity Problem Broad-topic queries Abundance Problem Similar-page queries Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 3/25
5 Introduction Authorities and Broad-topic queries In search of a denition of authority Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25
6 Introduction Authorities and Broad-topic queries In search of a denition of authority No endogenous measure of a page to assess authority (ex. Text-based ranking functions for queries: \Harvard", \search engines", \automobile manufacturers", images) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25
7 Introduction Authorities and Broad-topic queries In search of a denition of authority No endogenous measure of a page to assess authority (ex. Text-based ranking functions for queries: \Harvard", \search engines", \automobile manufacturers", images) Solution: Analysis of the link structure Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25
8 Introduction Authorities and Broad-topic queries In search of a denition of authority No endogenous measure of a page to assess authority (ex. Text-based ranking functions for queries: \Harvard", \search engines", \automobile manufacturers", images) Solution: Analysis of the link structure Pitfalls: Advertising links Navigational links Universal Popularity links Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25
9 Introduction Authorities and Broad-topic queries In search of a denition of authority No endogenous measure of a page to assess authority (ex. Text-based ranking functions for queries: \Harvard", \search engines", \automobile manufacturers", images) Solution: Analysis of the link structure Pitfalls: Advertising links Navigational links Universal Popularity links Question.How reliable is page p? Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 4/25
10 Introduction Algorithm outline Hubs Pages that link to many Authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 5/25
11 Introduction Algorithm outline Hubs Pages that link to many Authorities Outline of the algorithm 1 Create a focused subgraph of the www from the output of a text-based engine (Small collection of pages likely to contain the most authoritative pages) 2 Identify hubs and authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 5/25
12 Construction of a Focused Subgraph of the www Directed graph G = (V; E) of the www. V the collection of hyperlinked pages of the www V : (nodes correspond to the pages) (p; q) E There is a link from p to q G[W]. Subgraph of G induced on W V Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 6/25
13 Construction of a Focused Subgraph of the www Directed graph G = (V; E) of the www. V the collection of hyperlinked pages of the www V : (nodes correspond to the pages) (p; q) E There is a link from p to q G[W]. Subgraph of G induced on W V Goal Construct G = G[S ] such that 1 S is relatively small (computational cost) 2 S rich in relevant pages (search quality) 3 S contains most of the strongest authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 6/25
14 Construction of a Focused Subgraph of the www Directed graph G = (V; E) of the www. V the collection of hyperlinked pages of the www V : (nodes correspond to the pages) (p; q) E There is a link from p to q G[W]. Subgraph of G induced on W V Goal Construct G = G[S ] such that 1 S is relatively small (computational cost) 2 S rich in relevant pages (search quality) 3 S contains most of the strongest authorities Observation S rich in relevant pages many links to authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 6/25
15 Construction of a Focused Subgraph of the www Notation Γ + (p): the set of all pages p points to Γ (p): the set of all pages pointing to p E: a text-based search engine Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 7/25
16 Construction of a Focused Subgraph of the www Notation Γ + (p): the set of all pages p points to Γ (p): the set of all pages pointing to p E: a text-based search engine 1st Step Root set R ={t highest ranked pages for the query from E} R satises (i); (ii). Generally doesn't satisfy (iii) and is structureless (few links) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 7/25
17 Construction of a Focused Subgraph of the www Notation Γ + (p): the set of all pages p points to Γ (p): the set of all pages pointing to p E: a text-based search engine 1st Step Root set R ={t highest ranked pages for the query from E} R satises (i); (ii). Generally doesn't satisfy (iii) and is structureless (few links) Observation If a = R, strong authority, there is a high probability that p R such that a Γ + (p) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 7/25
18 Expanding R into the base set S 2nd Step Idea:Expand R into the Base set S by adding strong authorities and relevant pages Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 8/25
19 Expanding R into the base set S 2nd Step Idea:Expand R into the Base set S by adding strong authorities and relevant pages Subgraph(; E ; t ; d) t; d N Set S := R For each page p R Add all pages in Γ + (p) to S If Γ (p) d then, Add all pages in Γ (p) to S Else Add an arbitary set of d pages from Γ (p) to S End. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 8/25
20 Expanding R into the base set S Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 9/25
21 Heuristics impoving S Links Transverse: link between pages with dierent domain name Intrinsic: link between pages with the same domain name (navigational) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 10/25
22 Heuristics impoving S Links Transverse: link between pages with dierent domain name Intrinsic: link between pages with the same domain name (navigational) 3rd Step:Heuristics 1 Delete all Intrinsic links from the graph G 2 Allow up to a small number m pages from a single domain to point to any given page p Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 10/25
23 Heuristics impoving S Links Transverse: link between pages with dierent domain name Intrinsic: link between pages with the same domain name (navigational) 3rd Step:Heuristics 1 Delete all Intrinsic links from the graph G 2 Allow up to a small number m pages from a single domain to point to any given page p Finally we obtain a small subgraph G, relatively focused on the query, containing many relevant pages and strong authorities. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 10/25
24 Hubs and Authorities Purely In-degree ordering of Authorities Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 11/25
25 Hubs and Authorities Purely In-degree ordering of Authorities Problem. \Universally popular" pages, large in-degree regardless of the underlying query topic (ex.amazon Books) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 11/25
26 Hubs and Authorities Purely In-degree ordering of Authorities Problem. \Universally popular" pages, large in-degree regardless of the underlying query topic (ex.amazon Books) Observation Authorities: high in-degree, considerable overlap in the sets of pages that point to them(hubs) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 11/25
27 Hubs and Authorities Observation (mutually reinforcing relationship) Good Hub: a page that points to many good authorities Good Authority: a page that is pointed by many good hubs Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 12/25
28 Hubs and Authorities Observation (mutually reinforcing relationship) Good Hub: a page that points to many good authorities Good Authority: a page that is pointed by many good hubs Denitions p S = {1; 2; :::; n} assign: authority weight x <p> 0 hub weight y <p> 0 Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 12/25
29 Hubs and Authorities Observation (mutually reinforcing relationship) Good Hub: a page that points to many good authorities Good Authority: a page that is pointed by many good hubs Denitions p S = {1; 2; :::; n} assign: authority weight x <p> 0 hub weight y <p> 0 corresponding vectors x = x <1> x <2>. x <n> and y = y <1> y <2>. y <n> Normalized such that x = y = 1 example.for norm 1 ; n i=1 x <i> = n i=1 y <i> = 1 Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 12/25
30 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E y <q> Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25
31 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E O(x ; y) (y-weight update): y <p> ; y <p> q:(p;q) E y <q> x <q> ; Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25
32 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E O(x ; y) (y-weight update): y <p> ; y <p> q:(p;q) E y <q> x <q> ; Idea Find the disired \equilibrium" values for the weights by applying I; O operations iterativelly. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25
33 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E O(x ; y) (y-weight update): y <p> ; y <p> q:(p;q) E y <q> x <q> ; Idea Find the disired \equilibrium" values for the weights by applying I; O operations iterativelly. Question. Does this \equilibrium" exist? Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25
34 Hubs and Authorities Denitions I(x ; y) (x -weight update): x <p> ; x <p> q:(q;p) E O(x ; y) (y-weight update): y <p> ; y <p> q:(p;q) E y <q> x <q> ; Idea Find the disired \equilibrium" values for the weights by applying I; O operations iterativelly. Question. Does this \equilibrium" exist? YES (linear algebra) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 13/25
35 Iterative algorithm Iterate(G ; k) z := (1; 1; :::; 1) R n Set x 0 := z Set y 0 := z For i = 1; 2; :::; k Apply the I(x i 1 ; y i 1 ) operation to obtain new x weights x i Apply the O(x i ; y i 1) operation to obtain new y weights y i Normalize x i, obtaining x i Normalize y i, obtaining y i End Return (x k ; y k ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 14/25
36 Iterative algorithm Iterate(G ; k) z := (1; 1; :::; 1) R n Set x 0 := z Set y 0 := z For i = 1; 2; :::; k Apply the I(x i 1 ; y i 1 ) operation to obtain new x weights x i Apply the O(x i ; y i 1) operation to obtain new y weights y i Normalize x i, obtaining x i Normalize y i, obtaining y i End Return (x k ; y k ) Filter(G ; k ; c) (x k ; y k ) :=Iterate(G ; k) Report the pages with the c largest coordinates in x k as authorities Report the pages with the c largest coordinates in y k as hubs Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 14/25
37 Elements of Linear Algebra Notation Matrix M C n n Eigenvalues i (M ) Eigenvectors! i (M ) eigenspace V subspace of C n multiplicity of : m = dim(v ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 15/25
38 Elements of Linear Algebra Notation Matrix M C n n Eigenvalues i (M ) Eigenvectors! i (M ) eigenspace V subspace of C n multiplicity of : m = dim(v ) Denitions Positive matrix M : If M i;j > 0; i ; j Primitive matrix M : If m N; M m : positive Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 15/25
39 Elements of Linear Algebra Let M R n n symmetric, nonnegative matrix (M i;j 0; i ; j ) Theorems 1 M has at most n distinct eigenvalues and = n m i Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 16/25
40 Elements of Linear Algebra Let M R n n symmetric, nonnegative matrix (M i;j 0; i ; j ) Theorems 1 M has at most n distinct eigenvalues and m i = n 2 M has only real eigenvalues i Denote them 1 (M ) 2 (M ) ::: n (M ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 16/25
41 Elements of Linear Algebra Let M R n n symmetric, nonnegative matrix (M i;j 0; i ; j ) Theorems 1 M has at most n distinct eigenvalues and m i = n 2 M has only real eigenvalues i Denote them 1 (M ) 2 (M ) ::: n (M ) 3 (Perron-Frobenius) If M is primitive then: i. Largest eigenvalue 1 (M ) > 0 and m = 1 ii. 1 > i i 1 iii. 1 has a corresponding eigenvector! 1 (M ) with all entries positive (the principal eigenvector) Consider! 1 (M ) normalized,! 1 (M ) = 1 Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 16/25
42 Elements of Linear Algebra Let M R n n symmetric, nonnegative matrix (M i;j 0; i ; j ) Theorems 1 M has at most n distinct eigenvalues and m i = n 2 M has only real eigenvalues i Denote them 1 (M ) 2 (M ) ::: n (M ) 3 (Perron-Frobenius) If M is primitive then: i. Largest eigenvalue 1 (M ) > 0 and m = 1 ii. 1 > i i 1 iii. 1 has a corresponding eigenvector! 1 (M ) with all entries positive (the principal eigenvector) Consider! 1 (M ) normalized,! 1 (M ) = 1 4 If M is primitive and vector v not orthogonal to the principal eigenvector! 1 (M ), let v k be the unit vector in the direction of M k k v. Then v k! 1 (M ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 16/25
43 Elements of Linear Algebra Observation Note that we can consider! 1 (M ) as a vector of the orthonormal base of V 1. Moreover dimv 1 = m 1 = 1 and all entries in! 1 (M ) are positive. Hence the principal eigenvector! 1 (M ) is unique. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 17/25
44 Connection of Iterative algorithm to linear Algebra Let A be the adjacency matrix of G Observation Operations I(x ; y) and O(x ; y) can be written as x A y and y Ax respectivelly Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 18/25
45 Connection of Iterative algorithm to linear Algebra Let A be the adjacency matrix of G Observation Operations I(x ; y) and O(x ; y) can be written as x A y and y Ax respectivelly Iterate Algorithm The algorithm initializes x 0 = y 0 = z =. 1 step of thalgorithm we have the following steps: I(x i ; y i 1) x i = A y i 1 O(x i ; y i ) y i = Ax i 1. In the and i th Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 18/25
46 Connection of Iterative algorithm to linear Algebra Let A be the adjacency matrix of G Observation Operations I(x ; y) and O(x ; y) can be written as x A y and y Ax respectivelly Iterate Algorithm The algorithm initializes x 0 = y 0 = z =. 1 step of thalgorithm we have the following steps: I(x i ; y i 1) x i = A y i 1 1. In the and i th O(x i ; y i ) y i = Ax i x i = A Ax i 1 = (A A) i x 0 y i = AA y i 1 = (AA ) i y 0 } x i = (A A) i z y i = (AA ) i z ( ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 18/25
47 Convergence of the Iterative Algorithm Observation We consider x i ; y i normalized in each step as in Iterate algorithm. So in every step x i = y i = 1 For simplicity, matrices AA and A A are considered primitive Obviously z c 0; c R n 0 Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 19/25
48 Convergence of the Iterative Algorithm Observation We consider x i ; y i normalized in each step as in Iterate algorithm. So in every step x i = y i = 1 For simplicity, matrices AA and A A are considered primitive Obviously z c 0; c R n 0 Theorem The sequences {x i } and {y i } produced by Iterate Algorithm converge to limits x ; y respectively Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 19/25
49 Convergence of the Iterative Algorithm Observation We consider x i ; y i normalized in each step as in Iterate algorithm. So in every step x i = y i = 1 For simplicity, matrices AA and A A are considered primitive Obviously z c 0; c R n 0 Theorem The sequences {x i } and {y i } produced by Iterate Algorithm converge to limits x ; y respectively Proof. A A primitive, z not orthogonal to the principal eigenvector! 1 (A A) = x, x i the unit vector in the direction (A A) i z Hence from Theorem 4 {x i } x, simirarly {y i } y =! 1 (AA ) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 19/25
50 Conclusion Conclusion The process of iterated I=O operations will converge. The desired equilibrium values for the x ; y weights, are the values x ; y. Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 20/25
51 Conclusion Conclusion The process of iterated I=O operations will converge. The desired equilibrium values for the x ; y weights, are the values x ; y. Observation In practice the convergence of Iterate is quite rapid, so one can compute x ; y weights by starting from any initial vectors x 0 ; y 0, and performing a xed number of I; O operations Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 20/25
52 Conclusion Conclusion The process of iterated I=O operations will converge. The desired equilibrium values for the x ; y weights, are the values x ; y. Observation In practice the convergence of Iterate is quite rapid, so one can compute x ; y weights by starting from any initial vectors x 0 ; y 0, and performing a xed number of I; O operations Hyperlink-Induced Topic Search (HITS) algorithm (Ask.com) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 20/25
53 Similar-Page Queries Query. Find pages similar to p Idea Initiate search with the page p instead of a query string Use the link structure to infer \similarity among pages" Similar pages to p Strongest authorities in local region of p Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 21/25
54 Similar-Page Queries Query. Find pages similar to p Idea Initiate search with the page p instead of a query string Use the link structure to infer \similarity among pages" Similar pages to p Strongest authorities in local region of p Adaptation of the broad-topic query method 1 Find t pages pointing to p Assemble root set R p 2 Expand R p to base set S p as before 3 Search for authorities and hubs in the focused subgraph G p Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 21/25
55 Multiple Sets of Hubs and Authorities Situation. For a query string, relevant pages grouped into clusters. Reasons Dierent meanings of. ex. \jaguar\ String arises as term in multiple technical communities. ex. \randomized algorithms" String refers to a highly polarized issue. ex.\abortion" Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 22/25
56 Multiple Sets of Hubs and Authorities Situation. For a query string, relevant pages grouped into clusters. Reasons Dierent meanings of. ex. \jaguar\ String arises as term in multiple technical communities. ex. \randomized algorithms" String refers to a highly polarized issue. ex.\abortion" Problem. Identify the clusters Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 22/25
57 Multiple Sets of Hubs and Authorities Situation. For a query string, relevant pages grouped into clusters. Reasons Dierent meanings of. ex. \jaguar\ String arises as term in multiple technical communities. ex. \randomized algorithms" String refers to a highly polarized issue. ex.\abortion" Problem. Identify the clusters Idea Clusters represented in focused subgraph G as densely linked subgraphs(collection of hubs and authorities) Extract densely linked collections of hubs and authorities through non-principal eigenvectors of AA and A A Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 22/25
58 Non-Principal Eigenvectors and Clustering of G Proposition AA and A A have the same multiset of eigenvalues, and their eigenvectors can be chosen so that! i (AA ) = A! i (A A) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 23/25
59 Non-Principal Eigenvectors and Clustering of G Proposition AA and A A have the same multiset of eigenvalues, and their eigenvectors can be chosen so that! i (AA ) = A! i (A A) Observation For each pair of eigenvectors x i =! i (A A); y i =! i (AA ) operation I(x i ; y i ) keeps x weights parallel to x i : x = A! i (AA ) = A A! i (A A) = i x i Simirarly operation O(x i ; y i ) keeps x weights parallel to y i Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 23/25
60 Non-Principal Eigenvectors and Clustering of G Proposition AA and A A have the same multiset of eigenvalues, and their eigenvectors can be chosen so that! i (AA ) = A! i (A A) Observation For each pair of eigenvectors x i =! i (A A); y i =! i (AA ) operation I(x i ; y i ) keeps x weights parallel to x i : x = A! i (AA ) = A A! i (A A) = i x i Simirarly operation O(x i ; y i ) keeps x weights parallel to y i Non-principal eigenvectors have both positive and negative entries. Hence each pair of (x i ; y i ) provide us with two densely connected set of hubs and authorities Pages that correspond to the c coordinates with the most positive values Pages that correspond to the c coordinates with the most negative values Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 23/25
61 Non-Principal Eigenvectors and Clustering of G Description of the method For each of the st few non-principal eigenvectors (x i ; y i ) nd the two densely connected set of hubs and authorities through an algorithm similar(but less clean conceptually) to the Iterate algorithm Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 24/25
62 Non-Principal Eigenvectors and Clustering of G Description of the method For each of the st few non-principal eigenvectors (x i ; y i ) nd the two densely connected set of hubs and authorities through an algorithm similar(but less clean conceptually) to the Iterate algorithm The pages with large coordinates in the rst few nonprincipal eigenvectors tend to recur, so that essentially the same collection of hubs and authorities will often be generated by several of the strongest nonprincipal eigenvectors Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 24/25
63 Similar Concepts in Other Areas Social networks \Standing" Bibliometrics \Impact" Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 25/25
64 Similar Concepts in Other Areas Social networks \Standing" Bibliometrics \Impact" Economics (Wassily Leontief 1941-Nobel prize 1973) Dimitris Sakavalas, Authoritative Sources in a Hyperlinked Enviroment 25/25
Data Mining and Matrices
Data Mining and Matrices 10 Graphs II Rainer Gemulla, Pauli Miettinen Jul 4, 2013 Link analysis The web as a directed graph Set of web pages with associated textual content Hyperlinks between webpages
More information1 Searching the World Wide Web
Hubs and Authorities in a Hyperlinked Environment 1 Searching the World Wide Web Because diverse users each modify the link structure of the WWW within a relatively small scope by creating web-pages on
More informationLink Analysis Ranking
Link Analysis Ranking How do search engines decide how to rank your query results? Guess why Google ranks the query results the way it does How would you do it? Naïve ranking of query results Given query
More informationIntroduction to Search Engine Technology Introduction to Link Structure Analysis. Ronny Lempel Yahoo Labs, Haifa
Introduction to Search Engine Technology Introduction to Link Structure Analysis Ronny Lempel Yahoo Labs, Haifa Outline Anchor-text indexing Mathematical Background Motivation for link structure analysis
More informationCS 3750 Advanced Machine Learning. Applications of SVD and PCA (LSA and Link analysis) Cem Akkaya
CS 375 Advanced Machine Learning Applications of SVD and PCA (LSA and Link analysis) Cem Akkaya Outline SVD and LSI Kleinberg s Algorithm PageRank Algorithm Vector Space Model Vector space model represents
More informationCS224W: Social and Information Network Analysis Jure Leskovec, Stanford University
CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu How to organize/navigate it? First try: Human curated Web directories Yahoo, DMOZ, LookSmart
More informationHyperlinked-Induced Topic Search (HITS) identifies. authorities as good content sources (~high indegree) HITS [Kleinberg 99] considers a web page
IV.3 HITS Hyperlinked-Induced Topic Search (HITS) identifies authorities as good content sources (~high indegree) hubs as good link sources (~high outdegree) HITS [Kleinberg 99] considers a web page a
More informationA Note on Google s PageRank
A Note on Google s PageRank According to Google, google-search on a given topic results in a listing of most relevant web pages related to the topic. Google ranks the importance of webpages according to
More informationOnline Social Networks and Media. Link Analysis and Web Search
Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information
More informationLink Analysis. Leonid E. Zhukov
Link Analysis Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural Analysis and Visualization
More informationLink Analysis. Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze
Link Analysis Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze 1 The Web as a Directed Graph Page A Anchor hyperlink Page B Assumption 1: A hyperlink between pages
More informationFaloutsos, Tong ICDE, 2009
Large Graph Mining: Patterns, Tools and Case Studies Christos Faloutsos Hanghang Tong CMU Copyright: Faloutsos, Tong (29) 2-1 Outline Part 1: Patterns Part 2: Matrix and Tensor Tools Part 3: Proximity
More informationLab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018
Lab 8: Measuring Graph Centrality - PageRank Monday, November 5 CompSci 531, Fall 2018 Outline Measuring Graph Centrality: Motivation Random Walks, Markov Chains, and Stationarity Distributions Google
More informationDATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS
DATA MINING LECTURE 3 Link Analysis Ranking PageRank -- Random walks HITS How to organize the web First try: Manually curated Web Directories How to organize the web Second try: Web Search Information
More informationWiki Definition. Reputation Systems I. Outline. Introduction to Reputations. Yury Lifshits. HITS, PageRank, SALSA, ebay, EigenTrust, VKontakte
Reputation Systems I HITS, PageRank, SALSA, ebay, EigenTrust, VKontakte Yury Lifshits Wiki Definition Reputation is the opinion (more technically, a social evaluation) of the public toward a person, a
More informationCS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine
CS 277: Data Mining Mining Web Link Structure Class Presentations In-class, Tuesday and Thursday next week 2-person teams: 6 minutes, up to 6 slides, 3 minutes/slides each person 1-person teams 4 minutes,
More informationHow does Google rank webpages?
Linear Algebra Spring 016 How does Google rank webpages? Dept. of Internet and Multimedia Eng. Konkuk University leehw@konkuk.ac.kr 1 Background on search engines Outline HITS algorithm (Jon Kleinberg)
More informationApplications to network analysis: Eigenvector centrality indices Lecture notes
Applications to network analysis: Eigenvector centrality indices Lecture notes Dario Fasino, University of Udine (Italy) Lecture notes for the second part of the course Nonnegative and spectral matrix
More informationDetailed Proof of The PerronFrobenius Theorem
Detailed Proof of The PerronFrobenius Theorem Arseny M Shur Ural Federal University October 30, 2016 1 Introduction This famous theorem has numerous applications, but to apply it you should understand
More informationSpectra of Adjacency and Laplacian Matrices
Spectra of Adjacency and Laplacian Matrices Definition: University of Alicante (Spain) Matrix Computing (subject 3168 Degree in Maths) 30 hours (theory)) + 15 hours (practical assignment) Contents 1. Spectra
More informationOnline Social Networks and Media. Link Analysis and Web Search
Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information
More informationSVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices)
Chapter 14 SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Today we continue the topic of low-dimensional approximation to datasets and matrices. Last time we saw the singular
More information1 Matrix notation and preliminaries from spectral graph theory
Graph clustering (or community detection or graph partitioning) is one of the most studied problems in network analysis. One reason for this is that there are a variety of ways to define a cluster or community.
More information6.207/14.15: Networks Lectures 4, 5 & 6: Linear Dynamics, Markov Chains, Centralities
6.207/14.15: Networks Lectures 4, 5 & 6: Linear Dynamics, Markov Chains, Centralities 1 Outline Outline Dynamical systems. Linear and Non-linear. Convergence. Linear algebra and Lyapunov functions. Markov
More informationSTA141C: Big Data & High Performance Statistical Computing
STA141C: Big Data & High Performance Statistical Computing Lecture 6: Numerical Linear Algebra: Applications in Machine Learning Cho-Jui Hsieh UC Davis April 27, 2017 Principal Component Analysis Principal
More informationChapter 3 Transformations
Chapter 3 Transformations An Introduction to Optimization Spring, 2014 Wei-Ta Chu 1 Linear Transformations A function is called a linear transformation if 1. for every and 2. for every If we fix the bases
More informationLecture 15 Perron-Frobenius Theory
EE363 Winter 2005-06 Lecture 15 Perron-Frobenius Theory Positive and nonnegative matrices and vectors Perron-Frobenius theorems Markov chains Economic growth Population dynamics Max-min and min-max characterization
More informationFiedler s Theorems on Nodal Domains
Spectral Graph Theory Lecture 7 Fiedler s Theorems on Nodal Domains Daniel A. Spielman September 19, 2018 7.1 Overview In today s lecture we will justify some of the behavior we observed when using eigenvectors
More informationGoogle Page Rank Project Linear Algebra Summer 2012
Google Page Rank Project Linear Algebra Summer 2012 How does an internet search engine, like Google, work? In this project you will discover how the Page Rank algorithm works to give the most relevant
More informationMultiRank and HAR for Ranking Multi-relational Data, Transition Probability Tensors, and Multi-Stochastic Tensors
MultiRank and HAR for Ranking Multi-relational Data, Transition Probability Tensors, and Multi-Stochastic Tensors Michael K. Ng Centre for Mathematical Imaging and Vision and Department of Mathematics
More informationMachine Learning. Principal Components Analysis. Le Song. CSE6740/CS7641/ISYE6740, Fall 2012
Machine Learning CSE6740/CS7641/ISYE6740, Fall 2012 Principal Components Analysis Le Song Lecture 22, Nov 13, 2012 Based on slides from Eric Xing, CMU Reading: Chap 12.1, CB book 1 2 Factor or Component
More information(a) II and III (b) I (c) I and III (d) I and II and III (e) None are true.
1 Which of the following statements is always true? I The null space of an m n matrix is a subspace of R m II If the set B = {v 1,, v n } spans a vector space V and dimv = n, then B is a basis for V III
More informationComplex Social System, Elections. Introduction to Network Analysis 1
Complex Social System, Elections Introduction to Network Analysis 1 Complex Social System, Network I person A voted for B A is more central than B if more people voted for A In-degree centrality index
More informationECEN 689 Special Topics in Data Science for Communications Networks
ECEN 689 Special Topics in Data Science for Communications Networks Nick Duffield Department of Electrical & Computer Engineering Texas A&M University Lecture 8 Random Walks, Matrices and PageRank Graphs
More informationComplex Networks CSYS/MATH 303, Spring, Prof. Peter Dodds
Complex Networks CSYS/MATH 303, Spring, 2011 Prof. Peter Dodds Department of Mathematics & Statistics Center for Complex Systems Vermont Advanced Computing Center University of Vermont Licensed under the
More informationLinear Algebra: Characteristic Value Problem
Linear Algebra: Characteristic Value Problem . The Characteristic Value Problem Let < be the set of real numbers and { be the set of complex numbers. Given an n n real matrix A; does there exist a number
More informationCSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68
CSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68 References 1 L. Freeman, Centrality in Social Networks: Conceptual Clarification, Social Networks, Vol. 1, 1978/1979, pp. 215 239. 2 S. Wasserman
More informationLecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora
princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora Scribe: Today we continue the
More informationLink Analysis Information Retrieval and Data Mining. Prof. Matteo Matteucci
Link Analysis Information Retrieval and Data Mining Prof. Matteo Matteucci Hyperlinks for Indexing and Ranking 2 Page A Hyperlink Page B Intuitions The anchor text might describe the target page B Anchor
More informationLinear algebra and applications to graphs Part 1
Linear algebra and applications to graphs Part 1 Written up by Mikhail Belkin and Moon Duchin Instructor: Laszlo Babai June 17, 2001 1 Basic Linear Algebra Exercise 1.1 Let V and W be linear subspaces
More informationEigenvalues of Exponentiated Adjacency Matrices
Eigenvalues of Exponentiated Adjacency Matrices Tracy Powell Lesley A Ward, Advisor Estelle Basor, Reader November 21, 25 Department of Mathematics 2 Chapter Problem A graph G is made up of vertices, or
More informationConceptual Questions for Review
Conceptual Questions for Review Chapter 1 1.1 Which vectors are linear combinations of v = (3, 1) and w = (4, 3)? 1.2 Compare the dot product of v = (3, 1) and w = (4, 3) to the product of their lengths.
More informationNode and Link Analysis
Node and Link Analysis Leonid E. Zhukov School of Applied Mathematics and Information Science National Research University Higher School of Economics 10.02.2014 Leonid E. Zhukov (HSE) Lecture 5 10.02.2014
More informationStructure in Data. A major objective in data analysis is to identify interesting features or structure in the data.
Structure in Data A major objective in data analysis is to identify interesting features or structure in the data. The graphical methods are very useful in discovering structure. There are basically two
More informationWeb Structure Mining Nodes, Links and Influence
Web Structure Mining Nodes, Links and Influence 1 Outline 1. Importance of nodes 1. Centrality 2. Prestige 3. Page Rank 4. Hubs and Authority 5. Metrics comparison 2. Link analysis 3. Influence model 1.
More informationLecture 7 Mathematics behind Internet Search
CCST907 Hidden Order in Daily Life: A Mathematical Perspective Lecture 7 Mathematics behind Internet Search Dr. S. P. Yung (907A) Dr. Z. Hua (907B) Department of Mathematics, HKU Outline Google is the
More informationPROBABILISTIC LATENT SEMANTIC ANALYSIS
PROBABILISTIC LATENT SEMANTIC ANALYSIS Lingjia Deng Revised from slides of Shuguang Wang Outline Review of previous notes PCA/SVD HITS Latent Semantic Analysis Probabilistic Latent Semantic Analysis Applications
More information1 Inner Product and Orthogonality
CSCI 4/Fall 6/Vora/GWU/Orthogonality and Norms Inner Product and Orthogonality Definition : The inner product of two vectors x and y, x x x =.., y =. x n y y... y n is denoted x, y : Note that n x, y =
More informationgoogling it: how google ranks search results Courtney R. Gibbons October 17, 2017
googling it: how google ranks search results Courtney R. Gibbons October 17, 2017 Definition (Relevance) (noun): the quality or state of being closely connected or appropriate: this film has contemporary
More informationIR: Information Retrieval
/ 44 IR: Information Retrieval FIB, Master in Innovation and Research in Informatics Slides by Marta Arias, José Luis Balcázar, Ramon Ferrer-i-Cancho, Ricard Gavaldá Department of Computer Science, UPC
More informationSpectral Analysis of k-balanced Signed Graphs
Spectral Analysis of k-balanced Signed Graphs Leting Wu 1, Xiaowei Ying 1, Xintao Wu 1, Aidong Lu 1 and Zhi-Hua Zhou 2 1 University of North Carolina at Charlotte, USA, {lwu8,xying, xwu,alu1}@uncc.edu
More information6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search
6.207/14.15: Networks Lecture 7: Search on Networks: Navigation and Web Search Daron Acemoglu and Asu Ozdaglar MIT September 30, 2009 1 Networks: Lecture 7 Outline Navigation (or decentralized search)
More informationWeb Ranking. Classification (manual, automatic) Link Analysis (today s lesson)
Link Analysis Web Ranking Documents on the web are first ranked according to their relevance vrs the query Additional ranking methods are needed to cope with huge amount of information Additional ranking
More informationChapter 6. Eigenvalues. Josef Leydold Mathematical Methods WS 2018/19 6 Eigenvalues 1 / 45
Chapter 6 Eigenvalues Josef Leydold Mathematical Methods WS 2018/19 6 Eigenvalues 1 / 45 Closed Leontief Model In a closed Leontief input-output-model consumption and production coincide, i.e. V x = x
More information22m:033 Notes: 7.1 Diagonalization of Symmetric Matrices
m:33 Notes: 7. Diagonalization of Symmetric Matrices Dennis Roseman University of Iowa Iowa City, IA http://www.math.uiowa.edu/ roseman May 3, Symmetric matrices Definition. A symmetric matrix is a matrix
More informationParallel Singular Value Decomposition. Jiaxing Tan
Parallel Singular Value Decomposition Jiaxing Tan Outline What is SVD? How to calculate SVD? How to parallelize SVD? Future Work What is SVD? Matrix Decomposition Eigen Decomposition A (non-zero) vector
More informationNode Centrality and Ranking on Networks
Node Centrality and Ranking on Networks Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Social
More informationMath 1553, Introduction to Linear Algebra
Learning goals articulate what students are expected to be able to do in a course that can be measured. This course has course-level learning goals that pertain to the entire course, and section-level
More informationQuadratic forms. Here. Thus symmetric matrices are diagonalizable, and the diagonalization can be performed by means of an orthogonal matrix.
Quadratic forms 1. Symmetric matrices An n n matrix (a ij ) n ij=1 with entries on R is called symmetric if A T, that is, if a ij = a ji for all 1 i, j n. We denote by S n (R) the set of all n n symmetric
More information1 Last time: least-squares problems
MATH Linear algebra (Fall 07) Lecture Last time: least-squares problems Definition. If A is an m n matrix and b R m, then a least-squares solution to the linear system Ax = b is a vector x R n such that
More informationseries. Utilize the methods of calculus to solve applied problems that require computational or algebraic techniques..
1 Use computational techniques and algebraic skills essential for success in an academic, personal, or workplace setting. (Computational and Algebraic Skills) MAT 203 MAT 204 MAT 205 MAT 206 Calculus I
More informationLink Analysis and Web Search
Link Analysis and Web Search Episode 11 Baochun Li Professor Department of Electrical and Computer Engineering University of Toronto Link Analysis and Web Search (Chapter 13, 14) Information networks and
More informationLINEAR ALGEBRA 1, 2012-I PARTIAL EXAM 3 SOLUTIONS TO PRACTICE PROBLEMS
LINEAR ALGEBRA, -I PARTIAL EXAM SOLUTIONS TO PRACTICE PROBLEMS Problem (a) For each of the two matrices below, (i) determine whether it is diagonalizable, (ii) determine whether it is orthogonally diagonalizable,
More informationWeb Ranking. Classification (manual, automatic) Link Analysis (today s lesson)
Link Analysis Web Ranking Documents on the web are first ranked according to their relevance vrs the query Additional ranking methods are needed to cope with huge amount of information Additional ranking
More informationSpectral Graph Theory
Spectral Graph Theory Aaron Mishtal April 27, 2016 1 / 36 Outline Overview Linear Algebra Primer History Theory Applications Open Problems Homework Problems References 2 / 36 Outline Overview Linear Algebra
More informationNotes on Linear Algebra and Matrix Theory
Massimo Franceschet featuring Enrico Bozzo Scalar product The scalar product (a.k.a. dot product or inner product) of two real vectors x = (x 1,..., x n ) and y = (y 1,..., y n ) is not a vector but a
More informationCalculating Web Page Authority Using the PageRank Algorithm
Jacob Miles Prystowsky and Levi Gill Math 45, Fall 2005 1 Introduction 1.1 Abstract In this document, we examine how the Google Internet search engine uses the PageRank algorithm to assign quantitatively
More information1. The Polar Decomposition
A PERSONAL INTERVIEW WITH THE SINGULAR VALUE DECOMPOSITION MATAN GAVISH Part. Theory. The Polar Decomposition In what follows, F denotes either R or C. The vector space F n is an inner product space with
More informationLatent Semantic Analysis. Hongning Wang
Latent Semantic Analysis Hongning Wang CS@UVa VS model in practice Document and query are represented by term vectors Terms are not necessarily orthogonal to each other Synonymy: car v.s. automobile Polysemy:
More informationFinal Review Written by Victoria Kala SH 6432u Office Hours R 12:30 1:30pm Last Updated 11/30/2015
Final Review Written by Victoria Kala vtkala@mathucsbedu SH 6432u Office Hours R 12:30 1:30pm Last Updated 11/30/2015 Summary This review contains notes on sections 44 47, 51 53, 61, 62, 65 For your final,
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University.
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu What is the structure of the Web? How is it organized? 2/7/2011 Jure Leskovec, Stanford C246: Mining Massive
More informationPrincipal Component Analysis
Machine Learning Michaelmas 2017 James Worrell Principal Component Analysis 1 Introduction 1.1 Goals of PCA Principal components analysis (PCA) is a dimensionality reduction technique that can be used
More informationOn the mathematical background of Google PageRank algorithm
Working Paper Series Department of Economics University of Verona On the mathematical background of Google PageRank algorithm Alberto Peretti, Alberto Roveda WP Number: 25 December 2014 ISSN: 2036-2919
More informationBruce Hendrickson Discrete Algorithms & Math Dept. Sandia National Labs Albuquerque, New Mexico Also, CS Department, UNM
Latent Semantic Analysis and Fiedler Retrieval Bruce Hendrickson Discrete Algorithms & Math Dept. Sandia National Labs Albuquerque, New Mexico Also, CS Department, UNM Informatics & Linear Algebra Eigenvectors
More informationMultimedia Databases - 68A6 Final Term - exercises
Multimedia Databases - 8A Final Term - exercises Exercises for the preparation to the final term June, the 1th 00 quiz 1. approximation of cosine similarity An approximate computation of the cosine similarity
More information7. Symmetric Matrices and Quadratic Forms
Linear Algebra 7. Symmetric Matrices and Quadratic Forms CSIE NCU 1 7. Symmetric Matrices and Quadratic Forms 7.1 Diagonalization of symmetric matrices 2 7.2 Quadratic forms.. 9 7.4 The singular value
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu 2/7/2012 Jure Leskovec, Stanford C246: Mining Massive Datasets 2 Web pages are not equally important www.joe-schmoe.com
More informationChapter 7: Symmetric Matrices and Quadratic Forms
Chapter 7: Symmetric Matrices and Quadratic Forms (Last Updated: December, 06) These notes are derived primarily from Linear Algebra and its applications by David Lay (4ed). A few theorems have been moved
More informationMA 265 FINAL EXAM Fall 2012
MA 265 FINAL EXAM Fall 22 NAME: INSTRUCTOR S NAME:. There are a total of 25 problems. You should show work on the exam sheet, and pencil in the correct answer on the scantron. 2. No books, notes, or calculators
More informationProperties of Matrices and Operations on Matrices
Properties of Matrices and Operations on Matrices A common data structure for statistical analysis is a rectangular array or matris. Rows represent individual observational units, or just observations,
More informationMATH 829: Introduction to Data Mining and Analysis Clustering II
his lecture is based on U. von Luxburg, A Tutorial on Spectral Clustering, Statistics and Computing, 17 (4), 2007. MATH 829: Introduction to Data Mining and Analysis Clustering II Dominique Guillot Departments
More informationRiemann Hypotheses. Alex J. Best 4/2/2014. WMS Talks
Riemann Hypotheses Alex J. Best WMS Talks 4/2/2014 In this talk: 1 Introduction 2 The original hypothesis 3 Zeta functions for graphs 4 More assorted zetas 5 Back to number theory 6 Conclusion The Riemann
More informationIntroduction. Chapter One
Chapter One Introduction The aim of this book is to describe and explain the beautiful mathematical relationships between matrices, moments, orthogonal polynomials, quadrature rules and the Lanczos and
More informationThanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides
Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides Web Search: How to Organize the Web? Ranking Nodes on Graphs Hubs and Authorities PageRank How to Solve PageRank
More informationLinear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) 1.1 The Formal Denition of a Vector Space
Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) Contents 1 Vector Spaces 1 1.1 The Formal Denition of a Vector Space.................................. 1 1.2 Subspaces...................................................
More informationBounds on the Ratio of Eigenvalues
Bounds on the Ratio of Eigenvalues Tracy Powell Lesley A Ward, Advisor Estelle Basor, Reader December 0, 2005 Department of Mathematics Abstract In this paper, we find lower and upper bounds for the leading
More informationDS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.
DS-GA 1002 Lecture notes 0 Fall 2016 Linear Algebra These notes provide a review of basic concepts in linear algebra. 1 Vector spaces You are no doubt familiar with vectors in R 2 or R 3, i.e. [ ] 1.1
More informationIndex. Copyright (c)2007 The Society for Industrial and Applied Mathematics From: Matrix Methods in Data Mining and Pattern Recgonition By: Lars Elden
Index 1-norm, 15 matrix, 17 vector, 15 2-norm, 15, 59 matrix, 17 vector, 15 3-mode array, 91 absolute error, 15 adjacency matrix, 158 Aitken extrapolation, 157 algebra, multi-linear, 91 all-orthogonality,
More informationOn the convergence of weighted-average consensus
On the convergence of weighted-average consensus Francisco Pedroche Miguel Rebollo Carlos Carrascosa Alberto Palomares arxiv:307.7562v [math.oc] 29 Jul 203 Abstract In this note we give sufficient conditions
More informationSlide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University.
Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University http://www.mmds.org #1: C4.5 Decision Tree - Classification (61 votes) #2: K-Means - Clustering
More informationLink Mining PageRank. From Stanford C246
Link Mining PageRank From Stanford C246 Broad Question: How to organize the Web? First try: Human curated Web dictionaries Yahoo, DMOZ LookSmart Second try: Web Search Information Retrieval investigates
More informationMATH36001 Perron Frobenius Theory 2015
MATH361 Perron Frobenius Theory 215 In addition to saying something useful, the Perron Frobenius theory is elegant. It is a testament to the fact that beautiful mathematics eventually tends to be useful,
More information0.1 Naive formulation of PageRank
PageRank is a ranking system designed to find the best pages on the web. A webpage is considered good if it is endorsed (i.e. linked to) by other good webpages. The more webpages link to it, and the more
More information1 T 1 = where 1 is the all-ones vector. For the upper bound, let v 1 be the eigenvector corresponding. u:(u,v) E v 1(u)
CME 305: Discrete Mathematics and Algorithms Instructor: Reza Zadeh (rezab@stanford.edu) Final Review Session 03/20/17 1. Let G = (V, E) be an unweighted, undirected graph. Let λ 1 be the maximum eigenvalue
More informationDimensionality Reduction: PCA. Nicholas Ruozzi University of Texas at Dallas
Dimensionality Reduction: PCA Nicholas Ruozzi University of Texas at Dallas Eigenvalues λ is an eigenvalue of a matrix A R n n if the linear system Ax = λx has at least one non-zero solution If Ax = λx
More information1.3 Convergence of Regular Markov Chains
Markov Chains and Random Walks on Graphs 3 Applying the same argument to A T, which has the same λ 0 as A, yields the row sum bounds Corollary 0 Let P 0 be the transition matrix of a regular Markov chain
More informationNonnegative Matrices I
Nonnegative Matrices I Daisuke Oyama Topics in Economic Theory September 26, 2017 References J. L. Stuart, Digraphs and Matrices, in Handbook of Linear Algebra, Chapter 29, 2006. R. A. Brualdi and H. J.
More informationMath 4153 Exam 3 Review. The syllabus for Exam 3 is Chapter 6 (pages ), Chapter 7 through page 137, and Chapter 8 through page 182 in Axler.
Math 453 Exam 3 Review The syllabus for Exam 3 is Chapter 6 (pages -2), Chapter 7 through page 37, and Chapter 8 through page 82 in Axler.. You should be sure to know precise definition of the terms we
More information1. What is the determinant of the following matrix? a 1 a 2 4a 3 2a 2 b 1 b 2 4b 3 2b c 1. = 4, then det
What is the determinant of the following matrix? 3 4 3 4 3 4 4 3 A 0 B 8 C 55 D 0 E 60 If det a a a 3 b b b 3 c c c 3 = 4, then det a a 4a 3 a b b 4b 3 b c c c 3 c = A 8 B 6 C 4 D E 3 Let A be an n n matrix
More informationValidated computation tool for the Perron-Frobenius eigenvalue using graph theory
Validated computation tool for the Perron-Frobenius eigenvalue using graph theory Kaori Nagatou Kyushu University, JAPAN [ Joint work with Yutaka Ishii, Kyushu University ] AGA Seminar in INI, March 13,
More information