SlideShare uma empresa Scribd logo
1 de 27
Q3: How does Google
rank webpages?
Mung Chiang
Networks: Friends, Money, and Bytes
Webpages form a network

 Links in text: since mid-20th century…

 Hyperlinks in webpages
    Early 1990s: web, browser, portal…
    Mid to late 1990s: search…
    Directed graph

 Huge and sparse
    N=40—60 Billion webpages out there…
    And very few links in/out of most webpages
Which webpages are more
       important?
 Usefulness of ranking is hard to measure

 So rank by importance

 Quantify node importance:
    Count the number of links?
    More important links point to this page?


 Turn a seemingly cyclic statmeent to characterize
 an equilibrium of a recursive definition
General themes
Network consists of
   Topology: graphs, matrices
   Functionality: what you do on the graph


We’ll see 3 matrices and a model of the “search
and navigation” functionality
Try 1
Add up importance scores through incoming links
Try 2
Normalize by the spread of importance




Is there a set of consistent scores?
Example
Calculation
What does Google do?
Crawling the web

Storing and indexing the pages

Computing two scores to rank pages per search
   Relevant scores
   Importance scores
Remember vector and
     matrix?
Matrix multiplication
Example
The first matrix
Iterations
Dangling nodes
The second matrix
Mandatory score-spreading
Too many consistent
      scores
The third matrix
Randomization
Pagerank algorithm
Example
Matrix
Iterations
PageRank result
3
2
4
8
1
5
7
6
Parallel with DPC
Both are special cases of “power method” using
non-negative matrix theory
The challenge of scale
Numerical linear algebra methods

A few more tricks
SEO
How to increase your website rank?




How Google reacts?
   Early 2011
   May 2012
Summary
Hyperlinked webpages form a network

Connectivity pattern provides a hint on
importance



Pagerank uniquely defines and efficiently
computes a consistent set of importance scores

Which can be viewed as the dominant
eigenvector of the Google matrix

Mais conteúdo relacionado

Semelhante a Q3 (20)

Incremental Page Rank Computation on Evolving Graphs : NOTES
Incremental Page Rank Computation on Evolving Graphs : NOTESIncremental Page Rank Computation on Evolving Graphs : NOTES
Incremental Page Rank Computation on Evolving Graphs : NOTES
 
PageRank Algorithm
PageRank AlgorithmPageRank Algorithm
PageRank Algorithm
 
J046045558
J046045558J046045558
J046045558
 
Macran
MacranMacran
Macran
 
Discovering knowledge using web structure mining
Discovering knowledge using web structure miningDiscovering knowledge using web structure mining
Discovering knowledge using web structure mining
 
Pagerank
PagerankPagerank
Pagerank
 
Comparative study of different ranking algorithms adopted by search engine
Comparative study of  different ranking algorithms adopted by search engineComparative study of  different ranking algorithms adopted by search engine
Comparative study of different ranking algorithms adopted by search engine
 
INTRODUCCION A LA FINANZA
INTRODUCCION A LA FINANZAINTRODUCCION A LA FINANZA
INTRODUCCION A LA FINANZA
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
PAGERANK
PAGERANKPAGERANK
PAGERANK
 
PAGERANKS
PAGERANKSPAGERANKS
PAGERANKS
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Introduccion a las Finanzas
Introduccion a las FinanzasIntroduccion a las Finanzas
Introduccion a las Finanzas
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 

Q3