SlideShare a Scribd company logo
1 of 22
Download to read offline
PageRank
指導⽼老師 許慶昇
研究⽣生 鍾聖彥
Reference
Bing Liu (2011). Web Data Mining: Exploring Hyperlinks,
Contents, and Usage Data. 2nd Ed., Springer-Verlang Berlin
Heigelberg.
• Page, Lawrence, et al. "The PageRank citation ranking: bringing
order to the web." (1999).
• Brin, Sergey, and Lawrence Page. "The anatomy of a large-scale
hypertextual Web search engine." Computer networks and ISDN
systems 30.1 (1998): 107-117.
• PageRank,http://en.wikipedia.org/wiki/PageRank
• 台灣搜尋引擎優化與⾏行銷研究院 - SEO & SEM Blog
http://seo.dns.com.tw/
• Croft, W. Bruce, Donald Metzler, and Trevor Strohman. Search
engines: Information retrieval in practice. Reading: AddisonWesley, 2010.
•
Why	
  This	
  Order?
PageRank was developed by Larry Page (hence the
name Page-Rank) and Sergey Brin.
Not Page 1, 2, 3………
Larry Page
•

Born March 26, 1973(age 40)

•

American computer scientist

•

Co-founder and CEO of
Google Inc.

•

As of 2013, Page's personal
wealth is estimated to be US
$20.3 billion, ranking him #13
on the Forbes 400 list of the
400 richest Americans.

•

Inventor of PageRank, the
foundation of Google's search
ranking algorithm.
Sergey Brin

•

Born August 21, 1973

•

Russian

•

Co-founder of Google

•

As of 2013, his personal
wealth was estimated to be
$24.4 billion.
What is PageRank ?
PageRank is able to order search results so that more
important and central Web pages are given preference.
(Sergey Brin, Lawrence Page,1998)
A method for rating the importance of web pages
objectively and mechanically using the link structure of
the web. (Sergey Brin, Lawrence Page,1998).
The year 1998 was an important year for web link
analysis and web search(Bing Liu ,2011).
Applications
PageRank has applications in search, browsing, and
traffic estimation.(Sergey Brin, Lawrence Page,1998).
To test the utility of PageRank for search, we built a
web search engine called Google.(Sergey Brin,
Lawrence Page,1998).
Link Structure of the Web

Backlinks and Forward links:
!

A and B are C’s backlinks
C is A and B’s forward link
Definition of PageRank

PageRank 的運算公式被設計為「⼀一個網站的 PageRank 值,來
⾃自於加總所有連結到該網站的網站之 PageRank 值除以本⾝身的
導出連結數」
Definition of PageRank
此公式是⼀一個會收斂的運算
以上述例⼦子,假設每個網⾴頁的 PageRank 值都是均等的,則計算⽅方法如下(每階段的 PR
值使⽤用前⼀一階段的運算 結果):

1. PR(A)=PR(B)=PR(C)=1/3=0.33
2. PR(A)=0.33 PR(B)=0.33/2=0.17 PR(C)=0.33/2+0.33=0.5
3. PR(A)=0.5 PR(B)= 0.33/2=0.17 PR(C)=0.33/2+0.17=0.33	

4. PR(A)=0.33 PR(B)=0.5/2=0.25 PR(C)=0.5/2+0.17=0.42
5. 依此類推...	

最後趨近:
PR(A)=0.4 PR(B)=0.2 PR(C)=0.4
!
Convergence Properties
Definition of PageRank
A Simple Version of PageRank

u: a web page
Bu: the set of u’s backlinks
Nv: the number of forward links of page v
c: the normalization factor to make ||R||L1 = 1 (||
R||L1= |R1 + … + Rn|) (so that the total rank of all
web pages is constant).
A Problem with Simplified
PageRank

During each iteration, the loop accumulates rank but
never distributes rank to other pages.
Random Surfer Model
The standing probability distribution of a
random walk on the graph of the web.
Simply keeps clicking successive links at
random.
Modified Version of
PageRank

E(u):The additional factor E can be viewed as a way
of modeling this behavior: the surfer periodically
“gets bored" and jumps to a random page chosen
based on the distribution in E.
Searching with PageRank
Searching with PageRank
Reference
Bing Liu (2011). Web Data Mining: Exploring Hyperlinks,
Contents, and Usage Data. 2nd Ed., Springer-Verlang Berlin
Heigelberg.
• Page, Lawrence, et al. "The PageRank citation ranking: bringing
order to the web." (1999).
• Brin, Sergey, and Lawrence Page. "The anatomy of a large-scale
hypertextual Web search engine." Computer networks and ISDN
systems 30.1 (1998): 107-117.
• PageRank,http://en.wikipedia.org/wiki/PageRank
• 台灣搜尋引擎優化與⾏行銷研究院 - SEO & SEM Blog
http://seo.dns.com.tw/
• Croft, W. Bruce, Donald Metzler, and Trevor Strohman. Search
engines: Information retrieval in practice. Reading: AddisonWesley, 2010.
•

More Related Content

What's hot (19)

Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank (1)
Pagerank (1)Pagerank (1)
Pagerank (1)
 
Pagerank
PagerankPagerank
Pagerank
 
prueba
prueba prueba
prueba
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank
PagerankPagerank
Pagerank
 
Pagerank (1)
Pagerank (1)Pagerank (1)
Pagerank (1)
 

Viewers also liked

Fci2012 pieve di soligo low
Fci2012 pieve di soligo lowFci2012 pieve di soligo low
Fci2012 pieve di soligo loweventiFFF
 
Firearms
FirearmsFirearms
Firearmskfritze
 
Integrating UX Activities in Global Conferences - UX @ Enterprise 2014
Integrating UX Activities in Global Conferences - UX @ Enterprise 2014Integrating UX Activities in Global Conferences - UX @ Enterprise 2014
Integrating UX Activities in Global Conferences - UX @ Enterprise 2014Omer Nahshon
 
Did Communism threaten America's internal security after World War 2?
Did Communism threaten America's internal security after World War 2?Did Communism threaten America's internal security after World War 2?
Did Communism threaten America's internal security after World War 2?Solous
 
Hogar santa helena
Hogar santa helenaHogar santa helena
Hogar santa helenaIndeleblia
 
Second hand knowledge presentation
Second hand knowledge presentationSecond hand knowledge presentation
Second hand knowledge presentationSolous
 
Living for a year in Shanghai
Living for a year in ShanghaiLiving for a year in Shanghai
Living for a year in ShanghaiDawn Lee
 
Enterprise UX - UXI Live 2013
Enterprise UX - UXI Live 2013Enterprise UX - UXI Live 2013
Enterprise UX - UXI Live 2013Omer Nahshon
 
Living for a Year in Shanghai
Living for a Year in ShanghaiLiving for a Year in Shanghai
Living for a Year in ShanghaiDawn Lee
 
World Literature Paper
World Literature PaperWorld Literature Paper
World Literature PaperSolous
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data miningMai Mustafa
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explainedjdhaar
 
Google Page Rank Algorithm
Google Page Rank AlgorithmGoogle Page Rank Algorithm
Google Page Rank AlgorithmOmkar Dash
 

Viewers also liked (18)

Fci2012 pieve di soligo low
Fci2012 pieve di soligo lowFci2012 pieve di soligo low
Fci2012 pieve di soligo low
 
Jishin
JishinJishin
Jishin
 
Android google mapv2
Android google mapv2Android google mapv2
Android google mapv2
 
Firearms
FirearmsFirearms
Firearms
 
Integrating UX Activities in Global Conferences - UX @ Enterprise 2014
Integrating UX Activities in Global Conferences - UX @ Enterprise 2014Integrating UX Activities in Global Conferences - UX @ Enterprise 2014
Integrating UX Activities in Global Conferences - UX @ Enterprise 2014
 
Did Communism threaten America's internal security after World War 2?
Did Communism threaten America's internal security after World War 2?Did Communism threaten America's internal security after World War 2?
Did Communism threaten America's internal security after World War 2?
 
Hogar santa helena
Hogar santa helenaHogar santa helena
Hogar santa helena
 
Second hand knowledge presentation
Second hand knowledge presentationSecond hand knowledge presentation
Second hand knowledge presentation
 
Living for a year in Shanghai
Living for a year in ShanghaiLiving for a year in Shanghai
Living for a year in Shanghai
 
How online social ties and product-related risks influence purchase intention...
How online social ties and product-related risks influence purchase intention...How online social ties and product-related risks influence purchase intention...
How online social ties and product-related risks influence purchase intention...
 
Enterprise UX - UXI Live 2013
Enterprise UX - UXI Live 2013Enterprise UX - UXI Live 2013
Enterprise UX - UXI Live 2013
 
類神經網路
類神經網路類神經網路
類神經網路
 
Living for a Year in Shanghai
Living for a Year in ShanghaiLiving for a Year in Shanghai
Living for a Year in Shanghai
 
World Literature Paper
World Literature PaperWorld Literature Paper
World Literature Paper
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data mining
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explained
 
Google Page Rank Algorithm
Google Page Rank AlgorithmGoogle Page Rank Algorithm
Google Page Rank Algorithm
 
Romantic era
Romantic eraRomantic era
Romantic era
 

Similar to Page rank

Similar to Page rank (20)

Web mining
Web miningWeb mining
Web mining
 
Page rank by university of michagain.ppt
Page rank by university of michagain.pptPage rank by university of michagain.ppt
Page rank by university of michagain.ppt
 
Google page rank
Google page rankGoogle page rank
Google page rank
 
Search Engine Optimization(SEO)
Search Engine Optimization(SEO)Search Engine Optimization(SEO)
Search Engine Optimization(SEO)
 
Page rank algortihm
Page rank algortihmPage rank algortihm
Page rank algortihm
 
Dm page rank
Dm page rankDm page rank
Dm page rank
 
PageRank & Searching
PageRank & SearchingPageRank & Searching
PageRank & Searching
 
Implementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduceImplementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduce
 
Search Engine working, Crawlers working, Search Engine mechanism
Search Engine working, Crawlers working, Search Engine mechanismSearch Engine working, Crawlers working, Search Engine mechanism
Search Engine working, Crawlers working, Search Engine mechanism
 
Pagerank
PagerankPagerank
Pagerank
 
Link analysis for web search
Link analysis for web searchLink analysis for web search
Link analysis for web search
 
Search engine
Search engineSearch engine
Search engine
 
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHLINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
 
Seo and page rank algorithm
Seo and page rank algorithmSeo and page rank algorithm
Seo and page rank algorithm
 
Google page rank
Google page rankGoogle page rank
Google page rank
 
Ranking algorithms
Ranking algorithmsRanking algorithms
Ranking algorithms
 
IRJET- Page Ranking Algorithms – A Comparison
IRJET- Page Ranking Algorithms – A ComparisonIRJET- Page Ranking Algorithms – A Comparison
IRJET- Page Ranking Algorithms – A Comparison
 
I04015559
I04015559I04015559
I04015559
 
Page Rank Link Farm Detection
Page Rank Link Farm DetectionPage Rank Link Farm Detection
Page Rank Link Farm Detection
 
PageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey reportPageRank algorithm and its variations: A Survey report
PageRank algorithm and its variations: A Survey report
 

More from Department of Information Management Ming Chuan University, Taiwan (6)

Android googlemapv2 keyApplicance
Android googlemapv2 keyApplicanceAndroid googlemapv2 keyApplicance
Android googlemapv2 keyApplicance
 
Greedy minimum spanning tree- prim's algorithm
Greedy minimum spanning tree- prim's algorithmGreedy minimum spanning tree- prim's algorithm
Greedy minimum spanning tree- prim's algorithm
 
Dynamic programming lcs
Dynamic programming lcsDynamic programming lcs
Dynamic programming lcs
 
Examining the impact of rich media on consumer willingness to pay in online ...
Examining the impact of rich media  on consumer willingness to pay in online ...Examining the impact of rich media  on consumer willingness to pay in online ...
Examining the impact of rich media on consumer willingness to pay in online ...
 
No sql
No sqlNo sql
No sql
 
Semantic web
Semantic webSemantic web
Semantic web
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 

Recently uploaded (20)

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

Page rank

  • 2. Reference Bing Liu (2011). Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data. 2nd Ed., Springer-Verlang Berlin Heigelberg. • Page, Lawrence, et al. "The PageRank citation ranking: bringing order to the web." (1999). • Brin, Sergey, and Lawrence Page. "The anatomy of a large-scale hypertextual Web search engine." Computer networks and ISDN systems 30.1 (1998): 107-117. • PageRank,http://en.wikipedia.org/wiki/PageRank • 台灣搜尋引擎優化與⾏行銷研究院 - SEO & SEM Blog http://seo.dns.com.tw/ • Croft, W. Bruce, Donald Metzler, and Trevor Strohman. Search engines: Information retrieval in practice. Reading: AddisonWesley, 2010. •
  • 4. PageRank was developed by Larry Page (hence the name Page-Rank) and Sergey Brin. Not Page 1, 2, 3………
  • 5. Larry Page • Born March 26, 1973(age 40) • American computer scientist • Co-founder and CEO of Google Inc. • As of 2013, Page's personal wealth is estimated to be US $20.3 billion, ranking him #13 on the Forbes 400 list of the 400 richest Americans. • Inventor of PageRank, the foundation of Google's search ranking algorithm.
  • 6. Sergey Brin • Born August 21, 1973 • Russian • Co-founder of Google • As of 2013, his personal wealth was estimated to be $24.4 billion.
  • 7. What is PageRank ? PageRank is able to order search results so that more important and central Web pages are given preference. (Sergey Brin, Lawrence Page,1998) A method for rating the importance of web pages objectively and mechanically using the link structure of the web. (Sergey Brin, Lawrence Page,1998). The year 1998 was an important year for web link analysis and web search(Bing Liu ,2011).
  • 8. Applications PageRank has applications in search, browsing, and traffic estimation.(Sergey Brin, Lawrence Page,1998). To test the utility of PageRank for search, we built a web search engine called Google.(Sergey Brin, Lawrence Page,1998).
  • 9. Link Structure of the Web Backlinks and Forward links: ! A and B are C’s backlinks C is A and B’s forward link
  • 10. Definition of PageRank PageRank 的運算公式被設計為「⼀一個網站的 PageRank 值,來 ⾃自於加總所有連結到該網站的網站之 PageRank 值除以本⾝身的 導出連結數」
  • 12. 此公式是⼀一個會收斂的運算 以上述例⼦子,假設每個網⾴頁的 PageRank 值都是均等的,則計算⽅方法如下(每階段的 PR 值使⽤用前⼀一階段的運算 結果): 1. PR(A)=PR(B)=PR(C)=1/3=0.33 2. PR(A)=0.33 PR(B)=0.33/2=0.17 PR(C)=0.33/2+0.33=0.5 3. PR(A)=0.5 PR(B)= 0.33/2=0.17 PR(C)=0.33/2+0.17=0.33 4. PR(A)=0.33 PR(B)=0.5/2=0.25 PR(C)=0.5/2+0.17=0.42 5. 依此類推... 最後趨近: PR(A)=0.4 PR(B)=0.2 PR(C)=0.4 !
  • 13.
  • 15. Definition of PageRank A Simple Version of PageRank u: a web page Bu: the set of u’s backlinks Nv: the number of forward links of page v c: the normalization factor to make ||R||L1 = 1 (|| R||L1= |R1 + … + Rn|) (so that the total rank of all web pages is constant).
  • 16.
  • 17. A Problem with Simplified PageRank During each iteration, the loop accumulates rank but never distributes rank to other pages.
  • 18. Random Surfer Model The standing probability distribution of a random walk on the graph of the web. Simply keeps clicking successive links at random.
  • 19. Modified Version of PageRank E(u):The additional factor E can be viewed as a way of modeling this behavior: the surfer periodically “gets bored" and jumps to a random page chosen based on the distribution in E.
  • 22. Reference Bing Liu (2011). Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data. 2nd Ed., Springer-Verlang Berlin Heigelberg. • Page, Lawrence, et al. "The PageRank citation ranking: bringing order to the web." (1999). • Brin, Sergey, and Lawrence Page. "The anatomy of a large-scale hypertextual Web search engine." Computer networks and ISDN systems 30.1 (1998): 107-117. • PageRank,http://en.wikipedia.org/wiki/PageRank • 台灣搜尋引擎優化與⾏行銷研究院 - SEO & SEM Blog http://seo.dns.com.tw/ • Croft, W. Bruce, Donald Metzler, and Trevor Strohman. Search engines: Information retrieval in practice. Reading: AddisonWesley, 2010. •