SlideShare uma empresa Scribd logo
1 de 18
Motivation When searching for information on the WWW, user perform a query to a search engine. The engine return, as the query’s result, a list of Web sites which usually is a huge set. So the ranking of these web sites is very important. Because much information is contained in the link-structure of the WWW, information such as which pages are linked to others can be used to augment search algorithms.
[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
SALSA----Idea SALSA is based upon the theory of Markov chains,  and relies on the stochastic properties of random walks  performed on our collection of sites. The input to our scheme consists of a collection of sites  C  which is built around a topic  t . Intuition  suggests that authoritative sites on topic  t  should be visible from many sites in the subgraph induced by  C .  Thus, a random walk on this subgraph will visit t -authorities with high probability.
SALSA----Idea Combine the theory of random walks with the notion  of the two distinct types of Web sites, hubs and  authorities, and actually analyze two different Markov  chains: A chain of hubs and a chain of authorities.  Analyzing both chains allows our approach to give each Web site two distinct scores, a hub score and an  authority score.
[object Object],[object Object],[object Object],[object Object],[object Object]
SALSA the principal community of authorities(hubs) found by the SALSA will be composed of the sites whose entries in the principal eigenvector of  A  ( H ) are the highest.
SALSA----Conclusion SALSA is a new stochastic approach for link structure analysis, which examines random walks on graphs derived from the link structure.  The principal community of authorities(hubs) corresponds to the sites that are most frequently visited by the random walk defined by the authority(hub) Markov chain.
The PageRank Citation Ranking: Bringing Order to the Web Larry Page etc. Stanford University
PageRank----Idea Every page has some number of forward links(outedges) and backlinks(inedges)
PageRank----Idea ,[object Object],[object Object]
PageRank----Idea ,[object Object],A page has high rank if the sum of the ranks of its backlinks is high. This covers both the case when a page has many backlinks and when a page has a few highly ranked backlinks.
PageRank----Definition u: a web page F u :  set of pages u points to  B u :  set of pages that point to u N u =|F u |:  the number of links from u  c: a factor used for normalization The equation is recursive, but it may be computed by starting with any set of ranks and iterating the computation until it converges.
PageRank----definition A problem with above definition:  rank sink If two web pages point to each other but to no other page, during the iteration, this loop will accumulate rank but  never distribute any rank.
PageRank----definition Definition modified: E(u) is some vector over the web pages(for example uniform, favorite page etc.) that corresponds to a source of rank.  E(u) is a user designed parameter.
PageRank----Random Surfer Model ,[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]

Mais conteúdo relacionado

Destaque

GLOBAL VOLCANO DISASTER RESILIENCE. AN INTEGRATED FRAMEWORK DEMONSTRATION OF ...
GLOBAL VOLCANO DISASTER RESILIENCE. AN INTEGRATED FRAMEWORK DEMONSTRATION OF ...GLOBAL VOLCANO DISASTER RESILIENCE. AN INTEGRATED FRAMEWORK DEMONSTRATION OF ...
GLOBAL VOLCANO DISASTER RESILIENCE. AN INTEGRATED FRAMEWORK DEMONSTRATION OF ...
Professor Eric K. Noji, M.D., MPH, DTMH(Lon), FRCP(UK)hon
 
Cpsp chapter 1 what is strategic management?
Cpsp chapter 1 what is strategic management?Cpsp chapter 1 what is strategic management?
Cpsp chapter 1 what is strategic management?
Prof Patrick McNamee
 
Literature searching
Literature searchingLiterature searching
Literature searching
Fowler Susan
 
Benefit Concert Slideshow For Blog
Benefit Concert Slideshow For BlogBenefit Concert Slideshow For Blog
Benefit Concert Slideshow For Blog
lemicosh
 
Page Rank
Page RankPage Rank
Page Rank
Javier
 

Destaque (19)

Power moda1
Power moda1Power moda1
Power moda1
 
21 40 Pages Slides Sy X
21 40 Pages Slides Sy X21 40 Pages Slides Sy X
21 40 Pages Slides Sy X
 
GLOBAL VOLCANO DISASTER RESILIENCE. AN INTEGRATED FRAMEWORK DEMONSTRATION OF ...
GLOBAL VOLCANO DISASTER RESILIENCE. AN INTEGRATED FRAMEWORK DEMONSTRATION OF ...GLOBAL VOLCANO DISASTER RESILIENCE. AN INTEGRATED FRAMEWORK DEMONSTRATION OF ...
GLOBAL VOLCANO DISASTER RESILIENCE. AN INTEGRATED FRAMEWORK DEMONSTRATION OF ...
 
e-commerce
e-commercee-commerce
e-commerce
 
video de amor
video de amorvideo de amor
video de amor
 
Tornado Outbreak In Oklahoma, Arkansas and Iowa April 26-27, 2014
Tornado Outbreak In Oklahoma, Arkansas and Iowa April 26-27, 2014Tornado Outbreak In Oklahoma, Arkansas and Iowa April 26-27, 2014
Tornado Outbreak In Oklahoma, Arkansas and Iowa April 26-27, 2014
 
Part 1 Typhoons. Learning from Global Disaster Laboratories in 2014
Part 1 Typhoons.  Learning from Global Disaster Laboratories in 2014Part 1 Typhoons.  Learning from Global Disaster Laboratories in 2014
Part 1 Typhoons. Learning from Global Disaster Laboratories in 2014
 
Deaths and injuries due to the earthquake in Armenia: a cohort approach
Deaths and injuries due to the earthquake in Armenia: a cohort approachDeaths and injuries due to the earthquake in Armenia: a cohort approach
Deaths and injuries due to the earthquake in Armenia: a cohort approach
 
Part 1 The Case For A Major Paradigmn Shift Towards Disaster Resiliency Duri...
Part 1  The Case For A Major Paradigmn Shift Towards Disaster Resiliency Duri...Part 1  The Case For A Major Paradigmn Shift Towards Disaster Resiliency Duri...
Part 1 The Case For A Major Paradigmn Shift Towards Disaster Resiliency Duri...
 
Sami_in_Finland
Sami_in_FinlandSami_in_Finland
Sami_in_Finland
 
藏智於民:開放政府資料的原則與現況
藏智於民:開放政府資料的原則與現況藏智於民:開放政府資料的原則與現況
藏智於民:開放政府資料的原則與現況
 
The Historical Development of Public Health Responses to Disasters
The Historical Development of Public Health Responses to DisastersThe Historical Development of Public Health Responses to Disasters
The Historical Development of Public Health Responses to Disasters
 
Cpsp chapter 1 what is strategic management?
Cpsp chapter 1 what is strategic management?Cpsp chapter 1 what is strategic management?
Cpsp chapter 1 what is strategic management?
 
Literature searching
Literature searchingLiterature searching
Literature searching
 
Live&Learn In Globe3.0 Apr2007
Live&Learn In Globe3.0 Apr2007Live&Learn In Globe3.0 Apr2007
Live&Learn In Globe3.0 Apr2007
 
A case-control study of injuries arising from the earthquake in Armenia, 1988
A case-control study of injuries arising from the earthquake in Armenia, 1988A case-control study of injuries arising from the earthquake in Armenia, 1988
A case-control study of injuries arising from the earthquake in Armenia, 1988
 
Benefit Concert Slideshow For Blog
Benefit Concert Slideshow For BlogBenefit Concert Slideshow For Blog
Benefit Concert Slideshow For Blog
 
Page Rank
Page RankPage Rank
Page Rank
 
M6.6 earthquake strikes near Ya’an city, Sichuan province, China
M6.6 earthquake strikes near Ya’an city, Sichuan province, ChinaM6.6 earthquake strikes near Ya’an city, Sichuan province, China
M6.6 earthquake strikes near Ya’an city, Sichuan province, China
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 

Pagerank

  • 1. Motivation When searching for information on the WWW, user perform a query to a search engine. The engine return, as the query’s result, a list of Web sites which usually is a huge set. So the ranking of these web sites is very important. Because much information is contained in the link-structure of the WWW, information such as which pages are linked to others can be used to augment search algorithms.
  • 2.
  • 3.
  • 4. SALSA----Idea SALSA is based upon the theory of Markov chains, and relies on the stochastic properties of random walks performed on our collection of sites. The input to our scheme consists of a collection of sites C which is built around a topic t . Intuition suggests that authoritative sites on topic t should be visible from many sites in the subgraph induced by C . Thus, a random walk on this subgraph will visit t -authorities with high probability.
  • 5. SALSA----Idea Combine the theory of random walks with the notion of the two distinct types of Web sites, hubs and authorities, and actually analyze two different Markov chains: A chain of hubs and a chain of authorities. Analyzing both chains allows our approach to give each Web site two distinct scores, a hub score and an authority score.
  • 6.
  • 7. SALSA the principal community of authorities(hubs) found by the SALSA will be composed of the sites whose entries in the principal eigenvector of A ( H ) are the highest.
  • 8. SALSA----Conclusion SALSA is a new stochastic approach for link structure analysis, which examines random walks on graphs derived from the link structure. The principal community of authorities(hubs) corresponds to the sites that are most frequently visited by the random walk defined by the authority(hub) Markov chain.
  • 9. The PageRank Citation Ranking: Bringing Order to the Web Larry Page etc. Stanford University
  • 10. PageRank----Idea Every page has some number of forward links(outedges) and backlinks(inedges)
  • 11.
  • 12.
  • 13. PageRank----Definition u: a web page F u : set of pages u points to B u : set of pages that point to u N u =|F u |: the number of links from u c: a factor used for normalization The equation is recursive, but it may be computed by starting with any set of ranks and iterating the computation until it converges.
  • 14. PageRank----definition A problem with above definition: rank sink If two web pages point to each other but to no other page, during the iteration, this loop will accumulate rank but never distribute any rank.
  • 15. PageRank----definition Definition modified: E(u) is some vector over the web pages(for example uniform, favorite page etc.) that corresponds to a source of rank. E(u) is a user designed parameter.
  • 16.
  • 17.
  • 18.