SlideShare uma empresa Scribd logo
1 de 20
Introduction to SEO
Search engine optimization (SEO) : SEO works in two ways
Pay Per Click.
Search engine's "natural" or un-paid ("organic") search
results.
Search engine optimization (SEO) is the process of
affecting the visibility of a website or a web page in
a search engine's "natural" or un-paid ("organic") search
results. In general, the earlier (or higher ranked on the
search results page), and more frequently a site appears
in the search results list, the more visitors it will receive
from the search engine's users (Wikipedia)
Optimizing search engines ("organic")
search results.
factors which affects your web content search visibility:-
Title:- Title tag should be mention properly. Search Engines doesn't use the title tag 100%
of the time. Occasionally, Google pulls the title from the anchor text of a link to that page.
Make sure your words in Title and web link words matches.
Snippet :-The snippet is the description for the page that appears beneath the title.
Google may pull this from the page‟s meta description tag. Put relevant sentences, it
helps to best search queries.
Bolding :-Google bolds the query words anywhere they appear in the search result.
Cached link :- If the page is down or loading slowly, a searcher can still get to the
information via the cache. If the page is accidentally deleted, the webmaster can retrieve
the data from the cache to recreate the page. Also, the cache shows when the page was
crawled.
Optimizing search engines ("organic")
search results.
Meta data Other items in a search result include the URL, the page
size etc.
Sitelink:- Link to other sites helps, improve search results.
Introduction to webmasters.(Google and Bing Search Engines)
Want to avoid negative search results about your web content ?
Put unwanted links in robots.txt in webmaster tools.
This all operations could be done through HTML Meta tags and
using webmaster tool from search engines.
Google PageRank
Algorithm
Agenda :-
Fact.
Understanding Page Rank Algorithm.
Analysis.
Case Discussion.
Practical implementation.
Agenda :-
Fact.
Understanding Page Rank Algorithm.
Simple calculation of PageRank.
Analysis of PageRank Algorithm.
Case Discussion.
Practical Implementation.
References.
Fact
Developed by Larry Page and Sergey Brin in
1988
Trade Mark of Google.
Patterned by Stanford University.
Back Bone of Google Search Technology.
Understanding Simple PR
Algorithm.
Every inbound link increase the weightage of
a page.
Page Rank is based on numbers of pages
linked to that page.
Highest PageRank is 1 but in real world Indicated with
numbers between 1 – 10(using a logarithmic scale.)
Hence , appropriate SERP listing.
Calculated by Nature and Numbers of back links.
Indicated on Google toolbar.
Definition of PageRank
Algorithm.
Assume a small universe of four web pages: A, B, C and D , then page rank calculated
as:
PR(A) = PR(B) + PR(C) + PR(D)
Where page B had a link to pages C and A, page C had a link to page A, and
page D had links to all three pages. Then PR is:-
PR(A) = PR(B)/2 + PR(C)/1 + PR(D)/3
Let denote , outbound links by L() then,
PR(A) = PR(B)/L(B) + PR(C)/L(C)+ PR(D)/L(D), final summation will be,
Understanding PageRank
PR(u) = ∑ PR(v)/L(v) , for every v ∈ Bu
i.e. the PageRank value for a page u is dependent on the PageRank values for each page v contained
in the set Bu (the set containing all pages linking to page u), divided by the number L(v) of links from
page v.
Introduction to Damping factor(By SergeyBrin)(d = 0.85) :-
The PageRank theory holds that an imaginary surfer who is randomly clicking on links will eventually
stop clicking. The probability, at any step, that the person will continue is a damping factor d ,
generally assumed value is 0.85
So generalized PageRank algorithm is :-
PR( Pi ) = 1- d/N + d ∑ P(j)/L(j)
Understanding PageRank
A Simple Example:- Consider a small universe (A Set of N pages)where , we have only to web
pages, then
Guess 1
st
:- Say, initial page rank of each page is 1.0 and d = 0.85
PR(A) = (1 – d) + d(PR(B)/1) and PR(B) = (1 – d) + d(PR(A)/1) We get,
PR(A) = 0.15 + 0.85 * 1 = 1 and PR(B) = 0.15 + 0.85 * 1 = 1
Guess 2
nd
:- Say, initial page rank of each page is 40 and d = 0.85
PR(A) = (1 – d) + d(PR(B)/1) and PR(B) = (1 – d) + d(PR(A)/1) We get,
Understanding PageRank
First Calculation:-
PR(A)= 0.15 + 0.85 * 40 = 34.25 ,
PR(B) = 0.15 + 0.85 * 0.385875 = 9.1775
Second Calculation:-
PR(A)= 0.15 + 0.85 * 29.1775 = 24.950875
PR(B) = 0.15 + 0.85 * 24.950875 = 21.35824375 and so on …
On Kth Calculation:- When the sum of PageRank of each page is equal to number of pages present in
the set , that would be your page rank of page.
Average page rank never cross to 1.
Linear system of equations
Assume in small set „x‟, we have
Pages 1, 2 , 3 , 4 then transition
Matrix will be, A =
Please note some observations here:-
Page 1:- donates = 1/3+1/3+1/3 = 1 and gains 1+1/2 = 1.5 importance.
Page 2:- donates = 1/3+1/2 = 0.83 and gains 1/3 = 0.33 importance.
Page 3 :- donates = 1 = 1 and gains 1/3 +1/2+1/2 = 1.33 importance.
Page 4 :- donates = 1/3+1/2 = 0.83 and gains 1/2 = 0.5 importance.
Solving Linear Equation:-
Arrange Linear system =
of equations
We get linear equation = ->
Solving this equation by
substitution method(substitute value of x2, we and ), we get,
Solving Linear Equation:-
We get a vector eigenvectors corresponding to
the Eigen value 1 are of the form
Here we don‟t know about value of x1, choose
x1/12 as some constant so we could get Eigen
vector, whose average value is 1.
Solving Linear Equation:-
We could choose as a 1/31,
So that, sum of
PR(x) = 0.38 + 0.132 +0.29 + 0.19 = 0.992
(Since PR never cross 1 and average/Maximum PR will be 1)
How PR help you ?
How it is use full to me ?
Linking your web content
with many links can increase
your search visibility and
A outbound link from highly
Ranked page optimize your
search query.
Google Bomb :-
The terms Google bomb is creating large
numbers of links, that cause a web page to have
a high ranking for searches on unrelated or off
topic keyword phrases, often for comical or
satirical purposes.
Example of Google bomb:- Search For
“completely wrong” in Google.
References:-
I would like to thanks to Dr. Vinayak Joshi, Department of Mathematics,
University of Pune, Who introduce me to this algorithm and motivated me
to deliver a session in 2009.
Wikipedia http://en.wikipedia.org/wiki/PageRank
Department of Mathematics, Cornell University, Lecture 3 and 6
Linear Algebra by Vivek Sahai and Vikas Bist.
Questions?
Questions?
Thanks !

Mais conteúdo relacionado

Destaque

Scientometrics and semantic maps for development (Author: Iina Hellsten)
Scientometrics and semantic maps for development (Author: Iina Hellsten)Scientometrics and semantic maps for development (Author: Iina Hellsten)
Scientometrics and semantic maps for development (Author: Iina Hellsten)Sarah Cummings
 
Webometrics Revisited in Big Data Age_DISC2013
Webometrics Revisited in Big Data Age_DISC2013Webometrics Revisited in Big Data Age_DISC2013
Webometrics Revisited in Big Data Age_DISC2013Han Woo PARK
 
Scientometric Analysis
Scientometric AnalysisScientometric Analysis
Scientometric Analysissumitbanshal
 
Bibliometrics and scientometrics
Bibliometrics and scientometricsBibliometrics and scientometrics
Bibliometrics and scientometricsguest633b30
 
Bibliometrics, Scintometrics, Citation analysis, Content analysis
Bibliometrics, Scintometrics, Citation analysis, Content analysisBibliometrics, Scintometrics, Citation analysis, Content analysis
Bibliometrics, Scintometrics, Citation analysis, Content analysisSumit Ranjan
 
Webometrics and Studies of Cultural Diffusion -Psy Gangnam Style on YouTube
Webometrics and Studies of Cultural Diffusion-Psy Gangnam Style on YouTubeWebometrics and Studies of Cultural Diffusion-Psy Gangnam Style on YouTube
Webometrics and Studies of Cultural Diffusion -Psy Gangnam Style on YouTubeHan Woo PARK
 
Search Engine Optimization PPT
Search Engine Optimization PPT Search Engine Optimization PPT
Search Engine Optimization PPT Kranthi Shaik
 

Destaque (11)

Au 2015
Au 2015Au 2015
Au 2015
 
Scientometrics and semantic maps for development (Author: Iina Hellsten)
Scientometrics and semantic maps for development (Author: Iina Hellsten)Scientometrics and semantic maps for development (Author: Iina Hellsten)
Scientometrics and semantic maps for development (Author: Iina Hellsten)
 
Scientometrics class
Scientometrics classScientometrics class
Scientometrics class
 
Scientometrics 2010-85-2
Scientometrics 2010-85-2Scientometrics 2010-85-2
Scientometrics 2010-85-2
 
Webometrics Revisited in Big Data Age_DISC2013
Webometrics Revisited in Big Data Age_DISC2013Webometrics Revisited in Big Data Age_DISC2013
Webometrics Revisited in Big Data Age_DISC2013
 
Scientometric Analysis
Scientometric AnalysisScientometric Analysis
Scientometric Analysis
 
Bibliometrics and scientometrics
Bibliometrics and scientometricsBibliometrics and scientometrics
Bibliometrics and scientometrics
 
Bibliometrics, Scintometrics, Citation analysis, Content analysis
Bibliometrics, Scintometrics, Citation analysis, Content analysisBibliometrics, Scintometrics, Citation analysis, Content analysis
Bibliometrics, Scintometrics, Citation analysis, Content analysis
 
Webometrics and Studies of Cultural Diffusion -Psy Gangnam Style on YouTube
Webometrics and Studies of Cultural Diffusion-Psy Gangnam Style on YouTubeWebometrics and Studies of Cultural Diffusion-Psy Gangnam Style on YouTube
Webometrics and Studies of Cultural Diffusion -Psy Gangnam Style on YouTube
 
S.e.o by h@r$h!l
S.e.o by h@r$h!lS.e.o by h@r$h!l
S.e.o by h@r$h!l
 
Search Engine Optimization PPT
Search Engine Optimization PPT Search Engine Optimization PPT
Search Engine Optimization PPT
 

Semelhante a Optimizing search engines (20)

Analysis Of Algorithm
Analysis Of AlgorithmAnalysis Of Algorithm
Analysis Of Algorithm
 
Page rank2
Page rank2Page rank2
Page rank2
 
Cloud Computing Project
Cloud Computing ProjectCloud Computing Project
Cloud Computing Project
 
Dm page rank
Dm page rankDm page rank
Dm page rank
 
Ranking Web Pages
Ranking Web PagesRanking Web Pages
Ranking Web Pages
 
Seo and page rank algorithm
Seo and page rank algorithmSeo and page rank algorithm
Seo and page rank algorithm
 
Search engine optimization
Search engine optimizationSearch engine optimization
Search engine optimization
 
How Google Works
How Google WorksHow Google Works
How Google Works
 
Local Approximation of PageRank
Local Approximation of PageRankLocal Approximation of PageRank
Local Approximation of PageRank
 
PageRank
PageRankPageRank
PageRank
 
J046045558
J046045558J046045558
J046045558
 
I04015559
I04015559I04015559
I04015559
 
Page Rank Link Farm Detection
Page Rank Link Farm DetectionPage Rank Link Farm Detection
Page Rank Link Farm Detection
 
PageRank & Searching
PageRank & SearchingPageRank & Searching
PageRank & Searching
 
PageRank Algorithm
PageRank AlgorithmPageRank Algorithm
PageRank Algorithm
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data mining
 
Search engine
Search engineSearch engine
Search engine
 
Page Rank
Page RankPage Rank
Page Rank
 
Page Rank
Page RankPage Rank
Page Rank
 
Page Rank
Page RankPage Rank
Page Rank
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Optimizing search engines

  • 1.
  • 2. Introduction to SEO Search engine optimization (SEO) : SEO works in two ways Pay Per Click. Search engine's "natural" or un-paid ("organic") search results. Search engine optimization (SEO) is the process of affecting the visibility of a website or a web page in a search engine's "natural" or un-paid ("organic") search results. In general, the earlier (or higher ranked on the search results page), and more frequently a site appears in the search results list, the more visitors it will receive from the search engine's users (Wikipedia)
  • 3. Optimizing search engines ("organic") search results. factors which affects your web content search visibility:- Title:- Title tag should be mention properly. Search Engines doesn't use the title tag 100% of the time. Occasionally, Google pulls the title from the anchor text of a link to that page. Make sure your words in Title and web link words matches. Snippet :-The snippet is the description for the page that appears beneath the title. Google may pull this from the page‟s meta description tag. Put relevant sentences, it helps to best search queries. Bolding :-Google bolds the query words anywhere they appear in the search result. Cached link :- If the page is down or loading slowly, a searcher can still get to the information via the cache. If the page is accidentally deleted, the webmaster can retrieve the data from the cache to recreate the page. Also, the cache shows when the page was crawled.
  • 4. Optimizing search engines ("organic") search results. Meta data Other items in a search result include the URL, the page size etc. Sitelink:- Link to other sites helps, improve search results. Introduction to webmasters.(Google and Bing Search Engines) Want to avoid negative search results about your web content ? Put unwanted links in robots.txt in webmaster tools. This all operations could be done through HTML Meta tags and using webmaster tool from search engines.
  • 5. Google PageRank Algorithm Agenda :- Fact. Understanding Page Rank Algorithm. Analysis. Case Discussion. Practical implementation.
  • 6. Agenda :- Fact. Understanding Page Rank Algorithm. Simple calculation of PageRank. Analysis of PageRank Algorithm. Case Discussion. Practical Implementation. References.
  • 7. Fact Developed by Larry Page and Sergey Brin in 1988 Trade Mark of Google. Patterned by Stanford University. Back Bone of Google Search Technology.
  • 8. Understanding Simple PR Algorithm. Every inbound link increase the weightage of a page. Page Rank is based on numbers of pages linked to that page. Highest PageRank is 1 but in real world Indicated with numbers between 1 – 10(using a logarithmic scale.) Hence , appropriate SERP listing. Calculated by Nature and Numbers of back links. Indicated on Google toolbar.
  • 9. Definition of PageRank Algorithm. Assume a small universe of four web pages: A, B, C and D , then page rank calculated as: PR(A) = PR(B) + PR(C) + PR(D) Where page B had a link to pages C and A, page C had a link to page A, and page D had links to all three pages. Then PR is:- PR(A) = PR(B)/2 + PR(C)/1 + PR(D)/3 Let denote , outbound links by L() then, PR(A) = PR(B)/L(B) + PR(C)/L(C)+ PR(D)/L(D), final summation will be,
  • 10. Understanding PageRank PR(u) = ∑ PR(v)/L(v) , for every v ∈ Bu i.e. the PageRank value for a page u is dependent on the PageRank values for each page v contained in the set Bu (the set containing all pages linking to page u), divided by the number L(v) of links from page v. Introduction to Damping factor(By SergeyBrin)(d = 0.85) :- The PageRank theory holds that an imaginary surfer who is randomly clicking on links will eventually stop clicking. The probability, at any step, that the person will continue is a damping factor d , generally assumed value is 0.85 So generalized PageRank algorithm is :- PR( Pi ) = 1- d/N + d ∑ P(j)/L(j)
  • 11. Understanding PageRank A Simple Example:- Consider a small universe (A Set of N pages)where , we have only to web pages, then Guess 1 st :- Say, initial page rank of each page is 1.0 and d = 0.85 PR(A) = (1 – d) + d(PR(B)/1) and PR(B) = (1 – d) + d(PR(A)/1) We get, PR(A) = 0.15 + 0.85 * 1 = 1 and PR(B) = 0.15 + 0.85 * 1 = 1 Guess 2 nd :- Say, initial page rank of each page is 40 and d = 0.85 PR(A) = (1 – d) + d(PR(B)/1) and PR(B) = (1 – d) + d(PR(A)/1) We get,
  • 12. Understanding PageRank First Calculation:- PR(A)= 0.15 + 0.85 * 40 = 34.25 , PR(B) = 0.15 + 0.85 * 0.385875 = 9.1775 Second Calculation:- PR(A)= 0.15 + 0.85 * 29.1775 = 24.950875 PR(B) = 0.15 + 0.85 * 24.950875 = 21.35824375 and so on … On Kth Calculation:- When the sum of PageRank of each page is equal to number of pages present in the set , that would be your page rank of page. Average page rank never cross to 1.
  • 13. Linear system of equations Assume in small set „x‟, we have Pages 1, 2 , 3 , 4 then transition Matrix will be, A = Please note some observations here:- Page 1:- donates = 1/3+1/3+1/3 = 1 and gains 1+1/2 = 1.5 importance. Page 2:- donates = 1/3+1/2 = 0.83 and gains 1/3 = 0.33 importance. Page 3 :- donates = 1 = 1 and gains 1/3 +1/2+1/2 = 1.33 importance. Page 4 :- donates = 1/3+1/2 = 0.83 and gains 1/2 = 0.5 importance.
  • 14. Solving Linear Equation:- Arrange Linear system = of equations We get linear equation = -> Solving this equation by substitution method(substitute value of x2, we and ), we get,
  • 15. Solving Linear Equation:- We get a vector eigenvectors corresponding to the Eigen value 1 are of the form Here we don‟t know about value of x1, choose x1/12 as some constant so we could get Eigen vector, whose average value is 1.
  • 16. Solving Linear Equation:- We could choose as a 1/31, So that, sum of PR(x) = 0.38 + 0.132 +0.29 + 0.19 = 0.992 (Since PR never cross 1 and average/Maximum PR will be 1)
  • 17. How PR help you ? How it is use full to me ? Linking your web content with many links can increase your search visibility and A outbound link from highly Ranked page optimize your search query.
  • 18. Google Bomb :- The terms Google bomb is creating large numbers of links, that cause a web page to have a high ranking for searches on unrelated or off topic keyword phrases, often for comical or satirical purposes. Example of Google bomb:- Search For “completely wrong” in Google.
  • 19. References:- I would like to thanks to Dr. Vinayak Joshi, Department of Mathematics, University of Pune, Who introduce me to this algorithm and motivated me to deliver a session in 2009. Wikipedia http://en.wikipedia.org/wiki/PageRank Department of Mathematics, Cornell University, Lecture 3 and 6 Linear Algebra by Vivek Sahai and Vikas Bist.