SlideShare uma empresa Scribd logo
1 de 21
INSIDE GOOGLE
SEARCH
CONTENTS
1. What is search engine?
2. Examples of search engine
3. Google introduction
4. What happens when we do a web search?
5. Spiders and crawlers
6. Googlebot
7. Google’s Query Processor
8. Google’s Indexer
9. Advantages
10.Disadvantages
11.Conclusion
12.Reference
WHAT IS A SEARCH ENGINE?
Search engine:
It is a website dedicated to search other
websites and there contents.

It is a program that searches documents for
specified keywords.
It returns a list of the documents where the
keywords were found.
EXAMPLES OF SEARCH ENGINE.
There are many search engines but some of the most
popular search engines are:
Google
Yahoo
Ask.com
Alta Vista.
Dogpile
Bing. etc
GOOGLE INTRODUCTION.
 They thought that a search engine that could analyze the
relationships between websites would product better results than
other search engine.
 They called their new creation "BackRub", because it checked the
backlinks to estimate a site's importance.
The logo they had then was much different from today's logo, and
the name was changed in September 7, 1998, when Larry Page and
Sergey Brin bought the domain Google.com, and officially changed
the name to Google.
. Google was a research project in 1996 by Larry Page and
Sergey Brin, who were both PhD students at Stanford
University
Today, Google is a publicly traded company that handles one of

the most used search engines in the world.
The company currently employs 8,000 employees, and is based
in Mountain View, California.
It also has several other headquarters in places like Seattle,
Washington.
Google offers many innovative services, such as Blogger, Orkut,
and Gmail, and since its introduction in 1996, it offers a wide variety
of services, not just search anymore.
WHAT HAPPENS WHEN WE DO A WEB
SEARCH?
When we do a Google search actually we
are searching the web, we are searching
Google's index of the web.
We do this by software programs called
spiders.
Spiders start fetching a few web pages
and then they follow the link and fetch the
pages they point to.
SPIDERS OR CRAWLERS.

A spider, also known as a robot or a crawler, is actually a program
that
follows, or "crawls", links throughout the Internet,
grabbing content from sites and adding it to search engine indexes.
Spiders only can follow links from one page to another and from
one site to another. That is the primary reason why links to your
site are so important..
Spiders find Web pages by following links from other Web pages,
but you can also submit your Web pages directly to a search
engine or directory and request a visit by their spider.
GOOGLEBOT

Googlebot is Google’s web crawling robot, which finds and
retrieves pages on the web and hands them off to the Google
indexer.
It functions much like our web browser, by sending a request to
a web server for a web page, downloading the entire page, then
handing it off to Google’s indexer.
Googlebot consists of many computers requesting and fetching
pages much more quickly than you can with your web browser.
Googlebot can
simultaneously.

request

thousands

of

different

pages
GOOGLE’S QUERY PROCESSOR

The query processor has several parts, including the user
interface (search box), the “engine” that evaluates queries and
matches them to relevant documents, and the results formatter.
Page rank is Google’s system for ranking web pages. A page with
a higher PageRank is deemed more important and is more likely to
be listed above a page with a lower Page Rank.
Google considers over a hundred factors in computing a
PageRank and determining which documents are most relevant to a
query, including the popularity of the page, the position and size of
the search terms within the page, and the proximity of the search
terms to one another on the page.
Google applies machine-learning techniques to improve its
performance automatically by learning relationships and
associations within the stored data. . For example, the spellingcorrecting system.
Google gives more priority to pages that have search terms near
each other and in the same order as the query. Google can also
match multi-word phrases and sentences.
LET’S SEE HOW GOOGLE’S PROCESSES
A QUERY.
GOOGLE’S INDEXER.
Googlebot gives the indexer the full text of the pages it finds.
 These pages are stored in Google’s index database.
This index is sorted alphabetically by search term, with each index
entry storing a list of documents in which the term appears and the
location within the text where it occurs.
To improve search performance, Google ignores (doesn’t index)
common words called stop words.
 Stop words are so common that they do little to narrow a search,
and therefore they can safely be discarded.
 The indexer also ignores some punctuation and multiple spaces, as
well as converting all letters to lowercase, to improve Google’s
Advantages
The google search box can be used as a calculator, a mathematical
converter and a dictionary.
It can also be used to find airport conditions, track airline flights,
find stock information, look up information in white and yellow
pages and get movie listings from your home location.
 You can look up Universal product codes and VIN numbers to get
vehicle information.
Google has options for image search, article search or even
search for any government document.
It searches according to the terms you type and also searches
for other terms with same meaning.
It is fast, reliable, it has its own dictionary, calculator, and spell
check.
Disadvantages
It doesn’t support full Boolean searching. You can only make
use of the default AND, the forced AND and the OR terms in
your search.
It only indexes the first 101 kilobytes of a web page. Another
search engine, Yahoo for example, indexes up to 500 kilobytes
in the text of web pages
Although it does stem words, it doesn’t allow for truncation. You
can’t put in part of a word and get Google to “guess the rest”.
Google isn’t good for most “deep web” searches, which is why
libraries subscribe to unique databases. However, Google is
improving in some specialized areas such as Google scholar which
searches scholarly document, Google book search which searches
the full text of thousands of books and “find in a library” which
searches the OCLC database. OCLC stands for online computer
library centre and is a worldwide library cooperative.
Reference
 www.google.co.in
 www.google.com/webmaster/tools
 www.googleguide.com
 www.optimum7.com
 www.searchenginejournal.com
 www.prattibrary.org
Conclusion
It can be concluded that the algorithm of Google search,
Spamming
protection over links, how websites are indexed, crawled to
Google

servers, How one can maintain their website through
Google
Webmaster Tools.

Mais conteúdo relacionado

Mais procurados

How Google Search Engine Works
How Google Search Engine Works How Google Search Engine Works
How Google Search Engine Works ARK Solution
 
Search engine ppt
Search engine pptSearch engine ppt
Search engine pptmitul2712
 
Google algorithm updates
Google algorithm updatesGoogle algorithm updates
Google algorithm updatesKavya V K
 
How search engine works
How search engine worksHow search engine works
How search engine worksleoniehannah
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works reportSovan Misra
 
Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...joelmaster
 
Training Project Report on Search Engines
Training Project Report on Search EnginesTraining Project Report on Search Engines
Training Project Report on Search EnginesShivam Saxena
 
Search engine optimization (seo)
Search engine optimization (seo)Search engine optimization (seo)
Search engine optimization (seo)jhon smith
 
Basic SEO mini workshop for copywriter
Basic SEO mini workshop for copywriter Basic SEO mini workshop for copywriter
Basic SEO mini workshop for copywriter salomon dayan
 
Google Search Techniques
Google Search TechniquesGoogle Search Techniques
Google Search TechniquesDuc Chau
 
Week 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworksWeek 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworkscarolyn oldham
 
Brighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersBrighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersTom Bennet
 

Mais procurados (20)

How Google Search Engine Works
How Google Search Engine Works How Google Search Engine Works
How Google Search Engine Works
 
Search engine ppt
Search engine pptSearch engine ppt
Search engine ppt
 
Search engine optimization
Search engine optimizationSearch engine optimization
Search engine optimization
 
Google algorithm updates
Google algorithm updatesGoogle algorithm updates
Google algorithm updates
 
About search engines
About search enginesAbout search engines
About search engines
 
How search engine works
How search engine worksHow search engine works
How search engine works
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works report
 
Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...Entireweb review over 150 million searches per month with website submission ...
Entireweb review over 150 million searches per month with website submission ...
 
Search Engines
Search EnginesSearch Engines
Search Engines
 
Training Project Report on Search Engines
Training Project Report on Search EnginesTraining Project Report on Search Engines
Training Project Report on Search Engines
 
Search engine optimization (seo)
Search engine optimization (seo)Search engine optimization (seo)
Search engine optimization (seo)
 
Search Engine Demystified
Search Engine DemystifiedSearch Engine Demystified
Search Engine Demystified
 
Search engine
Search engineSearch engine
Search engine
 
Google
GoogleGoogle
Google
 
Search engine
Search engineSearch engine
Search engine
 
Basic SEO mini workshop for copywriter
Basic SEO mini workshop for copywriter Basic SEO mini workshop for copywriter
Basic SEO mini workshop for copywriter
 
Google Search Techniques
Google Search TechniquesGoogle Search Techniques
Google Search Techniques
 
Search engine
Search engineSearch engine
Search engine
 
Week 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworksWeek 9 10 ppt-how_searchworks
Week 9 10 ppt-how_searchworks
 
Brighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content MarketersBrighton SEO - Site Speed for Content Marketers
Brighton SEO - Site Speed for Content Marketers
 

Semelhante a Inside google search - how it works??

Search Engine Optimization (Seo)
Search Engine Optimization (Seo)Search Engine Optimization (Seo)
Search Engine Optimization (Seo)ssunnysengar
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineManish Chopra
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimizationshrishail uttagi
 
How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??Viral Shah
 
Search engine -final
Search engine  -finalSearch engine  -final
Search engine -finalAnn Alcid
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete ApproachPrakhar Gethe
 
Searching the Internet
Searching the Internet Searching the Internet
Searching the Internet guest32ae6
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search EnginesJohan Koren
 
Damien mulleyonlinemarketing
Damien mulleyonlinemarketingDamien mulleyonlinemarketing
Damien mulleyonlinemarketingKeerthiKommineni
 
Search Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEOSearch Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEONeeraj Reddy
 
Googlesearchtechniques 090402135045-phpapp01
Googlesearchtechniques 090402135045-phpapp01Googlesearchtechniques 090402135045-phpapp01
Googlesearchtechniques 090402135045-phpapp01Charles Erwin
 

Semelhante a Inside google search - how it works?? (20)

Search Engine Optimization (Seo)
Search Engine Optimization (Seo)Search Engine Optimization (Seo)
Search Engine Optimization (Seo)
 
The Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search EngineThe Anatomy of GOOGLE Search Engine
The Anatomy of GOOGLE Search Engine
 
Internet search techniques by zakir hossain
Internet search techniques by zakir hossainInternet search techniques by zakir hossain
Internet search techniques by zakir hossain
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimization
 
How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??
 
Search engine
Search engineSearch engine
Search engine
 
Search engine -final
Search engine  -finalSearch engine  -final
Search engine -final
 
How Google Works
How Google WorksHow Google Works
How Google Works
 
How google works and functions: A complete Approach
How google works and functions: A complete ApproachHow google works and functions: A complete Approach
How google works and functions: A complete Approach
 
Search engines
Search enginesSearch engines
Search engines
 
Searching the Internet
Searching the Internet Searching the Internet
Searching the Internet
 
Internet search techniques for K12
Internet search techniques for K12Internet search techniques for K12
Internet search techniques for K12
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search Engines
 
Damien mulleyonlinemarketing
Damien mulleyonlinemarketingDamien mulleyonlinemarketing
Damien mulleyonlinemarketing
 
Seo Manual
Seo ManualSeo Manual
Seo Manual
 
Google
GoogleGoogle
Google
 
Google SEO
Google SEOGoogle SEO
Google SEO
 
Search Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEOSearch Engine Optimization - Fundamentals - SEO
Search Engine Optimization - Fundamentals - SEO
 
What is seo
What is seoWhat is seo
What is seo
 
Googlesearchtechniques 090402135045-phpapp01
Googlesearchtechniques 090402135045-phpapp01Googlesearchtechniques 090402135045-phpapp01
Googlesearchtechniques 090402135045-phpapp01
 

Mais de Dhruv Patel

Shortify for Android
Shortify for AndroidShortify for Android
Shortify for AndroidDhruv Patel
 
Basics of Wordpress
Basics of WordpressBasics of Wordpress
Basics of WordpressDhruv Patel
 
Near field communication - Data transmission
Near field communication - Data transmissionNear field communication - Data transmission
Near field communication - Data transmissionDhruv Patel
 
Modem technology
Modem technologyModem technology
Modem technologyDhruv Patel
 
Blue brain project
Blue brain projectBlue brain project
Blue brain projectDhruv Patel
 
Apple iOS - A modern way to mobile operating system
Apple iOS - A modern way to mobile operating systemApple iOS - A modern way to mobile operating system
Apple iOS - A modern way to mobile operating systemDhruv Patel
 
How Facebook actually works????
How Facebook actually works????How Facebook actually works????
How Facebook actually works????Dhruv Patel
 

Mais de Dhruv Patel (9)

Shortify for Android
Shortify for AndroidShortify for Android
Shortify for Android
 
Basics of Wordpress
Basics of WordpressBasics of Wordpress
Basics of Wordpress
 
Near field communication - Data transmission
Near field communication - Data transmissionNear field communication - Data transmission
Near field communication - Data transmission
 
Modem technology
Modem technologyModem technology
Modem technology
 
Google glass
Google glassGoogle glass
Google glass
 
Blue brain project
Blue brain projectBlue brain project
Blue brain project
 
Apple iOS - A modern way to mobile operating system
Apple iOS - A modern way to mobile operating systemApple iOS - A modern way to mobile operating system
Apple iOS - A modern way to mobile operating system
 
How Facebook actually works????
How Facebook actually works????How Facebook actually works????
How Facebook actually works????
 
Fb mechanism
Fb mechanismFb mechanism
Fb mechanism
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Último (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Inside google search - how it works??

  • 2. CONTENTS 1. What is search engine? 2. Examples of search engine 3. Google introduction 4. What happens when we do a web search? 5. Spiders and crawlers 6. Googlebot 7. Google’s Query Processor 8. Google’s Indexer 9. Advantages 10.Disadvantages 11.Conclusion 12.Reference
  • 3. WHAT IS A SEARCH ENGINE? Search engine: It is a website dedicated to search other websites and there contents. It is a program that searches documents for specified keywords. It returns a list of the documents where the keywords were found.
  • 4. EXAMPLES OF SEARCH ENGINE. There are many search engines but some of the most popular search engines are: Google Yahoo Ask.com Alta Vista. Dogpile Bing. etc
  • 5. GOOGLE INTRODUCTION.  They thought that a search engine that could analyze the relationships between websites would product better results than other search engine.  They called their new creation "BackRub", because it checked the backlinks to estimate a site's importance. The logo they had then was much different from today's logo, and the name was changed in September 7, 1998, when Larry Page and Sergey Brin bought the domain Google.com, and officially changed the name to Google.
  • 6. . Google was a research project in 1996 by Larry Page and Sergey Brin, who were both PhD students at Stanford University Today, Google is a publicly traded company that handles one of the most used search engines in the world. The company currently employs 8,000 employees, and is based in Mountain View, California.
  • 7. It also has several other headquarters in places like Seattle, Washington. Google offers many innovative services, such as Blogger, Orkut, and Gmail, and since its introduction in 1996, it offers a wide variety of services, not just search anymore.
  • 8. WHAT HAPPENS WHEN WE DO A WEB SEARCH?
  • 9.
  • 10. When we do a Google search actually we are searching the web, we are searching Google's index of the web. We do this by software programs called spiders. Spiders start fetching a few web pages and then they follow the link and fetch the pages they point to.
  • 11. SPIDERS OR CRAWLERS. A spider, also known as a robot or a crawler, is actually a program that follows, or "crawls", links throughout the Internet, grabbing content from sites and adding it to search engine indexes. Spiders only can follow links from one page to another and from one site to another. That is the primary reason why links to your site are so important.. Spiders find Web pages by following links from other Web pages, but you can also submit your Web pages directly to a search engine or directory and request a visit by their spider.
  • 12.
  • 13. GOOGLEBOT Googlebot is Google’s web crawling robot, which finds and retrieves pages on the web and hands them off to the Google indexer. It functions much like our web browser, by sending a request to a web server for a web page, downloading the entire page, then handing it off to Google’s indexer. Googlebot consists of many computers requesting and fetching pages much more quickly than you can with your web browser. Googlebot can simultaneously. request thousands of different pages
  • 14. GOOGLE’S QUERY PROCESSOR The query processor has several parts, including the user interface (search box), the “engine” that evaluates queries and matches them to relevant documents, and the results formatter. Page rank is Google’s system for ranking web pages. A page with a higher PageRank is deemed more important and is more likely to be listed above a page with a lower Page Rank. Google considers over a hundred factors in computing a PageRank and determining which documents are most relevant to a query, including the popularity of the page, the position and size of the search terms within the page, and the proximity of the search terms to one another on the page.
  • 15. Google applies machine-learning techniques to improve its performance automatically by learning relationships and associations within the stored data. . For example, the spellingcorrecting system. Google gives more priority to pages that have search terms near each other and in the same order as the query. Google can also match multi-word phrases and sentences.
  • 16. LET’S SEE HOW GOOGLE’S PROCESSES A QUERY.
  • 17. GOOGLE’S INDEXER. Googlebot gives the indexer the full text of the pages it finds.  These pages are stored in Google’s index database. This index is sorted alphabetically by search term, with each index entry storing a list of documents in which the term appears and the location within the text where it occurs. To improve search performance, Google ignores (doesn’t index) common words called stop words.  Stop words are so common that they do little to narrow a search, and therefore they can safely be discarded.  The indexer also ignores some punctuation and multiple spaces, as well as converting all letters to lowercase, to improve Google’s
  • 18. Advantages The google search box can be used as a calculator, a mathematical converter and a dictionary. It can also be used to find airport conditions, track airline flights, find stock information, look up information in white and yellow pages and get movie listings from your home location.  You can look up Universal product codes and VIN numbers to get vehicle information. Google has options for image search, article search or even search for any government document. It searches according to the terms you type and also searches for other terms with same meaning. It is fast, reliable, it has its own dictionary, calculator, and spell check.
  • 19. Disadvantages It doesn’t support full Boolean searching. You can only make use of the default AND, the forced AND and the OR terms in your search. It only indexes the first 101 kilobytes of a web page. Another search engine, Yahoo for example, indexes up to 500 kilobytes in the text of web pages Although it does stem words, it doesn’t allow for truncation. You can’t put in part of a word and get Google to “guess the rest”. Google isn’t good for most “deep web” searches, which is why libraries subscribe to unique databases. However, Google is improving in some specialized areas such as Google scholar which searches scholarly document, Google book search which searches the full text of thousands of books and “find in a library” which searches the OCLC database. OCLC stands for online computer library centre and is a worldwide library cooperative.
  • 20. Reference  www.google.co.in  www.google.com/webmaster/tools  www.googleguide.com  www.optimum7.com  www.searchenginejournal.com  www.prattibrary.org
  • 21. Conclusion It can be concluded that the algorithm of Google search, Spamming protection over links, how websites are indexed, crawled to Google servers, How one can maintain their website through Google Webmaster Tools.