SlideShare uma empresa Scribd logo
1 de 29
How SearcH engineS
      work



     Presentation
          by
       cHinna
What is Search Engine
  Search engine is a software program that
searches for sites based on the words that you
          designate as search terms.

  "Search engine" is the popular term for an
      Information Retrieval (IR) system.




                                                 2
Motto of search engines
A web search engine is designed to search for
information on the World Wide Web and
FTP servers. The search results are generally
presented in a list of results often referred to
as SERPS, or "search engine results pages".
The information may consist of web pages,
images, information and other types of files.




                                                   3
Purpose of Search Engines
Helping people find what they’re looking
 for
  • Starts with an "information need"
  • Convert to a query
  • Gets results
In the materials available
  • Web pages
  • Other formats
  • Deep Web

                                            4
HISTORY
Archie – First search tool for the Internet

Gopher – indexed plain text documents

Jughead – searched the files stored in
 Gopher index systems

Wandex – First Web search engine

                                               5
How web search engines work

search engine operates in the following
                order:

            Web Crawling
              Indexing
             Searching




                                          6
How do Search Engine Works
   Spiders




   Robots




                                     7
Search is Not a Panacea
Search can’t find what’s not there
  • The content is hugely important
Information Architecture is vital
Usable sites have good navigation and
 structure




                                         8
Search Engine Modules


A query processor
A search and matching function
A ranking capability
Summarizing and Presenting documents.




                                         9
Search Engines Mode of Working in
             Earlier Days
From 1990-1998 (1st Generation of search
  tools):
  • Looked at title of web pages
  • Ranking was based on page content
     • Looked at number of times the search term
       appeared on the page
     • Looked at metatags




                                                   10
SEO (Search Engine Optimization)
Used by companies to get a higher result in
 search engines
White hat: Using legitimate techniques
Black hat: Using illegal techniques to trick
 the search engine, like paying sites to link
 to you.




                                                11
Search Processing




                    12
Search is Only as Good as the Content
Users blame the search engine
  • Even when the content is unavailable
Understand the scope of site or intranet
  • Kinds of information
  • Divided sites: products / corporate info
  • Dates
  • Languages
  • Sources and data silos: databases...
  • Update processes

                                               13
Making a Searchable Index
Store text to search it later
Many ways to gather text
  • Crawl (spider) via HTTP
  • Read files on file servers
  • Access databases (HTTP or API)
  • Data silos via local APIs
  • Applications, CMSs, via Web Services
Security and Access Control


                                           14
Robot Indexing Diagram




                         Sour




                         15
What the Index Needs
Basic information for document or record
  • File name / URL / record ID
  • Title or equivalent
  • Size, date, MIME type
Full text of item
More metadata
  • Product name, picture ID
  • Category, topic, or subject
  • Other attributes, for relevance ranking and display



                                                          16
Simple Index Diagram




                       17
Index Issues
Stopwords
Stemming
Metadata
  • Explicit (tags)
  • Implicit (context)
Semantics
  • CMS and Database fields
  • XML tags and attributes


                                18
Search Query Processing
What happens after you click the search
 button, and before retrieval starts.
Usually in this order
  • Handle character set, maybe language
  • Look for operators and organize the query
  • Look for field names or metadata
  • Extract words (just like the indexer)
  • Deal with letter casing


                                                19
Search and Retrieval
Retrieval: find files with query terms
Not the same as relevance ranking
Recall: find all
 relevant items
Precision: find only
 relevant items
Increasing one
 decreases the other



                                          20
Retrieval = Matching
Single-word queries
  • Find items containing that word
Multi-word queries: combine lists
  • Any: every item with any query word
  • All: only items with every word
  • Phrases: find only items with all words in
    order
Boolean and complex queries
  • Use algorithm to combine lists

                                                 21
Why Searches Fail
Empty search
Nothing on the site on that topic (scope)
Misspelling or typing mistakes
Vocabulary differences
Restrictive search defaults
Restrictive search choices
Software failure


                                             22
Relevance Ranking
Theory: sort the matching items, so the most
 relevant ones appear first
Can't really know what the user wants
Relevance is hard to define and situational
Short queries tend to be deeply ambiguous
  • What do people mean when they type “bank”?
First 10 results are the most important



                                                 23
Relevance Processing
Sorting documents on various criteria
Start with words matching query terms
Citation and link analysis
  • Like old library Citation Indexes
  • Not only hypertext, but the links
  • Google PageRank
    • Incoming links
    • Authority of linkers
Taxonomies and external metadata

                                         24
Search Results Interface
What users see after they click the Search
 button
The most visible part of search
Elements of the results page
  • Page layout and navigation
  • Results header
  • List of results items
  • Results footer


                                              25
Search Suggestions
Human judgment beats algorithms
Great for frequent, ambiguous searches
  • Use search log to identify best candidates
Recommend good starting pages
    • Product information, FAQs, etc.
Requires human resources
  • That means money and time
More static than algorithmic search


                                                 26
Search Metrics

           Number of searches
     Number of matches searches
Traffic from search to high-value pages
 Relate search changes to other metrics




                                          27
Query Example
Consider the Query Mahendra Singh Dhoni

   A good answer contains all the three words, and more
frequently the better, we call this Term Frequency(TF)

 Some Query terms are more important those have better
discriminating power than others

 For example an answer containing only "Dhoni" is likely to
be better than an answer containing only “Mahendra“
We call this Inverse Document Frequency (IDF)


                                                              28
Search Will Never Be Perfect
Search engines can’t read minds
  • User queries are short and ambiguous
Some things will help
  • Design a usable interface
  • Show match words in context
  • Keep index current and complete
  • Adjust heuristic weighting
  • Maintain suggestions and synonyms
  • Consider faceted metadata search

                                           29

Mais conteúdo relacionado

Mais procurados

Search engines and web directories
Search engines and web directoriesSearch engines and web directories
Search engines and web directoriesKJ Limbo
 
Internet Search Presentation
Internet Search PresentationInternet Search Presentation
Internet Search PresentationSteve Guinan
 
Types and overview of Search Engine
Types and overview of Search Engine  Types and overview of Search Engine
Types and overview of Search Engine `Shweta Bhavsar
 
On page SEO Optimization & it's Techniques
On page SEO Optimization & it's TechniquesOn page SEO Optimization & it's Techniques
On page SEO Optimization & it's TechniquesPratibha Maurya
 
Introduction to web page
Introduction to web pageIntroduction to web page
Introduction to web pageMahmoud Shaqria
 
Search Engines Presentation
Search Engines PresentationSearch Engines Presentation
Search Engines PresentationJSCHO9
 
Internet and search engine
Internet and search engineInternet and search engine
Internet and search engineDeepak John
 
Types of Web Browser
Types of Web BrowserTypes of Web Browser
Types of Web BrowserRajesh Yadav
 
Web browser(pp ts)
Web browser(pp ts)Web browser(pp ts)
Web browser(pp ts)darpan1118
 

Mais procurados (20)

Search Engine
Search EngineSearch Engine
Search Engine
 
Search engine
Search engineSearch engine
Search engine
 
Search engine
Search engineSearch engine
Search engine
 
Search engines and web directories
Search engines and web directoriesSearch engines and web directories
Search engines and web directories
 
Search engines
Search enginesSearch engines
Search engines
 
Internet Search Presentation
Internet Search PresentationInternet Search Presentation
Internet Search Presentation
 
How search engine work ppt
How search engine work pptHow search engine work ppt
How search engine work ppt
 
Search Engine
Search EngineSearch Engine
Search Engine
 
Types and overview of Search Engine
Types and overview of Search Engine  Types and overview of Search Engine
Types and overview of Search Engine
 
Search engine ppt
Search engine pptSearch engine ppt
Search engine ppt
 
SEO PPT
SEO PPTSEO PPT
SEO PPT
 
Search engine
Search engineSearch engine
Search engine
 
On page SEO Optimization & it's Techniques
On page SEO Optimization & it's TechniquesOn page SEO Optimization & it's Techniques
On page SEO Optimization & it's Techniques
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Introduction to web page
Introduction to web pageIntroduction to web page
Introduction to web page
 
Search Engines Presentation
Search Engines PresentationSearch Engines Presentation
Search Engines Presentation
 
Internet and search engine
Internet and search engineInternet and search engine
Internet and search engine
 
Types of Web Browser
Types of Web BrowserTypes of Web Browser
Types of Web Browser
 
Web browser(pp ts)
Web browser(pp ts)Web browser(pp ts)
Web browser(pp ts)
 
Search Engine
Search Engine Search Engine
Search Engine
 

Destaque

How Do Search Engines Work
How Do Search Engines WorkHow Do Search Engines Work
How Do Search Engines WorkPromozSEO
 
Engine Presentation
Engine PresentationEngine Presentation
Engine PresentationZeenoside
 
4-stroke Diesel Engine
4-stroke Diesel Engine4-stroke Diesel Engine
4-stroke Diesel EngineMridul Rawat
 
Search Engine Powerpoint
Search Engine PowerpointSearch Engine Powerpoint
Search Engine Powerpoint201014161
 
Basic SEO Presentation
Basic SEO PresentationBasic SEO Presentation
Basic SEO PresentationPaul Kortman
 
INTERNAL COMBUSTION ENGINES PPT
INTERNAL COMBUSTION ENGINES PPT INTERNAL COMBUSTION ENGINES PPT
INTERNAL COMBUSTION ENGINES PPT AKASH1001
 
Introduction to SEO
Introduction to SEOIntroduction to SEO
Introduction to SEORand Fishkin
 
Diesel engine Powerpoint
Diesel engine PowerpointDiesel engine Powerpoint
Diesel engine Powerpointkaushdave
 

Destaque (10)

Information organization
Information organization Information organization
Information organization
 
How Do Search Engines Work
How Do Search Engines WorkHow Do Search Engines Work
How Do Search Engines Work
 
Engine Presentation
Engine PresentationEngine Presentation
Engine Presentation
 
4-stroke Diesel Engine
4-stroke Diesel Engine4-stroke Diesel Engine
4-stroke Diesel Engine
 
Search Engine Powerpoint
Search Engine PowerpointSearch Engine Powerpoint
Search Engine Powerpoint
 
Basic SEO Presentation
Basic SEO PresentationBasic SEO Presentation
Basic SEO Presentation
 
Diesel engine
Diesel engineDiesel engine
Diesel engine
 
INTERNAL COMBUSTION ENGINES PPT
INTERNAL COMBUSTION ENGINES PPT INTERNAL COMBUSTION ENGINES PPT
INTERNAL COMBUSTION ENGINES PPT
 
Introduction to SEO
Introduction to SEOIntroduction to SEO
Introduction to SEO
 
Diesel engine Powerpoint
Diesel engine PowerpointDiesel engine Powerpoint
Diesel engine Powerpoint
 

Semelhante a How search engines work

Charting Searchland, ACM SIG Data Mining
Charting Searchland, ACM SIG Data MiningCharting Searchland, ACM SIG Data Mining
Charting Searchland, ACM SIG Data MiningValeria de Paiva
 
Search engines by Gulshan K Maheshwari(QAU)
Search engines by Gulshan  K Maheshwari(QAU)Search engines by Gulshan  K Maheshwari(QAU)
Search engines by Gulshan K Maheshwari(QAU)GulshanKumar368
 
How search engines work Anand Saini
How search engines work Anand SainiHow search engines work Anand Saini
How search engines work Anand SainiDr,Saini Anand
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notesAnandh Arumugakan
 
Web search engines and search technology
Web search engines and search technologyWeb search engines and search technology
Web search engines and search technologyStefanos Anastasiadis
 
UX: internal search for e-commerce
UX: internal search for e-commerceUX: internal search for e-commerce
UX: internal search for e-commerceMyriam Jessier
 
How to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceHow to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceBrightEdge
 
Evaluating search engines
Evaluating search enginesEvaluating search engines
Evaluating search enginesPhil Bradley
 
Seo Beginners Guide SriG Systems
Seo Beginners Guide SriG SystemsSeo Beginners Guide SriG Systems
Seo Beginners Guide SriG SystemsSriG Systems
 
TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013Avtex
 
Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentationadeason
 
Advanced google searching (1)
Advanced google searching (1)Advanced google searching (1)
Advanced google searching (1)Brenda Crawford
 
Search Engines Other than Google
Search Engines Other than GoogleSearch Engines Other than Google
Search Engines Other than GoogleDr Trivedi
 
SPConnections - What's new in SharePoint 2013 Search
SPConnections - What's new in SharePoint 2013 SearchSPConnections - What's new in SharePoint 2013 Search
SPConnections - What's new in SharePoint 2013 SearchAgnes Molnar
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologiesenterprisesearchmeetup
 
Data analytics and SEO to grow your international business | John Caldwell | ...
Data analytics and SEO to grow your international business | John Caldwell | ...Data analytics and SEO to grow your international business | John Caldwell | ...
Data analytics and SEO to grow your international business | John Caldwell | ...Enterprise Ireland
 
ECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
ECIR-2014: Multilanguage Content Discovery Through Entity Driven SearchECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
ECIR-2014: Multilanguage Content Discovery Through Entity Driven SearchAntonio David Pérez Morales
 

Semelhante a How search engines work (20)

Charting Searchland, ACM SIG Data Mining
Charting Searchland, ACM SIG Data MiningCharting Searchland, ACM SIG Data Mining
Charting Searchland, ACM SIG Data Mining
 
Searchland2
Searchland2Searchland2
Searchland2
 
Search engines by Gulshan K Maheshwari(QAU)
Search engines by Gulshan  K Maheshwari(QAU)Search engines by Gulshan  K Maheshwari(QAU)
Search engines by Gulshan K Maheshwari(QAU)
 
How search engines work Anand Saini
How search engines work Anand SainiHow search engines work Anand Saini
How search engines work Anand Saini
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
Search engines
Search enginesSearch engines
Search engines
 
Web search engines and search technology
Web search engines and search technologyWeb search engines and search technology
Web search engines and search technology
 
UX: internal search for e-commerce
UX: internal search for e-commerceUX: internal search for e-commerce
UX: internal search for e-commerce
 
How to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User ExperienceHow to SEO a Terrific - and Profitable - User Experience
How to SEO a Terrific - and Profitable - User Experience
 
Evaluating search engines
Evaluating search enginesEvaluating search engines
Evaluating search engines
 
Seo Beginners Guide SriG Systems
Seo Beginners Guide SriG SystemsSeo Beginners Guide SriG Systems
Seo Beginners Guide SriG Systems
 
TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013
 
Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentation
 
Advanced google searching (1)
Advanced google searching (1)Advanced google searching (1)
Advanced google searching (1)
 
Search Engines Other than Google
Search Engines Other than GoogleSearch Engines Other than Google
Search Engines Other than Google
 
SPConnections - What's new in SharePoint 2013 Search
SPConnections - What's new in SharePoint 2013 SearchSPConnections - What's new in SharePoint 2013 Search
SPConnections - What's new in SharePoint 2013 Search
 
Riley-o.com
Riley-o.comRiley-o.com
Riley-o.com
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
 
Data analytics and SEO to grow your international business | John Caldwell | ...
Data analytics and SEO to grow your international business | John Caldwell | ...Data analytics and SEO to grow your international business | John Caldwell | ...
Data analytics and SEO to grow your international business | John Caldwell | ...
 
ECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
ECIR-2014: Multilanguage Content Discovery Through Entity Driven SearchECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
ECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
 

Último

Booking open Available Pune Call Girls Talegaon Dabhade 6297143586 Call Hot ...
Booking open Available Pune Call Girls Talegaon Dabhade  6297143586 Call Hot ...Booking open Available Pune Call Girls Talegaon Dabhade  6297143586 Call Hot ...
Booking open Available Pune Call Girls Talegaon Dabhade 6297143586 Call Hot ...Call Girls in Nagpur High Profile
 
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...priyasharma62062
 
Gurley shaw Theory of Monetary Economics.
Gurley shaw Theory of Monetary Economics.Gurley shaw Theory of Monetary Economics.
Gurley shaw Theory of Monetary Economics.Vinodha Devi
 
Vip Call US 📞 7738631006 ✅Call Girls In Sakinaka ( Mumbai )
Vip Call US 📞 7738631006 ✅Call Girls In Sakinaka ( Mumbai )Vip Call US 📞 7738631006 ✅Call Girls In Sakinaka ( Mumbai )
Vip Call US 📞 7738631006 ✅Call Girls In Sakinaka ( Mumbai )Pooja Nehwal
 
Webinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech BelgiumWebinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech BelgiumFinTech Belgium
 
Top Rated Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Call Girls in Nagpur High Profile
 
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...dipikadinghjn ( Why You Choose Us? ) Escorts
 
Call Girls Koregaon Park Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Koregaon Park Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Koregaon Park Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Koregaon Park Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Call Girls Banaswadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Banaswadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Banaswadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Banaswadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
( Jasmin ) Top VIP Escorts Service Dindigul 💧 7737669865 💧 by Dindigul Call G...
( Jasmin ) Top VIP Escorts Service Dindigul 💧 7737669865 💧 by Dindigul Call G...( Jasmin ) Top VIP Escorts Service Dindigul 💧 7737669865 💧 by Dindigul Call G...
( Jasmin ) Top VIP Escorts Service Dindigul 💧 7737669865 💧 by Dindigul Call G...dipikadinghjn ( Why You Choose Us? ) Escorts
 
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...dipikadinghjn ( Why You Choose Us? ) Escorts
 
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...ssifa0344
 
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...dipikadinghjn ( Why You Choose Us? ) Escorts
 
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...priyasharma62062
 
falcon-invoice-discounting-unlocking-prime-investment-opportunities
falcon-invoice-discounting-unlocking-prime-investment-opportunitiesfalcon-invoice-discounting-unlocking-prime-investment-opportunities
falcon-invoice-discounting-unlocking-prime-investment-opportunitiesFalcon Invoice Discounting
 
Top Rated Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
Top Rated  Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...Top Rated  Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
Top Rated Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...Call Girls in Nagpur High Profile
 
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...priyasharma62062
 
Business Principles, Tools, and Techniques in Participating in Various Types...
Business Principles, Tools, and Techniques  in Participating in Various Types...Business Principles, Tools, and Techniques  in Participating in Various Types...
Business Principles, Tools, and Techniques in Participating in Various Types...jeffreytingson
 

Último (20)

Booking open Available Pune Call Girls Talegaon Dabhade 6297143586 Call Hot ...
Booking open Available Pune Call Girls Talegaon Dabhade  6297143586 Call Hot ...Booking open Available Pune Call Girls Talegaon Dabhade  6297143586 Call Hot ...
Booking open Available Pune Call Girls Talegaon Dabhade 6297143586 Call Hot ...
 
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
Mira Road Memorable Call Grls Number-9833754194-Bhayandar Speciallty Call Gir...
 
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
Call Girls in New Ashok Nagar, (delhi) call me [9953056974] escort service 24X7
 
Gurley shaw Theory of Monetary Economics.
Gurley shaw Theory of Monetary Economics.Gurley shaw Theory of Monetary Economics.
Gurley shaw Theory of Monetary Economics.
 
Vip Call US 📞 7738631006 ✅Call Girls In Sakinaka ( Mumbai )
Vip Call US 📞 7738631006 ✅Call Girls In Sakinaka ( Mumbai )Vip Call US 📞 7738631006 ✅Call Girls In Sakinaka ( Mumbai )
Vip Call US 📞 7738631006 ✅Call Girls In Sakinaka ( Mumbai )
 
Webinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech BelgiumWebinar on E-Invoicing for Fintech Belgium
Webinar on E-Invoicing for Fintech Belgium
 
Top Rated Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated  Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...Top Rated  Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
Top Rated Pune Call Girls Lohegaon ⟟ 6297143586 ⟟ Call Me For Genuine Sex Se...
 
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
VIP Independent Call Girls in Bandra West 🌹 9920725232 ( Call Me ) Mumbai Esc...
 
Call Girls Koregaon Park Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Koregaon Park Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Koregaon Park Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Koregaon Park Call Me 7737669865 Budget Friendly No Advance Booking
 
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
From Luxury Escort Service Kamathipura : 9352852248 Make on-demand Arrangemen...
 
Call Girls Banaswadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Banaswadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Banaswadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Banaswadi Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
( Jasmin ) Top VIP Escorts Service Dindigul 💧 7737669865 💧 by Dindigul Call G...
( Jasmin ) Top VIP Escorts Service Dindigul 💧 7737669865 💧 by Dindigul Call G...( Jasmin ) Top VIP Escorts Service Dindigul 💧 7737669865 💧 by Dindigul Call G...
( Jasmin ) Top VIP Escorts Service Dindigul 💧 7737669865 💧 by Dindigul Call G...
 
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
VIP Independent Call Girls in Taloja 🌹 9920725232 ( Call Me ) Mumbai Escorts ...
 
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
Solution Manual for Financial Accounting, 11th Edition by Robert Libby, Patri...
 
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
VIP Call Girl in Mumbai Central 💧 9920725232 ( Call Me ) Get A New Crush Ever...
 
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
Kharghar Blowjob Housewife Call Girls NUmber-9833754194-CBD Belapur Internati...
 
falcon-invoice-discounting-unlocking-prime-investment-opportunities
falcon-invoice-discounting-unlocking-prime-investment-opportunitiesfalcon-invoice-discounting-unlocking-prime-investment-opportunities
falcon-invoice-discounting-unlocking-prime-investment-opportunities
 
Top Rated Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
Top Rated  Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...Top Rated  Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
Top Rated Pune Call Girls Sinhagad Road ⟟ 6297143586 ⟟ Call Me For Genuine S...
 
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
Navi Mumbai Cooperetive Housewife Call Girls-9833754194-Natural Panvel Enjoye...
 
Business Principles, Tools, and Techniques in Participating in Various Types...
Business Principles, Tools, and Techniques  in Participating in Various Types...Business Principles, Tools, and Techniques  in Participating in Various Types...
Business Principles, Tools, and Techniques in Participating in Various Types...
 

How search engines work

  • 1. How SearcH engineS work Presentation by cHinna
  • 2. What is Search Engine Search engine is a software program that searches for sites based on the words that you designate as search terms. "Search engine" is the popular term for an Information Retrieval (IR) system. 2
  • 3. Motto of search engines A web search engine is designed to search for information on the World Wide Web and FTP servers. The search results are generally presented in a list of results often referred to as SERPS, or "search engine results pages". The information may consist of web pages, images, information and other types of files. 3
  • 4. Purpose of Search Engines Helping people find what they’re looking for • Starts with an "information need" • Convert to a query • Gets results In the materials available • Web pages • Other formats • Deep Web 4
  • 5. HISTORY Archie – First search tool for the Internet Gopher – indexed plain text documents Jughead – searched the files stored in Gopher index systems Wandex – First Web search engine 5
  • 6. How web search engines work search engine operates in the following order: Web Crawling Indexing Searching 6
  • 7. How do Search Engine Works  Spiders  Robots 7
  • 8. Search is Not a Panacea Search can’t find what’s not there • The content is hugely important Information Architecture is vital Usable sites have good navigation and structure 8
  • 9. Search Engine Modules A query processor A search and matching function A ranking capability Summarizing and Presenting documents. 9
  • 10. Search Engines Mode of Working in Earlier Days From 1990-1998 (1st Generation of search tools): • Looked at title of web pages • Ranking was based on page content • Looked at number of times the search term appeared on the page • Looked at metatags 10
  • 11. SEO (Search Engine Optimization) Used by companies to get a higher result in search engines White hat: Using legitimate techniques Black hat: Using illegal techniques to trick the search engine, like paying sites to link to you. 11
  • 13. Search is Only as Good as the Content Users blame the search engine • Even when the content is unavailable Understand the scope of site or intranet • Kinds of information • Divided sites: products / corporate info • Dates • Languages • Sources and data silos: databases... • Update processes 13
  • 14. Making a Searchable Index Store text to search it later Many ways to gather text • Crawl (spider) via HTTP • Read files on file servers • Access databases (HTTP or API) • Data silos via local APIs • Applications, CMSs, via Web Services Security and Access Control 14
  • 16. What the Index Needs Basic information for document or record • File name / URL / record ID • Title or equivalent • Size, date, MIME type Full text of item More metadata • Product name, picture ID • Category, topic, or subject • Other attributes, for relevance ranking and display 16
  • 18. Index Issues Stopwords Stemming Metadata • Explicit (tags) • Implicit (context) Semantics • CMS and Database fields • XML tags and attributes 18
  • 19. Search Query Processing What happens after you click the search button, and before retrieval starts. Usually in this order • Handle character set, maybe language • Look for operators and organize the query • Look for field names or metadata • Extract words (just like the indexer) • Deal with letter casing 19
  • 20. Search and Retrieval Retrieval: find files with query terms Not the same as relevance ranking Recall: find all relevant items Precision: find only relevant items Increasing one decreases the other 20
  • 21. Retrieval = Matching Single-word queries • Find items containing that word Multi-word queries: combine lists • Any: every item with any query word • All: only items with every word • Phrases: find only items with all words in order Boolean and complex queries • Use algorithm to combine lists 21
  • 22. Why Searches Fail Empty search Nothing on the site on that topic (scope) Misspelling or typing mistakes Vocabulary differences Restrictive search defaults Restrictive search choices Software failure 22
  • 23. Relevance Ranking Theory: sort the matching items, so the most relevant ones appear first Can't really know what the user wants Relevance is hard to define and situational Short queries tend to be deeply ambiguous • What do people mean when they type “bank”? First 10 results are the most important 23
  • 24. Relevance Processing Sorting documents on various criteria Start with words matching query terms Citation and link analysis • Like old library Citation Indexes • Not only hypertext, but the links • Google PageRank • Incoming links • Authority of linkers Taxonomies and external metadata 24
  • 25. Search Results Interface What users see after they click the Search button The most visible part of search Elements of the results page • Page layout and navigation • Results header • List of results items • Results footer 25
  • 26. Search Suggestions Human judgment beats algorithms Great for frequent, ambiguous searches • Use search log to identify best candidates Recommend good starting pages • Product information, FAQs, etc. Requires human resources • That means money and time More static than algorithmic search 26
  • 27. Search Metrics Number of searches Number of matches searches Traffic from search to high-value pages Relate search changes to other metrics 27
  • 28. Query Example Consider the Query Mahendra Singh Dhoni A good answer contains all the three words, and more frequently the better, we call this Term Frequency(TF) Some Query terms are more important those have better discriminating power than others For example an answer containing only "Dhoni" is likely to be better than an answer containing only “Mahendra“ We call this Inverse Document Frequency (IDF) 28
  • 29. Search Will Never Be Perfect Search engines can’t read minds • User queries are short and ambiguous Some things will help • Design a usable interface • Show match words in context • Keep index current and complete • Adjust heuristic weighting • Maintain suggestions and synonyms • Consider faceted metadata search 29

Notas do Editor

  1. Adaptive Path
  2. Adaptive Path
  3. Adaptive Path
  4. Adaptive Path