SlideShare uma empresa Scribd logo
1 de 31
OSLO STOCKHOLM LONDON BOSTON SINGAPORE
Search Analytics
Comperio - Seminar on Searchdriven
Websites and Analytics of Searchlogs
Stockholm Digital Days 2013-05-22
Bo Engren
Agenda
• What is Search(log) Analytics?
• Improving Search
• Best Practices & Administration
• QA
Web Analytics vs. Search Analytics
The difference between Web
Analytics and Search Analytics is
that Web shows what the users
actually have been doing, Search
shows their intent.
(and btw Search Analytics isn’t SEO either)
The challenges with Search
I can’t find
what I’m
looking for
Content
is old
Duplicates
and
versions
Not
maintained
Too many
choices
Language
and
domain
vocabulary
Poor user
experience
etc…
The relevancy threshold
By raising the
relevance with
40%, we can
move the
search solution
from low to
high trust.
Tuning relevancy - toolboxes
The search team
Best Practices & Administration
Operational steps for good search
DEFINE
SCOPE
IMPLEMENT
RELEASE
MAINTAIN
Understand business
needs
• Understand what you are
trying to achieve
• Plan and define goals
• Identify good trends, ROI
Measure and refine
• Monitor and use query
information
• Mine query logs
• Measure effectiveness
of search towards a
target
Output and benefits
• Better search
• Better results
• Enhanced usability
• Enhanced revenues
Search
customer
Analyzing search logs – fundamentals
When you have defined your business needs
Monitor your search logs...
...again and again and again
Look for
• Specific queries
• General queries
• Queries with zero results
• Filter away junk!
Know your search distribution
350 10.0000
0
500
20%
80%
Similar
searches
Unique
searches
Frequency
Query term
Can we find
patterns in
this type of
searches?
Take good
care of your
top queries
Frequent queries
Visualized Search history. Most frequent query terms
Unique queries example
A lot of
product code
searches
Sample Query report
Zero Result Queries
9.95% of today’s queries
return no results
Create a synonym for the query
Select time period
Empty result sets
How do we fix empty result sets?
• Investigate why!
– Spelling errors?
– Semantics?
– UI difficulties?
• Correct the underlying causes
Create Synonyms
Top/Frequent queries
How do we serve frequent queries best?
• Ensure good relevance
• Apply best bets
• If ambient, present options to narrow results
• If specific, make sure user get to the goal
Content Search - Refiners
• Filters are based on words in documents
• Words are used to tag the document with predefined set of Filter
names
Result Refiners
Enables filtering
Boosts and Blocks
• Boosting is the process of changing the
“natural” rank to alter the position of a document
within the result set
Apply selected Linguistic Features
• Automatic language detection
• Approximate matching (spell checking) “cort”, “court”
• Lemmatization Noun: “car”  “cars”
Verb: “break”  “break”, “breaks”, “broke”
•
• Synonyms “color” = “colour”
“car” = “automobile”
• Proper Name and Phrasing /Spellcheck “Venus Williams”, “French Open”
• Anti-phrasing (Stopwords) “[I want a] Nikon camera”
• Character Normalization “Molière -> Moliere”
• Tokenization (CJK support) “market-shares” -> “market shares”
• Phonetic Search “Eyvind”, “Oyvind” -> “Eyvind”
• Automatic spelltuning Based on index contents
When implemented properly can drastically improve the
usefulness of a search
Search statistics – several tools available
• Start with the searchlogs:
– Use the built in tools
– Loggparsers (IIS loggparser etc.)
– Webanalytics tools (Google Analytics,
Webtrends etc.)
– Log management (logstash, kibana)
– Big data (Hadoop, pig)
Visual searchresults
Comperio internal Knowledge Management DB February 2013
Statistic analysis – Best Practice
• Zero hit results  key to monitor and remove
• Analyze the Top queries
• Trends over time – group by day/week/month
• Separate internal and external searches
• Group the queries for better understanding (for
example products, documents, persons)
Examples of Metrics for Search
Analytics – select a few initally
Search perspective
Measures Definition
Metric
type
Total queries Total number of search queries #
Clicks Total number of clicks that goes from search results to final file or page #
Satisfied queries Percentage of search results with at least one click %
Opportunity queries Percentage of search results with no click %
Visits with keyword searches Percentage of web visitors that use search %
Visits with guided product search Percentage of web visitors that use guided product search %
Visits with browsing searches Percentage of web visitors that use browseing searches e.g. listings %
Search result exits Percentage of web visitors that exit the website on the search result page %
Searches with zero results Percentage of searches that end up with zero results %
Search depth Depth after search result page #
Refined searches Number of searches refined with new query text after result view #
Result relevancy Relevancy of search results, based on recall/precision test model and test set #
Query suggestion use Number of searches performed with suggested queries #
Related queries Number of searches with related queries used #
Filtered queries Number of searches with query refinement filters #
Time to destination Time spent from search to final result Time
Result sidebar use Percentage of clicks on sidebar results on result page views %
Advanced queries Number of advanced queries performed with boolean or filter operators #
Best bets use Percentage of clicks on manual top results when displayed %
Improve results of searches - Best
Practice
Improve similar searches (fat head)
• Autocomplete
• Best bets
Improve uniqe searches (long tail)
• Spellchecking
• Synonyms
• Adjust your content
Internal searches – do we understand
the context of the user?
• Start with the User
– Study/test your User Stories.
Example: You are going to start a new project.
Do you find what you need to get started?
– Use Online surverys for deeper insights
All search platforms need maintenance
• A team that specializes in search
and related technologies
– Front end search specialists
– Search analysts
• Examples of Tasks
– Sounding board for proposed projects or reported
problems
– Cataloguing agreed search best practice
– Control vocabularies and taxonomies
– Monitoring and tuning
– In-house training
Search Analytics – Summary 1
• Make someone responsible for search - Appoint a
Search Manager
• Set a search strategy which enables the business
strategy and is in line with overall IT-strategy
• Make the Business Case
• Measure and Monitor Search Queries = Search
Analytics
• Enable User Feedback
• Raise quality of information by adding metadata and
doing content lifecycle management
• Add metadata - manual, mandatory or automatic?
Search Analytics - Summary 2
• Establish processes to deliver feedback to your
Stakeholders regarding the search logs
– Separate External and Internal sites?
• Educate information creators - simple handouts and
sit-downs
• Apply spelling suggestions, key-matches and auto-
complete
• What can we do as Editors and what do we need
Techies to do?
– You can do more than you think!
Thanks for listening
and time for QA!

Mais conteúdo relacionado

Destaque (10)

Catalogo COREP
Catalogo COREPCatalogo COREP
Catalogo COREP
 
El rey leon
El rey leonEl rey leon
El rey leon
 
Developing strategic thinking acumen
Developing strategic thinking acumenDeveloping strategic thinking acumen
Developing strategic thinking acumen
 
Dash LLM
Dash LLMDash LLM
Dash LLM
 
Cv renny mathew_hw engineer (2)
Cv renny mathew_hw engineer (2)Cv renny mathew_hw engineer (2)
Cv renny mathew_hw engineer (2)
 
DOC108.PDF
DOC108.PDFDOC108.PDF
DOC108.PDF
 
Presentation on personal loan
Presentation on personal loanPresentation on personal loan
Presentation on personal loan
 
cover letter and cv 2017
cover letter and cv 2017cover letter and cv 2017
cover letter and cv 2017
 
SharePoint 2013 Enterprise Search Prjoect Learnings - Comperio
SharePoint 2013 Enterprise Search Prjoect Learnings - ComperioSharePoint 2013 Enterprise Search Prjoect Learnings - Comperio
SharePoint 2013 Enterprise Search Prjoect Learnings - Comperio
 
Virksomhetssøk for prosjekt - Comperio
Virksomhetssøk for prosjekt  - ComperioVirksomhetssøk for prosjekt  - Comperio
Virksomhetssøk for prosjekt - Comperio
 

Semelhante a Search Analytics - Comperio

"Unstoppable Traffic" SEO cheat sheets for you
"Unstoppable Traffic" SEO cheat sheets for you"Unstoppable Traffic" SEO cheat sheets for you
"Unstoppable Traffic" SEO cheat sheets for you
vidyamittal
 
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Simplilearn
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
enterprisesearchmeetup
 

Semelhante a Search Analytics - Comperio (20)

Keyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEOKeyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEO
 
Best Practices for Enterprise Search
Best Practices for Enterprise SearchBest Practices for Enterprise Search
Best Practices for Enterprise Search
 
HighRoad U Webinar: How to Create a Keyword Strategy
HighRoad U Webinar: How to Create a Keyword StrategyHighRoad U Webinar: How to Create a Keyword Strategy
HighRoad U Webinar: How to Create a Keyword Strategy
 
Keywords and Keyword Research by Bruce Clay
Keywords and Keyword Research by Bruce ClayKeywords and Keyword Research by Bruce Clay
Keywords and Keyword Research by Bruce Clay
 
Tuning Up Site Search - IA Summit 2007
Tuning Up Site Search - IA Summit 2007Tuning Up Site Search - IA Summit 2007
Tuning Up Site Search - IA Summit 2007
 
"Unstoppable Traffic" SEO cheat sheets for you
"Unstoppable Traffic" SEO cheat sheets for you"Unstoppable Traffic" SEO cheat sheets for you
"Unstoppable Traffic" SEO cheat sheets for you
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
KEYWORD RESEARCH & SEO
KEYWORD RESEARCH & SEO KEYWORD RESEARCH & SEO
KEYWORD RESEARCH & SEO
 
Secrets to Identify Highly-Effective SEO Keywords
Secrets to Identify Highly-Effective SEO KeywordsSecrets to Identify Highly-Effective SEO Keywords
Secrets to Identify Highly-Effective SEO Keywords
 
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
 
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
 
Search engine optimization
Search engine optimizationSearch engine optimization
Search engine optimization
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimization
 
SEO for Beginners-- What is Search Engine Optimization (SEO) ?
SEO for Beginners-- What is Search Engine Optimization (SEO) ?SEO for Beginners-- What is Search Engine Optimization (SEO) ?
SEO for Beginners-- What is Search Engine Optimization (SEO) ?
 
Search Engine Optimization | Derin Dolen
Search Engine Optimization | Derin DolenSearch Engine Optimization | Derin Dolen
Search Engine Optimization | Derin Dolen
 
Search Quality Management
Search Quality ManagementSearch Quality Management
Search Quality Management
 
Search Solutions 2015: Towards a new model of search relevance testing
Search Solutions 2015:  Towards a new model of search relevance testingSearch Solutions 2015:  Towards a new model of search relevance testing
Search Solutions 2015: Towards a new model of search relevance testing
 
The Power of SEO
The Power of SEOThe Power of SEO
The Power of SEO
 
MMM, Search!
MMM, Search!MMM, Search!
MMM, Search!
 
Relevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search TechnologiesRelevancy and Search Quality Analysis - Search Technologies
Relevancy and Search Quality Analysis - Search Technologies
 

Mais de Comperio - Search Matters.

Produktivitet 1.0 - Comperio Seminar oktober 2012
Produktivitet 1.0 - Comperio Seminar oktober 2012Produktivitet 1.0 - Comperio Seminar oktober 2012
Produktivitet 1.0 - Comperio Seminar oktober 2012
Comperio - Search Matters.
 
Search solutions for big data and collaboration - Comperio seminar October 2012
Search solutions for big data and collaboration - Comperio seminar October 2012Search solutions for big data and collaboration - Comperio seminar October 2012
Search solutions for big data and collaboration - Comperio seminar October 2012
Comperio - Search Matters.
 

Mais de Comperio - Search Matters. (15)

Samhandlingsløsninger med søk på tvers av kilder
Samhandlingsløsninger med søk på tvers av kilderSamhandlingsløsninger med søk på tvers av kilder
Samhandlingsløsninger med søk på tvers av kilder
 
Søkeløsningen dine kolleger drømmer om
Søkeløsningen dine kolleger drømmer omSøkeløsningen dine kolleger drømmer om
Søkeløsningen dine kolleger drømmer om
 
SharePoint Search mot 360 og ProArc
SharePoint Search mot 360 og ProArcSharePoint Search mot 360 og ProArc
SharePoint Search mot 360 og ProArc
 
NDC lightning SharePoint 2013 and Enterprise Search
NDC lightning SharePoint 2013 and Enterprise SearchNDC lightning SharePoint 2013 and Enterprise Search
NDC lightning SharePoint 2013 and Enterprise Search
 
Improve Performance in Fast Search for SharePoint - Comperio
Improve Performance in Fast Search for SharePoint - ComperioImprove Performance in Fast Search for SharePoint - Comperio
Improve Performance in Fast Search for SharePoint - Comperio
 
Search Driven Websites - Comperio
Search Driven Websites - ComperioSearch Driven Websites - Comperio
Search Driven Websites - Comperio
 
Welcome virksomhetssøk og sosial samhandling - Comperio
Welcome virksomhetssøk og sosial samhandling - ComperioWelcome virksomhetssøk og sosial samhandling - Comperio
Welcome virksomhetssøk og sosial samhandling - Comperio
 
Yammer and office 365 roadmap update - Comperio seminar oslo14 May2013
Yammer and office 365 roadmap update - Comperio seminar oslo14 May2013Yammer and office 365 roadmap update - Comperio seminar oslo14 May2013
Yammer and office 365 roadmap update - Comperio seminar oslo14 May2013
 
Information wants to be free - Comperio seminar oslo14may2013
Information wants to be free - Comperio seminar oslo14may2013Information wants to be free - Comperio seminar oslo14may2013
Information wants to be free - Comperio seminar oslo14may2013
 
Fileserver Search Assessment - Comperio
Fileserver Search Assessment - ComperioFileserver Search Assessment - Comperio
Fileserver Search Assessment - Comperio
 
Sökmotorn i SharePoint 2013 - Comperio
Sökmotorn i SharePoint 2013 - ComperioSökmotorn i SharePoint 2013 - Comperio
Sökmotorn i SharePoint 2013 - Comperio
 
Big Data – good news for Enterprise Search
Big Data – good news for Enterprise SearchBig Data – good news for Enterprise Search
Big Data – good news for Enterprise Search
 
Produktivitet 1.0 - Comperio Seminar oktober 2012
Produktivitet 1.0 - Comperio Seminar oktober 2012Produktivitet 1.0 - Comperio Seminar oktober 2012
Produktivitet 1.0 - Comperio Seminar oktober 2012
 
Search solutions for big data and collaboration - Comperio seminar October 2012
Search solutions for big data and collaboration - Comperio seminar October 2012Search solutions for big data and collaboration - Comperio seminar October 2012
Search solutions for big data and collaboration - Comperio seminar October 2012
 
Hvordan lykkes med intern Facebook og Google
Hvordan lykkes med intern Facebook og GoogleHvordan lykkes med intern Facebook og Google
Hvordan lykkes med intern Facebook og Google
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Último (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

Search Analytics - Comperio

  • 1. OSLO STOCKHOLM LONDON BOSTON SINGAPORE Search Analytics Comperio - Seminar on Searchdriven Websites and Analytics of Searchlogs Stockholm Digital Days 2013-05-22 Bo Engren
  • 2. Agenda • What is Search(log) Analytics? • Improving Search • Best Practices & Administration • QA
  • 3. Web Analytics vs. Search Analytics The difference between Web Analytics and Search Analytics is that Web shows what the users actually have been doing, Search shows their intent. (and btw Search Analytics isn’t SEO either)
  • 4. The challenges with Search I can’t find what I’m looking for Content is old Duplicates and versions Not maintained Too many choices Language and domain vocabulary Poor user experience etc…
  • 5. The relevancy threshold By raising the relevance with 40%, we can move the search solution from low to high trust.
  • 6. Tuning relevancy - toolboxes
  • 8. Best Practices & Administration
  • 9. Operational steps for good search DEFINE SCOPE IMPLEMENT RELEASE MAINTAIN Understand business needs • Understand what you are trying to achieve • Plan and define goals • Identify good trends, ROI Measure and refine • Monitor and use query information • Mine query logs • Measure effectiveness of search towards a target Output and benefits • Better search • Better results • Enhanced usability • Enhanced revenues Search customer
  • 10. Analyzing search logs – fundamentals When you have defined your business needs Monitor your search logs... ...again and again and again Look for • Specific queries • General queries • Queries with zero results • Filter away junk!
  • 11. Know your search distribution 350 10.0000 0 500 20% 80% Similar searches Unique searches Frequency Query term Can we find patterns in this type of searches? Take good care of your top queries
  • 12. Frequent queries Visualized Search history. Most frequent query terms
  • 13. Unique queries example A lot of product code searches
  • 15. Zero Result Queries 9.95% of today’s queries return no results Create a synonym for the query Select time period
  • 16. Empty result sets How do we fix empty result sets? • Investigate why! – Spelling errors? – Semantics? – UI difficulties? • Correct the underlying causes
  • 18. Top/Frequent queries How do we serve frequent queries best? • Ensure good relevance • Apply best bets • If ambient, present options to narrow results • If specific, make sure user get to the goal
  • 19. Content Search - Refiners • Filters are based on words in documents • Words are used to tag the document with predefined set of Filter names Result Refiners Enables filtering
  • 20. Boosts and Blocks • Boosting is the process of changing the “natural” rank to alter the position of a document within the result set
  • 21. Apply selected Linguistic Features • Automatic language detection • Approximate matching (spell checking) “cort”, “court” • Lemmatization Noun: “car”  “cars” Verb: “break”  “break”, “breaks”, “broke” • • Synonyms “color” = “colour” “car” = “automobile” • Proper Name and Phrasing /Spellcheck “Venus Williams”, “French Open” • Anti-phrasing (Stopwords) “[I want a] Nikon camera” • Character Normalization “Molière -> Moliere” • Tokenization (CJK support) “market-shares” -> “market shares” • Phonetic Search “Eyvind”, “Oyvind” -> “Eyvind” • Automatic spelltuning Based on index contents When implemented properly can drastically improve the usefulness of a search
  • 22. Search statistics – several tools available • Start with the searchlogs: – Use the built in tools – Loggparsers (IIS loggparser etc.) – Webanalytics tools (Google Analytics, Webtrends etc.) – Log management (logstash, kibana) – Big data (Hadoop, pig)
  • 23. Visual searchresults Comperio internal Knowledge Management DB February 2013
  • 24. Statistic analysis – Best Practice • Zero hit results  key to monitor and remove • Analyze the Top queries • Trends over time – group by day/week/month • Separate internal and external searches • Group the queries for better understanding (for example products, documents, persons)
  • 25. Examples of Metrics for Search Analytics – select a few initally Search perspective Measures Definition Metric type Total queries Total number of search queries # Clicks Total number of clicks that goes from search results to final file or page # Satisfied queries Percentage of search results with at least one click % Opportunity queries Percentage of search results with no click % Visits with keyword searches Percentage of web visitors that use search % Visits with guided product search Percentage of web visitors that use guided product search % Visits with browsing searches Percentage of web visitors that use browseing searches e.g. listings % Search result exits Percentage of web visitors that exit the website on the search result page % Searches with zero results Percentage of searches that end up with zero results % Search depth Depth after search result page # Refined searches Number of searches refined with new query text after result view # Result relevancy Relevancy of search results, based on recall/precision test model and test set # Query suggestion use Number of searches performed with suggested queries # Related queries Number of searches with related queries used # Filtered queries Number of searches with query refinement filters # Time to destination Time spent from search to final result Time Result sidebar use Percentage of clicks on sidebar results on result page views % Advanced queries Number of advanced queries performed with boolean or filter operators # Best bets use Percentage of clicks on manual top results when displayed %
  • 26. Improve results of searches - Best Practice Improve similar searches (fat head) • Autocomplete • Best bets Improve uniqe searches (long tail) • Spellchecking • Synonyms • Adjust your content
  • 27. Internal searches – do we understand the context of the user? • Start with the User – Study/test your User Stories. Example: You are going to start a new project. Do you find what you need to get started? – Use Online surverys for deeper insights
  • 28. All search platforms need maintenance • A team that specializes in search and related technologies – Front end search specialists – Search analysts • Examples of Tasks – Sounding board for proposed projects or reported problems – Cataloguing agreed search best practice – Control vocabularies and taxonomies – Monitoring and tuning – In-house training
  • 29. Search Analytics – Summary 1 • Make someone responsible for search - Appoint a Search Manager • Set a search strategy which enables the business strategy and is in line with overall IT-strategy • Make the Business Case • Measure and Monitor Search Queries = Search Analytics • Enable User Feedback • Raise quality of information by adding metadata and doing content lifecycle management • Add metadata - manual, mandatory or automatic?
  • 30. Search Analytics - Summary 2 • Establish processes to deliver feedback to your Stakeholders regarding the search logs – Separate External and Internal sites? • Educate information creators - simple handouts and sit-downs • Apply spelling suggestions, key-matches and auto- complete • What can we do as Editors and what do we need Techies to do? – You can do more than you think!
  • 31. Thanks for listening and time for QA!