SlideShare uma empresa Scribd logo
1 de 40
May 22nd 2013, Kiev
Enterprise search portals
SharePoint 2013
Alexey Kozhemiakin
May 22nd 2013, Kiev
or “How to make a cool
search”
Alexey Kozhemiakin
Who’s speaking to you?
• Solution Architect @epam
• Focusing on search
• Sharepoint Search FAST/2010/2013
• Apache Lucene, Solr, elasticsearch,
Oracle Endeca…
• http://powersearching.wordpress.com
3
Agenda
• Enterprise Search Portal
• Insight into SP2013 Search
• Key changes from SP2010
• A bit of magic – relevancy calculation
• Search governance, useful hint & tips
4
Key search patterns
• I know what I’m searching and where to find it
• I know what I’m searching but don’t know where
to find it.
• I don’t‘ know what I’m searching
5
http://aghy.hu/AghyBlog_EN/Lists/Posts/Post.aspx?ID=199
• Demand:
• Fast growing enterprises
• Zoo of internal systems
• Solution:
• “google” inside enterprise
• Quick-wins for business:
• Single point of smart search and information retrieval
• Reduce search time by employee
• Better inner communications and simplified reuse of
conent
6
Enterprise Search Portal
But after deployment…
• «.. Search sucks»
• Out of the box search knows nothing about you
• «Typical But…
• … Microsoft takes care of decent search algorithm»
• … we’re not sure we can do better»
• ... we don’t need search, everybody know where content is»
• … make our search like in facebook/google/bing (instead of
requirements)»
7
Why it’s hard
• Ambiguous short queries
• Unstructured not optimized content
• Different active vocabulary of content users and
creators
• Limited resources ($), while in internet search:
• Auto and manual testing of search quality (assessors)
• Continuous improvement
8
Search architecture in
SP2013
9
Search in two phase
process
• Matching – all docs with keywords
• Linguistics: stemming, phonetics
• Synonyms
• Ranking
• «Фичи»
• TF-IDF, BM25
• Вес полей
• Тип файла
• Дата изменения
• Популярность
• …
10
Ranking in FAST
• Linear combination of features
11
Ranking in FAST
• Impact of each component to final rank
12
0
1000
2000
3000
4000
5000
6000
7000
8000
1st 2nd 3rd 4th
term:fast term:search freshness static rank proximity
Migration FAST->SP2013
13
Ranking in SP2013
14
Ranking in SP2013
• Default Relevancy Model
• Two neural networks
• Freshness in not included in ranking
• Features
15
Type Instance
BM25 BM25
Static UrlDepth
BucketedStatic InternalFileType
BucketedStatic Language
Static ClickDistance
Static QueryLogClicks
Static QueryLogSkips
Static LastClicks
Static EventRate
MinSpan - soft Title
MinSpan - soft Title
MinSpan - soft Title
MinSpan - soft Content
Ranking in SP2013
16
• Default relevancy model
Explain rank
• /_layout/15/explainrank.aspx
• rankdetail property
17
Explain rank
• Manual validation in excel
18
19
Search Governance
1. Search analytics
2. Fine tuning and adaptation
3. Regular testing
4. Security assessment
5. Promotion whithin company
6. Content optimization and basic SEO
20
1. Search analytics
• Search analytics
• Search analytics
• Search analytics
• Obey! Use Search analytics
21
1. Search analytics
• OOTB in SP2013
• Most popular queries
• «No Results/abandoned» queries
• 3rd party tools (Google Analytics, Omniture,
WebTrends)
• Measure search quality (!)
• % click on results
• Which results
• Return after clicks
• Session analysis
• Query segmantation
22
Query segmantation
• Analyze and improve not only top N queries, but
classes of queries
23
2. Fine tuning
• Authoritative Pages
• Quick win – content source priority
• Query Rules
• Smart search for users
• Synonyms
• Separate mapping file
• Expansion only
• Termsets synonyms NOT working
• Relevancy models
24
Authoritative Pages
• Impacts ClickDistance
• ClickDistance, UrlDepth have hich impact on total
score (see explain rank)
• Configures in CA, CSOM
25
Query Rules (Rule +
Action)
• The tool to make search smarter
• Interactive feedback to user queries
• Post processing of queries
• Leverage navigational queries
• …
26
Condition for Query Rules
• Query Matches Keyword Exactly
• Advanced Query Text Match
• Query Matches Dictionary Exactly
• Query Contains Action Term
• Query More Common in Source
• Result Type Commonly Clicked
27
Actions для Query Rules
• Create and display a result block
• Change ranked search results
• Best Bets
• XRANK
• Works additive to total rank
• Not explained in rankdetail
• How to choose correct value?
28
Templates for
QueryRules
• Typical navigational keywords from our portal
• Software, soft, download, install
• How to
• Policy, Blog
• Portal
• Music, Video
• Presentation, Documents, Report
• Training, tutorial
• Book, ebook
• You will have different ones!
29
Custom Rank Models
• Сбор Query Judgments
• Tune neural network coefficients using machine
learning
• Gradient Descent, Lambda Rank
• Microsoft.Office.Server.Search.RankerTuning
30
Custom Rank Models
• Modify manually new model or very simple (not
default one!)
• A/B testing of weights
• Measure, measure: Precision, NDCG
31
Custom Rank Models
• Example of simple model – people search
32
3. Search quality testing
• Why need? It’s your compass.
• «Unit testing»
• Periodical manual testing
33
4. Security «audit»
• Search reveals breaches in security
• Security by obscurity
• Examples of queries:
• «confidential»
• Salaries, performance reviews
• Solution – automatic monitoring of sensitive
queries
34
5. Adoption of content
• Use with departments
• Get help with search monitoring of their queries
• Guideline to format content
• Basic SEO
• Titles
• Friendly urls
• Custom meta tags <meta name=…
• Title, description
• Custom Automatically appear in crawled properties
35
6. Promotion within
company
• Image – «you will find everything here»
• Integrate with other portals
• Propose Search as a serivce
• Widget «Global search»
• Badges, gamification
36
Promotion
• Social Best-bets
37
Semantic search
• Cannot be solved in general
• Analytics + fine tuning
• See practices above
• NLP – question answering
• Rocket science
• English only
• Part of speech tagging, dependency parsing
• Stanford NLP, Open NLP, IR
38
«References»
• Patents - http://goo.gl/20sbR
• Explain Rank page - http://goo.gl/o3ZmN
• How SP2013 relevancy models works - http://goo.gl/arf0P
• MS Enterprise Search approach - http://goo.gl/x8SDO
• Customizing ranking models in SP 2013 - http://goo.gl/lBJAp
39
May 22nd 2013, Kiev
Thanks
Skype: Alexey_Kozhemiakin
Email: Alexey.Kozhemiakin@gmail.com
Blog: http://powersearching.wordpress.com
40

Mais conteúdo relacionado

Mais procurados

Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceLucidworks
 
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Sezai Komur
 
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013Corey Roth
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Jeff Fried
 
SharePoint Search Secrets for Power Users & Administrators - Mike Smith
SharePoint Search Secrets for Power Users & Administrators - Mike SmithSharePoint Search Secrets for Power Users & Administrators - Mike Smith
SharePoint Search Secrets for Power Users & Administrators - Mike SmithMAX Technical Training
 
Webinar: Building Customer-Targeted Search with Fusion
Webinar: Building Customer-Targeted Search with FusionWebinar: Building Customer-Targeted Search with Fusion
Webinar: Building Customer-Targeted Search with FusionLucidworks
 
Cloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointCloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointJeff Fried
 
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Maximilian Melcher
 
Office 365 SharePoint Search Planning
Office 365 SharePoint Search PlanningOffice 365 SharePoint Search Planning
Office 365 SharePoint Search PlanningJoel Oleson
 
Succeeding with Hybrid SharePoint
Succeeding with Hybrid SharePointSucceeding with Hybrid SharePoint
Succeeding with Hybrid SharePointJeff Fried
 
Ms. Search and Mr. Project
Ms. Search and Mr. ProjectMs. Search and Mr. Project
Ms. Search and Mr. ProjectSPC Adriatics
 
Bpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchBpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchAgnes Molnar
 
SPSBE building an faq for end users
SPSBE building an faq for end usersSPSBE building an faq for end users
SPSBE building an faq for end usersPaul Hunt
 
Hybrid SharePoint Solutions for the Business Decision-Maker
Hybrid SharePoint Solutions for the Business Decision-MakerHybrid SharePoint Solutions for the Business Decision-Maker
Hybrid SharePoint Solutions for the Business Decision-MakerChristian Buckley
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Jeff Fried
 
Assuring the code quality of share point solutions and apps - Matthias Einig
Assuring the code quality of share point solutions and apps - Matthias EinigAssuring the code quality of share point solutions and apps - Matthias Einig
Assuring the code quality of share point solutions and apps - Matthias EinigSPC Adriatics
 
Fried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveFried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveJeff Fried
 
Searching for SharePoint Analytics
Searching for SharePoint AnalyticsSearching for SharePoint Analytics
Searching for SharePoint AnalyticsJeff Fried
 
SPUnite17 The New Enterprise Content Management
SPUnite17 The New Enterprise Content ManagementSPUnite17 The New Enterprise Content Management
SPUnite17 The New Enterprise Content ManagementNCCOMMS
 
Quick start guide to java script frameworks for sharepoint apps spsbe-2015
Quick start guide to java script frameworks for sharepoint apps spsbe-2015Quick start guide to java script frameworks for sharepoint apps spsbe-2015
Quick start guide to java script frameworks for sharepoint apps spsbe-2015Sonja Madsen
 

Mais procurados (20)

Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
 
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
Overview of Search in SharePoint Server 2013 - Australian SharePoint Conferen...
 
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
Fives ways to query SharePoint 2013 Search - SharePoint Summit Toronto 2013
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
 
SharePoint Search Secrets for Power Users & Administrators - Mike Smith
SharePoint Search Secrets for Power Users & Administrators - Mike SmithSharePoint Search Secrets for Power Users & Administrators - Mike Smith
SharePoint Search Secrets for Power Users & Administrators - Mike Smith
 
Webinar: Building Customer-Targeted Search with Fusion
Webinar: Building Customer-Targeted Search with FusionWebinar: Building Customer-Targeted Search with Fusion
Webinar: Building Customer-Targeted Search with Fusion
 
Cloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePointCloud Hybrid Search with SharePoint
Cloud Hybrid Search with SharePoint
 
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)Search-Driven Applications with SharePoint 2013 (#SBSBE16)
Search-Driven Applications with SharePoint 2013 (#SBSBE16)
 
Office 365 SharePoint Search Planning
Office 365 SharePoint Search PlanningOffice 365 SharePoint Search Planning
Office 365 SharePoint Search Planning
 
Succeeding with Hybrid SharePoint
Succeeding with Hybrid SharePointSucceeding with Hybrid SharePoint
Succeeding with Hybrid SharePoint
 
Ms. Search and Mr. Project
Ms. Search and Mr. ProjectMs. Search and Mr. Project
Ms. Search and Mr. Project
 
Bpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearchBpc10 119 agnes-molnar_implementingsearch
Bpc10 119 agnes-molnar_implementingsearch
 
SPSBE building an faq for end users
SPSBE building an faq for end usersSPSBE building an faq for end users
SPSBE building an faq for end users
 
Hybrid SharePoint Solutions for the Business Decision-Maker
Hybrid SharePoint Solutions for the Business Decision-MakerHybrid SharePoint Solutions for the Business Decision-Maker
Hybrid SharePoint Solutions for the Business Decision-Maker
 
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
Succeeding with Hybrid SharePoint (includes new Cloud SSA material)
 
Assuring the code quality of share point solutions and apps - Matthias Einig
Assuring the code quality of share point solutions and apps - Matthias EinigAssuring the code quality of share point solutions and apps - Matthias Einig
Assuring the code quality of share point solutions and apps - Matthias Einig
 
Fried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper diveFried sp techcon hybrid search deeper dive
Fried sp techcon hybrid search deeper dive
 
Searching for SharePoint Analytics
Searching for SharePoint AnalyticsSearching for SharePoint Analytics
Searching for SharePoint Analytics
 
SPUnite17 The New Enterprise Content Management
SPUnite17 The New Enterprise Content ManagementSPUnite17 The New Enterprise Content Management
SPUnite17 The New Enterprise Content Management
 
Quick start guide to java script frameworks for sharepoint apps spsbe-2015
Quick start guide to java script frameworks for sharepoint apps spsbe-2015Quick start guide to java script frameworks for sharepoint apps spsbe-2015
Quick start guide to java script frameworks for sharepoint apps spsbe-2015
 

Semelhante a Spcua 2013 Alexey Kozhemiakin Enterprise Search

Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitSearch Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitJoel Oleson
 
SharePoint Search Tips for Power Users
SharePoint Search Tips for Power UsersSharePoint Search Tips for Power Users
SharePoint Search Tips for Power UsersJoel Oleson
 
Scoping a Successful SharePoint 2016 Hybrid Search Implementation
Scoping a Successful SharePoint 2016 Hybrid Search ImplementationScoping a Successful SharePoint 2016 Hybrid Search Implementation
Scoping a Successful SharePoint 2016 Hybrid Search ImplementationAgnes Molnar
 
SEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchSEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchPhilipp Klöckner
 
Planning Your Enterprise Search Strategy
Planning Your Enterprise Search StrategyPlanning Your Enterprise Search Strategy
Planning Your Enterprise Search StrategyInnoTech
 
SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013Agnes Molnar
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchAgnes Molnar
 
Building great search – how to overcome common challenges jon handler, aws
Building great search – how to overcome common challenges   jon handler, awsBuilding great search – how to overcome common challenges   jon handler, aws
Building great search – how to overcome common challenges jon handler, awsAmazon Web Services
 
Mktg350 lecture 10142013
Mktg350 lecture 10142013Mktg350 lecture 10142013
Mktg350 lecture 10142013lkirkman
 
Fried toronto sps14 91 wcm intranet
Fried toronto sps14 91 wcm intranetFried toronto sps14 91 wcm intranet
Fried toronto sps14 91 wcm intranetJeff Fried
 
Deep-Dive to Azure Search
Deep-Dive to Azure SearchDeep-Dive to Azure Search
Deep-Dive to Azure SearchGunnar Peipman
 
WordCamp Birmingham 2014: SEO Workshop: Best Practices for Better Website Tra...
WordCamp Birmingham 2014: SEO Workshop: Best Practices for Better Website Tra...WordCamp Birmingham 2014: SEO Workshop: Best Practices for Better Website Tra...
WordCamp Birmingham 2014: SEO Workshop: Best Practices for Better Website Tra...Mickey Mellen
 
How to Get the Most Out of Search Webinar
How to Get the Most Out of Search WebinarHow to Get the Most Out of Search Webinar
How to Get the Most Out of Search WebinarConcept Searching, Inc
 
Enterprise Search Strategy 101 at SEF2014 in Stockholm
Enterprise Search Strategy 101 at SEF2014 in StockholmEnterprise Search Strategy 101 at SEF2014 in Stockholm
Enterprise Search Strategy 101 at SEF2014 in StockholmJoel Oleson
 
Webinar: How to (Finally!) Get Relevant Results From SharePoint’s Search
Webinar: How to (Finally!) Get Relevant Results From SharePoint’s SearchWebinar: How to (Finally!) Get Relevant Results From SharePoint’s Search
Webinar: How to (Finally!) Get Relevant Results From SharePoint’s SearchLucidworks
 
Key Success Factors for Enterprise Content Management
Key Success Factors for Enterprise Content ManagementKey Success Factors for Enterprise Content Management
Key Success Factors for Enterprise Content ManagementIntlock Ltd.
 
The things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 SearchThe things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 SearchSearch Technologies
 
Casablanca SharePoint Days Enterprise Search Strategy
Casablanca SharePoint Days Enterprise Search StrategyCasablanca SharePoint Days Enterprise Search Strategy
Casablanca SharePoint Days Enterprise Search StrategyJoel Oleson
 
Building a Fast and Powerful Search App with Lucidworks Site Search - Andrew ...
Building a Fast and Powerful Search App with Lucidworks Site Search - Andrew ...Building a Fast and Powerful Search App with Lucidworks Site Search - Andrew ...
Building a Fast and Powerful Search App with Lucidworks Site Search - Andrew ...Lucidworks
 
33 Tactics to Engage and Retain More Customers - IRCE 2016
33 Tactics to Engage and Retain More Customers - IRCE 201633 Tactics to Engage and Retain More Customers - IRCE 2016
33 Tactics to Engage and Retain More Customers - IRCE 2016Mark Ginsberg
 

Semelhante a Spcua 2013 Alexey Kozhemiakin Enterprise Search (20)

Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint SummitSearch Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
Search Strategy for Enterprise SharePoint 2013 - Vancouver SharePoint Summit
 
SharePoint Search Tips for Power Users
SharePoint Search Tips for Power UsersSharePoint Search Tips for Power Users
SharePoint Search Tips for Power Users
 
Scoping a Successful SharePoint 2016 Hybrid Search Implementation
Scoping a Successful SharePoint 2016 Hybrid Search ImplementationScoping a Successful SharePoint 2016 Hybrid Search Implementation
Scoping a Successful SharePoint 2016 Hybrid Search Implementation
 
SEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences SearchSEO in the Age of Artificial Intelligence | How AI influences Search
SEO in the Age of Artificial Intelligence | How AI influences Search
 
Planning Your Enterprise Search Strategy
Planning Your Enterprise Search StrategyPlanning Your Enterprise Search Strategy
Planning Your Enterprise Search Strategy
 
SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013SPConnections - Search Administration in SharePoint 2013
SPConnections - Search Administration in SharePoint 2013
 
ESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 SearchESPC13 - 10 Things I Like in SharePoint 2013 Search
ESPC13 - 10 Things I Like in SharePoint 2013 Search
 
Building great search – how to overcome common challenges jon handler, aws
Building great search – how to overcome common challenges   jon handler, awsBuilding great search – how to overcome common challenges   jon handler, aws
Building great search – how to overcome common challenges jon handler, aws
 
Mktg350 lecture 10142013
Mktg350 lecture 10142013Mktg350 lecture 10142013
Mktg350 lecture 10142013
 
Fried toronto sps14 91 wcm intranet
Fried toronto sps14 91 wcm intranetFried toronto sps14 91 wcm intranet
Fried toronto sps14 91 wcm intranet
 
Deep-Dive to Azure Search
Deep-Dive to Azure SearchDeep-Dive to Azure Search
Deep-Dive to Azure Search
 
WordCamp Birmingham 2014: SEO Workshop: Best Practices for Better Website Tra...
WordCamp Birmingham 2014: SEO Workshop: Best Practices for Better Website Tra...WordCamp Birmingham 2014: SEO Workshop: Best Practices for Better Website Tra...
WordCamp Birmingham 2014: SEO Workshop: Best Practices for Better Website Tra...
 
How to Get the Most Out of Search Webinar
How to Get the Most Out of Search WebinarHow to Get the Most Out of Search Webinar
How to Get the Most Out of Search Webinar
 
Enterprise Search Strategy 101 at SEF2014 in Stockholm
Enterprise Search Strategy 101 at SEF2014 in StockholmEnterprise Search Strategy 101 at SEF2014 in Stockholm
Enterprise Search Strategy 101 at SEF2014 in Stockholm
 
Webinar: How to (Finally!) Get Relevant Results From SharePoint’s Search
Webinar: How to (Finally!) Get Relevant Results From SharePoint’s SearchWebinar: How to (Finally!) Get Relevant Results From SharePoint’s Search
Webinar: How to (Finally!) Get Relevant Results From SharePoint’s Search
 
Key Success Factors for Enterprise Content Management
Key Success Factors for Enterprise Content ManagementKey Success Factors for Enterprise Content Management
Key Success Factors for Enterprise Content Management
 
The things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 SearchThe things you need to know about SharePoint 2013 Search
The things you need to know about SharePoint 2013 Search
 
Casablanca SharePoint Days Enterprise Search Strategy
Casablanca SharePoint Days Enterprise Search StrategyCasablanca SharePoint Days Enterprise Search Strategy
Casablanca SharePoint Days Enterprise Search Strategy
 
Building a Fast and Powerful Search App with Lucidworks Site Search - Andrew ...
Building a Fast and Powerful Search App with Lucidworks Site Search - Andrew ...Building a Fast and Powerful Search App with Lucidworks Site Search - Andrew ...
Building a Fast and Powerful Search App with Lucidworks Site Search - Andrew ...
 
33 Tactics to Engage and Retain More Customers - IRCE 2016
33 Tactics to Engage and Retain More Customers - IRCE 201633 Tactics to Engage and Retain More Customers - IRCE 2016
33 Tactics to Engage and Retain More Customers - IRCE 2016
 

Último

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Último (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Spcua 2013 Alexey Kozhemiakin Enterprise Search

  • 1. May 22nd 2013, Kiev Enterprise search portals SharePoint 2013 Alexey Kozhemiakin
  • 2. May 22nd 2013, Kiev or “How to make a cool search” Alexey Kozhemiakin
  • 3. Who’s speaking to you? • Solution Architect @epam • Focusing on search • Sharepoint Search FAST/2010/2013 • Apache Lucene, Solr, elasticsearch, Oracle Endeca… • http://powersearching.wordpress.com 3
  • 4. Agenda • Enterprise Search Portal • Insight into SP2013 Search • Key changes from SP2010 • A bit of magic – relevancy calculation • Search governance, useful hint & tips 4
  • 5. Key search patterns • I know what I’m searching and where to find it • I know what I’m searching but don’t know where to find it. • I don’t‘ know what I’m searching 5 http://aghy.hu/AghyBlog_EN/Lists/Posts/Post.aspx?ID=199
  • 6. • Demand: • Fast growing enterprises • Zoo of internal systems • Solution: • “google” inside enterprise • Quick-wins for business: • Single point of smart search and information retrieval • Reduce search time by employee • Better inner communications and simplified reuse of conent 6 Enterprise Search Portal
  • 7. But after deployment… • «.. Search sucks» • Out of the box search knows nothing about you • «Typical But… • … Microsoft takes care of decent search algorithm» • … we’re not sure we can do better» • ... we don’t need search, everybody know where content is» • … make our search like in facebook/google/bing (instead of requirements)» 7
  • 8. Why it’s hard • Ambiguous short queries • Unstructured not optimized content • Different active vocabulary of content users and creators • Limited resources ($), while in internet search: • Auto and manual testing of search quality (assessors) • Continuous improvement 8
  • 10. Search in two phase process • Matching – all docs with keywords • Linguistics: stemming, phonetics • Synonyms • Ranking • «Фичи» • TF-IDF, BM25 • Вес полей • Тип файла • Дата изменения • Популярность • … 10
  • 11. Ranking in FAST • Linear combination of features 11
  • 12. Ranking in FAST • Impact of each component to final rank 12 0 1000 2000 3000 4000 5000 6000 7000 8000 1st 2nd 3rd 4th term:fast term:search freshness static rank proximity
  • 15. Ranking in SP2013 • Default Relevancy Model • Two neural networks • Freshness in not included in ranking • Features 15 Type Instance BM25 BM25 Static UrlDepth BucketedStatic InternalFileType BucketedStatic Language Static ClickDistance Static QueryLogClicks Static QueryLogSkips Static LastClicks Static EventRate MinSpan - soft Title MinSpan - soft Title MinSpan - soft Title MinSpan - soft Content
  • 16. Ranking in SP2013 16 • Default relevancy model
  • 18. Explain rank • Manual validation in excel 18
  • 19. 19
  • 20. Search Governance 1. Search analytics 2. Fine tuning and adaptation 3. Regular testing 4. Security assessment 5. Promotion whithin company 6. Content optimization and basic SEO 20
  • 21. 1. Search analytics • Search analytics • Search analytics • Search analytics • Obey! Use Search analytics 21
  • 22. 1. Search analytics • OOTB in SP2013 • Most popular queries • «No Results/abandoned» queries • 3rd party tools (Google Analytics, Omniture, WebTrends) • Measure search quality (!) • % click on results • Which results • Return after clicks • Session analysis • Query segmantation 22
  • 23. Query segmantation • Analyze and improve not only top N queries, but classes of queries 23
  • 24. 2. Fine tuning • Authoritative Pages • Quick win – content source priority • Query Rules • Smart search for users • Synonyms • Separate mapping file • Expansion only • Termsets synonyms NOT working • Relevancy models 24
  • 25. Authoritative Pages • Impacts ClickDistance • ClickDistance, UrlDepth have hich impact on total score (see explain rank) • Configures in CA, CSOM 25
  • 26. Query Rules (Rule + Action) • The tool to make search smarter • Interactive feedback to user queries • Post processing of queries • Leverage navigational queries • … 26
  • 27. Condition for Query Rules • Query Matches Keyword Exactly • Advanced Query Text Match • Query Matches Dictionary Exactly • Query Contains Action Term • Query More Common in Source • Result Type Commonly Clicked 27
  • 28. Actions для Query Rules • Create and display a result block • Change ranked search results • Best Bets • XRANK • Works additive to total rank • Not explained in rankdetail • How to choose correct value? 28
  • 29. Templates for QueryRules • Typical navigational keywords from our portal • Software, soft, download, install • How to • Policy, Blog • Portal • Music, Video • Presentation, Documents, Report • Training, tutorial • Book, ebook • You will have different ones! 29
  • 30. Custom Rank Models • Сбор Query Judgments • Tune neural network coefficients using machine learning • Gradient Descent, Lambda Rank • Microsoft.Office.Server.Search.RankerTuning 30
  • 31. Custom Rank Models • Modify manually new model or very simple (not default one!) • A/B testing of weights • Measure, measure: Precision, NDCG 31
  • 32. Custom Rank Models • Example of simple model – people search 32
  • 33. 3. Search quality testing • Why need? It’s your compass. • «Unit testing» • Periodical manual testing 33
  • 34. 4. Security «audit» • Search reveals breaches in security • Security by obscurity • Examples of queries: • «confidential» • Salaries, performance reviews • Solution – automatic monitoring of sensitive queries 34
  • 35. 5. Adoption of content • Use with departments • Get help with search monitoring of their queries • Guideline to format content • Basic SEO • Titles • Friendly urls • Custom meta tags <meta name=… • Title, description • Custom Automatically appear in crawled properties 35
  • 36. 6. Promotion within company • Image – «you will find everything here» • Integrate with other portals • Propose Search as a serivce • Widget «Global search» • Badges, gamification 36
  • 38. Semantic search • Cannot be solved in general • Analytics + fine tuning • See practices above • NLP – question answering • Rocket science • English only • Part of speech tagging, dependency parsing • Stanford NLP, Open NLP, IR 38
  • 39. «References» • Patents - http://goo.gl/20sbR • Explain Rank page - http://goo.gl/o3ZmN • How SP2013 relevancy models works - http://goo.gl/arf0P • MS Enterprise Search approach - http://goo.gl/x8SDO • Customizing ranking models in SP 2013 - http://goo.gl/lBJAp 39
  • 40. May 22nd 2013, Kiev Thanks Skype: Alexey_Kozhemiakin Email: Alexey.Kozhemiakin@gmail.com Blog: http://powersearching.wordpress.com 40