SlideShare a Scribd company logo
1 of 12
Apache Solr vs Oracle Endeca
07-01-2015
Pedro Melo Pereira
1 Key concepts
– Enterprise search platform
– Faceted search
2 Projects overview
3 Apache Solr
4 Oracle Endeca
5 Feature comparison
6 Conclusion
2
Summary
• Enterprise search platform
The practice of identifying and enabling specific content across the enterprise to be indexed, searched
and displayed to authorized users.
3
1. Key Concepts
1 Collection
2 Indexing
3 Query
Parser
4 Query
Engine
5 Post
Processor
6 Formatter
Content
Indexing
Query
Processing
1. Crawls directories and
websites, extracts content
from databases and other
repositories. Arranges for
content to be transferred to it
on a regular basis so it can
notify the search engine that
new information is available
2. Creates a searchable index
from all the content, often with
some value added processing
such as metadata extraction
and auto-summarization
(groups information into logical
categories)
3. Accepts searcher queries and
encodes them for optimal use
4. Passes query over index and
finds documents matching
search criteria
5. Sorts documents and applies
logic to the results such as
categorization, clustering and
recommendations
6. Streams out and formats
results
How do they work
• Faceted search
– Its the dynamic clustering of items or search results into categories that let users drill into search
results by any value in any field. Each facet displayed shows the number of hits that match that
category. Users can “drill down” by applying specific constraints to the search results. Also called
faceted browsing, faceted navigation, guided navigation and parametric search.
The example started out with all digital cameras, then the user selected the constraints “$400-$500”
and “SLR” from the Price and Digital camera type facets.
4
1. Key Concepts
• Faceted search benefits
– Superior feedback: Users can see at a glance a summary of the search results and how those results
break down by different criteria
– No surprises or dead ends: Users know how many results match before they click. Values with zero
counts are normally removed to reduce visual noise and eliminate the possibility of a user accidentaly
selecting a constraint that would lead to no results
– No selection hierarchy is imposed: Users are generally free to add or remove constraints in any order
5
1. Key Concepts
Apache Solr Oracle Endeca
An open source community supported
tool that allows IT to implement a
faceted search capability based on text
queries to an index of your data model
(e.g. products)
A mature product that provides all the
GUI based tools needed to allow IT and
business to quickly deploy search and
navigation built on queries to text and
object based data model.
More extensible Faster time to market
Faceted search – text search based Guided navigation – data model based
Limited tools Robust integrated tool set
6
2. Projects overview
Solr is a highly popular open source enterprise search platform from Apache. It uses the Lucene Java search
library at its core for full-text indexing and search, it has REST-like HTTP/XML and JSON apis that make it
usable from most programming languages.
Apache Lucene and Apache Solr projects were merged in 2010.
Strengths
• Free
• More powerful and extensible (e.g. freedom to build custom ranking algorithms)
• Larger adoption by the industry
• Larger community / modules / documentation
• Based on industry proven modules
Weaknesses
• No out of the box GUI for business users. Has to be implemented by IT
• No reporting
• It’s considered a framework not a product
7
3. Apache Solr
“Oracle had struggled to develop a strategy for enterprise search that would define it as a Leader. To do this, it has repurposed
Oracle Secure Enterprise Search as a tool that informs all its applications.
The acquisition of Endeca catapults Oracle forward in terms of search facility, though, at Oracle, Endeca is more prominent as
a means of improving business intelligence than as a search product.”
Strengths – Gartner report 2013/05
• Oracle offers strong flexibility for the design of conversational search capabilities to reduce the ambiguity
of results
• Oracle has very strong experience in e-commerce use cases
• Oracle has invested particularly strongly in the searching and analysing of structured data for hybrid
structured / unstructured use cases
Weaknesses
• Oracle has changed the model of pricing by data record to a price by processor (Oracle’s long standing
model). Clients indicate that they are often dissatisfied with this new model.
• Oracle is positioning Endeca as a search technology in the e-commerce arena, which might
weaken its development as a stand-alone enterprise search engine.
8
4. Oracle Endeca
Feature Apache Solr Oracle Endeca
Data modeling XML editing GUI tool set that supports configuration
and joining data from multiple sources
Index inspection Velocity based application that
supports search
Robust reference application to inspect
data and explore features
Business users n/a GUI based business suite to manage
configurations
Merchandising n/a GUI to manage merchandising rules
Reporting n/a Out of the box reports for search,
navigation and merchandising
Relevance ranking Extend a class to create what you
want
Limited to adjusting modules
XQuery n/a Xquery based ad-hoc querying with XML
support
9
5. Feature comparison
Feature Apache Solr Oracle Endeca
Aggregating records n/a Rollup records based on a property to
support variants
Hierarquical dimensions n/a Possible to define hierarchies for ranges
Internationalization Out of the box only supports
English. Has to use external
modules to support it
Licensed support for multiple languages
Clustering Manually configured by IT by using
external modules
Automatic organization of search results
into sets that share attributes
Scalability Based on Apache Zookeeper. Easy
to scale up. More powerful
Linear scalability out of the box. Easier
to manage
10
5. Feature comparison
Apache Solr
Strengths
• Fully integrated with Lucene (same project, different
modules).
• More freedom to customize and adapt to business
needs.
• More powerful api.
• Larger adoption / community.
Weaknesses
• No out of the box features for business users.
• More time to market for IT to implement features
(e.g. reporting, business Backoffice).
11
6. Conclusion
Oracle Endeca
Strengths
• Aligned with Oracle’s long-term goals to make it the
e-commerce reference for enterprise search.
• Out of the box features for business users
(backoffice).
Weaknesses
• Separate index. No integration with Lucene.
• Api more constrained. Possibly more difficult to
integrate to diverse business needs.
• Smaller adoption / community.
Q&A

More Related Content

What's hot

Transitioning from infomaker to bi publisher ppt
Transitioning from infomaker to bi publisher pptTransitioning from infomaker to bi publisher ppt
Transitioning from infomaker to bi publisher ppt
p6academy
 
Al kannan obiee implementation with ebs
Al kannan   obiee implementation with ebsAl kannan   obiee implementation with ebs
Al kannan obiee implementation with ebs
ravihrms
 

What's hot (20)

ATG - Commerce Service Center (CSC)
ATG - Commerce Service Center (CSC)ATG - Commerce Service Center (CSC)
ATG - Commerce Service Center (CSC)
 
Oracle Application Extensions for Oracle Endeca - for Application DBA's
Oracle Application Extensions for Oracle Endeca - for Application DBA'sOracle Application Extensions for Oracle Endeca - for Application DBA's
Oracle Application Extensions for Oracle Endeca - for Application DBA's
 
ATG Product Modules [INTERESTING VIEW]
ATG Product Modules [INTERESTING VIEW]ATG Product Modules [INTERESTING VIEW]
ATG Product Modules [INTERESTING VIEW]
 
Oracle bi ee architecture
Oracle bi ee architectureOracle bi ee architecture
Oracle bi ee architecture
 
Introduction to oracle bi 12c
Introduction to oracle bi 12cIntroduction to oracle bi 12c
Introduction to oracle bi 12c
 
OBIEE ARCHITECTURE.ppt
OBIEE ARCHITECTURE.pptOBIEE ARCHITECTURE.ppt
OBIEE ARCHITECTURE.ppt
 
Oracle OBIEE Data integration
Oracle OBIEE Data integrationOracle OBIEE Data integration
Oracle OBIEE Data integration
 
Upgrading To OBIEE 12C - Key Things Your Need To Know About
Upgrading To OBIEE 12C - Key Things Your Need To Know AboutUpgrading To OBIEE 12C - Key Things Your Need To Know About
Upgrading To OBIEE 12C - Key Things Your Need To Know About
 
Oracle Cloud ERP Report and Analytics | What are Cloud ERP reporting Options ...
Oracle Cloud ERP Report and Analytics | What are Cloud ERP reporting Options ...Oracle Cloud ERP Report and Analytics | What are Cloud ERP reporting Options ...
Oracle Cloud ERP Report and Analytics | What are Cloud ERP reporting Options ...
 
Oracle business intelligence overview
Oracle business intelligence overviewOracle business intelligence overview
Oracle business intelligence overview
 
Building Oracle BIEE (OBIEE) Reports, Dashboards
Building Oracle BIEE (OBIEE) Reports, DashboardsBuilding Oracle BIEE (OBIEE) Reports, Dashboards
Building Oracle BIEE (OBIEE) Reports, Dashboards
 
Oracle Business Intelligence Enterprise Edition
Oracle Business Intelligence Enterprise EditionOracle Business Intelligence Enterprise Edition
Oracle Business Intelligence Enterprise Edition
 
OBIEE11g Architecture & Internals : Collaborate'11, Orlando 2011
OBIEE11g Architecture & Internals : Collaborate'11, Orlando 2011OBIEE11g Architecture & Internals : Collaborate'11, Orlando 2011
OBIEE11g Architecture & Internals : Collaborate'11, Orlando 2011
 
Planning your move to the cloud: SaaS Enablement and User Experience (Oracle ...
Planning your move to the cloud: SaaS Enablement and User Experience (Oracle ...Planning your move to the cloud: SaaS Enablement and User Experience (Oracle ...
Planning your move to the cloud: SaaS Enablement and User Experience (Oracle ...
 
Obiee introductionbuildingreports
Obiee introductionbuildingreportsObiee introductionbuildingreports
Obiee introductionbuildingreports
 
Transitioning from infomaker to bi publisher ppt
Transitioning from infomaker to bi publisher pptTransitioning from infomaker to bi publisher ppt
Transitioning from infomaker to bi publisher ppt
 
Rest and Hateoas APIs
Rest and Hateoas APIsRest and Hateoas APIs
Rest and Hateoas APIs
 
OBIEE - Introduction & building reports
OBIEE - Introduction & building reportsOBIEE - Introduction & building reports
OBIEE - Introduction & building reports
 
Metadata Creation In OBIEE
Metadata Creation In OBIEEMetadata Creation In OBIEE
Metadata Creation In OBIEE
 
Al kannan obiee implementation with ebs
Al kannan   obiee implementation with ebsAl kannan   obiee implementation with ebs
Al kannan obiee implementation with ebs
 

Viewers also liked

Systems Migration
Systems MigrationSystems Migration
Systems Migration
richchihlee
 
Big data and enterprise search trends 120827nn
Big data and enterprise search trends 120827nnBig data and enterprise search trends 120827nn
Big data and enterprise search trends 120827nn
Cathy McKnight
 

Viewers also liked (13)

Endeca vs solr
Endeca vs solrEndeca vs solr
Endeca vs solr
 
Systems Migration
Systems MigrationSystems Migration
Systems Migration
 
Endeca: Developing A Best Practice Search Experience
Endeca: Developing A Best Practice Search ExperienceEndeca: Developing A Best Practice Search Experience
Endeca: Developing A Best Practice Search Experience
 
Big data and enterprise search trends 120827nn
Big data and enterprise search trends 120827nnBig data and enterprise search trends 120827nn
Big data and enterprise search trends 120827nn
 
Solr: 4 big features
Solr: 4 big featuresSolr: 4 big features
Solr: 4 big features
 
EBS-endeca-technical-considerations
EBS-endeca-technical-considerationsEBS-endeca-technical-considerations
EBS-endeca-technical-considerations
 
Facettensuche mit Lucene und Solr
Facettensuche mit Lucene und SolrFacettensuche mit Lucene und Solr
Facettensuche mit Lucene und Solr
 
Faceted Search and Solr
Faceted Search and SolrFaceted Search and Solr
Faceted Search and Solr
 
The Enterprise Search Market in a Nutshell
The Enterprise Search Market in a NutshellThe Enterprise Search Market in a Nutshell
The Enterprise Search Market in a Nutshell
 
Oracle Commerce Using ATG & Endeca - Do It Yourself Series
Oracle Commerce Using ATG & Endeca - Do It Yourself SeriesOracle Commerce Using ATG & Endeca - Do It Yourself Series
Oracle Commerce Using ATG & Endeca - Do It Yourself Series
 
Q4 2016 investor deck
Q4 2016 investor deckQ4 2016 investor deck
Q4 2016 investor deck
 
Solr facets and custom indices
Solr facets and custom indicesSolr facets and custom indices
Solr facets and custom indices
 
Titan: Big Graph Data with Cassandra
Titan: Big Graph Data with CassandraTitan: Big Graph Data with Cassandra
Titan: Big Graph Data with Cassandra
 

Similar to Apache Solr vs Oracle Endeca

Introducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinarIntroducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinar
Lucidworks (Archived)
 
Effective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCMEffective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCM
Fishbowl Solutions
 

Similar to Apache Solr vs Oracle Endeca (20)

Introducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinarIntroducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinar
 
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recallICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
 
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...
Jeroen Kleinhoven (Treparel), Turn Big Content into Business Insights - Data ...
 
Effective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCMEffective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCM
 
Webinar: Fusion 3.1 - What's New
Webinar: Fusion 3.1 - What's NewWebinar: Fusion 3.1 - What's New
Webinar: Fusion 3.1 - What's New
 
Oracle analytics cloud overview feb 2017
Oracle analytics cloud overview   feb 2017Oracle analytics cloud overview   feb 2017
Oracle analytics cloud overview feb 2017
 
Search for All with Elastic Enterprise Search
Search for All with Elastic Enterprise Search Search for All with Elastic Enterprise Search
Search for All with Elastic Enterprise Search
 
Webinar: Personalized Retail Search & Recommendations with Fusion
Webinar: Personalized Retail Search & Recommendations with FusionWebinar: Personalized Retail Search & Recommendations with Fusion
Webinar: Personalized Retail Search & Recommendations with Fusion
 
Nida event oracle business analytics 1 sep2016
Nida event   oracle business analytics 1 sep2016Nida event   oracle business analytics 1 sep2016
Nida event oracle business analytics 1 sep2016
 
What the #$* is a Business Catalog and why you need it
What the #$* is a Business Catalog and why you need it What the #$* is a Business Catalog and why you need it
What the #$* is a Business Catalog and why you need it
 
Webinar: Site Search in an Hour with Fusion
Webinar: Site Search in an Hour with FusionWebinar: Site Search in an Hour with Fusion
Webinar: Site Search in an Hour with Fusion
 
PatSeer Overview
PatSeer OverviewPatSeer Overview
PatSeer Overview
 
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
Getting started-oracle-analytics-cloud
Getting started-oracle-analytics-cloudGetting started-oracle-analytics-cloud
Getting started-oracle-analytics-cloud
 
Solr 101
Solr 101Solr 101
Solr 101
 
Delivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphsDelivering a Linked Data warehouse and realising the power of graphs
Delivering a Linked Data warehouse and realising the power of graphs
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Liwp consider opensource2010
Liwp consider opensource2010Liwp consider opensource2010
Liwp consider opensource2010
 
Ultra search
Ultra searchUltra search
Ultra search
 

Recently uploaded

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 

Recently uploaded (20)

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go PlatformlessWSO2CON2024 - It's time to go Platformless
WSO2CON2024 - It's time to go Platformless
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 

Apache Solr vs Oracle Endeca

  • 1. Apache Solr vs Oracle Endeca 07-01-2015 Pedro Melo Pereira
  • 2. 1 Key concepts – Enterprise search platform – Faceted search 2 Projects overview 3 Apache Solr 4 Oracle Endeca 5 Feature comparison 6 Conclusion 2 Summary
  • 3. • Enterprise search platform The practice of identifying and enabling specific content across the enterprise to be indexed, searched and displayed to authorized users. 3 1. Key Concepts 1 Collection 2 Indexing 3 Query Parser 4 Query Engine 5 Post Processor 6 Formatter Content Indexing Query Processing 1. Crawls directories and websites, extracts content from databases and other repositories. Arranges for content to be transferred to it on a regular basis so it can notify the search engine that new information is available 2. Creates a searchable index from all the content, often with some value added processing such as metadata extraction and auto-summarization (groups information into logical categories) 3. Accepts searcher queries and encodes them for optimal use 4. Passes query over index and finds documents matching search criteria 5. Sorts documents and applies logic to the results such as categorization, clustering and recommendations 6. Streams out and formats results How do they work
  • 4. • Faceted search – Its the dynamic clustering of items or search results into categories that let users drill into search results by any value in any field. Each facet displayed shows the number of hits that match that category. Users can “drill down” by applying specific constraints to the search results. Also called faceted browsing, faceted navigation, guided navigation and parametric search. The example started out with all digital cameras, then the user selected the constraints “$400-$500” and “SLR” from the Price and Digital camera type facets. 4 1. Key Concepts
  • 5. • Faceted search benefits – Superior feedback: Users can see at a glance a summary of the search results and how those results break down by different criteria – No surprises or dead ends: Users know how many results match before they click. Values with zero counts are normally removed to reduce visual noise and eliminate the possibility of a user accidentaly selecting a constraint that would lead to no results – No selection hierarchy is imposed: Users are generally free to add or remove constraints in any order 5 1. Key Concepts
  • 6. Apache Solr Oracle Endeca An open source community supported tool that allows IT to implement a faceted search capability based on text queries to an index of your data model (e.g. products) A mature product that provides all the GUI based tools needed to allow IT and business to quickly deploy search and navigation built on queries to text and object based data model. More extensible Faster time to market Faceted search – text search based Guided navigation – data model based Limited tools Robust integrated tool set 6 2. Projects overview
  • 7. Solr is a highly popular open source enterprise search platform from Apache. It uses the Lucene Java search library at its core for full-text indexing and search, it has REST-like HTTP/XML and JSON apis that make it usable from most programming languages. Apache Lucene and Apache Solr projects were merged in 2010. Strengths • Free • More powerful and extensible (e.g. freedom to build custom ranking algorithms) • Larger adoption by the industry • Larger community / modules / documentation • Based on industry proven modules Weaknesses • No out of the box GUI for business users. Has to be implemented by IT • No reporting • It’s considered a framework not a product 7 3. Apache Solr
  • 8. “Oracle had struggled to develop a strategy for enterprise search that would define it as a Leader. To do this, it has repurposed Oracle Secure Enterprise Search as a tool that informs all its applications. The acquisition of Endeca catapults Oracle forward in terms of search facility, though, at Oracle, Endeca is more prominent as a means of improving business intelligence than as a search product.” Strengths – Gartner report 2013/05 • Oracle offers strong flexibility for the design of conversational search capabilities to reduce the ambiguity of results • Oracle has very strong experience in e-commerce use cases • Oracle has invested particularly strongly in the searching and analysing of structured data for hybrid structured / unstructured use cases Weaknesses • Oracle has changed the model of pricing by data record to a price by processor (Oracle’s long standing model). Clients indicate that they are often dissatisfied with this new model. • Oracle is positioning Endeca as a search technology in the e-commerce arena, which might weaken its development as a stand-alone enterprise search engine. 8 4. Oracle Endeca
  • 9. Feature Apache Solr Oracle Endeca Data modeling XML editing GUI tool set that supports configuration and joining data from multiple sources Index inspection Velocity based application that supports search Robust reference application to inspect data and explore features Business users n/a GUI based business suite to manage configurations Merchandising n/a GUI to manage merchandising rules Reporting n/a Out of the box reports for search, navigation and merchandising Relevance ranking Extend a class to create what you want Limited to adjusting modules XQuery n/a Xquery based ad-hoc querying with XML support 9 5. Feature comparison
  • 10. Feature Apache Solr Oracle Endeca Aggregating records n/a Rollup records based on a property to support variants Hierarquical dimensions n/a Possible to define hierarchies for ranges Internationalization Out of the box only supports English. Has to use external modules to support it Licensed support for multiple languages Clustering Manually configured by IT by using external modules Automatic organization of search results into sets that share attributes Scalability Based on Apache Zookeeper. Easy to scale up. More powerful Linear scalability out of the box. Easier to manage 10 5. Feature comparison
  • 11. Apache Solr Strengths • Fully integrated with Lucene (same project, different modules). • More freedom to customize and adapt to business needs. • More powerful api. • Larger adoption / community. Weaknesses • No out of the box features for business users. • More time to market for IT to implement features (e.g. reporting, business Backoffice). 11 6. Conclusion Oracle Endeca Strengths • Aligned with Oracle’s long-term goals to make it the e-commerce reference for enterprise search. • Out of the box features for business users (backoffice). Weaknesses • Separate index. No integration with Lucene. • Api more constrained. Possibly more difficult to integrate to diverse business needs. • Smaller adoption / community.
  • 12. Q&A