SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
Replacing	
  GSA	
  with	
  Lucidworks	
  
Fusion	
  	
  
Evan	
  Sayer	
  
Senior	
  Search	
  Engineer	
  
Lucidworks	
  
	
  
Guy	
  Sperry	
  
Enterprise	
  Content	
  Management	
  &	
  Big	
  Data	
  Architect	
  	
  
County	
  of	
  Sacramento	
  
Introduc)on	
  
•  Lucidworks	
  
–  Founded	
  in	
  2007	
  
–  Contributes	
  ~70%	
  of	
  the	
  open-­‐source	
  code	
  commiJed	
  
to	
  the	
  Apache	
  Lucene/Solr	
  project	
  
•  Lucidworks	
  Fusion:	
  our	
  enterprise	
  search	
  
plaNorm	
  built	
  on	
  top	
  of	
  Apache	
  Solr	
  
•  Apache	
  Solr:	
  the	
  most	
  popular	
  open-­‐source	
  
enterprise	
  search	
  engine	
  on	
  Earth	
  
Google	
  Search	
  Appliance	
  (GSA)	
  
•  Google’s	
  enterprise	
  search	
  
soluPon	
  offered	
  from	
  2002-­‐2016	
  
•  One-­‐stop	
  shopping:	
  a	
  complete	
  
enterprise-­‐search	
  soluPon	
  in	
  one	
  
box	
  
•  EoL	
  as	
  of	
  February	
  2016,	
  support	
  
phased-­‐out	
  completely	
  by	
  2018	
  
GSA	
  Strengths	
  
•  Easy	
  to	
  setup	
  and	
  configure	
  –	
  “plug	
  and	
  play”	
  
–  Lower	
  start-­‐up	
  cost	
  and	
  lower	
  Pme-­‐to-­‐value	
  than	
  
many	
  other	
  contemporary	
  soluPons	
  
–  RelaPvely	
  straighNorward	
  to	
  operate	
  on	
  an	
  ongoing	
  
basis	
  
–  Achieve	
  a	
  decent	
  search	
  experience	
  quite	
  quickly	
  and	
  
easily	
  
•  Takeaway:	
  GSA	
  minimized	
  necessary	
  investment	
  
in	
  technical	
  experPse	
  
Replacing	
  GSA	
  with	
  Fusion	
  
•  Easy	
  to	
  setup	
  and	
  configure,	
  “plug	
  and	
  play”	
  
–  Fusion	
  Index	
  Workbench	
  
•  Quickly	
  connect	
  to	
  and	
  ingest	
  data	
  
•  IntuiPvely	
  iterate	
  on	
  improving	
  search	
  results	
  
•  Easily	
  A/B	
  test	
  tweaks	
  to	
  ETL	
  logic	
  
–  Dashboards	
  and	
  Log	
  AnalyPcs	
  
–  Monitoring/alerPng	
  APIs	
  that	
  integrate	
  with	
  common	
  
tools	
  to	
  ease	
  ongoing	
  maintenance	
  
GSA	
  Strengths	
  	
  
•  Out-­‐of-­‐box	
  search	
  UI	
  
–  Highly	
  useful	
  during	
  
development,	
  iteraPng	
  on	
  
relevancy	
  improvements,	
  etc.	
  
–  Customizable	
  enough	
  to	
  use	
  
as	
  an	
  end-­‐user	
  search	
  UI	
  
•  Takeaway:	
  GSA	
  minimized	
  
necessary	
  investment	
  in	
  
technical	
  experPse	
  
Replacing	
  GSA	
  with	
  Fusion	
  
•  Out-­‐of-­‐box	
  search	
  UI	
  
–  Lucidworks	
  View	
  
•  Highly	
  
customizable/”skin-­‐able”	
  
•  Fully	
  open-­‐source:	
  
hJps://github.com/
lucidworks/lucidworks-­‐
view	
  
•  Built	
  on	
  top	
  of	
  a	
  modern	
  
stack	
  (AngularJS)	
  
GSA	
  Strengths	
  
•  Broad	
  support	
  for	
  connecPng	
  to,	
  ingesPng,	
  
and	
  securing	
  data	
  
– Many	
  out-­‐of-­‐box	
  connectors	
  to	
  common	
  sources:	
  
CRM,	
  Wikis,	
  databases	
  etc.	
  
– Extensible	
  connector	
  framework	
  
•  Takeaway:	
  GSA	
  minimized	
  necessary	
  
investment	
  in	
  technical	
  experPse	
  
Replacing	
  GSA	
  with	
  Fusion	
  
•  Broad	
  support	
  for	
  connecPng	
  to,	
  ingesPng,	
  and	
  securing	
  data	
  
–  Fusion	
  ships	
  with	
  ~40	
  connectors	
  to	
  common	
  sources	
  
•  JDBC,	
  Web,	
  Alfresco,	
  Box,	
  Dropbox,	
  Drupal,	
  Github,	
  Google	
  Drive,	
  Jive,	
  JIRA,	
  
Sharepoint,	
  MongoDB,	
  Hadoop/HDFS,	
  Salesforce,	
  Slack,	
  lots	
  more…	
  
•  Fusion	
  connectors’	
  security-­‐trimming	
  funcPonality	
  secures	
  content/searches	
  
out-­‐of-­‐box	
  
–  Fusion	
  Index	
  Pipelines	
  enable	
  easily	
  pushing	
  data	
  into	
  the	
  index	
  as	
  
well,	
  via	
  a	
  REST	
  API	
  
–  Custom	
  connector	
  development	
  via	
  Fusion’s	
  Connectors	
  API	
  
	
  
GSA	
  Weaknesses	
  
•  Broad	
  theme:	
  insufficient	
  control	
  over	
  the	
  search	
  experience	
  
–  Relevancy	
  tuning	
  and	
  controls	
  are	
  exceedingly	
  opaque	
  
•  “Source	
  Biasing”:	
  +/-­‐	
  [strong|medium|weak]	
  
–  Lack	
  of	
  control	
  over	
  indexing	
  workflow	
  
•  Custom	
  metadata	
  processing	
  was	
  a	
  chore,	
  if	
  feasible	
  
–  Oren	
  referred	
  to	
  as	
  a	
  “black	
  box”	
  design	
  
•  Non-­‐trivial	
  to	
  scale	
  
–  Appliance	
  packaging	
  restricts	
  freedom	
  in	
  scaling	
  up	
  
–  Per-­‐document	
  pricing	
  model	
  
•  Incorrect	
  facet	
  counts!?	
  
Fusion	
  –	
  Fine-­‐grained	
  Control	
  over	
  *Everything*	
  
Fusion	
  –	
  Fine-­‐grained	
  Control	
  over	
  *Everything*	
  
•  Fusion	
  Index	
  Pipelines	
  
–  True	
  fine-­‐grained	
  control	
  over	
  ETL;	
  as	
  much	
  or	
  as	
  liJle	
  as	
  desired	
  
•  For	
  content	
  from	
  source	
  X,	
  I	
  want	
  to	
  redact	
  this	
  set	
  of	
  keywords	
  
•  For	
  content	
  from	
  source	
  Y,	
  I	
  want	
  to	
  extract	
  the	
  Ptle	
  from	
  this	
  HTML	
  tag	
  
•  For	
  content	
  from	
  source	
  Z,	
  I	
  want	
  to	
  lookup	
  the	
  authorized	
  groups	
  from	
  another	
  database,	
  and	
  add	
  
them	
  to	
  a	
  field	
  in	
  each	
  document	
  
•  Fusion	
  Query	
  Pipelines	
  
–  True	
  fine-­‐grained	
  control	
  over	
  request/response	
  logic	
  at	
  query-­‐Pme	
  
•  For	
  queries	
  containing	
  keyword	
  X,	
  I	
  want	
  to	
  rewrite	
  the	
  query	
  to	
  be	
  something	
  else	
  
•  For	
  queries	
  in	
  language	
  Y,	
  I	
  want	
  to	
  boost	
  results	
  matching	
  in	
  this	
  separate	
  set	
  of	
  fields	
  
•  For	
  matching	
  documents	
  containing	
  keyword	
  Z,	
  I	
  want	
  to	
  redact	
  all	
  occurrences	
  of	
  Z	
  before	
  returning	
  
the	
  results	
  
–  Fusion	
  signals:	
  collect	
  users’	
  queries+clicks	
  and	
  aggregate	
  them	
  over	
  Pme	
  
•  UPlize	
  this	
  knowledge	
  to	
  dynamically	
  boost	
  the	
  most	
  commonly-­‐clicked	
  item(s)	
  for	
  a	
  given	
  query	
  
•  ConPnually	
  improve	
  relevancy	
  without	
  manual	
  human	
  input	
  
•  If	
  you’re	
  already	
  familiar	
  with	
  Solr/Lucene,	
  hack	
  away!	
  J	
  
Fusion	
  –	
  Fine-­‐grained	
  Control	
  over	
  *Everything*	
  
•  Scaling	
  
–  Fusion	
  uPlizes	
  best-­‐in-­‐class	
  Apache	
  Solr	
  as	
  the	
  backend	
  search	
  engine	
  
•  Scale	
  to	
  billions	
  of	
  documents	
  linearly	
  
–  Fusion	
  services	
  scale	
  independently	
  
•  As	
  opposed	
  to	
  GSA,	
  which	
  scaled	
  in	
  units	
  of	
  enPre	
  appliances	
  
•  If	
  you	
  want	
  to	
  ingest	
  content	
  faster,	
  add	
  addiPonal	
  connectors	
  nodes	
  
•  If	
  you	
  want	
  to	
  enable	
  greater	
  query	
  throughput,	
  add	
  addiPonal	
  query-­‐processing	
  nodes	
  	
  
–  StraighNorward	
  APIs/processes	
  for	
  provisioning	
  addiPonal	
  nodes	
  
•  Just	
  spin	
  up	
  a	
  new	
  node,	
  install	
  Fusion,	
  and	
  point	
  it	
  at	
  the	
  central	
  cluster	
  manager	
  
(Apache	
  Zookeeper)	
  
•  Easily	
  overlay	
  Fusion	
  on	
  top	
  of	
  any	
  exisPng	
  Solr	
  cluster	
  
Fusion	
  as	
  a	
  plaDorm	
  
•  Get	
  started	
  with	
  ease:	
  hJps://lucidworks.com/products/fusion/download/	
  
1.  Point	
  Fusion	
  at	
  your	
  data	
  
2.  Setup	
  a	
  simple	
  baseline	
  search	
  app	
  with	
  Lucidworks	
  View	
  
3.  Iterate	
  on	
  the	
  actual	
  search	
  experience	
  to	
  your	
  heart’s	
  content	
  J	
  
•  Delve	
  into	
  the	
  details	
  (or	
  don’t!)	
  
–  Fusion	
  provides	
  the	
  necessary	
  framework	
  to	
  tackle	
  tough	
  and/or	
  use-­‐case-­‐specific	
  search	
  
problems	
  
–  Anything	
  but	
  a	
  “black	
  box”	
  design	
  
–  Most	
  components	
  are	
  customizable	
  and	
  extensible	
  
•  Implement	
  your	
  own	
  Fusion	
  components	
  in	
  Java	
  using	
  our	
  APIs	
  
•  Scale	
  with	
  minimal	
  effort,	
  maximal	
  flexibility	
  
–  Scale	
  linearly	
  up	
  to	
  billions	
  of	
  docs	
  with	
  Apache	
  Solr	
  
–  Self-­‐service	
  APIs	
  for	
  se{ng	
  up	
  addiPonal	
  nodes	
  to	
  expand	
  capacity	
  
–  Per-­‐node	
  instead	
  of	
  per-­‐doc	
  pricing	
  means	
  fewer	
  surprises	
  when	
  it’s	
  Pme	
  to	
  renew	
  licenses	
  
	
  
“Fusion gave us the features we needed to replace
Google Search Appliance in a matter of weeks. With
Fusion’s out-of-the-box capabilities, we skipped months
in our dev cycle so we could focus our team where they
would have the most impact. We cut our licensing costs
by 50% and improved application usability. The
Lucidworks professional services team amplified our
success even further.
“We’re all Fusion from here on out!”
Lourduraju Pamishetty

Senior IT Application Architect

Infoblox
Customers	
  
Who’ve	
  Made	
  
the	
  Switch	
  
Fusion	
  as	
  a	
  plaDorm	
  
•  Accurate	
  facet	
  counts	
  
– What	
  a	
  concept!	
  J	
  
•  Take	
  Fusion	
  for	
  a	
  spin:	
  
hJps://lucidworks.com/products/fusion/
download/	
  
Agenda	
  
•  IntroducPon	
  to	
  County	
  of	
  Sacramento	
  
•  Why	
  Sacramento	
  County	
  is	
  search	
  first	
  for	
  data	
  delivery	
  
•  How	
  Fusion	
  helps	
  us	
  meet	
  our	
  data	
  delivery	
  challenges	
  
•  How	
  Fusion	
  has	
  helped	
  us	
  fill	
  gaps	
  ler	
  by	
  GSA	
  rePrement	
  
	
  
Sacramento	
  County	
  
•  34	
  departments	
  and	
  affiliated	
  organizaPons	
  serving	
  1.5	
  
million	
  people	
  
•  Commitment	
  to	
  open	
  government	
  and	
  transparency	
  
•  CiPzen	
  engagement	
  
Why	
  Sacramento	
  County	
  is	
  Search	
  First	
  
•  Enterprise	
  apps,	
  data	
  snackers	
  and	
  LOB	
  apps	
  
–  ADABAS	
  (Mainframe)	
  
–  RDBMS	
  
–  CDH	
  
–  ECM	
  
	
  
•  Diverse,	
  heterogeneous	
  data	
  environment	
  
	
  
•  Our	
  challenge:	
  securely	
  deliver	
  prompt	
  access	
  to	
  relevant	
  data	
  
Fusion/Solr	
  in	
  Sacramento	
  County	
  
•  Documents	
  and	
  content	
  
–  Cross-­‐repository	
  search	
  
–  Source	
  repository	
  security	
  
•  GIS	
  
•  Cross-­‐Source	
  Data	
  Processing	
  and	
  AnalyPcs	
  
–  Fusion	
  connectors	
  
–  Spark	
  in	
  Fusion	
  
•  Log	
  Analysis	
  
•  NOSQL	
  
–  Why	
  be	
  MEAN	
  when	
  you	
  can	
  be	
  SANE?	
  
Gaps	
  LeH	
  by	
  GSA	
  
Fusion	
  was	
  our	
  final	
  GSA	
  patch	
  
AgendaSearch.saccounty.net	
  
•  The	
  Brown	
  Act	
  
–  Make	
  public	
  meePngs	
  accessible	
  to	
  ciPzens	
  
–  Maintain	
  transparency	
  
	
  
•  AgendaSearch	
  
–  Search	
  and	
  consume	
  public	
  documents	
  
–  Integrate	
  with	
  agenda	
  management	
  
–  Lucidworks	
  View	
  
–  Has	
  reduced	
  PRAs	
  
Immediate	
  Win	
  with	
  View	
  
•  County	
  Legal	
  Counsel	
  
•  ~2	
  million	
  document	
  archive	
  
•  Document	
  level	
  security	
  
•  IntuiPve	
  and	
  feature	
  rich	
  UI	
  
•  Search	
  soluPon	
  delivered	
  before	
  lunch	
  
Q&A	
  
	
  
Resources:	
  
	
  
•  Download	
  Fusion:	
  hJps://lucidworks.com/products/fusion/download/	
  
•  Lucene/Solr	
  RevoluPon	
  2016	
  –	
  Oct	
  11-­‐14	
  –	
  Boston,	
  MA:	
  lucenerevoluPon.org	
  
	
  
	
  
	
  

Mais conteúdo relacionado

Mais procurados

Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Lucidworks
 
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Lucidworks
 
Writing Continuous Applications with Structured Streaming PySpark API
Writing Continuous Applications with Structured Streaming PySpark APIWriting Continuous Applications with Structured Streaming PySpark API
Writing Continuous Applications with Structured Streaming PySpark API
Databricks
 
Webinar: Rapid Solr Development with Fusion
Webinar: Rapid Solr Development with FusionWebinar: Rapid Solr Development with Fusion
Webinar: Rapid Solr Development with Fusion
Lucidworks
 
Unifying your data management with Hadoop
Unifying your data management with HadoopUnifying your data management with Hadoop
Unifying your data management with Hadoop
Jayant Shekhar
 

Mais procurados (20)

Data Science with Solr and Spark
Data Science with Solr and SparkData Science with Solr and Spark
Data Science with Solr and Spark
 
Webinar: Solr 6 Deep Dive - SQL and Graph
Webinar: Solr 6 Deep Dive - SQL and GraphWebinar: Solr 6 Deep Dive - SQL and Graph
Webinar: Solr 6 Deep Dive - SQL and Graph
 
Your Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
Your Big Data Stack is Too Big!: Presented by Timothy Potter, LucidworksYour Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
Your Big Data Stack is Too Big!: Presented by Timothy Potter, Lucidworks
 
Deduplication Using Solr: Presented by Neeraj Jain, Stubhub
Deduplication Using Solr: Presented by Neeraj Jain, StubhubDeduplication Using Solr: Presented by Neeraj Jain, Stubhub
Deduplication Using Solr: Presented by Neeraj Jain, Stubhub
 
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
Ubiquitous Solr - A Database's Not-So-Evil Twin: Presented by Ayon Sinha, Wal...
 
Journey of Implementing Solr at Target: Presented by Raja Ramachandran, Target
Journey of Implementing Solr at Target: Presented by Raja Ramachandran, TargetJourney of Implementing Solr at Target: Presented by Raja Ramachandran, Target
Journey of Implementing Solr at Target: Presented by Raja Ramachandran, Target
 
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
Streaming Aggregation in Solr - New Horizons for Search: Presented by Erick E...
 
Big Data Processing with Spark and .NET - Microsoft Ignite 2019
Big Data Processing with Spark and .NET - Microsoft Ignite 2019Big Data Processing with Spark and .NET - Microsoft Ignite 2019
Big Data Processing with Spark and .NET - Microsoft Ignite 2019
 
Introduction to Spark SQL training workshop
Introduction to Spark SQL training workshopIntroduction to Spark SQL training workshop
Introduction to Spark SQL training workshop
 
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
Building a Real-Time News Search Engine: Presented by Ramkumar Aiyengar, Bloo...
 
H-Hypermap - Heatmap Analytics at Scale: Presented by David Smiley, D W Smile...
H-Hypermap - Heatmap Analytics at Scale: Presented by David Smiley, D W Smile...H-Hypermap - Heatmap Analytics at Scale: Presented by David Smiley, D W Smile...
H-Hypermap - Heatmap Analytics at Scale: Presented by David Smiley, D W Smile...
 
Writing Continuous Applications with Structured Streaming PySpark API
Writing Continuous Applications with Structured Streaming PySpark APIWriting Continuous Applications with Structured Streaming PySpark API
Writing Continuous Applications with Structured Streaming PySpark API
 
Informational Referential Integrity Constraints Support in Apache Spark with ...
Informational Referential Integrity Constraints Support in Apache Spark with ...Informational Referential Integrity Constraints Support in Apache Spark with ...
Informational Referential Integrity Constraints Support in Apache Spark with ...
 
Real-Time Analytics with Solr: Presented by Yonik Seeley, Cloudera
Real-Time Analytics with Solr: Presented by Yonik Seeley, ClouderaReal-Time Analytics with Solr: Presented by Yonik Seeley, Cloudera
Real-Time Analytics with Solr: Presented by Yonik Seeley, Cloudera
 
Webinar: Rapid Solr Development with Fusion
Webinar: Rapid Solr Development with FusionWebinar: Rapid Solr Development with Fusion
Webinar: Rapid Solr Development with Fusion
 
Introduction to Cloudera Search Training
Introduction to Cloudera Search TrainingIntroduction to Cloudera Search Training
Introduction to Cloudera Search Training
 
Solr JDBC: Presented by Kevin Risden, Avalon Consulting
Solr JDBC: Presented by Kevin Risden, Avalon ConsultingSolr JDBC: Presented by Kevin Risden, Avalon Consulting
Solr JDBC: Presented by Kevin Risden, Avalon Consulting
 
10 Keys to Solr's Future: Presented by Grant Ingersoll, Lucidworks
10 Keys to Solr's Future: Presented by Grant Ingersoll, Lucidworks10 Keys to Solr's Future: Presented by Grant Ingersoll, Lucidworks
10 Keys to Solr's Future: Presented by Grant Ingersoll, Lucidworks
 
Unifying your data management with Hadoop
Unifying your data management with HadoopUnifying your data management with Hadoop
Unifying your data management with Hadoop
 
Never Stop Exploring - Pushing the Limits of Solr: Presented by Anirudha Jadh...
Never Stop Exploring - Pushing the Limits of Solr: Presented by Anirudha Jadh...Never Stop Exploring - Pushing the Limits of Solr: Presented by Anirudha Jadh...
Never Stop Exploring - Pushing the Limits of Solr: Presented by Anirudha Jadh...
 

Destaque

Destaque (20)

Slash n near real time indexing
Slash n   near real time indexingSlash n   near real time indexing
Slash n near real time indexing
 
Apache Solr 5.0 and beyond
Apache Solr 5.0 and beyondApache Solr 5.0 and beyond
Apache Solr 5.0 and beyond
 
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...
 
Understanding the Solr security framework - Lucene Solr Revolution 2015
Understanding the Solr security framework - Lucene Solr Revolution 2015Understanding the Solr security framework - Lucene Solr Revolution 2015
Understanding the Solr security framework - Lucene Solr Revolution 2015
 
What's New in Apache Solr 4.10
What's New in Apache Solr 4.10What's New in Apache Solr 4.10
What's New in Apache Solr 4.10
 
What's new in Solr 5.0
What's new in Solr 5.0What's new in Solr 5.0
What's new in Solr 5.0
 
Building a Solr Continuous Delivery Pipeline with Jenkins: Presented by James...
Building a Solr Continuous Delivery Pipeline with Jenkins: Presented by James...Building a Solr Continuous Delivery Pipeline with Jenkins: Presented by James...
Building a Solr Continuous Delivery Pipeline with Jenkins: Presented by James...
 
Scaling SolrCloud to a large number of Collections
Scaling SolrCloud to a large number of CollectionsScaling SolrCloud to a large number of Collections
Scaling SolrCloud to a large number of Collections
 
Ease of use in Apache Solr
Ease of use in Apache SolrEase of use in Apache Solr
Ease of use in Apache Solr
 
it's just search
it's just searchit's just search
it's just search
 
Cross Data Center Replication for the Enterprise: Presented by Adam Williams,...
Cross Data Center Replication for the Enterprise: Presented by Adam Williams,...Cross Data Center Replication for the Enterprise: Presented by Adam Williams,...
Cross Data Center Replication for the Enterprise: Presented by Adam Williams,...
 
Solr security frameworks
Solr security frameworksSolr security frameworks
Solr security frameworks
 
SolrCloud Cluster management via APIs
SolrCloud Cluster management via APIsSolrCloud Cluster management via APIs
SolrCloud Cluster management via APIs
 
Using Apache Solr for Images as Big Data: Presented by Kerry Koitzsch, Wipro...
Using Apache Solr for Images as Big Data: Presented by Kerry Koitzsch,  Wipro...Using Apache Solr for Images as Big Data: Presented by Kerry Koitzsch,  Wipro...
Using Apache Solr for Images as Big Data: Presented by Kerry Koitzsch, Wipro...
 
Downtown SF Lucene/Solr Meetup: Developing Scalable Search for User Generated...
Downtown SF Lucene/Solr Meetup: Developing Scalable Search for User Generated...Downtown SF Lucene/Solr Meetup: Developing Scalable Search for User Generated...
Downtown SF Lucene/Solr Meetup: Developing Scalable Search for User Generated...
 
Working with deeply nested documents in Apache Solr
Working with deeply nested documents in Apache SolrWorking with deeply nested documents in Apache Solr
Working with deeply nested documents in Apache Solr
 
Managing a SolrCloud cluster using APIs
Managing a SolrCloud cluster using APIsManaging a SolrCloud cluster using APIs
Managing a SolrCloud cluster using APIs
 
Coffee, Danish & Search: Presented by Alan Woodward & Charlie Hull, Flax
Coffee, Danish & Search: Presented by Alan Woodward & Charlie Hull, FlaxCoffee, Danish & Search: Presented by Alan Woodward & Charlie Hull, Flax
Coffee, Danish & Search: Presented by Alan Woodward & Charlie Hull, Flax
 
Deploying and managing Solr at scale
Deploying and managing Solr at scaleDeploying and managing Solr at scale
Deploying and managing Solr at scale
 
Building a Vibrant Search Ecosystem @ Bloomberg: Presented by Steven Bower & ...
Building a Vibrant Search Ecosystem @ Bloomberg: Presented by Steven Bower & ...Building a Vibrant Search Ecosystem @ Bloomberg: Presented by Steven Bower & ...
Building a Vibrant Search Ecosystem @ Bloomberg: Presented by Steven Bower & ...
 

Semelhante a Webinar: Replace Google Search Appliance with Lucidworks Fusion

SQL Analytics for Search Engineers - Timothy Potter, Lucidworksngineers
SQL Analytics for Search Engineers - Timothy Potter, LucidworksngineersSQL Analytics for Search Engineers - Timothy Potter, Lucidworksngineers
SQL Analytics for Search Engineers - Timothy Potter, Lucidworksngineers
Lucidworks
 
Ultra-scale e-Commerce Transaction Services with Lean Middleware
Ultra-scale e-Commerce Transaction Services with Lean Middleware Ultra-scale e-Commerce Transaction Services with Lean Middleware
Ultra-scale e-Commerce Transaction Services with Lean Middleware
WSO2
 
Hunk: Splunk Analytics for Hadoop
Hunk: Splunk Analytics for HadoopHunk: Splunk Analytics for Hadoop
Hunk: Splunk Analytics for Hadoop
Georg Knon
 

Semelhante a Webinar: Replace Google Search Appliance with Lucidworks Fusion (20)

Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
Building Scalable Big Data Infrastructure Using Open Source Software Presenta...
 
Cloudera Search Webinar: Big Data Search, Bigger Insights
Cloudera Search Webinar: Big Data Search, Bigger InsightsCloudera Search Webinar: Big Data Search, Bigger Insights
Cloudera Search Webinar: Big Data Search, Bigger Insights
 
SQL Analytics for Search Engineers - Timothy Potter, Lucidworksngineers
SQL Analytics for Search Engineers - Timothy Potter, LucidworksngineersSQL Analytics for Search Engineers - Timothy Potter, Lucidworksngineers
SQL Analytics for Search Engineers - Timothy Potter, Lucidworksngineers
 
Webinar: What's New in Fusion 4.1
Webinar: What's New in Fusion 4.1Webinar: What's New in Fusion 4.1
Webinar: What's New in Fusion 4.1
 
Taking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – ArchitectureTaking Splunk to the Next Level – Architecture
Taking Splunk to the Next Level – Architecture
 
Implementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEMImplementing Site Search in CQ5 / AEM
Implementing Site Search in CQ5 / AEM
 
How Open Source Embiggens Salesforce.com
How Open Source Embiggens Salesforce.comHow Open Source Embiggens Salesforce.com
How Open Source Embiggens Salesforce.com
 
Qtr 3 2012 Ppt
Qtr 3 2012 PptQtr 3 2012 Ppt
Qtr 3 2012 Ppt
 
OSGi Community Event 2010 - Migration from Java EE Application Server to Serv...
OSGi Community Event 2010 - Migration from Java EE Application Server to Serv...OSGi Community Event 2010 - Migration from Java EE Application Server to Serv...
OSGi Community Event 2010 - Migration from Java EE Application Server to Serv...
 
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stackAccelerating analytics in the cloud with the Starburst Presto + Alluxio stack
Accelerating analytics in the cloud with the Starburst Presto + Alluxio stack
 
Ultra-scale e-Commerce Transaction Services with Lean Middleware
Ultra-scale e-Commerce Transaction Services with Lean Middleware Ultra-scale e-Commerce Transaction Services with Lean Middleware
Ultra-scale e-Commerce Transaction Services with Lean Middleware
 
Patching is Your Friend in the New World Order of EPM and ERP Cloud
Patching is Your Friend in the New World Order of EPM and ERP CloudPatching is Your Friend in the New World Order of EPM and ERP Cloud
Patching is Your Friend in the New World Order of EPM and ERP Cloud
 
Webinar: Site Search in an Hour with Fusion
Webinar: Site Search in an Hour with FusionWebinar: Site Search in an Hour with Fusion
Webinar: Site Search in an Hour with Fusion
 
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
Justin Sheppard & Ankur Gupta from Sears Holdings Corporation - Single point ...
 
Fried dallas spug
Fried dallas spugFried dallas spug
Fried dallas spug
 
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
 
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recallICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
ICIC 2013 Conference Proceedings Andreas Pesenhofer max.recall
 
Accelerating workloads and bursting data with Google Dataproc & Alluxio
Accelerating workloads and bursting data with Google Dataproc & AlluxioAccelerating workloads and bursting data with Google Dataproc & Alluxio
Accelerating workloads and bursting data with Google Dataproc & Alluxio
 
Hunk: Splunk Analytics for Hadoop
Hunk: Splunk Analytics for HadoopHunk: Splunk Analytics for Hadoop
Hunk: Splunk Analytics for Hadoop
 
SplunkLive London 2014 Developer Presentation
SplunkLive London 2014  Developer PresentationSplunkLive London 2014  Developer Presentation
SplunkLive London 2014 Developer Presentation
 

Mais de Lucidworks

Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Lucidworks
 

Mais de Lucidworks (20)

Search is the Tip of the Spear for Your B2B eCommerce Strategy
Search is the Tip of the Spear for Your B2B eCommerce StrategySearch is the Tip of the Spear for Your B2B eCommerce Strategy
Search is the Tip of the Spear for Your B2B eCommerce Strategy
 
Drive Agent Effectiveness in Salesforce
Drive Agent Effectiveness in SalesforceDrive Agent Effectiveness in Salesforce
Drive Agent Effectiveness in Salesforce
 
How Crate & Barrel Connects Shoppers with Relevant Products
How Crate & Barrel Connects Shoppers with Relevant ProductsHow Crate & Barrel Connects Shoppers with Relevant Products
How Crate & Barrel Connects Shoppers with Relevant Products
 
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Lucidworks & IMRG Webinar – Best-In-Class Retail Product DiscoveryLucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
 
Connected Experiences Are Personalized Experiences
Connected Experiences Are Personalized ExperiencesConnected Experiences Are Personalized Experiences
Connected Experiences Are Personalized Experiences
 
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
 
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
 
Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020Preparing for Peak in Ecommerce | eTail Asia 2020
Preparing for Peak in Ecommerce | eTail Asia 2020
 
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
 
AI-Powered Linguistics and Search with Fusion and Rosette
AI-Powered Linguistics and Search with Fusion and RosetteAI-Powered Linguistics and Search with Fusion and Rosette
AI-Powered Linguistics and Search with Fusion and Rosette
 
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
The Service Industry After COVID-19: The Soul of Service in a Virtual MomentThe Service Industry After COVID-19: The Soul of Service in a Virtual Moment
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
 
Webinar: Smart answers for employee and customer support after covid 19 - Europe
Webinar: Smart answers for employee and customer support after covid 19 - EuropeWebinar: Smart answers for employee and customer support after covid 19 - Europe
Webinar: Smart answers for employee and customer support after covid 19 - Europe
 
Smart Answers for Employee and Customer Support After COVID-19
Smart Answers for Employee and Customer Support After COVID-19Smart Answers for Employee and Customer Support After COVID-19
Smart Answers for Employee and Customer Support After COVID-19
 
Applying AI & Search in Europe - featuring 451 Research
Applying AI & Search in Europe - featuring 451 ResearchApplying AI & Search in Europe - featuring 451 Research
Applying AI & Search in Europe - featuring 451 Research
 
Webinar: Accelerate Data Science with Fusion 5.1
Webinar: Accelerate Data Science with Fusion 5.1Webinar: Accelerate Data Science with Fusion 5.1
Webinar: Accelerate Data Science with Fusion 5.1
 
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce StrategyWebinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
 
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
 
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
Apply Knowledge Graphs and Search for Real-World Decision IntelligenceApply Knowledge Graphs and Search for Real-World Decision Intelligence
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
 
Webinar: Building a Business Case for Enterprise Search
Webinar: Building a Business Case for Enterprise SearchWebinar: Building a Business Case for Enterprise Search
Webinar: Building a Business Case for Enterprise Search
 
Why Insight Engines Matter in 2020 and Beyond
Why Insight Engines Matter in 2020 and BeyondWhy Insight Engines Matter in 2020 and Beyond
Why Insight Engines Matter in 2020 and Beyond
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 

Webinar: Replace Google Search Appliance with Lucidworks Fusion

  • 1.
  • 2. Replacing  GSA  with  Lucidworks   Fusion     Evan  Sayer   Senior  Search  Engineer   Lucidworks     Guy  Sperry   Enterprise  Content  Management  &  Big  Data  Architect     County  of  Sacramento  
  • 3.
  • 4. Introduc)on   •  Lucidworks   –  Founded  in  2007   –  Contributes  ~70%  of  the  open-­‐source  code  commiJed   to  the  Apache  Lucene/Solr  project   •  Lucidworks  Fusion:  our  enterprise  search   plaNorm  built  on  top  of  Apache  Solr   •  Apache  Solr:  the  most  popular  open-­‐source   enterprise  search  engine  on  Earth  
  • 5. Google  Search  Appliance  (GSA)   •  Google’s  enterprise  search   soluPon  offered  from  2002-­‐2016   •  One-­‐stop  shopping:  a  complete   enterprise-­‐search  soluPon  in  one   box   •  EoL  as  of  February  2016,  support   phased-­‐out  completely  by  2018  
  • 6. GSA  Strengths   •  Easy  to  setup  and  configure  –  “plug  and  play”   –  Lower  start-­‐up  cost  and  lower  Pme-­‐to-­‐value  than   many  other  contemporary  soluPons   –  RelaPvely  straighNorward  to  operate  on  an  ongoing   basis   –  Achieve  a  decent  search  experience  quite  quickly  and   easily   •  Takeaway:  GSA  minimized  necessary  investment   in  technical  experPse  
  • 7. Replacing  GSA  with  Fusion   •  Easy  to  setup  and  configure,  “plug  and  play”   –  Fusion  Index  Workbench   •  Quickly  connect  to  and  ingest  data   •  IntuiPvely  iterate  on  improving  search  results   •  Easily  A/B  test  tweaks  to  ETL  logic   –  Dashboards  and  Log  AnalyPcs   –  Monitoring/alerPng  APIs  that  integrate  with  common   tools  to  ease  ongoing  maintenance  
  • 8. GSA  Strengths     •  Out-­‐of-­‐box  search  UI   –  Highly  useful  during   development,  iteraPng  on   relevancy  improvements,  etc.   –  Customizable  enough  to  use   as  an  end-­‐user  search  UI   •  Takeaway:  GSA  minimized   necessary  investment  in   technical  experPse  
  • 9. Replacing  GSA  with  Fusion   •  Out-­‐of-­‐box  search  UI   –  Lucidworks  View   •  Highly   customizable/”skin-­‐able”   •  Fully  open-­‐source:   hJps://github.com/ lucidworks/lucidworks-­‐ view   •  Built  on  top  of  a  modern   stack  (AngularJS)  
  • 10. GSA  Strengths   •  Broad  support  for  connecPng  to,  ingesPng,   and  securing  data   – Many  out-­‐of-­‐box  connectors  to  common  sources:   CRM,  Wikis,  databases  etc.   – Extensible  connector  framework   •  Takeaway:  GSA  minimized  necessary   investment  in  technical  experPse  
  • 11. Replacing  GSA  with  Fusion   •  Broad  support  for  connecPng  to,  ingesPng,  and  securing  data   –  Fusion  ships  with  ~40  connectors  to  common  sources   •  JDBC,  Web,  Alfresco,  Box,  Dropbox,  Drupal,  Github,  Google  Drive,  Jive,  JIRA,   Sharepoint,  MongoDB,  Hadoop/HDFS,  Salesforce,  Slack,  lots  more…   •  Fusion  connectors’  security-­‐trimming  funcPonality  secures  content/searches   out-­‐of-­‐box   –  Fusion  Index  Pipelines  enable  easily  pushing  data  into  the  index  as   well,  via  a  REST  API   –  Custom  connector  development  via  Fusion’s  Connectors  API    
  • 12. GSA  Weaknesses   •  Broad  theme:  insufficient  control  over  the  search  experience   –  Relevancy  tuning  and  controls  are  exceedingly  opaque   •  “Source  Biasing”:  +/-­‐  [strong|medium|weak]   –  Lack  of  control  over  indexing  workflow   •  Custom  metadata  processing  was  a  chore,  if  feasible   –  Oren  referred  to  as  a  “black  box”  design   •  Non-­‐trivial  to  scale   –  Appliance  packaging  restricts  freedom  in  scaling  up   –  Per-­‐document  pricing  model   •  Incorrect  facet  counts!?  
  • 13. Fusion  –  Fine-­‐grained  Control  over  *Everything*  
  • 14. Fusion  –  Fine-­‐grained  Control  over  *Everything*   •  Fusion  Index  Pipelines   –  True  fine-­‐grained  control  over  ETL;  as  much  or  as  liJle  as  desired   •  For  content  from  source  X,  I  want  to  redact  this  set  of  keywords   •  For  content  from  source  Y,  I  want  to  extract  the  Ptle  from  this  HTML  tag   •  For  content  from  source  Z,  I  want  to  lookup  the  authorized  groups  from  another  database,  and  add   them  to  a  field  in  each  document   •  Fusion  Query  Pipelines   –  True  fine-­‐grained  control  over  request/response  logic  at  query-­‐Pme   •  For  queries  containing  keyword  X,  I  want  to  rewrite  the  query  to  be  something  else   •  For  queries  in  language  Y,  I  want  to  boost  results  matching  in  this  separate  set  of  fields   •  For  matching  documents  containing  keyword  Z,  I  want  to  redact  all  occurrences  of  Z  before  returning   the  results   –  Fusion  signals:  collect  users’  queries+clicks  and  aggregate  them  over  Pme   •  UPlize  this  knowledge  to  dynamically  boost  the  most  commonly-­‐clicked  item(s)  for  a  given  query   •  ConPnually  improve  relevancy  without  manual  human  input   •  If  you’re  already  familiar  with  Solr/Lucene,  hack  away!  J  
  • 15. Fusion  –  Fine-­‐grained  Control  over  *Everything*   •  Scaling   –  Fusion  uPlizes  best-­‐in-­‐class  Apache  Solr  as  the  backend  search  engine   •  Scale  to  billions  of  documents  linearly   –  Fusion  services  scale  independently   •  As  opposed  to  GSA,  which  scaled  in  units  of  enPre  appliances   •  If  you  want  to  ingest  content  faster,  add  addiPonal  connectors  nodes   •  If  you  want  to  enable  greater  query  throughput,  add  addiPonal  query-­‐processing  nodes     –  StraighNorward  APIs/processes  for  provisioning  addiPonal  nodes   •  Just  spin  up  a  new  node,  install  Fusion,  and  point  it  at  the  central  cluster  manager   (Apache  Zookeeper)   •  Easily  overlay  Fusion  on  top  of  any  exisPng  Solr  cluster  
  • 16. Fusion  as  a  plaDorm   •  Get  started  with  ease:  hJps://lucidworks.com/products/fusion/download/   1.  Point  Fusion  at  your  data   2.  Setup  a  simple  baseline  search  app  with  Lucidworks  View   3.  Iterate  on  the  actual  search  experience  to  your  heart’s  content  J   •  Delve  into  the  details  (or  don’t!)   –  Fusion  provides  the  necessary  framework  to  tackle  tough  and/or  use-­‐case-­‐specific  search   problems   –  Anything  but  a  “black  box”  design   –  Most  components  are  customizable  and  extensible   •  Implement  your  own  Fusion  components  in  Java  using  our  APIs   •  Scale  with  minimal  effort,  maximal  flexibility   –  Scale  linearly  up  to  billions  of  docs  with  Apache  Solr   –  Self-­‐service  APIs  for  se{ng  up  addiPonal  nodes  to  expand  capacity   –  Per-­‐node  instead  of  per-­‐doc  pricing  means  fewer  surprises  when  it’s  Pme  to  renew  licenses    
  • 17. “Fusion gave us the features we needed to replace Google Search Appliance in a matter of weeks. With Fusion’s out-of-the-box capabilities, we skipped months in our dev cycle so we could focus our team where they would have the most impact. We cut our licensing costs by 50% and improved application usability. The Lucidworks professional services team amplified our success even further. “We’re all Fusion from here on out!” Lourduraju Pamishetty
 Senior IT Application Architect
 Infoblox
  • 18. Customers   Who’ve  Made   the  Switch  
  • 19. Fusion  as  a  plaDorm   •  Accurate  facet  counts   – What  a  concept!  J   •  Take  Fusion  for  a  spin:   hJps://lucidworks.com/products/fusion/ download/  
  • 20. Agenda   •  IntroducPon  to  County  of  Sacramento   •  Why  Sacramento  County  is  search  first  for  data  delivery   •  How  Fusion  helps  us  meet  our  data  delivery  challenges   •  How  Fusion  has  helped  us  fill  gaps  ler  by  GSA  rePrement    
  • 21. Sacramento  County   •  34  departments  and  affiliated  organizaPons  serving  1.5   million  people   •  Commitment  to  open  government  and  transparency   •  CiPzen  engagement  
  • 22. Why  Sacramento  County  is  Search  First   •  Enterprise  apps,  data  snackers  and  LOB  apps   –  ADABAS  (Mainframe)   –  RDBMS   –  CDH   –  ECM     •  Diverse,  heterogeneous  data  environment     •  Our  challenge:  securely  deliver  prompt  access  to  relevant  data  
  • 23. Fusion/Solr  in  Sacramento  County   •  Documents  and  content   –  Cross-­‐repository  search   –  Source  repository  security   •  GIS   •  Cross-­‐Source  Data  Processing  and  AnalyPcs   –  Fusion  connectors   –  Spark  in  Fusion   •  Log  Analysis   •  NOSQL   –  Why  be  MEAN  when  you  can  be  SANE?  
  • 24. Gaps  LeH  by  GSA   Fusion  was  our  final  GSA  patch  
  • 25. AgendaSearch.saccounty.net   •  The  Brown  Act   –  Make  public  meePngs  accessible  to  ciPzens   –  Maintain  transparency     •  AgendaSearch   –  Search  and  consume  public  documents   –  Integrate  with  agenda  management   –  Lucidworks  View   –  Has  reduced  PRAs  
  • 26. Immediate  Win  with  View   •  County  Legal  Counsel   •  ~2  million  document  archive   •  Document  level  security   •  IntuiPve  and  feature  rich  UI   •  Search  soluPon  delivered  before  lunch  
  • 27. Q&A     Resources:     •  Download  Fusion:  hJps://lucidworks.com/products/fusion/download/   •  Lucene/Solr  RevoluPon  2016  –  Oct  11-­‐14  –  Boston,  MA:  lucenerevoluPon.org