SlideShare uma empresa Scribd logo
1 de 61
Baixar para ler offline
Apache Solr
Robert Douglass, Acquia
Anniversary

• two years ago...
• at a conference called FOSDEM...
• the Apache Solr module was introduced.
Anniversary


• it was 20% software ....
• and 80% vaporware.
Anniversary
A lot has changed.
A lot has changed


• Acquia Search
Acquia Search

• Acquia’s hosted service - big success
• In combination with Acquia Stack Installer
  and a trial or basic subscription, 5 minutes
  to install and have Solr search running.
Acquia Search


• http://acquia.com/acquia-search
• http://acquia.com/downloads
A lot has changed


• Acquia Search
• Lucid Imagination
Lucid Imagination

• Venture funded Boston-based company
  (just like Acquia)
• Core Solr committers (just like Acquia)
• Solr support and services (very much like
  Acquia)
Lucid Imagination

• Accelerating pace of Solr development (just
  like Acquia)
• Big interest from government clients (just
  like Acquia)
Acquia
          and
   Lucid Imagination
Two good reasons to choose Drupal and Solr as
      long-term technology platforms.

              http://acquia.com

         http://lucidimagination.com
A lot has changed

• Acquia Search
• Lucid Imagination
• Drupal.org
Drupal.org
• ~50% of page requests get main content
  from Solr
 • site search
 • project listing pages (modules, themes,
    etc)
 • Issue queue listings
Drupal.org
Drupal.org
Drupal.org
Drupal.org

• Vastly improved search experience
 • Faceting
 • Better relevancy
 • Better performance
 • Better scalability
A lot has changed

• Acquia Search
• Lucid Imagination
• Drupal.org
• Whitehouse.gov
Whitehouse.gov
Whitehouse.gov
Whitehouse.gov
Whitehouse.gov
Some dreams
       are now reality
• Faceted search: Was new ... is now
  household word
• Spelling suggestions
• Field boosting
• Content recommendation
Some dreams
       are now reality
• Faceted search: Was new ... is now
  household word
• Spelling suggestions
• Field boosting
• Content recommendation
Improving spelling
       suggestions
• In your solr/conf directory, look into the
  file called spellings.txt
• It has two words in it:
   “pizza”
   “history”
• Tip: Get or build a dictionary to fill that file
  with correctly spelled words.
Improving spelling
          suggestions

•    Use taxonomy terms, vocabularies and
    synonyms
•   Use content types
•   Use user names
•   Use online dictionaries
Improving spelling
          suggestions
An idea for a contributed module

•    Use taxonomy terms, vocabularies and
    synonyms
•   Use content types
•   Use user names
•   Use online dictionaries
Some dreams
       are now reality
• Faceted search: Was new ... is now
  household word
• Spelling suggestions
• Field boosting
• Content recommendation
Some dreams reality
Some dreams reality
Some dreams reality
Some dreams
       are now reality
• Faceted search: Was new ... is now
  household word
• Spelling suggestions
• Field boosting
• Content recommendation
Some dreams reality
Some dreams reality
Some dreams reality
  Greater control over recommendations:


      A patch to review:

http://drupal.org/node/372767
2 Years of Pure Party

• ApacheSolr Views Statistics
• File search      •
• Multisite search • Autocomplete
• Comment search   • Display Suite
• CCK Date facets
Apache Solr + Views 3

• Views builds Solr query
• Views displays query results
• Build custom search forms with exposed
  filters
• Faceting works as well
Apache Solr + Views 3
Apache Solr + Views 3


http://acquia.com/node/911667
File Search
                    New blog post:

http://robshouse.net/blog-post/use-apache-solr-search-files

    http://acquia.com/blog/use-apache-solr-search-files

                           or

             http://acquia.com/node/1129446
File Search
File Search
Multisite Search
Comment Search
Comment Search
CCK Date Facets
CCK Date Facets

       Help test cck date facets:


http://drupal.org/node/558160
Statistics


http://drupal.org/project/apachesolr_stats
Statistics
Statistics
Statistics
Autocomplete


http://drupal.org/project/apachesolr_autocomplete
Autocomplete
Display Suite

http://drupal.org/project/ds
Display Suite
Stuff on the horizon
• Solr 1.5 features
 • GeoSpatial search
 • Autocomplete component
 • eDismax (Extended dismax) - supports
    raw Lucene syntax, among other thigngs.
    Opens the door for integration with
    Lucene API module. (http://drupal.org/
    project/luceneapi)
Stuff on the horizon


• Crawling with Nutch
• Acquia support for multisite, file and
  geospatial search
Urgent needs

• Drupal.org relaunch
  http://drupal.org/node/704062
• Test suites
• Drupal 7 version
House cleaning

• Help us refactor
• Better APIs
• Better Documentation
Any Questions?

Mais conteúdo relacionado

Mais procurados

Data Science with Solr and Spark
Data Science with Solr and SparkData Science with Solr and Spark
Data Science with Solr and SparkLucidworks
 
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesIntroduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesRahul Jain
 
Introduction to Apache Solr
Introduction to Apache SolrIntroduction to Apache Solr
Introduction to Apache SolrAndy Jackson
 
Simple Fuzzy Name Matching in Solr: Presented by Chris Mack, Basis Technology
Simple Fuzzy Name Matching in Solr: Presented by Chris Mack, Basis TechnologySimple Fuzzy Name Matching in Solr: Presented by Chris Mack, Basis Technology
Simple Fuzzy Name Matching in Solr: Presented by Chris Mack, Basis TechnologyLucidworks
 
Battle of the giants: Apache Solr vs ElasticSearch
Battle of the giants: Apache Solr vs ElasticSearchBattle of the giants: Apache Solr vs ElasticSearch
Battle of the giants: Apache Solr vs ElasticSearchRafał Kuć
 
Scaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solrScaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solrTrey Grainger
 
Introduction to Apache Solr
Introduction to Apache SolrIntroduction to Apache Solr
Introduction to Apache SolrChristos Manios
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with SolrErik Hatcher
 
Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5israelekpo
 
Beyond full-text searches with Lucene and Solr
Beyond full-text searches with Lucene and SolrBeyond full-text searches with Lucene and Solr
Beyond full-text searches with Lucene and SolrBertrand Delacretaz
 
Solr Recipes Workshop
Solr Recipes WorkshopSolr Recipes Workshop
Solr Recipes WorkshopErik Hatcher
 
Apache Solr/Lucene Internals by Anatoliy Sokolenko
Apache Solr/Lucene Internals  by Anatoliy SokolenkoApache Solr/Lucene Internals  by Anatoliy Sokolenko
Apache Solr/Lucene Internals by Anatoliy SokolenkoProvectus
 
Rebuilding Solr 6 Examples - Layer by Layer: Presented by Alexandre Rafalovit...
Rebuilding Solr 6 Examples - Layer by Layer: Presented by Alexandre Rafalovit...Rebuilding Solr 6 Examples - Layer by Layer: Presented by Alexandre Rafalovit...
Rebuilding Solr 6 Examples - Layer by Layer: Presented by Alexandre Rafalovit...Lucidworks
 
20130310 solr tuorial
20130310 solr tuorial20130310 solr tuorial
20130310 solr tuorialChris Huang
 
Why Is My Solr Slow?: Presented by Mike Drob, Cloudera
Why Is My Solr Slow?: Presented by Mike Drob, ClouderaWhy Is My Solr Slow?: Presented by Mike Drob, Cloudera
Why Is My Solr Slow?: Presented by Mike Drob, ClouderaLucidworks
 
Solr Application Development Tutorial
Solr Application Development TutorialSolr Application Development Tutorial
Solr Application Development TutorialErik Hatcher
 
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...Lucidworks
 
The ultimate guide for Elasticsearch plugins
The ultimate guide for Elasticsearch pluginsThe ultimate guide for Elasticsearch plugins
The ultimate guide for Elasticsearch pluginsItamar
 

Mais procurados (20)

Data Science with Solr and Spark
Data Science with Solr and SparkData Science with Solr and Spark
Data Science with Solr and Spark
 
Introduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and UsecasesIntroduction to Lucene & Solr and Usecases
Introduction to Lucene & Solr and Usecases
 
Introduction to Apache Solr
Introduction to Apache SolrIntroduction to Apache Solr
Introduction to Apache Solr
 
Simple Fuzzy Name Matching in Solr: Presented by Chris Mack, Basis Technology
Simple Fuzzy Name Matching in Solr: Presented by Chris Mack, Basis TechnologySimple Fuzzy Name Matching in Solr: Presented by Chris Mack, Basis Technology
Simple Fuzzy Name Matching in Solr: Presented by Chris Mack, Basis Technology
 
Battle of the giants: Apache Solr vs ElasticSearch
Battle of the giants: Apache Solr vs ElasticSearchBattle of the giants: Apache Solr vs ElasticSearch
Battle of the giants: Apache Solr vs ElasticSearch
 
Scaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solrScaling Recommendations, Semantic Search, & Data Analytics with solr
Scaling Recommendations, Semantic Search, & Data Analytics with solr
 
Introduction to Apache Solr
Introduction to Apache SolrIntroduction to Apache Solr
Introduction to Apache Solr
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with Solr
 
Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5Building Intelligent Search Applications with Apache Solr and PHP5
Building Intelligent Search Applications with Apache Solr and PHP5
 
Beyond full-text searches with Lucene and Solr
Beyond full-text searches with Lucene and SolrBeyond full-text searches with Lucene and Solr
Beyond full-text searches with Lucene and Solr
 
Solr Recipes Workshop
Solr Recipes WorkshopSolr Recipes Workshop
Solr Recipes Workshop
 
Apache Solr/Lucene Internals by Anatoliy Sokolenko
Apache Solr/Lucene Internals  by Anatoliy SokolenkoApache Solr/Lucene Internals  by Anatoliy Sokolenko
Apache Solr/Lucene Internals by Anatoliy Sokolenko
 
Rebuilding Solr 6 Examples - Layer by Layer: Presented by Alexandre Rafalovit...
Rebuilding Solr 6 Examples - Layer by Layer: Presented by Alexandre Rafalovit...Rebuilding Solr 6 Examples - Layer by Layer: Presented by Alexandre Rafalovit...
Rebuilding Solr 6 Examples - Layer by Layer: Presented by Alexandre Rafalovit...
 
Apache Solr
Apache SolrApache Solr
Apache Solr
 
20130310 solr tuorial
20130310 solr tuorial20130310 solr tuorial
20130310 solr tuorial
 
Apache Solr
Apache SolrApache Solr
Apache Solr
 
Why Is My Solr Slow?: Presented by Mike Drob, Cloudera
Why Is My Solr Slow?: Presented by Mike Drob, ClouderaWhy Is My Solr Slow?: Presented by Mike Drob, Cloudera
Why Is My Solr Slow?: Presented by Mike Drob, Cloudera
 
Solr Application Development Tutorial
Solr Application Development TutorialSolr Application Development Tutorial
Solr Application Development Tutorial
 
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...
Downtown SF Lucene/Solr Meetup: Developing Scalable User Search for PlayStati...
 
The ultimate guide for Elasticsearch plugins
The ultimate guide for Elasticsearch pluginsThe ultimate guide for Elasticsearch plugins
The ultimate guide for Elasticsearch plugins
 

Semelhante a State-of-the-Art Drupal Search with Apache Solr

Tips for Tuning Solr Search: No Coding Required
Tips for Tuning Solr Search: No Coding RequiredTips for Tuning Solr Search: No Coding Required
Tips for Tuning Solr Search: No Coding RequiredAcquia
 
5 Reasons Your Site Needs Acquia Search
5 Reasons Your Site Needs Acquia Search5 Reasons Your Site Needs Acquia Search
5 Reasons Your Site Needs Acquia SearchAcquia
 
Plone for Education: Bibliographies
Plone for Education: BibliographiesPlone for Education: Bibliographies
Plone for Education: BibliographiesCristopher Ewing
 
44 Slides About 22 Modules
44 Slides About 22 Modules44 Slides About 22 Modules
44 Slides About 22 Modulesheyrocker
 
Intro to Apache Solr for Drupal
Intro to Apache Solr for DrupalIntro to Apache Solr for Drupal
Intro to Apache Solr for DrupalChris Caple
 
Digital Publishing with the OSCI Toolkit - Workshop MCN 2012
Digital Publishing with the OSCI Toolkit - Workshop MCN 2012Digital Publishing with the OSCI Toolkit - Workshop MCN 2012
Digital Publishing with the OSCI Toolkit - Workshop MCN 2012graybowman
 
Showcasing drupal
Showcasing drupalShowcasing drupal
Showcasing drupalOpevel
 
Challenges of Simple Documents: When Basic isn't so Basic - Cassandra Targett...
Challenges of Simple Documents: When Basic isn't so Basic - Cassandra Targett...Challenges of Simple Documents: When Basic isn't so Basic - Cassandra Targett...
Challenges of Simple Documents: When Basic isn't so Basic - Cassandra Targett...Lucidworks
 
Evaluating Drupal for the Enterprise
Evaluating Drupal for the EnterpriseEvaluating Drupal for the Enterprise
Evaluating Drupal for the Enterpriseultimike
 
Search all the things
Search all the thingsSearch all the things
Search all the thingscyberswat
 
Library Mashups & APIs
Library Mashups & APIsLibrary Mashups & APIs
Library Mashups & APIslibrarywebchic
 
Meet Solr For The Tirst Again
Meet Solr For The Tirst AgainMeet Solr For The Tirst Again
Meet Solr For The Tirst AgainVarun Thacker
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Oscar Corcho
 
Solr + Hadoop: Interactive Search for Hadoop
Solr + Hadoop: Interactive Search for HadoopSolr + Hadoop: Interactive Search for Hadoop
Solr + Hadoop: Interactive Search for Hadoopgregchanan
 
Creating a Documentation Portal
Creating a Documentation PortalCreating a Documentation Portal
Creating a Documentation PortalSteve Anderson
 
7 Apache Process Cloudstack Developer Day
7 Apache Process Cloudstack Developer Day7 Apache Process Cloudstack Developer Day
7 Apache Process Cloudstack Developer DayKimihiko Kitase
 

Semelhante a State-of-the-Art Drupal Search with Apache Solr (20)

Tips for Tuning Solr Search: No Coding Required
Tips for Tuning Solr Search: No Coding RequiredTips for Tuning Solr Search: No Coding Required
Tips for Tuning Solr Search: No Coding Required
 
5 Reasons Your Site Needs Acquia Search
5 Reasons Your Site Needs Acquia Search5 Reasons Your Site Needs Acquia Search
5 Reasons Your Site Needs Acquia Search
 
Plone for Education: Bibliographies
Plone for Education: BibliographiesPlone for Education: Bibliographies
Plone for Education: Bibliographies
 
44 Slides About 22 Modules
44 Slides About 22 Modules44 Slides About 22 Modules
44 Slides About 22 Modules
 
Intro to Apache Solr for Drupal
Intro to Apache Solr for DrupalIntro to Apache Solr for Drupal
Intro to Apache Solr for Drupal
 
DrupalCon 2011 Highlight
DrupalCon 2011 HighlightDrupalCon 2011 Highlight
DrupalCon 2011 Highlight
 
Digital Publishing with the OSCI Toolkit - Workshop MCN 2012
Digital Publishing with the OSCI Toolkit - Workshop MCN 2012Digital Publishing with the OSCI Toolkit - Workshop MCN 2012
Digital Publishing with the OSCI Toolkit - Workshop MCN 2012
 
Wikipedia Cloud Search Webinar
Wikipedia Cloud Search WebinarWikipedia Cloud Search Webinar
Wikipedia Cloud Search Webinar
 
Showcasing drupal
Showcasing drupalShowcasing drupal
Showcasing drupal
 
SolrCloud on Hadoop
SolrCloud on HadoopSolrCloud on Hadoop
SolrCloud on Hadoop
 
Challenges of Simple Documents: When Basic isn't so Basic - Cassandra Targett...
Challenges of Simple Documents: When Basic isn't so Basic - Cassandra Targett...Challenges of Simple Documents: When Basic isn't so Basic - Cassandra Targett...
Challenges of Simple Documents: When Basic isn't so Basic - Cassandra Targett...
 
Evaluating Drupal for the Enterprise
Evaluating Drupal for the EnterpriseEvaluating Drupal for the Enterprise
Evaluating Drupal for the Enterprise
 
Search all the things
Search all the thingsSearch all the things
Search all the things
 
Library Mashups & APIs
Library Mashups & APIsLibrary Mashups & APIs
Library Mashups & APIs
 
Meet Solr For The Tirst Again
Meet Solr For The Tirst AgainMeet Solr For The Tirst Again
Meet Solr For The Tirst Again
 
Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?Why do they call it Linked Data when they want to say...?
Why do they call it Linked Data when they want to say...?
 
Solr + Hadoop: Interactive Search for Hadoop
Solr + Hadoop: Interactive Search for HadoopSolr + Hadoop: Interactive Search for Hadoop
Solr + Hadoop: Interactive Search for Hadoop
 
Creating a Documentation Portal
Creating a Documentation PortalCreating a Documentation Portal
Creating a Documentation Portal
 
Internet content as research data
Internet content as research dataInternet content as research data
Internet content as research data
 
7 Apache Process Cloudstack Developer Day
7 Apache Process Cloudstack Developer Day7 Apache Process Cloudstack Developer Day
7 Apache Process Cloudstack Developer Day
 

Mais de Robert Douglass

DevOps and the Death & Rebirth of Childhood Innocence
DevOps and the Death & Rebirth of Childhood InnocenceDevOps and the Death & Rebirth of Childhood Innocence
DevOps and the Death & Rebirth of Childhood InnocenceRobert Douglass
 
Open Source Music - OHM2013
Open Source Music - OHM2013Open Source Music - OHM2013
Open Source Music - OHM2013Robert Douglass
 
Classical:NEXT - Crowdfunding, with Steven Walter and Robert Douglass
Classical:NEXT - Crowdfunding, with Steven Walter and Robert DouglassClassical:NEXT - Crowdfunding, with Steven Walter and Robert Douglass
Classical:NEXT - Crowdfunding, with Steven Walter and Robert DouglassRobert Douglass
 
Why contributing to Drupal is awesome
Why contributing to Drupal is awesomeWhy contributing to Drupal is awesome
Why contributing to Drupal is awesomeRobert Douglass
 
Sell your code: Announcing the DroopyAppStore
Sell your code: Announcing the DroopyAppStoreSell your code: Announcing the DroopyAppStore
Sell your code: Announcing the DroopyAppStoreRobert Douglass
 
Drupal and Interactive Digital Marketing
Drupal and Interactive Digital MarketingDrupal and Interactive Digital Marketing
Drupal and Interactive Digital MarketingRobert Douglass
 
ApacheSolr presentation from "Do it With Drupal"
ApacheSolr presentation from "Do it With Drupal"ApacheSolr presentation from "Do it With Drupal"
ApacheSolr presentation from "Do it With Drupal"Robert Douglass
 

Mais de Robert Douglass (8)

DevOps and the Death & Rebirth of Childhood Innocence
DevOps and the Death & Rebirth of Childhood InnocenceDevOps and the Death & Rebirth of Childhood Innocence
DevOps and the Death & Rebirth of Childhood Innocence
 
Open Source Music - OHM2013
Open Source Music - OHM2013Open Source Music - OHM2013
Open Source Music - OHM2013
 
Classical:NEXT - Crowdfunding, with Steven Walter and Robert Douglass
Classical:NEXT - Crowdfunding, with Steven Walter and Robert DouglassClassical:NEXT - Crowdfunding, with Steven Walter and Robert Douglass
Classical:NEXT - Crowdfunding, with Steven Walter and Robert Douglass
 
Why contributing to Drupal is awesome
Why contributing to Drupal is awesomeWhy contributing to Drupal is awesome
Why contributing to Drupal is awesome
 
Sell your code: Announcing the DroopyAppStore
Sell your code: Announcing the DroopyAppStoreSell your code: Announcing the DroopyAppStore
Sell your code: Announcing the DroopyAppStore
 
The Business of Drupal
The Business of DrupalThe Business of Drupal
The Business of Drupal
 
Drupal and Interactive Digital Marketing
Drupal and Interactive Digital MarketingDrupal and Interactive Digital Marketing
Drupal and Interactive Digital Marketing
 
ApacheSolr presentation from "Do it With Drupal"
ApacheSolr presentation from "Do it With Drupal"ApacheSolr presentation from "Do it With Drupal"
ApacheSolr presentation from "Do it With Drupal"
 

Último

Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentMahmoud Rabie
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Jeffrey Haguewood
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialJoão Esperancinha
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...itnewsafrica
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 

Último (20)

Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career Development
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorial
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 

State-of-the-Art Drupal Search with Apache Solr

Notas do Editor

  1. This presentation was given on Sunday, February 7, 2010, in Brussels, at the FOSDEM conference.
  2. Two years ago, at a conference called FOSDEM, the Apache Solr module was introduced. Coincidentally, it was the day I started working for Acquia.
  3. It wasn’t really ready for prime time. In all I’d say it was 20% software and 80% vaporware.
  4. I also had more hair back then, which was not only longer and thicker, but less grey.
  5. A lot has changed.
  6. Since then Acquia has launched a hosted search service based on Apache Solr: http://acquia.com/products-services/acquia-search
  7. It’s been a big successs. Hosted by Acquia, using Amazon cloud architecture for great performance and high availability. If you want to try it all out and get up and running quickly, the Acquia Stack Installer and a free or basic subscription will get you there in around 5 minutes.
  8. http://acquia.com/acquia-search http://acquia.com/downloads
  9. Another important change is the funding and founding of Lucid Imagination. http://www.lucidimagination.com/About
  10. Like Acquia, Lucid Imagination is a venture funded software company based in the Boston area. Like Acquia they have core committers on their team. Like Acquia they’re in the business of providing support and services for their open source project.
  11. They have succeeded in raising the profile and awareness of Solr, and have also accelerated the pace of Solr development. Lucid Imagination has also captured a lot of interest from government clients.
  12. Acquia and Lucid Imagination represent two good reasons to choose Solr. It is a good long-term technology platform decision.
  13. One of the big changes of the past two years has affected all of us quite strongly. For some time, Drupal.org has been running Solr as its main search component.
  14. Around 50% of page requests to Drupal.org involve the Solr server in some way. These include site search, the project listings page, and the issue queue listings.
  15. Here is a search page. Note the opportunities to sort, and filter on facets including content type and author.
  16. Here’s the modules listing page. It’s especially helpful that you can filter on Drupal version type, project type, or do a keyword search that is limited just to modules.
  17. Here’s the issue queue advanced search. This too is powered by Solr.
  18. The advantages of Drupal.org switching to Solr search include a much better search experience. There’s faceting. There’s better relevancy. Better performance. Better scalability.
  19. Of course, one of the exciting announcements from last year was that President Obama’s website, Whitehouse.gov, had switched to Drupal.
  20. I bet you can guess that was quite happy about this.
  21. And quite proud.
  22. Proud of Drupal, and of Acquia,
  23. and of the Obama administration for working to foster openness in government.
  24. Two years ago the idea and concept of faceted search - the ability to easily drill down into search results - was new. Now everybody wants it, it’s become a de facto standard in new projects.
  25. Two years ago I promised we’d get spelling suggestions. We have them, though we’re still learning how to tune and improve them.
  26. I will give you a tip, though: find solr/conf/spellings.txt and add problem words to it. The default file that comes with Solr only has the words “pizza” and “history”.
  27. Someone here could make a contributed module to generate a spellings.txt. You could use taxonomy terms, vocabularies, synonyms, content types, user names, custom input forms, and even online dictionaries to generate the file.
  28. Two years ago I promised more control over tuning search results. Now there are all sorts of boosting and customization options. You can exclude content types from the index. You can boost or reduce the importance of individual fields or HTML elements when searching.
  29. This screen shows how you can use node attributes, like whether a node is promoted to the front page, or is sticky, to influence search rankings.
  30. This screen shows how you can boost or diminish the ranking of individual content types, or exclude content types from being indexed altogether.
  31. And this screen shows how you can use the HTML markup itself to give extra weight to some elements, or diminish the value of others.
  32. Two years ago I promised the ability to do content recommendation with Solr. That is now a reality and it works really well, leading to far lower bounce rates and more time spent on your site
  33. When Dries enabled content recommendation on http://Buytaert.net I spent about two hours re-discovering things that he’d written over the years. Every article had more context and background. You can always find something related and interesting to read.
  34. And with this pending patch, you have even greater control over content recommendation. You can limit the recommendations to certain content types, or certain taxonomy terms, or boost certain words. You can make as many different recommendation blocks as you need.
  35. We now have Views 3, File search, multisite search, comment search, cck date facets, statistics, autocomplete, and the glorious display suite.
  36. Views 3 builds custom solr queries (using the apachesolr_views module). Views 3 then displays query results, with all the Views goodies you’re familiar with (tables, grids, carousels, slideshows etc.) You can build custom search forms with exposed filters, and faceting works as well.
  37. Here you can see an example view that has an exposed keyword filter, a taxonomy facet block, and a table display including a sortable title column. All made using the views user interface.
  38. Read more about it at acquia dot com, node nine one one six six seven. http://acquia.com/node/911667
  39. It’s possible to search for text inside of uploaded files as well. There’s a brand new blog post on Acquia.com about this topic that I published today. http://acquia.com/node/1129446
  40. I searched for “merlinofchaos” and the text was found in a the zipped up tarball of Views3 which I attached to a node using a filefield.
  41. Here are examples of the word “Drupal” being found in a Microsoft Word file and this very Keynote presentation that you’re watching.
  42. Multisite search: sites share an index and can be filtered on the “Site” facet. You can either search across all your sites, or on the site that you’re currently on.
  43. Comment search: Comments get indexed as 1st class citizens and get their own search results. Even solved the problem of linking to a comment on page X. This is only available in the DRUPAL-6--2 branch of ApacheSolr.
  44. CCK Date Facets: Any CCK date field becomes a facet filter. You can drill down - year - month - day - hour.
  45. This is a patch, currently, and needs testing. Please help. http://drupal.org/node/558160
  46. Too few people analyze what’s happening on their site regarding search. When you look at normal analytics you see a lot of incoming links and keyword searches from sites like Google. Do you, however, analyze the keywords that people use on your own site search? This is golden information as they’re telling you exactly what they’re looking for. Do you retrace their steps and look at the search results they see? Is it what you’d expect? Are they finding what they’re looking for?
  47. The statistics module gives you insight to how many searches, search performance (.0001 second average - .543 sec maximum)
  48. And also which search filters are most often being applied.
  49. Here you can see how many searches from a music site are being filtered by genre or instrumentation.
  50. Ready for use. Go try it out.
  51. Autocomplete uses the contents of the index to suggest terms to you, as you type. If you finish one term it will suggest a common second term to go with it.
  52. Just a few days ago the display suite 1.0 was released. This is now the easiest way to customize your search results.
  53. Here, for example, are search results with full teasers and images.
  54. Solr 1.5 is just around the corner. The team at Lucid has set a brisk pace. It will include native geospatial seaerch. It will have an autocomplete request handler. It will have the very exciting eDismax request handler - full lucene syntax and perhaps opportunity for Lucene API module integration. (http://drupal.org/project/luceneapi)
  55. People often want to search non-Drupal sites along with their Drupal sites. This can be done by crawling those sites with Nutch and using the Nutch/Solr integration. Acquia support for multisite, file and geospatial search are on the way. (In the presentation I asked a show of hands for which of these three features would be most popular: People responded 1 - File, 2 - Multisite, and 3 - Geospatial in that order.
  56. There are urgent needs and immediate opportunities to help. #1 on the list is to assist with the Drupal.org redesign and relaunch. http://drupal.org/node/704062 Much of the new site is driven by Solr #2 is the glaring absence of test suites. This slows development due to regressions. #3 is the upgrade to Drupal 7.
  57. Like any two year long party, there’s a bit of of housecleaning that now has to be done. Better APIs, more abstraction, better documentation.