SlideShare a Scribd company logo
1 of 58
Download to read offline
Searching with Solr
            AN INTRODUCTION




           Tyler Harms
           Developer



           @harmstyler
           tyler@blendinteractive.com




                                        1


Saturday, November 10, 12
Why Implement Solr?
            SEARCHING WITH SOLR




           • Does your site need search?
           • Is google enough?

           • Do you need/want to control rankings?

           • Just text, or Structured Data?




                                                     2


Saturday, November 10, 12
What is Solr?
            SEARCHING WITH SOLR




                  Solr is a standalone enterprise
                  search server with a REST-like API.
                  You put documents in it [...] over
                  HTTP. You query it via HTTP GET
                  and receive [...] results.

                                                        3


Saturday, November 10, 12
4


Saturday, November 10, 12
Solr Versions
            SEARCHING WITH SOLR




           •   Current Version(s)
               •   Solr 3.6.1
               •   Solr 4
           •   Released Versions are always stable




                                                     5


Saturday, November 10, 12
$ wget http://(...)/3.6.1/apache-solr-3.6.1.tgz
                            $ tar -xzf apache-solr-3.6.1.tgz
                            $ cd apache-solr-3.6.1/example/
                            $ java -jar start.jar
                            (a lot of java log...)




                                                                              6


Saturday, November 10, 12
Search Alternatives
            SEARCHING WITH SOLR




           • Google
           • Lucene

           • elasticsearch

           • Whoosh

           • Xapien

           • Many Others




                                  7


Saturday, November 10, 12
NOT a Database Replacement
            SEARCHING WITH SOLR




           •   Solr is designed to live alongside your website as a separate
               web app




                                                                               8


Saturday, November 10, 12
Frontend
                   Database Master
                                                Servers[1..n]

                  Database Slaves[0..n]



                             Solr Master

                            Solr Slaves[0..n]



                                                                10
                                                                 9


Saturday, November 10, 12
Scaling Solr
            SEARCHING WITH SOLR




           •   Master/Slave Architecture
               •   Write to master -> Read from slaves
           •   Multicore Setup
               •   Multiple Solr ‘cores’ running alongside each other within the same install




                                                                                                10


Saturday, November 10, 12
Solr’s Data Model
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Solr maintains a collection of documents
           • A document is a collection of fields and values

           • A field can occur multiple times in a doc

           • Documents are immutable

               •   They can be deleted and replaced by new versions, however.




                                                                                11


Saturday, November 10, 12
Querying
            SEARCHING WITH
            SUB HEADLINE SOLR




           • http request
           • http://localhost:8983/solr/select?q=blend&start=0&rows=10




                                                                         12


Saturday, November 10, 12
Solr Query Syntax
            SEARCHING WITH
            SUB HEADLINE SOLR




           • blend (value)
           • company:blend (field:value)

           • title:”Searching with Solr” AND text:apache

           • id:[* TO *]

           • *:* (all fields : all values)




                                                           13


Saturday, November 10, 12
Using Solr
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Getting Data into Solr
           • Getting Data out of Solr




                                        14


Saturday, November 10, 12
Getting Data into Solr
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   POST it
                        <add>
                         <doc>
                           <field name="abstract">Lorem ipsum</field>
                           <field name="company">Blend Interactive</field>
                           <field name="text">Lorem Ipsum</field>
                           <field name="title">Some Title</field>
                         </doc>
                         [<doc> ... </doc>[<doc> ... </doc>]]
                        </add>

                                                                             15


Saturday, November 10, 12
Getting Data into Solr
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   POST it
                        <add>
                         <doc>
                           <field name="abstract">Lorem ipsum</field>
                           <field name="company">Blend Interactive</field>
                           <field name="text">Lorem Ipsum</field>
                           <field name="title">Some Title</field>
                         </doc>
                         [<doc> ... </doc>[<doc> ... </doc>]]
                        </add>

                                                                             16


Saturday, November 10, 12
Getting Data into Solr
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   POST it
                        <add>
                         <doc>
                           <field name="abstract">Lorem ipsum</field>
                           <field name="company">Blend Interactive</field>
                           <field name="text">Lorem Ipsum</field>
                           <field name="title">Some Title</field>
                         </doc>
                         [<doc> ... </doc>[<doc> ... </doc>]]
                        </add>

                                                                             17


Saturday, November 10, 12
Commiting
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Nothing shows up in the index until you commit
           • You can just POST <commit/> to:

               •   http://<host>:<port>/solr/update




                                                              18


Saturday, November 10, 12
Getting Data out of Solr
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   http://localhost:8983/solr/select/?q=solr




                                                           19


Saturday, November 10, 12
<response>
                  <lst name="responseHeader">
                      <int name="status">0</int>
                      <int name="QTime">19</int>
                      <lst name="params">
                          <str name="q">solr</str>
                      </lst>
                  </lst>
                  <result name="response" numFound="1" start="0">
                      <doc>
                          <str name="abstract">
                          A brief introduction to using Apache Solr for implementing search for your website.
                          </str>
                          <str name="django_ct">codecamp.session</str>
                          <str name="django_id">19</str>
                          <str name="id">codecamp.session.19</str>
                          <str name="text">
                          Searching with Solr: An Introduction A brief introduction to using Apache Solr for
                          implementing search for your website.
                          </str>
                          <str name="title">Searching with Solr: An Introduction</str>
                      </doc>
                  </result>
               </response>

                                                                                                                20


Saturday, November 10, 12
<response>
                  <lst name="responseHeader">
                      <int name="status">0</int>
                      <int name="QTime">19</int>
                      <lst name="params">
                          <str name="q">solr</str>
                      </lst>
                  </lst>
                  <result name="response" numFound="1" start="0">
                      <doc>
                          <str name="abstract">
                          A brief introduction to using Apache Solr for implementing search for your website.
                          </str>
                          <str name="django_ct">codecamp.session</str>
                          <str name="django_id">19</str>
                          <str name="id">codecamp.session.19</str>
                          <str name="text">
                          Searching with Solr: An Introduction A brief introduction to using Apache Solr for
                          implementing search for your website.
                          </str>
                          <str name="title">Searching with Solr: An Introduction</str>
                      </doc>
                  </result>
               </response>

                                                                                                                21


Saturday, November 10, 12
<response>
                  <lst name="responseHeader">
                      <int name="status">0</int>
                      <int name="QTime">19</int>
                      <lst name="params">
                          <str name="q">solr</str>
                      </lst>
                  </lst>
                  <result name="response" numFound="1" start="0">
                      <doc>
                          <str name="abstract">
                          A brief introduction to using Apache Solr for implementing search for your website.
                          </str>
                          <str name="django_ct">codecamp.session</str>
                          <str name="django_id">19</str>
                          <str name="id">codecamp.session.19</str>
                          <str name="text">
                          Searching with Solr: An Introduction A brief introduction to using Apache Solr for
                          implementing search for your website.
                          </str>
                          <str name="title">Searching with Solr: An Introduction</str>
                      </doc>
                  </result>
               </response>

                                                                                                                22


Saturday, November 10, 12
Getting Data out of Solr: JSON
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   http://localhost:8983/solr/select/?q=solr&wt=json




                                                                   23


Saturday, November 10, 12
{
                    "responseHeader": {
                        "status":0,
                        "QTime":0,
                        "params": {
                            "wt":"json",
                            "q":"solr"
                        }
                    },
                    "response": {
                        "numFound":1,
                        "start":0,
                        "docs":[{
                            "django_id":"19",
                            "title":"Searching with Solr: An Introduction",
                            "text":"Searching with Solr: An IntroductionnA brief introduction to using Apache Solr for
                            implementing search for your website.",
                            "abstract":"A brief introduction to using Apache Solr for implementing search for your
                            website.",
                            "django_ct":"codecamp.session","id":"codecamp.session.19"
                        }]
                    }
               }


                                                                                                                          24


Saturday, November 10, 12
Deleting Data from Solr
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   POST it

          <delete><id>codecamp.session.19</id></delete>
          <delete><query>company:blend</query></delete>




                                                          25


Saturday, November 10, 12
The Solr Schema
            SEARCHING WITH SOLR




           •   schema.xml
               •   Defines ‘types’ used in the webapp
               •   Defines the fields
               •   Defines ‘copyfields’
               •   Read the schema inside the example project for more




                                                                         26


Saturday, November 10, 12
The Solr Schema
            SEARCHING WITH SOLR




           •   Types
               •   Define how a field and query should be processed
                   •   Word Stemming
                   •   Case Folding
                   •   How would you handle a search for ‘C.I.A.’?
               •   Dates, ints, floats, etc.. are defined here as well
               •   2 Modes
                   •   Index Time
                   •   Query Time


                                                                         27


Saturday, November 10, 12
<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
              <analyzer type="index">
                 <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                 <filter class="solr.StopFilterFactory" ignoreCase="true"
                 words="stopwords.txt"/>
              </analyzer>
              <analyzer type="query">
                 <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                 <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
                 ignoreCase="true" expand="true"/>
                 <filter class="solr.StopFilterFactory" ignoreCase="true"
                 words="stopwords.txt"/>
              </analyzer>
           </fieldType>


                                                                                       28


Saturday, November 10, 12
<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
              <analyzer type="index">
                 <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                 <filter class="solr.StopFilterFactory" ignoreCase="true"
                 words="stopwords.txt"/>
              </analyzer>
              <analyzer type="query">
                 <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                 <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
                 ignoreCase="true" expand="true"/>
                 <filter class="solr.StopFilterFactory" ignoreCase="true"
                 words="stopwords.txt"/>
              </analyzer>
           </fieldType>


                                                                                       29


Saturday, November 10, 12
<fieldType name="text" class="solr.TextField" positionIncrementGap="100">
              <analyzer type="index">
                 <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                 <filter class="solr.StopFilterFactory" ignoreCase="true"
                 words="stopwords.txt"/>
              </analyzer>
              <analyzer type="query">
                 <tokenizer class="solr.WhitespaceTokenizerFactory"/>
                 <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
                 ignoreCase="true" expand="true"/>
                 <filter class="solr.StopFilterFactory" ignoreCase="true"
                 words="stopwords.txt"/>
              </analyzer>
           </fieldType>


                                                                                       30


Saturday, November 10, 12
Fields
            SEARCHING WITH SOLR




           •   The elements of a document
               •   Both Predefined and Dynamic
               •   Fields may occur multiple times
               •   May be indexed and/or stored




                                                     31


Saturday, November 10, 12
<fields>
   <!-- general -->
   <field name="id" type="string" indexed="true" stored="true" multiValued="false" required="true"/>
   <field name="django_ct" type="string" indexed="true" stored="true" multiValued="false" />
   <field name="django_id" type="string" indexed="true" stored="true" multiValued="false" />
   <!-- dynamic -->
   <dynamicField name="*_i" type="sint" indexed="true" stored="true"/>
   <dynamicField name="*_s" type="string" indexed="true" stored="true"/>
   <dynamicField name="*_l" type="slong" indexed="true" stored="true"/>
   <dynamicField name="*_t" type="text" indexed="true" stored="true"/>
   <dynamicField name="*_b" type="boolean" indexed="true" stored="true"/>
   <dynamicField name="*_f" type="sfloat" indexed="true" stored="true"/>
   <dynamicField name="*_d" type="sdouble" indexed="true" stored="true"/>
   <dynamicField name="*_dt" type="date" indexed="true" stored="true"/>
   <!-- app -->
   <field name="bio" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="title" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="text" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="abstract" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="full_name" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="company" type="text" indexed="true" stored="true" multiValued="false" />
</fields>                                                                                      32


Saturday, November 10, 12
<fields>
   <!-- general -->
   <field name="id" type="string" indexed="true" stored="true" multiValued="false" required="true"/>
   <field name="django_ct" type="string" indexed="true" stored="true" multiValued="false" />
   <field name="django_id" type="string" indexed="true" stored="true" multiValued="false" />
   <!-- dynamic -->
   <dynamicField name="*_i" type="sint" indexed="true" stored="true"/>
   <dynamicField name="*_s" type="string" indexed="true" stored="true"/>
   <dynamicField name="*_l" type="slong" indexed="true" stored="true"/>
   <dynamicField name="*_t" type="text" indexed="true" stored="true"/>
   <dynamicField name="*_b" type="boolean" indexed="true" stored="true"/>
   <dynamicField name="*_f" type="sfloat" indexed="true" stored="true"/>
   <dynamicField name="*_d" type="sdouble" indexed="true" stored="true"/>
   <dynamicField name="*_dt" type="date" indexed="true" stored="true"/>
   <!-- app -->
   <field name="bio" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="title" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="text" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="abstract" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="full_name" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="company" type="text" indexed="true" stored="true" multiValued="false" />
</fields>                                                                                      33


Saturday, November 10, 12
<fields>
   <!-- general -->
   <field name="id" type="string" indexed="true" stored="true" multiValued="false" required="true"/>
   <field name="django_ct" type="string" indexed="true" stored="true" multiValued="false" />
   <field name="django_id" type="string" indexed="true" stored="true" multiValued="false" />
   <!-- dynamic -->
   <dynamicField name="*_i" type="sint" indexed="true" stored="true"/>
   <dynamicField name="*_s" type="string" indexed="true" stored="true"/>
   <dynamicField name="*_l" type="slong" indexed="true" stored="true"/>
   <dynamicField name="*_t" type="text" indexed="true" stored="true"/>
   <dynamicField name="*_b" type="boolean" indexed="true" stored="true"/>
   <dynamicField name="*_f" type="sfloat" indexed="true" stored="true"/>
   <dynamicField name="*_d" type="sdouble" indexed="true" stored="true"/>
   <dynamicField name="*_dt" type="date" indexed="true" stored="true"/>
   <!-- app -->
   <field name="bio" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="title" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="text" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="abstract" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="full_name" type="text" indexed="true" stored="true" multiValued="false" />
   <field name="company" type="text" indexed="true" stored="true" multiValued="false" />
</fields>                                                                                      34


Saturday, November 10, 12
Copy Fields
            SEARCHING WITH SOLR




           •   Two Main Uses
               •   Analyze fields in different ways
               •   Concatenate Fields




                                                      35


Saturday, November 10, 12
<copyField source="bio" dest="df_text" />
                      <copyField source="year" dest="century" maxChars="2"/>




                                                                               36


Saturday, November 10, 12
<copyField source="bio" dest="df_text" />
                      <copyField source="year" dest="century" maxChars="2"/>




                                                                               37


Saturday, November 10, 12
<copyField source="bio" dest="df_text" />
                      <copyField source="year" dest="century" maxChars="2"/>

                                        2000 would be stored as 20
                                        Useful for custom faceting




                                                                               38


Saturday, November 10, 12
The Solr Config File
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   solrconfig.xml
               •   Defines request handlers, defaults, & caches
               •   Read the solrconfig.xml inside the example project for more




                                                                                 39


Saturday, November 10, 12
Other Solr Tools
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Debug Query
           • Boost Functions

           • Search Faceting

           • Search Filters

           • Search Highlighting

           • Solr Admin




                                   40


Saturday, November 10, 12
Debug Query Option
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   Add &debugQuery=on to request parameters
           •   Returns a parsed form of the query




                                                          41


Saturday, November 10, 12
<lst name="debug">
              <str name="rawquerystring">solr</str>
              <str name="querystring">solr</str>
              <str name="parsedquery">text:solr</str>
              <str name="parsedquery_toString">text:solr</str>
              <lst name="explain">
                 <str name="codecamp.session.19">
                 1.2147729 = (MATCH) fieldWeight(text:solr in 17), product of: 1.4142135 =
                 tf(termFreq(text:solr)=2) 3.9267395 = idf(docFreq=2, maxDocs=56)
                 0.21875 = fieldNorm(field=text, doc=17)
                 </str>
              </lst>




                                                                                        42


Saturday, November 10, 12
<lst name="debug">
              <str name="rawquerystring">solr</str>
              <str name="querystring">solr</str>
              <str name="parsedquery">text:solr</str>
              <str name="parsedquery_toString">text:solr</str>
              <lst name="explain">
                 <str name="codecamp.session.19">
                 1.2147729 = (MATCH) fieldWeight(text:solr in 17), product of: 1.4142135 =
                 tf(termFreq(text:solr)=2) 3.9267395 = idf(docFreq=2, maxDocs=56)
                 0.21875 = fieldNorm(field=text, doc=17)
                 </str>
              </lst>




                                                                                        43


Saturday, November 10, 12
Boost Function
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Allows you to influence results at query time
           • Really useful for tuning scoring

           • You can also boost at index time




                                                             44


Saturday, November 10, 12
Boost Function
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Allows you to influence results at query time
           • Really useful for tuning scoring

           • You can also boost at index time




                                q=blend&qf=text^2 company



                                                             45


Saturday, November 10, 12
Boost Function
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Allows you to influence results at query time
                                                             More information available -

           • Really useful for tuning scoring
                                                             http://wiki.apache.org/solr/
                                                             SolrRelevancyFAQ

                                                             Can use both dismax and

           • You can also boost at index time
                                                             standard query handlers, I use
                                                             dismax




                                q=blend&qf=text^2 company



                                                                                              46


Saturday, November 10, 12
Boost Function
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Allows you to influence results at query time
                                                             More information available -

           • Really useful for tuning scoring
                                                             http://wiki.apache.org/solr/
                                                             SolrRelevancyFAQ

                                                             Can use both dismax and

           • You can also boost at index time
                                                             standard query handlers, I use
                                                             dismax




                                 &bq=text:blend^2



                                                                                              47


Saturday, November 10, 12
Solr Faceting
            SEARCHING WITH
            SUB HEADLINE SOLR




           • What is a facet?
           • “Interaction style where users filter a set of items by

             progressively selecting from only valid values of a  faceted
             classification system” - Keith Instone, SOASIS&T, July 8, 2004
           • What does it look like?

           • Make sure to use an untokenized field (e.g. string)

               •   “San Jose” != “san”+“jose”



                                                                              48


Saturday, November 10, 12
q=*:*
           facet=on
           facet.field=company




                                 49


Saturday, November 10, 12
Solr Filter Query
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Used to narrow your search query
             • Restrict the super set of documents that can be returned

           • ‘fq’ parameter (short for Filter Query)




                                                                          50


Saturday, November 10, 12
Solr Filter Query
            SEARCHING WITH
            SUB HEADLINE SOLR




           • Used to narrow your search query
             • Restrict the super set of documents that can be returned

           • ‘fq’ parameter (short for Filter Query)




                                q=*:*
                                fq=company:blend


                                                                          51


Saturday, November 10, 12
Search Highlighting
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   Allow Solr to generate your highlight




                                                       52


Saturday, November 10, 12
Search Highlighting
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   Allow Solr to generate your highlight




                                                       53


Saturday, November 10, 12
hl=true
                            hl.simple.pre=<b>
                            hl.simple.post=</b>
                            hl.fragsize=200
                            hl.requireFieldMatch=false
                            hl.fl=text bio title
                            hl.snippets=1

                                                         54


Saturday, November 10, 12
Solr Admin
            SEARCHING WITH
            SUB HEADLINE SOLR




           • http://localhost:8983/solr/admin/
           • Built in app for testing all search options

               •   Field Analysis
               •   Schema Browser
               •   Full Query Interface
               •   Solr Statistics
               •   Solr Information
               •   Many More Options


                                                           55


Saturday, November 10, 12
Solr/Browse
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   Test your search configuration using the /browse
               requestHandler




                                                                  56


Saturday, November 10, 12
Resources
            SEARCHING WITH
            SUB HEADLINE SOLR




           •   Apache Solr Website
               •   http://lucene.apache.org/solr/
               •   Wiki, mailing list, bugs/features
           •   Books




                                                       57


Saturday, November 10, 12
58


Saturday, November 10, 12

More Related Content

What's hot

Couchdbkit djangocong-20100425
Couchdbkit djangocong-20100425Couchdbkit djangocong-20100425
Couchdbkit djangocong-20100425guest4f2eea
 
Cassandra 3.0 - JSON at scale - StampedeCon 2015
Cassandra 3.0 - JSON at scale - StampedeCon 2015Cassandra 3.0 - JSON at scale - StampedeCon 2015
Cassandra 3.0 - JSON at scale - StampedeCon 2015StampedeCon
 
Optimize Is (Not) Bad For You - Rafał Kuć, Sematext Group, Inc.
Optimize Is (Not) Bad For You - Rafał Kuć, Sematext Group, Inc.Optimize Is (Not) Bad For You - Rafał Kuć, Sematext Group, Inc.
Optimize Is (Not) Bad For You - Rafał Kuć, Sematext Group, Inc.Lucidworks
 
Scrap your query boilerplate with specql
Scrap your query boilerplate with specqlScrap your query boilerplate with specql
Scrap your query boilerplate with specqlTatu Tarvainen
 
Improving DSpace Backups, Restores & Migrations
Improving DSpace Backups, Restores & MigrationsImproving DSpace Backups, Restores & Migrations
Improving DSpace Backups, Restores & MigrationsTim Donohue
 
[WLDN] Supercharging word press development in 2018
[WLDN] Supercharging word press development in 2018[WLDN] Supercharging word press development in 2018
[WLDN] Supercharging word press development in 2018Adam Tomat
 
개발자를 위한 Amazon Lightsail Deep-Dive
개발자를 위한 Amazon Lightsail Deep-Dive개발자를 위한 Amazon Lightsail Deep-Dive
개발자를 위한 Amazon Lightsail Deep-Dive창훈 정
 
MySQL Without the MySQL -- Oh My!
MySQL Without the MySQL -- Oh My!MySQL Without the MySQL -- Oh My!
MySQL Without the MySQL -- Oh My!Dave Stokes
 
Rapid prototyping search applications with solr
Rapid prototyping search applications with solrRapid prototyping search applications with solr
Rapid prototyping search applications with solrLucidworks (Archived)
 
Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009,...
Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009,...Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009,...
Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009,...Erik Hatcher
 
Schemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST APISchemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST APIlucenerevolution
 

What's hot (12)

Couchdbkit djangocong-20100425
Couchdbkit djangocong-20100425Couchdbkit djangocong-20100425
Couchdbkit djangocong-20100425
 
Solr Masterclass Bangkok, June 2014
Solr Masterclass Bangkok, June 2014Solr Masterclass Bangkok, June 2014
Solr Masterclass Bangkok, June 2014
 
Cassandra 3.0 - JSON at scale - StampedeCon 2015
Cassandra 3.0 - JSON at scale - StampedeCon 2015Cassandra 3.0 - JSON at scale - StampedeCon 2015
Cassandra 3.0 - JSON at scale - StampedeCon 2015
 
Optimize Is (Not) Bad For You - Rafał Kuć, Sematext Group, Inc.
Optimize Is (Not) Bad For You - Rafał Kuć, Sematext Group, Inc.Optimize Is (Not) Bad For You - Rafał Kuć, Sematext Group, Inc.
Optimize Is (Not) Bad For You - Rafał Kuć, Sematext Group, Inc.
 
Scrap your query boilerplate with specql
Scrap your query boilerplate with specqlScrap your query boilerplate with specql
Scrap your query boilerplate with specql
 
Improving DSpace Backups, Restores & Migrations
Improving DSpace Backups, Restores & MigrationsImproving DSpace Backups, Restores & Migrations
Improving DSpace Backups, Restores & Migrations
 
[WLDN] Supercharging word press development in 2018
[WLDN] Supercharging word press development in 2018[WLDN] Supercharging word press development in 2018
[WLDN] Supercharging word press development in 2018
 
개발자를 위한 Amazon Lightsail Deep-Dive
개발자를 위한 Amazon Lightsail Deep-Dive개발자를 위한 Amazon Lightsail Deep-Dive
개발자를 위한 Amazon Lightsail Deep-Dive
 
MySQL Without the MySQL -- Oh My!
MySQL Without the MySQL -- Oh My!MySQL Without the MySQL -- Oh My!
MySQL Without the MySQL -- Oh My!
 
Rapid prototyping search applications with solr
Rapid prototyping search applications with solrRapid prototyping search applications with solr
Rapid prototyping search applications with solr
 
Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009,...
Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009,...Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009,...
Solr Flair: Search User Interfaces Powered by Apache Solr (ApacheCon US 2009,...
 
Schemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST APISchemaless Solr and the Solr Schema REST API
Schemaless Solr and the Solr Schema REST API
 

Viewers also liked

Getting started with apache solr
Getting started with apache solrGetting started with apache solr
Getting started with apache solrHumayun Kabir
 
Getting Started with Solr
Getting Started with SolrGetting Started with Solr
Getting Started with SolrTravis Carlson
 
Writing custom queries scorers’ diversity and traps (lucene internals)(2)
Writing custom queries  scorers’ diversity and traps (lucene internals)(2)Writing custom queries  scorers’ diversity and traps (lucene internals)(2)
Writing custom queries scorers’ diversity and traps (lucene internals)(2)lucenerevolution
 
Andrzej bialecki lr-2013-dublin
Andrzej bialecki lr-2013-dublinAndrzej bialecki lr-2013-dublin
Andrzej bialecki lr-2013-dublinlucenerevolution
 
Introduction to Lucene and Solr - 1
Introduction to Lucene and Solr - 1Introduction to Lucene and Solr - 1
Introduction to Lucene and Solr - 1YI-CHING WU
 
เกี่ยวกับ Apache solr 4.0
เกี่ยวกับ Apache solr 4.0เกี่ยวกับ Apache solr 4.0
เกี่ยวกับ Apache solr 4.0Somkiat Puisungnoen
 
Apache Solr - search for everyone!
Apache Solr - search for everyone!Apache Solr - search for everyone!
Apache Solr - search for everyone!Jaran Flaath
 
NoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopNoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopDmitry Kan
 
Solr: Search at the Speed of Light
Solr: Search at the Speed of LightSolr: Search at the Speed of Light
Solr: Search at the Speed of LightErik Hatcher
 

Viewers also liked (9)

Getting started with apache solr
Getting started with apache solrGetting started with apache solr
Getting started with apache solr
 
Getting Started with Solr
Getting Started with SolrGetting Started with Solr
Getting Started with Solr
 
Writing custom queries scorers’ diversity and traps (lucene internals)(2)
Writing custom queries  scorers’ diversity and traps (lucene internals)(2)Writing custom queries  scorers’ diversity and traps (lucene internals)(2)
Writing custom queries scorers’ diversity and traps (lucene internals)(2)
 
Andrzej bialecki lr-2013-dublin
Andrzej bialecki lr-2013-dublinAndrzej bialecki lr-2013-dublin
Andrzej bialecki lr-2013-dublin
 
Introduction to Lucene and Solr - 1
Introduction to Lucene and Solr - 1Introduction to Lucene and Solr - 1
Introduction to Lucene and Solr - 1
 
เกี่ยวกับ Apache solr 4.0
เกี่ยวกับ Apache solr 4.0เกี่ยวกับ Apache solr 4.0
เกี่ยวกับ Apache solr 4.0
 
Apache Solr - search for everyone!
Apache Solr - search for everyone!Apache Solr - search for everyone!
Apache Solr - search for everyone!
 
NoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopNoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache Hadoop
 
Solr: Search at the Speed of Light
Solr: Search at the Speed of LightSolr: Search at the Speed of Light
Solr: Search at the Speed of Light
 

Similar to "Searching with Solr" - Tyler Harms, South Dakota Code Camp 2012

Responsive Facets with Apache Solr
Responsive Facets with Apache SolrResponsive Facets with Apache Solr
Responsive Facets with Apache SolrBrent Lemons
 
Solr introduction
Solr introductionSolr introduction
Solr introductionLap Tran
 
Scaling search in Oak with Solr
Scaling search in Oak with Solr Scaling search in Oak with Solr
Scaling search in Oak with Solr Tommaso Teofili
 
Building Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesBuilding Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesRahul Singh
 
Building Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesBuilding Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesAnant Corporation
 
Solr search engine with multiple table relation
Solr search engine with multiple table relationSolr search engine with multiple table relation
Solr search engine with multiple table relationJay Bharat
 
Meet Solr For The Tirst Again
Meet Solr For The Tirst AgainMeet Solr For The Tirst Again
Meet Solr For The Tirst AgainVarun Thacker
 
Hortonworks Technical Workshop - HDP Search
Hortonworks Technical Workshop - HDP Search Hortonworks Technical Workshop - HDP Search
Hortonworks Technical Workshop - HDP Search Hortonworks
 
Rails and the Apache SOLR Search Engine
Rails and the Apache SOLR Search EngineRails and the Apache SOLR Search Engine
Rails and the Apache SOLR Search EngineDavid Keener
 
Ease of use in Apache Solr
Ease of use in Apache SolrEase of use in Apache Solr
Ease of use in Apache SolrAnshum Gupta
 
Rapid prototyping with solr - By Erik Hatcher
Rapid prototyping with solr -  By Erik Hatcher Rapid prototyping with solr -  By Erik Hatcher
Rapid prototyping with solr - By Erik Hatcher lucenerevolution
 
Making your Drupal fly with Apache SOLR
Making your Drupal fly with Apache SOLRMaking your Drupal fly with Apache SOLR
Making your Drupal fly with Apache SOLRExove
 
JClouds at San Francisco Java User Group
JClouds at San Francisco Java User GroupJClouds at San Francisco Java User Group
JClouds at San Francisco Java User GroupMarakana Inc.
 
Introduction to Node.js: perspectives from a Drupal dev
Introduction to Node.js: perspectives from a Drupal devIntroduction to Node.js: perspectives from a Drupal dev
Introduction to Node.js: perspectives from a Drupal devmcantelon
 
Devops kc meetup_5_20_2013
Devops kc meetup_5_20_2013Devops kc meetup_5_20_2013
Devops kc meetup_5_20_2013Aaron Blythe
 
Rails ORM De-mystifying Active Record has_many
Rails ORM De-mystifying Active Record has_manyRails ORM De-mystifying Active Record has_many
Rails ORM De-mystifying Active Record has_manyBlazing Cloud
 
eZ Find workshop: advanced insights & recipes
eZ Find workshop: advanced insights & recipeseZ Find workshop: advanced insights & recipes
eZ Find workshop: advanced insights & recipesPaul Borgermans
 
Solr as your search and suggest engine karan nangru
Solr as your search and suggest engine   karan nangruSolr as your search and suggest engine   karan nangru
Solr as your search and suggest engine karan nangruIndicThreads
 

Similar to "Searching with Solr" - Tyler Harms, South Dakota Code Camp 2012 (20)

Responsive Facets with Apache Solr
Responsive Facets with Apache SolrResponsive Facets with Apache Solr
Responsive Facets with Apache Solr
 
Solr introduction
Solr introductionSolr introduction
Solr introduction
 
Scaling search in Oak with Solr
Scaling search in Oak with Solr Scaling search in Oak with Solr
Scaling search in Oak with Solr
 
Building Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesBuilding Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source Technologies
 
Building Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source TechnologiesBuilding Enterprise Search Engines using Open Source Technologies
Building Enterprise Search Engines using Open Source Technologies
 
Solr search engine with multiple table relation
Solr search engine with multiple table relationSolr search engine with multiple table relation
Solr search engine with multiple table relation
 
Meet Solr For The Tirst Again
Meet Solr For The Tirst AgainMeet Solr For The Tirst Again
Meet Solr For The Tirst Again
 
Hortonworks Technical Workshop - HDP Search
Hortonworks Technical Workshop - HDP Search Hortonworks Technical Workshop - HDP Search
Hortonworks Technical Workshop - HDP Search
 
Rails and the Apache SOLR Search Engine
Rails and the Apache SOLR Search EngineRails and the Apache SOLR Search Engine
Rails and the Apache SOLR Search Engine
 
Ease of use in Apache Solr
Ease of use in Apache SolrEase of use in Apache Solr
Ease of use in Apache Solr
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with Solr
 
Rapid prototyping with solr - By Erik Hatcher
Rapid prototyping with solr -  By Erik Hatcher Rapid prototyping with solr -  By Erik Hatcher
Rapid prototyping with solr - By Erik Hatcher
 
Making your Drupal fly with Apache SOLR
Making your Drupal fly with Apache SOLRMaking your Drupal fly with Apache SOLR
Making your Drupal fly with Apache SOLR
 
JClouds at San Francisco Java User Group
JClouds at San Francisco Java User GroupJClouds at San Francisco Java User Group
JClouds at San Francisco Java User Group
 
Introduction to Node.js: perspectives from a Drupal dev
Introduction to Node.js: perspectives from a Drupal devIntroduction to Node.js: perspectives from a Drupal dev
Introduction to Node.js: perspectives from a Drupal dev
 
Devops kc meetup_5_20_2013
Devops kc meetup_5_20_2013Devops kc meetup_5_20_2013
Devops kc meetup_5_20_2013
 
Rails ORM De-mystifying Active Record has_many
Rails ORM De-mystifying Active Record has_manyRails ORM De-mystifying Active Record has_many
Rails ORM De-mystifying Active Record has_many
 
Building Search Engines
Building Search EnginesBuilding Search Engines
Building Search Engines
 
eZ Find workshop: advanced insights & recipes
eZ Find workshop: advanced insights & recipeseZ Find workshop: advanced insights & recipes
eZ Find workshop: advanced insights & recipes
 
Solr as your search and suggest engine karan nangru
Solr as your search and suggest engine   karan nangruSolr as your search and suggest engine   karan nangru
Solr as your search and suggest engine karan nangru
 

More from Blend Interactive

"The Accessible Editor Workshop" by Corey Vilhauer, from Now What? Workshops ...
"The Accessible Editor Workshop" by Corey Vilhauer, from Now What? Workshops ..."The Accessible Editor Workshop" by Corey Vilhauer, from Now What? Workshops ...
"The Accessible Editor Workshop" by Corey Vilhauer, from Now What? Workshops ...Blend Interactive
 
"The Accessible Editor" by Corey Vilhauer, from DrupalCon 2018 in Nashville, ...
"The Accessible Editor" by Corey Vilhauer, from DrupalCon 2018 in Nashville, ..."The Accessible Editor" by Corey Vilhauer, from DrupalCon 2018 in Nashville, ...
"The Accessible Editor" by Corey Vilhauer, from DrupalCon 2018 in Nashville, ...Blend Interactive
 
"Click to Continue" by Sam Otis, from Content+Design Meetup, Oct. 4, 2017
"Click to Continue" by Sam Otis, from Content+Design Meetup, Oct. 4, 2017"Click to Continue" by Sam Otis, from Content+Design Meetup, Oct. 4, 2017
"Click to Continue" by Sam Otis, from Content+Design Meetup, Oct. 4, 2017Blend Interactive
 
"Never Knowing Enough: dealing with the self doubt that hinders your success....
"Never Knowing Enough: dealing with the self doubt that hinders your success...."Never Knowing Enough: dealing with the self doubt that hinders your success....
"Never Knowing Enough: dealing with the self doubt that hinders your success....Blend Interactive
 
"Making things real: Content strategy for realistic content management" - Con...
"Making things real: Content strategy for realistic content management" - Con..."Making things real: Content strategy for realistic content management" - Con...
"Making things real: Content strategy for realistic content management" - Con...Blend Interactive
 
"Making things real: Content strategy for realistic content management" - Con...
"Making things real: Content strategy for realistic content management" - Con..."Making things real: Content strategy for realistic content management" - Con...
"Making things real: Content strategy for realistic content management" - Con...Blend Interactive
 
Getting Started With User-Centered Content by Emileigh Barnes & Kate Garklavs...
Getting Started With User-Centered Content by Emileigh Barnes & Kate Garklavs...Getting Started With User-Centered Content by Emileigh Barnes & Kate Garklavs...
Getting Started With User-Centered Content by Emileigh Barnes & Kate Garklavs...Blend Interactive
 
Content Measurement and Analytics: Making Positive Change on the Web by Rick ...
Content Measurement and Analytics: Making Positive Change on the Web by Rick ...Content Measurement and Analytics: Making Positive Change on the Web by Rick ...
Content Measurement and Analytics: Making Positive Change on the Web by Rick ...Blend Interactive
 
“How Silos Learn: Working in the Idea Factory” by Amanda Costello - Now What?...
“How Silos Learn: Working in the Idea Factory” by Amanda Costello - Now What?...“How Silos Learn: Working in the Idea Factory” by Amanda Costello - Now What?...
“How Silos Learn: Working in the Idea Factory” by Amanda Costello - Now What?...Blend Interactive
 
"Adaptive Content, Context, and Controversy
"Adaptive Content, Context, and Controversy"Adaptive Content, Context, and Controversy
"Adaptive Content, Context, and ControversyBlend Interactive
 
"Empathy Behind the Algorithms" by Chris Corak - Now What? Conference 2017
"Empathy Behind the Algorithms" by Chris Corak - Now What? Conference 2017"Empathy Behind the Algorithms" by Chris Corak - Now What? Conference 2017
"Empathy Behind the Algorithms" by Chris Corak - Now What? Conference 2017Blend Interactive
 
“The Beauty of Brevity” by Ravi Jain - Now What? Conference 2017
“The Beauty of Brevity” by Ravi Jain - Now What? Conference 2017“The Beauty of Brevity” by Ravi Jain - Now What? Conference 2017
“The Beauty of Brevity” by Ravi Jain - Now What? Conference 2017Blend Interactive
 
“Why Content Projects Fail” by Deane Barker - Now What? Conference 2017
“Why Content Projects Fail” by Deane Barker - Now What? Conference 2017“Why Content Projects Fail” by Deane Barker - Now What? Conference 2017
“Why Content Projects Fail” by Deane Barker - Now What? Conference 2017Blend Interactive
 
Not Perfect, "Always Better: A Story of Inclusion" by Derek Featherstone - No...
Not Perfect, "Always Better: A Story of Inclusion" by Derek Featherstone - No...Not Perfect, "Always Better: A Story of Inclusion" by Derek Featherstone - No...
Not Perfect, "Always Better: A Story of Inclusion" by Derek Featherstone - No...Blend Interactive
 
Content Design for Mobile Devices - Now What? Conference 2017
Content Design for Mobile Devices - Now What? Conference 2017Content Design for Mobile Devices - Now What? Conference 2017
Content Design for Mobile Devices - Now What? Conference 2017Blend Interactive
 
"Making Things Real: Taking content strategy from abstract to functional" - M...
"Making Things Real: Taking content strategy from abstract to functional" - M..."Making Things Real: Taking content strategy from abstract to functional" - M...
"Making Things Real: Taking content strategy from abstract to functional" - M...Blend Interactive
 
“Writing for Your Audience — The Message, the Words, the Plan” – Business Sen...
“Writing for Your Audience — The Message, the Words, the Plan” – Business Sen...“Writing for Your Audience — The Message, the Words, the Plan” – Business Sen...
“Writing for Your Audience — The Message, the Words, the Plan” – Business Sen...Blend Interactive
 
"The Self-Directed Strategist: Building a Practice and Managing Organizationa...
"The Self-Directed Strategist: Building a Practice and Managing Organizationa..."The Self-Directed Strategist: Building a Practice and Managing Organizationa...
"The Self-Directed Strategist: Building a Practice and Managing Organizationa...Blend Interactive
 
Next Level Collaboration: The Future of Content and Design by Rebekah Cancino...
Next Level Collaboration: The Future of Content and Design by Rebekah Cancino...Next Level Collaboration: The Future of Content and Design by Rebekah Cancino...
Next Level Collaboration: The Future of Content and Design by Rebekah Cancino...Blend Interactive
 

More from Blend Interactive (20)

"The Accessible Editor Workshop" by Corey Vilhauer, from Now What? Workshops ...
"The Accessible Editor Workshop" by Corey Vilhauer, from Now What? Workshops ..."The Accessible Editor Workshop" by Corey Vilhauer, from Now What? Workshops ...
"The Accessible Editor Workshop" by Corey Vilhauer, from Now What? Workshops ...
 
"The Accessible Editor" by Corey Vilhauer, from DrupalCon 2018 in Nashville, ...
"The Accessible Editor" by Corey Vilhauer, from DrupalCon 2018 in Nashville, ..."The Accessible Editor" by Corey Vilhauer, from DrupalCon 2018 in Nashville, ...
"The Accessible Editor" by Corey Vilhauer, from DrupalCon 2018 in Nashville, ...
 
"Click to Continue" by Sam Otis, from Content+Design Meetup, Oct. 4, 2017
"Click to Continue" by Sam Otis, from Content+Design Meetup, Oct. 4, 2017"Click to Continue" by Sam Otis, from Content+Design Meetup, Oct. 4, 2017
"Click to Continue" by Sam Otis, from Content+Design Meetup, Oct. 4, 2017
 
"Never Knowing Enough: dealing with the self doubt that hinders your success....
"Never Knowing Enough: dealing with the self doubt that hinders your success...."Never Knowing Enough: dealing with the self doubt that hinders your success....
"Never Knowing Enough: dealing with the self doubt that hinders your success....
 
"Making things real: Content strategy for realistic content management" - Con...
"Making things real: Content strategy for realistic content management" - Con..."Making things real: Content strategy for realistic content management" - Con...
"Making things real: Content strategy for realistic content management" - Con...
 
"Making things real: Content strategy for realistic content management" - Con...
"Making things real: Content strategy for realistic content management" - Con..."Making things real: Content strategy for realistic content management" - Con...
"Making things real: Content strategy for realistic content management" - Con...
 
Getting Started With User-Centered Content by Emileigh Barnes & Kate Garklavs...
Getting Started With User-Centered Content by Emileigh Barnes & Kate Garklavs...Getting Started With User-Centered Content by Emileigh Barnes & Kate Garklavs...
Getting Started With User-Centered Content by Emileigh Barnes & Kate Garklavs...
 
Content Measurement and Analytics: Making Positive Change on the Web by Rick ...
Content Measurement and Analytics: Making Positive Change on the Web by Rick ...Content Measurement and Analytics: Making Positive Change on the Web by Rick ...
Content Measurement and Analytics: Making Positive Change on the Web by Rick ...
 
“How Silos Learn: Working in the Idea Factory” by Amanda Costello - Now What?...
“How Silos Learn: Working in the Idea Factory” by Amanda Costello - Now What?...“How Silos Learn: Working in the Idea Factory” by Amanda Costello - Now What?...
“How Silos Learn: Working in the Idea Factory” by Amanda Costello - Now What?...
 
"Adaptive Content, Context, and Controversy
"Adaptive Content, Context, and Controversy"Adaptive Content, Context, and Controversy
"Adaptive Content, Context, and Controversy
 
"Empathy Behind the Algorithms" by Chris Corak - Now What? Conference 2017
"Empathy Behind the Algorithms" by Chris Corak - Now What? Conference 2017"Empathy Behind the Algorithms" by Chris Corak - Now What? Conference 2017
"Empathy Behind the Algorithms" by Chris Corak - Now What? Conference 2017
 
“The Beauty of Brevity” by Ravi Jain - Now What? Conference 2017
“The Beauty of Brevity” by Ravi Jain - Now What? Conference 2017“The Beauty of Brevity” by Ravi Jain - Now What? Conference 2017
“The Beauty of Brevity” by Ravi Jain - Now What? Conference 2017
 
“Why Content Projects Fail” by Deane Barker - Now What? Conference 2017
“Why Content Projects Fail” by Deane Barker - Now What? Conference 2017“Why Content Projects Fail” by Deane Barker - Now What? Conference 2017
“Why Content Projects Fail” by Deane Barker - Now What? Conference 2017
 
Not Perfect, "Always Better: A Story of Inclusion" by Derek Featherstone - No...
Not Perfect, "Always Better: A Story of Inclusion" by Derek Featherstone - No...Not Perfect, "Always Better: A Story of Inclusion" by Derek Featherstone - No...
Not Perfect, "Always Better: A Story of Inclusion" by Derek Featherstone - No...
 
Content Design for Mobile Devices - Now What? Conference 2017
Content Design for Mobile Devices - Now What? Conference 2017Content Design for Mobile Devices - Now What? Conference 2017
Content Design for Mobile Devices - Now What? Conference 2017
 
"Making Things Real: Taking content strategy from abstract to functional" - M...
"Making Things Real: Taking content strategy from abstract to functional" - M..."Making Things Real: Taking content strategy from abstract to functional" - M...
"Making Things Real: Taking content strategy from abstract to functional" - M...
 
“Writing for Your Audience — The Message, the Words, the Plan” – Business Sen...
“Writing for Your Audience — The Message, the Words, the Plan” – Business Sen...“Writing for Your Audience — The Message, the Words, the Plan” – Business Sen...
“Writing for Your Audience — The Message, the Words, the Plan” – Business Sen...
 
Functional concepts in C#
Functional concepts in C#Functional concepts in C#
Functional concepts in C#
 
"The Self-Directed Strategist: Building a Practice and Managing Organizationa...
"The Self-Directed Strategist: Building a Practice and Managing Organizationa..."The Self-Directed Strategist: Building a Practice and Managing Organizationa...
"The Self-Directed Strategist: Building a Practice and Managing Organizationa...
 
Next Level Collaboration: The Future of Content and Design by Rebekah Cancino...
Next Level Collaboration: The Future of Content and Design by Rebekah Cancino...Next Level Collaboration: The Future of Content and Design by Rebekah Cancino...
Next Level Collaboration: The Future of Content and Design by Rebekah Cancino...
 

"Searching with Solr" - Tyler Harms, South Dakota Code Camp 2012

  • 1. Searching with Solr AN INTRODUCTION Tyler Harms Developer @harmstyler tyler@blendinteractive.com 1 Saturday, November 10, 12
  • 2. Why Implement Solr? SEARCHING WITH SOLR • Does your site need search? • Is google enough? • Do you need/want to control rankings? • Just text, or Structured Data? 2 Saturday, November 10, 12
  • 3. What is Solr? SEARCHING WITH SOLR Solr is a standalone enterprise search server with a REST-like API. You put documents in it [...] over HTTP. You query it via HTTP GET and receive [...] results. 3 Saturday, November 10, 12
  • 5. Solr Versions SEARCHING WITH SOLR • Current Version(s) • Solr 3.6.1 • Solr 4 • Released Versions are always stable 5 Saturday, November 10, 12
  • 6. $ wget http://(...)/3.6.1/apache-solr-3.6.1.tgz $ tar -xzf apache-solr-3.6.1.tgz $ cd apache-solr-3.6.1/example/ $ java -jar start.jar (a lot of java log...) 6 Saturday, November 10, 12
  • 7. Search Alternatives SEARCHING WITH SOLR • Google • Lucene • elasticsearch • Whoosh • Xapien • Many Others 7 Saturday, November 10, 12
  • 8. NOT a Database Replacement SEARCHING WITH SOLR • Solr is designed to live alongside your website as a separate web app 8 Saturday, November 10, 12
  • 9. Frontend Database Master Servers[1..n] Database Slaves[0..n] Solr Master Solr Slaves[0..n] 10 9 Saturday, November 10, 12
  • 10. Scaling Solr SEARCHING WITH SOLR • Master/Slave Architecture • Write to master -> Read from slaves • Multicore Setup • Multiple Solr ‘cores’ running alongside each other within the same install 10 Saturday, November 10, 12
  • 11. Solr’s Data Model SEARCHING WITH SUB HEADLINE SOLR • Solr maintains a collection of documents • A document is a collection of fields and values • A field can occur multiple times in a doc • Documents are immutable • They can be deleted and replaced by new versions, however. 11 Saturday, November 10, 12
  • 12. Querying SEARCHING WITH SUB HEADLINE SOLR • http request • http://localhost:8983/solr/select?q=blend&start=0&rows=10 12 Saturday, November 10, 12
  • 13. Solr Query Syntax SEARCHING WITH SUB HEADLINE SOLR • blend (value) • company:blend (field:value) • title:”Searching with Solr” AND text:apache • id:[* TO *] • *:* (all fields : all values) 13 Saturday, November 10, 12
  • 14. Using Solr SEARCHING WITH SUB HEADLINE SOLR • Getting Data into Solr • Getting Data out of Solr 14 Saturday, November 10, 12
  • 15. Getting Data into Solr SEARCHING WITH SUB HEADLINE SOLR • POST it <add> <doc> <field name="abstract">Lorem ipsum</field> <field name="company">Blend Interactive</field> <field name="text">Lorem Ipsum</field> <field name="title">Some Title</field> </doc> [<doc> ... </doc>[<doc> ... </doc>]] </add> 15 Saturday, November 10, 12
  • 16. Getting Data into Solr SEARCHING WITH SUB HEADLINE SOLR • POST it <add> <doc> <field name="abstract">Lorem ipsum</field> <field name="company">Blend Interactive</field> <field name="text">Lorem Ipsum</field> <field name="title">Some Title</field> </doc> [<doc> ... </doc>[<doc> ... </doc>]] </add> 16 Saturday, November 10, 12
  • 17. Getting Data into Solr SEARCHING WITH SUB HEADLINE SOLR • POST it <add> <doc> <field name="abstract">Lorem ipsum</field> <field name="company">Blend Interactive</field> <field name="text">Lorem Ipsum</field> <field name="title">Some Title</field> </doc> [<doc> ... </doc>[<doc> ... </doc>]] </add> 17 Saturday, November 10, 12
  • 18. Commiting SEARCHING WITH SUB HEADLINE SOLR • Nothing shows up in the index until you commit • You can just POST <commit/> to: • http://<host>:<port>/solr/update 18 Saturday, November 10, 12
  • 19. Getting Data out of Solr SEARCHING WITH SUB HEADLINE SOLR • http://localhost:8983/solr/select/?q=solr 19 Saturday, November 10, 12
  • 20. <response> <lst name="responseHeader"> <int name="status">0</int> <int name="QTime">19</int> <lst name="params"> <str name="q">solr</str> </lst> </lst> <result name="response" numFound="1" start="0"> <doc> <str name="abstract"> A brief introduction to using Apache Solr for implementing search for your website. </str> <str name="django_ct">codecamp.session</str> <str name="django_id">19</str> <str name="id">codecamp.session.19</str> <str name="text"> Searching with Solr: An Introduction A brief introduction to using Apache Solr for implementing search for your website. </str> <str name="title">Searching with Solr: An Introduction</str> </doc> </result> </response> 20 Saturday, November 10, 12
  • 21. <response> <lst name="responseHeader"> <int name="status">0</int> <int name="QTime">19</int> <lst name="params"> <str name="q">solr</str> </lst> </lst> <result name="response" numFound="1" start="0"> <doc> <str name="abstract"> A brief introduction to using Apache Solr for implementing search for your website. </str> <str name="django_ct">codecamp.session</str> <str name="django_id">19</str> <str name="id">codecamp.session.19</str> <str name="text"> Searching with Solr: An Introduction A brief introduction to using Apache Solr for implementing search for your website. </str> <str name="title">Searching with Solr: An Introduction</str> </doc> </result> </response> 21 Saturday, November 10, 12
  • 22. <response> <lst name="responseHeader"> <int name="status">0</int> <int name="QTime">19</int> <lst name="params"> <str name="q">solr</str> </lst> </lst> <result name="response" numFound="1" start="0"> <doc> <str name="abstract"> A brief introduction to using Apache Solr for implementing search for your website. </str> <str name="django_ct">codecamp.session</str> <str name="django_id">19</str> <str name="id">codecamp.session.19</str> <str name="text"> Searching with Solr: An Introduction A brief introduction to using Apache Solr for implementing search for your website. </str> <str name="title">Searching with Solr: An Introduction</str> </doc> </result> </response> 22 Saturday, November 10, 12
  • 23. Getting Data out of Solr: JSON SEARCHING WITH SUB HEADLINE SOLR • http://localhost:8983/solr/select/?q=solr&wt=json 23 Saturday, November 10, 12
  • 24. { "responseHeader": { "status":0, "QTime":0, "params": { "wt":"json", "q":"solr" } }, "response": { "numFound":1, "start":0, "docs":[{ "django_id":"19", "title":"Searching with Solr: An Introduction", "text":"Searching with Solr: An IntroductionnA brief introduction to using Apache Solr for implementing search for your website.", "abstract":"A brief introduction to using Apache Solr for implementing search for your website.", "django_ct":"codecamp.session","id":"codecamp.session.19" }] } } 24 Saturday, November 10, 12
  • 25. Deleting Data from Solr SEARCHING WITH SUB HEADLINE SOLR • POST it <delete><id>codecamp.session.19</id></delete> <delete><query>company:blend</query></delete> 25 Saturday, November 10, 12
  • 26. The Solr Schema SEARCHING WITH SOLR • schema.xml • Defines ‘types’ used in the webapp • Defines the fields • Defines ‘copyfields’ • Read the schema inside the example project for more 26 Saturday, November 10, 12
  • 27. The Solr Schema SEARCHING WITH SOLR • Types • Define how a field and query should be processed • Word Stemming • Case Folding • How would you handle a search for ‘C.I.A.’? • Dates, ints, floats, etc.. are defined here as well • 2 Modes • Index Time • Query Time 27 Saturday, November 10, 12
  • 28. <fieldType name="text" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> </analyzer> </fieldType> 28 Saturday, November 10, 12
  • 29. <fieldType name="text" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> </analyzer> </fieldType> 29 Saturday, November 10, 12
  • 30. <fieldType name="text" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/> </analyzer> </fieldType> 30 Saturday, November 10, 12
  • 31. Fields SEARCHING WITH SOLR • The elements of a document • Both Predefined and Dynamic • Fields may occur multiple times • May be indexed and/or stored 31 Saturday, November 10, 12
  • 32. <fields> <!-- general --> <field name="id" type="string" indexed="true" stored="true" multiValued="false" required="true"/> <field name="django_ct" type="string" indexed="true" stored="true" multiValued="false" /> <field name="django_id" type="string" indexed="true" stored="true" multiValued="false" /> <!-- dynamic --> <dynamicField name="*_i" type="sint" indexed="true" stored="true"/> <dynamicField name="*_s" type="string" indexed="true" stored="true"/> <dynamicField name="*_l" type="slong" indexed="true" stored="true"/> <dynamicField name="*_t" type="text" indexed="true" stored="true"/> <dynamicField name="*_b" type="boolean" indexed="true" stored="true"/> <dynamicField name="*_f" type="sfloat" indexed="true" stored="true"/> <dynamicField name="*_d" type="sdouble" indexed="true" stored="true"/> <dynamicField name="*_dt" type="date" indexed="true" stored="true"/> <!-- app --> <field name="bio" type="text" indexed="true" stored="true" multiValued="false" /> <field name="title" type="text" indexed="true" stored="true" multiValued="false" /> <field name="text" type="text" indexed="true" stored="true" multiValued="false" /> <field name="abstract" type="text" indexed="true" stored="true" multiValued="false" /> <field name="full_name" type="text" indexed="true" stored="true" multiValued="false" /> <field name="company" type="text" indexed="true" stored="true" multiValued="false" /> </fields> 32 Saturday, November 10, 12
  • 33. <fields> <!-- general --> <field name="id" type="string" indexed="true" stored="true" multiValued="false" required="true"/> <field name="django_ct" type="string" indexed="true" stored="true" multiValued="false" /> <field name="django_id" type="string" indexed="true" stored="true" multiValued="false" /> <!-- dynamic --> <dynamicField name="*_i" type="sint" indexed="true" stored="true"/> <dynamicField name="*_s" type="string" indexed="true" stored="true"/> <dynamicField name="*_l" type="slong" indexed="true" stored="true"/> <dynamicField name="*_t" type="text" indexed="true" stored="true"/> <dynamicField name="*_b" type="boolean" indexed="true" stored="true"/> <dynamicField name="*_f" type="sfloat" indexed="true" stored="true"/> <dynamicField name="*_d" type="sdouble" indexed="true" stored="true"/> <dynamicField name="*_dt" type="date" indexed="true" stored="true"/> <!-- app --> <field name="bio" type="text" indexed="true" stored="true" multiValued="false" /> <field name="title" type="text" indexed="true" stored="true" multiValued="false" /> <field name="text" type="text" indexed="true" stored="true" multiValued="false" /> <field name="abstract" type="text" indexed="true" stored="true" multiValued="false" /> <field name="full_name" type="text" indexed="true" stored="true" multiValued="false" /> <field name="company" type="text" indexed="true" stored="true" multiValued="false" /> </fields> 33 Saturday, November 10, 12
  • 34. <fields> <!-- general --> <field name="id" type="string" indexed="true" stored="true" multiValued="false" required="true"/> <field name="django_ct" type="string" indexed="true" stored="true" multiValued="false" /> <field name="django_id" type="string" indexed="true" stored="true" multiValued="false" /> <!-- dynamic --> <dynamicField name="*_i" type="sint" indexed="true" stored="true"/> <dynamicField name="*_s" type="string" indexed="true" stored="true"/> <dynamicField name="*_l" type="slong" indexed="true" stored="true"/> <dynamicField name="*_t" type="text" indexed="true" stored="true"/> <dynamicField name="*_b" type="boolean" indexed="true" stored="true"/> <dynamicField name="*_f" type="sfloat" indexed="true" stored="true"/> <dynamicField name="*_d" type="sdouble" indexed="true" stored="true"/> <dynamicField name="*_dt" type="date" indexed="true" stored="true"/> <!-- app --> <field name="bio" type="text" indexed="true" stored="true" multiValued="false" /> <field name="title" type="text" indexed="true" stored="true" multiValued="false" /> <field name="text" type="text" indexed="true" stored="true" multiValued="false" /> <field name="abstract" type="text" indexed="true" stored="true" multiValued="false" /> <field name="full_name" type="text" indexed="true" stored="true" multiValued="false" /> <field name="company" type="text" indexed="true" stored="true" multiValued="false" /> </fields> 34 Saturday, November 10, 12
  • 35. Copy Fields SEARCHING WITH SOLR • Two Main Uses • Analyze fields in different ways • Concatenate Fields 35 Saturday, November 10, 12
  • 36. <copyField source="bio" dest="df_text" /> <copyField source="year" dest="century" maxChars="2"/> 36 Saturday, November 10, 12
  • 37. <copyField source="bio" dest="df_text" /> <copyField source="year" dest="century" maxChars="2"/> 37 Saturday, November 10, 12
  • 38. <copyField source="bio" dest="df_text" /> <copyField source="year" dest="century" maxChars="2"/> 2000 would be stored as 20 Useful for custom faceting 38 Saturday, November 10, 12
  • 39. The Solr Config File SEARCHING WITH SUB HEADLINE SOLR • solrconfig.xml • Defines request handlers, defaults, & caches • Read the solrconfig.xml inside the example project for more 39 Saturday, November 10, 12
  • 40. Other Solr Tools SEARCHING WITH SUB HEADLINE SOLR • Debug Query • Boost Functions • Search Faceting • Search Filters • Search Highlighting • Solr Admin 40 Saturday, November 10, 12
  • 41. Debug Query Option SEARCHING WITH SUB HEADLINE SOLR • Add &debugQuery=on to request parameters • Returns a parsed form of the query 41 Saturday, November 10, 12
  • 42. <lst name="debug"> <str name="rawquerystring">solr</str> <str name="querystring">solr</str> <str name="parsedquery">text:solr</str> <str name="parsedquery_toString">text:solr</str> <lst name="explain"> <str name="codecamp.session.19"> 1.2147729 = (MATCH) fieldWeight(text:solr in 17), product of: 1.4142135 = tf(termFreq(text:solr)=2) 3.9267395 = idf(docFreq=2, maxDocs=56) 0.21875 = fieldNorm(field=text, doc=17) </str> </lst> 42 Saturday, November 10, 12
  • 43. <lst name="debug"> <str name="rawquerystring">solr</str> <str name="querystring">solr</str> <str name="parsedquery">text:solr</str> <str name="parsedquery_toString">text:solr</str> <lst name="explain"> <str name="codecamp.session.19"> 1.2147729 = (MATCH) fieldWeight(text:solr in 17), product of: 1.4142135 = tf(termFreq(text:solr)=2) 3.9267395 = idf(docFreq=2, maxDocs=56) 0.21875 = fieldNorm(field=text, doc=17) </str> </lst> 43 Saturday, November 10, 12
  • 44. Boost Function SEARCHING WITH SUB HEADLINE SOLR • Allows you to influence results at query time • Really useful for tuning scoring • You can also boost at index time 44 Saturday, November 10, 12
  • 45. Boost Function SEARCHING WITH SUB HEADLINE SOLR • Allows you to influence results at query time • Really useful for tuning scoring • You can also boost at index time q=blend&qf=text^2 company 45 Saturday, November 10, 12
  • 46. Boost Function SEARCHING WITH SUB HEADLINE SOLR • Allows you to influence results at query time More information available - • Really useful for tuning scoring http://wiki.apache.org/solr/ SolrRelevancyFAQ Can use both dismax and • You can also boost at index time standard query handlers, I use dismax q=blend&qf=text^2 company 46 Saturday, November 10, 12
  • 47. Boost Function SEARCHING WITH SUB HEADLINE SOLR • Allows you to influence results at query time More information available - • Really useful for tuning scoring http://wiki.apache.org/solr/ SolrRelevancyFAQ Can use both dismax and • You can also boost at index time standard query handlers, I use dismax &bq=text:blend^2 47 Saturday, November 10, 12
  • 48. Solr Faceting SEARCHING WITH SUB HEADLINE SOLR • What is a facet? • “Interaction style where users filter a set of items by progressively selecting from only valid values of a  faceted classification system” - Keith Instone, SOASIS&T, July 8, 2004 • What does it look like? • Make sure to use an untokenized field (e.g. string) • “San Jose” != “san”+“jose” 48 Saturday, November 10, 12
  • 49. q=*:* facet=on facet.field=company 49 Saturday, November 10, 12
  • 50. Solr Filter Query SEARCHING WITH SUB HEADLINE SOLR • Used to narrow your search query • Restrict the super set of documents that can be returned • ‘fq’ parameter (short for Filter Query) 50 Saturday, November 10, 12
  • 51. Solr Filter Query SEARCHING WITH SUB HEADLINE SOLR • Used to narrow your search query • Restrict the super set of documents that can be returned • ‘fq’ parameter (short for Filter Query) q=*:* fq=company:blend 51 Saturday, November 10, 12
  • 52. Search Highlighting SEARCHING WITH SUB HEADLINE SOLR • Allow Solr to generate your highlight 52 Saturday, November 10, 12
  • 53. Search Highlighting SEARCHING WITH SUB HEADLINE SOLR • Allow Solr to generate your highlight 53 Saturday, November 10, 12
  • 54. hl=true hl.simple.pre=<b> hl.simple.post=</b> hl.fragsize=200 hl.requireFieldMatch=false hl.fl=text bio title hl.snippets=1 54 Saturday, November 10, 12
  • 55. Solr Admin SEARCHING WITH SUB HEADLINE SOLR • http://localhost:8983/solr/admin/ • Built in app for testing all search options • Field Analysis • Schema Browser • Full Query Interface • Solr Statistics • Solr Information • Many More Options 55 Saturday, November 10, 12
  • 56. Solr/Browse SEARCHING WITH SUB HEADLINE SOLR • Test your search configuration using the /browse requestHandler 56 Saturday, November 10, 12
  • 57. Resources SEARCHING WITH SUB HEADLINE SOLR • Apache Solr Website • http://lucene.apache.org/solr/ • Wiki, mailing list, bugs/features • Books 57 Saturday, November 10, 12