SlideShare uma empresa Scribd logo
1 de 74
BETTER SEARCH
    ENGINE TESTING




STPCON 2011 | EPUGH@O19S.COM | @DEP4B
WHY AM I QUALIFIED TO BE
           UP HERE?
•   President of OpenSource
    Connections

• Contributor  to CruiseControl
    and Continuum CI projects

• Member    of Apache Software
    Foundation

• Presenter at conferences
    (OSCON, ApacheCON, jTDS,
    ExpoQA, STPcon 2009!)
AUTHOR
WRITER
FATHER
AGILISTA
AGENDA
Why is Search Becoming More
         Important?


  What is a Search Engine?

   Techniques for Testing

         Wrap Up
WHY IS SEARCH
BECOMING MORE
  IMPORTANT?
INFORMATION IS EXPLODING

 “information workers ... are each bombarded with1.6
gigabytes of information on average every day through
  emails, reports, blogs, text messages,calls and more”.

• http://online.wsj.com/article/SB124252211780027326.html
UNSTRUCTURED



• emails, spreadsheets, documents, presentations, images,
 databases

  • 75%   unstructured to 25% structured
MANAGING DATA IS
              EXPENSIVE


•1   GB costs $.20 to store

•1   GB costs $3500 to Manage
WHAT DOES 3500 BUY YOU?


• 69% of respondents felt 50% or less of data could be found
 online

• Knowledge  workers spend 25% of their time engaged in
 search-related activities.
WHY NOT JUST USE GOOGLE


 • We   don’t want 44 million results, we want 1

 • we   want “the” answer, not “an” answer

 • we   tolerate inefficieny in the Internet search
                    As
John Allenhappy toputs it: “The Internet is
 • We are Paulos “satisfice”
the world's largest library.It's just that all
           the books are on
                the floor.”
WHAT IS A SEARCH
   ENGINE?
THREE STAGES OF SEARCH
CONTENT INDEXING
•- creating an index by crawling the content directories,
 databases, other repositories using an automated process
 (either pushing or pulling changes)

• create   an Index, which is a searchable key to a collection.

• In
   Enterprise Search, the indexing mechanism should be able
 to access company private data (with access privileges
 maintained)

• control
        indexing schedule - being able to index rapidly
 changing content quickly, other content more slowly.

• rather   than having the bot look for the data.
CONTENT INDEXING

• Indexing   may also support

  • Metadata   extraction

  • Auto-summarization, which  is analyse of the collection and
   group its content into categories or clusters.

• Metadatain turn becomes facets that can be used to tune the
 query to put emphasis on that category.
CONTENT INDEXING
QUERYING
FORMATTING
FACETING
Faceted or "guided navigation"
leverages metadata fields and
values to provide users with visible
options for narrowing or refining
their query.
- Peter Morville, Search Patterns




                                       26
Search
Stack


               User
Interface

               Search
Engine

                   Data
HOW DO WE TEST?
HOW DO WE TEST?


• Querying

• Formatting

• Content   Indexing

• Performance
WHO SHOULD TEST?
CHALLENGES
• Competing   business stakeholders:

 • Tester: When I search for “lamp shades”, I used to see these
   documents, now I see a differing set.

 • Business  Owner: How do I know that the new search
   engine is better?

 • User: My   pet feature “search within these results” works
   differently.

 • Marketing Guy: I want to control the results so the current
   marketing push for toilet paper brand X always shows up at
   the top.
CHALLENGES


• Stakeholders want a better search implementation, but
 perversely often want it to all work “the exact same way”.  
 Getting agreement across all the stakeholders for the
 project vision, and agree on the metrics is a challenge.
PERFECT SEARCH TESTER
         WOULD BE ALL OF
• Mathematician   • Business Analyst

• Librarian       • Systems   Engineer

• UX   Expert     • Geographer!

• Writer          • Psychologist

• Programmer
KNOWLEDGE TRANSFER

• If
   you don’t have the perfect team already, bring in experts and
  do domain knowledge transfer.

• Learn the vocabulary of search to better communicate
  together

  • “auto   complete” vs “auto suggest”

• Do “Search   for Content Team” brownbag sessions!
QUERY TESTING



• Often   called “relevancy testing”
TWO SCHOOLS OF
                THOUGHT


• “One True Answer”

• “I   know it when I see it”
“ONE TRUE ANSWER”

• Absolute Truth   / Matrix / Grid / TREC / Relevancy Assertions

 • The    correct answers for each search are known ahead of
   time

 • Humans   judges often decide these correct answers, stored
   as Relevancy Assertions

 • Can    be labor intensive to setup

• A “Numerical   Grade” is produced for comparision
PROBLEMS WITH THIS
             APPROACH
• Open  to gaming. TREC competition is swamped by
 “academic” search engine efforts that don’t work in the real
 world.

• Needa well understood data set with generally accepted
 answers.

  is it better to have an engine that gives modestly relevant
   results almost all the time, or an engine that gives really
 good answers sometimes, better on average than the other
     engine, but sometimes gives back complete garbage?
A/B TESTING
                                              Engine version 1 and
                                              version 2!

• Tracks   explicit or implicit preferences between engines A/B

• Often    dispenses with the notion of the "correct" answer

• Canbe easier to setup, but some fear the best answers will be
 missed by both engines
RELEVANCY



• Do   we have any defined relevancy metrics?

• Relevancy   is like porn.....
I KNOW IT WHEN I SEE IT!




  http://en.wikipedia.org/wiki/Les_Amants
BEYOND PRECISION AND
  RECALL: HOW ENGINES ARE
• Binary   vs. Non-Binary Grading Systems

  • Early TREC
             had binary judgements, only Yes/No on whether
   each doc was related to a test search

  • More    choices were later added

  •A system can use letter grades (A, B, C, D and F) or numeric
   grades

  • Another style asks testers to sort documents in their
   preferred order
CLASSIC MEASUREMENTS OF
    SEARCH RELEVANCY


• Recall: "Did
           I find all the documents I expected to get back? 
 What percent?"

• Precision: "Did
                the system bring back other documents that
 weren't relevant?  What percent were on target?"
NEWER IDEAS


• Rank: The   order of documents that were returned

  • Generally
            a 1 in 20 match in the #1 spot is better than a
   50% rate where all matches are on the second page.
INTERACTIVITY: WHAT
      NAVIGATORS OR
 VISUALIZATION WERE GIVEN
• Facets   and sorting: Clickable filters and sort options

• Unsupervised     Clustering: Related terms or phrases, or related
 searches

• Spelling   and thesaurus suggestions
SUBJECT DISAMBIGUATION,
   SENTIMENT, CONFLICTING
    INFORMATION, CROWD
            HINTS
• kidney   bean or kidney cell

• "best   football team in the UK"
SOURCES OF VARIANCE, AKA
      "PROBLEMS"
                 Note, this is talking
                 about comparing search
                 engine a to search engine
                 b. But I am thinking
                 more in the context of
                 search engive v1 to v2!
DIFFERENT GOALS



• Perfect/Human   vs. Best vs. Acceptable vs. Better than X

• Constrained   vs. Unconstrained Resources (time, cpu, storage)
SAMPLE SIZE


• Amount   of Data

 • Fixed   set or growing over time

• Number    of Testers (AB or Relevancy Judgments)

• Number    of Searches
VERTICAL VS. HORIZONTAL
         CONTENT


• Oneextreme: Specific demo may cover just one discipline, for
 example Medical Journals

• Other   extreme: Internet covers vastly disparate domains
USERS
• Experienced     vs. New Searcher

• Subject   Expert vs. Novice

• Spelling, typing   and computer proficiency

• InterfaceMedium (large visual display, small text display,
 audible, Braille, etc)

• Amount      of Effort to understand Search

• Willingness   to Iterate

• Searching    for specific answer vs. General Exploration
TYPE OF SEARCHES

• Length    / 1 or 2 words

• Full   question

• Sample     text

• Internet   Boolean

• Advanced     Boolean / Syntax / Proximity

  • Wildcard, Regex, etc.
PUNCTUATION


• Chemical

• Source    Code

• Units   of Measure

• Literal   vs. Search Operator
NOT EVEN GETTING INTO
    MULTI LINGUAL SEARCH



• How   do I test in languages I don’t understand?
GROK YOUR RESULTS
FORMATTING TESTING



• Directly   builds on most of our existing test skills.
PERSONAS & SCENARIOS
Persona 1: Going to be a mom
                     Oh my God                                                                                                               Needy
                     I’m actually
                     Pregnant!                                Narrative

                                                                Self Introduction
                                                                Hi all, I'm very new to this but i couldn't help but share my excitement. I have
                                                                just found out today that i am pregnant. It wasn't planned, me and my
                                                                partner of a year and a half were going to wait until we had our own place and
                                                                were married first but it looks like we have done it the other way round.

      What’s next? What am I
      supposed to do?                                           My only concern is that i don't really know how my boyfriend feels about it. I
      Guidance please!                                          know we need to discuss the options but i have really already made up my
                                                                mind about what i want to do. There is so much to consider, money, a decent
To interact with
people going                                                    place to live, being ready but i know i am ready and have been for a long time ( I
through the same                                                get extremely broody when i see my friends kids)
thing.

                                                                Should i just tell him how i feel or go with how he feels because i don't want to
                               Scenarios that typify -          lose him. He is a loving partner who would stand by me through anything i just
                               planned to get pregnant, but     don’t want him to feel like i am tying him down!!! I suppose i am feeling very
                                                                happy but also very confused at the same time!!!
                               hasn't done any research
                               Catch phrases - Nervous but
                               excited, giddy, Where do I       hp://forum.sofeminine.co.uk/forum/maternite1/__f468_maternite1-Oh-
                               start?                           my-god-i-m-pregnant.html
                               Tag lines - Wants to share,
                               has a million questions
                               Likely to say - Guide me,
                               help me get off to a good
                               start
Persona 2: New Mom
                                    Are my kids sick or is                                                                                                             Demanding
                                    this condition normal?
                                    How do I…?                                        Narrative



                                                                                      I have been hearing about women who claim that thier 2, 2 1/2 or 3 year old is not ready
                                                                                      for the poy. They claim its a nightmare and are waiting for their children to come around.

                                                                                      Maybe I grew up in the twilight zone, but I had always assumed that poy training was
                                                                                      something that is just done. Its done when:
                    How do I ensure my                                                   a) The child in question can sleep through the night and stay dry.
                    baby is latching on                                                  b) The child in question can speak to you, in full sentences. like, "apple juice, please" or
                    correctly?                                                           "wanna go to the park" or "momma I wanna hold you..."
                                                                                         c) The child in question knows they are soiled and can ask to be changed.

      What type of stroller                                                           Barring any of those things, a child is ready to be placed on the poy. using the poy was
      should I buy? What                                                              never negotiable in my family. When we hit the above milestones my mother trained us. We
      brand of car seat is                                                            just did it. If we complained she never put diapers on us, she just kept directing us back to
      best?                                                                           the poy. Her methods of redirection may be controversial (she told my brother that
                                                                                      unless he was a big boy he would not get a happy meal. Boys who pooed on themselves got
                                                                                      sad meals... lol!!! He straightened up and started using the poy at 2 1/2) but she was
                                                                                      never abusive or anything she just DIDNT ASK US. it was time to poy and that was it.
                                              Scenarios that Typify
                                                                                      The reasoning was that I used to drink from a bole, and sleep with my mother, and such,
                                              Likely to say -Are my kids sick or is
                                              this condition normal?                  now I don't. I also used to crap my pants, and that is no longer allowed after a certain point.
                                              Describes herself - wants to be a
                                              good mother, looking for expert
                                              advice, wants to get ideas from other   My question is this: why ask children if they are ready to use the poy, after they are
                                              moms                                    clearly ready to use it (with language tools and bladder control)? Why is it treated like
                                              Narrative- could be working mom,        something that is negotiable or that the child has a choice of either coming around to it or
                                              could be stay at home mom
                                                                                      not? I understand that children are sensitive and you have to follow their lead, at times.
                                              Questions likely to ask - sometimes
This picture captures my life perfectly: an   wants to ask questions/get expert       But allowing them to shit
              adult beverage                  advice
   siing on a book about underpants.
Scenario 1
                                              “I know went through this before with my first child,
Find old answer                               but cannot recall the answer”


                                         Preamble

                                         Experienced mom has a déjà vu moment about a
                                         previous problematic experience with her first child. She
                                         has a partial recollection of a piece of information                                             Success Factors
                                         related to the answer she seeks but she needs help in
                                                                                                                                          • Speed of Comprehension
                                         pulling
                                                                                                                                          • Directness to destination
                                                                                                                                          • Reduced:
                                                                                                                                             • Number of queries
                                                                                                                                             • Number of results
                                                                                                                                          • Indirect Knowledge Transfer

       Thinking aloud in the Family Room
                                                                                                                               Very nice – lists out related
                                                              Josh had not started to cry non-                              concerns for constipation. Let’s
          Hhhm I now I had the                                 stop for 3 hours when it finally                              see: ‘symptoms’, ‘cures’, ‘when to
         same issue with josh,     wwwaaaaaaaa                dawned on me that he had not had                             call the doctor’, ‘what other moms
         but what the heck did   wwwwaaaaaa . . . ggg            a movement for 3 days . . .                                   are saying’, ‘topic over view’
                 I do?            wwwaaaaaaaa . . .
                                                                               Let’s try querying that . . . “no
                                                                                                                                                   Ok – I’ll take ‘cures’ Alex for a 300
                                                                              poop” . . . Not likely . . . Uumm . ..
                                                                                                                                                     points and my personal sanity!
                                                                             “constipation”? Oh, might help to
                                                                                                                                                    Water . . . fruit juice . . . high-fiber
                                                                             specify who as well . . . “baby” . . .
                                                                                                                                                    baby foods - Ahhh prune juice . . .
                                                                                                                                                      prune juice! Now why didn’t I
                                                                                                                                                              remember that!




     After hours of frustration mother home alone has a   Mother starts to type in query but suggest-as-you-           Structured results quickly tip off the mother to the
     partial epiphany as to her child’s problem.          type search box hints to her to be more specific.            assorted aspects of constipation. She focuses in on
                                                                                                                       one of the aspects and has total recollection of her
                                                                                                                       previous experience.
Scenario 2
                                            It’s 2am and I don’t know who to ask?”
Urgent Question



                                              Preamble

                                              Mother of twins finds herself with panicked in the early
                                              morning hours with a new situation.




                                                                                                                                           Success Factors
                                                                                                                                           • Speed of Comprehension
                                                                                                                                           • Directness to answer

                   Crying in the Kitchen
                                                                             I don’t have to read hundreds of             ‘102’ . . . thank god !
                              wwwaaaaaaaa                                  pages on the internet . . . I just need            We’re safe
                            wwwwaaaaaa . . . ggg                                 a quick concise answer . . .
                             wwwaaaaaaaaa . . .
                                        wwwaaaaaaaa                           . . . at what temperature do I need to
             Crap! Who I am I          wwwwaaaaa . . . ggg                                 be worried . . ? !                     Ahhh . . . that’s helpful - other
           supposed to at this         wwwaaaaaaaaa . . .                                                                         conditions to know about . . .
         hour ! Why is it no body is                                                 Please [BabyCenter] show me the
         open when I need them ? !                                                              answer . . !                             That’s thorough : ‘What will the doctor do? ‘
                                                                    wwwaaaaaaaa                                                      Interesting ‘If fever is a defense against infection, is
                                                                  wwwwaaaaaa . . . ggg                                                   it really a good idea to try to bring it down?’
                                                                   wwwaaaaaaaaa . . .
                                                                          wwwaaaaaaaa
                                                                        wwwwaaaaaa . . . ggg
                                                                         wwwaaaaaaaaa . . .
                                                                                                                                                                             Let me book mark this
                                                                                                                                                                                   for later.




     In the middle of the night, a mother of twins finds        Mother starts to type in a query but notices the       The mother zooms in on the specific answer she
     herself alone, overwhelmed, and in dire need of an         suggest-as-you-type search box lets her narrow her     seeks. But then she notices collateral knowledge
     answer.                                                    question boosting her confidence she is going to get   she takes note of for later reading.
                                                                the answer she needs.
CONTENT INDEXING
               TESTING


• Leverages our normal testing skills. And typically what it really
 means is “Performance Testing”.

 • Lot’s   of “integration” testing.
PERFORMANCE TESTING
LEVELS OF SCALING
• Scale   High

  • There     is a quickly hit point of diminishing returns!

• Scale Wide

  • The   safety valve for lots of load.

• Scale   Deep

  • Scaling
          Deep? You are doing some crazy stuff with huge
   indexes!!
                                     65
SCALE WIDE (SLAVES)

• Too   many inbound queries!

• slaves
      poll master for
 changes

• index and config files
 transferred

• ALL   JAVA!

                                66
SCALE WIDE (SHARDING)
• Too     large of an index to query

• Split
      index over multiple Search
 servers

  •A      -> M: Server 1, N -> Z: Server 2

  • uniqueId.hash    % numServers

• Relevancy     typically balanced shards

• Requestsplit across shards, results
 aggregated to single response
                                   67
SCALE DEEP


• Combine  both scaling wide
 to handle number of queries
 with sharding to handle size
 of indexes!




                                68
WRAP UP
User
           Search
Methodology                  Interface         Engine
                                                                 Data

Concurrent
Streams
of
Work
                                                 Itera&on
2
Story:
Opera4onalize
Solr                Deploy
Solr
into
BabyCenter
Test
Environment




                                                    Itera&on
2
Story:
  Search
Analysis                    Integrate
Solr
into
Community
UI,
A/B
TesBng




                                                Itera&on
2
Story:
 Search
Experience                  Conceptual
Model
(Personas,
etc)
&
Mockups




OSC APPROACH TO SEARCH
OSC APPROACH TO SEARCH
RESOURCES

• http://www.scribd.com/doc/17563004/Why-You-Cant-Just-
    Google-for-Enterprise-Knowledge

• http://www.searchtools.com/info/user-interface.html

• http://www.alistapart.com/articles/testing-search-for-relevancy-
    and-precision/

•
SEARCHPATTERNS.ORG
        73
THANK YOU!



• twitter:   dep4b

• speakerrate:   http://www.speakerrate.com/epugh/

• email:   epugh@opensourceconnections.com

                          74

Mais conteúdo relacionado

Destaque

Developing and testing search engine algorithms –
Developing and testing search engine algorithms –Developing and testing search engine algorithms –
Developing and testing search engine algorithms –vonreventlow
 
Ppt stadas klmpok
Ppt stadas klmpokPpt stadas klmpok
Ppt stadas klmpokabiumi01
 
Run test satu sampel
Run test satu sampelRun test satu sampel
Run test satu sampelIpin Rahma
 
Meetup - Using CloudSearch with DynamoDB
Meetup - Using CloudSearch with DynamoDBMeetup - Using CloudSearch with DynamoDB
Meetup - Using CloudSearch with DynamoDBAmazon Web Services
 
Kendall coefficient of concordance
Kendall coefficient of concordance Kendall coefficient of concordance
Kendall coefficient of concordance Zha Jie
 
Search Patterns: Design for Discovery
Search Patterns: Design for DiscoverySearch Patterns: Design for Discovery
Search Patterns: Design for DiscoveryPeter Morville
 
The Modern Web, Part 1: Mobility
The Modern Web, Part 1: MobilityThe Modern Web, Part 1: Mobility
The Modern Web, Part 1: MobilityDavid Pallmann
 
Web of knowledge advanced features
Web of knowledge advanced featuresWeb of knowledge advanced features
Web of knowledge advanced featuresLisa Hartman
 
Using the internet for search
Using the internet for searchUsing the internet for search
Using the internet for searchDr-Heba Mustafa
 
The Modern Web Part 3: Social Networking
The Modern Web Part 3: Social NetworkingThe Modern Web Part 3: Social Networking
The Modern Web Part 3: Social NetworkingDavid Pallmann
 
Calculate your Water Footprint at H2O Conserve
Calculate your Water Footprint at H2O ConserveCalculate your Water Footprint at H2O Conserve
Calculate your Water Footprint at H2O Conserveguest5961519
 
I know how to search the internet,
I know how to search the internet,I know how to search the internet,
I know how to search the internet,Hindie Dershowitz
 
The Modern Web, Part 2: HTML5
The Modern Web, Part 2: HTML5The Modern Web, Part 2: HTML5
The Modern Web, Part 2: HTML5David Pallmann
 
How to search the Internet, a guide to save time and effort
How to search the Internet, a guide to save time and effortHow to search the Internet, a guide to save time and effort
How to search the Internet, a guide to save time and effortPete S
 
Google webmaster guide for starters
Google webmaster guide for startersGoogle webmaster guide for starters
Google webmaster guide for startersjatindsim
 
When worlds Collide: HTML5 Meets the Cloud
When worlds Collide: HTML5 Meets the CloudWhen worlds Collide: HTML5 Meets the Cloud
When worlds Collide: HTML5 Meets the CloudDavid Pallmann
 
Manuele Margni, CIRAIG - Behind the Water Footprint Stream: Metrics and Initi...
Manuele Margni, CIRAIG - Behind the Water Footprint Stream: Metrics and Initi...Manuele Margni, CIRAIG - Behind the Water Footprint Stream: Metrics and Initi...
Manuele Margni, CIRAIG - Behind the Water Footprint Stream: Metrics and Initi...CWS_2010
 

Destaque (20)

One sample runs test
One sample runs testOne sample runs test
One sample runs test
 
Amazon search test case document
Amazon search test case documentAmazon search test case document
Amazon search test case document
 
Developing and testing search engine algorithms –
Developing and testing search engine algorithms –Developing and testing search engine algorithms –
Developing and testing search engine algorithms –
 
Ppt stadas klmpok
Ppt stadas klmpokPpt stadas klmpok
Ppt stadas klmpok
 
Run test satu sampel
Run test satu sampelRun test satu sampel
Run test satu sampel
 
Meetup - Using CloudSearch with DynamoDB
Meetup - Using CloudSearch with DynamoDBMeetup - Using CloudSearch with DynamoDB
Meetup - Using CloudSearch with DynamoDB
 
Kendall coefficient of concordance
Kendall coefficient of concordance Kendall coefficient of concordance
Kendall coefficient of concordance
 
Search Patterns: Design for Discovery
Search Patterns: Design for DiscoverySearch Patterns: Design for Discovery
Search Patterns: Design for Discovery
 
The Modern Web, Part 1: Mobility
The Modern Web, Part 1: MobilityThe Modern Web, Part 1: Mobility
The Modern Web, Part 1: Mobility
 
Web of knowledge advanced features
Web of knowledge advanced featuresWeb of knowledge advanced features
Web of knowledge advanced features
 
Using the internet for search
Using the internet for searchUsing the internet for search
Using the internet for search
 
The Modern Web Part 3: Social Networking
The Modern Web Part 3: Social NetworkingThe Modern Web Part 3: Social Networking
The Modern Web Part 3: Social Networking
 
Calculate your Water Footprint at H2O Conserve
Calculate your Water Footprint at H2O ConserveCalculate your Water Footprint at H2O Conserve
Calculate your Water Footprint at H2O Conserve
 
Affordable web design
Affordable web designAffordable web design
Affordable web design
 
I know how to search the internet,
I know how to search the internet,I know how to search the internet,
I know how to search the internet,
 
The Modern Web, Part 2: HTML5
The Modern Web, Part 2: HTML5The Modern Web, Part 2: HTML5
The Modern Web, Part 2: HTML5
 
How to search the Internet, a guide to save time and effort
How to search the Internet, a guide to save time and effortHow to search the Internet, a guide to save time and effort
How to search the Internet, a guide to save time and effort
 
Google webmaster guide for starters
Google webmaster guide for startersGoogle webmaster guide for starters
Google webmaster guide for starters
 
When worlds Collide: HTML5 Meets the Cloud
When worlds Collide: HTML5 Meets the CloudWhen worlds Collide: HTML5 Meets the Cloud
When worlds Collide: HTML5 Meets the Cloud
 
Manuele Margni, CIRAIG - Behind the Water Footprint Stream: Metrics and Initi...
Manuele Margni, CIRAIG - Behind the Water Footprint Stream: Metrics and Initi...Manuele Margni, CIRAIG - Behind the Water Footprint Stream: Metrics and Initi...
Manuele Margni, CIRAIG - Behind the Water Footprint Stream: Metrics and Initi...
 

Semelhante a Better Search Engine Testing

Test Driven Search Relevancy w/ Quepid
Test Driven Search Relevancy w/ QuepidTest Driven Search Relevancy w/ Quepid
Test Driven Search Relevancy w/ QuepidDoug Turnbull
 
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCTest Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCLucidworks (Archived)
 
Hands on Usability Testing (Jonathan Rubin)
Hands on Usability Testing (Jonathan Rubin)Hands on Usability Testing (Jonathan Rubin)
Hands on Usability Testing (Jonathan Rubin)uxpa-dc
 
Information Search Skills
Information Search SkillsInformation Search Skills
Information Search Skillswendy0315
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
 
Evaluating search engines
Evaluating search enginesEvaluating search engines
Evaluating search enginesPhil Bradley
 
Design, Create, Evaluate Process (1).pptx
Design, Create, Evaluate Process (1).pptxDesign, Create, Evaluate Process (1).pptx
Design, Create, Evaluate Process (1).pptxLe Hung
 
Using the ebsc ohost research databases revised 2014
Using the ebsc ohost research databases revised 2014Using the ebsc ohost research databases revised 2014
Using the ebsc ohost research databases revised 2014Oak Hall School
 
UX Field Research Toolkit - A Workshop at Big Design - 2017
UX Field Research Toolkit - A Workshop at Big Design - 2017UX Field Research Toolkit - A Workshop at Big Design - 2017
UX Field Research Toolkit - A Workshop at Big Design - 2017Kelly Moran
 
Five selfish reasons to work reproducibly
Five selfish reasons to work reproduciblyFive selfish reasons to work reproducibly
Five selfish reasons to work reproduciblyFlorian Markowetz
 
Make Enterprise Search Less Broken
Make Enterprise Search Less BrokenMake Enterprise Search Less Broken
Make Enterprise Search Less BrokenHelen Lippell
 
Reactive Writing Techniques for Rewarding and Retaining Users
Reactive Writing Techniques for Rewarding and Retaining UsersReactive Writing Techniques for Rewarding and Retaining Users
Reactive Writing Techniques for Rewarding and Retaining Usersgrebstock
 
Owning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsOwning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsRobert H. McDonald
 
Online research and research skills
Online research and research skillsOnline research and research skills
Online research and research skillsLive Angga
 
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...Codemotion
 
Conducting a Summative Study of EHR Usability: Case Study
Conducting a Summative Study of EHR Usability: Case StudyConducting a Summative Study of EHR Usability: Case Study
Conducting a Summative Study of EHR Usability: Case StudyUXPA Boston
 

Semelhante a Better Search Engine Testing (20)

Test Driven Search Relevancy w/ Quepid
Test Driven Search Relevancy w/ QuepidTest Driven Search Relevancy w/ Quepid
Test Driven Search Relevancy w/ Quepid
 
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCTest Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
 
Hands on Usability Testing (Jonathan Rubin)
Hands on Usability Testing (Jonathan Rubin)Hands on Usability Testing (Jonathan Rubin)
Hands on Usability Testing (Jonathan Rubin)
 
Information Search Skills
Information Search SkillsInformation Search Skills
Information Search Skills
 
MMM, Search!
MMM, Search!MMM, Search!
MMM, Search!
 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
 
Evaluating search engines
Evaluating search enginesEvaluating search engines
Evaluating search engines
 
Discovering discovery
Discovering discoveryDiscovering discovery
Discovering discovery
 
Design, Create, Evaluate Process (1).pptx
Design, Create, Evaluate Process (1).pptxDesign, Create, Evaluate Process (1).pptx
Design, Create, Evaluate Process (1).pptx
 
Using the ebsc ohost research databases revised 2014
Using the ebsc ohost research databases revised 2014Using the ebsc ohost research databases revised 2014
Using the ebsc ohost research databases revised 2014
 
In-Depth Interviews: Techniques and Best Practices
In-Depth Interviews: Techniques and Best PracticesIn-Depth Interviews: Techniques and Best Practices
In-Depth Interviews: Techniques and Best Practices
 
UX Field Research Toolkit - A Workshop at Big Design - 2017
UX Field Research Toolkit - A Workshop at Big Design - 2017UX Field Research Toolkit - A Workshop at Big Design - 2017
UX Field Research Toolkit - A Workshop at Big Design - 2017
 
Five selfish reasons to work reproducibly
Five selfish reasons to work reproduciblyFive selfish reasons to work reproducibly
Five selfish reasons to work reproducibly
 
Make Enterprise Search Less Broken
Make Enterprise Search Less BrokenMake Enterprise Search Less Broken
Make Enterprise Search Less Broken
 
Reactive Writing Techniques for Rewarding and Retaining Users
Reactive Writing Techniques for Rewarding and Retaining UsersReactive Writing Techniques for Rewarding and Retaining Users
Reactive Writing Techniques for Rewarding and Retaining Users
 
Owning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your PatronsOwning the Discovery Experience for Your Patrons
Owning the Discovery Experience for Your Patrons
 
Online research and research skills
Online research and research skillsOnline research and research skills
Online research and research skills
 
Interview workshop slides final
Interview workshop slides  finalInterview workshop slides  final
Interview workshop slides final
 
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
The Secrets of High Performance: Science Edition - Nicole Forsgren - Codemoti...
 
Conducting a Summative Study of EHR Usability: Case Study
Conducting a Summative Study of EHR Usability: Case StudyConducting a Summative Study of EHR Usability: Case Study
Conducting a Summative Study of EHR Usability: Case Study
 

Mais de OpenSource Connections

How To Structure Your Search Team for Success
How To Structure Your Search Team for SuccessHow To Structure Your Search Team for Success
How To Structure Your Search Team for SuccessOpenSource Connections
 
The right path to making search relevant - Taxonomy Bootcamp London 2019
The right path to making search relevant  - Taxonomy Bootcamp London 2019The right path to making search relevant  - Taxonomy Bootcamp London 2019
The right path to making search relevant - Taxonomy Bootcamp London 2019OpenSource Connections
 
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie HullHaystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie HullOpenSource Connections
 
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - State of Apache Tika - Tim AllisonHaystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - State of Apache Tika - Tim AllisonOpenSource Connections
 
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...OpenSource Connections
 
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj BharadwajHaystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj BharadwajOpenSource Connections
 
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...OpenSource Connections
 
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search-based recommendations at Politico - Ryan KohlHaystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search-based recommendations at Politico - Ryan KohlOpenSource Connections
 
Haystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesHaystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesOpenSource Connections
 
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerHaystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerOpenSource Connections
 
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...OpenSource Connections
 
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...OpenSource Connections
 
Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Architectural considerations on search relevancy in the conte...Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Architectural considerations on search relevancy in the conte...OpenSource Connections
 
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...OpenSource Connections
 
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...OpenSource Connections
 
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...OpenSource Connections
 
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah ViaOpenSource Connections
 

Mais de OpenSource Connections (20)

Encores
EncoresEncores
Encores
 
Test driven relevancy
Test driven relevancyTest driven relevancy
Test driven relevancy
 
How To Structure Your Search Team for Success
How To Structure Your Search Team for SuccessHow To Structure Your Search Team for Success
How To Structure Your Search Team for Success
 
The right path to making search relevant - Taxonomy Bootcamp London 2019
The right path to making search relevant  - Taxonomy Bootcamp London 2019The right path to making search relevant  - Taxonomy Bootcamp London 2019
The right path to making search relevant - Taxonomy Bootcamp London 2019
 
Payloads and OCR with Solr
Payloads and OCR with SolrPayloads and OCR with Solr
Payloads and OCR with Solr
 
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie HullHaystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
Haystack 2019 Lightning Talk - The Future of Quepid - Charlie Hull
 
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - State of Apache Tika - Tim AllisonHaystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
Haystack 2019 Lightning Talk - State of Apache Tika - Tim Allison
 
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
Haystack 2019 Lightning Talk - Relevance on 17 million full text documents - ...
 
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj BharadwajHaystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
Haystack 2019 Lightning Talk - Solr Cloud on Kubernetes - Manoj Bharadwaj
 
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
Haystack 2019 Lightning Talk - Quaerite a Search relevance evaluation toolkit...
 
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search-based recommendations at Politico - Ryan KohlHaystack 2019 - Search-based recommendations at Politico - Ryan Kohl
Haystack 2019 - Search-based recommendations at Politico - Ryan Kohl
 
Haystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon HughesHaystack 2019 - Search with Vectors - Simon Hughes
Haystack 2019 - Search with Vectors - Simon Hughes
 
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerHaystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
 
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
Haystack 2019 - Search Logs + Machine Learning = Auto-Tagging Inventory - Joh...
 
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
Haystack 2019 - Improving Search Relevance with Numeric Features in Elasticse...
 
Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Architectural considerations on search relevancy in the conte...Haystack 2019 - Architectural considerations on search relevancy in the conte...
Haystack 2019 - Architectural considerations on search relevancy in the conte...
 
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
Haystack 2019 - Custom Solr Query Parser Design Option, and Pros & Cons - Ber...
 
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...Haystack 2019 - Establishing a relevance focused culture in a large organizat...
Haystack 2019 - Establishing a relevance focused culture in a large organizat...
 
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
Haystack 2019 - Solving for Satisfaction: Introduction to Click Models - Eliz...
 
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
2019 Haystack - How The New York Times Tackles Relevance - Jeremiah Via
 

Último

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 

Último (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 

Better Search Engine Testing

  • 1. BETTER SEARCH ENGINE TESTING STPCON 2011 | EPUGH@O19S.COM | @DEP4B
  • 2. WHY AM I QUALIFIED TO BE UP HERE? • President of OpenSource Connections • Contributor to CruiseControl and Continuum CI projects • Member of Apache Software Foundation • Presenter at conferences (OSCON, ApacheCON, jTDS, ExpoQA, STPcon 2009!)
  • 7. AGENDA Why is Search Becoming More Important? What is a Search Engine? Techniques for Testing Wrap Up
  • 8. WHY IS SEARCH BECOMING MORE IMPORTANT?
  • 9. INFORMATION IS EXPLODING “information workers ... are each bombarded with1.6 gigabytes of information on average every day through emails, reports, blogs, text messages,calls and more”. • http://online.wsj.com/article/SB124252211780027326.html
  • 10. UNSTRUCTURED • emails, spreadsheets, documents, presentations, images, databases • 75% unstructured to 25% structured
  • 11. MANAGING DATA IS EXPENSIVE •1 GB costs $.20 to store •1 GB costs $3500 to Manage
  • 12. WHAT DOES 3500 BUY YOU? • 69% of respondents felt 50% or less of data could be found online • Knowledge workers spend 25% of their time engaged in search-related activities.
  • 13. WHY NOT JUST USE GOOGLE • We don’t want 44 million results, we want 1 • we want “the” answer, not “an” answer • we tolerate inefficieny in the Internet search As John Allenhappy toputs it: “The Internet is • We are Paulos “satisfice” the world's largest library.It's just that all the books are on the floor.”
  • 14. WHAT IS A SEARCH ENGINE?
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. THREE STAGES OF SEARCH
  • 21. CONTENT INDEXING •- creating an index by crawling the content directories, databases, other repositories using an automated process (either pushing or pulling changes) • create an Index, which is a searchable key to a collection. • In Enterprise Search, the indexing mechanism should be able to access company private data (with access privileges maintained) • control indexing schedule - being able to index rapidly changing content quickly, other content more slowly. • rather than having the bot look for the data.
  • 22. CONTENT INDEXING • Indexing may also support • Metadata extraction • Auto-summarization, which is analyse of the collection and group its content into categories or clusters. • Metadatain turn becomes facets that can be used to tune the query to put emphasis on that category.
  • 26. FACETING Faceted or "guided navigation" leverages metadata fields and values to provide users with visible options for narrowing or refining their query. - Peter Morville, Search Patterns 26
  • 27. Search
Stack User
Interface Search
Engine Data
  • 28. HOW DO WE TEST?
  • 29. HOW DO WE TEST? • Querying • Formatting • Content Indexing • Performance
  • 31. CHALLENGES • Competing business stakeholders: • Tester: When I search for “lamp shades”, I used to see these documents, now I see a differing set. • Business Owner: How do I know that the new search engine is better? • User: My pet feature “search within these results” works differently. • Marketing Guy: I want to control the results so the current marketing push for toilet paper brand X always shows up at the top.
  • 32. CHALLENGES • Stakeholders want a better search implementation, but perversely often want it to all work “the exact same way”.   Getting agreement across all the stakeholders for the project vision, and agree on the metrics is a challenge.
  • 33. PERFECT SEARCH TESTER WOULD BE ALL OF • Mathematician • Business Analyst • Librarian • Systems Engineer • UX Expert • Geographer! • Writer • Psychologist • Programmer
  • 34. KNOWLEDGE TRANSFER • If you don’t have the perfect team already, bring in experts and do domain knowledge transfer. • Learn the vocabulary of search to better communicate together • “auto complete” vs “auto suggest” • Do “Search for Content Team” brownbag sessions!
  • 35. QUERY TESTING • Often called “relevancy testing”
  • 36. TWO SCHOOLS OF THOUGHT • “One True Answer” • “I know it when I see it”
  • 37. “ONE TRUE ANSWER” • Absolute Truth / Matrix / Grid / TREC / Relevancy Assertions • The correct answers for each search are known ahead of time • Humans judges often decide these correct answers, stored as Relevancy Assertions • Can be labor intensive to setup • A “Numerical Grade” is produced for comparision
  • 38. PROBLEMS WITH THIS APPROACH • Open to gaming. TREC competition is swamped by “academic” search engine efforts that don’t work in the real world. • Needa well understood data set with generally accepted answers. is it better to have an engine that gives modestly relevant results almost all the time, or an engine that gives really good answers sometimes, better on average than the other engine, but sometimes gives back complete garbage?
  • 39. A/B TESTING Engine version 1 and version 2! • Tracks explicit or implicit preferences between engines A/B • Often dispenses with the notion of the "correct" answer • Canbe easier to setup, but some fear the best answers will be missed by both engines
  • 40. RELEVANCY • Do we have any defined relevancy metrics? • Relevancy is like porn.....
  • 41. I KNOW IT WHEN I SEE IT! http://en.wikipedia.org/wiki/Les_Amants
  • 42. BEYOND PRECISION AND RECALL: HOW ENGINES ARE • Binary vs. Non-Binary Grading Systems • Early TREC had binary judgements, only Yes/No on whether each doc was related to a test search • More choices were later added •A system can use letter grades (A, B, C, D and F) or numeric grades • Another style asks testers to sort documents in their preferred order
  • 43. CLASSIC MEASUREMENTS OF SEARCH RELEVANCY • Recall: "Did I find all the documents I expected to get back?  What percent?" • Precision: "Did the system bring back other documents that weren't relevant?  What percent were on target?"
  • 44. NEWER IDEAS • Rank: The order of documents that were returned • Generally a 1 in 20 match in the #1 spot is better than a 50% rate where all matches are on the second page.
  • 45. INTERACTIVITY: WHAT NAVIGATORS OR VISUALIZATION WERE GIVEN • Facets and sorting: Clickable filters and sort options • Unsupervised Clustering: Related terms or phrases, or related searches • Spelling and thesaurus suggestions
  • 46. SUBJECT DISAMBIGUATION, SENTIMENT, CONFLICTING INFORMATION, CROWD HINTS • kidney bean or kidney cell • "best football team in the UK"
  • 47.
  • 48. SOURCES OF VARIANCE, AKA "PROBLEMS" Note, this is talking about comparing search engine a to search engine b. But I am thinking more in the context of search engive v1 to v2!
  • 49. DIFFERENT GOALS • Perfect/Human vs. Best vs. Acceptable vs. Better than X • Constrained vs. Unconstrained Resources (time, cpu, storage)
  • 50. SAMPLE SIZE • Amount of Data • Fixed set or growing over time • Number of Testers (AB or Relevancy Judgments) • Number of Searches
  • 51. VERTICAL VS. HORIZONTAL CONTENT • Oneextreme: Specific demo may cover just one discipline, for example Medical Journals • Other extreme: Internet covers vastly disparate domains
  • 52. USERS • Experienced vs. New Searcher • Subject Expert vs. Novice • Spelling, typing and computer proficiency • InterfaceMedium (large visual display, small text display, audible, Braille, etc) • Amount of Effort to understand Search • Willingness to Iterate • Searching for specific answer vs. General Exploration
  • 53. TYPE OF SEARCHES • Length / 1 or 2 words • Full question • Sample text • Internet Boolean • Advanced Boolean / Syntax / Proximity • Wildcard, Regex, etc.
  • 54. PUNCTUATION • Chemical • Source Code • Units of Measure • Literal vs. Search Operator
  • 55. NOT EVEN GETTING INTO MULTI LINGUAL SEARCH • How do I test in languages I don’t understand?
  • 57. FORMATTING TESTING • Directly builds on most of our existing test skills.
  • 59. Persona 1: Going to be a mom Oh my God Needy I’m actually Pregnant! Narrative Self Introduction Hi all, I'm very new to this but i couldn't help but share my excitement. I have just found out today that i am pregnant. It wasn't planned, me and my partner of a year and a half were going to wait until we had our own place and were married first but it looks like we have done it the other way round. What’s next? What am I supposed to do? My only concern is that i don't really know how my boyfriend feels about it. I Guidance please! know we need to discuss the options but i have really already made up my mind about what i want to do. There is so much to consider, money, a decent To interact with people going place to live, being ready but i know i am ready and have been for a long time ( I through the same get extremely broody when i see my friends kids) thing. Should i just tell him how i feel or go with how he feels because i don't want to Scenarios that typify - lose him. He is a loving partner who would stand by me through anything i just planned to get pregnant, but don’t want him to feel like i am tying him down!!! I suppose i am feeling very happy but also very confused at the same time!!! hasn't done any research Catch phrases - Nervous but excited, giddy, Where do I hp://forum.sofeminine.co.uk/forum/maternite1/__f468_maternite1-Oh- start? my-god-i-m-pregnant.html Tag lines - Wants to share, has a million questions Likely to say - Guide me, help me get off to a good start
  • 60. Persona 2: New Mom Are my kids sick or is Demanding this condition normal? How do I…? Narrative I have been hearing about women who claim that thier 2, 2 1/2 or 3 year old is not ready for the poy. They claim its a nightmare and are waiting for their children to come around. Maybe I grew up in the twilight zone, but I had always assumed that poy training was something that is just done. Its done when: How do I ensure my a) The child in question can sleep through the night and stay dry. baby is latching on b) The child in question can speak to you, in full sentences. like, "apple juice, please" or correctly? "wanna go to the park" or "momma I wanna hold you..." c) The child in question knows they are soiled and can ask to be changed. What type of stroller Barring any of those things, a child is ready to be placed on the poy. using the poy was should I buy? What never negotiable in my family. When we hit the above milestones my mother trained us. We brand of car seat is just did it. If we complained she never put diapers on us, she just kept directing us back to best? the poy. Her methods of redirection may be controversial (she told my brother that unless he was a big boy he would not get a happy meal. Boys who pooed on themselves got sad meals... lol!!! He straightened up and started using the poy at 2 1/2) but she was never abusive or anything she just DIDNT ASK US. it was time to poy and that was it. Scenarios that Typify The reasoning was that I used to drink from a bole, and sleep with my mother, and such, Likely to say -Are my kids sick or is this condition normal? now I don't. I also used to crap my pants, and that is no longer allowed after a certain point. Describes herself - wants to be a good mother, looking for expert advice, wants to get ideas from other My question is this: why ask children if they are ready to use the poy, after they are moms clearly ready to use it (with language tools and bladder control)? Why is it treated like Narrative- could be working mom, something that is negotiable or that the child has a choice of either coming around to it or could be stay at home mom not? I understand that children are sensitive and you have to follow their lead, at times. Questions likely to ask - sometimes This picture captures my life perfectly: an wants to ask questions/get expert But allowing them to shit adult beverage advice siing on a book about underpants.
  • 61. Scenario 1 “I know went through this before with my first child, Find old answer but cannot recall the answer” Preamble Experienced mom has a déjà vu moment about a previous problematic experience with her first child. She has a partial recollection of a piece of information Success Factors related to the answer she seeks but she needs help in • Speed of Comprehension pulling • Directness to destination • Reduced: • Number of queries • Number of results • Indirect Knowledge Transfer Thinking aloud in the Family Room Very nice – lists out related Josh had not started to cry non- concerns for constipation. Let’s Hhhm I now I had the stop for 3 hours when it finally see: ‘symptoms’, ‘cures’, ‘when to same issue with josh, wwwaaaaaaaa dawned on me that he had not had call the doctor’, ‘what other moms but what the heck did wwwwaaaaaa . . . ggg a movement for 3 days . . . are saying’, ‘topic over view’ I do? wwwaaaaaaaa . . . Let’s try querying that . . . “no Ok – I’ll take ‘cures’ Alex for a 300 poop” . . . Not likely . . . Uumm . .. points and my personal sanity! “constipation”? Oh, might help to Water . . . fruit juice . . . high-fiber specify who as well . . . “baby” . . . baby foods - Ahhh prune juice . . . prune juice! Now why didn’t I remember that! After hours of frustration mother home alone has a Mother starts to type in query but suggest-as-you- Structured results quickly tip off the mother to the partial epiphany as to her child’s problem. type search box hints to her to be more specific. assorted aspects of constipation. She focuses in on one of the aspects and has total recollection of her previous experience.
  • 62. Scenario 2 It’s 2am and I don’t know who to ask?” Urgent Question Preamble Mother of twins finds herself with panicked in the early morning hours with a new situation. Success Factors • Speed of Comprehension • Directness to answer Crying in the Kitchen I don’t have to read hundreds of ‘102’ . . . thank god ! wwwaaaaaaaa pages on the internet . . . I just need We’re safe wwwwaaaaaa . . . ggg a quick concise answer . . . wwwaaaaaaaaa . . . wwwaaaaaaaa . . . at what temperature do I need to Crap! Who I am I wwwwaaaaa . . . ggg be worried . . ? ! Ahhh . . . that’s helpful - other supposed to at this wwwaaaaaaaaa . . . conditions to know about . . . hour ! Why is it no body is Please [BabyCenter] show me the open when I need them ? ! answer . . ! That’s thorough : ‘What will the doctor do? ‘ wwwaaaaaaaa Interesting ‘If fever is a defense against infection, is wwwwaaaaaa . . . ggg it really a good idea to try to bring it down?’ wwwaaaaaaaaa . . . wwwaaaaaaaa wwwwaaaaaa . . . ggg wwwaaaaaaaaa . . . Let me book mark this for later. In the middle of the night, a mother of twins finds Mother starts to type in a query but notices the The mother zooms in on the specific answer she herself alone, overwhelmed, and in dire need of an suggest-as-you-type search box lets her narrow her seeks. But then she notices collateral knowledge answer. question boosting her confidence she is going to get she takes note of for later reading. the answer she needs.
  • 63. CONTENT INDEXING TESTING • Leverages our normal testing skills. And typically what it really means is “Performance Testing”. • Lot’s of “integration” testing.
  • 65. LEVELS OF SCALING • Scale High • There is a quickly hit point of diminishing returns! • Scale Wide • The safety valve for lots of load. • Scale Deep • Scaling Deep? You are doing some crazy stuff with huge indexes!! 65
  • 66. SCALE WIDE (SLAVES) • Too many inbound queries! • slaves poll master for changes • index and config files transferred • ALL JAVA! 66
  • 67. SCALE WIDE (SHARDING) • Too large of an index to query • Split index over multiple Search servers •A -> M: Server 1, N -> Z: Server 2 • uniqueId.hash % numServers • Relevancy typically balanced shards • Requestsplit across shards, results aggregated to single response 67
  • 68. SCALE DEEP • Combine both scaling wide to handle number of queries with sharding to handle size of indexes! 68
  • 70. User
 Search Methodology Interface Engine Data Concurrent
Streams
of
Work Itera&on
2
Story: Opera4onalize
Solr Deploy
Solr
into
BabyCenter
Test
Environment Itera&on
2
Story: Search
Analysis Integrate
Solr
into
Community
UI,
A/B
TesBng Itera&on
2
Story: Search
Experience Conceptual
Model
(Personas,
etc)
&
Mockups OSC APPROACH TO SEARCH
  • 71. OSC APPROACH TO SEARCH
  • 72. RESOURCES • http://www.scribd.com/doc/17563004/Why-You-Cant-Just- Google-for-Enterprise-Knowledge • http://www.searchtools.com/info/user-interface.html • http://www.alistapart.com/articles/testing-search-for-relevancy- and-precision/ •
  • 74. THANK YOU! • twitter: dep4b • speakerrate: http://www.speakerrate.com/epugh/ • email: epugh@opensourceconnections.com 74

Notas do Editor

  1. \n
  2. \n
  3. \n
  4. Soon to be in a Spanish language newsletter!\n
  5. \n
  6. \n
  7. Wrap up should have plenty of time for questions\n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. he first stage is Content Indexing - creating an index by crawling the content directories, databases\nand other repositories using an automated process (a bot). The aim is to create an Index, which is a\nsearchable key to a collection. In Enterprise Search, the indexing mechanism should be able to\naccess company private data (with access privileges maintained), and you should have controlover\nthe indexing schedule - being able to index rapidly changing content quickly, other content more\nslowly. The bot may also be augmented with feed-in sources, which submit information to the index\nrather than having the bot look for the data.Indexing may also support Metadata extraction and\nAuto-summarization, which is to analyse the collection and group its content into categories or\nclusters. These in turn become facets that can be used to tune the query to put emphasis on that\ncategory.\n
  22. \n
  23. he first stage is Content Indexing - creating an index by crawling the content directories, databases\nand other repositories using an automated process (a bot). The aim is to create an Index, which is a\nsearchable key to a collection. In Enterprise Search, the indexing mechanism should be able to\naccess company private data (with access privileges maintained), and you should have controlover\nthe indexing schedule - being able to index rapidly changing content quickly, other content more\nslowly. The bot may also be augmented with feed-in sources, which submit information to the index\nrather than having the bot look for the data.Indexing may also support Metadata extraction and\nAuto-summarization, which is to analyse the collection and group its content into categories or\nclusters. These in turn become facets that can be used to tune the query to put emphasis on that\ncategory.\n
  24. engine processes the query, using the index, and finds any content that matches the search subject. The results can be further processed by sorting them by relevance or other clustering logic such as 'recommended' or 'best bets'.\n
  25. wolfram alpha the results are tabulated in some way for the end-user, usually according to a template. But this could also be any other sort of data visualization, such as a tag cloud. Again, this should take into account local user rights to the indexed data, and access control\nmust be applied to the search result content\n
  26. afd\n
  27. When the data being searched has high quality metadata, it lends itself to search using a faceted classification system. As Wikipedia puts it, this "allows the assignment of multiple classifications to an object, enabling the classifications to be ordered in multiple ways, rather than in a single, pre-determined, taxonomic order". This may sound complex, but you have almost certainly used such a system on an e-commerce site which separates products into different departments e.g. Electronics\n> Computers > Laptops. Enterprises are now bringing faceted search to document control, where the facets are metadata such as author, edition, keyword, access permissions, and so on.\n\n
  28. \n
  29. This is really about an existing suite of tests that are NOT under CI.\n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n
  36. \n
  37. \n
  38. It's nice to be able to compare one search engine to another in some unbiased way.  You point a couple search engines at the same documents and run some searches.  The test tool then pops a numerical grade.  These grades can either compare each engine to what humans have said are the correct answers, or compares one engine directy to another, and at least determins which one did better.  TREC has been one of the groups trying to perfect this type of testing over the years.\n
  39. As simple as this sounds, quite a few issues come up that make this more difficult than you might think:\nYou need to get a large set of documents and searches to test with.  This can be a problem for copyright reasons, among other things.\nYou need to come up with searches that represent the types of things real users would search for, and get everybody to agree on what's a fair test search.\nGetting humans to sit down and record what the best matches should be is very time consuming for large data sets.\nThen you need to decide which mathematical formulas you'll use to crank out the final scores.  For example, is it better to have an engine that gives modestly relevant results almost all the time, or an engine that gives really good answers sometimes, better on average than the other engine, but sometimes gives back complete garbage?\n\n
  40. \n
  41. \n
  42. famously said by Supreme Court Justice Potter Stewart in 1964 about the french film The Lovers. \nhe didn’t consider it obcene...\n
  43. \n
  44. \n
  45. \n
  46. \n
  47. \n
  48. \n
  49. \n
  50. \n
  51. \n
  52. \n
  53. \n
  54. \n
  55. \n
  56. \n
  57. \n
  58. \n
  59. \n
  60. \n
  61. \n
  62. Statistics: 45% of BC Users are in pregnancy\n
  63. http://uk.answers.yahoo.com/question/index?qid=20100905034150AAJ21bT \nhttp://www.diaryofanewmom.net/ \n
  64. \n
  65. \n
  66. \n
  67. \n
  68. \n
  69. Some debate on Solr 1.5 or Solr 3.0 to match lucene versions\n
  70. Look at moving to multiple Solr servers. If your queries run quickly with an acceptable avgTimePerRequest, then replicate your complete index across multiple Solr servers in a master/slave configuration. \n
  71. If your queries are running slowly, then use sharding to split the load of processing a single query across multiple sharded Solr servers. Both these are approaches that can be considered as scaling wider.\n
  72. \n
  73. \n
  74. \n
  75. \n
  76. \n
  77. Its both informative about patterns.. behaviour, structural, but also a call to arms for making the world better!\n
  78. \n