See conference video - http://www.lucidimagination.com/devzone/events/conferences/ApacheLuceneEurocon2011
In this case study I'll discuss architectural lessons learned from refactoring an existing REST-API backed by Apache Solr. The initial goal of the refactoring was to speed up data access while scaling from 5m documents to 20-50m documents stored in Solr. Under consideration was the hosting infrastructure, the REST API Java code and the Solr documents and configuration. In this talk I'll give a brief review of the results.
"Pimping" the Solr configuration, the client access and the document structure achieved better results. But the elementary lesson learned was, that a significant increase of data access speed can only be realized with a functional redesign and a simplification of the REST API. NO CAPS ON CORES & SHARDS) I'll explain how this led us directly to distinct Solr cores and why we dropped the introduction of Solr shards or a breathing cloud infrastructure.
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Lessons Learned: Refactoring a Solr-Based API App - Torsten Koester
1. Architectural lessons learned from refactoring a
Solr based API application.
Torsten Bøgh Köster (Shopping24) Apache Lucene Eurocon, 19.10.2011
2. Contents
Shopping24 and it‘s API
Technical scaling solutions
Sharding
Caching
Solr Cores
„Elastic“ infrastructure
business requirements as key
factor
3. @tboeghk
Software- and systems- architect
2 years experience with Solr
3 years experience with Lucene
Team of 7 Java developers currently at Shopping24
8. index fact time
•16 Gig Data
•Single-Core-Layout
•Up to 17s response time
•Machine size limited
•Stalled at solr version 1.4
•API designed for small
tools
10. ask the nerds
„Shard!“
That‘ll be fun!
„Use spare compute cores at Amazon?“
breathe load into the cloud
„Reduce that index size“
„Get rid of those long running queries!“