SlideShare a Scribd company logo
1 of 15
Leveraging Data:
Building a Stable Platform
Ophir Cohen, Data Platform Lead, ophirc@liveperson.com
Amit Fainer, Data QA Lead, amitfa@liveperson.com
May, 2013
Connection before content… 2
 Who was the commander of whom in the army?
 Who met his wife in India?
Agenda 3
 Connection before content
 LivePerson Is…
 Data platform requirements
 Quality challenges
 Architecture
 Development and production processes
 Case study: LivePerson BI Reports
LivePerson Is…
Mission:
4
Company
• Cloud-computing, SaaS pioneer since 1998
• IPO April 2000 (Nasdaq: LPSN); debt free
• 700+ employees
• LivePerson offers an extensive and rapidly-growing partner network
Customers
• 8,500 customers around the globe have chosen LivePerson to create secure,
reliable connections with their customers. LivePerson clients include:
• 8 of the top 10 Fortune 500 companies
•Top 10 of 15 commercial banks (Fortune 500)
•Top 4 of 5 telecommunication companies (Fortune 500)
•4 of the top 7 of the Forbes Global 2000
•5 of the top 6 software and services companies (Forbes 2000)
•8 of the top 10 of Interbrand's Best Global Brands
Service Delivery
• 1.8 billion visitors monitored per month
• 20 million connections per month
• Analyzes over 1.2 million documents and chat transcripts per month.
Mission
Creating
Meaningful
Customer
Connections
Live Chat and Click-to-Call
Vendor 2012
Enterprise Customer Success & Domain Expertise
Finance
High–Tech
Retail
Telecom
Travel
5
Requirements 6
 Massive Data flow (few TB a day)
 Different Data types, Different Producers
 Never Lose Data!
 Variety latency needs – Near real-time through Offline
 Data is accessible to everyone for Processing, in a standardized,
common paradigm, adopted by all consumers and producers
Quality Challenges 7
 Large volumes of Data – Automate or Die
 Bugs yield corrupted Data
 Produced data stays Forever
 Consumers need a standardized form to assure data integrity
Architecture 8
Kafka
Data Tier
Application Tier
Storm
Hadoop
Pig
Java MR
Hive
Architecture – Persistency Layer 9
Kafka
Data Tier
Application Tier
Storm
Hadoop
Pig
Java MR
Hive
Kafka (by LinkedIn):
• Queuing mechanism
• Persistency layer
• High availability layer
Architecture – Streaming Processing Layer 10
Kafka
Data Tier
Application Tier
Storm
Hadoop
Pig
Java MR
Hive
Storm (by Twitter)
• Stream processing
• Pluggable framework
Architecture – Batch Processing Layer 11
Kafka
Data Tier
Application Tier
Storm
Hadoop
Pig
Java MR
Hive
Hadoop (an Apache Project)
• Reliable, scalable, distributed
computing framework
• Rich eco-system
Develop, Test and Deploy at Scale 12
 Automated, Continuously integrated with built-in Performance
testing
 Satisfying Monitoring and Auditing needs of Tiers 1 through 5
 On going production tests
 Auditing mechanism
 Scrum
 Isolated production-mirrored environment for Testing
Case Study – LivePerson BI Reports 13
Case Study – LivePerson BI Reports 14
 Source to target
 Auditing tool as part of data integrity tests
 Load tests in real data env
Thank You 15
LivePerson Hire!
Feel free to reach out:
 ophirc@liveperson.com
 @ophchu
 amitfa@liveperson.com

More Related Content

More from Taldor Group

פיני מנדל תובנות עסקיות מיישומי Hadoop
פיני מנדל   תובנות עסקיות מיישומי Hadoopפיני מנדל   תובנות עסקיות מיישומי Hadoop
פיני מנדל תובנות עסקיות מיישומי HadoopTaldor Group
 
נתן פרידחי הקדמה לכנס Hadoop
נתן פרידחי   הקדמה לכנס Hadoopנתן פרידחי   הקדמה לכנס Hadoop
נתן פרידחי הקדמה לכנס HadoopTaldor Group
 
הערך העסקי שבאיכות הנתונים קוסטין מרזאה
הערך העסקי שבאיכות הנתונים   קוסטין מרזאההערך העסקי שבאיכות הנתונים   קוסטין מרזאה
הערך העסקי שבאיכות הנתונים קוסטין מרזאהTaldor Group
 
Dcl צביקה מנלה - סיפורי לקוחות
Dcl   צביקה מנלה - סיפורי לקוחותDcl   צביקה מנלה - סיפורי לקוחות
Dcl צביקה מנלה - סיפורי לקוחותTaldor Group
 
Taldor data quality einat shimoni - stki
Taldor data quality   einat shimoni - stkiTaldor data quality   einat shimoni - stki
Taldor data quality einat shimoni - stkiTaldor Group
 
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 32013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3Taldor Group
 
Loshin operationalizingdatagovernance
Loshin operationalizingdatagovernanceLoshin operationalizingdatagovernance
Loshin operationalizingdatagovernanceTaldor Group
 

More from Taldor Group (7)

פיני מנדל תובנות עסקיות מיישומי Hadoop
פיני מנדל   תובנות עסקיות מיישומי Hadoopפיני מנדל   תובנות עסקיות מיישומי Hadoop
פיני מנדל תובנות עסקיות מיישומי Hadoop
 
נתן פרידחי הקדמה לכנס Hadoop
נתן פרידחי   הקדמה לכנס Hadoopנתן פרידחי   הקדמה לכנס Hadoop
נתן פרידחי הקדמה לכנס Hadoop
 
הערך העסקי שבאיכות הנתונים קוסטין מרזאה
הערך העסקי שבאיכות הנתונים   קוסטין מרזאההערך העסקי שבאיכות הנתונים   קוסטין מרזאה
הערך העסקי שבאיכות הנתונים קוסטין מרזאה
 
Dcl צביקה מנלה - סיפורי לקוחות
Dcl   צביקה מנלה - סיפורי לקוחותDcl   צביקה מנלה - סיפורי לקוחות
Dcl צביקה מנלה - סיפורי לקוחות
 
Taldor data quality einat shimoni - stki
Taldor data quality   einat shimoni - stkiTaldor data quality   einat shimoni - stki
Taldor data quality einat shimoni - stki
 
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 32013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
2013 04 irm mdmdg - jon asprey 4 most asked dg questions v 1 3
 
Loshin operationalizingdatagovernance
Loshin operationalizingdatagovernanceLoshin operationalizingdatagovernance
Loshin operationalizingdatagovernance
 

Recently uploaded

SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

Live person under_the_hood_taldor_for_publish

  • 1. Leveraging Data: Building a Stable Platform Ophir Cohen, Data Platform Lead, ophirc@liveperson.com Amit Fainer, Data QA Lead, amitfa@liveperson.com May, 2013
  • 2. Connection before content… 2  Who was the commander of whom in the army?  Who met his wife in India?
  • 3. Agenda 3  Connection before content  LivePerson Is…  Data platform requirements  Quality challenges  Architecture  Development and production processes  Case study: LivePerson BI Reports
  • 4. LivePerson Is… Mission: 4 Company • Cloud-computing, SaaS pioneer since 1998 • IPO April 2000 (Nasdaq: LPSN); debt free • 700+ employees • LivePerson offers an extensive and rapidly-growing partner network Customers • 8,500 customers around the globe have chosen LivePerson to create secure, reliable connections with their customers. LivePerson clients include: • 8 of the top 10 Fortune 500 companies •Top 10 of 15 commercial banks (Fortune 500) •Top 4 of 5 telecommunication companies (Fortune 500) •4 of the top 7 of the Forbes Global 2000 •5 of the top 6 software and services companies (Forbes 2000) •8 of the top 10 of Interbrand's Best Global Brands Service Delivery • 1.8 billion visitors monitored per month • 20 million connections per month • Analyzes over 1.2 million documents and chat transcripts per month. Mission Creating Meaningful Customer Connections Live Chat and Click-to-Call Vendor 2012
  • 5. Enterprise Customer Success & Domain Expertise Finance High–Tech Retail Telecom Travel 5
  • 6. Requirements 6  Massive Data flow (few TB a day)  Different Data types, Different Producers  Never Lose Data!  Variety latency needs – Near real-time through Offline  Data is accessible to everyone for Processing, in a standardized, common paradigm, adopted by all consumers and producers
  • 7. Quality Challenges 7  Large volumes of Data – Automate or Die  Bugs yield corrupted Data  Produced data stays Forever  Consumers need a standardized form to assure data integrity
  • 8. Architecture 8 Kafka Data Tier Application Tier Storm Hadoop Pig Java MR Hive
  • 9. Architecture – Persistency Layer 9 Kafka Data Tier Application Tier Storm Hadoop Pig Java MR Hive Kafka (by LinkedIn): • Queuing mechanism • Persistency layer • High availability layer
  • 10. Architecture – Streaming Processing Layer 10 Kafka Data Tier Application Tier Storm Hadoop Pig Java MR Hive Storm (by Twitter) • Stream processing • Pluggable framework
  • 11. Architecture – Batch Processing Layer 11 Kafka Data Tier Application Tier Storm Hadoop Pig Java MR Hive Hadoop (an Apache Project) • Reliable, scalable, distributed computing framework • Rich eco-system
  • 12. Develop, Test and Deploy at Scale 12  Automated, Continuously integrated with built-in Performance testing  Satisfying Monitoring and Auditing needs of Tiers 1 through 5  On going production tests  Auditing mechanism  Scrum  Isolated production-mirrored environment for Testing
  • 13. Case Study – LivePerson BI Reports 13
  • 14. Case Study – LivePerson BI Reports 14  Source to target  Auditing tool as part of data integrity tests  Load tests in real data env
  • 15. Thank You 15 LivePerson Hire! Feel free to reach out:  ophirc@liveperson.com  @ophchu  amitfa@liveperson.com

Editor's Notes

  1. We need to update this slide
  2. The biggest in the areaAll fields: finance, telecom etc…