SlideShare uma empresa Scribd logo
1 de 14
Big Data in REAL TIME
Ron Zavner
We’re Living in a Real Time World…
        Social                           User Tracking &                 Homeland Security
                                          Engagement




      eCommerce                       Financial Services                 Real Time Search




2                 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
The Flavors of Big Data Analytics




       Counting                                Correlating               Research




3                 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Twitter in Numbers (March 2011)



     It takes a week for users to
     send    1 billion tweets
                                                       Source: http://blog.twitter.com/2011/03/numbers.html

4            ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Twitter in Numbers (March 2011)



                   On average,
           140 million
      tweets get sent every day
                                                       Source: http://blog.twitter.com/2011/03/numbers.html

5            ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Twitter in Numbers (March 2011)



            The highest
        throughput to date is
6,939 tweets/sec.
                                                       Source: http://blog.twitter.com/2011/03/numbers.html

6            ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Twitter in Numbers (March 2011)



         460,000 new
          accounts
            are created daily
                                                       Source: http://blog.twitter.com/2011/03/numbers.html

7            ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Challenge – Word Count
           Tweets




8
                                     ?
             ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
                                                                    Count
                                                                     Count
                                                                             Word:Count
Analyze the Problem
       Thousands of tweets per second to process
       Aggregate counters for each word
       Latency – less than a second
       System needs to linearly scale
       System needs to be fault tolerant
       Querying & Persisting Data
       Managing the system




9                ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Tier Based Architecture?




10        ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Data Grid 




11        ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Putting it all together




12         ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
The 3 Most Popular Words on Twitter?



                  1. Just
                  2. Found
                  3. Love
                                                                 - August 2012

13        ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
Q&A




       RonZ@gigaspaces.com

14      ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

Mais conteúdo relacionado

Semelhante a Big Data in Real Time

Search Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSearch Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSematext Group, Inc.
 
Bigdata analytics-twitter
Bigdata analytics-twitterBigdata analytics-twitter
Bigdata analytics-twitterdfilppi
 
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...Project Controls Expo
 
Learn The Characteristics Of World Class Communities
Learn The Characteristics Of World Class CommunitiesLearn The Characteristics Of World Class Communities
Learn The Characteristics Of World Class CommunitiesTelligent
 
Alfresco digital assetmanagement-042111-final
Alfresco digital assetmanagement-042111-finalAlfresco digital assetmanagement-042111-final
Alfresco digital assetmanagement-042111-finalEmil Loreto
 
Social Radar 3.0 Deck
Social Radar 3.0 DeckSocial Radar 3.0 Deck
Social Radar 3.0 DeckJohn Mumford
 
Big data and APIs for PHP developers - SXSW 2011
Big data and APIs for PHP developers - SXSW 2011Big data and APIs for PHP developers - SXSW 2011
Big data and APIs for PHP developers - SXSW 2011Eli White
 
Leveraging open source for big data stack
Leveraging open source for big data stackLeveraging open source for big data stack
Leveraging open source for big data stackFlytxt
 
How to measurably increase your email response rates webinar.041411.1
How to measurably increase your email response rates webinar.041411.1How to measurably increase your email response rates webinar.041411.1
How to measurably increase your email response rates webinar.041411.1Trinity Web Works
 
Digital Asset Management with Alfresco
Digital Asset Management with AlfrescoDigital Asset Management with Alfresco
Digital Asset Management with Alfrescorivetlogic
 
Transform your Classified business into Digital
Transform your Classified business into DigitalTransform your Classified business into Digital
Transform your Classified business into DigitalTANGERINE Digital
 
Sviluppare un backend serverless in real time attraverso GraphQL
Sviluppare un backend serverless in real time attraverso GraphQLSviluppare un backend serverless in real time attraverso GraphQL
Sviluppare un backend serverless in real time attraverso GraphQLAmazon Web Services
 
Big Traffic, Big Trouble: Big Data Security Analytics
Big Traffic, Big Trouble: Big Data Security AnalyticsBig Traffic, Big Trouble: Big Data Security Analytics
Big Traffic, Big Trouble: Big Data Security AnalyticsDataWorks Summit
 
Big Traffic, Big Trouble: Big Data - Tokyo
Big Traffic, Big Trouble: Big Data - TokyoBig Traffic, Big Trouble: Big Data - Tokyo
Big Traffic, Big Trouble: Big Data - TokyoDataWorks Summit
 
Aras ACE Conference PLM Keynote by Peter Schroer
Aras ACE Conference PLM Keynote by Peter SchroerAras ACE Conference PLM Keynote by Peter Schroer
Aras ACE Conference PLM Keynote by Peter SchroerAras
 
Social media it support.pptx
Social media  it support.pptxSocial media  it support.pptx
Social media it support.pptxPink Elephant
 

Semelhante a Big Data in Real Time (20)

Search Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL BackendSearch Analytics Business Value & NoSQL Backend
Search Analytics Business Value & NoSQL Backend
 
Bigdata analytics-twitter
Bigdata analytics-twitterBigdata analytics-twitter
Bigdata analytics-twitter
 
Search Analytics What? Why? How?
Search Analytics What? Why? How?Search Analytics What? Why? How?
Search Analytics What? Why? How?
 
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
Project Controls Expo, 13th Nov 2013 - "A new visual way to engage executive ...
 
Learn The Characteristics Of World Class Communities
Learn The Characteristics Of World Class CommunitiesLearn The Characteristics Of World Class Communities
Learn The Characteristics Of World Class Communities
 
Alfresco digital assetmanagement-042111-final
Alfresco digital assetmanagement-042111-finalAlfresco digital assetmanagement-042111-final
Alfresco digital assetmanagement-042111-final
 
Social Radar 3.0 Deck
Social Radar 3.0 DeckSocial Radar 3.0 Deck
Social Radar 3.0 Deck
 
How To Use It With Safe
How To Use It With SafeHow To Use It With Safe
How To Use It With Safe
 
Big data and APIs for PHP developers - SXSW 2011
Big data and APIs for PHP developers - SXSW 2011Big data and APIs for PHP developers - SXSW 2011
Big data and APIs for PHP developers - SXSW 2011
 
Leveraging open source for big data stack
Leveraging open source for big data stackLeveraging open source for big data stack
Leveraging open source for big data stack
 
How to measurably increase your email response rates webinar.041411.1
How to measurably increase your email response rates webinar.041411.1How to measurably increase your email response rates webinar.041411.1
How to measurably increase your email response rates webinar.041411.1
 
Digital Asset Management with Alfresco
Digital Asset Management with AlfrescoDigital Asset Management with Alfresco
Digital Asset Management with Alfresco
 
Transform your Classified business into Digital
Transform your Classified business into DigitalTransform your Classified business into Digital
Transform your Classified business into Digital
 
Sviluppare un backend serverless in real time attraverso GraphQL
Sviluppare un backend serverless in real time attraverso GraphQLSviluppare un backend serverless in real time attraverso GraphQL
Sviluppare un backend serverless in real time attraverso GraphQL
 
Big Traffic, Big Trouble: Big Data Security Analytics
Big Traffic, Big Trouble: Big Data Security AnalyticsBig Traffic, Big Trouble: Big Data Security Analytics
Big Traffic, Big Trouble: Big Data Security Analytics
 
Big Traffic, Big Trouble: Big Data - Tokyo
Big Traffic, Big Trouble: Big Data - TokyoBig Traffic, Big Trouble: Big Data - Tokyo
Big Traffic, Big Trouble: Big Data - Tokyo
 
Aras ACE Conference PLM Keynote by Peter Schroer
Aras ACE Conference PLM Keynote by Peter SchroerAras ACE Conference PLM Keynote by Peter Schroer
Aras ACE Conference PLM Keynote by Peter Schroer
 
Social media it support.pptx
Social media  it support.pptxSocial media  it support.pptx
Social media it support.pptx
 
Big data by_mcal
Big data by_mcalBig data by_mcal
Big data by_mcal
 
Final_Bigdata_pret
Final_Bigdata_pretFinal_Bigdata_pret
Final_Bigdata_pret
 

Último

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 

Último (20)

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 

Big Data in Real Time

  • 1. Big Data in REAL TIME Ron Zavner
  • 2. We’re Living in a Real Time World… Social User Tracking & Homeland Security Engagement eCommerce Financial Services Real Time Search 2 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 3. The Flavors of Big Data Analytics Counting Correlating Research 3 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 4. Twitter in Numbers (March 2011) It takes a week for users to send 1 billion tweets Source: http://blog.twitter.com/2011/03/numbers.html 4 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 5. Twitter in Numbers (March 2011) On average, 140 million tweets get sent every day Source: http://blog.twitter.com/2011/03/numbers.html 5 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 6. Twitter in Numbers (March 2011) The highest throughput to date is 6,939 tweets/sec. Source: http://blog.twitter.com/2011/03/numbers.html 6 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 7. Twitter in Numbers (March 2011) 460,000 new accounts are created daily Source: http://blog.twitter.com/2011/03/numbers.html 7 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 8. Challenge – Word Count Tweets 8 ? ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved Count Count Word:Count
  • 9. Analyze the Problem  Thousands of tweets per second to process  Aggregate counters for each word  Latency – less than a second  System needs to linearly scale  System needs to be fault tolerant  Querying & Persisting Data  Managing the system 9 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 10. Tier Based Architecture? 10 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 11. Data Grid  11 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 12. Putting it all together 12 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 13. The 3 Most Popular Words on Twitter? 1. Just 2. Found 3. Love - August 2012 13 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved
  • 14. Q&A RonZ@gigaspaces.com 14 ® Copyright 2011 Gigaspaces Ltd. All Rights Reserved

Notas do Editor

  1. ActiveInsight