Dai Clegg (Big Data Evangelist IBM) - Babies, Buses and Movies; some examples of the value in big data analytics

•

1 gostou•696 visualizações

Presentatie van Big Data Evangelist Dai Clegg (IBM): 'Babies, Buses and Movies; some examples of the value in big data analytics' tijdens het Big Data Analytics seminar 14 juni van Almere DataCapital in Almere.

Tecnologia Negócios

Embracing & Exploiting Big Data

Babies, Motors & Movies
dai clegg: IBM big data evangelist

© 2012 IBM Corporation

Information Management

Utilities
Financial Services  Weather impact on power
 Fraud detection generation
 Risk management  Transmission monitoring
 360° View of the Customer  Smart grid management
Variety: Manage the complexity of multiple
relational and non-relational data
Transportation types and schemas
IT
 Weather and traffic
 Transition log analysis for
impact on logistics and
multiple systems
Streaming data and large volume
fuel consumption Velocity:  Cybersecurity
data movement
Health & Life Sciences
 Epidemic early warning Retail
 ICU monitoring Volume:  Customer 360° View
 Healthcare monitoring Scale from terabytes to zettabytes
 Click-stream analysis
 Real-time promotions

Telecommunications Law Enforcement
 CDR processing  Real-time multimodal surveillance
 Churn prediction  Situational awareness
 Geomapping / marketing  Cyber security detection
 Network monitoring

© 2012 IBM Corporation

Information Management

Big:

Broad:

Brainless:

Big + Smart
= Insights!

© 2012 IBM Corporation

Information Management

Babies

 Use case
– Neonatal infant monitoring
– Predict infection in ICU 24 hours in
advance
 Solutions
– 120 children monitored :120K msg/sec,
billion msg/day
– Trials expanding to include hospitals in US
and China

© 2012 IBM Corporation

Information Management

Babies

© 2012 IBM Corporation

Information Management

Motors

Policy & Claims
System

Service Centre

Mobile Data Feed Customer Portal
Analytic Reporting
© 2012 IBM Corporation

Information Management

Movies

USC’s Film Forecaster correctly predicted a clamor for "Hangover 2” that
resulted in $100 million opening over Memorial Day weekend
– Looked at 250K-500K Tweets and broke down positive and negative messages
using a lexicon of 1700 words

The Film Forecaster sounds like a
big undertaking for USC, but it really
came down to one communications
masters student who learned Big
Sheets in a day, then pulled in the
tweets and analyzed them
- Ryan Kim

© 2012 IBM Corporation

Information Management

Movies

© 2012 IBM Corporation

Information Management

IBM big data platform

InfoSphere BigInsights
Hadoop-based analytics for variety and volume

Hadoop

Information Stream
Integration Computing
InfoSphere Information InfoSphere Streams
Server
Low-latency Analytics for
High-volume data integration streaming data
and transformation

MPP Data Warehouse

IBM optimized workload data warehouses
Scalable, high-performance, mixed-workload analytics on structured data

© 2012 IBM Corporation

Information Management

IBM big data platform

© 2012 IBM Corporation

Information Management

IBM big data platform

InfoSphere BigInsights IBM Netezza InfoSphere Streams

Analytics on Big Data at Rest Analytics on
Unstructured Structured Big Data in Motion

© 2012 IBM Corporation

Information Management

IBM big data platform

• Big Data
• Volume
• Velocity
• Variety

• Combining data types & sources

• Combining technologies to analyse it

• Complementing the relational warehouse

© 2012 IBM Corporation

Information Management

© 2012 IBM Corporation

Mais conteúdo relacionado

Mais de AlmereDataCapital

Steven van der Linden (Qforce) @ PIDS seminarAlmereDataCapital

Maarten Stultjens (Elephant Security) @ PIDS seminarAlmereDataCapital

Sampo Kellomäki (Synergetics) @ PIDS seminarAlmereDataCapital

Jaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminarAlmereDataCapital

Peter Kits (Holland Van Gijzen) @ PIDS seminarAlmereDataCapital

Prof. mr. Sijmons (Universiteit Utrecht) @ PIDS seminarAlmereDataCapital

Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'AlmereDataCapital

Dr. Piet Daas (CBS) - Statistiek en grote data bestandenAlmereDataCapital

Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...AlmereDataCapital

Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...AlmereDataCapital

Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...AlmereDataCapital

Carlijn Nouwen (McKinsey) - Keynote: Big Data in de ZorgAlmereDataCapital

Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...AlmereDataCapital

Nicky Hekster (IBM) - Watson for HealthAlmereDataCapital

Freek Bomhof (TNO) - Big Data en kansen in de zorgAlmereDataCapital

Harro Stokman (Euvision) - Big Brother Watches Big DataAlmereDataCapital

Arjan Hassing (Ernst & Young) - Kosten besparen op big data storageAlmereDataCapital

Lex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdataAlmereDataCapital

Prof. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe dataAlmereDataCapital

Peter Walgemoed (Carelliance) - Businessmodels for Big DataAlmereDataCapital

Mais de AlmereDataCapital (20)

Steven van der Linden (Qforce) @ PIDS seminar

Maarten Stultjens (Elephant Security) @ PIDS seminar

Sampo Kellomäki (Synergetics) @ PIDS seminar

Jaap-Henk Hoepman (Privacy & Identity Lab) @ PIDS seminar

Peter Kits (Holland Van Gijzen) @ PIDS seminar

Prof. mr. Sijmons (Universiteit Utrecht) @ PIDS seminar

Roland Haeve (Atos): 'Using the Cloud for Big Data Analytics'

Dr. Piet Daas (CBS) - Statistiek en grote data bestanden

Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...

Gerard Jansen (CEO Alan Turing Institute) - Alan Turing Institute: brengt dat...

Bert Reijmerink (Genalice) - Hoe technologie bijdraagt aan een betere behande...

Carlijn Nouwen (McKinsey) - Keynote: Big Data in de Zorg

Sjaak van der Pouw (Siemens Healthcare) - Beeldexplosie: de mogelijkheden van...

Nicky Hekster (IBM) - Watson for Health

Freek Bomhof (TNO) - Big Data en kansen in de zorg

Harro Stokman (Euvision) - Big Brother Watches Big Data

Arjan Hassing (Ernst & Young) - Kosten besparen op big data storage

Lex Pater (Flevoziekenhuis) - Slim omgaan met ziekenhuisdata

Prof. Ard den Heeten (LRCB) - Brondata: kennis uit ruwe data

Peter Walgemoed (Carelliance) - Businessmodels for Big Data

Último

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Developing An App To Navigate The Roads of BrazilV3cube

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

A Domino Admins Adventures (Engage 2024)Gabriella Davis

A Call to Action for Generative AI in 2024Results

Slack Application Development 101 Slidespraypatel2

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

🐬 The future of MySQL is Postgres 🐘RTylerCroy

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Dai Clegg (Big Data Evangelist IBM) - Babies, Buses and Movies; some examples of the value in big data analytics

2. Information Management Utilities Financial Services  Weather impact on power  Fraud detection generation  Risk management  Transmission monitoring  360° View of the Customer  Smart grid management Variety: Manage the complexity of multiple relational and non-relational data Transportation types and schemas IT  Weather and traffic  Transition log analysis for impact on logistics and multiple systems Streaming data and large volume fuel consumption Velocity:  Cybersecurity data movement Health & Life Sciences  Epidemic early warning Retail  ICU monitoring Volume:  Customer 360° View  Healthcare monitoring Scale from terabytes to zettabytes  Click-stream analysis  Real-time promotions Telecommunications Law Enforcement  CDR processing  Real-time multimodal surveillance  Churn prediction  Situational awareness  Geomapping / marketing  Cyber security detection  Network monitoring © 2012 IBM Corporation

4. Information Management Babies  Use case – Neonatal infant monitoring – Predict infection in ICU 24 hours in advance  Solutions – 120 children monitored :120K msg/sec, billion msg/day – Trials expanding to include hospitals in US and China © 2012 IBM Corporation

7. Information Management Movies USC’s Film Forecaster correctly predicted a clamor for "Hangover 2” that resulted in $100 million opening over Memorial Day weekend – Looked at 250K-500K Tweets and broke down positive and negative messages using a lexicon of 1700 words The Film Forecaster sounds like a big undertaking for USC, but it really came down to one communications masters student who learned Big Sheets in a day, then pulled in the tweets and analyzed them - Ryan Kim © 2012 IBM Corporation

9. Information Management IBM big data platform InfoSphere BigInsights Hadoop-based analytics for variety and volume Hadoop Information Stream Integration Computing InfoSphere Information InfoSphere Streams Server Low-latency Analytics for High-volume data integration streaming data and transformation MPP Data Warehouse IBM optimized workload data warehouses Scalable, high-performance, mixed-workload analytics on structured data © 2012 IBM Corporation

11. Information Management IBM big data platform InfoSphere BigInsights IBM Netezza InfoSphere Streams Analytics on Big Data at Rest Analytics on Unstructured Structured Big Data in Motion © 2012 IBM Corporation

13. Information Management IBM big data platform • Big Data • Volume • Velocity • Variety • Combining data types & sources • Combining technologies to analyse it • Complementing the relational warehouse © 2012 IBM Corporation

Notas do Editor

Here is another example of something the University of Southern California Annenberg School of Communication did with the IBM Big Data platform’s BigSheets technology. USC@Annenburg created the Film Forecaster tool and used it to correctly predict 2011’s summer block busters based on scraping Twitter and analyzing that against a simple lexicon that described a positive or negative showing for a movie. They made quite the impact since this very solution was featured on ABC News (a national news agency in the USA).More striking is the quote: the application was built by a communication Masters student who learned Big Sheets in a day.
This picture is a little simplistic for 2 reasons:First if gives pre-eminence to Netezza. That is because Netezza’s simplicity, performance and agile support for ad-hoc analysis is often the default proposition for an analytic warehouse in a greenfield situation (though this is not necessarily true if there is an existing commitment to Power or to DB2).Secondly it does not recognise the differentiation between exploratory analysis and repeated analysis.But if you are doing exploratory analysis of relational (ie structured) data, Netezza is a better platform; it thrives on ad-hoc analysis and has very rich tooling (INZA, SPSS etc) for analytics.Clearly exploratory on unstructured is BigI, Exploratory analysis on something in between (e.g. CDRs) could be done on Netezza, but if the data is not already being loaded (and even in a Netezza customer the raw XDRs are probably not loaded into the warehouse) then exploration in a low-cost Hadoop grid makes tons of sense. We have at least one customer use case of this, where once the analysis was repeatable it was implemented in the Netezza. But there are also use cases where the repeated analysis remains in BigI, exploiting its differentiating enterprise readiness.
If it’s data in motion (remember the babies being monitored). it has to be real-time. it has to be Streams. That’s the easy one.If it’s unstructured data, at rest, the best place to start is BigInsights, though you may load data into the relational warehouse subsequently for further insight.If it’s relational data, it’s unlikely you are going to move it to Hadoop If it’s semi-structured you have a choice and you’ll be influenced by these other development factors:It may be that an organization has already developed a map-reduce solution that delivers a high value analysis for data that was unloaded from the corporate EDW.Is the right solution to say ‘great, now you know the solution, re-code it in SQL using in-database analytics and implement it on your warehouse?’ Maybe a better solution is to implement BigInsights to enterprise-harden the Hadoop environment and run the application as is, but with production applications reliability and supportability.It may be that the volume is so huge that a DWH can’t handle it and certainly can’t handle it economically (think Vestas)it may be better to go to the platform with more of the appropriate analytic skills or other development resources availableIt may be that the customer wants to build their capability in Hadoop because they will have more challenging use case later that will be clear-cut BigInsights use cases.It may be that the customer just wants to experiment cheaply and quickly (though actually that’s more a BigI Basic edition use case – we’ll be looking to enterprise harden it later)But remember they are influencers, not deciders. IBMers can adapt to whatever best matches the customer’s needs, because of the comprehensive nature of our big data portfolio.

Dai Clegg (Big Data Evangelist IBM) - Babies, Buses and Movies; some examples of the value in big data analytics

Recomendados

Recomendados

Mais conteúdo relacionado

Mais de AlmereDataCapital

Mais de AlmereDataCapital (20)

Último

Último (20)

Dai Clegg (Big Data Evangelist IBM) - Babies, Buses and Movies; some examples of the value in big data analytics

Notas do Editor