SlideShare uma empresa Scribd logo
1 de 27
1© Cloudera, Inc. All rights reserved.
Driving Insights in A Connected World
Amy O’ Connor | Big Data Evangelist
Vijay Raja | Solutions Marketing
Powering the Internet of Things
with Apache Hadoop
2© Cloudera, Inc. All rights reserved.
Agenda & Overview
• IoT – A Revolution in the Making
• Key IoT Segments & Use Cases
• The IoT Ecosystem
• A Next gen IoT Analytics Engine
• Cloudera Enterprise – Real Time Analytics for IoT
• Reference Architecture
• IoT – Customer Case Studies
3© Cloudera, Inc. All rights reserved.
Internet of Things (IoT) – A Revolution In The Making
$1.7
Trillion
In Value
20%
Annual Growth
30 Billion
Things
250
Million
Connected Vehicles
Source - IDC & Gartner Estimates
Internet of
Things
IoT Markets - 2020
4© Cloudera, Inc. All rights reserved.
IoT – A Growing Universe of Connected Objects
Source: McKinsey Source: Business Intelligence
5© Cloudera, Inc. All rights reserved.
IoT Will Drive An Explosion of Data…
Data expected to explode to
44 ZB by 2020
Source: IDC
44 Trillion GB!80% of data will be
unstructured
6© Cloudera, Inc. All rights reserved.
It’s all kind of meaningless
unless you can make
sense of all that data
7© Cloudera, Inc. All rights reserved.
< 1% of data is currently utilized…
Mostly for anomaly detection or real-
time control; more can be used for
optimization and prediction
1%
Source: McKinsey Global - THE INTERNET OF THINGS:
MAPPING THE VALUE BEYOND THE HYPE, June 2015
8© Cloudera, Inc. All rights reserved.
Value is Maximized when Data is combined from
other sources
Value of Data is multiplied when you combine
and correlate it with other data from relevant
sources
Improvement in value that can be
unlocked by combining data from
multiple IoT applications and sources
SOURCE: McKinsey Global Institute analysis
“Interoperability would significantly improve performance by
combining sensor data from different machines and systems to provide
decision makers with an integrated view of performance across an
entire factory or oil rig.”
40%
9© Cloudera, Inc. All rights reserved.
IoT – Key Segments & Use Cases
Industrial IoT
Manufacturing
• Predictive Maintenance
• Operations Optimization
• Supply Chain Optimization
Healthcare
• Proactive & Connected Monitoring
• Early detection & Diagnosis
• Remote Measurements
Energy & Utilities
• Transmission & Distribution
• Smart-Grid & Smart Meters
• Ops & Predictive Maintenance
Govt. & Public Services
• Smart Cities
• Traffic Optimization
• Public Safety
Insurance
• Usage Based Insurance (UBI)
• Telematics for Insurance
• Insured Asset Management
Retail
• Automated Checkouts
• Footfall Analytics & Promos
• Inventory Optimization
Telecommunications
• Network Maintenance
• Connected Homes/ Cars
• Data Monetization
Mobility
• Telematics & Fleet Mgmt.
• Tracking & Remote Monitoring
• Condition Based Maintenance
Consumer IoT
Connected Cars
• Safety & Security
• Real-Time Diagnostics
• In-car Connectivity & Infotainment
Health & Lifestyle
• Wearables
• Health & Fitness Tracking
• Real-Time Remote Monitoring
Connected Homes
• Home Automation & Security
• Home Energy management
• Smart Appliances
Entertainment
• Interconnected Smart Devices
• Virtual Reality/ Interactive Gaming
• Drones
10© Cloudera, Inc. All rights reserved.
Internet of Things - Opportunity & Impact
Source- Ovum: Understanding the IoT Opportunity: An Industry Perspective - 2015
Manufacturing is a future hotbed for IoT deployment
The potential value that could be unlocked with IoT
applications in factory settings could be as much as
$3.7 trillion in 2025 - McKinsey Analysis$3.7T
50%Using real-time data to predict and prevent
breakdowns can reduce downtime by 50 percent
- McKinsey Analysis
Reduction in
Downtime
50%
Healthcare: IoT can enable cutting the costs of
chronic disease treatment by as much as
50 percent
- McKinsey Analysis
Reduction in
Costs
The number of connected vehicles will grow more
than six fold to over 250 million by 2020
- Gartner
250
Million
11© Cloudera, Inc. All rights reserved.
Polling Question
Where are you with respect to your IoT journey?
o We already have IoT use cases deployed
o We are actively working on deploying IoT use cases
o We are currently evaluating IoT use cases & technologies
o We will potentially deploy in the next two years
o We don’t see an IoT play in my organization
12© Cloudera, Inc. All rights reserved.
Performance Monitoring & Predictive
Maintenance of Heavy Equipment
Challenge:
• Continuously monitor performance of
heavy machinery and perform predictive
maintenance
Solution:
• Use Cloudera to parse large volume and
high velocity sensor data from equipment
• Process and analyze data for performance
analysis, advanced defect detection
HEAVY MACHINARY
» INDUSTRIAL IoT
» PREDICTIVE MAINTENANCE
» LOWERED COSTS
Industrial IoT – Heavy Machinery
DATA-DRIVEN
PROCESS
CASE STUDY
Change Image
13© Cloudera, Inc. All rights reserved.
Advanced analytics on streaming data to
reduce human space mission risks
Challenge:
• Over 2 TB/ hour of telemetry test data
streaming in from over 1200 sensors in
test environment
Solution:
• Cloudera cluster supporting high rate of
data ingest – up to ~300MB/sec
• Advanced analytics run on the
streaming data to check for issues or
determine patterns and reduce risk
AEROSPACE
» INDUSTRIAL IoT
» REMOTE MONITORING
» PREDICTIVE MAINTENANCE
Aerospace – Spacecraft Telemetry
DATA-DRIVEN
PROCESS
CASE STUDY
14© Cloudera, Inc. All rights reserved.
Enabling Smart Parking in Milton Keynes
with BT
Challenge:
• Increase efficiency and utilization of the
parking spots at Milton Keynes
Solution:
• Sensors installed in the parking bays sent
data streams to the Cloudera Data Hub
hosted by BT
• Real-time updates on parking availability
over the web/ smartphones
• Savings of £105M & reduced emissions
TELCOMMUNICATIONS
» SMART CITIES
» IMPROVED UTILIZATION
» COST REDUCTION
Smart Cities
CASE STUDY
15© Cloudera, Inc. All rights reserved.
The IoT Ecosystem
Consumer
Industrial
IoT Gateway
Cloud
Data Center
Data Analytics
Sensors/ Things
16© Cloudera, Inc. All rights reserved.
The IoT Ecosystem
Consumer
Industrial
IoT Gateway
Data Center
Data Analytics
Sensors/ Things
Data Characteristics
• Un-structured
• Intermittent
• Volume & Variety
Gateway
• Data Routing
• Edge-Processing
• Edge-Storage
Sensors/ Things
•To grow by 50X
•Drop in prices by
70% in last 5 years
Data Storage, Processing & Analytics
IOT Data Characteristics
• More processing in the
cloud
• Analytics on the cloud
IOT Data Analytics
• Key to Value Creation
• Combine data from multiple
sources & types
• Drive business insights
IOT Data Characteristics
• Distributed Data
Processing
• Cloud & On-Premise
Cloud
17© Cloudera, Inc. All rights reserved.
Key Attributes For Next Gen IoT Data Platform
Scale efficiently based on
your data growth
Effectively handle multiple
data-types and structures
Manage the complexity of
real-time IoT data ingest
Fundamentally Secure
Real-Time Analytics – Combine and
analyze data from multiple sources
Flexible deployment options
- Cloud & Distributed Data Processing
18© Cloudera, Inc. All rights reserved.
FILESYSTEM RELATIONAL
Cloudera Enterprise – Making Hadoop Fast, Easy, and Secure
OPERATIONS
Cloudera Manager
Cloudera Director
DATA
MANAGEMENT
Cloudera Navigator
Encrypt and KeyTrustee
Optimizer
BATCH
Sqoop
REAL-TIME
Kafka, Flume
PROCESS, ANALYZE, SERVE
UNIFIED SERVICES
RESOURCE MANAGEMENT
YARN
SECURITY
Sentry, RecordService
FILESYSTEM
HDFS
RELATIONAL
Kudu
NoSQL
HBase
STORE
INTEGRATE
BATCH
Spark, Hive, Pig
MapReduce
STREAM
Spark
SQL
Impala
SEARCH
Solr
SDK
Partners
CLOUDERA ENTERPRISE
19© Cloudera, Inc. All rights reserved.
Cloudera Enterprise – The Data & Analytics Platform for IoT
Sensors/ IoT
Data Sources
Internal Systems External Sources
BI Solutions Real-Time AppsSearch EDWDiscove
r
Machine
Learning
Data Center
Cloud
Sensor/ IoT Data
IoT Gateway
• Data Storage
• Data Processing
• Machine Learning
• Real-time Analytics
OPERATIONS
Cloudera Manager
Cloudera Director
DATA
MANAGEMENT
Cloudera Navigator
Encrypt and KeyTrustee
Optimizer
BATCH
Sqoop
REAL-TIME
Kafka, Flume
PROCESS, ANALYZE, SERVE
UNIFIED SERVICES
RESOURCE MANAGEMENT
YARN
SECURITY
Sentry, RecordService
FILESYSTEM
HDFS
RELATIONAL
Kudu
NoSQL
HBase
STORE
INTEGRATE
BATCH
Spark, Hive, Pig
MapReduce
STREAM
Spark
SQL
Impala
SEARCH
Solr
SDK
Partners
20© Cloudera, Inc. All rights reserved.
Cloudera Enterprise – Real Time Analytics for IoT
BI Solutions Real-Time AppsSearch EDWDiscover Machine
Learning
Deployment
Flexibility
Spark Streaming
Leadership in Spark
Integrated with EDH
Flexible Storage
Store any and all Data.
Kudu - Fast Analytics on
Fast Data
Real-Time Data
Processing
Data Security
Four pillars of security: Perimeter,
Access, Visibility, and Data
+ Record Service
Streaming Ingest
Kafka & Flume - Real-Time
Data Ingest for streaming,
high volume data
Sensor/ IoT Data Internal Systems External Sources
Centralized Mgmt.
Cloudera Manager for
centralized cluster
management
Manage Multiple Clusters – On
Premise or Cloud environment
- On Premise or Cloud
OPERATIONS
Cloudera Manager
Cloudera Director
DATA
MANAGEMENT
Cloudera Navigator
Encrypt and KeyTrustee
Optimizer
BATCH
Sqoop
REAL-TIME
Kafka, Flume
PROCESS, ANALYZE, SERVE
UNIFIED SERVICES
RESOURCE MANAGEMENT
YARN
SECURITY
Sentry, RecordService
FILESYSTEM
HDFS
RELATIONAL
Kudu
NoSQL
HBase
STORE
INTEGRATE
BATCH
Spark, Hive, Pig
MapReduce
STREAM
Spark
SQL
Impala
SEARCH
Solr
SDK
Partners
21© Cloudera, Inc. All rights reserved.
Cloudera – IoT Technical Architecture
IoT Enablers
Data Ingestion
(Kafka, JSON)
Data Transformation
and Enrichment
Data Processing &
Serving
Rule Mining, Pattern
Matching, Machine Learning
Distributed Data
Analytics
Data
Visualization
Data Storage
(Hbase, HDFS, Kudu)
22© Cloudera, Inc. All rights reserved.
Enabling US Auto Manufacturer to
Effectively Capture & Analyze Data from
over 30,000 Sensors in Every Car
Challenge:
• Effectively capture and analyze data
emitting from 30,000 sensors in every
car
Solution:
• Centralized data platform from Cloudera
for analytics of all of the sensor data
• Monitor individual component
performance and operational metrics
AUTOMOBILE
» CONNECTED CARS
» PREDICTIVE ANALYTICS
» REMOTE MONITORING
Connected Cars
DATA-DRIVEN
PROCESS
CASE STUDY
23© Cloudera, Inc. All rights reserved.
Helping 4+ million homes save hundreds of
millions of dollars in energy bills
Challenge:
• Bringing together diverse data sets
including streaming utility& sensor data
• Deriving Business Insights from all the
Data
Solution:
• Analytical Application on Cloudera EDH
• Savings of more than $320 Million for
subscribers
UTILITIES
» CONSUMER IoT
» PROCESS IMPROVEMENT
» COST REDUCTION
Smart Meters
CASE STUDY
24© Cloudera, Inc. All rights reserved.
Usage Based Insurance to reduce claims
by a large European insurance agency
Challenge:
• Gather, store and analyze sensor data
from millions of customers to analyze
driving habits & risks
Solution:
• Telemetry solution on Cloudera EDH -
increased policy renewal and customer
satisfaction rates
• Reduced the number of claims by 30%
resulting in savings in millions annually
INSURANCE
» USAGE BASED INSURANCE
» IMPROVED PROCESSES
» REDUCED COSTS
Usage-Based Insurance and Telematics
DATA-DRIVEN
PROCESS
CASE STUDY
25© Cloudera, Inc. All rights reserved.
Improve Parkinson's Disease Monitoring
and Treatment through IoT
Challenge:
• Collect and analyze data from wearables
(more than 300 readings per second)
from thousands of patients in real-time
Solution:
• Cloudera on Intel architecture to detect
patterns in patient data streaming from
thousands of wearables
• Continuously monitor the patients and
symptoms to accelerate breakthrough on
drug development
HEALTHCARE
» PATIENT 360°
» PREDICTIVE ANALYTICS
» IMPROVED CARE
Smart Healthcare
DATA-DRIVEN
PROCESS
CASE STUDY
26© Cloudera, Inc. All rights reserved.
The Cloudera Difference
Powerful Cluster Ops
Trusted by the pros
Cloud & Hybrid deployment
Integrated with AWS & Azure
Expert Support
Dedicated prescriptive help, just a click away
Real-Time IoT Analytics
The most experience with Spark
The Fastest Analytic SQL
Lowest latency, best concurrency
Fast, Updateable Analytic Storage
High throughput, low latency, and updates
Easy to ManageFast for Business Security without Compromise
Enterprise Encryption
Protects everything transparently
Access Policy Enforcement
Full-stack row/column-based RBAC & dynamic masking
Automated Data Management
Full-stack audit, lineage, discovery, and lifecycle
Secure Operations
Separation of duties, log data redaction
27© Cloudera, Inc. All rights reserved.
Thank You

Mais conteúdo relacionado

Mais procurados

Information Technology For Business Syllabus
Information Technology For Business   SyllabusInformation Technology For Business   Syllabus
Information Technology For Business SyllabusSahil Mahajan
 
Machine learning life cycle
Machine learning life cycleMachine learning life cycle
Machine learning life cycleRamjee Ganti
 
Topic 2: Cloud Computing Paradigms
Topic 2: Cloud Computing ParadigmsTopic 2: Cloud Computing Paradigms
Topic 2: Cloud Computing ParadigmsZubair Nabi
 
The data defined-vehicle_architecture
The data defined-vehicle_architectureThe data defined-vehicle_architecture
The data defined-vehicle_architectureSebastian Wedeniwski
 
Future of AI Smart Networks
Future of AI Smart NetworksFuture of AI Smart Networks
Future of AI Smart NetworksMelanie Swan
 
What Is Machine Learning? | Machine Learning Basics | Edureka
What Is Machine Learning? | Machine Learning Basics | EdurekaWhat Is Machine Learning? | Machine Learning Basics | Edureka
What Is Machine Learning? | Machine Learning Basics | EdurekaEdureka!
 
AI, Machine Learning and Deep Learning - The Overview
AI, Machine Learning and Deep Learning - The OverviewAI, Machine Learning and Deep Learning - The Overview
AI, Machine Learning and Deep Learning - The OverviewSpotle.ai
 
Building the Metaverse
Building the MetaverseBuilding the Metaverse
Building the MetaverseJon Radoff
 
PPT AI IN FINANCE.pptx
PPT AI IN FINANCE.pptxPPT AI IN FINANCE.pptx
PPT AI IN FINANCE.pptxVidyaBorade
 
A brief history of artificial intelligence for business
A brief history of artificial intelligence for businessA brief history of artificial intelligence for business
A brief history of artificial intelligence for businessJack C Crawford
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentationlpaviglianiti
 
SolarWinds Federal User Group 2016 - SolarWinds Enterprise Scalability, Integ...
SolarWinds Federal User Group 2016 - SolarWinds Enterprise Scalability, Integ...SolarWinds Federal User Group 2016 - SolarWinds Enterprise Scalability, Integ...
SolarWinds Federal User Group 2016 - SolarWinds Enterprise Scalability, Integ...SolarWinds
 
Top 10 Applications Of Artificial Intelligence | Edureka
Top 10 Applications Of Artificial Intelligence | EdurekaTop 10 Applications Of Artificial Intelligence | Edureka
Top 10 Applications Of Artificial Intelligence | EdurekaEdureka!
 
Artificial Intelligence in E-Commerce
Artificial Intelligence in E-CommerceArtificial Intelligence in E-Commerce
Artificial Intelligence in E-CommerceMd Javedul Ferdous
 
Microsoft & Internet of Things
Microsoft & Internet of ThingsMicrosoft & Internet of Things
Microsoft & Internet of ThingsMarlon Luz
 
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...Edureka!
 

Mais procurados (20)

Information Technology For Business Syllabus
Information Technology For Business   SyllabusInformation Technology For Business   Syllabus
Information Technology For Business Syllabus
 
Machine learning life cycle
Machine learning life cycleMachine learning life cycle
Machine learning life cycle
 
Topic 2: Cloud Computing Paradigms
Topic 2: Cloud Computing ParadigmsTopic 2: Cloud Computing Paradigms
Topic 2: Cloud Computing Paradigms
 
The data defined-vehicle_architecture
The data defined-vehicle_architectureThe data defined-vehicle_architecture
The data defined-vehicle_architecture
 
Future of AI Smart Networks
Future of AI Smart NetworksFuture of AI Smart Networks
Future of AI Smart Networks
 
Artificial intelligence in financial sector
Artificial intelligence in financial sectorArtificial intelligence in financial sector
Artificial intelligence in financial sector
 
Explore the Impact of AI on E-Commerce
Explore the Impact of AI on E-CommerceExplore the Impact of AI on E-Commerce
Explore the Impact of AI on E-Commerce
 
What Is Machine Learning? | Machine Learning Basics | Edureka
What Is Machine Learning? | Machine Learning Basics | EdurekaWhat Is Machine Learning? | Machine Learning Basics | Edureka
What Is Machine Learning? | Machine Learning Basics | Edureka
 
AI, Machine Learning and Deep Learning - The Overview
AI, Machine Learning and Deep Learning - The OverviewAI, Machine Learning and Deep Learning - The Overview
AI, Machine Learning and Deep Learning - The Overview
 
Building the Metaverse
Building the MetaverseBuilding the Metaverse
Building the Metaverse
 
PPT AI IN FINANCE.pptx
PPT AI IN FINANCE.pptxPPT AI IN FINANCE.pptx
PPT AI IN FINANCE.pptx
 
A brief history of artificial intelligence for business
A brief history of artificial intelligence for businessA brief history of artificial intelligence for business
A brief history of artificial intelligence for business
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
SolarWinds Federal User Group 2016 - SolarWinds Enterprise Scalability, Integ...
SolarWinds Federal User Group 2016 - SolarWinds Enterprise Scalability, Integ...SolarWinds Federal User Group 2016 - SolarWinds Enterprise Scalability, Integ...
SolarWinds Federal User Group 2016 - SolarWinds Enterprise Scalability, Integ...
 
Federated Learning
Federated LearningFederated Learning
Federated Learning
 
Top 10 Applications Of Artificial Intelligence | Edureka
Top 10 Applications Of Artificial Intelligence | EdurekaTop 10 Applications Of Artificial Intelligence | Edureka
Top 10 Applications Of Artificial Intelligence | Edureka
 
Artificial Intelligence in E-Commerce
Artificial Intelligence in E-CommerceArtificial Intelligence in E-Commerce
Artificial Intelligence in E-Commerce
 
Microsoft & Internet of Things
Microsoft & Internet of ThingsMicrosoft & Internet of Things
Microsoft & Internet of Things
 
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginn...
 
Final presentation on chatbot
Final presentation on chatbotFinal presentation on chatbot
Final presentation on chatbot
 

Destaque

Cloudera for Internet of Things
Cloudera for Internet of ThingsCloudera for Internet of Things
Cloudera for Internet of ThingsCloudera, Inc.
 
A Reference Architecture for IoT
A Reference Architecture for IoT A Reference Architecture for IoT
A Reference Architecture for IoT WSO2
 
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...Cloudera, Inc.
 
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...Cloudera, Inc.
 
A reference architecture for the internet of things
A reference architecture for the internet of thingsA reference architecture for the internet of things
A reference architecture for the internet of thingsCharles Gibbons
 
IoT Cloud architecture
IoT Cloud architectureIoT Cloud architecture
IoT Cloud architectureMachinePulse
 
Hadoop mapreduce performance study on arm cluster
Hadoop mapreduce performance study on arm clusterHadoop mapreduce performance study on arm cluster
Hadoop mapreduce performance study on arm clusterairbots
 
모바일 시대의 커뮤니케이션 패턴변화와 쟁점
모바일 시대의 커뮤니케이션 패턴변화와 쟁점모바일 시대의 커뮤니케이션 패턴변화와 쟁점
모바일 시대의 커뮤니케이션 패턴변화와 쟁점SCOTOSS
 
Hadoop World 2011: Changing Company Culture with Hadoop - Amy O'Connor, Nokia
Hadoop World 2011: Changing Company Culture with Hadoop - Amy O'Connor, NokiaHadoop World 2011: Changing Company Culture with Hadoop - Amy O'Connor, Nokia
Hadoop World 2011: Changing Company Culture with Hadoop - Amy O'Connor, NokiaCloudera, Inc.
 
Intuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchIntuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchCloudera, Inc.
 
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use case
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use caseBDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use case
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use caseDavid Lauzon
 
Ingesting hdfs intosolrusingsparktrimmed
Ingesting hdfs intosolrusingsparktrimmedIngesting hdfs intosolrusingsparktrimmed
Ingesting hdfs intosolrusingsparktrimmedwhoschek
 
Industrial Data Space Association - New Members, New Insights, New Future Dir...
Industrial Data Space Association - New Members, New Insights, New Future Dir...Industrial Data Space Association - New Members, New Insights, New Future Dir...
Industrial Data Space Association - New Members, New Insights, New Future Dir...Thorsten Huelsmann
 
Hadoop Self-Service Data Prep Fuels Analytics
Hadoop Self-Service Data Prep Fuels AnalyticsHadoop Self-Service Data Prep Fuels Analytics
Hadoop Self-Service Data Prep Fuels AnalyticsSenturus
 
Farming hadoop in_the_cloud
Farming hadoop in_the_cloudFarming hadoop in_the_cloud
Farming hadoop in_the_cloudSteve Loughran
 
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...DataStax
 

Destaque (20)

A Methodology for Building the Internet of Things
A Methodology for Building the Internet of ThingsA Methodology for Building the Internet of Things
A Methodology for Building the Internet of Things
 
Cloudera for Internet of Things
Cloudera for Internet of ThingsCloudera for Internet of Things
Cloudera for Internet of Things
 
A Reference Architecture for IoT
A Reference Architecture for IoT A Reference Architecture for IoT
A Reference Architecture for IoT
 
Big Data Application Architectures - IoT
Big Data Application Architectures - IoTBig Data Application Architectures - IoT
Big Data Application Architectures - IoT
 
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
The Business Advantage of Hadoop: Lessons from the Field – Cloudera Summer We...
 
IoT Data as Service with Hadoop
IoT Data as Service with HadoopIoT Data as Service with Hadoop
IoT Data as Service with Hadoop
 
IoT architecture
IoT architectureIoT architecture
IoT architecture
 
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
Enterprise Hadoop in the Cloud. In Minutes. | How to Run Cloudera Enterprise ...
 
A reference architecture for the internet of things
A reference architecture for the internet of thingsA reference architecture for the internet of things
A reference architecture for the internet of things
 
IoT Cloud architecture
IoT Cloud architectureIoT Cloud architecture
IoT Cloud architecture
 
Hadoop mapreduce performance study on arm cluster
Hadoop mapreduce performance study on arm clusterHadoop mapreduce performance study on arm cluster
Hadoop mapreduce performance study on arm cluster
 
모바일 시대의 커뮤니케이션 패턴변화와 쟁점
모바일 시대의 커뮤니케이션 패턴변화와 쟁점모바일 시대의 커뮤니케이션 패턴변화와 쟁점
모바일 시대의 커뮤니케이션 패턴변화와 쟁점
 
Hadoop World 2011: Changing Company Culture with Hadoop - Amy O'Connor, Nokia
Hadoop World 2011: Changing Company Culture with Hadoop - Amy O'Connor, NokiaHadoop World 2011: Changing Company Culture with Hadoop - Amy O'Connor, Nokia
Hadoop World 2011: Changing Company Culture with Hadoop - Amy O'Connor, Nokia
 
Intuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchIntuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with Search
 
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use case
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use caseBDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use case
BDM9 - Comparison of Oracle RDBMS and Cloudera Impala for a hospital use case
 
Ingesting hdfs intosolrusingsparktrimmed
Ingesting hdfs intosolrusingsparktrimmedIngesting hdfs intosolrusingsparktrimmed
Ingesting hdfs intosolrusingsparktrimmed
 
Industrial Data Space Association - New Members, New Insights, New Future Dir...
Industrial Data Space Association - New Members, New Insights, New Future Dir...Industrial Data Space Association - New Members, New Insights, New Future Dir...
Industrial Data Space Association - New Members, New Insights, New Future Dir...
 
Hadoop Self-Service Data Prep Fuels Analytics
Hadoop Self-Service Data Prep Fuels AnalyticsHadoop Self-Service Data Prep Fuels Analytics
Hadoop Self-Service Data Prep Fuels Analytics
 
Farming hadoop in_the_cloud
Farming hadoop in_the_cloudFarming hadoop in_the_cloud
Farming hadoop in_the_cloud
 
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
 

Semelhante a Powering the Internet of Things with Apache Hadoop

Cloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera, Inc.
 
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected World
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected WorldCloudera - Enabling the IoT Revolution Driving Insights in a Connected World
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected Worldandreas kuncoro
 
The Evolution of Data Architecture
The Evolution of Data ArchitectureThe Evolution of Data Architecture
The Evolution of Data ArchitectureWei-Chiu Chuang
 
CL2015 - Datacenter and Cloud Strategy and Planning
CL2015 - Datacenter and Cloud Strategy and PlanningCL2015 - Datacenter and Cloud Strategy and Planning
CL2015 - Datacenter and Cloud Strategy and PlanningCisco
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
AWS O&G Day - Ambyint and AWS
AWS O&G Day - Ambyint and AWSAWS O&G Day - Ambyint and AWS
AWS O&G Day - Ambyint and AWSAWS Summits
 
Presentation-Watson_IoT_Platform-Long-08Feb2016
Presentation-Watson_IoT_Platform-Long-08Feb2016Presentation-Watson_IoT_Platform-Long-08Feb2016
Presentation-Watson_IoT_Platform-Long-08Feb2016Nikhil Dikshit
 
IoT: Understanding its potential and what makes it tick! by Mark Torr
IoT: Understanding its potential and what makes it tick! by Mark TorrIoT: Understanding its potential and what makes it tick! by Mark Torr
IoT: Understanding its potential and what makes it tick! by Mark TorrBosnia Agile
 
Streaming Analytics for IoT with Apache Spark
Streaming Analytics for IoT with Apache SparkStreaming Analytics for IoT with Apache Spark
Streaming Analytics for IoT with Apache SparkImpetus Technologies
 
CWIN17 Frankfurt / Cloudera
CWIN17 Frankfurt / ClouderaCWIN17 Frankfurt / Cloudera
CWIN17 Frankfurt / ClouderaCapgemini
 
Introduction to ibm internet of things foundation
Introduction to ibm internet of things foundationIntroduction to ibm internet of things foundation
Introduction to ibm internet of things foundationBernard Kufluk
 
IoT Connected Brewery
IoT Connected BreweryIoT Connected Brewery
IoT Connected BreweryJason Hubbard
 
How Can Edge Computing and IoT Transform Your Business?
How Can Edge Computing and IoT Transform Your Business?How Can Edge Computing and IoT Transform Your Business?
How Can Edge Computing and IoT Transform Your Business?Amazon Web Services
 
Barga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteBarga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteRoger Barga
 
Mindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for IndustryMindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for IndustryIIoTWorld
 
Subscribed 2015: The Explosion of Smart Connected Things
Subscribed 2015: The Explosion of Smart Connected ThingsSubscribed 2015: The Explosion of Smart Connected Things
Subscribed 2015: The Explosion of Smart Connected ThingsZuora, Inc.
 
The Impact of IoT on Product Design
The Impact of IoT on Product DesignThe Impact of IoT on Product Design
The Impact of IoT on Product DesignAras
 
Oracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin PrašovićOracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin PrašovićBosnia Agile
 
SECON'2016. Семенченко Антон, Как тренды в Мобильной разработке и Интернете в...
SECON'2016. Семенченко Антон, Как тренды в Мобильной разработке и Интернете в...SECON'2016. Семенченко Антон, Как тренды в Мобильной разработке и Интернете в...
SECON'2016. Семенченко Антон, Как тренды в Мобильной разработке и Интернете в...SECON
 

Semelhante a Powering the Internet of Things with Apache Hadoop (20)

Cloudera - IoT & Smart Cities
Cloudera - IoT & Smart CitiesCloudera - IoT & Smart Cities
Cloudera - IoT & Smart Cities
 
Top 5 IoT Use Cases
Top 5 IoT Use CasesTop 5 IoT Use Cases
Top 5 IoT Use Cases
 
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected World
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected WorldCloudera - Enabling the IoT Revolution Driving Insights in a Connected World
Cloudera - Enabling the IoT Revolution Driving Insights in a Connected World
 
The Evolution of Data Architecture
The Evolution of Data ArchitectureThe Evolution of Data Architecture
The Evolution of Data Architecture
 
CL2015 - Datacenter and Cloud Strategy and Planning
CL2015 - Datacenter and Cloud Strategy and PlanningCL2015 - Datacenter and Cloud Strategy and Planning
CL2015 - Datacenter and Cloud Strategy and Planning
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
AWS O&G Day - Ambyint and AWS
AWS O&G Day - Ambyint and AWSAWS O&G Day - Ambyint and AWS
AWS O&G Day - Ambyint and AWS
 
Presentation-Watson_IoT_Platform-Long-08Feb2016
Presentation-Watson_IoT_Platform-Long-08Feb2016Presentation-Watson_IoT_Platform-Long-08Feb2016
Presentation-Watson_IoT_Platform-Long-08Feb2016
 
IoT: Understanding its potential and what makes it tick! by Mark Torr
IoT: Understanding its potential and what makes it tick! by Mark TorrIoT: Understanding its potential and what makes it tick! by Mark Torr
IoT: Understanding its potential and what makes it tick! by Mark Torr
 
Streaming Analytics for IoT with Apache Spark
Streaming Analytics for IoT with Apache SparkStreaming Analytics for IoT with Apache Spark
Streaming Analytics for IoT with Apache Spark
 
CWIN17 Frankfurt / Cloudera
CWIN17 Frankfurt / ClouderaCWIN17 Frankfurt / Cloudera
CWIN17 Frankfurt / Cloudera
 
Introduction to ibm internet of things foundation
Introduction to ibm internet of things foundationIntroduction to ibm internet of things foundation
Introduction to ibm internet of things foundation
 
IoT Connected Brewery
IoT Connected BreweryIoT Connected Brewery
IoT Connected Brewery
 
How Can Edge Computing and IoT Transform Your Business?
How Can Edge Computing and IoT Transform Your Business?How Can Edge Computing and IoT Transform Your Business?
How Can Edge Computing and IoT Transform Your Business?
 
Barga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteBarga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 Keynote
 
Mindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for IndustryMindsphere: an open cloud-based IoT operating system for Industry
Mindsphere: an open cloud-based IoT operating system for Industry
 
Subscribed 2015: The Explosion of Smart Connected Things
Subscribed 2015: The Explosion of Smart Connected ThingsSubscribed 2015: The Explosion of Smart Connected Things
Subscribed 2015: The Explosion of Smart Connected Things
 
The Impact of IoT on Product Design
The Impact of IoT on Product DesignThe Impact of IoT on Product Design
The Impact of IoT on Product Design
 
Oracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin PrašovićOracle Digital Business Transformation and Internet of Things by Ermin Prašović
Oracle Digital Business Transformation and Internet of Things by Ermin Prašović
 
SECON'2016. Семенченко Антон, Как тренды в Мобильной разработке и Интернете в...
SECON'2016. Семенченко Антон, Как тренды в Мобильной разработке и Интернете в...SECON'2016. Семенченко Антон, Как тренды в Мобильной разработке и Интернете в...
SECON'2016. Семенченко Антон, Как тренды в Мобильной разработке и Интернете в...
 

Mais de Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

Mais de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 

Último

SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Angel Borroy López
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Natan Silnitsky
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEEVICTOR MAESTRE RAMIREZ
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprisepreethippts
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...Technogeeks
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Matt Ray
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfFerryKemperman
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noidabntitsolutionsrishis
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 

Último (20)

SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
Alfresco TTL#157 - Troubleshooting Made Easy: Deciphering Alfresco mTLS Confi...
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
Taming Distributed Systems: Key Insights from Wix's Large-Scale Experience - ...
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
Cloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEECloud Data Center Network Construction - IEEE
Cloud Data Center Network Construction - IEEE
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
Odoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 EnterpriseOdoo 14 - eLearning Module In Odoo 14 Enterprise
Odoo 14 - eLearning Module In Odoo 14 Enterprise
 
What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...What is Advanced Excel and what are some best practices for designing and cre...
What is Advanced Excel and what are some best practices for designing and cre...
 
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
Open Source Summit NA 2024: Open Source Cloud Costs - OpenCost's Impact on En...
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Introduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdfIntroduction Computer Science - Software Design.pdf
Introduction Computer Science - Software Design.pdf
 
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in NoidaBuds n Tech IT Solutions: Top-Notch Web Services in Noida
Buds n Tech IT Solutions: Top-Notch Web Services in Noida
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 

Powering the Internet of Things with Apache Hadoop

  • 1. 1© Cloudera, Inc. All rights reserved. Driving Insights in A Connected World Amy O’ Connor | Big Data Evangelist Vijay Raja | Solutions Marketing Powering the Internet of Things with Apache Hadoop
  • 2. 2© Cloudera, Inc. All rights reserved. Agenda & Overview • IoT – A Revolution in the Making • Key IoT Segments & Use Cases • The IoT Ecosystem • A Next gen IoT Analytics Engine • Cloudera Enterprise – Real Time Analytics for IoT • Reference Architecture • IoT – Customer Case Studies
  • 3. 3© Cloudera, Inc. All rights reserved. Internet of Things (IoT) – A Revolution In The Making $1.7 Trillion In Value 20% Annual Growth 30 Billion Things 250 Million Connected Vehicles Source - IDC & Gartner Estimates Internet of Things IoT Markets - 2020
  • 4. 4© Cloudera, Inc. All rights reserved. IoT – A Growing Universe of Connected Objects Source: McKinsey Source: Business Intelligence
  • 5. 5© Cloudera, Inc. All rights reserved. IoT Will Drive An Explosion of Data… Data expected to explode to 44 ZB by 2020 Source: IDC 44 Trillion GB!80% of data will be unstructured
  • 6. 6© Cloudera, Inc. All rights reserved. It’s all kind of meaningless unless you can make sense of all that data
  • 7. 7© Cloudera, Inc. All rights reserved. < 1% of data is currently utilized… Mostly for anomaly detection or real- time control; more can be used for optimization and prediction 1% Source: McKinsey Global - THE INTERNET OF THINGS: MAPPING THE VALUE BEYOND THE HYPE, June 2015
  • 8. 8© Cloudera, Inc. All rights reserved. Value is Maximized when Data is combined from other sources Value of Data is multiplied when you combine and correlate it with other data from relevant sources Improvement in value that can be unlocked by combining data from multiple IoT applications and sources SOURCE: McKinsey Global Institute analysis “Interoperability would significantly improve performance by combining sensor data from different machines and systems to provide decision makers with an integrated view of performance across an entire factory or oil rig.” 40%
  • 9. 9© Cloudera, Inc. All rights reserved. IoT – Key Segments & Use Cases Industrial IoT Manufacturing • Predictive Maintenance • Operations Optimization • Supply Chain Optimization Healthcare • Proactive & Connected Monitoring • Early detection & Diagnosis • Remote Measurements Energy & Utilities • Transmission & Distribution • Smart-Grid & Smart Meters • Ops & Predictive Maintenance Govt. & Public Services • Smart Cities • Traffic Optimization • Public Safety Insurance • Usage Based Insurance (UBI) • Telematics for Insurance • Insured Asset Management Retail • Automated Checkouts • Footfall Analytics & Promos • Inventory Optimization Telecommunications • Network Maintenance • Connected Homes/ Cars • Data Monetization Mobility • Telematics & Fleet Mgmt. • Tracking & Remote Monitoring • Condition Based Maintenance Consumer IoT Connected Cars • Safety & Security • Real-Time Diagnostics • In-car Connectivity & Infotainment Health & Lifestyle • Wearables • Health & Fitness Tracking • Real-Time Remote Monitoring Connected Homes • Home Automation & Security • Home Energy management • Smart Appliances Entertainment • Interconnected Smart Devices • Virtual Reality/ Interactive Gaming • Drones
  • 10. 10© Cloudera, Inc. All rights reserved. Internet of Things - Opportunity & Impact Source- Ovum: Understanding the IoT Opportunity: An Industry Perspective - 2015 Manufacturing is a future hotbed for IoT deployment The potential value that could be unlocked with IoT applications in factory settings could be as much as $3.7 trillion in 2025 - McKinsey Analysis$3.7T 50%Using real-time data to predict and prevent breakdowns can reduce downtime by 50 percent - McKinsey Analysis Reduction in Downtime 50% Healthcare: IoT can enable cutting the costs of chronic disease treatment by as much as 50 percent - McKinsey Analysis Reduction in Costs The number of connected vehicles will grow more than six fold to over 250 million by 2020 - Gartner 250 Million
  • 11. 11© Cloudera, Inc. All rights reserved. Polling Question Where are you with respect to your IoT journey? o We already have IoT use cases deployed o We are actively working on deploying IoT use cases o We are currently evaluating IoT use cases & technologies o We will potentially deploy in the next two years o We don’t see an IoT play in my organization
  • 12. 12© Cloudera, Inc. All rights reserved. Performance Monitoring & Predictive Maintenance of Heavy Equipment Challenge: • Continuously monitor performance of heavy machinery and perform predictive maintenance Solution: • Use Cloudera to parse large volume and high velocity sensor data from equipment • Process and analyze data for performance analysis, advanced defect detection HEAVY MACHINARY » INDUSTRIAL IoT » PREDICTIVE MAINTENANCE » LOWERED COSTS Industrial IoT – Heavy Machinery DATA-DRIVEN PROCESS CASE STUDY Change Image
  • 13. 13© Cloudera, Inc. All rights reserved. Advanced analytics on streaming data to reduce human space mission risks Challenge: • Over 2 TB/ hour of telemetry test data streaming in from over 1200 sensors in test environment Solution: • Cloudera cluster supporting high rate of data ingest – up to ~300MB/sec • Advanced analytics run on the streaming data to check for issues or determine patterns and reduce risk AEROSPACE » INDUSTRIAL IoT » REMOTE MONITORING » PREDICTIVE MAINTENANCE Aerospace – Spacecraft Telemetry DATA-DRIVEN PROCESS CASE STUDY
  • 14. 14© Cloudera, Inc. All rights reserved. Enabling Smart Parking in Milton Keynes with BT Challenge: • Increase efficiency and utilization of the parking spots at Milton Keynes Solution: • Sensors installed in the parking bays sent data streams to the Cloudera Data Hub hosted by BT • Real-time updates on parking availability over the web/ smartphones • Savings of £105M & reduced emissions TELCOMMUNICATIONS » SMART CITIES » IMPROVED UTILIZATION » COST REDUCTION Smart Cities CASE STUDY
  • 15. 15© Cloudera, Inc. All rights reserved. The IoT Ecosystem Consumer Industrial IoT Gateway Cloud Data Center Data Analytics Sensors/ Things
  • 16. 16© Cloudera, Inc. All rights reserved. The IoT Ecosystem Consumer Industrial IoT Gateway Data Center Data Analytics Sensors/ Things Data Characteristics • Un-structured • Intermittent • Volume & Variety Gateway • Data Routing • Edge-Processing • Edge-Storage Sensors/ Things •To grow by 50X •Drop in prices by 70% in last 5 years Data Storage, Processing & Analytics IOT Data Characteristics • More processing in the cloud • Analytics on the cloud IOT Data Analytics • Key to Value Creation • Combine data from multiple sources & types • Drive business insights IOT Data Characteristics • Distributed Data Processing • Cloud & On-Premise Cloud
  • 17. 17© Cloudera, Inc. All rights reserved. Key Attributes For Next Gen IoT Data Platform Scale efficiently based on your data growth Effectively handle multiple data-types and structures Manage the complexity of real-time IoT data ingest Fundamentally Secure Real-Time Analytics – Combine and analyze data from multiple sources Flexible deployment options - Cloud & Distributed Data Processing
  • 18. 18© Cloudera, Inc. All rights reserved. FILESYSTEM RELATIONAL Cloudera Enterprise – Making Hadoop Fast, Easy, and Secure OPERATIONS Cloudera Manager Cloudera Director DATA MANAGEMENT Cloudera Navigator Encrypt and KeyTrustee Optimizer BATCH Sqoop REAL-TIME Kafka, Flume PROCESS, ANALYZE, SERVE UNIFIED SERVICES RESOURCE MANAGEMENT YARN SECURITY Sentry, RecordService FILESYSTEM HDFS RELATIONAL Kudu NoSQL HBase STORE INTEGRATE BATCH Spark, Hive, Pig MapReduce STREAM Spark SQL Impala SEARCH Solr SDK Partners CLOUDERA ENTERPRISE
  • 19. 19© Cloudera, Inc. All rights reserved. Cloudera Enterprise – The Data & Analytics Platform for IoT Sensors/ IoT Data Sources Internal Systems External Sources BI Solutions Real-Time AppsSearch EDWDiscove r Machine Learning Data Center Cloud Sensor/ IoT Data IoT Gateway • Data Storage • Data Processing • Machine Learning • Real-time Analytics OPERATIONS Cloudera Manager Cloudera Director DATA MANAGEMENT Cloudera Navigator Encrypt and KeyTrustee Optimizer BATCH Sqoop REAL-TIME Kafka, Flume PROCESS, ANALYZE, SERVE UNIFIED SERVICES RESOURCE MANAGEMENT YARN SECURITY Sentry, RecordService FILESYSTEM HDFS RELATIONAL Kudu NoSQL HBase STORE INTEGRATE BATCH Spark, Hive, Pig MapReduce STREAM Spark SQL Impala SEARCH Solr SDK Partners
  • 20. 20© Cloudera, Inc. All rights reserved. Cloudera Enterprise – Real Time Analytics for IoT BI Solutions Real-Time AppsSearch EDWDiscover Machine Learning Deployment Flexibility Spark Streaming Leadership in Spark Integrated with EDH Flexible Storage Store any and all Data. Kudu - Fast Analytics on Fast Data Real-Time Data Processing Data Security Four pillars of security: Perimeter, Access, Visibility, and Data + Record Service Streaming Ingest Kafka & Flume - Real-Time Data Ingest for streaming, high volume data Sensor/ IoT Data Internal Systems External Sources Centralized Mgmt. Cloudera Manager for centralized cluster management Manage Multiple Clusters – On Premise or Cloud environment - On Premise or Cloud OPERATIONS Cloudera Manager Cloudera Director DATA MANAGEMENT Cloudera Navigator Encrypt and KeyTrustee Optimizer BATCH Sqoop REAL-TIME Kafka, Flume PROCESS, ANALYZE, SERVE UNIFIED SERVICES RESOURCE MANAGEMENT YARN SECURITY Sentry, RecordService FILESYSTEM HDFS RELATIONAL Kudu NoSQL HBase STORE INTEGRATE BATCH Spark, Hive, Pig MapReduce STREAM Spark SQL Impala SEARCH Solr SDK Partners
  • 21. 21© Cloudera, Inc. All rights reserved. Cloudera – IoT Technical Architecture IoT Enablers Data Ingestion (Kafka, JSON) Data Transformation and Enrichment Data Processing & Serving Rule Mining, Pattern Matching, Machine Learning Distributed Data Analytics Data Visualization Data Storage (Hbase, HDFS, Kudu)
  • 22. 22© Cloudera, Inc. All rights reserved. Enabling US Auto Manufacturer to Effectively Capture & Analyze Data from over 30,000 Sensors in Every Car Challenge: • Effectively capture and analyze data emitting from 30,000 sensors in every car Solution: • Centralized data platform from Cloudera for analytics of all of the sensor data • Monitor individual component performance and operational metrics AUTOMOBILE » CONNECTED CARS » PREDICTIVE ANALYTICS » REMOTE MONITORING Connected Cars DATA-DRIVEN PROCESS CASE STUDY
  • 23. 23© Cloudera, Inc. All rights reserved. Helping 4+ million homes save hundreds of millions of dollars in energy bills Challenge: • Bringing together diverse data sets including streaming utility& sensor data • Deriving Business Insights from all the Data Solution: • Analytical Application on Cloudera EDH • Savings of more than $320 Million for subscribers UTILITIES » CONSUMER IoT » PROCESS IMPROVEMENT » COST REDUCTION Smart Meters CASE STUDY
  • 24. 24© Cloudera, Inc. All rights reserved. Usage Based Insurance to reduce claims by a large European insurance agency Challenge: • Gather, store and analyze sensor data from millions of customers to analyze driving habits & risks Solution: • Telemetry solution on Cloudera EDH - increased policy renewal and customer satisfaction rates • Reduced the number of claims by 30% resulting in savings in millions annually INSURANCE » USAGE BASED INSURANCE » IMPROVED PROCESSES » REDUCED COSTS Usage-Based Insurance and Telematics DATA-DRIVEN PROCESS CASE STUDY
  • 25. 25© Cloudera, Inc. All rights reserved. Improve Parkinson's Disease Monitoring and Treatment through IoT Challenge: • Collect and analyze data from wearables (more than 300 readings per second) from thousands of patients in real-time Solution: • Cloudera on Intel architecture to detect patterns in patient data streaming from thousands of wearables • Continuously monitor the patients and symptoms to accelerate breakthrough on drug development HEALTHCARE » PATIENT 360° » PREDICTIVE ANALYTICS » IMPROVED CARE Smart Healthcare DATA-DRIVEN PROCESS CASE STUDY
  • 26. 26© Cloudera, Inc. All rights reserved. The Cloudera Difference Powerful Cluster Ops Trusted by the pros Cloud & Hybrid deployment Integrated with AWS & Azure Expert Support Dedicated prescriptive help, just a click away Real-Time IoT Analytics The most experience with Spark The Fastest Analytic SQL Lowest latency, best concurrency Fast, Updateable Analytic Storage High throughput, low latency, and updates Easy to ManageFast for Business Security without Compromise Enterprise Encryption Protects everything transparently Access Policy Enforcement Full-stack row/column-based RBAC & dynamic masking Automated Data Management Full-stack audit, lineage, discovery, and lifecycle Secure Operations Separation of duties, log data redaction
  • 27. 27© Cloudera, Inc. All rights reserved. Thank You

Notas do Editor

  1. IDC predicts the worldwide Internet of Things (IoT) market will grow from $655.8 billion in 2014 to $1.7 trillion in 2020 with a compound annual growth rate (CAGR) of 16.9%. The installed base of IoT endpoints will grow from 10.3 billion in 2014 to more than 29.5 billion in 2020 with a CAGR of 19.2%.
  2. IDC predicts the worldwide Internet of Things (IoT) market will grow from $655.8 billion in 2014 to $1.7 trillion in 2020 with a compound annual growth rate (CAGR) of 16.9%. The installed base of IoT endpoints will grow from 10.3 billion in 2014 to more than 29.5 billion in 2020 with a CAGR of 19.2%.
  3. Data is the key to IoT – all of the ability to gain insights out of all of this data DC’s Digital Universe study predicts the world's data will amount to 44 zettabytes by 2020, 10% of it from the internet of thing. The amount of data on the planet is set to grow 10-fold in the next six years to 2020 from around 4.4 zettabytes to 44ZB. That’s according to IDC’s annual Digital Universe study, which also predicted that, by 2020, the amount of information produced by machines, the so-called internet of things, will account for about 10% of data on earth.
  4. SLIDE 3: Takeaway — It’s all kind of meaningless unless you can make sense of all that data POV: Hadoop is a central component for success with IoT Action (General action step): Re-architect now to prepare for IoT deluge Benefit: Get ahead of IoT wave, stop talking - start doing Talking points: While internet of things holds a lot of promise, it’s all for not unless you can actually do something with the data that is the bi-product of IoT itself. The data deluge that comes with it requires a modernized approach to data management infrastructure, one that accounts for the new requirements to store, process and analyze enormous volumes of data. Successful organizations start from the architecture out are generally the most successful. Re-architecting now, at the onset of your IoT journey, will put you in a position to capitalize on all that data and deliver meaningful results. As we’ll discuss throughout this presentation, Hadoop is a central pillar to that modernized information architecture.
  5. Based on a recent report from McKinsey analysis, less than 1 percent of the data being generated by the 30,000 sensors on an offshore oil rig is currently used to make decisions. And of the data that are actually used—for example, in manufacturing automation systems on factory floors—most are used only for real-time control or anomaly detection. < 1% of data is currently utilized, mostly for anomaly detection or real-time control; more can be used for optimization and prediction algorithms. 99% of the data is not being utilized, analyzed or leveraged for business decision making. ~40% of all data is never stored; remainder is stored locally in silos for a short period but not utilized for business analytics. A critical challenge is how to utilize the flood gate of data generated by IoT devices for predication & optimization. Knowing what to do with data – predicting a machine failure before it happens. Where IoT data are being used, they are often used only for anomaly detection or real-time control, rather than for optimization or prediction, which is where much additional value can be derived. For example, in manufacturing, an increasing number of machines are “wired,” but this instrumentation is used primarily to control the tools or to send alarms when it detects something out of tolerance. The data from these tools are often not analyzed (or even collected in a place where they could be analyzed), even though the data could be used to optimize processes and head off disruptions.
  6. Importance of interoperability in generating maximum value from IoT applications. McKinsey estimates that situations in which two or more IoT systems must work together can account for about 40 percent of the total value that can be unlocked by the Internet of Things. For ex. Interoperability would significantly improve performance by combining sensor data from different machines and systems to provide decision makers with an integrated view of performance across an entire factory or oil rig.
  7. Lot of focus on Consumer IoT – around Nest Thermostats & Fitbit wearables and watches but there is huge IoT ecosystem outside of this, and outside of consumer IoT frames. In fact over 70% of the economic value in IoT will be generated in Industrial or B2B IoT settings For example Manufacturing is a huge area – where things like predictive maintenance and operations optimization can revolutionize manufacturing. Using real-time data to predict and prevent breakdowns can reduce downtime by 50 percent. Supply Chain Optimization and Real-Time View of the Supply Chain can drive down costs by 50%. Energy & Utilities: Smart Buildings: industrial zones, office parks, shopping malls, airports or seaports, IoT can help reduce the cost of energy, and building maintenance by up to 30 percent.” Healthcare: In Healthcare with connected & real-time monitoring, IoT can enable cutting the costs of chronic disease treatment by as much as 50 percent Connected cars will enable low speed crashes by 80% McKinsey estimates that B2B uses can generate nearly 70 percent of potential value enabled by IoT – Healthcare: IoT consists of technology mash-ups: devices integration, software, networks, and analytics. Two very different examples underline the spectrum of change within healthcare IoT: from niche to ubiquitous and from expected to unexpected. The deployment in a UK hospital's pediatric unit of McLaren Applied Technologies analytics technology, normally used for racing cars, for early warning of patient deterioration, is an example of how the industry can borrow innovation from other sectors. Google's ramp up of healthcare activity comes as no surprise; its nanoparticle project, designed to detect early signs of disease, exemplifies the growing raft of both tech-driven innovation and the special attention devoted to healthcare. IoT will be a key component of new mega-vendor healthcare strategies and partnerships. GE Healthcare's $1bn R&D budget for its campaign against cancer, announced in 2011, marries advanced cancer diagnostics, molecular imaging, and advanced tech for biopharmaceuticals and cancer research. Philips' rapprochement with Amazon Web Services (AWS), centered on the use of AWS's IoT platform to expand the capabilities of its HealthSuite platform, comes hot on the heels on IBM's launch of Watson Health and its string of healthcare analytics acquisitions. The involvement of
  8. Caterpillar VIMS – Vehicle Information Management System This application is an IoT application. It collects sensor data from a subset of the Caterpillar fleet and is used for performance analysis, defect detection, etc. Used mainly for processing for downstream BI and analysis applications. This use case was originally developed by Cloudera PS and has been extended and supported by Caterpillar
  9. Lockheed Martin One of the major projects is the Orion Multi-Purpose Crew Vehicle, which is designed for long-duration, human-rated deep space exploration. Orion will transport humans to interplanetary destinations beyond low Earth orbit, such as asteroids, the moon, and eventually Mars, and return them safely back to Earth. With human lives on the line, along with millions of dollars of highly technical equipment, testing of all the systems and subsystems for the Orion Space systems is long and exhaustive. Simulating the space environment on the ground and testing the system is a lengthy and expensive effort – requiring perfect results for each subsystem and its subsystems. These tests generate hundreds of megabytes per second of telemetry data that in very short time becomes petabytes of data that needs to managed, analyzed, and leveraged to validate healthy functioning of all the systems. For Orion, the telemetry is produced in a variety of simulation and test environments which includes at least 7 differ labs across the US. In simulating the real mission environment, test telemetry data is streamed to the testing system. This telemetry data contains mission-critical health information about equipment, and the test’s status. Knowing whether or not the test is progressing correctly can advise the test conductors to make decisions about the continuance or modification of a test scenario – a test that may take weeks to accomplish. Our work with the Orion Space capsule takes this streaming test data and saves it to a Hadoop-based cluster supporting high data rate ingest. Advanced analytics can be run on the streaming data to check for expected or indeterminate patterns. This method of data analytics for system testing in an online environment opens up new opportunities for the test conductors to significantly reduce the risk of missing critical test parameters. It also creates a highly cost-effective and productive test environment. Orion’s First Test: Orbited the Earth twice, traveling approximately 3,600 miles above the Earth’s surface 15 times farther than the International Space Station. Generated more than 80% of the return velocity experienced during a reentry from the moon, which allows engineers to model expected reentries from future missions in deep space. Orion’s next mission (EM-1) in 2018 2 weeks instead of 4 hours 4 times as many computers Twice as many instruments Subsystems that support Human Flight!
  10. One of the fastest growing cities in the UK, Milton Keynes has to support that expansion within local infrastructure constraints, while meeting stretching expenditure and carbon reduction targets. Joining forces with The Open University, BT and other partners, Milton Keynes Council formed a Smart City collaboration to rise to those challenges. There are around 25,000 parking spaces in Milton Keynes and forecasts suggest that perhaps as many as 12,000 more may be needed by 2020. Brian Matthews, head of transport at Milton Keynes Council, says: “If we don’t act soon, parking in Milton Keynes will become a big problem. But we know that around 7,000 existing spaces are empty at any one time and, in some cases, this is because people don’t know where to find them.” Better utilisation of existing parking spaces will save the Council a substantial sum. “It costs around £15,000 to create a new parking bay,” says Brian Matthews. “If we built new ones when there are 7,000 unused we could be wasting truly significant amounts of money.” A pilot was launched to manage the use of short-term parking spaces at Milton Keynes railway station. Designed by specialist technology provider Deteq working with BT, it involved installing sensors in each of the parking bays. Bonded to the tarmac, they’re powered by lithium-ion batteries with an over four-year lifespan. Detecting the arrival and departure of a vehicle, the sensors send information wirelessly to lamppost mounted solar powered repeaters. These aggregate the data and transmit it over the internet to the MK Data Hub, which is currently hosted by BT. There it’s processed and the resulting analysis made available on the Milton Keynes Council public information dashboard, as well as via a browser that displays bay status as red (occupied) or green (free) via an overlay to Google maps. The prize from full deployment will be a capital saving of at least £105 million, with reduced fuel use and vehicle emissions.
  11. 30-70% Drop in the price of MEMS sensors in past five years – McKinsey Research Diverse data types – from intermittent sensor readings of temperature and pressure to real-time location data or streaming live videos for video analytics Given the flexible, scalable nature of cloud-based infrastructure and the fact that machine data often originates off premises, we expect a lot of IoT data to be stored and processed in the cloud. The ideal IoT data platform can be deployed either on premise or in a public, hybrid, or private cloud environment. It should be possible to administer the platform via both a web-based interface and API calls. Gateways collects, aggregates, and optionally processes the data generated by the devices. The gateway can also accept and route commands sent from the backend to the respective device. Gateway is responsible for authenticating and authorizing the devices to participate in the workflow. It ensures secure communication between the devices and the centralized command center. The gateway is capable of dealing with multiple protocols and data formats.
  12. Some Pointers: Given the flexible, scalable nature of cloud-based infrastructure and the fact that machine data often originates off premises, we expect a lot of IoT data to be stored and processed in the cloud. The ideal IoT data platform can be deployed either on premise or in a public, hybrid, or private cloud environment. It should be possible to administer the platform via both a web-based interface and API calls. One of the scarcest resources in many IoT environments is likely to be network bandwidth, either because it is simply not available or because it is expensive. The volume, complexity, and growing geographical dispersal of IoT data calls for a database that is optimized to handle the type of data that IoT devices will generate. Support for real-time analytics and events The ability to analyze sensor data in real time is key to putting the information to work. The technology should also specify event-triggers to save the need to manually code threshold checks or event monitoring.
  13. Fast, Easy, Secure An enterprise data hub can store unlimited data, cost-effectively and reliably, for as long as you need, and lets users access that data in a variety of ways. Data can be collected, stored, processed, explored, modeled, and served in one unified platform. It’s connected to the systems you already rely on. Cloudera’s enterprise data hub, powered by Apache Hadoop, the popular open source distributed data platform, is differentiated in several crucial areas. We provide: Leading query performance. The enterprise management and governance that you require of all of your mission-critical infrastructure. Comprehensive, transparent, compliance-ready security at the core. An open source platform that is also built of open standards – projects that are supported by multiple vendors to ensure sustainability, portability, and compatibility. Our platform runs in your choice of environment, whether on-premises or in the cloud. === Cheat Sheet version: Our enterprise data hub is: One place for unlimited data Accessible to anyone Connected to the systems you already depend on Secure, governed, managed & compliant Built on open source and open standards Deployed however you want Coupled with the support and enablement you need to succeed. Important Note: Our EDH emphasizes “unified analytics” over “unified data”: It’s not practical or probable that customers will actually unify all their data. Much of it lives in the cloud or on storage (e.g. Isilon), in remote datacenters, is of uncertain value vs. cost of moving it to a hub, or security mandates preclude collocation. We enable customers to gather unlimited data, while bringing diverse processing and analytics to that data.
  14. How Cloudera’s EDH fits into the IoT Ecosystem Can ingest data from multiple sources including real-time streaming sensor data You can combine the sensor data with data other internal and external sources to drive business insights You can deploy EDH on prem (in your data center) or on hybrid cloud environments and still be able to manage it centrally And you can serve and analyze the data in a number of different ways - integrate it with existing BI solutions, do search or machine learning or integrate it with real time applications
  15. Streaming Ingest – Kafka & Flume plus Data Pipeline Visualization with our partner Streamsets Kudu – Real Time updates and real-time appends to data – Ideal for streaming data to query data as it lands Streaming Data Processing - Spark – Cloudera leadership in Spark Batch data processing – HDFS/ Hbase Capabilities Centralized Cluster Mgmt – Unified Monitoring & Troubleshooting with Cloudera Manager Deployment Flexibility - Can take this to Cloud easily (Director) – High Availability Hybrid Portable Deployment – Deploy a cluster in AWS & Google Cloud and effectively manage the Clusters with the same interface Security Features – Adding security features for Kafka + Record Service – Unified Access Policies irrespective access frameworks Build integrations between Kafka and the Gateway to push data back to the sensors
  16. Tesla Onboard sensors capture 30,000 + signal types from onboard sensors • Component data – how fast, voltages, temperatures, work performed   • Operational metrics – how many times charging port has been opened / closed, air conditioner operation metrics … What components and software were installed?
  17. Opower is a Utility Analytics company that provides 360-degree views into energy usage patterns and similar household comparisons to help consumers save energy. Challenge: With the advent of smart meters and ever-growing utility data streams, Opower recognized the need to capture, store and analyze this data in order to help consumers save energy. Solution: Opower built an analytical application on Cloudera Enterprise, leveraging Apache HBase, to bring together utility consumption data along with weather data, consumer behavior data, and other disparate sources of information. Benefit: By pulling together, processing, analyzing, and then presenting information to homeowners, Opower is helping more than four million homes save hundreds of millions of dollars on their energy bills.
  18. Assicurazioni Generali
  19. https://www.michaeljfox.org/foundation/publication-detail.html?id=555&category=7 The Michael J. Fox Foundation and Intel Join Forces to Improve Parkinson's Disease Monitoring and Treatment through Advanced Technologies August 13,2014 The Michael J. Fox Foundation for Parkinson’s Research (MJFF) and Intel Corporation announced today a collaboration aimed at improving research and treatment for Parkinson’s disease — a neurodegenerative brain disease second only to Alzheimer’s in worldwide prevalence. The collaboration includes a multiphase research study using a new big data analytics platform that detects patterns in participant data collected from wearable technologies used to monitor symptoms. This effort is an important step in enabling researchers and physicians to measure progression of the disease and to speed progress toward breakthroughs in drug development. “Nearly 200 years after Parkinson’s disease was first described by Dr. James Parkinson in 1817, we are still subjectively measuring Parkinson’s disease largely the same way doctors did then,” said Todd Sherer, PhD, CEO of The Michael J. Fox Foundation. “Data science and wearable computing hold the potential to transform our ability to capture and objectively measure patients’ actual experience of disease, with unprecedented implications for Parkinson’s drug development, diagnosis and treatment.” “The variability in Parkinson’s symptoms creates unique challenges in monitoring progression of the disease,” said Diane Bryant, senior vice president and general manager of Intel’s Data Center Group. “Emerging technologies can not only create a new paradigm for measurement of Parkinson’s, but as more data is made available to the medical community, it may also point to currently unidentified features of the disease that could lead to new areas of research.” Tracking an Invisible Enemy For nearly two decades, researchers have been refining advanced genomics and proteomics techniques to create increasingly sophisticated cellular profiles of Parkinson’s disease pathology. Advances in data collection and analysis now provide the opportunity to expand the value of this wealth of molecular data by correlating it with objective clinical characterization of the disease for use in drug development. The potential to collect and analyze data from thousands of individuals on measurable features of Parkinson’s, such as slowness of movement, tremor and sleep quality, could enable researchers to assemble a better picture of the clinical progression of Parkinson’s and track its relationship to molecular changes. Wearables can unobtrusively gather and transmit objective, experiential data in real time, 24 hours a day, seven days a week. With this approach, researchers could go from looking at a very small number of data points and burdensome pencil-and-paper patient diaries collected sporadically to analyzing hundreds of readings per second from thousands of patients and attaining a critical mass of data to detect patterns and make new discoveries. MJFF and Intel initiated a study earlier this year to evaluate the usability and accuracy of wearable devices for tracking agreed physiological features from participants and to develop a big data analytics platform to collect and analyze the data. The participants (16 Parkinson’s patients and nine control volunteers) wore the devices during two clinic visits and at home continuously over four days. Bret Parker, 46, of New York, is living with Parkinson’s and participated in the study. “I know that many doctors tell their patients to keep a log to track their Parkinson’s,” said Parker. “I am not a compliant patient on that front. I pay attention to my Parkinson’s, but it’s not everything I am all the time. The wearables did that monitoring for me in a way I didn’t even notice, and the study allowed me to take an active role in the process for developing a cure.” Intel data scientists are now correlating the collected data to clinical observations and patient diaries to gauge the devices’ accuracy, and are developing algorithms to measure symptoms and disease progression. Later this year, Intel and MJFF plan to launch a new mobile application that enables patients to report their medication intake as well as how they are feeling. The effort is part of the next phase of the study to enable medical researchers to study the effects of medication on motor symptoms via changes detected in sensor data from wearable devices. Collecting, Storing and Analyzing the Data To analyze the volume of data — more than 300 observations per second from each patient — Intel developed a big data analytics platform that integrates a number of software components including Cloudera® CDH* — an open-source software platform that collects, stores, and manages data. The data platform is deployed on a cloud infrastructure optimized on Intel® architecture, allowing scientists to focus on research rather than the underlying computing technologies. The platform supports an analytics application developed by Intel to process and detect changes in the data in real time. By detecting anomalies and changes in sensor and other data, the platform can provide researchers with a way to measure the progression of the disease objectively. In the near future, the platform could store other types of data such as patient, genome and clinical trial data. In addition, the platform could enable other advanced techniques such as machine learning and graph analytics to deliver more accurate predictive models that researchers could use to detect change in disease symptoms. These advances could provide unprecedented insights into the nature of Parkinson’s disease, helping scientists measure the efficacy of new drugs and assisting physicians with prognostic decisions. Shared Commitment to Open-Access Data MJFF and Intel share a commitment to increasing the rate of progress made possible by open access to data. The organizations aim to share data with the greater Parkinson’s community of physicians and researchers as well as invite them to submit their own de-identified patient and subject data for analysis. Teams may also choose to contribute de-identified patient data for inclusion in broader, population-scale studies. The Foundation has previously made de-identified data and bio-samples from its sponsored studies available to qualified researchers, including from individuals with a Parkinson’s-implicated mutation in their LRRK2 gene. MJFF has also opened access to resources from its landmark biomarker study the Parkinson’s Progression Markers Initiative (PPMI) since it launched in 2010. Parkinson’s scientists around the world have downloaded PPMI data more than 235,000 times to date.
  20. Every Hadoop platform gives you scalability and flexibility. Cloudera makes Hadoop fast, easy, and secure. Trap Questions: Spark: What matters to you in supporting Spark and Hadoop? Impala: How many BI users will you have? What additional budget have you allocated for Hive? Kudu: How do you plan to address operational data warehouse / time series use cases? Cloudera Navigator Optimizer: How do you know what data should be in Hadoop vs existing systems? Trap Questions: Cloudera Manager: How much downtime are you willing to accept during an upgrade? What if your operations tools fail during an outage? What does your team need to debug critical and latent issues? Cloudera Director: Where is your data being created? How do you plan to manage across environments? Are you prepared to train staff on both Amazon and on-premises Hadoop platforms? Expert Support: How can a core R&D group simultaneously respond to frequent customer issues and also build a culture of innovation? [only Cloudera has a back-line support team to address issues without bringing in R&D] Trap Questions: Navigator Encrypt/KeyTrustee: What is the impact of an information leak from intermediate MR results, log files, etc? Sentry/RecordService: How are you planning to secure access to sensitive data across Hive and Spark? Navigator: Do your governance needs extend beyond Hive? Manager: How will you keep end users from damaging your production environments?