SlideShare uma empresa Scribd logo
1 de 36
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Welcome to the Waitless World
Benefits of Transferring
Real-time Data to
Hadoop at Scale
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Guest Speakers
Ali Bajwa
Principal Partner Solutions Engineer, Hortonworks
Steve Roberts
Offering Manager, Power Systems Big Data & Analytics Solutions, IBM
Dan Potter
VP of Product Management & Marketing, Attunity
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
• Connected customers,
vehicles, devices
• Socially crowd-sourced
requirements
• Digital design and
analysis
• Digital prototypes and
tests (simulations)
• Connected factories,
sensors, devices
• Human-robotic
interaction
• 3D-printing on demand
• Connected trucks,
inventory
• Location, traffic,
weather-aware
distribution
• Real-time inventory
visibility
• Dynamic rerouting
• Connected customers,
devices
• Omni-channel demand
sensing
• Real-time
Recommendations
• Connected assets
• Remote service
monitoring & delivery
• Predictive maintenance
• OTA Updates
Development Manufacturing Distribution Marketing/Sales Service
The New Way of Business Is Fueled by Connected Data
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Technology Trends: Shifting the Data Paradigm
Artificial IntelligenceInternet of Things Cloud Computing Streaming Data
Industrial Internet
Connected business
Consumer devices
Smart devices
Autonomy
Prescriptive analytics
SaaS/PaaS applications
Ephemeral use cases
Operational efficiency
Collaboration
Real-time applications
Targeted retail
Recommendations
Industrial applications
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Hortonworks Enabling the Modern Data
Architecture
• Our durable and reliable mission continues…
• Make Hadoop an enterprise viable data platform
• Bring all data under management—all sources and types
• Enable pre and post transaction analysis
Hortonworks consistent and continuous track record of innovation
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Powering the Modern Data Architecture
DATA AT RESTDATA IN MOTION
ACTIONABLE
INTELLIGENCE
COMPLETE DATA
LIFECYCLE
MANAGEMENT
RUN CONTAINERIZED
APPLICATIONS
CONCURRENTLY
EDGECLOUD
H O L I S T I C M A N A G E M E N T , G O V E R N A N C E A N D S E C U R I T Y
ON-PREMISES
MULTI-WORKLOADS MULTI-TYPE MULTI-TIER
Data Science SQL Query Engine
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Hortonworks Value: Platform Flexibility
CloudSensors/Sources
 Constrained
 High-latency
 Localized context
 On-premise and cloud
 Low-latency
 Global context
Data Centers
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Hortonworks DataFlow and Analytics Reference Platform
Applications
Edge/Sensor/3rd Party Data Flow and Streaming Analytics and Data Science
Field Data Capture Office, Datacenter or Cloud
Industrial Protocols such as
OPC
Files / Other Unstructured
Data
Video
IoT Gateways
PLC / RTU
SCADA, DCS, Historians Hortonworks Data Platform
SQL
Hortonworks DataFlow
Data Flow
Managemen
t
Message
Queues
Stream
Processing
In-stream
Analytics
NoSQL
Machine
Learning
Resource Management
Distributed File StorageStructured Data Sets
Location 1
Time Series
Storage
Data
Acquisitio
n
Event
Processin
g
Location N
Time Series
Storage
Data
Acquisitio
n
Event
Processin
g
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Complementing Attunity and IBM Ecosystem
Applications
Edge/Sensor/3rd Party Data Flow and Streaming Analytics and Data Science
Field Data Capture Office, Datacenter or Cloud
Industrial Protocols such as
OPC
Files / Other Unstructured
Data
Video
IoT Gateways
PLC / RTU
SCADA, DCS, Historians Hortonworks Data Platform
SQL
Hortonworks DataFlow
Data Flow
Managemen
t
Message
Queues
IBM
Stream
Computing
In-stream
Analytics
NoSQL
Machine
Learning
Resource Management
Distributed File StorageStructured Data Sets
Location 1
Time Series
Storage
Data
Acquisitio
n
Event
Processin
g
Location N
Time Series
Storage
Data
Acquisitio
n
Event
Processin
g
IBM
Bluemix
IBM
Bluemix
IBM Spectrum Scale
IBM Watson
IBM Watson
IBM also resells HDP and HDF
IBM Big
SQL
DATA INGESTION
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Hortonworks DataPlane Service
A common set of services that:
⬢ Supports enterprise deployment strategy
and move to the cloud
⬢ Addresses compliance and regulatory
requirements for enterprise
⬢ Eliminates policy silos and ensures security
& governance moves with data
⬢ Simplifies data asset management and
provides access for analysts and data
scientists
⬢ Extensible to new services: Services
enablement layer brings new offerings to
market rapidly
Next Chapter: Announcing Hortonworks DataPlane Service
Enabling the Modern Data Architecture
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Enterprise Data Science at Scale
Enterprise- Grade
Leverage enterprise-
grade security,
governance and
operations
Tools
Enhance productivity by
enabling data scientists
to use their favorite
tools, technologies and
libraries
Deployment
Compress the time to
insight by deploying
models into production
faster
Data
Build more robust
models by using all the
data in the data lake
The Power of Data Science for your Enterprise
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
DATA AT
REST
DATA-IN-MOTION
HDP®
HORTONWORKS
DATA PLATFORM
Powered by Apache Hadoop®
HDF™
HORTONWORKS
DATAFLOW
Powered by Apache™ NiFi
DATA-AT-REST
Powering Modern
Data Applications
IBM Analytics  Hortonworks Resell
IBM DSX
IBM BigSQL
IBM Analytics  Re-sell
BigInsights’ existing customers migrated to HDP
IBM resells HDP & HDF
IBM Systems  Co-Sell
• IBM Power Systems (Compute)
• IBM Spectrum Scale (Storage)
+
Bringing it all Together
DATA INGESTION
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Centralized
Mainframes
Cognitive Era
E-Business
Distributed
Computing
Smarter Planet
Office
Productivity
Client/
Server
Personal
Computer
Data
Warehousing
Big Data &
Predictive Analytics
Cognitive
A new era of computing has emerged
Data InsightContext
Transactional
Database
Business
Intelligence
Big Data &
Analytics
Actionable
Insight in context
Reporting
Cloud
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Accelerated compute and storage delivered
on prem, in the cloud or via Watson
Power Systems is now part of
Cognitive Systems
REINVENTING COMPUTING FOR DATA-INTENSIVE
AND COGNITIVE WORKLOADS
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Open to the core for true differentiation in performance & cost
315+ OpenPOWER members across 31 countries
Ecosystem-driven
Customer Choice
Growing ecosystem of
OpenPOWER Servers
Growing ecosystem of
OpenPOWER Innovation
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Power Systems S822LC for Big Data - Not Just Another Intel Server
Linux by Redhat:
Redhat 7.2 Linux OS
Mellanox: InfiniBand/Ethernet Connectivity
in and out of server
HGST: Optional NVMe Adapters
Alpha Data with Xilinx FPGA:
Optional CAPI Accelerator
Broadcom: Optional PCIe Adapters
QLogic: Optional Fiber Channel PCIe
Samsung: SSDs & NVMe
Hynix, Samsung, Micron: DDR4
NVIDIA:
Tesla K80 GPU Accelerator
IBM: POWER8 CPU
16
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation 17
Available until Dec 31, 2017!
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
TCO at Scale with HDP on Power Systems with Elastic
Storage Server
18
• Up to 3X reduction of storage and
compute infrastructure moving to
Power Systems and Elastic Storage
Server vs commodity scale out x86
• More flexible and scalable vs EMC
Isilon using IBM Spectrum Scale
• Position for future growth, avoid
hitting the data center wall with
cluster sprawl
E E
InfiniBand (RDMA) / 40 GigE / 10 GigE
Scale Compute Nodes
• IBM Power Systems
• Only Hadoop services
and HDFS client
ESS
HDP HDP HDP HDP HDP
ESS
Elastic Storage Server
(Powered by Spectrum Scale and Power Systems)
C C C C CC
C Spectrum Scale Clientv
HDP Hortonworks Data Platform
Scale Storage as Required
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Artificial
Intelligence
and
Cognitive
Applications
Machine
Learning
Deep
Learning
(Neural Networks)
The deeper you go, the more value you gain, and the more you know
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
simple
machine
learning
deep
learning
accident
risk
rate
90%
inspection
times
10X
number of
inspections
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
enterprise-ready
software distribution
built on open source
tools for ease
of development
performance
faster training times
for data scientists
+
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
9Days
Acceleration training …. days become hours
4Hours
Recognition
Shape
Attenuation
Boundary
Recognition
Shape
Attenuation
Boundary
54x
Learning
runs with
Power 8
4Hours
4Hours
4Hours
4Hours
. . . . . . .
. . . . . . .
.
4Hours
What will you do?
Iterate more and create more accurate models?
Create more models?
Both?
IBM S822LC for HPC
Data Integration
for Modern
Analytics
Data Ingestion Patterns
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Data Integration for Modern Analytics
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Modern Data Ingest
METADATA
HIVE
OPTIMIZED
STREAM
OPTIMIZED
CHANGE DATA CAPTURE
CLOUD ON PREM
WAREHOUSE MAINFRAME RDBMS SAP
CDC (log-based) for high
performance, low latency and
low impact
Single platform for all key
enterprise systems
Hive-optimized for HDP and
Stream-optimized for HDF
Point-and-Click with NO
coding and NO agents
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
In Memory and File Optimized Data Transport
Real-Time Data Integration
Streaming Change Data Capture (CDC)
– Apply transactions
sequentially
– Stream batched changes
– Integrate with DW native
loaders to ingest and
merge
– Stream changes to Kafka
message brokers
R1
R1
R2
R1
R2
R1
R2Batch CDC
Data Warehouse
Ingest-Merge
SQL
n 2 1
SQL SQL
Transactional CDC
Message Encoded
CDC
Flexible Real-Time Options
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Simplify Data Integration
Zero Footprint Architecture
– CDC identifies source
updates by scanning
change logs
– No software agents
required on sources or
targets
– Minimal administrative
tasks
• Log based CDC
• Source specific optimization
Hadoop
File
s
RDBMS
EDW
Mainframe
Hadoop
Files
RDBMS
EDW
Kafka
Streamlined Process
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Simplify Data Integration
Go Agile with Automation
– No manual coding
– Automated end-to-end
– Optimized and configurable
• Target schema creation
• Heterogeneous data type
mapping
• Batch to CDC transition
• DDL change propagation
• Filtering
• Transformations
Hadoop
File
s
RDBMS
Mainframe
Hadoop
Files
RDBMS
Kafka
EDW EDW
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
– Intuitive web-based GUI
– Drag and drop, wizard-assisted
configuration steps
– Consistent process for all
sources and targets
Simplify Data Integration
Guided User Experience
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
zzzz
zz
RDBMS
Oracle
SQL Server
DB2 iSeries
DB2 z/OS
DB2 LUW
MySQL
PostgeSQL
Sybase ASE
Informix
DW
Exadata
Teradata
Netezza
Vertica
Hortonworks
Cloudera
MapR
HADOOP
DB2 for z/OS
IMS/DB
VSAM
SQL/MP
Enscribe
RMS
MAINFRAME
AWS RDS
Salesforce
Snowflake
CLOUD
RDBMS
Oracle
SQL Server
DB2 LUW
MySQL
PostgreSQL
Sybase ASE
Informix
DW
Microsoft PDW
Exadata
Teradata
Netezza
Vertica
Sybase IQ
Amazon Redshift
Actian Vector
SAP HANA
Hortonworks
Cloudera
MapR
Pivotal
Amazon EMR
HADOOP
MongoDB
NOSQL
Amazon RDS
Amazon Redshift
Amazon EMR
Google Cloud SQL
Google Cloud
Dataproc
Azure SQL DW
Azure SQL DB
CLOUD
Azure Event Hubs
Kafka
MapR
STREAMING
TARGETS
SOURCES
SAP
ECC on Oracle
ECC on SQL
ECC on DB2
SAP
HANA
12
Universal Data Integration
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Feeding the Data Lake with Attunity Replicate
Results
4500 applications
DB2 MF SQL Oracle
• Consolidating massive data
volumes for global analytics
• Hadoop Data Lake with Kafka
• Minimizing labor and cost
• Realizing faster insights and
competitive advantage
Fortune 100 auto maker
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
The results are impressive!
3x
Faster!
+
+
3 x faster than alternative solutions
Q&A
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
REFERENCE CHARTS
35
© 2015 IBM Corporation
Welcome to the Waitless World
© 2017 IBM Corporation
Hortonworks HDP 3X POWER8 Price-Performance
Guarantee
36
IBM Power Systems guarantees the Power S822LC for Big Data system built with POWER8 delivers at least a 3X price-performance
advantage vs. x86 based results when running a customer application/workload with Tez/Hive LLAP on Hortonworks HDP under the
conditions noted below. A Worker Node is a server carrying out the HDP query functions, with one Worker Node per server.
3X price-performance means that the customer's documented throughput performance on the cluster of S822LC for Big Data Worker Nodes divided by the
price of the cluster of Worker Nodes will be at least 3 times higher than the customer's documented throughput performance on the cluster of x86 based
Worker Nodes divided by the price of the cluster of x86 Worker Nodes.
EX: If queries per second on the cluster of S822LC Worker Nodes are 30,000 and 10,000 on the cluster of x86 based Worker Nodes, while the price of the S822LC Worker Node cluster is
$10,000, and the price of the x86 based Worker Node cluster is $10,000, then the Throughput Performance Per Price would be exactly 3 times higher and the guarantee would be met."
Notes:
1. Client’s Power S822LC for BD Worker Nodes and the x86 Worker Nodes must be running at similar utilization rates of at least 50% or higher, using the same software stack as described in Note #4, and which are configured similarly.
2. Client’s Power S822LC for BD performance cannot be constrained by I/O subsystem. Specifically, the I/O subsystem on the Power S822LC for BD Worker Node must achieve greater than or equal I/O bandwidth and operations per second than
the x86 Worker Node.
3. Client’s Power S822LC for BD Worker Node’s physical memory must be the same or greater than the physical memory on the x86 Worker Node.
4. Applicable software stack is Tez/Hive LLAP on HDP 2.6 or later for both the Power S822LC and x86-based Worker Nodes.
5. Client is responsible for demonstrating comparable real-world representative workload between the Power S822LC for BD Worker Node and the x86 Worker Node through the use of the IBM provided tools and comparable tools on x86 systems.
6. 3X guarantee is based on a list price for x86 servers from Dell, Cisco, HP or Lenovo based on E5-2600 v4 or earlier processor technology and the IBM S822LC for Big Data.
The IBM Power S822LC for Big Data servers (22-core/2.89 GHz) used as Worker Nodes must be purchased from IBM or an authorized IBM Business Partner prior to
September 30, 2017. The guarantee period is valid for three (3) months from the date of purchase. The x86-based Worker Nodes must be comparably configured
branded servers from Cisco, Dell, HP, or Lenovo and the client is responsible for all Hortonworks licenses.
3X throughput performance per price means that the customer's documented throughput performance on the cluster of Power S822LC for BD Worker Nodes based on
either queries, operations or transactions per second divided by the price of the cluster of Worker Nodes will be at least 3 times higher than the customer's same
documented throughput performance on the cluster of x86 Worker Nodes divided by the price of said cluster of x86 Worker Nodes.
Remediation: IBM will provide additional performance optimization and tuning services consistent with IBM Best Practices, at no charge. If unable to reach the
guaranteed level of price-performance, IBM will provide additional equally configured Worker Nodes to those already purchased to reach the guaranteed level of price-
performance.
Only Available until Dec 31, 2017!

Mais conteúdo relacionado

Mais procurados

Multi-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLASMulti-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLASDataWorks Summit
 
10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the GlobeDataWorks Summit
 
The Modern Data Platform - How to Conquer a New World with Old Problems
The Modern Data Platform - How to Conquer a New World with Old ProblemsThe Modern Data Platform - How to Conquer a New World with Old Problems
The Modern Data Platform - How to Conquer a New World with Old ProblemsDataWorks Summit/Hadoop Summit
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleHortonworks
 
Postgres Vision 2018: How to Consume your Database Platform On-premises
Postgres Vision 2018: How to Consume your Database Platform On-premisesPostgres Vision 2018: How to Consume your Database Platform On-premises
Postgres Vision 2018: How to Consume your Database Platform On-premisesEDB
 
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...DataWorks Summit
 
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...DataWorks Summit
 
Sprint's Data Modernization Journey
Sprint's Data Modernization JourneySprint's Data Modernization Journey
Sprint's Data Modernization JourneyHortonworks
 
Journey to Big Data: Main Issues, Solutions, Benefits
Journey to Big Data: Main Issues, Solutions, BenefitsJourney to Big Data: Main Issues, Solutions, Benefits
Journey to Big Data: Main Issues, Solutions, BenefitsDataWorks Summit
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB
 
Securing and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industrySecuring and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industryDataWorks Summit
 
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...DataWorks Summit
 
Making Bank Predictive and Real-Time
Making Bank Predictive and Real-TimeMaking Bank Predictive and Real-Time
Making Bank Predictive and Real-TimeDataWorks Summit
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.DataWorks Summit
 
Data Centric Transformation in Telecom
Data Centric Transformation in TelecomData Centric Transformation in Telecom
Data Centric Transformation in TelecomDataWorks Summit
 
Georgia Azure Event - Scalable cloud games using Microsoft Azure
Georgia Azure Event - Scalable cloud games using Microsoft AzureGeorgia Azure Event - Scalable cloud games using Microsoft Azure
Georgia Azure Event - Scalable cloud games using Microsoft AzureMicrosoft
 
Use dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeUse dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeDataWorks Summit
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data editionMark Kerzner
 
Enterprise Data Warehouse Optimization: 7 Keys to Success
Enterprise Data Warehouse Optimization: 7 Keys to SuccessEnterprise Data Warehouse Optimization: 7 Keys to Success
Enterprise Data Warehouse Optimization: 7 Keys to SuccessHortonworks
 

Mais procurados (20)

Multi-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLASMulti-tenant Hadoop - the challenge of maintaining high SLAS
Multi-tenant Hadoop - the challenge of maintaining high SLAS
 
10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe
 
The Modern Data Platform - How to Conquer a New World with Old Problems
The Modern Data Platform - How to Conquer a New World with Old ProblemsThe Modern Data Platform - How to Conquer a New World with Old Problems
The Modern Data Platform - How to Conquer a New World with Old Problems
 
Hadoop dev 01
Hadoop dev 01Hadoop dev 01
Hadoop dev 01
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
Postgres Vision 2018: How to Consume your Database Platform On-premises
Postgres Vision 2018: How to Consume your Database Platform On-premisesPostgres Vision 2018: How to Consume your Database Platform On-premises
Postgres Vision 2018: How to Consume your Database Platform On-premises
 
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
Not Just a necessary evil, it’s good for business: implementing PCI DSS contr...
 
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
Running Enterprise Workloads with an open source Hybrid Cloud Data Architectu...
 
Sprint's Data Modernization Journey
Sprint's Data Modernization JourneySprint's Data Modernization Journey
Sprint's Data Modernization Journey
 
Journey to Big Data: Main Issues, Solutions, Benefits
Journey to Big Data: Main Issues, Solutions, BenefitsJourney to Big Data: Main Issues, Solutions, Benefits
Journey to Big Data: Main Issues, Solutions, Benefits
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
Securing and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industrySecuring and governing a multi-tenant data lake within the financial industry
Securing and governing a multi-tenant data lake within the financial industry
 
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
Data Offload for the Chief Data Officer – how to move data onto Hadoop withou...
 
Making Bank Predictive and Real-Time
Making Bank Predictive and Real-TimeMaking Bank Predictive and Real-Time
Making Bank Predictive and Real-Time
 
Data Science at Speed. At Scale.
Data Science at Speed. At Scale.Data Science at Speed. At Scale.
Data Science at Speed. At Scale.
 
Data Centric Transformation in Telecom
Data Centric Transformation in TelecomData Centric Transformation in Telecom
Data Centric Transformation in Telecom
 
Georgia Azure Event - Scalable cloud games using Microsoft Azure
Georgia Azure Event - Scalable cloud games using Microsoft AzureGeorgia Azure Event - Scalable cloud games using Microsoft Azure
Georgia Azure Event - Scalable cloud games using Microsoft Azure
 
Use dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application codeUse dependency injection to get Hadoop *out* of your application code
Use dependency injection to get Hadoop *out* of your application code
 
Oil and gas big data edition
Oil and gas  big data editionOil and gas  big data edition
Oil and gas big data edition
 
Enterprise Data Warehouse Optimization: 7 Keys to Success
Enterprise Data Warehouse Optimization: 7 Keys to SuccessEnterprise Data Warehouse Optimization: 7 Keys to Success
Enterprise Data Warehouse Optimization: 7 Keys to Success
 

Destaque

Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseStreamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseHortonworks
 
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017 Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017 Hortonworks
 
Eric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceEric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceHortonworks
 
Hortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks
 
Apache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup SlidesApache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup SlidesIsheeta Sanghi
 
How to Architect and Omnichannel Retail Solution to Achieve Real-Time Custome...
How to Architect and Omnichannel Retail Solution to Achieve Real-Time Custome...How to Architect and Omnichannel Retail Solution to Achieve Real-Time Custome...
How to Architect and Omnichannel Retail Solution to Achieve Real-Time Custome...Hortonworks
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsHortonworks
 
Cloudera and Qlik: Big Data Analytics for Business
Cloudera and Qlik: Big Data Analytics for BusinessCloudera and Qlik: Big Data Analytics for Business
Cloudera and Qlik: Big Data Analytics for BusinessData IQ Argentina
 
Security implementation on hadoop
Security implementation on hadoopSecurity implementation on hadoop
Security implementation on hadoopWei-Chiu Chuang
 
The Evolution of Data Architecture
The Evolution of Data ArchitectureThe Evolution of Data Architecture
The Evolution of Data ArchitectureWei-Chiu Chuang
 
Partner Ecosystem Showcase for Apache Ranger and Apache Atlas
Partner Ecosystem Showcase for Apache Ranger and Apache AtlasPartner Ecosystem Showcase for Apache Ranger and Apache Atlas
Partner Ecosystem Showcase for Apache Ranger and Apache AtlasDataWorks Summit
 
Building the Ideal Stack for Real-Time Analytics
Building the Ideal Stack for Real-Time AnalyticsBuilding the Ideal Stack for Real-Time Analytics
Building the Ideal Stack for Real-Time AnalyticsSingleStore
 
Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...
Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...
Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...Cloudera, Inc.
 
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...Spark Summit
 
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Cloudera, Inc.
 
CWIN17 Frankfurt / Cloudera
CWIN17 Frankfurt / ClouderaCWIN17 Frankfurt / Cloudera
CWIN17 Frankfurt / ClouderaCapgemini
 
빅데이터윈윈 컨퍼런스_데이터시각화자료
빅데이터윈윈 컨퍼런스_데이터시각화자료빅데이터윈윈 컨퍼런스_데이터시각화자료
빅데이터윈윈 컨퍼런스_데이터시각화자료ABRC_DATA
 

Destaque (20)

Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSenseStreamline Apache Hadoop Operations with Apache Ambari and SmartSense
Streamline Apache Hadoop Operations with Apache Ambari and SmartSense
 
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017 Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
 
Eric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceEric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers Conference
 
Hortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptx
 
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
Hortonworks Data in Motion Webinar Series Part 7 Apache Kafka Nifi Better Tog...
 
Apache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup SlidesApache NiFi- MiNiFi meetup Slides
Apache NiFi- MiNiFi meetup Slides
 
How to Architect and Omnichannel Retail Solution to Achieve Real-Time Custome...
How to Architect and Omnichannel Retail Solution to Achieve Real-Time Custome...How to Architect and Omnichannel Retail Solution to Achieve Real-Time Custome...
How to Architect and Omnichannel Retail Solution to Achieve Real-Time Custome...
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
 
Cloudera and Qlik: Big Data Analytics for Business
Cloudera and Qlik: Big Data Analytics for BusinessCloudera and Qlik: Big Data Analytics for Business
Cloudera and Qlik: Big Data Analytics for Business
 
Security implementation on hadoop
Security implementation on hadoopSecurity implementation on hadoop
Security implementation on hadoop
 
Softnix Messaging Server
Softnix Messaging ServerSoftnix Messaging Server
Softnix Messaging Server
 
The Evolution of Data Architecture
The Evolution of Data ArchitectureThe Evolution of Data Architecture
The Evolution of Data Architecture
 
Partner Ecosystem Showcase for Apache Ranger and Apache Atlas
Partner Ecosystem Showcase for Apache Ranger and Apache AtlasPartner Ecosystem Showcase for Apache Ranger and Apache Atlas
Partner Ecosystem Showcase for Apache Ranger and Apache Atlas
 
Building the Ideal Stack for Real-Time Analytics
Building the Ideal Stack for Real-Time AnalyticsBuilding the Ideal Stack for Real-Time Analytics
Building the Ideal Stack for Real-Time Analytics
 
Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...
Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...
Webinar - Sehr empfehlenswert: wie man aus Daten durch maschinelles Lernen We...
 
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
Apache Spark—Apache HBase Connector: Feature Rich and Efficient Access to HBa...
 
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets

 
CWIN17 Frankfurt / Cloudera
CWIN17 Frankfurt / ClouderaCWIN17 Frankfurt / Cloudera
CWIN17 Frankfurt / Cloudera
 
Softnix Security Data Lake
Softnix Security Data Lake Softnix Security Data Lake
Softnix Security Data Lake
 
빅데이터윈윈 컨퍼런스_데이터시각화자료
빅데이터윈윈 컨퍼런스_데이터시각화자료빅데이터윈윈 컨퍼런스_데이터시각화자료
빅데이터윈윈 컨퍼런스_데이터시각화자료
 

Semelhante a Benefits of Transferring Real-Time Data to Hadoop at Scale

IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter AnalyticsAdrian Turcu
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudDataWorks Summit/Hadoop Summit
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
IMS integration 2017
IMS integration 2017IMS integration 2017
IMS integration 2017Helene Lyon
 
Accelerating Innovation with Hybrid Cloud
Accelerating Innovation with Hybrid CloudAccelerating Innovation with Hybrid Cloud
Accelerating Innovation with Hybrid CloudJeff Jakubiak
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsDataWorks Summit
 
Real-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDB
Real-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDBReal-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDB
Real-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDBVoltDB
 
Why Infrastructure matters?!
Why Infrastructure matters?!Why Infrastructure matters?!
Why Infrastructure matters?!Gabi Bauer
 
Get Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceGet Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceIBM Cloud Data Services
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsStephan Reimann
 
High Value Business Intelligence for IBM Platform compute environments
High Value Business Intelligence for IBM Platform compute environmentsHigh Value Business Intelligence for IBM Platform compute environments
High Value Business Intelligence for IBM Platform compute environmentsGabor Samu
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceKaran Sachdeva
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise DataWorks Summit
 
Digital Reinvention by NRB
Digital Reinvention by NRBDigital Reinvention by NRB
Digital Reinvention by NRBWilliam Poos
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data AnalyticsAmazon Web Services
 
Transformation Track AWS Cloud Experience Argentina - Why Enterprise Workload...
Transformation Track AWS Cloud Experience Argentina - Why Enterprise Workload...Transformation Track AWS Cloud Experience Argentina - Why Enterprise Workload...
Transformation Track AWS Cloud Experience Argentina - Why Enterprise Workload...Amazon Web Services LATAM
 
How to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosHow to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosCresco International
 
IMS08 the momentum driving the ims future
IMS08   the momentum driving the ims futureIMS08   the momentum driving the ims future
IMS08 the momentum driving the ims futureRobert Hain
 
SendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingSendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingAmazon Web Services
 

Semelhante a Benefits of Transferring Real-Time Data to Hadoop at Scale (20)

IBM Smarter Analytics
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
IMS integration 2017
IMS integration 2017IMS integration 2017
IMS integration 2017
 
Accelerating Innovation with Hybrid Cloud
Accelerating Innovation with Hybrid CloudAccelerating Innovation with Hybrid Cloud
Accelerating Innovation with Hybrid Cloud
 
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for AnalyticsVerizon Centralizes Data into a Data Lake in Real Time for Analytics
Verizon Centralizes Data into a Data Lake in Real Time for Analytics
 
Real-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDB
Real-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDBReal-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDB
Real-time Big Data Analytics in the IBM SoftLayer Cloud with VoltDB
 
Why Infrastructure matters?!
Why Infrastructure matters?!Why Infrastructure matters?!
Why Infrastructure matters?!
 
Haven 2 0
Haven 2 0 Haven 2 0
Haven 2 0
 
Get Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a ServiceGet Started Quickly with IBM's Hadoop as a Service
Get Started Quickly with IBM's Hadoop as a Service
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of Things
 
High Value Business Intelligence for IBM Platform compute environments
High Value Business Intelligence for IBM Platform compute environmentsHigh Value Business Intelligence for IBM Platform compute environments
High Value Business Intelligence for IBM Platform compute environments
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise
 
Digital Reinvention by NRB
Digital Reinvention by NRBDigital Reinvention by NRB
Digital Reinvention by NRB
 
Data & Analytics - Session 1 - Big Data Analytics
Data & Analytics - Session 1 -  Big Data AnalyticsData & Analytics - Session 1 -  Big Data Analytics
Data & Analytics - Session 1 - Big Data Analytics
 
Transformation Track AWS Cloud Experience Argentina - Why Enterprise Workload...
Transformation Track AWS Cloud Experience Argentina - Why Enterprise Workload...Transformation Track AWS Cloud Experience Argentina - Why Enterprise Workload...
Transformation Track AWS Cloud Experience Argentina - Why Enterprise Workload...
 
How to Increase Performance in IBM Cognos
How to Increase Performance in IBM CognosHow to Increase Performance in IBM Cognos
How to Increase Performance in IBM Cognos
 
IMS08 the momentum driving the ims future
IMS08   the momentum driving the ims futureIMS08   the momentum driving the ims future
IMS08 the momentum driving the ims future
 
SendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingSendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data Warehousing
 

Mais de Hortonworks

IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyHortonworks
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakHortonworks
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsHortonworks
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysHortonworks
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's NewHortonworks
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerHortonworks
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsHortonworks
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidHortonworks
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATAHortonworks
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Hortonworks
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseHortonworks
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationHortonworks
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementHortonworks
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHortonworks
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCHortonworks
 
4 Essential Steps for Managing Sensitive Data
4 Essential Steps for Managing Sensitive Data4 Essential Steps for Managing Sensitive Data
4 Essential Steps for Managing Sensitive DataHortonworks
 
5 Steps to Create a Company Culture that Embraces the Power of Data
5 Steps to Create a Company Culture that Embraces the Power of Data5 Steps to Create a Company Culture that Embraces the Power of Data
5 Steps to Create a Company Culture that Embraces the Power of DataHortonworks
 
Exploring the Heated-and Completely Unnecessary- Data Lake Debate
Exploring the Heated-and Completely Unnecessary- Data Lake DebateExploring the Heated-and Completely Unnecessary- Data Lake Debate
Exploring the Heated-and Completely Unnecessary- Data Lake DebateHortonworks
 
The Life of a Hadoop Administrator, with and without SmartSense
The Life of a Hadoop Administrator, with and without SmartSenseThe Life of a Hadoop Administrator, with and without SmartSense
The Life of a Hadoop Administrator, with and without SmartSenseHortonworks
 

Mais de Hortonworks (20)

IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 
4 Essential Steps for Managing Sensitive Data
4 Essential Steps for Managing Sensitive Data4 Essential Steps for Managing Sensitive Data
4 Essential Steps for Managing Sensitive Data
 
5 Steps to Create a Company Culture that Embraces the Power of Data
5 Steps to Create a Company Culture that Embraces the Power of Data5 Steps to Create a Company Culture that Embraces the Power of Data
5 Steps to Create a Company Culture that Embraces the Power of Data
 
Exploring the Heated-and Completely Unnecessary- Data Lake Debate
Exploring the Heated-and Completely Unnecessary- Data Lake DebateExploring the Heated-and Completely Unnecessary- Data Lake Debate
Exploring the Heated-and Completely Unnecessary- Data Lake Debate
 
The Life of a Hadoop Administrator, with and without SmartSense
The Life of a Hadoop Administrator, with and without SmartSenseThe Life of a Hadoop Administrator, with and without SmartSense
The Life of a Hadoop Administrator, with and without SmartSense
 

Último

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Último (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Benefits of Transferring Real-Time Data to Hadoop at Scale

  • 1. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Welcome to the Waitless World Benefits of Transferring Real-time Data to Hadoop at Scale
  • 2. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Guest Speakers Ali Bajwa Principal Partner Solutions Engineer, Hortonworks Steve Roberts Offering Manager, Power Systems Big Data & Analytics Solutions, IBM Dan Potter VP of Product Management & Marketing, Attunity
  • 3. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation • Connected customers, vehicles, devices • Socially crowd-sourced requirements • Digital design and analysis • Digital prototypes and tests (simulations) • Connected factories, sensors, devices • Human-robotic interaction • 3D-printing on demand • Connected trucks, inventory • Location, traffic, weather-aware distribution • Real-time inventory visibility • Dynamic rerouting • Connected customers, devices • Omni-channel demand sensing • Real-time Recommendations • Connected assets • Remote service monitoring & delivery • Predictive maintenance • OTA Updates Development Manufacturing Distribution Marketing/Sales Service The New Way of Business Is Fueled by Connected Data
  • 4. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Technology Trends: Shifting the Data Paradigm Artificial IntelligenceInternet of Things Cloud Computing Streaming Data Industrial Internet Connected business Consumer devices Smart devices Autonomy Prescriptive analytics SaaS/PaaS applications Ephemeral use cases Operational efficiency Collaboration Real-time applications Targeted retail Recommendations Industrial applications
  • 5. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Hortonworks Enabling the Modern Data Architecture • Our durable and reliable mission continues… • Make Hadoop an enterprise viable data platform • Bring all data under management—all sources and types • Enable pre and post transaction analysis Hortonworks consistent and continuous track record of innovation
  • 6. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Powering the Modern Data Architecture DATA AT RESTDATA IN MOTION ACTIONABLE INTELLIGENCE COMPLETE DATA LIFECYCLE MANAGEMENT RUN CONTAINERIZED APPLICATIONS CONCURRENTLY EDGECLOUD H O L I S T I C M A N A G E M E N T , G O V E R N A N C E A N D S E C U R I T Y ON-PREMISES MULTI-WORKLOADS MULTI-TYPE MULTI-TIER Data Science SQL Query Engine
  • 7. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Hortonworks Value: Platform Flexibility CloudSensors/Sources  Constrained  High-latency  Localized context  On-premise and cloud  Low-latency  Global context Data Centers
  • 8. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Hortonworks DataFlow and Analytics Reference Platform Applications Edge/Sensor/3rd Party Data Flow and Streaming Analytics and Data Science Field Data Capture Office, Datacenter or Cloud Industrial Protocols such as OPC Files / Other Unstructured Data Video IoT Gateways PLC / RTU SCADA, DCS, Historians Hortonworks Data Platform SQL Hortonworks DataFlow Data Flow Managemen t Message Queues Stream Processing In-stream Analytics NoSQL Machine Learning Resource Management Distributed File StorageStructured Data Sets Location 1 Time Series Storage Data Acquisitio n Event Processin g Location N Time Series Storage Data Acquisitio n Event Processin g
  • 9. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Complementing Attunity and IBM Ecosystem Applications Edge/Sensor/3rd Party Data Flow and Streaming Analytics and Data Science Field Data Capture Office, Datacenter or Cloud Industrial Protocols such as OPC Files / Other Unstructured Data Video IoT Gateways PLC / RTU SCADA, DCS, Historians Hortonworks Data Platform SQL Hortonworks DataFlow Data Flow Managemen t Message Queues IBM Stream Computing In-stream Analytics NoSQL Machine Learning Resource Management Distributed File StorageStructured Data Sets Location 1 Time Series Storage Data Acquisitio n Event Processin g Location N Time Series Storage Data Acquisitio n Event Processin g IBM Bluemix IBM Bluemix IBM Spectrum Scale IBM Watson IBM Watson IBM also resells HDP and HDF IBM Big SQL DATA INGESTION
  • 10. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Hortonworks DataPlane Service A common set of services that: ⬢ Supports enterprise deployment strategy and move to the cloud ⬢ Addresses compliance and regulatory requirements for enterprise ⬢ Eliminates policy silos and ensures security & governance moves with data ⬢ Simplifies data asset management and provides access for analysts and data scientists ⬢ Extensible to new services: Services enablement layer brings new offerings to market rapidly Next Chapter: Announcing Hortonworks DataPlane Service Enabling the Modern Data Architecture
  • 11. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Enterprise Data Science at Scale Enterprise- Grade Leverage enterprise- grade security, governance and operations Tools Enhance productivity by enabling data scientists to use their favorite tools, technologies and libraries Deployment Compress the time to insight by deploying models into production faster Data Build more robust models by using all the data in the data lake The Power of Data Science for your Enterprise
  • 12. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation DATA AT REST DATA-IN-MOTION HDP® HORTONWORKS DATA PLATFORM Powered by Apache Hadoop® HDF™ HORTONWORKS DATAFLOW Powered by Apache™ NiFi DATA-AT-REST Powering Modern Data Applications IBM Analytics  Hortonworks Resell IBM DSX IBM BigSQL IBM Analytics  Re-sell BigInsights’ existing customers migrated to HDP IBM resells HDP & HDF IBM Systems  Co-Sell • IBM Power Systems (Compute) • IBM Spectrum Scale (Storage) + Bringing it all Together DATA INGESTION
  • 13. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Centralized Mainframes Cognitive Era E-Business Distributed Computing Smarter Planet Office Productivity Client/ Server Personal Computer Data Warehousing Big Data & Predictive Analytics Cognitive A new era of computing has emerged Data InsightContext Transactional Database Business Intelligence Big Data & Analytics Actionable Insight in context Reporting Cloud
  • 14. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Accelerated compute and storage delivered on prem, in the cloud or via Watson Power Systems is now part of Cognitive Systems REINVENTING COMPUTING FOR DATA-INTENSIVE AND COGNITIVE WORKLOADS
  • 15. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Open to the core for true differentiation in performance & cost 315+ OpenPOWER members across 31 countries Ecosystem-driven Customer Choice Growing ecosystem of OpenPOWER Servers Growing ecosystem of OpenPOWER Innovation
  • 16. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Power Systems S822LC for Big Data - Not Just Another Intel Server Linux by Redhat: Redhat 7.2 Linux OS Mellanox: InfiniBand/Ethernet Connectivity in and out of server HGST: Optional NVMe Adapters Alpha Data with Xilinx FPGA: Optional CAPI Accelerator Broadcom: Optional PCIe Adapters QLogic: Optional Fiber Channel PCIe Samsung: SSDs & NVMe Hynix, Samsung, Micron: DDR4 NVIDIA: Tesla K80 GPU Accelerator IBM: POWER8 CPU 16
  • 17. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation 17 Available until Dec 31, 2017!
  • 18. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation TCO at Scale with HDP on Power Systems with Elastic Storage Server 18 • Up to 3X reduction of storage and compute infrastructure moving to Power Systems and Elastic Storage Server vs commodity scale out x86 • More flexible and scalable vs EMC Isilon using IBM Spectrum Scale • Position for future growth, avoid hitting the data center wall with cluster sprawl E E InfiniBand (RDMA) / 40 GigE / 10 GigE Scale Compute Nodes • IBM Power Systems • Only Hadoop services and HDFS client ESS HDP HDP HDP HDP HDP ESS Elastic Storage Server (Powered by Spectrum Scale and Power Systems) C C C C CC C Spectrum Scale Clientv HDP Hortonworks Data Platform Scale Storage as Required
  • 19. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Artificial Intelligence and Cognitive Applications Machine Learning Deep Learning (Neural Networks) The deeper you go, the more value you gain, and the more you know
  • 20. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation simple machine learning deep learning
  • 22. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation enterprise-ready software distribution built on open source tools for ease of development performance faster training times for data scientists +
  • 23. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation 9Days Acceleration training …. days become hours 4Hours Recognition Shape Attenuation Boundary Recognition Shape Attenuation Boundary 54x Learning runs with Power 8 4Hours 4Hours 4Hours 4Hours . . . . . . . . . . . . . . . 4Hours What will you do? Iterate more and create more accurate models? Create more models? Both? IBM S822LC for HPC
  • 25. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Data Integration for Modern Analytics
  • 26. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Modern Data Ingest METADATA HIVE OPTIMIZED STREAM OPTIMIZED CHANGE DATA CAPTURE CLOUD ON PREM WAREHOUSE MAINFRAME RDBMS SAP CDC (log-based) for high performance, low latency and low impact Single platform for all key enterprise systems Hive-optimized for HDP and Stream-optimized for HDF Point-and-Click with NO coding and NO agents
  • 27. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation In Memory and File Optimized Data Transport Real-Time Data Integration Streaming Change Data Capture (CDC) – Apply transactions sequentially – Stream batched changes – Integrate with DW native loaders to ingest and merge – Stream changes to Kafka message brokers R1 R1 R2 R1 R2 R1 R2Batch CDC Data Warehouse Ingest-Merge SQL n 2 1 SQL SQL Transactional CDC Message Encoded CDC Flexible Real-Time Options
  • 28. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Simplify Data Integration Zero Footprint Architecture – CDC identifies source updates by scanning change logs – No software agents required on sources or targets – Minimal administrative tasks • Log based CDC • Source specific optimization Hadoop File s RDBMS EDW Mainframe Hadoop Files RDBMS EDW Kafka Streamlined Process
  • 29. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Simplify Data Integration Go Agile with Automation – No manual coding – Automated end-to-end – Optimized and configurable • Target schema creation • Heterogeneous data type mapping • Batch to CDC transition • DDL change propagation • Filtering • Transformations Hadoop File s RDBMS Mainframe Hadoop Files RDBMS Kafka EDW EDW
  • 30. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation – Intuitive web-based GUI – Drag and drop, wizard-assisted configuration steps – Consistent process for all sources and targets Simplify Data Integration Guided User Experience
  • 31. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation zzzz zz RDBMS Oracle SQL Server DB2 iSeries DB2 z/OS DB2 LUW MySQL PostgeSQL Sybase ASE Informix DW Exadata Teradata Netezza Vertica Hortonworks Cloudera MapR HADOOP DB2 for z/OS IMS/DB VSAM SQL/MP Enscribe RMS MAINFRAME AWS RDS Salesforce Snowflake CLOUD RDBMS Oracle SQL Server DB2 LUW MySQL PostgreSQL Sybase ASE Informix DW Microsoft PDW Exadata Teradata Netezza Vertica Sybase IQ Amazon Redshift Actian Vector SAP HANA Hortonworks Cloudera MapR Pivotal Amazon EMR HADOOP MongoDB NOSQL Amazon RDS Amazon Redshift Amazon EMR Google Cloud SQL Google Cloud Dataproc Azure SQL DW Azure SQL DB CLOUD Azure Event Hubs Kafka MapR STREAMING TARGETS SOURCES SAP ECC on Oracle ECC on SQL ECC on DB2 SAP HANA 12 Universal Data Integration
  • 32. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Feeding the Data Lake with Attunity Replicate Results 4500 applications DB2 MF SQL Oracle • Consolidating massive data volumes for global analytics • Hadoop Data Lake with Kafka • Minimizing labor and cost • Realizing faster insights and competitive advantage Fortune 100 auto maker
  • 33. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation The results are impressive! 3x Faster! + + 3 x faster than alternative solutions
  • 34. Q&A
  • 35. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation REFERENCE CHARTS 35
  • 36. © 2015 IBM Corporation Welcome to the Waitless World © 2017 IBM Corporation Hortonworks HDP 3X POWER8 Price-Performance Guarantee 36 IBM Power Systems guarantees the Power S822LC for Big Data system built with POWER8 delivers at least a 3X price-performance advantage vs. x86 based results when running a customer application/workload with Tez/Hive LLAP on Hortonworks HDP under the conditions noted below. A Worker Node is a server carrying out the HDP query functions, with one Worker Node per server. 3X price-performance means that the customer's documented throughput performance on the cluster of S822LC for Big Data Worker Nodes divided by the price of the cluster of Worker Nodes will be at least 3 times higher than the customer's documented throughput performance on the cluster of x86 based Worker Nodes divided by the price of the cluster of x86 Worker Nodes. EX: If queries per second on the cluster of S822LC Worker Nodes are 30,000 and 10,000 on the cluster of x86 based Worker Nodes, while the price of the S822LC Worker Node cluster is $10,000, and the price of the x86 based Worker Node cluster is $10,000, then the Throughput Performance Per Price would be exactly 3 times higher and the guarantee would be met." Notes: 1. Client’s Power S822LC for BD Worker Nodes and the x86 Worker Nodes must be running at similar utilization rates of at least 50% or higher, using the same software stack as described in Note #4, and which are configured similarly. 2. Client’s Power S822LC for BD performance cannot be constrained by I/O subsystem. Specifically, the I/O subsystem on the Power S822LC for BD Worker Node must achieve greater than or equal I/O bandwidth and operations per second than the x86 Worker Node. 3. Client’s Power S822LC for BD Worker Node’s physical memory must be the same or greater than the physical memory on the x86 Worker Node. 4. Applicable software stack is Tez/Hive LLAP on HDP 2.6 or later for both the Power S822LC and x86-based Worker Nodes. 5. Client is responsible for demonstrating comparable real-world representative workload between the Power S822LC for BD Worker Node and the x86 Worker Node through the use of the IBM provided tools and comparable tools on x86 systems. 6. 3X guarantee is based on a list price for x86 servers from Dell, Cisco, HP or Lenovo based on E5-2600 v4 or earlier processor technology and the IBM S822LC for Big Data. The IBM Power S822LC for Big Data servers (22-core/2.89 GHz) used as Worker Nodes must be purchased from IBM or an authorized IBM Business Partner prior to September 30, 2017. The guarantee period is valid for three (3) months from the date of purchase. The x86-based Worker Nodes must be comparably configured branded servers from Cisco, Dell, HP, or Lenovo and the client is responsible for all Hortonworks licenses. 3X throughput performance per price means that the customer's documented throughput performance on the cluster of Power S822LC for BD Worker Nodes based on either queries, operations or transactions per second divided by the price of the cluster of Worker Nodes will be at least 3 times higher than the customer's same documented throughput performance on the cluster of x86 Worker Nodes divided by the price of said cluster of x86 Worker Nodes. Remediation: IBM will provide additional performance optimization and tuning services consistent with IBM Best Practices, at no charge. If unable to reach the guaranteed level of price-performance, IBM will provide additional equally configured Worker Nodes to those already purchased to reach the guaranteed level of price- performance. Only Available until Dec 31, 2017!