SlideShare uma empresa Scribd logo
1 de 40
Baixar para ler offline
1 © Hortonworks Inc. 2011–2018. All rights reserved.
© Hortonworks, Inc. 2011-2018. All rights reserved. | Hortonworks confidential and proprietary information.
Hortonworks Data Flow 3.1
Timothy Spann, Solutions Engineer
Hortonworks @PaaSDev
2 © Hortonworks Inc. 2011–2018. All rights reserved.
Disclaimer
• This document may contain product features and technology directions that are under
development, may be under development in the future or may ultimately not be
developed.
• Technical feasibility, market demand, user feedback, and the Apache Software
Foundation community development process can all effect timing and final delivery.
• This document’s description of these features and technology directions does not
represent a contractual commitment, promise or obligation from Hortonworks to deliver
these features in any generally available product.
• Product features and technology directions are subject to change, and must not be
included in contracts, purchase orders, or sales agreements of any kind.
• Since this document contains an outline of general product development plans,
customers should not rely upon it when making a purchase decision.
3 © Hortonworks Inc. 2011–2018. All rights reserved.
MULTIPLE CLUSTERS AND SOURCES
MULTIHYBRID
DATAPLANE SERVICE (DPS)
MANAGE, GOVERN, SECURE
DATA
LIFECYCLE
MANAGER
DATA STEWARD
STUDIO*
ISV
SERVICES
*not yet available, coming soon
EXTENSIBLE SERVICES
IBM DSX*CLOUD-
BREAK*
DATA
ANALYTICS
STUDIO*
CONNECTED DATA PLATFORMS
HORTONWORKS
DATA PLATFORM (HDP®)
DATA-AT-REST
HORTONWORKS
DATAFLOW (HDF™)
DATA-IN-MOTION
MODERN DATA USE CASES
EDW
OPTIMIZATION
CYBER SECURITY DATA SCIENCE
ADVANCED
ANALYTICS
PARTNER
SOLUTIONS
IOT/ STREAMING
ANALYTICS
HORTONWORKS
CONNECTION
ENTERPRISE SUPPORT
PREMIER SUPPORT
EDUCATIONAL SERVICES
PROFESSIONAL SERVICES
COMMUNITY CONNECTION
HORTONWORKS
PLATFORM SERVICES
OPERATIONAL SERVICES
SMARTSENSE™
Global Data Management With Hortonworks
4 © Hortonworks Inc. 2011–2018. All rights reserved.
HDF Data-In-Motion Platform – with HDF 3.1 GA Release
5 © Hortonworks Inc. 2011–2018. All rights reserved.
HDF 3.1 New and Enhanced Features
Ease of Use
Core
Enhancements
Cross-Product
Integration
Flow
Management
Stream
Processing
• NiFi-Atlas, -SmartSense, and
-Knox integration
(HDF on HDP scenario only)
• NiFi-Ranger: Group based
policy support for NiFi
resources
• New SAM operations
module
• SAM ”Test Mode”
• Kafka 1.0 Support
• Schema Registry
• Schema Version
Lifecycle Mgmt.
• SAM extensibility
improvements
• Ambari and Ranger
support for Kafka 1.0
• Improved Ambari
experience: Automate
adding NiFi nodes to
existing cluster
• Apache NiFi Registry (new)
• Flow migration and
version control
• MiNiFi C++, Java, Android/IOS
libraries GA
• Containerized deployment
(Docker)
6 © Hortonworks Inc. 2011–2018. All rights reserved.
Improved Operational Efficiency
MiNiFi C++ Agent
C++ Agent
C++ Agent
C++ Agent
There are many configuration options for MiNiFi
C++, all dependent on the use case, they may
help with:
• Minimizing memory footprint
• Lowering CPU consumption
• Reducing size on disk
https://community.hortonworks.com/articles/167193/building-and-
running-minifi-cpp-in-orangepi-zero.html
7 © Hortonworks Inc. 2011–2018. All rights reserved.
Integrated Provisioning and Security
Kafka 1.0 Support
To enhance data governance and lineage, users can
now manage access control policies using resource or
tag-based security in Ranger for Kafka 1.0 clusters.
Users can now install, configure, manage, upgrade,
monitor, and secure Kafka 1.0 clusters with Ambari.
New processors in NiFi and Streaming Analytics
Manager support Kafka 1.0 features including message
headers and transactions.
8 © Hortonworks Inc. 2011–2018. All rights reserved.
When HDF is co-located with HDP…
Integrations with Atlas, Knox and SmartSense
SmartSense
9 © Hortonworks Inc. 2011–2018. All rights reserved.
220+ Processors for Deeper Ecosystem Integration
Hash
Extract
Merge
Duplicate
Scan
GeoEnrich
Replace
ConvertSplit
Translate
Route Content
Route Context
Route Text
Control Rate
Distribute Load
Generate Table Fetch
Jolt Transform JSON
Prioritized Delivery
Encrypt
Tail
Evaluate
Execute
All Apache project logos are trademarks of the ASF and the respective projects.
Fetch
HTTP
Syslog
Email
HTML
Image
HL7
FTP
UDP
XML
SFTP
AMQP
WebSocket
10 © Hortonworks Inc. 2011–2018. All rights reserved.
HDF 3.1 for Big Data Engineers
Multiple users, frameworks, languages, data sources & clusters
BIG DATA ENGINEER
• Experience in ETL
• Coding skills in Scala,
Python, Java
• Experience with Apache
Hadoop
• Knowledge of tools such
Hive, Flume or Pig
• Knowledge of SQL
• Expert in ETL (Eating, Ties
and Laziness)
• Social Media Maven
• Deep SME in Buzzwords
• No Coding skills
• Interest in Pig and Falcon
CAT AI
• Will Drive your Car
• Will Fix Your Code
• Will Not Be Discussed
Today
• Will Not Finish This Talk For
Me, This Time
11 © Hortonworks Inc. 2011–2018. All rights reserved.
Aggregate all data from sensors, drones, logs, geo-location devices,
machines and social feeds
Collect: Bring Together
Mediate point-to-point and bi-directional data flows, delivering data
reliably to Apache HBase, Apache Hive, HDFS, Slack and Email.
Conduct: Mediate the Data Flow
Parse, filter, join, transform, fork, query, sort, dissect, enrich with weather,
location, Apache OpenNLP and Apache MXNet.
Curate: Gain Insights
12 © Hortonworks Inc. 2011–2018. All rights reserved.
NiFi (PROD)
MiNiFi
MiNiFi
MiNiFi
MiNiFi
MiNiFi
MiNiFi
MiNiFi
Flow Registry API
Persistence Other services Other services
NiFi (QA)
NiFi (Dev)
Register DeployDeploy
DataFlow Registry
• NiFi Flow Registry
• Standalone application/service
(URL)
• Standard API with pluggable
components
• Design and deploy mechanism for
flow migration (SDLC) use cases
13 © Hortonworks Inc. 2011–2018. All rights reserved.
Kafka
Powerful Pattern with Kafka Headers: Pass Schema Key in Kafka Header
Truck Geo
Sensor
Truck Speed
Sensor
Kafka Topic
(raw-all_truck_events_csv)
Centralized Schema
Repository
Publish CSV Events with
Schema metadata from SR
stored in Kafka Header
Data Movement and
Processing by NiFi using
new Record-Based
processing
Kafka Event with Header Published by the Sensor Producing App
Kafka Header Kafka Payload
header with key schema.name
that has metadata info to lookup
the schema in HWX SR
CSV Binary Event
14 © Hortonworks Inc. 2011–2018. All rights reserved.
Nifi and Kafka 1.0 – Use Case for Kafka Message Headers
Kafka
15 © Hortonworks Inc. 2011–2018. All rights reserved.
Grafana & Kafka 1.0 Integration: Monitoring
Topic
Level
KPIs
Broker
Level
KPIs
Kafka
16 © Hortonworks Inc. 2011–2018. All rights reserved.
Apache Spark Integration
17 © Hortonworks Inc. 2011–2018. All rights reserved.
Apache Spark Integration
18 © Hortonworks Inc. 2011–2018. All rights reserved.
New: Integrated Registry Service
• Integrated Flow Registry Service
• Sharable between NiFi
environments for Dev/UAT/Prod
promotion
• API or GUI driven
• Can be integrated with Enterprise
Version Control e.g. GitLab
• ‘Buckets’ of Flows for security and
access control
SDLC
19 © Hortonworks Inc. 2011–2018. All rights reserved.
New: Integrated Variable Registry Service
• Integrated Variable Registry
• Sets of key:value pairs available on
every Process Group
• Referenced with NiFi Expression
Language
• Dynamically changeable at runtime
• Use within Versioned Flows to set
Environment Variables
• GUI or API driven
SDLC
20 © Hortonworks Inc. 2011–2018. All rights reserved.
• Wrap atomic functions in harnesses
for regression testing
• Integrate via the Rest-API to
automate testing through Jenkins
etc.
• Automate triggering tests when
new Versions are pushed to the
Flow Registry
SDLC
Regression test with Golden Datasets
21 © Hortonworks Inc. 2011–2018. All rights reserved.
• Nest Versioned Process Groups to
test composite functions
• Wrap in test harnesses to validate
functionality
• Flow Versioning provides visibility
as components of Composites are
updated
SDLC
Build & Test Composite DataFlows
22 © Hortonworks Inc. 2011–2018. All rights reserved.
New: Design & Deploy complementing Command & Control
• SDLC Dev: Place Process Groups
under Version Control
• Make changes and commit to new
version
• Roll Versions back or forward
SDLC
23 © Hortonworks Inc. 2011–2018. All rights reserved.
• Get Notifications of local changes
or new versions available in
Repository
• Revert or Commit local changes via
the GUI or Rest-API
• Use Rest-API to integrate with
Jenkins, etc.
SDLC
New: Design & Deploy complementing Command & Control
24 © Hortonworks Inc. 2011–2018. All rights reserved.
Administration
25 © Hortonworks Inc. 2011–2018. All rights reserved.
Administration
26 © Hortonworks Inc. 2011–2018. All rights reserved.
Schema Registry
27 © Hortonworks Inc. 2011–2018. All rights reserved.
Schema Registry
28 © Hortonworks Inc. 2011–2018. All rights reserved.
Schema Registry
29 © Hortonworks Inc. 2011–2018. All rights reserved.
Lifecycle Action 1 - Action: Fork Schema Version to Branch called Dev
Schema Registry
30 © Hortonworks Inc. 2011–2018. All rights reserved.
More Data Set Coverage
AtlasNiFiFlowLineage
(ReportingTask)
NiFi Flow
NiFi Data Provenance
Kafka topic
1. static flow lineage from NiFi flow def
2. Add DataSet entities from NiFi Data
Provenance events
Atlas Integration
31 © Hortonworks Inc. 2011–2018. All rights reserved.
sensor-data
tweets
default.sensor_data
path1
path0
path2
Atlas Integration
32 © Hortonworks Inc. 2011–2018. All rights reserved.
Registry
33 © Hortonworks Inc. 2011–2018. All rights reserved.
Registry
34 © Hortonworks Inc. 2011–2018. All rights reserved.
Registry
35 © Hortonworks Inc. 2011–2018. All rights reserved.
Registry
36 © Hortonworks Inc. 2011–2018. All rights reserved.
Questions?
37 © Hortonworks Inc. 2011–2018. All rights reserved.
https://community.hortonworks.com/articles/161761/new-features-in-apache-nifi-
15-apache-nifi-registr.html
https://community.hortonworks.com/articles/171787/hdf-31-executing-apache-
spark-via-executesparkinte.html
https://community.hortonworks.com/articles/171960/using-apache-mxnet-on-an-
apache-nifi-15-instance-w.html
https://community.hortonworks.com/articles/171893/hdf-31-executing-apache-
spark-via-executesparkinte-1.html
Resources
38 © Hortonworks Inc. 2011–2018. All rights reserved.
Contact
https://github.com/tspannhw/ApacheBigData101/tree/master
https://community.hortonworks.com/users/9304/tspann.html
https://dzone.com/users/297029/bunkertor.html
https://www.meetup.com/futureofdata-princeton/
https://twitter.com/PaaSDev
https://community.hortonworks.com/articles/155435/using-the-new-mxnet-model-server.html
39 © Hortonworks Inc. 2011–2018. All rights reserved.
Hortonworks Community Connection
Read access for everyone, join to participate and be recognized
• Full Q&A Platform (like StackOverflow)
• Knowledge Base Articles
• Code Samples and Repositories
40 © Hortonworks Inc. 2011–2018. All rights reserved.
Community Engagement
Participate now at: community.hortonworks.com© Hortonworks Inc. 2011 – 2015. All Rights Reserved
4,000+
Registered Users
10,000+
Answers
15,000+
Technical Assets
One Website!

Mais conteúdo relacionado

Mais procurados

Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterpriseUsing Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
DataWorks Summit
 
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017 Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Hortonworks
 
The Implacable advance of the data
The Implacable advance of the dataThe Implacable advance of the data
The Implacable advance of the data
DataWorks Summit
 

Mais procurados (20)

Overcoming the AI hype — and what enterprises should really focus on
Overcoming the AI hype — and what enterprises should really focus onOvercoming the AI hype — and what enterprises should really focus on
Overcoming the AI hype — and what enterprises should really focus on
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Introduction to Streaming Analytics Manager
Introduction to Streaming Analytics ManagerIntroduction to Streaming Analytics Manager
Introduction to Streaming Analytics Manager
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterpriseUsing Spark Streaming and NiFi for the next generation of ETL in the enterprise
Using Spark Streaming and NiFi for the next generation of ETL in the enterprise
 
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017 Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
Enterprise Data Science at Scale Meetup - IBM and Hortonworks - Oct 2017
 
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFiReal-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
 
Pivotal - Advanced Analytics for Telecommunications
Pivotal - Advanced Analytics for Telecommunications Pivotal - Advanced Analytics for Telecommunications
Pivotal - Advanced Analytics for Telecommunications
 
Spark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun MurthySpark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun Murthy
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
IoT Crash Course Hadoop Summit SJ
IoT Crash Course Hadoop Summit SJIoT Crash Course Hadoop Summit SJ
IoT Crash Course Hadoop Summit SJ
 
Apache Hadoop Crash Course - HS16SJ
Apache Hadoop Crash Course - HS16SJApache Hadoop Crash Course - HS16SJ
Apache Hadoop Crash Course - HS16SJ
 
Hadoop Summit Tokyo HDP Sandbox Workshop
Hadoop Summit Tokyo HDP Sandbox Workshop Hadoop Summit Tokyo HDP Sandbox Workshop
Hadoop Summit Tokyo HDP Sandbox Workshop
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Data Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJData Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJ
 
The Implacable advance of the data
The Implacable advance of the dataThe Implacable advance of the data
The Implacable advance of the data
 
Social Media Monitoring with NiFi, Druid and Superset
Social Media Monitoring with NiFi, Druid and SupersetSocial Media Monitoring with NiFi, Druid and Superset
Social Media Monitoring with NiFi, Druid and Superset
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
 

Semelhante a HDF 3.1 : An Introduction to New Features

Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
DataWorks Summit
 
IoT with Apache MXNet and Apache NiFi and MiniFi
IoT with Apache MXNet and Apache NiFi and MiniFiIoT with Apache MXNet and Apache NiFi and MiniFi
IoT with Apache MXNet and Apache NiFi and MiniFi
DataWorks Summit
 

Semelhante a HDF 3.1 : An Introduction to New Features (20)

Data Con LA 2018 - Streaming and IoT by Pat Alwell
Data Con LA 2018 - Streaming and IoT by Pat AlwellData Con LA 2018 - Streaming and IoT by Pat Alwell
Data Con LA 2018 - Streaming and IoT by Pat Alwell
 
State of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & CommunityState of the Apache NiFi Ecosystem & Community
State of the Apache NiFi Ecosystem & Community
 
Curing the Kafka blindness—Streams Messaging Manager
Curing the Kafka blindness—Streams Messaging ManagerCuring the Kafka blindness—Streams Messaging Manager
Curing the Kafka blindness—Streams Messaging Manager
 
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
SoCal BigData Day
SoCal BigData DaySoCal BigData Day
SoCal BigData Day
 
Apache Deep Learning 101 - DWS Berlin 2018
Apache Deep Learning 101 - DWS Berlin 2018Apache Deep Learning 101 - DWS Berlin 2018
Apache Deep Learning 101 - DWS Berlin 2018
 
Running Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration OptionsRunning Apache NiFi with Apache Spark : Integration Options
Running Apache NiFi with Apache Spark : Integration Options
 
Deep learning on HDP 2018 Prague
Deep learning on HDP 2018 PragueDeep learning on HDP 2018 Prague
Deep learning on HDP 2018 Prague
 
Future of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep DiveFuture of Data New Jersey - HDF 3.0 Deep Dive
Future of Data New Jersey - HDF 3.0 Deep Dive
 
Enterprise IIoT Edge Processing with Apache NiFi
Enterprise IIoT Edge Processing with Apache NiFiEnterprise IIoT Edge Processing with Apache NiFi
Enterprise IIoT Edge Processing with Apache NiFi
 
IoT Edge Processing with Apache NiFi and MiniFi and Apache MXNet for IoT NY 2018
IoT Edge Processing with Apache NiFi and MiniFi and Apache MXNet for IoT NY 2018IoT Edge Processing with Apache NiFi and MiniFi and Apache MXNet for IoT NY 2018
IoT Edge Processing with Apache NiFi and MiniFi and Apache MXNet for IoT NY 2018
 
Apache deep learning 101
Apache deep learning 101Apache deep learning 101
Apache deep learning 101
 
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the EnterpriseUsing Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
Using Spark Streaming and NiFi for the Next Generation of ETL in the Enterprise
 
IoT with Apache MXNet and Apache NiFi and MiniFi
IoT with Apache MXNet and Apache NiFi and MiniFiIoT with Apache MXNet and Apache NiFi and MiniFi
IoT with Apache MXNet and Apache NiFi and MiniFi
 
Apache MXNet for IoT with Apache NiFi
Apache MXNet for IoT with Apache NiFiApache MXNet for IoT with Apache NiFi
Apache MXNet for IoT with Apache NiFi
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
 
Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies
Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies
Apache Atlas: Why Big Data Management Requires Hierarchical Taxonomies
 
Is your Enterprise Data lake Metadata Driven AND Secure?
Is your Enterprise Data lake Metadata Driven AND Secure?Is your Enterprise Data lake Metadata Driven AND Secure?
Is your Enterprise Data lake Metadata Driven AND Secure?
 
Classification based security in Hadoop
Classification based security in HadoopClassification based security in Hadoop
Classification based security in Hadoop
 
Containers and Big Data
Containers and Big DataContainers and Big Data
Containers and Big Data
 

Mais de Timothy Spann

Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Timothy Spann
 
28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines
Timothy Spann
 
TCFPro24 Building Real-Time Generative AI Pipelines
TCFPro24 Building Real-Time Generative AI PipelinesTCFPro24 Building Real-Time Generative AI Pipelines
TCFPro24 Building Real-Time Generative AI Pipelines
Timothy Spann
 
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
Timothy Spann
 
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
Timothy Spann
 
DBA Fundamentals Group: Continuous SQL with Kafka and Flink
DBA Fundamentals Group: Continuous SQL with Kafka and FlinkDBA Fundamentals Group: Continuous SQL with Kafka and Flink
DBA Fundamentals Group: Continuous SQL with Kafka and Flink
Timothy Spann
 
OSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
OSACon 2023_ Unlocking Financial Data with Real-Time PipelinesOSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
OSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
Timothy Spann
 
JConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and Flink
Timothy Spann
 

Mais de Timothy Spann (20)

DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
2024 XTREMEJ_ Building Real-time Pipelines with FLaNK_ A Case Study with Tra...
2024 XTREMEJ_  Building Real-time Pipelines with FLaNK_ A Case Study with Tra...2024 XTREMEJ_  Building Real-time Pipelines with FLaNK_ A Case Study with Tra...
2024 XTREMEJ_ Building Real-time Pipelines with FLaNK_ A Case Study with Tra...
 
28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines
 
TCFPro24 Building Real-Time Generative AI Pipelines
TCFPro24 Building Real-Time Generative AI PipelinesTCFPro24 Building Real-Time Generative AI Pipelines
TCFPro24 Building Real-Time Generative AI Pipelines
 
2024 Build Generative AI for Non-Profits
2024 Build Generative AI for Non-Profits2024 Build Generative AI for Non-Profits
2024 Build Generative AI for Non-Profits
 
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
2024 February 28 - NYC - Meetup Unlocking Financial Data with Real-Time Pipel...
 
Conf42-Python-Building Apache NiFi 2.0 Python Processors
Conf42-Python-Building Apache NiFi 2.0 Python ProcessorsConf42-Python-Building Apache NiFi 2.0 Python Processors
Conf42-Python-Building Apache NiFi 2.0 Python Processors
 
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...
Conf42Python -Using Apache NiFi, Apache Kafka, RisingWave, and Apache Iceberg...
 
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
 
DBA Fundamentals Group: Continuous SQL with Kafka and Flink
DBA Fundamentals Group: Continuous SQL with Kafka and FlinkDBA Fundamentals Group: Continuous SQL with Kafka and Flink
DBA Fundamentals Group: Continuous SQL with Kafka and Flink
 
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
NY Open Source Data Meetup Feb 8 2024 Building Real-time Pipelines with FLaNK...
 
OSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
OSACon 2023_ Unlocking Financial Data with Real-Time PipelinesOSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
OSACon 2023_ Unlocking Financial Data with Real-Time Pipelines
 
Building Real-Time Travel Alerts
Building Real-Time Travel AlertsBuilding Real-Time Travel Alerts
Building Real-Time Travel Alerts
 
JConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and FlinkJConWorld_ Continuous SQL with Kafka and Flink
JConWorld_ Continuous SQL with Kafka and Flink
 
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
[EN]DSS23_tspann_Integrating LLM with Streaming Data Pipelines
 
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines DemoEvolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
Evolve 2023 NYC - Integrating AI Into Realtime Data Pipelines Demo
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 

HDF 3.1 : An Introduction to New Features

  • 1. 1 © Hortonworks Inc. 2011–2018. All rights reserved. © Hortonworks, Inc. 2011-2018. All rights reserved. | Hortonworks confidential and proprietary information. Hortonworks Data Flow 3.1 Timothy Spann, Solutions Engineer Hortonworks @PaaSDev
  • 2. 2 © Hortonworks Inc. 2011–2018. All rights reserved. Disclaimer • This document may contain product features and technology directions that are under development, may be under development in the future or may ultimately not be developed. • Technical feasibility, market demand, user feedback, and the Apache Software Foundation community development process can all effect timing and final delivery. • This document’s description of these features and technology directions does not represent a contractual commitment, promise or obligation from Hortonworks to deliver these features in any generally available product. • Product features and technology directions are subject to change, and must not be included in contracts, purchase orders, or sales agreements of any kind. • Since this document contains an outline of general product development plans, customers should not rely upon it when making a purchase decision.
  • 3. 3 © Hortonworks Inc. 2011–2018. All rights reserved. MULTIPLE CLUSTERS AND SOURCES MULTIHYBRID DATAPLANE SERVICE (DPS) MANAGE, GOVERN, SECURE DATA LIFECYCLE MANAGER DATA STEWARD STUDIO* ISV SERVICES *not yet available, coming soon EXTENSIBLE SERVICES IBM DSX*CLOUD- BREAK* DATA ANALYTICS STUDIO* CONNECTED DATA PLATFORMS HORTONWORKS DATA PLATFORM (HDP®) DATA-AT-REST HORTONWORKS DATAFLOW (HDF™) DATA-IN-MOTION MODERN DATA USE CASES EDW OPTIMIZATION CYBER SECURITY DATA SCIENCE ADVANCED ANALYTICS PARTNER SOLUTIONS IOT/ STREAMING ANALYTICS HORTONWORKS CONNECTION ENTERPRISE SUPPORT PREMIER SUPPORT EDUCATIONAL SERVICES PROFESSIONAL SERVICES COMMUNITY CONNECTION HORTONWORKS PLATFORM SERVICES OPERATIONAL SERVICES SMARTSENSE™ Global Data Management With Hortonworks
  • 4. 4 © Hortonworks Inc. 2011–2018. All rights reserved. HDF Data-In-Motion Platform – with HDF 3.1 GA Release
  • 5. 5 © Hortonworks Inc. 2011–2018. All rights reserved. HDF 3.1 New and Enhanced Features Ease of Use Core Enhancements Cross-Product Integration Flow Management Stream Processing • NiFi-Atlas, -SmartSense, and -Knox integration (HDF on HDP scenario only) • NiFi-Ranger: Group based policy support for NiFi resources • New SAM operations module • SAM ”Test Mode” • Kafka 1.0 Support • Schema Registry • Schema Version Lifecycle Mgmt. • SAM extensibility improvements • Ambari and Ranger support for Kafka 1.0 • Improved Ambari experience: Automate adding NiFi nodes to existing cluster • Apache NiFi Registry (new) • Flow migration and version control • MiNiFi C++, Java, Android/IOS libraries GA • Containerized deployment (Docker)
  • 6. 6 © Hortonworks Inc. 2011–2018. All rights reserved. Improved Operational Efficiency MiNiFi C++ Agent C++ Agent C++ Agent C++ Agent There are many configuration options for MiNiFi C++, all dependent on the use case, they may help with: • Minimizing memory footprint • Lowering CPU consumption • Reducing size on disk https://community.hortonworks.com/articles/167193/building-and- running-minifi-cpp-in-orangepi-zero.html
  • 7. 7 © Hortonworks Inc. 2011–2018. All rights reserved. Integrated Provisioning and Security Kafka 1.0 Support To enhance data governance and lineage, users can now manage access control policies using resource or tag-based security in Ranger for Kafka 1.0 clusters. Users can now install, configure, manage, upgrade, monitor, and secure Kafka 1.0 clusters with Ambari. New processors in NiFi and Streaming Analytics Manager support Kafka 1.0 features including message headers and transactions.
  • 8. 8 © Hortonworks Inc. 2011–2018. All rights reserved. When HDF is co-located with HDP… Integrations with Atlas, Knox and SmartSense SmartSense
  • 9. 9 © Hortonworks Inc. 2011–2018. All rights reserved. 220+ Processors for Deeper Ecosystem Integration Hash Extract Merge Duplicate Scan GeoEnrich Replace ConvertSplit Translate Route Content Route Context Route Text Control Rate Distribute Load Generate Table Fetch Jolt Transform JSON Prioritized Delivery Encrypt Tail Evaluate Execute All Apache project logos are trademarks of the ASF and the respective projects. Fetch HTTP Syslog Email HTML Image HL7 FTP UDP XML SFTP AMQP WebSocket
  • 10. 10 © Hortonworks Inc. 2011–2018. All rights reserved. HDF 3.1 for Big Data Engineers Multiple users, frameworks, languages, data sources & clusters BIG DATA ENGINEER • Experience in ETL • Coding skills in Scala, Python, Java • Experience with Apache Hadoop • Knowledge of tools such Hive, Flume or Pig • Knowledge of SQL • Expert in ETL (Eating, Ties and Laziness) • Social Media Maven • Deep SME in Buzzwords • No Coding skills • Interest in Pig and Falcon CAT AI • Will Drive your Car • Will Fix Your Code • Will Not Be Discussed Today • Will Not Finish This Talk For Me, This Time
  • 11. 11 © Hortonworks Inc. 2011–2018. All rights reserved. Aggregate all data from sensors, drones, logs, geo-location devices, machines and social feeds Collect: Bring Together Mediate point-to-point and bi-directional data flows, delivering data reliably to Apache HBase, Apache Hive, HDFS, Slack and Email. Conduct: Mediate the Data Flow Parse, filter, join, transform, fork, query, sort, dissect, enrich with weather, location, Apache OpenNLP and Apache MXNet. Curate: Gain Insights
  • 12. 12 © Hortonworks Inc. 2011–2018. All rights reserved. NiFi (PROD) MiNiFi MiNiFi MiNiFi MiNiFi MiNiFi MiNiFi MiNiFi Flow Registry API Persistence Other services Other services NiFi (QA) NiFi (Dev) Register DeployDeploy DataFlow Registry • NiFi Flow Registry • Standalone application/service (URL) • Standard API with pluggable components • Design and deploy mechanism for flow migration (SDLC) use cases
  • 13. 13 © Hortonworks Inc. 2011–2018. All rights reserved. Kafka Powerful Pattern with Kafka Headers: Pass Schema Key in Kafka Header Truck Geo Sensor Truck Speed Sensor Kafka Topic (raw-all_truck_events_csv) Centralized Schema Repository Publish CSV Events with Schema metadata from SR stored in Kafka Header Data Movement and Processing by NiFi using new Record-Based processing Kafka Event with Header Published by the Sensor Producing App Kafka Header Kafka Payload header with key schema.name that has metadata info to lookup the schema in HWX SR CSV Binary Event
  • 14. 14 © Hortonworks Inc. 2011–2018. All rights reserved. Nifi and Kafka 1.0 – Use Case for Kafka Message Headers Kafka
  • 15. 15 © Hortonworks Inc. 2011–2018. All rights reserved. Grafana & Kafka 1.0 Integration: Monitoring Topic Level KPIs Broker Level KPIs Kafka
  • 16. 16 © Hortonworks Inc. 2011–2018. All rights reserved. Apache Spark Integration
  • 17. 17 © Hortonworks Inc. 2011–2018. All rights reserved. Apache Spark Integration
  • 18. 18 © Hortonworks Inc. 2011–2018. All rights reserved. New: Integrated Registry Service • Integrated Flow Registry Service • Sharable between NiFi environments for Dev/UAT/Prod promotion • API or GUI driven • Can be integrated with Enterprise Version Control e.g. GitLab • ‘Buckets’ of Flows for security and access control SDLC
  • 19. 19 © Hortonworks Inc. 2011–2018. All rights reserved. New: Integrated Variable Registry Service • Integrated Variable Registry • Sets of key:value pairs available on every Process Group • Referenced with NiFi Expression Language • Dynamically changeable at runtime • Use within Versioned Flows to set Environment Variables • GUI or API driven SDLC
  • 20. 20 © Hortonworks Inc. 2011–2018. All rights reserved. • Wrap atomic functions in harnesses for regression testing • Integrate via the Rest-API to automate testing through Jenkins etc. • Automate triggering tests when new Versions are pushed to the Flow Registry SDLC Regression test with Golden Datasets
  • 21. 21 © Hortonworks Inc. 2011–2018. All rights reserved. • Nest Versioned Process Groups to test composite functions • Wrap in test harnesses to validate functionality • Flow Versioning provides visibility as components of Composites are updated SDLC Build & Test Composite DataFlows
  • 22. 22 © Hortonworks Inc. 2011–2018. All rights reserved. New: Design & Deploy complementing Command & Control • SDLC Dev: Place Process Groups under Version Control • Make changes and commit to new version • Roll Versions back or forward SDLC
  • 23. 23 © Hortonworks Inc. 2011–2018. All rights reserved. • Get Notifications of local changes or new versions available in Repository • Revert or Commit local changes via the GUI or Rest-API • Use Rest-API to integrate with Jenkins, etc. SDLC New: Design & Deploy complementing Command & Control
  • 24. 24 © Hortonworks Inc. 2011–2018. All rights reserved. Administration
  • 25. 25 © Hortonworks Inc. 2011–2018. All rights reserved. Administration
  • 26. 26 © Hortonworks Inc. 2011–2018. All rights reserved. Schema Registry
  • 27. 27 © Hortonworks Inc. 2011–2018. All rights reserved. Schema Registry
  • 28. 28 © Hortonworks Inc. 2011–2018. All rights reserved. Schema Registry
  • 29. 29 © Hortonworks Inc. 2011–2018. All rights reserved. Lifecycle Action 1 - Action: Fork Schema Version to Branch called Dev Schema Registry
  • 30. 30 © Hortonworks Inc. 2011–2018. All rights reserved. More Data Set Coverage AtlasNiFiFlowLineage (ReportingTask) NiFi Flow NiFi Data Provenance Kafka topic 1. static flow lineage from NiFi flow def 2. Add DataSet entities from NiFi Data Provenance events Atlas Integration
  • 31. 31 © Hortonworks Inc. 2011–2018. All rights reserved. sensor-data tweets default.sensor_data path1 path0 path2 Atlas Integration
  • 32. 32 © Hortonworks Inc. 2011–2018. All rights reserved. Registry
  • 33. 33 © Hortonworks Inc. 2011–2018. All rights reserved. Registry
  • 34. 34 © Hortonworks Inc. 2011–2018. All rights reserved. Registry
  • 35. 35 © Hortonworks Inc. 2011–2018. All rights reserved. Registry
  • 36. 36 © Hortonworks Inc. 2011–2018. All rights reserved. Questions?
  • 37. 37 © Hortonworks Inc. 2011–2018. All rights reserved. https://community.hortonworks.com/articles/161761/new-features-in-apache-nifi- 15-apache-nifi-registr.html https://community.hortonworks.com/articles/171787/hdf-31-executing-apache- spark-via-executesparkinte.html https://community.hortonworks.com/articles/171960/using-apache-mxnet-on-an- apache-nifi-15-instance-w.html https://community.hortonworks.com/articles/171893/hdf-31-executing-apache- spark-via-executesparkinte-1.html Resources
  • 38. 38 © Hortonworks Inc. 2011–2018. All rights reserved. Contact https://github.com/tspannhw/ApacheBigData101/tree/master https://community.hortonworks.com/users/9304/tspann.html https://dzone.com/users/297029/bunkertor.html https://www.meetup.com/futureofdata-princeton/ https://twitter.com/PaaSDev https://community.hortonworks.com/articles/155435/using-the-new-mxnet-model-server.html
  • 39. 39 © Hortonworks Inc. 2011–2018. All rights reserved. Hortonworks Community Connection Read access for everyone, join to participate and be recognized • Full Q&A Platform (like StackOverflow) • Knowledge Base Articles • Code Samples and Repositories
  • 40. 40 © Hortonworks Inc. 2011–2018. All rights reserved. Community Engagement Participate now at: community.hortonworks.com© Hortonworks Inc. 2011 – 2015. All Rights Reserved 4,000+ Registered Users 10,000+ Answers 15,000+ Technical Assets One Website!