Achieving a 360 degree view of manufacturing

1 © Hortonworks Inc. 2011–2018. All rights reserved
Achieving a 360 Degree View of
Manufacturing via Open Source
Data Management
Michael Ger
General Manager, Manufacturing & Automotive, Hortonworks
Ryan Templeton
Solutions Engineer, Hortonworks
June 20, 2018

2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Title Goes HereThe Industry 4.0 Imperative
Total cost of poor quality amounts to a staggering 20% of sales
American Society of Quality
Warranty costs to companies amount to approximately 2-3% of revenues
Warranty Week
Deloitte
Unplanned downtime costs manufacturers approximately $50 billion per year
Poor maintenance strategies can reduce plant capacity by 5 to 20%
Deloitte
McKinsey
Through 2016, just 16% of manufacturers have adopted Industrial IOT
strategies

Title Goes HereBig Data For Manufacturing – So What’s New?
Data Variety
OT/IT convergence (Sensors, PLCs, Historians,
MES, ERP, Quality) and New Data Sources
(Image, Video, Audio, Social, RFID, Logs, etc.)
Expanded Analytics
Scope
From individual process focus to end-to-
end process optimization
Data Volume
Industrial Internet data to grow 2X faster
than any other data source
Computational
Power
New massively parallelized computing support ML
and AI, critical to next generation manufacturing

Title Goes Here
Highest value data
Always on, always connected devices generate a
constant stream of data related to the operations of
industrial businesses
These datasets contain:
• What events occurred
• Why and event occurred, or not
• Quantification of an event’s impact
These datasets go by many names:
• “SCADA Data”
• “Control System Data”
• “Historian Data”
• “Machine Data”
• “Measurement Logs”
• “Telemetry”
How are my …
People?
Processes?
Equipment?
Lots of misnomers

Title Goes HereChallenges in accessing machine level data
Control Systems
 Data is transmitted via
proprietary vendor
specific protocols
 Direct Integration with
control systems requires
protocol
translation/parsing for
each platform family
Nifi’s is a toolbox of connectors
 Ingest text files and interrogate REST
APIs using built in connectors
 Connect to industry standard protocols
like OPC UA with custom processors
 Build your own
Existing ICS Components
PLC, RTU & DCS
Open Source Tools
Process Historians &
OPC Servers
 Data is typically available
via programmatic access
such as OPC, API or SQL
 There is almost always an
option to create text files
Instrumentation
 Commonly only output
is electrical signals
 Integration with
sensors requires
specialized hardware
 serial bus, or wireless
are increasingly
available

Title Goes HereThe Data Management Challenge for Manufacturers
Data is Siloed
• Datasets are dispersed
and difficult to access
• Insights are based
sourced from the silos
are narrow & incomplete
• Historical data (years) is not
easily accessible and often
takes longer than expected to
extract
• Data required to solve real
problems comes from several
different sources (lab systems,
product scheduling) and
requires significant manual
effort to pull the data together
Considerable effort
to leverage
• Lack of connectivity to
proper data analytics tools
• Python
• MATLAB
• SAS
• R
• Inability to find data and a
reliance on “tribal
knowledge” and previous
engineer’s spreadsheets to
find tags and queries
Data Analytics
Missing
Lack of Resources
Inability to Leverage BOTH OT and IT Data Sources
• Time dependence limits
how data sets can be
processed
• Point solutions are
almost always closed
source, locking users
into closed ecosystems
• Do not scale with
exponential data growth

Title Goes HereTime Series data is emerging as the fastest growing type of data

Title Goes HereThe Opportunity for a Connected Data Platform
 Real-Time Data from Connected
Manufacturing and edge analytics drive
the need to manage data in motion
– Hortonworks Data Flow (HDF) is a
comprehensive solution to manage data in
motion
 Big Data Analytics require the ability to
store and process vast amounts of big
data at rest
– Hortonworks Data Platform (HDP) is a
comprehensive solution to manage data at rest
 Modern Connected Manufacturing
applications require a Connected Data
Platform - managing both data-in-motion
and data-at-rest
Real-Time
Data
Big Data &
Analytics
MODERN DATA APPS
DATA
IN MOTION
DATA
AT REST

Title Goes HereStepwise approach to these challenge
Remote Field or Manufacturing Site
RDBMS & EDW
Files / Other Unstructured Data
Video
IoT Gateways
WITSML
SCADA, DCS, PLC, RTU, Historians
Location 1
Data Consumers
Data
Marts
Analytics,
Statics &
Science
Visualization
& Dashboards

Title Goes HereData lakes only address data access
Field or Manufacturing Site
RDBMS & EDW
Video
IoT Gateways
WITSML
Location 1
Data Consumers
Data
Marts
Analytics,
Statics &
Science
Visualization
& Dashboards

Title Goes HereScalable data collection is required
Field or Manufacturing Site
RDBMS & EDW
Video
IoT Gateways
WITSML
Location 1
Data Consumers
Data
Marts
Analytics,
Statics &
Science
Visualization
& Dashboards

Title Goes HereWinners Will Embrace the Connected Manufacturing Data Lifecycle
Example: Equipment Predictive Maintenance (Plant Uptime)
REAL-TIME DATA SOURCES
ENTERPRISE
TRANSACTION
SOURCES
ERP, MES, QMS,
Maintenance, ETC. MANUFACTURING DATA LAKE
6
REAL-TIME
MAINTENANCE ACTION
ACT
1
I
N
G
E
S
T
1 INGEST
2 STORE
3 PROCESS
4 •Data Discovery
•Business Intelligence
ANALYZE
MONITOR 5
Deploy
Connected Manufacturing
Sensors
PLCs
Historians
SCADAs
4
LEARN
•Develop Models
•Machine Learning
Model Inputs
• Historic sensor values
• Historic production
• Historic defects & repair orders
• Historic maintenance records
• Design parameters, etc.

Title Goes HereSolution Component Detail
ENTERPRISE
TRANSACTION
SOURCES
MANUFACTURING DATA LAKE
6
1
I
N
G
E
S
T
4
4
Deploy
INGEST MONITOR
STORE AND PROCESS
ANALYZE
LEARN
1 5
REAL-TIME DATA SOURCES
Connected Manufacturing
Sensors
PLCs
Historians
SCADAs
ERP, MES, QMS,
Maintenance, ETC.
REAL-TIME
MAINTENANCE ACTION
ACT
Model Inputs
• Historic sensor values
• Historic production
• Historic defects & repair orders
• Historic maintenance records
• Design parameters, etc.

Title Goes HereThe Road to Big Data Transformation – Maturity Phases
Data Consolidation
into Data Lakes
Data for Analytics
Process 360
FOCUS
BENEFIT
EXAMPLE
EXPLORATION
Machine
Learning
End-To-End
Optimization
Predictive Maintenance,
Yield Optimization
FOCUS
BENEFIT
EXAMPLE
OPTIMIZATION
Streaming
Analytics
Real-Time Actions
Real-Time
Maintenance Actions
FOCUS
BENEFIT
EXAMPLE
TRANSFORMATION
Individual Process,
Equipment, Location
Operational Dashboards
FOCUS
BENEFIT
EXAMPLE
AWARENESS
Point Optimization

Title Goes HereManufacturing Operations – Sample Use Cases
Process 360
• Single-point access to critical
information for analytics
Process & Quality
Monitoring
• Reduce non-conformance, downtime,
scrap and late shipment costs
Equipment Predictive
Maintenance
• Reduce equipment downtime and
maintenance costs
Quality Event
Forensic Analysis
• Reduce scope of service campaigns
and recalls
Quality & Yield
Optimization
• Optimize process variables to improve
yields and quality
C
O
M
P
L
E
X
I
T
Y
Use Case Benefits

Title Goes HereOptimizing Pharmaceutical Yields using Big Data Insights
Manufacturing
Major Biopharmaceuticals
Manufacturer
Core Use Cases
• Manufacturing Yield and
Quality Optimization
After 15 billion calculations and 5.5 million batch-to-batch
comparisons, discovered characteristics closely correlated to yield
Solution: Manufacturing Data Lake with Yield Optimization Analytics
 Month 1: Data loaded into Hadoop (sensors, historians, ERP)
 Month 2: Analytics to cluster every batch of vaccine ever made
 Month 3: Development of models, testing against all historical data
Problem: Higher than desired discard rates (lower yields) on certain
vaccines
 Root cause analysis hampered by huge data volumes and
spreadsheet based analytics
 Researchers could only look at a batch or two at a time

Title Goes HereExample (Pharmaceutical Manufacturer)
Machine
Learning

Title Goes HereFrom Refinery to Enterprise Level Analytics
Problem: Refinery-level analytics sub-optimizes performance
 Analytics performed at each refinery in Excel spreadsheets
 Missed opportunities for optimization based on larger data sets
Solution: Centralize data in Manufacturing Data Lake for analytics
 Ingest data from each refinery using with HDF into centralized Data Lake
 Initial data set was over 1 million data tags, grew to 6 million
 Data Types: Time series, raw materials, quality results, SAP work order
data, etc.
Benefits: Enterprise level-analytics to optimize performance
 ROI Analysis
 $106 million in cost savings per year
 20X ROI annually
Oil Refining
Multinational Oil & Gas
Company
Core Use Cases
• Blend Monitoring
• Corrosion Prediction
• Analyzer Reliability Analytics
• Heat Exchanger
Performance Analytics
• Inferential Models Analytics

Title Goes HereData Lake for Connected Vehicle and Quality Analytics
Problem: Data silos impeded Connected Vehicle, quality management and
other cross-functional strategic initiatives
 Lacked infrastructure for storing long-term vehicle connected vehicle driving
data
 Data silos impeded ability to achieve a 360 degree view of vehicle quality data
for analytics
Solution: Implemented Enterprise Data Lake for cross-functional analytics
 Creating an Enterprise Data Lake for a wide range of data including dealership
repair orders, warranty claims, manufacturing shop floor and inspection data,
electric vehicle usage and performance, vehicle status and usage patterns
 Creating a unified Quality Data Lake including design quality, production quality
and final assembly quality data
Benefits: Cross-functional analytics leverages All enterprise data
 Data lake supports cross-functional data analytics for all types of data
 Platform planned to store 500TB of data in 2017
 Wide-ranging data analytics use cases planned for future
Automotive
Major Automotive OEM
Core Use Cases
• Connected Vehicle &
Value-Added Services
• Quality Event Detection
& Traceability

Title Goes HereVehicle Lifecycle Quality Analytics
Enterprise Quality Data Lake Provides Foundation for Rapid Time-to-Resolution
Voice of
Vehicle
Telematics
Voice Of
Customer
• DTC’s
• Performance
Parameters
Social + Web
(Forums, Surveys, etc.)
Call Center
Service & Warranty
• Likes
• Dislikes
• Sentiment
• Complaints
• Claims
What
Happened?
1
Problem
Voice Of
Supplier
Production and
Operations
Voice Of
Factory
Production and
Operations
• Man
• Method
• Material
• Machine
• Man
• Method
• Material
• Machine
Why Did It
Happen?
2
Installed
Base
Voice Of
Field
• VINs
• Configurations
• Locations
Contain The
Problem
Voice Of
Design3
• Designs
• Parts/BOMs
• Features
• Specifications
Product
Development

Title Goes HereWhy Open Source for Industry 4.0 Big Data Initiatives?
Data
Volume
Innovation
Lower
Risk
No Vendor
“Lock-In”
Open Source
Community
Economics

Thank You

Achieving a 360 degree view of manufacturing

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Achieving a 360 degree view of manufacturing

Similar to Achieving a 360 degree view of manufacturing (20)

More from DataWorks Summit

More from DataWorks Summit (20)

Recently uploaded

Recently uploaded (20)

Achieving a 360 degree view of manufacturing

Editor's Notes