SlideShare uma empresa Scribd logo
1 de 30
Baixar para ler offline
Big Data to AI
Analytics Trends and Directions:
Cedrine Madera, PhD
Executive Information Architect
Member of IBM Academy Of Technology
Unleashing your data and making the shift to a
Data-Driven Organization
Value
Uses of Data
Efficiency Modernization Data Decision Monetization
Operations Reporting &
Data
Warehousing
Self-Service
Analytics
New
Business
Models
Data Science
Analytics maturity level
From information driven to data driven
BIG DATA, MACHINE LEARNING AND COGNITIVE/AI>
010101010101010111100010011001010111
1000101
1000101
1000101
111010111010
00000000000010101010100000000000 111101011
Cognitive
BUSINESS
VALUE
1990’s
DATA WAREHOUSE
2012
BIG DATA
2014
Data Lake
Store and analyse growing volumes of data to answer to analytics requirements- Information driven Systems
Integrate non structured data – Apache Hadoop experimentation -
hybrid information & data driven systems
To support digital transformation, data driven model
Strong analytics foundations to go to AI>
Information
systems
Velocity/ Variety / Volume
of Data
2017
Cognitif Information
System
2018
Infuse AI
Semantic
• Artificial Intelligence (AI)
• Intelligence exhibited by machines or software
• Machine Learning (ML)
• Type of AI that enables computers to learn without
being explicitly programmed
• Deep Learning (DL)
• Type of ML, based on neural networks loosely
modeled after the brain
• learns features and representations of data
• Training
• neural “inspired”, fed by millions of data points
• repetition drives weighting and connections
Cognitive Systems : A category of technologies that uses natural language
processing and machine learning to enable people and machines to interact more
naturally to extend and magnify human expertise and cognition.
These systems will learn and interact to provide expert assistance to scientists,
engineers, lawyers, and other professionals in a fraction of the time it now takes.
Machine Learning
Deep Learning
Break tasks into Artificial
Neural Networks
Advanced
Analytics:
NoSQL,
Hadoop &
Analytics
Human Intelligence Exhibited by Machines
Cognitive / AI
“Trained” using large amounts of data &
ability to learn how to perform the task
What the market is
saying…
https://www.forbes.com/sites/brentdykes/2017/01/11/crawl-with-analytics-before-running-with-artificial-intelligence/#61efd2f8299c
Ovum : 2017 Trends to Watch: Analytics
Machine learning and automation
is the enterprise reality of AI science fiction
“A market for algorithms will emerge..”
Upgrading data architectures must balance
new capabilities with existing investments
IDC
Crawl With Analytics Before Running With Artificial Intelligence
No Artificial Inteligence
without Information Architecture
The descriptive Analytics challenges
Functional
• Regulation & compliance (GDPR)
• Silos
• All data types
Non functional
• Scalability
• Reliability
• Security
• Data governance
• Data Gravity
Descriptive analytics can be classified into three areas that answer certain kinds of questions:
• Standard reporting and dashboards: What happened? How does it compare to our plan? What is happening now?
• Ad-hoc reporting: How many? How often? Where?
• Analysis/query/drill-down: What exactly is the problem? Why is it happening?
The Predictive Analytics challenges
Functional
• Information system
coverage extension
• Skills- open technologies
• Machine Learning
Non functional
• Volume
• Security
• transparency
Predictive analytics can be classified into six categories:
•Data mining: What data is correlated with other data?
•Pattern recognition and alerts: When should I take action to correct or adjust a process or piece of equipment?
•Monte-Carlo simulation: What could happen?
•Forecasting: What if these trends continue?
•Root cause analysis: Why did something happen?
•Predictive modeling: What will happen next if?
The Prescriptive Analytics challenges
Functional
•Business rules
automation
Non functional
•Real time
•Historical data volume
Prescriptive analytics, which is part of “advanced analytics,” is based on the concept of optimization, which can be
divided into two areas:
•Optimization: How can we achieve the best outcome?
•Stochastic optimization: How can we achieve the best outcome and address uncertainty in the data to make better
decisions?
The Data governance challenges
Functional
CDO- CPO
Ethics & Analytics
Regulations
Non functional
•Data Life cycle
•Data Security
•Data quality
Data governance (DG) refers to the overall management of the availability, usability, integrity, and security of
the data employed in an enterprise.
The Data Architecture challenges
Functional
HTAP*
Data Lake
IoT
Non functional
• Volume
• Cost
• Data Security
• Data quality
• Real time
Data architecture is a set of rules, policies, standards and models that govern and define the type of data
collected and how it is used, stored, managed and integrated within an organization and its database systems.
It provides a formal approach to creating and managing the flow of data and how it is processed across an
organization’s IT systems and applications.
*Hybrid Transactional Analytical Processing
How the z Systems can help to solve
those challenges?
Analytics- Machine Learning-Data governance-Data architecture
The descriptive Analytics challenges
Accelerators
IBM DB2 Analytics Accelerator
DB2 BLU
DASHDB
SIMD
SMT
• Data movement – ETL
• INZA-predictive modelling
• Queries
• Open language R-Scala(Spark)
• Archives
• Federation
• DB2 z/OS- IMS-VSAM-Oracle
Technology breath : To simply- To alleviate- To secure
Data gravity : volume-sensitivity-cost
HTAP enablement
The Predictive Analytics challenges
Open Framework
Machine Learning
IBM SPSS
Apache Spark
IBM Machine Learning on
z/OS
R
Technology breath : To simply- To alleviate- To secure
Data gravity : volume-sensitivity-cost
HTAP enablement
Machine Learning Basics
Identifies patterns in
historical data
Builds/trains
behavioral models
from patterns
Makes
recommendations
Machine learning is everywhere, influencing nearly
everything we do…
Netflix personalized movie
recommendations
Waze personalized
driving experience 7 out of 10 financial customers would take
recommendations from a robot advisor
Machine Learning - Process
Data
Ingestion
Data Cleaning
and
Transformation
Model
Training
Testing and
Validation
Deployment
Model Selection
From experimentation to production… the real data science challenge
Machine
Learning can be
applied to a
Variety of Use
Cases
Across Problem
Types and
Industries
Machine learning can help IT department… batch optimization, predictive maintenance/failure,….
be embeded into any expert System.
The Data governance challenges
Move analytics power &
security to data
Ethics framework into
Analytics project
HW accelerator
Memory extended
zIIP eligibility
Zero cost – Zero latency for IDAA
Apache Spark
Pervasive encryption
MDM
Machine Learning
Privacy by design and by default
Technology breath : To simply- To alleviate- To secure
Data gravity : volume-sensitivity-cost
HTAP enablement
The Analytic’s
Ethics dilemma
with personal
data : how GDPR
could slow down
Analytics project
New Analytics or Machine Learning projects will required
Ethical policies by design and by default.
The importance of Ethical
dimension with Analytics and
Machine Learning projects
Recommendations for GDPR readiness with
Analytics and Machine learning projects
• Check if personal data is processed into big data analytics treatment and should consider to
use appropriate techniques to anonymize the personal data in their dataset(s) before
analysis...
• Become transparent about their processing of personal data by using a combination of
innovative approaches in order to provide meaningful privacy notices at appropriate stages
throughout a big data project.
• Embed a privacy impact assessment framework into their big data processing activities to
help identify privacy risks and assess the necessity and proportionality of a given project.
• Adopt a privacy by design approach in the development and application of their big data
analytics. This should include implementing technical and organizational measures to
address matters including data security, data minimization and data segregation...
• Develop ethical principles to help reinforce key data protection principles. Organizations
should create ethics boards to help scrutinize projects and assess complex issues arising
from big data analytics...
• Implement innovative techniques to develop auditable machine learning algorithms.
Internal and external audits should be undertaken with a view to explaining the rationale
behind algorithmic decisions and checking for bias, discrimination and errors...
The Data Architecture challenges
Federated data lake
Hybrid cloud integration
IDAA
Apache Spark
DashDB
Linux on z
Technology breath : To simply- To alleviate- To secure
Data gravity : volume-sensitivity-cost
HTAP enablement
Reasons to limit data movement to build a
physical data lake
Data gravity – analytic
treatment move where the
data resides
Data sensitivity – To crypt data
in case of data breach
Real time analytics
requirements
Data governance high
requirements :
•Data quality : reduce data copy
•Data security : regulations ( such as
GDPR)
•Data life cycle management : alleviate
and optimize data management
The hybrid data lake federated approach
To alleviate data
movement
To use
federated data
approach
To respect
data gravity
To leverage
existing data
set
To limit data
discrepancy
Use z Systems as
one of physical repository
Let z Systems data
In place
Show to your data scientists
How easy it is to access z data
Imperatives to implement Data Lake hybrid
scenario
Reduce complexity of
information supply chain, e.g.
• Avoid data movement
• Simplify data transformation
• Use in-DB transformation
• Use temporary tables structures
Adhere to innovative and
novel Analytics concepts, e.g.
• Limit number of data marts and data
cubes
• Use aggregation on the fly
• Allow for agile usage patterns
• Leverage HTAP* architecture
Technologies to use for hybrid data lake
approach
Leverage state-of-the-art technology,
e.g.
HW accelerators
Special-purpose appliances
In-memory processing
Use federation technique whenever
possible, e.g.
Federated SQL queries, leaving data in
place
Federated analytical processing,
leaving data in place
Open Framework (e.g Apache Spark)
*Hybrid Transactional Analytical Processing
Data in IBM DB2 Analytics Accelerator
• An extension of a DB2 for z/OS system
• ETL process acceleration and alleviation
• Accelerating SQL access to z/OS data, including
IMS, VSAM ... loaded by IDAA Loader
• Managing huge volume of history data (HPSS )
• R queries accelerator
• Apache Spark on z/OS queries accelerator
Transparent and easy data scientists
access
• Thru JDBC or API from Spark on distributed
including Linux on z
With Spark on z/OS as well as Machine
Learning on z/OS
z Systems as a Data Lake Repository into an
hybrid approach- make z Data Simple
Descriptive
Predictive
Prescriptive
Data architecture
Data governance
Technology breath with IBM Z
Ask your Information Architect
to leverage them!
Wrap up of the presentation
Analytics
From information driven to data driven , IBM Z can help to achieve the challenge !
Thank you
Cedrine Madera, PhD
Executive Information Architect
Member of IBM Academy Of Technology

Mais conteúdo relacionado

Mais procurados

How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...DATAVERSITY
 
Predictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupPredictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupCaserta
 
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...Vasu S
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingNitesh Khilwani
 
Big Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data AnalyticsBig Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data AnalyticsSystems Limited
 
What are the 6 elements of a project
What are the 6 elements of a projectWhat are the 6 elements of a project
What are the 6 elements of a projectRichardPierce28
 
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...Kevin Pledge
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino Data Lab
 
Analytics for actuaries cia
Analytics for actuaries ciaAnalytics for actuaries cia
Analytics for actuaries ciaKevin Pledge
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleVasu S
 
Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018Harvinder Atwal
 
Big Data in Manufacturing Final PPT
Big Data in Manufacturing Final PPTBig Data in Manufacturing Final PPT
Big Data in Manufacturing Final PPTNikhil Atkuri
 
000 introduction to big data analytics 2021
000   introduction to big data analytics  2021000   introduction to big data analytics  2021
000 introduction to big data analytics 2021Dendej Sawarnkatat
 
Gartner Business Intelligence & Analytics Summit Brochure
Gartner Business Intelligence & Analytics Summit BrochureGartner Business Intelligence & Analytics Summit Brochure
Gartner Business Intelligence & Analytics Summit BrochureNadia Smith
 
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Caserta
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalHarvinder Atwal
 
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIReal-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIibi
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsTeradata Aster
 

Mais procurados (20)

How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
 
Predictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupPredictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing Meetup
 
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
 
Big Data and Semantic Web in Manufacturing
Big Data and Semantic Web in ManufacturingBig Data and Semantic Web in Manufacturing
Big Data and Semantic Web in Manufacturing
 
Big Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data AnalyticsBig Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data Analytics
 
Sgcp14dunlea
Sgcp14dunleaSgcp14dunlea
Sgcp14dunlea
 
What are the 6 elements of a project
What are the 6 elements of a projectWhat are the 6 elements of a project
What are the 6 elements of a project
 
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
Advanced Business Analytics for Actuaries - Canadian Institute of Actuaries J...
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
 
Analytics for actuaries cia
Analytics for actuaries ciaAnalytics for actuaries cia
Analytics for actuaries cia
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Modern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | QuboleModern Integrated Data Environment - Whitepaper | Qubole
Modern Integrated Data Environment - Whitepaper | Qubole
 
Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018Data Leaders Summit Barcelona 2018
Data Leaders Summit Barcelona 2018
 
Big Data in Manufacturing Final PPT
Big Data in Manufacturing Final PPTBig Data in Manufacturing Final PPT
Big Data in Manufacturing Final PPT
 
000 introduction to big data analytics 2021
000   introduction to big data analytics  2021000   introduction to big data analytics  2021
000 introduction to big data analytics 2021
 
Gartner Business Intelligence & Analytics Summit Brochure
Gartner Business Intelligence & Analytics Summit BrochureGartner Business Intelligence & Analytics Summit Brochure
Gartner Business Intelligence & Analytics Summit Brochure
 
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
Creating a DevOps Practice for Analytics -- Strata Data, September 28, 2017
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIReal-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BI
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
 

Semelhante a Big Data to AI Analytics Trends and Directions

ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseDatabricks
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise deteo
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyNeo4j
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityDATAVERSITY
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentationPriyesh Patel
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraMolly Alexander
 
Building enterprise advance analytics platform
Building enterprise advance analytics platformBuilding enterprise advance analytics platform
Building enterprise advance analytics platformHaoran Du
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AIGary Allemann
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big DataIndu Khemchandani
 
An Introduction to Advanced analytics and data mining
An Introduction to Advanced analytics and data miningAn Introduction to Advanced analytics and data mining
An Introduction to Advanced analytics and data miningBarry Leventhal
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...Santiago Cabrera-Naranjo
 
IBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Software India
 

Semelhante a Big Data to AI Analytics Trends and Directions (20)

AI in the Enterprise at Scale
AI in the Enterprise at ScaleAI in the Enterprise at Scale
AI in the Enterprise at Scale
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph Technology
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture MaturityADV Slides: How to Improve Your Analytic Data Architecture Maturity
ADV Slides: How to Improve Your Analytic Data Architecture Maturity
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentation
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav MisraFrom Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
From Foundation to Mastery – Building a Mature Analytics Roadmap - Manav Misra
 
Building enterprise advance analytics platform
Building enterprise advance analytics platformBuilding enterprise advance analytics platform
Building enterprise advance analytics platform
 
Deliveinrg explainable AI
Deliveinrg explainable AIDeliveinrg explainable AI
Deliveinrg explainable AI
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
 
An Introduction to Advanced analytics and data mining
An Introduction to Advanced analytics and data miningAn Introduction to Advanced analytics and data mining
An Introduction to Advanced analytics and data mining
 
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
CTO Radshow Hamburg17 - Keynote - The CxO responsibilities in Big Data and AI...
 
IBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big Data
 

Último

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 

Último (20)

专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 

Big Data to AI Analytics Trends and Directions

  • 1. Big Data to AI Analytics Trends and Directions: Cedrine Madera, PhD Executive Information Architect Member of IBM Academy Of Technology
  • 2. Unleashing your data and making the shift to a Data-Driven Organization Value Uses of Data Efficiency Modernization Data Decision Monetization Operations Reporting & Data Warehousing Self-Service Analytics New Business Models Data Science Analytics maturity level From information driven to data driven
  • 3. BIG DATA, MACHINE LEARNING AND COGNITIVE/AI> 010101010101010111100010011001010111 1000101 1000101 1000101 111010111010 00000000000010101010100000000000 111101011
  • 4. Cognitive BUSINESS VALUE 1990’s DATA WAREHOUSE 2012 BIG DATA 2014 Data Lake Store and analyse growing volumes of data to answer to analytics requirements- Information driven Systems Integrate non structured data – Apache Hadoop experimentation - hybrid information & data driven systems To support digital transformation, data driven model Strong analytics foundations to go to AI> Information systems Velocity/ Variety / Volume of Data 2017 Cognitif Information System 2018 Infuse AI
  • 5. Semantic • Artificial Intelligence (AI) • Intelligence exhibited by machines or software • Machine Learning (ML) • Type of AI that enables computers to learn without being explicitly programmed • Deep Learning (DL) • Type of ML, based on neural networks loosely modeled after the brain • learns features and representations of data • Training • neural “inspired”, fed by millions of data points • repetition drives weighting and connections Cognitive Systems : A category of technologies that uses natural language processing and machine learning to enable people and machines to interact more naturally to extend and magnify human expertise and cognition. These systems will learn and interact to provide expert assistance to scientists, engineers, lawyers, and other professionals in a fraction of the time it now takes. Machine Learning Deep Learning Break tasks into Artificial Neural Networks Advanced Analytics: NoSQL, Hadoop & Analytics Human Intelligence Exhibited by Machines Cognitive / AI “Trained” using large amounts of data & ability to learn how to perform the task
  • 6. What the market is saying… https://www.forbes.com/sites/brentdykes/2017/01/11/crawl-with-analytics-before-running-with-artificial-intelligence/#61efd2f8299c Ovum : 2017 Trends to Watch: Analytics Machine learning and automation is the enterprise reality of AI science fiction “A market for algorithms will emerge..” Upgrading data architectures must balance new capabilities with existing investments IDC Crawl With Analytics Before Running With Artificial Intelligence
  • 7. No Artificial Inteligence without Information Architecture
  • 8. The descriptive Analytics challenges Functional • Regulation & compliance (GDPR) • Silos • All data types Non functional • Scalability • Reliability • Security • Data governance • Data Gravity Descriptive analytics can be classified into three areas that answer certain kinds of questions: • Standard reporting and dashboards: What happened? How does it compare to our plan? What is happening now? • Ad-hoc reporting: How many? How often? Where? • Analysis/query/drill-down: What exactly is the problem? Why is it happening?
  • 9. The Predictive Analytics challenges Functional • Information system coverage extension • Skills- open technologies • Machine Learning Non functional • Volume • Security • transparency Predictive analytics can be classified into six categories: •Data mining: What data is correlated with other data? •Pattern recognition and alerts: When should I take action to correct or adjust a process or piece of equipment? •Monte-Carlo simulation: What could happen? •Forecasting: What if these trends continue? •Root cause analysis: Why did something happen? •Predictive modeling: What will happen next if?
  • 10. The Prescriptive Analytics challenges Functional •Business rules automation Non functional •Real time •Historical data volume Prescriptive analytics, which is part of “advanced analytics,” is based on the concept of optimization, which can be divided into two areas: •Optimization: How can we achieve the best outcome? •Stochastic optimization: How can we achieve the best outcome and address uncertainty in the data to make better decisions?
  • 11. The Data governance challenges Functional CDO- CPO Ethics & Analytics Regulations Non functional •Data Life cycle •Data Security •Data quality Data governance (DG) refers to the overall management of the availability, usability, integrity, and security of the data employed in an enterprise.
  • 12. The Data Architecture challenges Functional HTAP* Data Lake IoT Non functional • Volume • Cost • Data Security • Data quality • Real time Data architecture is a set of rules, policies, standards and models that govern and define the type of data collected and how it is used, stored, managed and integrated within an organization and its database systems. It provides a formal approach to creating and managing the flow of data and how it is processed across an organization’s IT systems and applications. *Hybrid Transactional Analytical Processing
  • 13. How the z Systems can help to solve those challenges? Analytics- Machine Learning-Data governance-Data architecture
  • 14. The descriptive Analytics challenges Accelerators IBM DB2 Analytics Accelerator DB2 BLU DASHDB SIMD SMT • Data movement – ETL • INZA-predictive modelling • Queries • Open language R-Scala(Spark) • Archives • Federation • DB2 z/OS- IMS-VSAM-Oracle Technology breath : To simply- To alleviate- To secure Data gravity : volume-sensitivity-cost HTAP enablement
  • 15. The Predictive Analytics challenges Open Framework Machine Learning IBM SPSS Apache Spark IBM Machine Learning on z/OS R Technology breath : To simply- To alleviate- To secure Data gravity : volume-sensitivity-cost HTAP enablement
  • 16. Machine Learning Basics Identifies patterns in historical data Builds/trains behavioral models from patterns Makes recommendations Machine learning is everywhere, influencing nearly everything we do… Netflix personalized movie recommendations Waze personalized driving experience 7 out of 10 financial customers would take recommendations from a robot advisor
  • 17. Machine Learning - Process Data Ingestion Data Cleaning and Transformation Model Training Testing and Validation Deployment Model Selection From experimentation to production… the real data science challenge
  • 18. Machine Learning can be applied to a Variety of Use Cases Across Problem Types and Industries Machine learning can help IT department… batch optimization, predictive maintenance/failure,…. be embeded into any expert System.
  • 19. The Data governance challenges Move analytics power & security to data Ethics framework into Analytics project HW accelerator Memory extended zIIP eligibility Zero cost – Zero latency for IDAA Apache Spark Pervasive encryption MDM Machine Learning Privacy by design and by default Technology breath : To simply- To alleviate- To secure Data gravity : volume-sensitivity-cost HTAP enablement
  • 20. The Analytic’s Ethics dilemma with personal data : how GDPR could slow down Analytics project New Analytics or Machine Learning projects will required Ethical policies by design and by default.
  • 21. The importance of Ethical dimension with Analytics and Machine Learning projects
  • 22. Recommendations for GDPR readiness with Analytics and Machine learning projects • Check if personal data is processed into big data analytics treatment and should consider to use appropriate techniques to anonymize the personal data in their dataset(s) before analysis... • Become transparent about their processing of personal data by using a combination of innovative approaches in order to provide meaningful privacy notices at appropriate stages throughout a big data project. • Embed a privacy impact assessment framework into their big data processing activities to help identify privacy risks and assess the necessity and proportionality of a given project. • Adopt a privacy by design approach in the development and application of their big data analytics. This should include implementing technical and organizational measures to address matters including data security, data minimization and data segregation... • Develop ethical principles to help reinforce key data protection principles. Organizations should create ethics boards to help scrutinize projects and assess complex issues arising from big data analytics... • Implement innovative techniques to develop auditable machine learning algorithms. Internal and external audits should be undertaken with a view to explaining the rationale behind algorithmic decisions and checking for bias, discrimination and errors...
  • 23. The Data Architecture challenges Federated data lake Hybrid cloud integration IDAA Apache Spark DashDB Linux on z Technology breath : To simply- To alleviate- To secure Data gravity : volume-sensitivity-cost HTAP enablement
  • 24. Reasons to limit data movement to build a physical data lake Data gravity – analytic treatment move where the data resides Data sensitivity – To crypt data in case of data breach Real time analytics requirements Data governance high requirements : •Data quality : reduce data copy •Data security : regulations ( such as GDPR) •Data life cycle management : alleviate and optimize data management
  • 25. The hybrid data lake federated approach To alleviate data movement To use federated data approach To respect data gravity To leverage existing data set To limit data discrepancy Use z Systems as one of physical repository Let z Systems data In place Show to your data scientists How easy it is to access z data
  • 26. Imperatives to implement Data Lake hybrid scenario Reduce complexity of information supply chain, e.g. • Avoid data movement • Simplify data transformation • Use in-DB transformation • Use temporary tables structures Adhere to innovative and novel Analytics concepts, e.g. • Limit number of data marts and data cubes • Use aggregation on the fly • Allow for agile usage patterns • Leverage HTAP* architecture
  • 27. Technologies to use for hybrid data lake approach Leverage state-of-the-art technology, e.g. HW accelerators Special-purpose appliances In-memory processing Use federation technique whenever possible, e.g. Federated SQL queries, leaving data in place Federated analytical processing, leaving data in place Open Framework (e.g Apache Spark) *Hybrid Transactional Analytical Processing
  • 28. Data in IBM DB2 Analytics Accelerator • An extension of a DB2 for z/OS system • ETL process acceleration and alleviation • Accelerating SQL access to z/OS data, including IMS, VSAM ... loaded by IDAA Loader • Managing huge volume of history data (HPSS ) • R queries accelerator • Apache Spark on z/OS queries accelerator Transparent and easy data scientists access • Thru JDBC or API from Spark on distributed including Linux on z With Spark on z/OS as well as Machine Learning on z/OS z Systems as a Data Lake Repository into an hybrid approach- make z Data Simple
  • 29. Descriptive Predictive Prescriptive Data architecture Data governance Technology breath with IBM Z Ask your Information Architect to leverage them! Wrap up of the presentation Analytics From information driven to data driven , IBM Z can help to achieve the challenge !
  • 30. Thank you Cedrine Madera, PhD Executive Information Architect Member of IBM Academy Of Technology