SlideShare uma empresa Scribd logo
1 de 13
Self-Service Analytics – For Enterprise
Audience
• Sreejith Madhavan
– msreejith@yahoo.com
– https://www.linkedin.com/in/msreejith
Enterprise Analytics Portfolio – Lay of
The Land
Data Analytics – Basic Concepts
• Business Intelligence
o Using the available data to make factual business decisions
o “WHAT” is happening to your business right now?
• Business Analytics
o Steps that lead up to business decision
o Data Mining - process of looking for trends, patterns, or other useful
information within dataset
o Diagnostic analytics - “WHY” something is happening right now
o Predictive analytics - “WHAT Will” happen in future
o Prescriptive analytics - “WHAT Should be Done next”
Enterprise Analytics Landscape
• Enterprises typically have Users categorized broadly as -
o Business users – most interested in current metrics, fiscal trends, dashboards
o Engineering users – most interested in diagnostics (find needle-in-haystack),
deep-analytics
o An enterprise analytics solution stack should cover self-service needs to above
broad user-base
• Existing Data-stores Have Varying Use-cases
o Representing specialized data (application specific)
o Organizational units having independent solutions (IT, Engineering, Support etc..)
o Data architecture demands (BI tool backend, Datamarts, OLTP/OLAP etc)
• Enter Hadoop Datalake…
o Answering “Why” you need Hadoop Datalake in your Analytics landscape is critical
o What short, long term goals need to be met
o Not meant to be a one-stop-shop solution to replace existing Databases and
workflows
o Enterprise has several types of Users (by broad skill level) – A self service solution
stack should cater to broad User base by having mix-of several tools
Understanding Existing Data-Stores
Structured
data of Pre-
Computed
measures
Analytical
Cubes
Currently
SQL Server
Business
Analytics
system
Structured
data as Star
schema
with Dims
and Facts
Datamart
Currently
Oracle
Decision
Support
system/
Datamart
Structured,
Semi-
structured
data per
Event
granularity
Hive, M/R,
Datameer
Big Data
system
(Datalake)
Original
data
persisted in
its incoming
form
HDFS(M/R),
NFS
(Scripts),
REST
Raw Data
Highly granular and
complete dataset
Lower granularity and
subset of source data
Good for standard
Biz Metrics of
current and fiscal
trend
Good for interactive
Adhoc reporting
Good for diagnostic
mining and general
Adhoc reports at
scale
Useful to do ELT to
feed into other data
sources
Access
Interface/Tool
Data
Characteristic
Advanced Users (Data
Engineers/Scientists)
Enhance and persist
data-model, Develop
Deep insights
workflows
Frameworks, APIs
Map-reduce, Hive, Pig,
Spark, R, Programmatic
(JDBC..)
Technical Analysts
Generate Adhoc and
canned reports
SQL and
Transformation-
workflow based Tools
Oracle, SQL-Server,
Hive, R, Vertica,
Teradata, Datameer,
Tableau, PDI
Exec-users (Non-
Technical)
Consume predefined
metrics, Dashboards,
drag-n-drop what-if
analysis
Visual, Natural
language based tools
Tableau, OBIEE, PBA,
Excel, Microstrategy,
Search UI
End User Categories and Expectations
Usage
Characteristics
Interface
Characteristics
Sample Tools In each
Vertical
User and Use-case Requirement Considerations
• Demarcate target Users – Provision right Tool to right Users/Use-cases
– Not all users can should be given a Hadoop Datalake interface in self-service model
– Not one tool can fit all Use-cases
• Get to a Consolidated view of existing Data Sources to cover most
common domain objects to target “BI” based self-service model
• Data architecture - Data-layout and Data-model for the above
“Consolidated view”
– Star-schema vs Analytic Cube vs Flat OLTP schema
– MPP Analytic Database vs OLAP Cube vs DSS
– Traversing and Finding Metadata - Search interface to find entities, attributes and data
– Documentation covering data-model and data-dictionary
• Performance considerations
– High Performance and Concurrency support backend for interfacing BI Tools
– Scalable environment for batch, mining use-cases
– Interactive programmatic platform for data engineering
• Miscellaneous Operational Considerations (slide7)
Holistic View For Building E2E Analytics
Platform
Objectives For Holistic Analytics Platform
• Establish a self-service Analytics platform to cover BI and
Analytics use-cases for Internal users
• Support 3Vs of User types and Access patterns
o Volume of data
o Variety of Users (Programmatic and Non-technical)
o Variety of Queries (Adhoc, Not pre-defined)
o Velocity (Interactive query response, Dashboarding)
• Design Principles
o Embrace ideology of “one-tool doesn’t fit all use-cases and user preferences”
o Ease of Use (Front-end interface and Backend Data-model)
o Improved Performance to query response times
Datalake Analytics Platform – Conceptual View
MPP/Analytic
Database
PUAT Datamart Hive HDFS
BI Tool Front-End
Spark
Hue UI
(Hive, Search)
DataStore
Layer
Processing
Engine
Layer
Viz.and
Data
Access
Layer
• Focus on Data Processing & Integration frameworks
• Adhoc Data mining, complex data transformations, Machine learning
• 25-50 Concurrent users
• Focus on Visualization & Metrics (not Data Processing)
• Support Adhoc and Canned Self-service Reports
• 100+ Concurrent users
Extended
Datamodel
Cloudera Search
Spark CLI,
Hive Jdbc
(Programmatic
Access)
Datameer
(Non-
Programmatic)
Engineering focused Self-serve Reporting (Analysts &
Data engineers, Data scientists)
Business focused Self-serve Reporting (Analysts, Execs,
non-technical Audience)
Search
Front-End
Datalake Analytics Platform – Technology View
HDFS
(Orig Source)
Spark Data Prep
FW
M/R Daily HDFS
Transforms
HDFS
(Transformed)
Hive/Impala
Time based
SeqFile
Layout
System based
PARQUET
Layout
Adhoc Query
Hue UI/ Edge
Node CLI
Vertica MPP
Analytic DB
(12 month window)
On-demand
Parsed content
Datam
art
Structured
Config Feed
Cloudera Search
Indexing Prep FW
SSAS
Latest System
Snapshot raw
Latest Week Raw
& Structured
Data-
Prep/Transform
(SnapLogic/Data
meer)
Cloudera
Search Hue
UI
Tableau/Penta
ho BA
Spark
CLI/MLLib
Data-Prep/Filter
& Import
(SnapLogic)
DistributedR
Flattened
Star-schema
ZoomData
Raw
Data
Export
Published
Extended schema
Text search & Search AnalyticsSelf-serve BI
Reporting
Statistical Analytics Adhoc SQL Queries On-demand Data Transformations
Other
Sources…
Existing Components
Processing Workflows
New ComponentsOther
Legend
Evolving Other Operational Requirements
Agility and Productivity for End users
Monitoring and Governance
- Monitor & recover user, system jobs/service failures
- Analytics on Analytics – user and system behaviour
- Data quality, security etc
Ease of access to Data
- Abstracting data complexities, Provisioning prep’ed data to cover standard use-cases
- Query response times, Data mobility(transfer) issues
Understanding the Dataset
- Documentation, Catalog, Data Dictionary, Data Exploration
External References
• https://www.vertica.com/2014/04/18/facebook-and-vertica-a-case-for-mpp-databases/
• https://practicalanalytics.wordpress.com/2015/06/11/databianalytics-evolution-netflix/
• http://www.thebigdatainsightgroup.com/site/sites/default/files/Teradata's%20-
%20Big%20Data%20Architecture%20-%20Putting%20all%20your%20eggs%20in%20one%20basket.pdf
• http://www.slideshare.net/Dataconomy/hp-vertica-dataconomy
• http://www.bryanbrandow.com/2014/05/microstrategy-vs-tableau.html
• http://www.experfy.com/blog/pentaho-vs-tableau-comparison-visualization-dashboards/

Mais conteúdo relacionado

Mais procurados

Power BI: From the Basics
Power BI: From the BasicsPower BI: From the Basics
Power BI: From the BasicsNikkia Carter
 
Power BI Overview, Deployment and Governance
Power BI Overview, Deployment and GovernancePower BI Overview, Deployment and Governance
Power BI Overview, Deployment and GovernanceJames Serra
 
BI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and StrategyBI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and StrategyShivam Dhawan
 
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau -  Data, Graphs, Filters, Dashboards and Advanced featuresLearning Tableau -  Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced featuresVenkata Reddy Konasani
 
Essbase aso a quick reference guide part i
Essbase aso a quick reference guide part iEssbase aso a quick reference guide part i
Essbase aso a quick reference guide part iAmit Sharma
 
Microsoft power bi
Microsoft power biMicrosoft power bi
Microsoft power bitechpro360
 
Introduction to power BI
Introduction to power BIIntroduction to power BI
Introduction to power BIRamar Bose
 
Introduction to Microsoft Power BI
Introduction to Microsoft Power BIIntroduction to Microsoft Power BI
Introduction to Microsoft Power BIExilesoft
 
Technical Deck Delta Live Tables.pdf
Technical Deck Delta Live Tables.pdfTechnical Deck Delta Live Tables.pdf
Technical Deck Delta Live Tables.pdfIlham31574
 
SAP BO Web Intelligence Basics
SAP BO Web Intelligence BasicsSAP BO Web Intelligence Basics
SAP BO Web Intelligence BasicsKiran Joy
 
Business Intelligence tools comparison
Business Intelligence tools comparisonBusiness Intelligence tools comparison
Business Intelligence tools comparisonStratebi
 

Mais procurados (20)

Power BI: From the Basics
Power BI: From the BasicsPower BI: From the Basics
Power BI: From the Basics
 
Self-Service Analytics
Self-Service AnalyticsSelf-Service Analytics
Self-Service Analytics
 
Power BI Overview, Deployment and Governance
Power BI Overview, Deployment and GovernancePower BI Overview, Deployment and Governance
Power BI Overview, Deployment and Governance
 
BI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and StrategyBI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and Strategy
 
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau -  Data, Graphs, Filters, Dashboards and Advanced featuresLearning Tableau -  Data, Graphs, Filters, Dashboards and Advanced features
Learning Tableau - Data, Graphs, Filters, Dashboards and Advanced features
 
Data visualization
Data visualizationData visualization
Data visualization
 
MSBI-SSRS PPT
MSBI-SSRS PPTMSBI-SSRS PPT
MSBI-SSRS PPT
 
Essbase aso a quick reference guide part i
Essbase aso a quick reference guide part iEssbase aso a quick reference guide part i
Essbase aso a quick reference guide part i
 
Data modelling interview question
Data modelling interview questionData modelling interview question
Data modelling interview question
 
Power BI
Power BIPower BI
Power BI
 
Tableau Architecture
Tableau ArchitectureTableau Architecture
Tableau Architecture
 
Microsoft power bi
Microsoft power biMicrosoft power bi
Microsoft power bi
 
Introduction to power BI
Introduction to power BIIntroduction to power BI
Introduction to power BI
 
Introduction to Microsoft Power BI
Introduction to Microsoft Power BIIntroduction to Microsoft Power BI
Introduction to Microsoft Power BI
 
SSAS Tabular model importance and uses
SSAS  Tabular model importance and usesSSAS  Tabular model importance and uses
SSAS Tabular model importance and uses
 
Technical Deck Delta Live Tables.pdf
Technical Deck Delta Live Tables.pdfTechnical Deck Delta Live Tables.pdf
Technical Deck Delta Live Tables.pdf
 
SAP BO Web Intelligence Basics
SAP BO Web Intelligence BasicsSAP BO Web Intelligence Basics
SAP BO Web Intelligence Basics
 
Power Up with Power BI
Power Up with Power BIPower Up with Power BI
Power Up with Power BI
 
Business Intelligence tools comparison
Business Intelligence tools comparisonBusiness Intelligence tools comparison
Business Intelligence tools comparison
 
Data analytics and powerbi intro
Data analytics and powerbi introData analytics and powerbi intro
Data analytics and powerbi intro
 

Destaque

The Power of Self Service Reporting
The Power of Self Service ReportingThe Power of Self Service Reporting
The Power of Self Service ReportingAras
 
Obiee metadata dictionary
Obiee metadata dictionaryObiee metadata dictionary
Obiee metadata dictionaryobieefans
 
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...SAP Analytics
 
Agile collaborative practices
Agile collaborative practicesAgile collaborative practices
Agile collaborative practicesSreejith Madhavan
 
Trivial works.com introduction
Trivial works.com introductionTrivial works.com introduction
Trivial works.com introductionTrivialWorks
 
Agile Development For Rte Systems
Agile Development For Rte SystemsAgile Development For Rte Systems
Agile Development For Rte SystemsBruce Douglass
 
Collaborative and agile development of mobile applications
Collaborative and agile development of mobile applicationsCollaborative and agile development of mobile applications
Collaborative and agile development of mobile applicationsAyushman Jain
 
The Business Benefits of a Data-Driven, Self-Service BI Organization
The Business Benefits of a Data-Driven, Self-Service BI OrganizationThe Business Benefits of a Data-Driven, Self-Service BI Organization
The Business Benefits of a Data-Driven, Self-Service BI OrganizationLooker
 
Realtime Reporting using Spark Streaming
Realtime Reporting using Spark StreamingRealtime Reporting using Spark Streaming
Realtime Reporting using Spark StreamingSantosh Sahoo
 
The Complete Guide to Embedded Analytics
The Complete Guide to Embedded AnalyticsThe Complete Guide to Embedded Analytics
The Complete Guide to Embedded AnalyticsJessica Sprinkel
 
Agile presentation
Agile presentationAgile presentation
Agile presentationinfolock
 
Overview of Agile Methodology
Overview of Agile MethodologyOverview of Agile Methodology
Overview of Agile MethodologyHaresh Karkar
 

Destaque (13)

The Power of Self Service Reporting
The Power of Self Service ReportingThe Power of Self Service Reporting
The Power of Self Service Reporting
 
Obiee metadata dictionary
Obiee metadata dictionaryObiee metadata dictionary
Obiee metadata dictionary
 
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
Extending the Self-Service Capabilities of SAP BI with SAP BusinessObjects Ex...
 
Agile collaborative practices
Agile collaborative practicesAgile collaborative practices
Agile collaborative practices
 
Trivial works.com introduction
Trivial works.com introductionTrivial works.com introduction
Trivial works.com introduction
 
Agile Development For Rte Systems
Agile Development For Rte SystemsAgile Development For Rte Systems
Agile Development For Rte Systems
 
Collaborative and agile development of mobile applications
Collaborative and agile development of mobile applicationsCollaborative and agile development of mobile applications
Collaborative and agile development of mobile applications
 
The Business Benefits of a Data-Driven, Self-Service BI Organization
The Business Benefits of a Data-Driven, Self-Service BI OrganizationThe Business Benefits of a Data-Driven, Self-Service BI Organization
The Business Benefits of a Data-Driven, Self-Service BI Organization
 
Realtime Reporting using Spark Streaming
Realtime Reporting using Spark StreamingRealtime Reporting using Spark Streaming
Realtime Reporting using Spark Streaming
 
The Complete Guide to Embedded Analytics
The Complete Guide to Embedded AnalyticsThe Complete Guide to Embedded Analytics
The Complete Guide to Embedded Analytics
 
Agile presentation
Agile presentationAgile presentation
Agile presentation
 
Tableau Server Basics
Tableau Server BasicsTableau Server Basics
Tableau Server Basics
 
Overview of Agile Methodology
Overview of Agile MethodologyOverview of Agile Methodology
Overview of Agile Methodology
 

Semelhante a Self Service Reporting & Analytics For an Enterprise

The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...Revolution Analytics
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 
Tableau and hadoop
Tableau and hadoopTableau and hadoop
Tableau and hadoopCraig Jordan
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Pentaho
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Databricks
 
18Mar14 Find the Hidden Signal in Market Data Noise Webinar
18Mar14 Find the Hidden Signal in Market Data Noise Webinar 18Mar14 Find the Hidden Signal in Market Data Noise Webinar
18Mar14 Find the Hidden Signal in Market Data Noise Webinar Revolution Analytics
 
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
In-Memory Analytics - SAP Big Data - Analytics Tools Selection  - SAP HANA & ...In-Memory Analytics - SAP Big Data - Analytics Tools Selection  - SAP HANA & ...
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...Jothi Periasamy
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Hortonworks
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
AzureDay - Introduction Big Data Analytics.
AzureDay  - Introduction Big Data Analytics.AzureDay  - Introduction Big Data Analytics.
AzureDay - Introduction Big Data Analytics.Łukasz Grala
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Group
 
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...South Tyrol Free Software Conference
 
Architecting Agile Data Applications for Scale
Architecting Agile Data Applications for ScaleArchitecting Agile Data Applications for Scale
Architecting Agile Data Applications for ScaleDatabricks
 
No sql and sql - open analytics summit
No sql and sql - open analytics summitNo sql and sql - open analytics summit
No sql and sql - open analytics summitOpen Analytics
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & HadoopBlackvard
 
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dataconomy Media
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Singh
 
Hadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelHadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelUwe Printz
 

Semelhante a Self Service Reporting & Analytics For an Enterprise (20)

The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Tableau and hadoop
Tableau and hadoopTableau and hadoop
Tableau and hadoop
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
 
Big Data SE vs. SE for Big Data
Big Data SE vs. SE for Big DataBig Data SE vs. SE for Big Data
Big Data SE vs. SE for Big Data
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
 
18Mar14 Find the Hidden Signal in Market Data Noise Webinar
18Mar14 Find the Hidden Signal in Market Data Noise Webinar 18Mar14 Find the Hidden Signal in Market Data Noise Webinar
18Mar14 Find the Hidden Signal in Market Data Noise Webinar
 
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
In-Memory Analytics - SAP Big Data - Analytics Tools Selection  - SAP HANA & ...In-Memory Analytics - SAP Big Data - Analytics Tools Selection  - SAP HANA & ...
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
AzureDay - Introduction Big Data Analytics.
AzureDay  - Introduction Big Data Analytics.AzureDay  - Introduction Big Data Analytics.
AzureDay - Introduction Big Data Analytics.
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
SFScon19 - Grazia Cazzin - KNOWAGE the open source answer to the new needs in...
 
Architecting Agile Data Applications for Scale
Architecting Agile Data Applications for ScaleArchitecting Agile Data Applications for Scale
Architecting Agile Data Applications for Scale
 
No sql and sql - open analytics summit
No sql and sql - open analytics summitNo sql and sql - open analytics summit
No sql and sql - open analytics summit
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & Hadoop
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
 
Kushal Data Warehousing PPT
Kushal Data Warehousing PPTKushal Data Warehousing PPT
Kushal Data Warehousing PPT
 
Hadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelHadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data Model
 

Último

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...gajnagarg
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...gajnagarg
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...gajnagarg
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachBoston Institute of Analytics
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 

Último (20)

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls Palakkad Escorts ☎️9352988975 Two shot with one girl...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men  🔝Sambalpur🔝   Esc...
➥🔝 7737669865 🔝▻ Sambalpur Call-girls in Women Seeking Men 🔝Sambalpur🔝 Esc...
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Rabindra Nagar  (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Rabindra Nagar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men  🔝mahisagar🔝   Esc...
➥🔝 7737669865 🔝▻ mahisagar Call-girls in Women Seeking Men 🔝mahisagar🔝 Esc...
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 

Self Service Reporting & Analytics For an Enterprise

  • 1. Self-Service Analytics – For Enterprise Audience • Sreejith Madhavan – msreejith@yahoo.com – https://www.linkedin.com/in/msreejith
  • 2. Enterprise Analytics Portfolio – Lay of The Land
  • 3. Data Analytics – Basic Concepts • Business Intelligence o Using the available data to make factual business decisions o “WHAT” is happening to your business right now? • Business Analytics o Steps that lead up to business decision o Data Mining - process of looking for trends, patterns, or other useful information within dataset o Diagnostic analytics - “WHY” something is happening right now o Predictive analytics - “WHAT Will” happen in future o Prescriptive analytics - “WHAT Should be Done next”
  • 4. Enterprise Analytics Landscape • Enterprises typically have Users categorized broadly as - o Business users – most interested in current metrics, fiscal trends, dashboards o Engineering users – most interested in diagnostics (find needle-in-haystack), deep-analytics o An enterprise analytics solution stack should cover self-service needs to above broad user-base • Existing Data-stores Have Varying Use-cases o Representing specialized data (application specific) o Organizational units having independent solutions (IT, Engineering, Support etc..) o Data architecture demands (BI tool backend, Datamarts, OLTP/OLAP etc) • Enter Hadoop Datalake… o Answering “Why” you need Hadoop Datalake in your Analytics landscape is critical o What short, long term goals need to be met o Not meant to be a one-stop-shop solution to replace existing Databases and workflows o Enterprise has several types of Users (by broad skill level) – A self service solution stack should cater to broad User base by having mix-of several tools
  • 5. Understanding Existing Data-Stores Structured data of Pre- Computed measures Analytical Cubes Currently SQL Server Business Analytics system Structured data as Star schema with Dims and Facts Datamart Currently Oracle Decision Support system/ Datamart Structured, Semi- structured data per Event granularity Hive, M/R, Datameer Big Data system (Datalake) Original data persisted in its incoming form HDFS(M/R), NFS (Scripts), REST Raw Data Highly granular and complete dataset Lower granularity and subset of source data Good for standard Biz Metrics of current and fiscal trend Good for interactive Adhoc reporting Good for diagnostic mining and general Adhoc reports at scale Useful to do ELT to feed into other data sources Access Interface/Tool Data Characteristic
  • 6. Advanced Users (Data Engineers/Scientists) Enhance and persist data-model, Develop Deep insights workflows Frameworks, APIs Map-reduce, Hive, Pig, Spark, R, Programmatic (JDBC..) Technical Analysts Generate Adhoc and canned reports SQL and Transformation- workflow based Tools Oracle, SQL-Server, Hive, R, Vertica, Teradata, Datameer, Tableau, PDI Exec-users (Non- Technical) Consume predefined metrics, Dashboards, drag-n-drop what-if analysis Visual, Natural language based tools Tableau, OBIEE, PBA, Excel, Microstrategy, Search UI End User Categories and Expectations Usage Characteristics Interface Characteristics Sample Tools In each Vertical
  • 7. User and Use-case Requirement Considerations • Demarcate target Users – Provision right Tool to right Users/Use-cases – Not all users can should be given a Hadoop Datalake interface in self-service model – Not one tool can fit all Use-cases • Get to a Consolidated view of existing Data Sources to cover most common domain objects to target “BI” based self-service model • Data architecture - Data-layout and Data-model for the above “Consolidated view” – Star-schema vs Analytic Cube vs Flat OLTP schema – MPP Analytic Database vs OLAP Cube vs DSS – Traversing and Finding Metadata - Search interface to find entities, attributes and data – Documentation covering data-model and data-dictionary • Performance considerations – High Performance and Concurrency support backend for interfacing BI Tools – Scalable environment for batch, mining use-cases – Interactive programmatic platform for data engineering • Miscellaneous Operational Considerations (slide7)
  • 8. Holistic View For Building E2E Analytics Platform
  • 9. Objectives For Holistic Analytics Platform • Establish a self-service Analytics platform to cover BI and Analytics use-cases for Internal users • Support 3Vs of User types and Access patterns o Volume of data o Variety of Users (Programmatic and Non-technical) o Variety of Queries (Adhoc, Not pre-defined) o Velocity (Interactive query response, Dashboarding) • Design Principles o Embrace ideology of “one-tool doesn’t fit all use-cases and user preferences” o Ease of Use (Front-end interface and Backend Data-model) o Improved Performance to query response times
  • 10. Datalake Analytics Platform – Conceptual View MPP/Analytic Database PUAT Datamart Hive HDFS BI Tool Front-End Spark Hue UI (Hive, Search) DataStore Layer Processing Engine Layer Viz.and Data Access Layer • Focus on Data Processing & Integration frameworks • Adhoc Data mining, complex data transformations, Machine learning • 25-50 Concurrent users • Focus on Visualization & Metrics (not Data Processing) • Support Adhoc and Canned Self-service Reports • 100+ Concurrent users Extended Datamodel Cloudera Search Spark CLI, Hive Jdbc (Programmatic Access) Datameer (Non- Programmatic) Engineering focused Self-serve Reporting (Analysts & Data engineers, Data scientists) Business focused Self-serve Reporting (Analysts, Execs, non-technical Audience) Search Front-End
  • 11. Datalake Analytics Platform – Technology View HDFS (Orig Source) Spark Data Prep FW M/R Daily HDFS Transforms HDFS (Transformed) Hive/Impala Time based SeqFile Layout System based PARQUET Layout Adhoc Query Hue UI/ Edge Node CLI Vertica MPP Analytic DB (12 month window) On-demand Parsed content Datam art Structured Config Feed Cloudera Search Indexing Prep FW SSAS Latest System Snapshot raw Latest Week Raw & Structured Data- Prep/Transform (SnapLogic/Data meer) Cloudera Search Hue UI Tableau/Penta ho BA Spark CLI/MLLib Data-Prep/Filter & Import (SnapLogic) DistributedR Flattened Star-schema ZoomData Raw Data Export Published Extended schema Text search & Search AnalyticsSelf-serve BI Reporting Statistical Analytics Adhoc SQL Queries On-demand Data Transformations Other Sources… Existing Components Processing Workflows New ComponentsOther Legend
  • 12. Evolving Other Operational Requirements Agility and Productivity for End users Monitoring and Governance - Monitor & recover user, system jobs/service failures - Analytics on Analytics – user and system behaviour - Data quality, security etc Ease of access to Data - Abstracting data complexities, Provisioning prep’ed data to cover standard use-cases - Query response times, Data mobility(transfer) issues Understanding the Dataset - Documentation, Catalog, Data Dictionary, Data Exploration
  • 13. External References • https://www.vertica.com/2014/04/18/facebook-and-vertica-a-case-for-mpp-databases/ • https://practicalanalytics.wordpress.com/2015/06/11/databianalytics-evolution-netflix/ • http://www.thebigdatainsightgroup.com/site/sites/default/files/Teradata's%20- %20Big%20Data%20Architecture%20-%20Putting%20all%20your%20eggs%20in%20one%20basket.pdf • http://www.slideshare.net/Dataconomy/hp-vertica-dataconomy • http://www.bryanbrandow.com/2014/05/microstrategy-vs-tableau.html • http://www.experfy.com/blog/pentaho-vs-tableau-comparison-visualization-dashboards/

Notas do Editor

  1. Business users (typically from Sales, Product management, Other execs) Engineering users (Developers, QA, Technical support engineers, Analysts, Data scientists)
  2. User Types: - Semi/non- technical users – easy to use drag-n-drop interface - advanced users - Programmatic and SQL based interfaces Improved Performance considerations - High Performance and Concurrent platform for user interactions via BI Tools - Scalable environment for batch, mining use-cases - nteractive programmatic platform for data engineering
  3. Business users workflows: - Self-service - Answer “What” questions - Analytic Database – consolidate data model supporting quick Vizn, Performance and lower learning curve Engineering users workflows: - Self-service – Answer “Why” and “What next” questions
  4. CLI – Command-line Interface MLLib – Machine learning Lib Data Prep FW – Data Preparation framework MPP – Massive Parallel Processing BI – Business Intelligence