SlideShare a Scribd company logo
1 of 50
Download to read offline
WSO2 Data Analytics
Server 3.0.0
Product Release Webinar
Inosh Goonewardena
Associate Technical Lead
WSO2 Analytics Platform
WSO2 Analytics Platform
Data Processing Pipeline
Introducing WSO2 Data Analytics Server
● Fully-open source solution with the ability to build systems and
applications that collect and analyze data and communicate the
results.
● Embodies the WSO2 Analytics Platform by combining batch, real-
time, interactive and predictive analytics capabilities
● High performance data capture framework
● Highly available and scalable by design
Advantages of DAS 3.0 over WSO2 BAM 2.5.0
● Complete rewrite from the ground up, with performance and extensibility as
core values
● Faster analytics powered by Apache Spark, 10x - 100x speedup
● Rich indexing support, with near real-time text search
● Pluggable data store support, from lightweight embedded RDBMS to highly
scalable HBase/HDFS
● Revamped Analytics Dashboard with wizard-based gadget generation
WSO2 DAS Architecture
Collecting Data
Data Model
{
'name': 'stream.name',
'version': '1.0.0',
'nickName': 'stream nickname',
'description': 'description of the stream',
'metaData':[
{'name':'meta_data_1','type':'STRING'},
],
'correlationData':[
{'name':'correlation_data_1','type':'STRING'}
],
'payloadData':[
{'name':'payload_data_1','type':'BOOL'},
{'name':'payload_data_2','type':'LONG'}
]
}
● Published data conforms to a strongly typed data stream
● One API for Batch and Real-time
Analytics.
● Asynchronous and non-blocking
nature enables extremely fast writes.
● Supports multiple transport adapters
for data collection
Data Receiver
Highly Pluggable Event Receiver Architecture
Data Persistence
● Data Abstraction Layer to enable pluggable data connectors
○ RDBMS, Cassandra and HBase/HDFS offered. Custom connectors could be easily written
● Analytics Table
○ The data persistence entity in WSO2 Data Analytics Server
○ Provides a backend data source agnostic way of storing and retrieving data
○ Allows applications to be written in a way that it does not depend on a specific data source, e.
g. JDBC (RDBMS), Cassandra APIs etc.
○ WSO2 DAS gives a standard REST API in accessing the Analytics Tables
Data Persistence
● Analytics Record Stores
○ An Analytics Record Store houses a specific set of Analytics Tables
○ The Analytics Record Stores to be used for storing incoming events and storing query
processing output are configurable
○ Single Analytics Table namespace, the target record store only given at the time of table
creation
○ Useful in creating Analytics Tables where data will be stored in multiple target databases
● Analytics File System
○ The location where the indexing data is stored
○ Multiple implementations provided OOTB, or custom implementations can be written
Analyzing Data
Batch Analytics
Batch Analytics - Overview
● Powered by Apache Spark for 10x-100x higher performance than Hadoop
● Parallel, distributed with optimized in-memory processing
● Scalable script-based analytics written using an easy-to-learn, SQL-like
query language powered by Spark SQL
● Interactive built in web interface for ad-hoc query execution
● Scheduled query script execution support with high-availability and failover
● Run Spark on a single node, Spark embedded Carbon server cluster or
connect to external Spark cluster
create temporary table product_data using CarbonAnalytics
options (schema …)
create temporary table products using CarbonAnalytics
options (schema …)
insert into products select product_name from product_data
group by …
Batch Analytics - Spark SQL
Batch Analytics - Interactive Console
Batch Analytics - Spark Scripts
Interactive Analytics
● Full text data indexing support powered by Apache Lucene
● Drill down search support
● Distributed data indexing
○ Designed to support scalability
● Near real-time data indexing and retrieval
○ Data indexed immediately as received
Interactive Analytics
Interactive Analytics
Real-time Analytics
What is Real-time Analytics?
Real-time Analytics in
→
Real-time Analytics in
→
● Gather data from multiple sources
● Correlate data streams over time
● Find interesting occurrences
● And Notify
● All in real-time
What is Real-time Analytics?
Predictive Analytics (upcoming)
Predictive Analytics in
→
What is Predictive Analytics?
Predictive Analytics in
→
● Extract, pre-process, and explore data
● Create models, tune algorithms and make predictions
● Integrate for better intelligence
What is Predictive Analytics?
Communicating Results
Dashboards
● “Overall idea” in a glance (e.g. car
dashboard)
● Support for personalization, you can
build your own dashboard.
● The entry point for Drill-down
● Building a custom dashboard
○ Dashboard via Google Gadgets and content
via HTML5 + JavaScript
○ Leverages WSO2 User Engagement Server to
build a dashboard.
○ Uses charting libraries like Vega, D3.js
Dashboards: Gadget Generation Wizard
● Start with data in tabular format
● Map each column to dimension in your plot like X,Y,
color, point size, etc
● Also do drill downs
● Create a chart with few clicks
Alerts
● Detecting conditions can be
done via CEP Queries
● “Last Mile” is key
○ Email
○ SMS
○ Push notifications to a UI
○ Pager
○ Trigger physical Alarm
APIs
● With mobile Apps, most data are
exposed and shared as APIs
(REST/JSON ) to end users.
● Analytics results can be exposed
through APIs
○ REST API
○ JavaScript API
What can WSO2 DAS do for you?
Common Use Cases of WSO2 DAS
● KPI Statistics
○ Application Statistics Monitoring
○ Network / Service Statistics
○ Sensor Data Aggregation
● Solving Optimization Problems
○ Urban Planning
○ Revenue Distribution Analysis
● Activity Monitoring
○ Tracking Message Flows
● HL7 Data Exploration
○ ESB HL7 Transport Interfaced with
DAS
● Log Analysis
○ Application / System Logs
● Sports
○ Real-time Analysis of Player
Performance
○ Real-time Match Analysis
● Geo-Spatial
○ Traffic Monitoring and Alerting
○ Geo-fencing
● Anomaly Detection
○ Fraud Detection
○ Network Intrusion Detection
○ Server Health Monitoring
API Statistics
API Statistics
HTTP Monitoring
Activity Monitoring
Activity monitoring is for tracking events from multiple nodes in a
flow to understand a specific activity
● Example:
○ A client initiating a web services request which travels through multiple ESBs, application
servers and returns back. This flow will be uniquely identified and visualized in DAS
● Used for tracing messages, finding performance hotspots in the flow
● Implemented based on a correlation id based mechanism using
Interactive Analytics
Activity Monitoring
Activity Monitoring
Activity Monitoring
Activity Monitoring
Activity Monitoring
Fraud Detection
● Built for detecting credit card fraud
● The rules are extensible with
customized Siddhi execution plans
for any type of fraud detection
● Currently leverages Real-time and
Interactive Analytics features
Source: multichannelmerchant.com
Log Analysis
● Distributed indexing and searching
of any type of logs stored in the
system
● Notifications support with Real-time
event processing features
● Application / Server health prediction
with Machine Learning
● Utilizes Interactive + Real-time
Analytics + Machine Learning
features
Source: www.retrospective.centeractive.com
Urban Route Planning
Urban Route Planning
Product Demonstration
Questions?

More Related Content

What's hot

EAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using HadoopEAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using Hadoop
DataWorks Summit
 
Big data – can it deliver speed and accuracy v1
Big data – can it deliver speed and accuracy v1Big data – can it deliver speed and accuracy v1
Big data – can it deliver speed and accuracy v1
GurinderG
 
MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)
MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)
MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)
MongoDB
 
Building a Hadoop Powered Commerce Data Pipeline
Building a Hadoop Powered Commerce Data PipelineBuilding a Hadoop Powered Commerce Data Pipeline
Building a Hadoop Powered Commerce Data Pipeline
DataWorks Summit
 

What's hot (20)

Azure Stream Analytics
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream Analytics
 
Building a data pipeline to ingest data into Hadoop in minutes using Streamse...
Building a data pipeline to ingest data into Hadoop in minutes using Streamse...Building a data pipeline to ingest data into Hadoop in minutes using Streamse...
Building a data pipeline to ingest data into Hadoop in minutes using Streamse...
 
Azure Stream Analytics : Analyse Data in Motion
Azure Stream Analytics  : Analyse Data in MotionAzure Stream Analytics  : Analyse Data in Motion
Azure Stream Analytics : Analyse Data in Motion
 
[WSO2Con EU 2018] Patterns for Building Streaming Apps
[WSO2Con EU 2018] Patterns for Building Streaming Apps[WSO2Con EU 2018] Patterns for Building Streaming Apps
[WSO2Con EU 2018] Patterns for Building Streaming Apps
 
Real-time user profiling based on Spark streaming and HBase by Arkadiusz Jach...
Real-time user profiling based on Spark streaming and HBase by Arkadiusz Jach...Real-time user profiling based on Spark streaming and HBase by Arkadiusz Jach...
Real-time user profiling based on Spark streaming and HBase by Arkadiusz Jach...
 
Monitoring @ scale over diverse data sources @ PayPal - Druid, TSDB, Hadoop
Monitoring @ scale over diverse data sources @ PayPal  - Druid, TSDB, HadoopMonitoring @ scale over diverse data sources @ PayPal  - Druid, TSDB, Hadoop
Monitoring @ scale over diverse data sources @ PayPal - Druid, TSDB, Hadoop
 
Obfuscating LinkedIn Member Data
Obfuscating LinkedIn Member DataObfuscating LinkedIn Member Data
Obfuscating LinkedIn Member Data
 
EAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using HadoopEAP - Accelerating behavorial analytics at PayPal using Hadoop
EAP - Accelerating behavorial analytics at PayPal using Hadoop
 
The Stream is the Database - Revolutionizing Healthcare Data Architecture
The Stream is the Database - Revolutionizing Healthcare Data ArchitectureThe Stream is the Database - Revolutionizing Healthcare Data Architecture
The Stream is the Database - Revolutionizing Healthcare Data Architecture
 
Introduction to basic data analytics tools
Introduction to basic data analytics toolsIntroduction to basic data analytics tools
Introduction to basic data analytics tools
 
Big data – can it deliver speed and accuracy v1
Big data – can it deliver speed and accuracy v1Big data – can it deliver speed and accuracy v1
Big data – can it deliver speed and accuracy v1
 
Azure Stream Analytics
Azure Stream AnalyticsAzure Stream Analytics
Azure Stream Analytics
 
MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)
MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)
MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)
 
Building a Hadoop Powered Commerce Data Pipeline
Building a Hadoop Powered Commerce Data PipelineBuilding a Hadoop Powered Commerce Data Pipeline
Building a Hadoop Powered Commerce Data Pipeline
 
MongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDB
MongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDBMongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDB
MongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDB
 
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
 
Turning Events and Big Data into Insight with WSO2 CEP and WSO2 BAM
Turning Events and Big Data into Insight with WSO2 CEP and WSO2 BAMTurning Events and Big Data into Insight with WSO2 CEP and WSO2 BAM
Turning Events and Big Data into Insight with WSO2 CEP and WSO2 BAM
 
Snowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.comSnowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.com
 
NoSQL, which way to go?
NoSQL, which way to go?NoSQL, which way to go?
NoSQL, which way to go?
 
Integration Monday - Analysing StackExchange data with Azure Data Lake
Integration Monday - Analysing StackExchange data with Azure Data LakeIntegration Monday - Analysing StackExchange data with Azure Data Lake
Integration Monday - Analysing StackExchange data with Azure Data Lake
 

Viewers also liked

Storing and processing data with the wso2 platform
Storing and processing data with the wso2 platformStoring and processing data with the wso2 platform
Storing and processing data with the wso2 platform
WSO2
 
Wso2datasciencesummerschool20151 150714180825-lva1-app6892
Wso2datasciencesummerschool20151 150714180825-lva1-app6892Wso2datasciencesummerschool20151 150714180825-lva1-app6892
Wso2datasciencesummerschool20151 150714180825-lva1-app6892
WSO2
 

Viewers also liked (20)

WSO2 Data Analytics Server - Product Overview
WSO2 Data Analytics Server - Product OverviewWSO2 Data Analytics Server - Product Overview
WSO2 Data Analytics Server - Product Overview
 
WSO2Con USA 2015: An Introduction to the WSO2 Analytics Platform
WSO2Con USA 2015: An Introduction to the WSO2 Analytics PlatformWSO2Con USA 2015: An Introduction to the WSO2 Analytics Platform
WSO2Con USA 2015: An Introduction to the WSO2 Analytics Platform
 
Storing and processing data with the wso2 platform
Storing and processing data with the wso2 platformStoring and processing data with the wso2 platform
Storing and processing data with the wso2 platform
 
Hadoop as an Analytic Platform: Why Not?
Hadoop as an Analytic Platform: Why Not?Hadoop as an Analytic Platform: Why Not?
Hadoop as an Analytic Platform: Why Not?
 
WSO2 Complex Event Processor - Product Overview
WSO2 Complex Event Processor - Product OverviewWSO2 Complex Event Processor - Product Overview
WSO2 Complex Event Processor - Product Overview
 
Extending WSO2 Analytics Platform
Extending WSO2 Analytics PlatformExtending WSO2 Analytics Platform
Extending WSO2 Analytics Platform
 
Webinar: Real Time BI is Open and Anywhere with SpagoBI
Webinar: Real Time BI is Open and Anywhere with SpagoBIWebinar: Real Time BI is Open and Anywhere with SpagoBI
Webinar: Real Time BI is Open and Anywhere with SpagoBI
 
WSO2 Business Activity Monitor
WSO2 Business Activity MonitorWSO2 Business Activity Monitor
WSO2 Business Activity Monitor
 
Eclipse Internet of Things in Action EclipseCON 2014
Eclipse Internet of Things in Action EclipseCON 2014Eclipse Internet of Things in Action EclipseCON 2014
Eclipse Internet of Things in Action EclipseCON 2014
 
Webinar: BI Mobile with SpagoBI: be aware everywhere!
Webinar: BI Mobile with SpagoBI: be aware everywhere!Webinar: BI Mobile with SpagoBI: be aware everywhere!
Webinar: BI Mobile with SpagoBI: be aware everywhere!
 
Wso2datasciencesummerschool20151 150714180825-lva1-app6892
Wso2datasciencesummerschool20151 150714180825-lva1-app6892Wso2datasciencesummerschool20151 150714180825-lva1-app6892
Wso2datasciencesummerschool20151 150714180825-lva1-app6892
 
WSO2 - Portfólio de Produtos, Soluções e Suportes
WSO2 - Portfólio de Produtos, Soluções e SuportesWSO2 - Portfólio de Produtos, Soluções e Suportes
WSO2 - Portfólio de Produtos, Soluções e Suportes
 
WSO2 Identity Server - Product Overview
WSO2 Identity Server - Product OverviewWSO2 Identity Server - Product Overview
WSO2 Identity Server - Product Overview
 
WSO2 Business Process Server - Product Overview
WSO2 Business Process Server - Product OverviewWSO2 Business Process Server - Product Overview
WSO2 Business Process Server - Product Overview
 
CMMN in Camunda BPM 7.6
CMMN in Camunda BPM 7.6CMMN in Camunda BPM 7.6
CMMN in Camunda BPM 7.6
 
On making standards organizations & open source communities work hand in hand
On making standards organizations & open source communities work hand in handOn making standards organizations & open source communities work hand in hand
On making standards organizations & open source communities work hand in hand
 
WSO2 Data Services Server - Product Overview
WSO2 Data Services Server - Product OverviewWSO2 Data Services Server - Product Overview
WSO2 Data Services Server - Product Overview
 
Munching & crunching - Lucene index post-processing
Munching & crunching - Lucene index post-processingMunching & crunching - Lucene index post-processing
Munching & crunching - Lucene index post-processing
 
WSO2 Platform Overview - WSO2 Meetup 01 - 16th Oct 2014
WSO2 Platform Overview - WSO2 Meetup 01 - 16th Oct 2014WSO2 Platform Overview - WSO2 Meetup 01 - 16th Oct 2014
WSO2 Platform Overview - WSO2 Meetup 01 - 16th Oct 2014
 
WSO2 Application Server - Product Overview
WSO2 Application Server - Product OverviewWSO2 Application Server - Product Overview
WSO2 Application Server - Product Overview
 

Similar to WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0

Similar to WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0 (20)

Data to Insight: Introduction to WSO2 Business Activity Monitor
Data to Insight: Introduction to WSO2 Business Activity MonitorData to Insight: Introduction to WSO2 Business Activity Monitor
Data to Insight: Introduction to WSO2 Business Activity Monitor
 
WSO2Con ASIA 2016: WSO2 Analytics Platform: The One Stop Shop for All Your Da...
WSO2Con ASIA 2016: WSO2 Analytics Platform: The One Stop Shop for All Your Da...WSO2Con ASIA 2016: WSO2 Analytics Platform: The One Stop Shop for All Your Da...
WSO2Con ASIA 2016: WSO2 Analytics Platform: The One Stop Shop for All Your Da...
 
An introduction to the WSO2 Analytics Platform
An introduction to the WSO2 Analytics Platform   An introduction to the WSO2 Analytics Platform
An introduction to the WSO2 Analytics Platform
 
Monitoring Your Business with WSO2 BAM
Monitoring Your Business with WSO2 BAMMonitoring Your Business with WSO2 BAM
Monitoring Your Business with WSO2 BAM
 
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache CassandraApache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
 
Streaming Visualization
Streaming VisualizationStreaming Visualization
Streaming Visualization
 
WSO2Con USA 2015: WSO2 Analytics Platform - The One Stop Shop for All Your Da...
WSO2Con USA 2015: WSO2 Analytics Platform - The One Stop Shop for All Your Da...WSO2Con USA 2015: WSO2 Analytics Platform - The One Stop Shop for All Your Da...
WSO2Con USA 2015: WSO2 Analytics Platform - The One Stop Shop for All Your Da...
 
Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020
 
Machine Data 101 Workshop
Machine Data 101 Workshop Machine Data 101 Workshop
Machine Data 101 Workshop
 
WSO2 Analytics Platform: The one stop shop for all your data needs
WSO2 Analytics Platform: The one stop shop for all your data needsWSO2 Analytics Platform: The one stop shop for all your data needs
WSO2 Analytics Platform: The one stop shop for all your data needs
 
Data saturday malta - ADX Azure Data Explorer overview
Data saturday malta - ADX Azure Data Explorer overviewData saturday malta - ADX Azure Data Explorer overview
Data saturday malta - ADX Azure Data Explorer overview
 
WSO2 Stream Processor: Graphical Editor, HTTP & Message Trace Analytics and m...
WSO2 Stream Processor: Graphical Editor, HTTP & Message Trace Analytics and m...WSO2 Stream Processor: Graphical Editor, HTTP & Message Trace Analytics and m...
WSO2 Stream Processor: Graphical Editor, HTTP & Message Trace Analytics and m...
 
WSO2Con USA 2015: An Introduction to the WSO2 Analytics Platform
WSO2Con USA 2015: An Introduction to the WSO2 Analytics PlatformWSO2Con USA 2015: An Introduction to the WSO2 Analytics Platform
WSO2Con USA 2015: An Introduction to the WSO2 Analytics Platform
 
Machine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into InsightMachine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into Insight
 
Machine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into InsightMachine Data 101: Turning Data Into Insight
Machine Data 101: Turning Data Into Insight
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
 
USQL Trivadis Azure Data Lake Event
USQL Trivadis Azure Data Lake EventUSQL Trivadis Azure Data Lake Event
USQL Trivadis Azure Data Lake Event
 
IBM IoT Architecture and Capabilities at the Edge and Cloud
IBM IoT Architecture and Capabilities at the Edge and Cloud IBM IoT Architecture and Capabilities at the Edge and Cloud
IBM IoT Architecture and Capabilities at the Edge and Cloud
 
Building a Real-Time IoT monitoring application with Azure
Building a Real-Time IoT monitoring application with AzureBuilding a Real-Time IoT monitoring application with Azure
Building a Real-Time IoT monitoring application with Azure
 
Using Elasticsearch for Analytics
Using Elasticsearch for AnalyticsUsing Elasticsearch for Analytics
Using Elasticsearch for Analytics
 

More from WSO2

More from WSO2 (20)

Driving Innovation: Scania's API Revolution with WSO2
Driving Innovation: Scania's API Revolution with WSO2Driving Innovation: Scania's API Revolution with WSO2
Driving Innovation: Scania's API Revolution with WSO2
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
WSO2CON 2024 - Unlocking the Identity: Embracing CIAM 2.0 for a Competitive A...
 
WSO2CON 2024 Slides - Unlocking Value with AI
WSO2CON 2024 Slides - Unlocking Value with AIWSO2CON 2024 Slides - Unlocking Value with AI
WSO2CON 2024 Slides - Unlocking Value with AI
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
WSO2CON 2024 - Elevating the Integration Game to the Cloud
WSO2CON 2024 - Elevating the Integration Game to the CloudWSO2CON 2024 - Elevating the Integration Game to the Cloud
WSO2CON 2024 - Elevating the Integration Game to the Cloud
 
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & InnovationWSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
WSO2CON 2024 - OSU & WSO2: A Decade Journey in Integration & Innovation
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
WSO2CON 2024 - IoT Needs CIAM: The Importance of Centralized IAM in a Growing...
 
WSO2CON 2024 - Architecting AI in the Enterprise: APIs and Applications
WSO2CON 2024 - Architecting AI in the Enterprise: APIs and ApplicationsWSO2CON 2024 - Architecting AI in the Enterprise: APIs and Applications
WSO2CON 2024 - Architecting AI in the Enterprise: APIs and Applications
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
WSO2CON 2024 - Software Engineering for Digital Businesses
WSO2CON 2024 - Software Engineering for Digital BusinessesWSO2CON 2024 - Software Engineering for Digital Businesses
WSO2CON 2024 - Software Engineering for Digital Businesses
 
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
 
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of TransformationWSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
WSO2CON 2024 - Designing Event-Driven Enterprises: Stories of Transformation
 
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
WSO2CON 2024 - Not Just Microservices: Rightsize Your Services!
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 

WSO2 Product Release Webinar: WSO2 Data Analytics Server 3.0

  • 1. WSO2 Data Analytics Server 3.0.0 Product Release Webinar Inosh Goonewardena Associate Technical Lead
  • 5. Introducing WSO2 Data Analytics Server ● Fully-open source solution with the ability to build systems and applications that collect and analyze data and communicate the results. ● Embodies the WSO2 Analytics Platform by combining batch, real- time, interactive and predictive analytics capabilities ● High performance data capture framework ● Highly available and scalable by design
  • 6. Advantages of DAS 3.0 over WSO2 BAM 2.5.0 ● Complete rewrite from the ground up, with performance and extensibility as core values ● Faster analytics powered by Apache Spark, 10x - 100x speedup ● Rich indexing support, with near real-time text search ● Pluggable data store support, from lightweight embedded RDBMS to highly scalable HBase/HDFS ● Revamped Analytics Dashboard with wizard-based gadget generation
  • 9. Data Model { 'name': 'stream.name', 'version': '1.0.0', 'nickName': 'stream nickname', 'description': 'description of the stream', 'metaData':[ {'name':'meta_data_1','type':'STRING'}, ], 'correlationData':[ {'name':'correlation_data_1','type':'STRING'} ], 'payloadData':[ {'name':'payload_data_1','type':'BOOL'}, {'name':'payload_data_2','type':'LONG'} ] } ● Published data conforms to a strongly typed data stream
  • 10. ● One API for Batch and Real-time Analytics. ● Asynchronous and non-blocking nature enables extremely fast writes. ● Supports multiple transport adapters for data collection Data Receiver
  • 11. Highly Pluggable Event Receiver Architecture
  • 12. Data Persistence ● Data Abstraction Layer to enable pluggable data connectors ○ RDBMS, Cassandra and HBase/HDFS offered. Custom connectors could be easily written ● Analytics Table ○ The data persistence entity in WSO2 Data Analytics Server ○ Provides a backend data source agnostic way of storing and retrieving data ○ Allows applications to be written in a way that it does not depend on a specific data source, e. g. JDBC (RDBMS), Cassandra APIs etc. ○ WSO2 DAS gives a standard REST API in accessing the Analytics Tables
  • 13. Data Persistence ● Analytics Record Stores ○ An Analytics Record Store houses a specific set of Analytics Tables ○ The Analytics Record Stores to be used for storing incoming events and storing query processing output are configurable ○ Single Analytics Table namespace, the target record store only given at the time of table creation ○ Useful in creating Analytics Tables where data will be stored in multiple target databases ● Analytics File System ○ The location where the indexing data is stored ○ Multiple implementations provided OOTB, or custom implementations can be written
  • 16. Batch Analytics - Overview ● Powered by Apache Spark for 10x-100x higher performance than Hadoop ● Parallel, distributed with optimized in-memory processing ● Scalable script-based analytics written using an easy-to-learn, SQL-like query language powered by Spark SQL ● Interactive built in web interface for ad-hoc query execution ● Scheduled query script execution support with high-availability and failover ● Run Spark on a single node, Spark embedded Carbon server cluster or connect to external Spark cluster
  • 17. create temporary table product_data using CarbonAnalytics options (schema …) create temporary table products using CarbonAnalytics options (schema …) insert into products select product_name from product_data group by … Batch Analytics - Spark SQL
  • 18. Batch Analytics - Interactive Console
  • 19. Batch Analytics - Spark Scripts
  • 21. ● Full text data indexing support powered by Apache Lucene ● Drill down search support ● Distributed data indexing ○ Designed to support scalability ● Near real-time data indexing and retrieval ○ Data indexed immediately as received Interactive Analytics
  • 24. What is Real-time Analytics? Real-time Analytics in →
  • 25. Real-time Analytics in → ● Gather data from multiple sources ● Correlate data streams over time ● Find interesting occurrences ● And Notify ● All in real-time What is Real-time Analytics?
  • 27. Predictive Analytics in → What is Predictive Analytics?
  • 28. Predictive Analytics in → ● Extract, pre-process, and explore data ● Create models, tune algorithms and make predictions ● Integrate for better intelligence What is Predictive Analytics?
  • 30. Dashboards ● “Overall idea” in a glance (e.g. car dashboard) ● Support for personalization, you can build your own dashboard. ● The entry point for Drill-down ● Building a custom dashboard ○ Dashboard via Google Gadgets and content via HTML5 + JavaScript ○ Leverages WSO2 User Engagement Server to build a dashboard. ○ Uses charting libraries like Vega, D3.js
  • 31. Dashboards: Gadget Generation Wizard ● Start with data in tabular format ● Map each column to dimension in your plot like X,Y, color, point size, etc ● Also do drill downs ● Create a chart with few clicks
  • 32. Alerts ● Detecting conditions can be done via CEP Queries ● “Last Mile” is key ○ Email ○ SMS ○ Push notifications to a UI ○ Pager ○ Trigger physical Alarm
  • 33. APIs ● With mobile Apps, most data are exposed and shared as APIs (REST/JSON ) to end users. ● Analytics results can be exposed through APIs ○ REST API ○ JavaScript API
  • 34. What can WSO2 DAS do for you?
  • 35. Common Use Cases of WSO2 DAS ● KPI Statistics ○ Application Statistics Monitoring ○ Network / Service Statistics ○ Sensor Data Aggregation ● Solving Optimization Problems ○ Urban Planning ○ Revenue Distribution Analysis ● Activity Monitoring ○ Tracking Message Flows ● HL7 Data Exploration ○ ESB HL7 Transport Interfaced with DAS ● Log Analysis ○ Application / System Logs ● Sports ○ Real-time Analysis of Player Performance ○ Real-time Match Analysis ● Geo-Spatial ○ Traffic Monitoring and Alerting ○ Geo-fencing ● Anomaly Detection ○ Fraud Detection ○ Network Intrusion Detection ○ Server Health Monitoring
  • 39. Activity Monitoring Activity monitoring is for tracking events from multiple nodes in a flow to understand a specific activity ● Example: ○ A client initiating a web services request which travels through multiple ESBs, application servers and returns back. This flow will be uniquely identified and visualized in DAS ● Used for tracing messages, finding performance hotspots in the flow ● Implemented based on a correlation id based mechanism using Interactive Analytics
  • 45. Fraud Detection ● Built for detecting credit card fraud ● The rules are extensible with customized Siddhi execution plans for any type of fraud detection ● Currently leverages Real-time and Interactive Analytics features Source: multichannelmerchant.com
  • 46. Log Analysis ● Distributed indexing and searching of any type of logs stored in the system ● Notifications support with Real-time event processing features ● Application / Server health prediction with Machine Learning ● Utilizes Interactive + Real-time Analytics + Machine Learning features Source: www.retrospective.centeractive.com