SlideShare uma empresa Scribd logo
1 de 47
Baixar para ler offline
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Our Superpower
100000#
10000#
250#
0# 10000# 20000# 30000# 40000# 50000# 60000# 70000# 80000# 90000# 100000#
Bananas#
Spark#Streaming#
(latency#<#3s)#
Spark#Streaming#
(op@mized#for#latency)#
Throughput)(packets/s))
10x
Throughput!
24,000x
Lower Latency!
400x
Throughput !
when Spark Streaming
optimized for latency!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
The Benchmark:
Pattern Detection in Unstructured Streaming Text Data
Spark Streaming Setup#
Bananas Setup#
Text Stream
Generator!
Throughput Regulator#
Throughput
Regulator#
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
The Benchmark:
Platform and Setup
•  Dell 815 Servers!
!
•  48 Text Classification
Pipelines!
!
•  10 Gbit Connection!
!
Spark Streaming Configurations:!
•  Receiver-Based Model!
•  12 Kafka Topic
Partitions!
•  Block Size: 200 ms!
•  Batch Size: 1.5 – 20 s!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Why does it matter?
•  Reliability – May add 2 Nines!
•  Hardware Cost – Potentially 100x Less Cost In Hardware!
•  Energy – Potentially 100x Less Energy !
•  Data Center Footprint – Potentially 100x Less Racks !
•  Manageability – 10 machines versus 1000 machines!!
•  Network BW – Potentially 100x less network BW!
•  Total Cost of Ownership – Potentially < 1000x !!!!
•  Greater Peace of Mind!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Who will pay for Real-Time Solutions?
Real-Time: Expected Latency < 1ms
•  Online Marketers!
–  Process over 100k events per second for thousands
of social media websites!
–  Expected revenue > $2.1 Trillion!
•  IoT Businesses!
–  Process thousands of events per second from
millions of connected devices!
–  Expected revenue > $100 Billion!
•  Spam and Fraud Detection!
–  Detect multiple complex patterns in millions of
transactions and documents per second!
–  Expected revenue > $40 Billion!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
The Akuda Quest
•  To enable truly-real time classification of
extremely high rate data streams !
•  To enable subject matter experts who
possess extensive knowledge of the domain
the data belongs to, and who are often non-
programmers, to directly create classifiers!
•  To enable the fast development and
refinement of data classifiers!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
The Real-Time Classification Challenge
Latency < 1ms
ocuments/second
ytes/packet
Ultra Fast Classification & Correlation
0.001 seconds (max latency)
1,000,000
Distinct Possible
Events/Trigger/Results
K1
K5
K4
K3
K2
K6
K7 K8
K9
K10
Actionable
Information
Previous Knowledge Previous Knowledge Previous Knowledge
100 !
events/s #
10,000 !
Devices#
1,000,000 !
packets/s #
10,000!
Classifiers#
10 Billion
Classification
Operations/s#
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pulsar Analyst Workbench
Quick, Intuitive Classifier Development Sandbox
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Quick Model Optimization
Specialized Compiler, Data Analysis Tools
RESOLVED FILTERING NETWORK
Optimizing Parallelizing Compiler
Cycle
Detection
Reordering
DFA
Pruning
Platform
Targeting
TARGET
PLATFORM
TOPOLOGY
Execution
Engine
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
AKUDA Technology Delivery
•  SaaS turn-key solution, with a model
development system that allows for
deployment of complete solutions in hours,
without any coding requirements.!
•  Privately deployable enterprise solution on
a Cloud Infrastructure. !
•  Software Development Infrastructure for
developing highly specific and targeted
solutions.!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
The SaaS Platform: Pulsar
High Level View
INBOUND
DATA
HUB
DATA
AUGMENTATION
&
CORRELATION
CLASSIFICATION
INDEXING
CLUSTER
ANALYSIS
OUTBOUND
DATA
HUB
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pulsar
System View
Optimizing Parallelizing Compiler
for Classification, Analysis and Action
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Massively Parallel RT
Classification Engine
Social Media Data Sources
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
General Data Sources
Real-time
Stream
Aggregator
RT Classification Pipeline
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Author Attribute Store
Image Universal
Searchable
Index
Image Store
Massively Parallel RT
Classification Engine
AKUDA
Broadcaster
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
AuthorAtributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
(LGM)
AuthorAtributeDetector
AuthorAtributeDetector
AuthorAtributeDetector LDA
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Delivery Integration
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pulsar
Inbound Data Hub
Optimizing Parallelizing Compiler
for Classification, Analysis and Action
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Massively Parallel RT
Classification Engine
Social Media Data Sources
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
General Data Sources
Real-time
Stream
Aggregator
RT Classification Pipeline
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Author Attribute Store
Image Universal
Searchable
Index
Image Store
Massively Parallel RT
Classification Engine
AKUDA
Broadcaster
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
AuthorAtributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
(LGM)
AuthorAtributeDetector
AuthorAtributeDetector
AuthorAtributeDetector LDA
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Delivery Integration
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pulsar
LGM: Data Augmentation and Correlation
Optimizing Parallelizing Compiler
for Classification, Analysis and Action
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Massively Parallel RT
Classification Engine
Social Media Data Sources
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
General Data Sources
Real-time
Stream
Aggregator
RT Classification Pipeline
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Author Attribute Store
Image Universal
Searchable
Index
Image Store
Massively Parallel RT
Classification Engine
AKUDA
Broadcaster
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
AuthorAtributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
(LGM)
AuthorAtributeDetector
AuthorAtributeDetector
AuthorAtributeDetector LDA
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Delivery Integration
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pulsar
Bananas: Data Classification
Optimizing Parallelizing Compiler
for Classification, Analysis and Action
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Massively Parallel RT
Classification Engine
Social Media Data Sources
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
General Data Sources
Real-time
Stream
Aggregator
RT Classification Pipeline
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Author Attribute Store
Image Universal
Searchable
Index
Image Store
Massively Parallel RT
Classification Engine
AKUDA
Broadcaster
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
AuthorAtributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
(LGM)
AuthorAtributeDetector
AuthorAtributeDetector
AuthorAtributeDetector LDA
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Delivery Integration
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pulsar
Corona: Cluster Analysis
Optimizing Parallelizing Compiler
for Classification, Analysis and Action
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Massively Parallel RT
Classification Engine
Social Media Data Sources
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
General Data Sources
Real-time
Stream
Aggregator
RT Classification Pipeline
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Author Attribute Store
Image Universal
Searchable
Index
Image Store
Massively Parallel RT
Classification Engine
AKUDA
Broadcaster
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
AuthorAtributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
(LGM)
AuthorAtributeDetector
AuthorAtributeDetector
AuthorAtributeDetector LDA
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Delivery Integration
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pulsar
Outbound Data Hub
Optimizing Parallelizing Compiler
for Classification, Analysis and Action
Network
LDA
Cluster
Generator
LDA
Cluster
Refinement
Massively Parallel RT
Classification Engine
Social Media Data Sources
Universal Store
Social
Media
Harvester
General
Data
Integration
Hub
Data Source
Akuda
Agent
Universal
Searchable
Index
Data Source
Direct
Feed
Author
[G,A,E]
Image Analyzer
(LGM)
Author Info
Analyzer
(LGM)
General Data Sources
Real-time
Stream
Aggregator
RT Classification Pipeline
Author
Geolocation
Analyzer
(LGM)
Image
Data
Sources
Image
Harvester
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Author Attribute Store
Image Universal
Searchable
Index
Image Store
Massively Parallel RT
Classification Engine
AKUDA
Broadcaster
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
RT Classification Pipeline
AuthorAtributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
(LGM)
AuthorAtributeDetector
AuthorAtributeDetector
AuthorAtributeDetector LDA
Feature
Generator
(Proximity NGRAMS)
MISSION EDITOR
DFA
Ta
p
DFA
Ta
p
Ta
p
DFA
DFA
Classifier
Refinement
Pipeline Deep
Inspection Store
Metrics And Alarms
RT Stream
Indexer
Delivery Integration
Hub
Target
Systems
Dashboard
Editor
Visualization
RT DASHBOARD
[Corona]
PIPELINE STUDIO
[Pulsar]
DEEP INSPECTION
Query UI
AUTHOR
ATTRIBUTE
Query UI
UNIVERSAL
STREAM
Query UI
LDA
Classifier
Generator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
THE AKUDA CORE!
MASSIVELY PARALLEL STREAMING
CLASSIFICATION INFRASTRUCTURE!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Possible Solution 1
NOT THIS - GTS: Scalability & Latency Problems
Feed BC
Rx
Rx
Rx
Rx
Indexer
Broadcaster
GTS
Indexing
System
Query With Frequency 2 q/s
Indexer
Indexer
Indexer
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Query With Frequency 2 q/s
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Analytics
Visualization
Index Storage
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Possible Solution 2
NOT THIS - HADOOP: Latency Problems
Feed BC
Brodcaster
HADOOP
Broadcaster#
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Possible Solution 3
Not Quite There: Spark Streaming Pipeline of RDDs
Source
1,000,000 documents/second
1,024 bytes/packet
MicroBatcher
1,000,000 Sequential Stages
Doc 01
Doc 02
Doc 03
Doc 04
Doc 05
Doc 06
Doc 07
Doc 08
Doc 09
Doc 10
Doc 11
Doc 12
Doc 13
Doc 14
Doc 15
Doc 16
Latency of minutes, hours??
Network Transfers and/or Data
Copying Across Host Nodes or
Pipeline Stages
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Possible Solution 4
Almost There: Data Flow Pipelines, Data Replication
Source
1,000,000 documents/second
1,000 bytes/packet
Broadcaster
Bisection Bandwidth
1,000,000,000,000,000 bytes/second
~10,000,000 GBits/second
~10,000 TBits/second !!!
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
Doc 01
1,000,000
Stages Running
Simultaneously
Low Lat
Broadcasting Doc Replicas
becomes extreme bottleneck
PCIe 3.0 lane BW: ~ 1GByte/second
10Gbps Ethernet: ~ 1GB/second
Infiniband: Mellanox 56Gb/s FDR IB:
6.8GB/s
Cisco Catalyst 2960G-49TC-L Switching
Fabric: 40mpps. At 1000 bytes/
packet: 40,000 MBytes/second
==> 40 GBytes/second
Intel-Xeon-Processor-E7-8890 (15 cores)
Max Mem BW:85GBytes/second
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Possible Solution 5
Cost & Latency Issues: Data Broadcasting Tree
Feed BC
Rx
Rx
Rx
Rx
Broadcaster
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
1
10
10
10
R
x
R
x
R
x
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
Rx
R
x
R
x
R
x
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Indexi
ng /
Analyti
cs
Visuali
zation
Rx
Rx
Rx
Rx
Rx
Rx
10
10
10
1 + 10 x 10 x 10 x 10 x 10 x 10 = 1,000,001 Nodes
Worst Case Cost = 1,000,001 * $1000/month:
~ $ 1 Billion / Month !!!!
Latency Goes back to hours or days!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Honey, I Shrunk the Trees!
AKUDA Core Topology
Indexing / Analytics
Feed
Rx Tx
Visualization
Broadcaster
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx
Model Pipelines
Tx
Visualization
1,000,000 documents/second
1,000 bytes/packet
100,000 Short Pipelines * 10 Stages each= 1,000,000 Stages
Akuda Queue Technology
using on-chip inter-core
networks
Akuda Buffer Technology
using on-chip inter-core
networks
Akuda Correlator Technology
using on-chip inter-core
networks
0.001 seconds typical latency
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
The Solution
Utilize Inter-Core Communication Channel
Data Communication Hardware! Typical Bandwidth! Typical Cost!
10 Gbps Ethernet! 1 GB/s! $ 1,000!
PCIe 3.0 Lane! 1 GB/s! $10,000!
Infiniband, Mellanox 56Gb/s FDR IB! 6.8 GB/s! $1,000,000!
Cisco Catalyst Switching Fabric! 40 GB/s! $10,000,000!
Inter-core/Inter-processor Fabric
Bisection Bandwidth!
1000 GB/s !
(for IA64 Chips)!
$500!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Data Broadcasting
Use The Best Broadcasting Network
340GB/s
> 1000GB/s
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
The Solution
AKUDA Core Differentiating Factors
Lockfree Queue, Pipeline
Control!
Lockfree Correlator!
Lockfree Multithreaded
Processing!
Feed BC
Broadcaster
Indexing /
AnalyticsRx Tx
Visualization
1
10
1000
Akuda
Core
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Akuda
Core
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Akuda
Core
Akuda
Core
Akuda
Core
Zero-replication
Data Broadcasting!
On-chip-
network
Communication
Control!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Adaptive Topology
Continuous Optimization of Data Comm & Pipeline Execution
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Akuda Core Scalability
Bisection Bandwidth
10 20 30 40 50 60 70 80
80
70
60
50
40
30
20
10
BBW
Processors
Akuda Lock-free
Algorithms
Standard Algorithms
Processing Latency
10 20 30 40 50 60 70 80
80
70
60
50
40
30
20
10
Time
Processors
Akuda Lock-free
Algorithms
Standard Algorithms
Processing Cost
200 400 600 800 1000 1200 1400 1600
800
700
600
500
400
300
200
100
1000 $ / Month
MILLION [Stream Rate * Pipelines * Patterns]
Akuda Lock-free
Algorithms
Standard
Algorithms
Parallelization Speedup
10 20 30 40 50 60 70 80
80
70
60
50
40
30
20
10
Speedup
Processors
Akuda Lock-free
Algorithms
Standard Algorithms
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
AKUDA Core in Action
Election2016.io: Real-Time Online Polls
“The problem is that when polls are wrong, they tend to
be wrong in the same direction. If they miss in New
Hampshire, for instance, they all miss on the same
mistake.” -- Nate Silver!
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Akuda Core in Action
Election2016.io Backend
Feed
Indexing / Analytics
Rx
Model Pipelines
Tx
Visualization
50,000 documents/second (peak)
1,000 bytes/document
3000 Models (Author Classification + App Classification)
Akuda Broadcasting
Technology
using on-chip inter-core fabric
Akuda Buffer Technology
using on-chip inter-core
fabric
Akuda Correlator Technology
Sub-second Latency
150 GigaBytes Bisection Bandwidth
(Over 1 TERAbit/second)
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Indexing / Analytics
Rx Tx
Visualization
Learned People
Attributes
Akuda Classification
Technology
Correlator
Akuda Data Analysis
Technology
100 Patterns/Model
15 BILLION
Patterns/second
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
STANDALONE#USE#OF#
BANANAS##
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
General Statistical Classification
K-MEANS, LDA, NN
Feed
Rx Tx
Broadcaster
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx
k-means Model SubMatrix
Tx
1,000,000 documents/second
1,000 bytes/packet
10,000 Nodes * 100 k-means Centroid Vectors
Akuda Queue Technology
using on-chip inter-core
networks
Akuda Buffer Technology
using on-chip inter-core
networks
0.001 seconds typical latency
Aggregator
Akuda Queue Technology
using on-chip inter-core
networks
k-means
cluster
label
for data
item
Akuda Lockless
Matrix Ops
Akuda Lockless
Correlator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
IOT Classification POC
K-MEANS
Feed
Rx Tx
Broadcaster
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
Rx Tx
1,000,000 sensor-vectors/second
1,000 bytes/vector
Classification Using DFA, K-MEANS, LDA, or NN Models
Akuda Queue Technology
using on-chip inter-core
networks
Akuda Buffer Technology
using on-chip inter-core
networks
0.001 seconds typical latency
Aggregator
Akuda Queue Technology
using on-chip inter-core
networks
Sensor
Warnings
Akuda Lockless
Matrix Ops
Sensor State
Classification
Akuda Lockless
Correlator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
IOT Classification POC
K-MEANS
LINEAR
ALGEBRA
ENGINE - 1
LINEAR
ALGEBRA
ENGINE - 2
LINEAR
ALGEBRA
ENGINE - N
LINEAR
ALGEBRA
ENGINE - 100
DATA
RECEIVER
INPUT DATA
CHANNEL
L2 NORM
CHANNEL
AGGREGATOR
LOCKLESS HASH
UNSORTED
CHANNEL
MIN FINDER
INPUT DATA
STREAM
OUTPUT DATA
STREAM
Packet ID
Input Packet: D
Packet ID
Transformed Packet: D’
Packet ID
Minimum Elements Vector
Minimum Distance
from Classifier: Pn
Packet ID
Classified Packet
For, K = 100,000 (number of clusters)
       N = 100 (number of processors)
       P = 1000 (cardinality of feature set)
       D : Input Vector to be classified
       A : Model matrix representing trained
values for classification centroids
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
AKUDA#LABS#PATENTS#
Pending#&#Provisional#
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
PATENT LIST (1/3)#
1 HIERARCHICAL, PARALLEL MODELS FOR EXTRACTING IN REAL TIME HIGH-VALUE INFORMATION FROM DATA STREAMS AND SYSTEM AND METHOD FOR
CREATION OF SAME
2 HIERARCHICAL, PARALLEL MODELS FOR EXTRACTING IN REAL-TIME HIGH-VALUE INFORMATION FROM DATA STREAMS AND SYSTEM AND METHOD FOR
CREATION OF SAME
3
MASSIVELY-PARALLEL SYSTEM ARCHITECTURE AND METHOD FOR REAL-TIME EXTRACTION OF HIGH-VALUE INFORMATION FROM DATA STREAMS
4
OPTIMIZATION FOR REAL-TIME, PARALLEL EXECUTION OF MODELS FOR EXTRACTING HIGH-VALUE INFORMATION FROM DATA STREAMS
5
EXTRACTION OF HIGH VALUE INFORMATION FROM UNSTRUCTURED IMAGES IN MASSIVELY PARALLEL PROCESSING SYSTEM
6
REAL-TIME MASSIVELY PARALLEL PIPELINE PROCESSING SYSTEM
7
ADDITIONAL APPLICATIONS DIRECTED TO SPECIFIC ASPECTS/IMPROVEMENTS OF REAL-TIME MASSIVELY PARALLEL PIPELINE PROCESSING SYSTEM
8
AUTOMATIC TOPIC DISCOVERY IN STREAMS OF SOCIAL MEDIA POSTS
9
TOPIC AND TREND DISCOVERY WITHIN REAL-TIME ONLINE CONTENT STREAMS
10
SYSTEM AND METHOD FOR IMPLEMENTING ENTERPRISE RISK MODELS BASED ON INFORMATION POSTS
11
ADDITIONAL APPLICATIONS DIRECTED TO SPECIFIC MODELS OTHER THAN RISK MODELS
12
LAZY PARSER FOR INFERENCE IN UNSTRUCTURED DATA STREAMS
13
REALTIME DATA STREAM CLUSTER SUMMARIZATION AND LABELING SYSTEM
14
DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED DATA
15
REAL-TIME STREAM CORRELATION WITH PRE-EXISTING KNOWLEDGE (STATE)
16
LOCKLESS KEY-VALUE STORE AND MEMORY CACHING SYSTEM
17
DYNAMIC RESOURCE ALLOCATOR FOR REAL-TIME PARALLEL PIPELINE PROCESSING SYSTEM
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
PATENT LIST (2/3)#
18
REALTIME LOW LATENCY DATA STREAM DFA CLASSIFICATION ENGINE
19 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR SOCIAL MEDIA AUTHOR CLASSIFICATION AND
ANALYSIS STREAM
20
ATTRIBUTE VECTOR COMPRESSION FOR STREAM PROCESSING
21
REATIME IOT PARALLEL VECTOR CLASSIFICATION
22
REALTIME IMAGE HARVESTING AND STORAGE SYSTEM
23
DATA STREAM HISTORIC REPLAY VERSIONING (SKYLINE)
24
DATA STREAM HISTORIC REPLAY SYSTEM AND STORAGE
25
EXTRACTION OF AUTHOR(PEOPLE) ATTRIBUTES THROUGH COMPLEX DFA MODELS
26
REALTIME IMAGE HARVESTING AND STORAGE SYSTEM
27
NEURAL NETWORK-BASED SYSTEM FOR EXTRACTION OF DEMOGRAPHICS FROM SOCIAL MEDIA IMAGES
28
METHODFORSOCIALMEDIAEVENTDETECTIONANDCAUSEANALYSIS
29
METHOD FOR REAL-TIME TAGGING OF DATA STREAM DOCUMENTS
30
PEOPLE ATTRIBUTE QUERY AND VISUALIZATION TOOL
31
WORD SET VISUAL NORMALIZED WEIGHT DAMPENING
32 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED
ELECTION DATA
33 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED
RETAIL DATA
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
PATENT LIST (3/3)#
34
SYSTEMS AND METHODS FOR ANALYZING UNSOLICITED PRODUCT/SERVICE CUSTOMER REVIEWS
35
SYSTEM FOR CREDIT/INSURANCE PROCESSING USING UNSTRUCTURED DATA
36
SYSTEM AND METHOD FOR CORRELATING SOCIAL MEDIA DATA AND COMPANY FINANCIAL DATA
37
SYSTEMS AND METHODS FOR IDENTIFYING AN ILLNESS AND COURSE OF TREATMENT FOR A PATIENT
38
SYSTEM AND METHOD FOR IDENTIFYING FACIAL EXPRESSIONS FROM SOCIAL MEDIA IMAGES
39
SYSTEM AND METHOD FOR DETECTING HEALTH MALADIES IN A PATIENT USING UNSTRUCTURED IMAGES
40 SYSTEM AND METHOD FOR DETECTING POLITICAL DESTABILIZATION AT A SPECIFIC GEOGRAPHIC LOCATION BASED ON SOCIAL
MEDIA DATA
41
SYSTEM AND METHOD FOR IDENTIFYING CORRELATIONS BETWEEN SOCIAL MEDIA IMAGES USING NEURAL NETWORKS
42
SYSTEM AND METHOD FOR SCALABLE PROCESSING OF DATA PIPELINES USING A LOCKLESS SHARED MEMORY SYSTEM
43
ASYNCHRONOUS WEB PAGE DATA AGGREGATOR
44
APPLICATIONS OF DISTIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY TO REAL TIME NEWS SERVICE
45
DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME THREAT ANALYSIS
46
DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME EMERGENCY RESPONSE
47
DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR CLIMATE ANALYTICS
48
DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR INSURANCE RISK ASSESSMENT
49
DISTRIBUTED PARALLEL ARCHITECTURES FOR REAL TIME PROCESSING OF STREAMS OF STRUCTURED AND UNSTRUCTURED DATA
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
THE#AKUDA#SYSTEM#
Addi@onal#Informa@on#
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
The Solution
Akuda Core Topology with Kafka
UU#OnUchipUnetwork#Comm#Control#
UU#ZeroUcopy#Data#Broadcas@ng#
UU#Lockfree#queue,#pipeline#control#
UU#Lockfree#correlator#
UU#Lockfree#Mul@threaded#Processing#
Feed BC
Kafka
Indexing /
AnalyticsRx Tx
Visualization
1
10
1000
Akuda
Core
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Akuda
Core
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Indexing /
AnalyticsRx Tx
Visualization
Akuda
Core
Akuda
Core
Akuda
Core
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pulsar
Functional View
Unstructured
Data Source
Streams
Unstructured
Data Source
Batch
Unstructured
Data Source
Images
MILLIONS OF DOCUMENTS
PER SECOND
LDA
CONTROL
AKUDA
DEEP INSPECTION
THIRD-PARTY
DATA ANALYTICS
HADOOP
BASED ANALYTICS
THIRD-PARTY
VISUALIZATION
AKUDA
DASHBOARD
RT
Content
Classification
(DFA/LDA/VEC)
RT
Author
Classification
(DFA/LDA)
Optimizing Parallelizing
Compiler
Normalization
RT
Author
Image Analysis
(NEURAL NETS)
Universal
Indexing
P-GRAM GEN
Indexer
STATS /
ANALYTICS
Author ATTR
Author GEO
Author DEM
LDA PROC
P-GRAM GEN LDA PROC
10+ BILLIONS OF CLASSIFICATIONS
PER SECOND
MISSION
EDITOR
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Automatic Cluster Discovery
P-GRAMS, LDA, CONVERGENCE
Mission Deep
Inspection Store
Summarizer
p-GRAM
Generator
Mission Stream
Concept
Extractor
LDA
Solver
Convergence
Monitor
p-GRAMS
Corpus
Summary
Corpus
Concept
Cloud
Labeled
Corpus
Clusters
Classification
Model
Library
LDA
Cluster Generation & Labeling
LDA
Cluster Refinement
DFA
Classifier Refinement
LDA
Classifier Generator
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Author Attribute Discovery
Neural Networks, Bayesian Models, DFAs
Ethnicity
Image Analyzer
Author Info
Analyzer
(LGM)
Real-time
Stream
Aggregator
Author
Geolocation
Analyzer
(LGM)
Author
Attribute
Processor
(LGM)
Real-time
Stream
Correlator
Massively Parallel RT
Classification Engine
AKU
DA
Broad
caster
AuthorAtributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
AuthorAttributeDetector
(LGM)
AuthorAtributeDetector
AuthorAtributeDetector
AuthorAtributeDetector
Unstructured
Data Source
A
Unstructured
Data Source
B
Unstructured
Data Source
C
Normalization
Age
Image Analyzer
Gender
Image Analyzer
Labeled
Image
Generator
Neural Network
Trainer
Author Bayesian
Classification
Model Trainer
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Generalized Image Classification
Neural Networks, Bayesian Models, DFAs
Ethnicity
Image Analyzer
Age
Image Analyzer
Gender
Image Analyzer
Labeled
Image
Generator
Neural Network
Trainer
Image
Data
Sources
Image
Harvester
Logo
Identification
Face
Detector Glasses
Image Analyzer
Weight
Image Analyzer
Hair-style
Image Analyzer
Shape
Identification
Emotion
Image Analyzer
Image
Label
Classifier
Image DB
AKUDA LABS PROPRIETARY AND CONFIDENTIAL
Pipeline Editor
Automatic LDA Models, User-specified DFAs
RT
Content
Classification
(DFA/LDA/VEC)
Optimizing Parallelizing
Compiler
PIPELINE EDITOR
Filtering, Analysis And Action Network
LDA
Classifier
Vector
String
CMP
Vector
INT/
FP
CMP
DFA
Counter
Tap
Action
Block
DFA
Counter
Tap
Counter
Tap
DFA
Action
Block
Outp
utInou
t
LDA
Classifier
Vector
String
CMP
Vector
INT/FP
CMP
DFA Action
Block
Counter
Tap
Model Library
Airlines
Auto
Auto Insurance
Cable
Beverages
Fast Food
Finance
Housing
Legal
Pharma/Health
Most Used Detectors
Tech
Advertisement
Inquiry
Customer Service
Irate Customers
Thankful Customers
Consumers
STATE
MANAGEMENT
P-GRAM GEN
Indexer
LDA PROC

Mais conteúdo relacionado

Mais procurados

HadoopCon2015 Multi-Cluster Live Synchronization with Kerberos Federated Hadoop
HadoopCon2015 Multi-Cluster Live Synchronization with Kerberos Federated HadoopHadoopCon2015 Multi-Cluster Live Synchronization with Kerberos Federated Hadoop
HadoopCon2015 Multi-Cluster Live Synchronization with Kerberos Federated HadoopYafang Chang
 
HadoopCon- Trend Micro SPN Hadoop Overview
HadoopCon- Trend Micro SPN Hadoop OverviewHadoopCon- Trend Micro SPN Hadoop Overview
HadoopCon- Trend Micro SPN Hadoop OverviewYafang Chang
 
Apache Spark Best Practices Meetup Talk
Apache Spark Best Practices Meetup TalkApache Spark Best Practices Meetup Talk
Apache Spark Best Practices Meetup TalkEren Avşaroğulları
 
sudoers: Benchmarking Hadoop with ALOJA
sudoers: Benchmarking Hadoop with ALOJAsudoers: Benchmarking Hadoop with ALOJA
sudoers: Benchmarking Hadoop with ALOJANicolas Poggi
 
Keep your Hadoop cluster at its best!
Keep your Hadoop cluster at its best!Keep your Hadoop cluster at its best!
Keep your Hadoop cluster at its best!Sheetal Dolas
 
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudThe state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudNicolas Poggi
 
Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!DataWorks Summit
 
Apache Kafka Women Who Code Meetup
Apache Kafka Women Who Code MeetupApache Kafka Women Who Code Meetup
Apache Kafka Women Who Code MeetupSnehal Nagmote
 
Deep Learning with Spark and GPUs
Deep Learning with Spark and GPUsDeep Learning with Spark and GPUs
Deep Learning with Spark and GPUsDataWorks Summit
 
Elastify Cloud-Native Spark Application with Persistent Memory
Elastify Cloud-Native Spark Application with Persistent MemoryElastify Cloud-Native Spark Application with Persistent Memory
Elastify Cloud-Native Spark Application with Persistent MemoryDatabricks
 
Unleashing Data Intelligence with Intel and Apache Spark with Michael Greene
Unleashing Data Intelligence with Intel and Apache Spark with Michael GreeneUnleashing Data Intelligence with Intel and Apache Spark with Michael Greene
Unleashing Data Intelligence with Intel and Apache Spark with Michael GreeneDatabricks
 
Ceph Performance Profiling and Reporting
Ceph Performance Profiling and ReportingCeph Performance Profiling and Reporting
Ceph Performance Profiling and ReportingCeph Community
 
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...Databricks
 
Track A-2 基於 Spark 的數據分析
Track A-2 基於 Spark 的數據分析Track A-2 基於 Spark 的數據分析
Track A-2 基於 Spark 的數據分析Etu Solution
 
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and PluginsMonitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and PluginsDatabricks
 
Akka 2.4 plus new commercial features in Typesafe Reactive Platform
Akka 2.4 plus new commercial features in Typesafe Reactive PlatformAkka 2.4 plus new commercial features in Typesafe Reactive Platform
Akka 2.4 plus new commercial features in Typesafe Reactive PlatformLegacy Typesafe (now Lightbend)
 

Mais procurados (20)

HadoopCon2015 Multi-Cluster Live Synchronization with Kerberos Federated Hadoop
HadoopCon2015 Multi-Cluster Live Synchronization with Kerberos Federated HadoopHadoopCon2015 Multi-Cluster Live Synchronization with Kerberos Federated Hadoop
HadoopCon2015 Multi-Cluster Live Synchronization with Kerberos Federated Hadoop
 
HadoopCon- Trend Micro SPN Hadoop Overview
HadoopCon- Trend Micro SPN Hadoop OverviewHadoopCon- Trend Micro SPN Hadoop Overview
HadoopCon- Trend Micro SPN Hadoop Overview
 
Apache Spark Best Practices Meetup Talk
Apache Spark Best Practices Meetup TalkApache Spark Best Practices Meetup Talk
Apache Spark Best Practices Meetup Talk
 
sudoers: Benchmarking Hadoop with ALOJA
sudoers: Benchmarking Hadoop with ALOJAsudoers: Benchmarking Hadoop with ALOJA
sudoers: Benchmarking Hadoop with ALOJA
 
Keep your Hadoop cluster at its best!
Keep your Hadoop cluster at its best!Keep your Hadoop cluster at its best!
Keep your Hadoop cluster at its best!
 
HDFS Selective Wire Encryption
HDFS Selective Wire EncryptionHDFS Selective Wire Encryption
HDFS Selective Wire Encryption
 
The state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the CloudThe state of SQL-on-Hadoop in the Cloud
The state of SQL-on-Hadoop in the Cloud
 
Have your cake and eat it too
Have your cake and eat it tooHave your cake and eat it too
Have your cake and eat it too
 
Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!Hadoop Hardware @Twitter: Size does matter!
Hadoop Hardware @Twitter: Size does matter!
 
Apache Kafka Women Who Code Meetup
Apache Kafka Women Who Code MeetupApache Kafka Women Who Code Meetup
Apache Kafka Women Who Code Meetup
 
Deep Learning with Spark and GPUs
Deep Learning with Spark and GPUsDeep Learning with Spark and GPUs
Deep Learning with Spark and GPUs
 
Elastify Cloud-Native Spark Application with Persistent Memory
Elastify Cloud-Native Spark Application with Persistent MemoryElastify Cloud-Native Spark Application with Persistent Memory
Elastify Cloud-Native Spark Application with Persistent Memory
 
Unleashing Data Intelligence with Intel and Apache Spark with Michael Greene
Unleashing Data Intelligence with Intel and Apache Spark with Michael GreeneUnleashing Data Intelligence with Intel and Apache Spark with Michael Greene
Unleashing Data Intelligence with Intel and Apache Spark with Michael Greene
 
Enterprise Grade Streaming under 2ms on Hadoop
Enterprise Grade Streaming under 2ms on HadoopEnterprise Grade Streaming under 2ms on Hadoop
Enterprise Grade Streaming under 2ms on Hadoop
 
Ceph Performance Profiling and Reporting
Ceph Performance Profiling and ReportingCeph Performance Profiling and Reporting
Ceph Performance Profiling and Reporting
 
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
Serverless Machine Learning on Modern Hardware Using Apache Spark with Patric...
 
Spark+flume seattle
Spark+flume seattleSpark+flume seattle
Spark+flume seattle
 
Track A-2 基於 Spark 的數據分析
Track A-2 基於 Spark 的數據分析Track A-2 基於 Spark 的數據分析
Track A-2 基於 Spark 的數據分析
 
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and PluginsMonitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
 
Akka 2.4 plus new commercial features in Typesafe Reactive Platform
Akka 2.4 plus new commercial features in Typesafe Reactive PlatformAkka 2.4 plus new commercial features in Typesafe Reactive Platform
Akka 2.4 plus new commercial features in Typesafe Reactive Platform
 

Semelhante a AKUDA Labs: Pulsar

What is Apache Kafka and What is an Event Streaming Platform?
What is Apache Kafka and What is an Event Streaming Platform?What is Apache Kafka and What is an Event Streaming Platform?
What is Apache Kafka and What is an Event Streaming Platform?confluent
 
AKKA and Scala @ Inneractive
AKKA and Scala @ InneractiveAKKA and Scala @ Inneractive
AKKA and Scala @ InneractiveGal Aviv
 
Westpac Bank Tech Talk 1: Dive into Apache Kafka
Westpac Bank Tech Talk 1: Dive into Apache KafkaWestpac Bank Tech Talk 1: Dive into Apache Kafka
Westpac Bank Tech Talk 1: Dive into Apache Kafkaconfluent
 
Running Presto and Spark on the Netflix Big Data Platform
Running Presto and Spark on the Netflix Big Data PlatformRunning Presto and Spark on the Netflix Big Data Platform
Running Presto and Spark on the Netflix Big Data PlatformEva Tse
 
(BDT303) Running Spark and Presto on the Netflix Big Data Platform
(BDT303) Running Spark and Presto on the Netflix Big Data Platform(BDT303) Running Spark and Presto on the Netflix Big Data Platform
(BDT303) Running Spark and Presto on the Netflix Big Data PlatformAmazon Web Services
 
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...HostedbyConfluent
 
Web Scale Reasoning and the LarKC Project
Web Scale Reasoning and the LarKC ProjectWeb Scale Reasoning and the LarKC Project
Web Scale Reasoning and the LarKC ProjectSaltlux Inc.
 
(BDT318) How Netflix Handles Up To 8 Million Events Per Second
(BDT318) How Netflix Handles Up To 8 Million Events Per Second(BDT318) How Netflix Handles Up To 8 Million Events Per Second
(BDT318) How Netflix Handles Up To 8 Million Events Per SecondAmazon Web Services
 
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardDelta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardParis Data Engineers !
 
The Developer Data Scientist – Creating New Analytics Driven Applications usi...
The Developer Data Scientist – Creating New Analytics Driven Applications usi...The Developer Data Scientist – Creating New Analytics Driven Applications usi...
The Developer Data Scientist – Creating New Analytics Driven Applications usi...Microsoft Tech Community
 
Betting On Data Grids
Betting On Data GridsBetting On Data Grids
Betting On Data Gridsgojkoadzic
 
Spark and Spark Streaming at Netfix-(Kedar Sedekar and Monal Daxini, Netflix)
Spark and Spark Streaming at Netfix-(Kedar Sedekar and Monal Daxini, Netflix)Spark and Spark Streaming at Netfix-(Kedar Sedekar and Monal Daxini, Netflix)
Spark and Spark Streaming at Netfix-(Kedar Sedekar and Monal Daxini, Netflix)Spark Summit
 
Concepts and Patterns for Streaming Services with Kafka
Concepts and Patterns for Streaming Services with KafkaConcepts and Patterns for Streaming Services with Kafka
Concepts and Patterns for Streaming Services with KafkaQAware GmbH
 
Real world Scala hAkking NLJUG JFall 2011
Real world Scala hAkking NLJUG JFall 2011Real world Scala hAkking NLJUG JFall 2011
Real world Scala hAkking NLJUG JFall 2011Raymond Roestenburg
 
Iron.io Technical Overview
Iron.io Technical OverviewIron.io Technical Overview
Iron.io Technical OverviewChad Arimura
 
Squeak DBX
Squeak DBXSqueak DBX
Squeak DBXESUG
 
MongoDB World 2018: Enterprise Security in the Cloud
MongoDB World 2018: Enterprise Security in the CloudMongoDB World 2018: Enterprise Security in the Cloud
MongoDB World 2018: Enterprise Security in the CloudMongoDB
 
MongoDB World 2018: Enterprise Cloud Security
MongoDB World 2018: Enterprise Cloud SecurityMongoDB World 2018: Enterprise Cloud Security
MongoDB World 2018: Enterprise Cloud SecurityMongoDB
 

Semelhante a AKUDA Labs: Pulsar (20)

What is Apache Kafka and What is an Event Streaming Platform?
What is Apache Kafka and What is an Event Streaming Platform?What is Apache Kafka and What is an Event Streaming Platform?
What is Apache Kafka and What is an Event Streaming Platform?
 
AKKA and Scala @ Inneractive
AKKA and Scala @ InneractiveAKKA and Scala @ Inneractive
AKKA and Scala @ Inneractive
 
Spark streaming + kafka 0.10
Spark streaming + kafka 0.10Spark streaming + kafka 0.10
Spark streaming + kafka 0.10
 
Westpac Bank Tech Talk 1: Dive into Apache Kafka
Westpac Bank Tech Talk 1: Dive into Apache KafkaWestpac Bank Tech Talk 1: Dive into Apache Kafka
Westpac Bank Tech Talk 1: Dive into Apache Kafka
 
Running Presto and Spark on the Netflix Big Data Platform
Running Presto and Spark on the Netflix Big Data PlatformRunning Presto and Spark on the Netflix Big Data Platform
Running Presto and Spark on the Netflix Big Data Platform
 
(BDT303) Running Spark and Presto on the Netflix Big Data Platform
(BDT303) Running Spark and Presto on the Netflix Big Data Platform(BDT303) Running Spark and Presto on the Netflix Big Data Platform
(BDT303) Running Spark and Presto on the Netflix Big Data Platform
 
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
Applying ML on your Data in Motion with AWS and Confluent | Joseph Morais, Co...
 
Web Scale Reasoning and the LarKC Project
Web Scale Reasoning and the LarKC ProjectWeb Scale Reasoning and the LarKC Project
Web Scale Reasoning and the LarKC Project
 
(BDT318) How Netflix Handles Up To 8 Million Events Per Second
(BDT318) How Netflix Handles Up To 8 Million Events Per Second(BDT318) How Netflix Handles Up To 8 Million Events Per Second
(BDT318) How Netflix Handles Up To 8 Million Events Per Second
 
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin AmbardDelta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
 
The Developer Data Scientist – Creating New Analytics Driven Applications usi...
The Developer Data Scientist – Creating New Analytics Driven Applications usi...The Developer Data Scientist – Creating New Analytics Driven Applications usi...
The Developer Data Scientist – Creating New Analytics Driven Applications usi...
 
Betting On Data Grids
Betting On Data GridsBetting On Data Grids
Betting On Data Grids
 
Spark and Spark Streaming at Netfix-(Kedar Sedekar and Monal Daxini, Netflix)
Spark and Spark Streaming at Netfix-(Kedar Sedekar and Monal Daxini, Netflix)Spark and Spark Streaming at Netfix-(Kedar Sedekar and Monal Daxini, Netflix)
Spark and Spark Streaming at Netfix-(Kedar Sedekar and Monal Daxini, Netflix)
 
Concepts and Patterns for Streaming Services with Kafka
Concepts and Patterns for Streaming Services with KafkaConcepts and Patterns for Streaming Services with Kafka
Concepts and Patterns for Streaming Services with Kafka
 
Real world Scala hAkking NLJUG JFall 2011
Real world Scala hAkking NLJUG JFall 2011Real world Scala hAkking NLJUG JFall 2011
Real world Scala hAkking NLJUG JFall 2011
 
Iron.io Technical Overview
Iron.io Technical OverviewIron.io Technical Overview
Iron.io Technical Overview
 
Squeak DBX
Squeak DBXSqueak DBX
Squeak DBX
 
MongoDB World 2018: Enterprise Security in the Cloud
MongoDB World 2018: Enterprise Security in the CloudMongoDB World 2018: Enterprise Security in the Cloud
MongoDB World 2018: Enterprise Security in the Cloud
 
MongoDB World 2018: Enterprise Cloud Security
MongoDB World 2018: Enterprise Cloud SecurityMongoDB World 2018: Enterprise Cloud Security
MongoDB World 2018: Enterprise Cloud Security
 
Aplicaciones distribuidas con Dapr
Aplicaciones distribuidas con DaprAplicaciones distribuidas con Dapr
Aplicaciones distribuidas con Dapr
 

Último

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制vexqp
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...gajnagarg
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss ConfederationEfruzAsilolu
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...nirzagarg
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........EfruzAsilolu
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATIONLakpaYanziSherpa
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 

Último (20)

Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
怎样办理伦敦大学毕业证(UoL毕业证书)成绩单学校原版复制
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
SR-101-01012024-EN.docx  Federal Constitution  of the Swiss ConfederationSR-101-01012024-EN.docx  Federal Constitution  of the Swiss Confederation
SR-101-01012024-EN.docx Federal Constitution of the Swiss Confederation
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 

AKUDA Labs: Pulsar

  • 1. AKUDA LABS PROPRIETARY AND CONFIDENTIAL
  • 2. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Our Superpower 100000# 10000# 250# 0# 10000# 20000# 30000# 40000# 50000# 60000# 70000# 80000# 90000# 100000# Bananas# Spark#Streaming# (latency#<#3s)# Spark#Streaming# (op@mized#for#latency)# Throughput)(packets/s)) 10x Throughput! 24,000x Lower Latency! 400x Throughput ! when Spark Streaming optimized for latency!
  • 3. AKUDA LABS PROPRIETARY AND CONFIDENTIAL The Benchmark: Pattern Detection in Unstructured Streaming Text Data Spark Streaming Setup# Bananas Setup# Text Stream Generator! Throughput Regulator# Throughput Regulator#
  • 4. AKUDA LABS PROPRIETARY AND CONFIDENTIAL The Benchmark: Platform and Setup •  Dell 815 Servers! ! •  48 Text Classification Pipelines! ! •  10 Gbit Connection! ! Spark Streaming Configurations:! •  Receiver-Based Model! •  12 Kafka Topic Partitions! •  Block Size: 200 ms! •  Batch Size: 1.5 – 20 s!
  • 5. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Why does it matter? •  Reliability – May add 2 Nines! •  Hardware Cost – Potentially 100x Less Cost In Hardware! •  Energy – Potentially 100x Less Energy ! •  Data Center Footprint – Potentially 100x Less Racks ! •  Manageability – 10 machines versus 1000 machines!! •  Network BW – Potentially 100x less network BW! •  Total Cost of Ownership – Potentially < 1000x !!!! •  Greater Peace of Mind!
  • 6. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Who will pay for Real-Time Solutions? Real-Time: Expected Latency < 1ms •  Online Marketers! –  Process over 100k events per second for thousands of social media websites! –  Expected revenue > $2.1 Trillion! •  IoT Businesses! –  Process thousands of events per second from millions of connected devices! –  Expected revenue > $100 Billion! •  Spam and Fraud Detection! –  Detect multiple complex patterns in millions of transactions and documents per second! –  Expected revenue > $40 Billion!
  • 7. AKUDA LABS PROPRIETARY AND CONFIDENTIAL The Akuda Quest •  To enable truly-real time classification of extremely high rate data streams ! •  To enable subject matter experts who possess extensive knowledge of the domain the data belongs to, and who are often non- programmers, to directly create classifiers! •  To enable the fast development and refinement of data classifiers!
  • 8. AKUDA LABS PROPRIETARY AND CONFIDENTIAL The Real-Time Classification Challenge Latency < 1ms ocuments/second ytes/packet Ultra Fast Classification & Correlation 0.001 seconds (max latency) 1,000,000 Distinct Possible Events/Trigger/Results K1 K5 K4 K3 K2 K6 K7 K8 K9 K10 Actionable Information Previous Knowledge Previous Knowledge Previous Knowledge 100 ! events/s # 10,000 ! Devices# 1,000,000 ! packets/s # 10,000! Classifiers# 10 Billion Classification Operations/s#
  • 9. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pulsar Analyst Workbench Quick, Intuitive Classifier Development Sandbox
  • 10. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Quick Model Optimization Specialized Compiler, Data Analysis Tools RESOLVED FILTERING NETWORK Optimizing Parallelizing Compiler Cycle Detection Reordering DFA Pruning Platform Targeting TARGET PLATFORM TOPOLOGY Execution Engine
  • 11. AKUDA LABS PROPRIETARY AND CONFIDENTIAL AKUDA Technology Delivery •  SaaS turn-key solution, with a model development system that allows for deployment of complete solutions in hours, without any coding requirements.! •  Privately deployable enterprise solution on a Cloud Infrastructure. ! •  Software Development Infrastructure for developing highly specific and targeted solutions.!
  • 12. AKUDA LABS PROPRIETARY AND CONFIDENTIAL The SaaS Platform: Pulsar High Level View INBOUND DATA HUB DATA AUGMENTATION & CORRELATION CLASSIFICATION INDEXING CLUSTER ANALYSIS OUTBOUND DATA HUB
  • 13. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pulsar System View Optimizing Parallelizing Compiler for Classification, Analysis and Action Network LDA Cluster Generator LDA Cluster Refinement Massively Parallel RT Classification Engine Social Media Data Sources Universal Store Social Media Harvester General Data Integration Hub Data Source Akuda Agent Universal Searchable Index Data Source Direct Feed Author [G,A,E] Image Analyzer (LGM) Author Info Analyzer (LGM) General Data Sources Real-time Stream Aggregator RT Classification Pipeline Author Geolocation Analyzer (LGM) Image Data Sources Image Harvester Author Attribute Processor (LGM) Real-time Stream Correlator Author Attribute Store Image Universal Searchable Index Image Store Massively Parallel RT Classification Engine AKUDA Broadcaster RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline AuthorAtributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector (LGM) AuthorAtributeDetector AuthorAtributeDetector AuthorAtributeDetector LDA Feature Generator (Proximity NGRAMS) MISSION EDITOR DFA Ta p DFA Ta p Ta p DFA DFA Classifier Refinement Pipeline Deep Inspection Store Metrics And Alarms RT Stream Indexer Delivery Integration Hub Target Systems Dashboard Editor Visualization RT DASHBOARD [Corona] PIPELINE STUDIO [Pulsar] DEEP INSPECTION Query UI AUTHOR ATTRIBUTE Query UI UNIVERSAL STREAM Query UI LDA Classifier Generator
  • 14. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pulsar Inbound Data Hub Optimizing Parallelizing Compiler for Classification, Analysis and Action Network LDA Cluster Generator LDA Cluster Refinement Massively Parallel RT Classification Engine Social Media Data Sources Universal Store Social Media Harvester General Data Integration Hub Data Source Akuda Agent Universal Searchable Index Data Source Direct Feed Author [G,A,E] Image Analyzer (LGM) Author Info Analyzer (LGM) General Data Sources Real-time Stream Aggregator RT Classification Pipeline Author Geolocation Analyzer (LGM) Image Data Sources Image Harvester Author Attribute Processor (LGM) Real-time Stream Correlator Author Attribute Store Image Universal Searchable Index Image Store Massively Parallel RT Classification Engine AKUDA Broadcaster RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline AuthorAtributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector (LGM) AuthorAtributeDetector AuthorAtributeDetector AuthorAtributeDetector LDA Feature Generator (Proximity NGRAMS) MISSION EDITOR DFA Ta p DFA Ta p Ta p DFA DFA Classifier Refinement Pipeline Deep Inspection Store Metrics And Alarms RT Stream Indexer Delivery Integration Hub Target Systems Dashboard Editor Visualization RT DASHBOARD [Corona] PIPELINE STUDIO [Pulsar] DEEP INSPECTION Query UI AUTHOR ATTRIBUTE Query UI UNIVERSAL STREAM Query UI LDA Classifier Generator
  • 15. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pulsar LGM: Data Augmentation and Correlation Optimizing Parallelizing Compiler for Classification, Analysis and Action Network LDA Cluster Generator LDA Cluster Refinement Massively Parallel RT Classification Engine Social Media Data Sources Universal Store Social Media Harvester General Data Integration Hub Data Source Akuda Agent Universal Searchable Index Data Source Direct Feed Author [G,A,E] Image Analyzer (LGM) Author Info Analyzer (LGM) General Data Sources Real-time Stream Aggregator RT Classification Pipeline Author Geolocation Analyzer (LGM) Image Data Sources Image Harvester Author Attribute Processor (LGM) Real-time Stream Correlator Author Attribute Store Image Universal Searchable Index Image Store Massively Parallel RT Classification Engine AKUDA Broadcaster RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline AuthorAtributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector (LGM) AuthorAtributeDetector AuthorAtributeDetector AuthorAtributeDetector LDA Feature Generator (Proximity NGRAMS) MISSION EDITOR DFA Ta p DFA Ta p Ta p DFA DFA Classifier Refinement Pipeline Deep Inspection Store Metrics And Alarms RT Stream Indexer Delivery Integration Hub Target Systems Dashboard Editor Visualization RT DASHBOARD [Corona] PIPELINE STUDIO [Pulsar] DEEP INSPECTION Query UI AUTHOR ATTRIBUTE Query UI UNIVERSAL STREAM Query UI LDA Classifier Generator
  • 16. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pulsar Bananas: Data Classification Optimizing Parallelizing Compiler for Classification, Analysis and Action Network LDA Cluster Generator LDA Cluster Refinement Massively Parallel RT Classification Engine Social Media Data Sources Universal Store Social Media Harvester General Data Integration Hub Data Source Akuda Agent Universal Searchable Index Data Source Direct Feed Author [G,A,E] Image Analyzer (LGM) Author Info Analyzer (LGM) General Data Sources Real-time Stream Aggregator RT Classification Pipeline Author Geolocation Analyzer (LGM) Image Data Sources Image Harvester Author Attribute Processor (LGM) Real-time Stream Correlator Author Attribute Store Image Universal Searchable Index Image Store Massively Parallel RT Classification Engine AKUDA Broadcaster RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline AuthorAtributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector (LGM) AuthorAtributeDetector AuthorAtributeDetector AuthorAtributeDetector LDA Feature Generator (Proximity NGRAMS) MISSION EDITOR DFA Ta p DFA Ta p Ta p DFA DFA Classifier Refinement Pipeline Deep Inspection Store Metrics And Alarms RT Stream Indexer Delivery Integration Hub Target Systems Dashboard Editor Visualization RT DASHBOARD [Corona] PIPELINE STUDIO [Pulsar] DEEP INSPECTION Query UI AUTHOR ATTRIBUTE Query UI UNIVERSAL STREAM Query UI LDA Classifier Generator
  • 17. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pulsar Corona: Cluster Analysis Optimizing Parallelizing Compiler for Classification, Analysis and Action Network LDA Cluster Generator LDA Cluster Refinement Massively Parallel RT Classification Engine Social Media Data Sources Universal Store Social Media Harvester General Data Integration Hub Data Source Akuda Agent Universal Searchable Index Data Source Direct Feed Author [G,A,E] Image Analyzer (LGM) Author Info Analyzer (LGM) General Data Sources Real-time Stream Aggregator RT Classification Pipeline Author Geolocation Analyzer (LGM) Image Data Sources Image Harvester Author Attribute Processor (LGM) Real-time Stream Correlator Author Attribute Store Image Universal Searchable Index Image Store Massively Parallel RT Classification Engine AKUDA Broadcaster RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline AuthorAtributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector (LGM) AuthorAtributeDetector AuthorAtributeDetector AuthorAtributeDetector LDA Feature Generator (Proximity NGRAMS) MISSION EDITOR DFA Ta p DFA Ta p Ta p DFA DFA Classifier Refinement Pipeline Deep Inspection Store Metrics And Alarms RT Stream Indexer Delivery Integration Hub Target Systems Dashboard Editor Visualization RT DASHBOARD [Corona] PIPELINE STUDIO [Pulsar] DEEP INSPECTION Query UI AUTHOR ATTRIBUTE Query UI UNIVERSAL STREAM Query UI LDA Classifier Generator
  • 18. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pulsar Outbound Data Hub Optimizing Parallelizing Compiler for Classification, Analysis and Action Network LDA Cluster Generator LDA Cluster Refinement Massively Parallel RT Classification Engine Social Media Data Sources Universal Store Social Media Harvester General Data Integration Hub Data Source Akuda Agent Universal Searchable Index Data Source Direct Feed Author [G,A,E] Image Analyzer (LGM) Author Info Analyzer (LGM) General Data Sources Real-time Stream Aggregator RT Classification Pipeline Author Geolocation Analyzer (LGM) Image Data Sources Image Harvester Author Attribute Processor (LGM) Real-time Stream Correlator Author Attribute Store Image Universal Searchable Index Image Store Massively Parallel RT Classification Engine AKUDA Broadcaster RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline RT Classification Pipeline AuthorAtributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector (LGM) AuthorAtributeDetector AuthorAtributeDetector AuthorAtributeDetector LDA Feature Generator (Proximity NGRAMS) MISSION EDITOR DFA Ta p DFA Ta p Ta p DFA DFA Classifier Refinement Pipeline Deep Inspection Store Metrics And Alarms RT Stream Indexer Delivery Integration Hub Target Systems Dashboard Editor Visualization RT DASHBOARD [Corona] PIPELINE STUDIO [Pulsar] DEEP INSPECTION Query UI AUTHOR ATTRIBUTE Query UI UNIVERSAL STREAM Query UI LDA Classifier Generator
  • 19. AKUDA LABS PROPRIETARY AND CONFIDENTIAL THE AKUDA CORE! MASSIVELY PARALLEL STREAMING CLASSIFICATION INFRASTRUCTURE!
  • 20. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Possible Solution 1 NOT THIS - GTS: Scalability & Latency Problems Feed BC Rx Rx Rx Rx Indexer Broadcaster GTS Indexing System Query With Frequency 2 q/s Indexer Indexer Indexer Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Query With Frequency 2 q/s Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Analytics Visualization Index Storage
  • 21. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Possible Solution 2 NOT THIS - HADOOP: Latency Problems Feed BC Brodcaster HADOOP Broadcaster#
  • 22. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Possible Solution 3 Not Quite There: Spark Streaming Pipeline of RDDs Source 1,000,000 documents/second 1,024 bytes/packet MicroBatcher 1,000,000 Sequential Stages Doc 01 Doc 02 Doc 03 Doc 04 Doc 05 Doc 06 Doc 07 Doc 08 Doc 09 Doc 10 Doc 11 Doc 12 Doc 13 Doc 14 Doc 15 Doc 16 Latency of minutes, hours?? Network Transfers and/or Data Copying Across Host Nodes or Pipeline Stages
  • 23. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Possible Solution 4 Almost There: Data Flow Pipelines, Data Replication Source 1,000,000 documents/second 1,000 bytes/packet Broadcaster Bisection Bandwidth 1,000,000,000,000,000 bytes/second ~10,000,000 GBits/second ~10,000 TBits/second !!! Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 Doc 01 1,000,000 Stages Running Simultaneously Low Lat Broadcasting Doc Replicas becomes extreme bottleneck PCIe 3.0 lane BW: ~ 1GByte/second 10Gbps Ethernet: ~ 1GB/second Infiniband: Mellanox 56Gb/s FDR IB: 6.8GB/s Cisco Catalyst 2960G-49TC-L Switching Fabric: 40mpps. At 1000 bytes/ packet: 40,000 MBytes/second ==> 40 GBytes/second Intel-Xeon-Processor-E7-8890 (15 cores) Max Mem BW:85GBytes/second
  • 24. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Possible Solution 5 Cost & Latency Issues: Data Broadcasting Tree Feed BC Rx Rx Rx Rx Broadcaster Rx Rx Rx Rx Rx Rx Rx Rx Rx Rx Rx Rx 1 10 10 10 R x R x R x Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Rx Rx Rx Rx Rx Rx Rx Rx Rx R x R x R x Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Indexi ng / Analyti cs Visuali zation Rx Rx Rx Rx Rx Rx 10 10 10 1 + 10 x 10 x 10 x 10 x 10 x 10 = 1,000,001 Nodes Worst Case Cost = 1,000,001 * $1000/month: ~ $ 1 Billion / Month !!!! Latency Goes back to hours or days!
  • 25. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Honey, I Shrunk the Trees! AKUDA Core Topology Indexing / Analytics Feed Rx Tx Visualization Broadcaster Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Model Pipelines Tx Visualization 1,000,000 documents/second 1,000 bytes/packet 100,000 Short Pipelines * 10 Stages each= 1,000,000 Stages Akuda Queue Technology using on-chip inter-core networks Akuda Buffer Technology using on-chip inter-core networks Akuda Correlator Technology using on-chip inter-core networks 0.001 seconds typical latency
  • 26. AKUDA LABS PROPRIETARY AND CONFIDENTIAL The Solution Utilize Inter-Core Communication Channel Data Communication Hardware! Typical Bandwidth! Typical Cost! 10 Gbps Ethernet! 1 GB/s! $ 1,000! PCIe 3.0 Lane! 1 GB/s! $10,000! Infiniband, Mellanox 56Gb/s FDR IB! 6.8 GB/s! $1,000,000! Cisco Catalyst Switching Fabric! 40 GB/s! $10,000,000! Inter-core/Inter-processor Fabric Bisection Bandwidth! 1000 GB/s ! (for IA64 Chips)! $500!
  • 27. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Data Broadcasting Use The Best Broadcasting Network 340GB/s > 1000GB/s
  • 28. AKUDA LABS PROPRIETARY AND CONFIDENTIAL The Solution AKUDA Core Differentiating Factors Lockfree Queue, Pipeline Control! Lockfree Correlator! Lockfree Multithreaded Processing! Feed BC Broadcaster Indexing / AnalyticsRx Tx Visualization 1 10 1000 Akuda Core Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Akuda Core Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Akuda Core Akuda Core Akuda Core Zero-replication Data Broadcasting! On-chip- network Communication Control!
  • 29. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Adaptive Topology Continuous Optimization of Data Comm & Pipeline Execution
  • 30. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Akuda Core Scalability Bisection Bandwidth 10 20 30 40 50 60 70 80 80 70 60 50 40 30 20 10 BBW Processors Akuda Lock-free Algorithms Standard Algorithms Processing Latency 10 20 30 40 50 60 70 80 80 70 60 50 40 30 20 10 Time Processors Akuda Lock-free Algorithms Standard Algorithms Processing Cost 200 400 600 800 1000 1200 1400 1600 800 700 600 500 400 300 200 100 1000 $ / Month MILLION [Stream Rate * Pipelines * Patterns] Akuda Lock-free Algorithms Standard Algorithms Parallelization Speedup 10 20 30 40 50 60 70 80 80 70 60 50 40 30 20 10 Speedup Processors Akuda Lock-free Algorithms Standard Algorithms
  • 31. AKUDA LABS PROPRIETARY AND CONFIDENTIAL AKUDA Core in Action Election2016.io: Real-Time Online Polls “The problem is that when polls are wrong, they tend to be wrong in the same direction. If they miss in New Hampshire, for instance, they all miss on the same mistake.” -- Nate Silver!
  • 32. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Akuda Core in Action Election2016.io Backend Feed Indexing / Analytics Rx Model Pipelines Tx Visualization 50,000 documents/second (peak) 1,000 bytes/document 3000 Models (Author Classification + App Classification) Akuda Broadcasting Technology using on-chip inter-core fabric Akuda Buffer Technology using on-chip inter-core fabric Akuda Correlator Technology Sub-second Latency 150 GigaBytes Bisection Bandwidth (Over 1 TERAbit/second) Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Indexing / Analytics Rx Tx Visualization Learned People Attributes Akuda Classification Technology Correlator Akuda Data Analysis Technology 100 Patterns/Model 15 BILLION Patterns/second
  • 33. AKUDA LABS PROPRIETARY AND CONFIDENTIAL STANDALONE#USE#OF# BANANAS##
  • 34. AKUDA LABS PROPRIETARY AND CONFIDENTIAL General Statistical Classification K-MEANS, LDA, NN Feed Rx Tx Broadcaster Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx k-means Model SubMatrix Tx 1,000,000 documents/second 1,000 bytes/packet 10,000 Nodes * 100 k-means Centroid Vectors Akuda Queue Technology using on-chip inter-core networks Akuda Buffer Technology using on-chip inter-core networks 0.001 seconds typical latency Aggregator Akuda Queue Technology using on-chip inter-core networks k-means cluster label for data item Akuda Lockless Matrix Ops Akuda Lockless Correlator
  • 35. AKUDA LABS PROPRIETARY AND CONFIDENTIAL IOT Classification POC K-MEANS Feed Rx Tx Broadcaster Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx Rx Tx 1,000,000 sensor-vectors/second 1,000 bytes/vector Classification Using DFA, K-MEANS, LDA, or NN Models Akuda Queue Technology using on-chip inter-core networks Akuda Buffer Technology using on-chip inter-core networks 0.001 seconds typical latency Aggregator Akuda Queue Technology using on-chip inter-core networks Sensor Warnings Akuda Lockless Matrix Ops Sensor State Classification Akuda Lockless Correlator
  • 36. AKUDA LABS PROPRIETARY AND CONFIDENTIAL IOT Classification POC K-MEANS LINEAR ALGEBRA ENGINE - 1 LINEAR ALGEBRA ENGINE - 2 LINEAR ALGEBRA ENGINE - N LINEAR ALGEBRA ENGINE - 100 DATA RECEIVER INPUT DATA CHANNEL L2 NORM CHANNEL AGGREGATOR LOCKLESS HASH UNSORTED CHANNEL MIN FINDER INPUT DATA STREAM OUTPUT DATA STREAM Packet ID Input Packet: D Packet ID Transformed Packet: D’ Packet ID Minimum Elements Vector Minimum Distance from Classifier: Pn Packet ID Classified Packet For, K = 100,000 (number of clusters)        N = 100 (number of processors)        P = 1000 (cardinality of feature set)        D : Input Vector to be classified        A : Model matrix representing trained values for classification centroids
  • 37. AKUDA LABS PROPRIETARY AND CONFIDENTIAL AKUDA#LABS#PATENTS# Pending#&#Provisional#
  • 38. AKUDA LABS PROPRIETARY AND CONFIDENTIAL PATENT LIST (1/3)# 1 HIERARCHICAL, PARALLEL MODELS FOR EXTRACTING IN REAL TIME HIGH-VALUE INFORMATION FROM DATA STREAMS AND SYSTEM AND METHOD FOR CREATION OF SAME 2 HIERARCHICAL, PARALLEL MODELS FOR EXTRACTING IN REAL-TIME HIGH-VALUE INFORMATION FROM DATA STREAMS AND SYSTEM AND METHOD FOR CREATION OF SAME 3 MASSIVELY-PARALLEL SYSTEM ARCHITECTURE AND METHOD FOR REAL-TIME EXTRACTION OF HIGH-VALUE INFORMATION FROM DATA STREAMS 4 OPTIMIZATION FOR REAL-TIME, PARALLEL EXECUTION OF MODELS FOR EXTRACTING HIGH-VALUE INFORMATION FROM DATA STREAMS 5 EXTRACTION OF HIGH VALUE INFORMATION FROM UNSTRUCTURED IMAGES IN MASSIVELY PARALLEL PROCESSING SYSTEM 6 REAL-TIME MASSIVELY PARALLEL PIPELINE PROCESSING SYSTEM 7 ADDITIONAL APPLICATIONS DIRECTED TO SPECIFIC ASPECTS/IMPROVEMENTS OF REAL-TIME MASSIVELY PARALLEL PIPELINE PROCESSING SYSTEM 8 AUTOMATIC TOPIC DISCOVERY IN STREAMS OF SOCIAL MEDIA POSTS 9 TOPIC AND TREND DISCOVERY WITHIN REAL-TIME ONLINE CONTENT STREAMS 10 SYSTEM AND METHOD FOR IMPLEMENTING ENTERPRISE RISK MODELS BASED ON INFORMATION POSTS 11 ADDITIONAL APPLICATIONS DIRECTED TO SPECIFIC MODELS OTHER THAN RISK MODELS 12 LAZY PARSER FOR INFERENCE IN UNSTRUCTURED DATA STREAMS 13 REALTIME DATA STREAM CLUSTER SUMMARIZATION AND LABELING SYSTEM 14 DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED DATA 15 REAL-TIME STREAM CORRELATION WITH PRE-EXISTING KNOWLEDGE (STATE) 16 LOCKLESS KEY-VALUE STORE AND MEMORY CACHING SYSTEM 17 DYNAMIC RESOURCE ALLOCATOR FOR REAL-TIME PARALLEL PIPELINE PROCESSING SYSTEM
  • 39. AKUDA LABS PROPRIETARY AND CONFIDENTIAL PATENT LIST (2/3)# 18 REALTIME LOW LATENCY DATA STREAM DFA CLASSIFICATION ENGINE 19 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR SOCIAL MEDIA AUTHOR CLASSIFICATION AND ANALYSIS STREAM 20 ATTRIBUTE VECTOR COMPRESSION FOR STREAM PROCESSING 21 REATIME IOT PARALLEL VECTOR CLASSIFICATION 22 REALTIME IMAGE HARVESTING AND STORAGE SYSTEM 23 DATA STREAM HISTORIC REPLAY VERSIONING (SKYLINE) 24 DATA STREAM HISTORIC REPLAY SYSTEM AND STORAGE 25 EXTRACTION OF AUTHOR(PEOPLE) ATTRIBUTES THROUGH COMPLEX DFA MODELS 26 REALTIME IMAGE HARVESTING AND STORAGE SYSTEM 27 NEURAL NETWORK-BASED SYSTEM FOR EXTRACTION OF DEMOGRAPHICS FROM SOCIAL MEDIA IMAGES 28 METHODFORSOCIALMEDIAEVENTDETECTIONANDCAUSEANALYSIS 29 METHOD FOR REAL-TIME TAGGING OF DATA STREAM DOCUMENTS 30 PEOPLE ATTRIBUTE QUERY AND VISUALIZATION TOOL 31 WORD SET VISUAL NORMALIZED WEIGHT DAMPENING 32 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED ELECTION DATA 33 PARALLEL PROCESSING ARCHITECTURE AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME ANALYTICS FROM UNSTRUCTURED RETAIL DATA
  • 40. AKUDA LABS PROPRIETARY AND CONFIDENTIAL PATENT LIST (3/3)# 34 SYSTEMS AND METHODS FOR ANALYZING UNSOLICITED PRODUCT/SERVICE CUSTOMER REVIEWS 35 SYSTEM FOR CREDIT/INSURANCE PROCESSING USING UNSTRUCTURED DATA 36 SYSTEM AND METHOD FOR CORRELATING SOCIAL MEDIA DATA AND COMPANY FINANCIAL DATA 37 SYSTEMS AND METHODS FOR IDENTIFYING AN ILLNESS AND COURSE OF TREATMENT FOR A PATIENT 38 SYSTEM AND METHOD FOR IDENTIFYING FACIAL EXPRESSIONS FROM SOCIAL MEDIA IMAGES 39 SYSTEM AND METHOD FOR DETECTING HEALTH MALADIES IN A PATIENT USING UNSTRUCTURED IMAGES 40 SYSTEM AND METHOD FOR DETECTING POLITICAL DESTABILIZATION AT A SPECIFIC GEOGRAPHIC LOCATION BASED ON SOCIAL MEDIA DATA 41 SYSTEM AND METHOD FOR IDENTIFYING CORRELATIONS BETWEEN SOCIAL MEDIA IMAGES USING NEURAL NETWORKS 42 SYSTEM AND METHOD FOR SCALABLE PROCESSING OF DATA PIPELINES USING A LOCKLESS SHARED MEMORY SYSTEM 43 ASYNCHRONOUS WEB PAGE DATA AGGREGATOR 44 APPLICATIONS OF DISTIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY TO REAL TIME NEWS SERVICE 45 DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME THREAT ANALYSIS 46 DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR REAL TIME EMERGENCY RESPONSE 47 DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR CLIMATE ANALYTICS 48 DISTRIBUTED PROCESSING AND DATA BROADCASTING TECHNOLOGY FOR INSURANCE RISK ASSESSMENT 49 DISTRIBUTED PARALLEL ARCHITECTURES FOR REAL TIME PROCESSING OF STREAMS OF STRUCTURED AND UNSTRUCTURED DATA
  • 41. AKUDA LABS PROPRIETARY AND CONFIDENTIAL THE#AKUDA#SYSTEM# Addi@onal#Informa@on#
  • 42. AKUDA LABS PROPRIETARY AND CONFIDENTIAL The Solution Akuda Core Topology with Kafka UU#OnUchipUnetwork#Comm#Control# UU#ZeroUcopy#Data#Broadcas@ng# UU#Lockfree#queue,#pipeline#control# UU#Lockfree#correlator# UU#Lockfree#Mul@threaded#Processing# Feed BC Kafka Indexing / AnalyticsRx Tx Visualization 1 10 1000 Akuda Core Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Akuda Core Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Indexing / AnalyticsRx Tx Visualization Akuda Core Akuda Core Akuda Core
  • 43. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pulsar Functional View Unstructured Data Source Streams Unstructured Data Source Batch Unstructured Data Source Images MILLIONS OF DOCUMENTS PER SECOND LDA CONTROL AKUDA DEEP INSPECTION THIRD-PARTY DATA ANALYTICS HADOOP BASED ANALYTICS THIRD-PARTY VISUALIZATION AKUDA DASHBOARD RT Content Classification (DFA/LDA/VEC) RT Author Classification (DFA/LDA) Optimizing Parallelizing Compiler Normalization RT Author Image Analysis (NEURAL NETS) Universal Indexing P-GRAM GEN Indexer STATS / ANALYTICS Author ATTR Author GEO Author DEM LDA PROC P-GRAM GEN LDA PROC 10+ BILLIONS OF CLASSIFICATIONS PER SECOND MISSION EDITOR
  • 44. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Automatic Cluster Discovery P-GRAMS, LDA, CONVERGENCE Mission Deep Inspection Store Summarizer p-GRAM Generator Mission Stream Concept Extractor LDA Solver Convergence Monitor p-GRAMS Corpus Summary Corpus Concept Cloud Labeled Corpus Clusters Classification Model Library LDA Cluster Generation & Labeling LDA Cluster Refinement DFA Classifier Refinement LDA Classifier Generator
  • 45. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Author Attribute Discovery Neural Networks, Bayesian Models, DFAs Ethnicity Image Analyzer Author Info Analyzer (LGM) Real-time Stream Aggregator Author Geolocation Analyzer (LGM) Author Attribute Processor (LGM) Real-time Stream Correlator Massively Parallel RT Classification Engine AKU DA Broad caster AuthorAtributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector AuthorAttributeDetector (LGM) AuthorAtributeDetector AuthorAtributeDetector AuthorAtributeDetector Unstructured Data Source A Unstructured Data Source B Unstructured Data Source C Normalization Age Image Analyzer Gender Image Analyzer Labeled Image Generator Neural Network Trainer Author Bayesian Classification Model Trainer
  • 46. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Generalized Image Classification Neural Networks, Bayesian Models, DFAs Ethnicity Image Analyzer Age Image Analyzer Gender Image Analyzer Labeled Image Generator Neural Network Trainer Image Data Sources Image Harvester Logo Identification Face Detector Glasses Image Analyzer Weight Image Analyzer Hair-style Image Analyzer Shape Identification Emotion Image Analyzer Image Label Classifier Image DB
  • 47. AKUDA LABS PROPRIETARY AND CONFIDENTIAL Pipeline Editor Automatic LDA Models, User-specified DFAs RT Content Classification (DFA/LDA/VEC) Optimizing Parallelizing Compiler PIPELINE EDITOR Filtering, Analysis And Action Network LDA Classifier Vector String CMP Vector INT/ FP CMP DFA Counter Tap Action Block DFA Counter Tap Counter Tap DFA Action Block Outp utInou t LDA Classifier Vector String CMP Vector INT/FP CMP DFA Action Block Counter Tap Model Library Airlines Auto Auto Insurance Cable Beverages Fast Food Finance Housing Legal Pharma/Health Most Used Detectors Tech Advertisement Inquiry Customer Service Irate Customers Thankful Customers Consumers STATE MANAGEMENT P-GRAM GEN Indexer LDA PROC