SlideShare uma empresa Scribd logo
1 de 134
Baixar para ler offline
A Benchmark Suite for
Distributed Stream Processing
Systems
Maycon Viana Bordin
Claudio Geyer
Advisor
April, 2017
1
2
HUGE amounts of data
are being generated in
real-time
3
4
500M tweets
are sent per day
5
6
4.75B shares
4.5B likes
420M status updates
300M photos
EVERY DAY.
7
8
They need to process…
9
They need to process…
large volumes of data
10
They need to process…
large volumes of data
in real-time
11
They need to process…
large volumes of data
in real-time
continuously
12
They need to process…
large volumes of data
in real-time
continuously
producing actionable information
13
14
Stream Processing
15
16
17
B
18
B
19
20
Data Stream
21
B
22
B
23
B 1234567
24
Data from the stream source may or
may not be structured
25
The amount of data is usually
unbounded in size
26
The input rate is variable and
typically unpredictable
27
There are many platforms
on the market
28
Problem:
How to know which platform
is better for an specific type
of application?
29
Problem:
Current stream processing
benchmarks are composed
mostly of synthetic
applications.
30
Problem:
Benchmarks for other Big
Data platforms use more real
world applications, e.g.
BigDataBench and HiBench.
31
Goals:
32
Specific Goals:
•
•
33
34
Benchmarks for
Stream Processing
35
Linear Road Benchmark [Ara04]
•
•
•
36
StreamBench [Lu14]
•
•
•
•
•
37
Yahoo Streaming Benchmark
•
•
•
38
BigDataBench[Wan14]
•
•
•
39
StreamBench[Wan16]
•
•
•
•
•
40
RIoTBench[Wan17]
•
•
•
•
41
HiBench[Hua10]
•
•
•
42
Comparison
43
44
Benchmark
Architecture
45
46
47
48
49
API
•
•
•
•
50
Metrics
•
•
•
51
Scripts for automation…
•
•
•
•
•
•
•
52
Benchmark
Applications
53
54
55
56
•
•
•
•
57
58
59
60
61
62
63
64
65
66
Benchmark
Metrics
67
68
69
𝐿𝑎𝑡𝑒𝑛𝑐𝑦 = 𝑇𝑒𝑛𝑑 − 𝑇𝑒𝑛𝑑
70
71
72
73
𝑇ℎ𝑟𝑜𝑢𝑔ℎ𝑝𝑢𝑡 =
𝑁𝑢𝑚. 𝑃𝑟𝑜𝑐𝑒𝑠𝑠𝑒𝑑 𝑇𝑢𝑝𝑙𝑒𝑠
𝑅𝑢𝑛𝑡𝑖𝑚𝑒
74
75
76
77
Comparison with the other
Benchmarks
78
79
80
Results
Set-up
81
82
•
•
•
•
83
•
•
•
84
85
86
87
n1_x1_x5_x6_x3 n1_x1_x2_x1_x4_x2 n1_x4_x2_x2
n1_x2_x5_x6_x3 n1_x2_x2_x1_x4_x2 n4_x2_x2_x2
n1_x3_x5_x6_x3 n1_x4_x2_x1_x4_x2 n4_x8_x2_x2
n2_x1_x5_x6_x3 n1_x8_x2_x1_x4_x2
n2_x2_x5_x6_x3 n2_x1_x2_x1_x4_x2
n2_x3_x5_x6_x3 n2_x2_x2_x1_x4_x2
n4_x1_x5_x6_x3 n2_x4_x2_x1_x4_x2
n4_x2_x5_x6_x3 n2_x8_x2_x1_x4_x2
n4_x3_x5_x6_x3 n4_x1_x2_x1_x4_x2
n8_x1_x5_x6_x3 n4_x2_x2_x1_x4_x2
n8_x2_x5_x6_x3 n4_x4_x2_x1_x4_x2
n8_x3_x5_x6_x3 n4_x8_x2_x1_x4_x2
n8_x1_x2_x1_x4_x2
n8_x2_x2_x1_x4_x2
n8_x4_x2_x1_x4_x2
n8_x8_x2_x1_x4_x2 88
Results
Word Count: Storm
89
90
91
92
93
n8_x4
n4_x2
n4_x2_x10_x12_x6
n2_x1_x5_x6_x3
n1_x2_x5_x6_x3
94
Results
Word Count: Spark
95
96
97
98
99
Results
Log Processing: Storm
100
101
102
103
n8_x3
n4_x1_x2_x1_x4_x2
n2_x1_x2_x1_x4_x2
n1_x4_x2_x1_x4_x2
104
Results
Log Processing: Spark
105
106
107
108
109
Results
Traffic Monitoring: Storm
110
111
112
113
Results
Traffic Monitoring: Spark
114
115
116
117
118
119
120
121
Conclusion
122
•
•
•
123
•
•
124
Future Work
125
•
•
•
•
126
•
•
Publications
127
A Benchmark Suite for
Distributed Stream Processing
Systems
Maycon Viana Bordin
Claudio Geyer
Advisor
April, 2017
128
129
130
131
132
133
134

Mais conteúdo relacionado

Semelhante a A Benchmark Suite for Distributed Stream Processing Systems

Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
Vamshikrishna Goud
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Simplilearn
 

Semelhante a A Benchmark Suite for Distributed Stream Processing Systems (20)

BigData-Challenges.pptx
BigData-Challenges.pptxBigData-Challenges.pptx
BigData-Challenges.pptx
 
bigdata.pptx
bigdata.pptxbigdata.pptx
bigdata.pptx
 
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
 
Big Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalBig Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar Semwal
 
Inauguration lecture Martin Pinzger, University of Klagenfurt, Austria
Inauguration lecture Martin Pinzger, University of Klagenfurt, AustriaInauguration lecture Martin Pinzger, University of Klagenfurt, Austria
Inauguration lecture Martin Pinzger, University of Klagenfurt, Austria
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
Understanding big data
Understanding big dataUnderstanding big data
Understanding big data
 
Big Data Fabric: A Recipe for Big Data Initiatives
Big Data Fabric: A Recipe for Big Data InitiativesBig Data Fabric: A Recipe for Big Data Initiatives
Big Data Fabric: A Recipe for Big Data Initiatives
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big-Data-AryaTadbirNetworkDesigners
Big-Data-AryaTadbirNetworkDesignersBig-Data-AryaTadbirNetworkDesigners
Big-Data-AryaTadbirNetworkDesigners
 
Big Data for One Big Family
Big Data for One Big FamilyBig Data for One Big Family
Big Data for One Big Family
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
 
Applying Big Data
Applying Big DataApplying Big Data
Applying Big Data
 
Matthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCMMatthew Johnston - Big Data Futures Outlook BCM
Matthew Johnston - Big Data Futures Outlook BCM
 
Analyses and processing of big data in financial services
Analyses and processing of big data in financial servicesAnalyses and processing of big data in financial services
Analyses and processing of big data in financial services
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Sl nashville major Hospital company
Sl nashville major Hospital companySl nashville major Hospital company
Sl nashville major Hospital company
 
Big data Ppt
Big data PptBig data Ppt
Big data Ppt
 

Mais de Maycon Viana Bordin

A Benchmark Suite for Distributed Stream Processing Systems
A Benchmark Suite for Distributed Stream Processing SystemsA Benchmark Suite for Distributed Stream Processing Systems
A Benchmark Suite for Distributed Stream Processing Systems
Maycon Viana Bordin
 
Algoritmos de Consenso: Paxos vs RAFT
Algoritmos de Consenso: Paxos vs RAFTAlgoritmos de Consenso: Paxos vs RAFT
Algoritmos de Consenso: Paxos vs RAFT
Maycon Viana Bordin
 
Portal De Relacionamento entre Parceiros de Negócios para uma Cooperativa Agr...
Portal De Relacionamento entre Parceiros de Negócios para uma Cooperativa Agr...Portal De Relacionamento entre Parceiros de Negócios para uma Cooperativa Agr...
Portal De Relacionamento entre Parceiros de Negócios para uma Cooperativa Agr...
Maycon Viana Bordin
 
Arrependimento de adolescentes proprietários de celulares
Arrependimento de adolescentes proprietários de celularesArrependimento de adolescentes proprietários de celulares
Arrependimento de adolescentes proprietários de celulares
Maycon Viana Bordin
 
OrganicPM - People Management System
OrganicPM - People Management SystemOrganicPM - People Management System
OrganicPM - People Management System
Maycon Viana Bordin
 
Sistema para Gestão de Pessoas - OrganicPM
Sistema para Gestão de Pessoas - OrganicPMSistema para Gestão de Pessoas - OrganicPM
Sistema para Gestão de Pessoas - OrganicPM
Maycon Viana Bordin
 
Características dos Integrantes das Equipes
Características dos Integrantes das EquipesCaracterísticas dos Integrantes das Equipes
Características dos Integrantes das Equipes
Maycon Viana Bordin
 
Propostas de Aplicação de IA no Software Proschedule
Propostas de Aplicação de IA no Software ProschedulePropostas de Aplicação de IA no Software Proschedule
Propostas de Aplicação de IA no Software Proschedule
Maycon Viana Bordin
 

Mais de Maycon Viana Bordin (20)

Stream Processing Overview
Stream Processing OverviewStream Processing Overview
Stream Processing Overview
 
Uma Arquitetura de Stream Processing e ETL Serverless na AWS
Uma Arquitetura de Stream Processing e ETL Serverless na AWSUma Arquitetura de Stream Processing e ETL Serverless na AWS
Uma Arquitetura de Stream Processing e ETL Serverless na AWS
 
Introdução a Arquitetura Android
Introdução a Arquitetura AndroidIntrodução a Arquitetura Android
Introdução a Arquitetura Android
 
Desenvolvimento de uma Rede Social Baseada em Geolocalização
Desenvolvimento de uma Rede Social Baseada em GeolocalizaçãoDesenvolvimento de uma Rede Social Baseada em Geolocalização
Desenvolvimento de uma Rede Social Baseada em Geolocalização
 
A Benchmark Suite for Distributed Stream Processing Systems
A Benchmark Suite for Distributed Stream Processing SystemsA Benchmark Suite for Distributed Stream Processing Systems
A Benchmark Suite for Distributed Stream Processing Systems
 
Introdução a Programação Android - Aula 5
Introdução a Programação Android - Aula 5Introdução a Programação Android - Aula 5
Introdução a Programação Android - Aula 5
 
Introdução a Programação Android - Aula 4
Introdução a Programação Android - Aula 4Introdução a Programação Android - Aula 4
Introdução a Programação Android - Aula 4
 
Introdução a Programação Android - Aula 3
Introdução a Programação Android - Aula 3Introdução a Programação Android - Aula 3
Introdução a Programação Android - Aula 3
 
Introdução a Programação Android - Aula 2
Introdução a Programação Android - Aula 2Introdução a Programação Android - Aula 2
Introdução a Programação Android - Aula 2
 
Introdução a Programação Android - Aula 1
Introdução a Programação Android - Aula 1Introdução a Programação Android - Aula 1
Introdução a Programação Android - Aula 1
 
Development of a Distributed Stream Processing System
Development of a Distributed Stream Processing SystemDevelopment of a Distributed Stream Processing System
Development of a Distributed Stream Processing System
 
Algoritmos de Consenso: Paxos vs RAFT
Algoritmos de Consenso: Paxos vs RAFTAlgoritmos de Consenso: Paxos vs RAFT
Algoritmos de Consenso: Paxos vs RAFT
 
Introdução ao Android
Introdução ao AndroidIntrodução ao Android
Introdução ao Android
 
Portal De Relacionamento entre Parceiros de Negócios para uma Cooperativa Agr...
Portal De Relacionamento entre Parceiros de Negócios para uma Cooperativa Agr...Portal De Relacionamento entre Parceiros de Negócios para uma Cooperativa Agr...
Portal De Relacionamento entre Parceiros de Negócios para uma Cooperativa Agr...
 
Como Vencer no Caos
Como Vencer no CaosComo Vencer no Caos
Como Vencer no Caos
 
Arrependimento de adolescentes proprietários de celulares
Arrependimento de adolescentes proprietários de celularesArrependimento de adolescentes proprietários de celulares
Arrependimento de adolescentes proprietários de celulares
 
OrganicPM - People Management System
OrganicPM - People Management SystemOrganicPM - People Management System
OrganicPM - People Management System
 
Sistema para Gestão de Pessoas - OrganicPM
Sistema para Gestão de Pessoas - OrganicPMSistema para Gestão de Pessoas - OrganicPM
Sistema para Gestão de Pessoas - OrganicPM
 
Características dos Integrantes das Equipes
Características dos Integrantes das EquipesCaracterísticas dos Integrantes das Equipes
Características dos Integrantes das Equipes
 
Propostas de Aplicação de IA no Software Proschedule
Propostas de Aplicação de IA no Software ProschedulePropostas de Aplicação de IA no Software Proschedule
Propostas de Aplicação de IA no Software Proschedule
 

Último

Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
chiefasafspells
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
masabamasaba
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
 

Último (20)

AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 

A Benchmark Suite for Distributed Stream Processing Systems