SlideShare a Scribd company logo
1 of 37
Download to read offline
2nd BMBF Big Data All Hands Meeting and
2nd Smart Data Innovation Conference
Karlsruhe , October 11.-12., 2017
Presenting at
Efficiently Handling Streams
from Millions of Sensors
Jonas Traub – TU Berlin / DFKI
1
The Growth of the Internet of Things
Gartner says 6.4 billion connected
"Things" will be in use in 2016 and
more than 20 billion in 2020.
Year
# Devices (in billions)
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 2
Goal
Provide real-time insights based on IoT data.
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 3
Problem
• Billions of devices provide real-time data
• Result: Vast amount of data streams
Heavy Network Utilization Scalability Challenges Increasing Latencies
Financial Costs
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 4
Solution
Produce and process data streams
based on the data demand of applications.
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 5
State of the Art Approach
Data Stream Production with Periodic Sampling
Major Challenges:
• Oversampling
• Missing Adaptivity
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 6
Solution
On-Demand Data Streaming from Sensor Nodes
Optimized On-Demand Data
Streaming from Sensor Nodes
Jonas Traub – TU Berlin / DFKI – Efficently Handling Streams from Millions of Sensors 7
State of the Art Approach
Provide all Data to Front-End Applications
Optimized On-Demand Data
Streaming from Sensor Nodes
Major Challenge:
• Front End Overload
Jonas Traub – TU Berlin / DFKI – Efficently Handling Streams from Millions of Sensors 8
Solution
Adaptive Data Reduction with Streaming Engines
Optimized On-Demand Data
Streaming from Sensor Nodes
I²: Interactive Real-Time Visualization
for Streaming Data
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 9
Solution
Adaptive Data Reduction with Streaming
Engines
Optimized On-Demand Data
Streaming from Sensor Nodes
I²: Interactive Real-Time Visualization
for Streaming Data
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 10
Solution
Efficient Processing of user-defined Windows
Optimized On-Demand Data
Streaming from Sensor Nodes
I²: Interactive Real-Time Visualization
for Streaming Data
Cutty: Aggregate Sharing for
User-Defined Windows
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 11
Publications
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 12
Optimized On-Demand Data Streaming
from Sensor Nodes
Jonas Traub, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl
Santa Clara, California,
September 25-27, 2017
13
Architecture Overview
14
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
Architecture Overview
14
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
Architecture Overview
14
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
Architecture Overview
14
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
Architecture Overview
14
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
User-Defined Sampling Functions
19
• Provide an abstraction to define the data demand of applications.
• Upon a sensor read, request the next sensor read.
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
User-Defined Sampling Functions
20
• Provide an abstraction to define the data demand of applications.
• Upon a sensor read, request the next sensor read.
• Make read time tolerances explicit.
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
User-Defined Sampling Functions
21
Enable adaptive sampling techniques to reduce data transmission
e.g., Adam [Trihinas ‘15], FAST [Fan ‘14], L-SIP [Gaura ’13]
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
Sensor Read Fusion
22
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
Sensor Read Fusion
23
1) Minimize Sensor Reads and Data Transfer:
Latest possible read time
2) Optimize Sensor Read Times:
● Check the paper for all details on the read time optimizer!
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
24
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
Local Filtering
25
Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
Optimized On-Demand Data Streaming
from Sensor Nodes
Wrap-Up:
Tailor Data Streams to the Demand of Applications
• Define data demand: User-Defined Sampling Functions
• Schedule sensor reads and data transfer on-demand
• Optimize read times globally - for all users and queries
Jonas Traub, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl
26
Cutty: Aggregate Sharing
for User-Defined Windows
Paris Cabone, Jonas Traub, Asterios Katsifodimos, Seif Haridi, Volker Markl
27
Streaming Window Aggragation
Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 28
Stream Slicing
Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 29
Applicability of Stream Slicing
Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 30
Yes, we can do better!
Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 31
Cutty Overview
Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 32
Cutty: Aggregate Sharing
for User-Defined Windows
Wrap-Up:
Enable Stream Slicing beyond Simple Tumbling and Sliding Windows
• Cutty enables Stream Slicing for a broad class of windows
• Cutty combines Stream Slicing, On-the-fly Aggregation,
Aggregate Sharing, and Aggregate Trees
Paris Cabone, Jonas Traub, Asterios Katsifodimos, Seif Haridi, Volker Markl
33
I²: Interactive Real-Time Visualization
for Streaming Data
Jonas Traub, Nikolaas Steenbergen, Philipp Grulich, Tilmann Rabl, Volker Markl
34
Architecture Overview
Jonas Traub et al. – I²: Interactive Real-Time Visualization for Streaming Data – EDBT 2017 35
Check out our Flink Forward Talk
youtube.com/watch?v=JNbq239JkK4
36
The Big Picture
Optimized On-Demand Data
Streaming from Sensor Nodes
Traub et al.; ACM SoCC’17
I²: Interactive Real-Time Visualization
for Streaming Data
Traub et al.; EDBT’17
Cutty: Aggregate Sharing for
User-Defined Windows
Carbone et al.; CIKM’16
Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors

More Related Content

Viewers also liked

Viewers also liked (7)

In Memory Analytics with Apache Spark
In Memory Analytics with Apache SparkIn Memory Analytics with Apache Spark
In Memory Analytics with Apache Spark
 
JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of ...
JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of ...JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of ...
JT@UCSB - On-Demand Data Streaming from Sensor Nodes and A quick overview of ...
 
An Introduction to Evaluation in Medical Visualization
An Introduction to Evaluation in Medical VisualizationAn Introduction to Evaluation in Medical Visualization
An Introduction to Evaluation in Medical Visualization
 
Processing Twitter Events in Real-Time with Oracle Event Processing (OEP) 12c
Processing Twitter Events in Real-Time with Oracle Event Processing (OEP) 12cProcessing Twitter Events in Real-Time with Oracle Event Processing (OEP) 12c
Processing Twitter Events in Real-Time with Oracle Event Processing (OEP) 12c
 
Web 2 0 Projects Elementary
Web 2 0 Projects ElementaryWeb 2 0 Projects Elementary
Web 2 0 Projects Elementary
 
What Is Visualization?
What Is Visualization?What Is Visualization?
What Is Visualization?
 
Stream Processing with Kafka in Uber, Danny Yuan
Stream Processing with Kafka in Uber, Danny Yuan Stream Processing with Kafka in Uber, Danny Yuan
Stream Processing with Kafka in Uber, Danny Yuan
 

More from Jonas Traub

Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Jonas Traub
 
code.talks 2019 - Scotty: Efficient Window Aggregation for your Stream Proces...
code.talks 2019 - Scotty: Efficient Window Aggregation for your Stream Proces...code.talks 2019 - Scotty: Efficient Window Aggregation for your Stream Proces...
code.talks 2019 - Scotty: Efficient Window Aggregation for your Stream Proces...
Jonas Traub
 
FlinkForward Berlin 2019 - Scotty: Efficient Window Aggregation with General ...
FlinkForward Berlin 2019 - Scotty: Efficient Window Aggregation with General ...FlinkForward Berlin 2019 - Scotty: Efficient Window Aggregation with General ...
FlinkForward Berlin 2019 - Scotty: Efficient Window Aggregation with General ...
Jonas Traub
 
Efficient Window Aggregation with General Stream Slicing (EDBT 2019, Best Paper)
Efficient Window Aggregation with General Stream Slicing (EDBT 2019, Best Paper)Efficient Window Aggregation with General Stream Slicing (EDBT 2019, Best Paper)
Efficient Window Aggregation with General Stream Slicing (EDBT 2019, Best Paper)
Jonas Traub
 
Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...
Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...
Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...
Jonas Traub
 
Flink Forward 2018: Efficient Window Aggregation with Stream Slicing
Flink Forward 2018: Efficient Window Aggregation with Stream SlicingFlink Forward 2018: Efficient Window Aggregation with Stream Slicing
Flink Forward 2018: Efficient Window Aggregation with Stream Slicing
Jonas Traub
 
Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Scotty: Efficient Window Aggregation for Out-of-Order Stream ProcessingScotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Jonas Traub
 
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
Jonas Traub
 

More from Jonas Traub (17)

Definitely not Java! A Hands-on Introduction to Efficient Functional Programm...
Definitely not Java! A Hands-on Introduction to Efficient Functional Programm...Definitely not Java! A Hands-on Introduction to Efficient Functional Programm...
Definitely not Java! A Hands-on Introduction to Efficient Functional Programm...
 
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
Efficient Data Stream Processing in the Internet of Things - SoftwareCampus A...
 
code.talks 2019 - Scotty: Efficient Window Aggregation for your Stream Proces...
code.talks 2019 - Scotty: Efficient Window Aggregation for your Stream Proces...code.talks 2019 - Scotty: Efficient Window Aggregation for your Stream Proces...
code.talks 2019 - Scotty: Efficient Window Aggregation for your Stream Proces...
 
FlinkForward Berlin 2019 - Scotty: Efficient Window Aggregation with General ...
FlinkForward Berlin 2019 - Scotty: Efficient Window Aggregation with General ...FlinkForward Berlin 2019 - Scotty: Efficient Window Aggregation with General ...
FlinkForward Berlin 2019 - Scotty: Efficient Window Aggregation with General ...
 
Analyzing Efficient Stream Processing on Modern Hardware (VLDB 2019 Presentat...
Analyzing Efficient Stream Processing on Modern Hardware (VLDB 2019 Presentat...Analyzing Efficient Stream Processing on Modern Hardware (VLDB 2019 Presentat...
Analyzing Efficient Stream Processing on Modern Hardware (VLDB 2019 Presentat...
 
Database Research at TU Berlin DIMA and DFKI IAM - USA Excursion Slides 2019
Database Research at TU Berlin DIMA and DFKI IAM - USA Excursion Slides 2019Database Research at TU Berlin DIMA and DFKI IAM - USA Excursion Slides 2019
Database Research at TU Berlin DIMA and DFKI IAM - USA Excursion Slides 2019
 
Efficient Window Aggregation with General Stream Slicing (EDBT 2019, Best Paper)
Efficient Window Aggregation with General Stream Slicing (EDBT 2019, Best Paper)Efficient Window Aggregation with General Stream Slicing (EDBT 2019, Best Paper)
Efficient Window Aggregation with General Stream Slicing (EDBT 2019, Best Paper)
 
Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...
Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...
Resense: Transparent Record and Replay of Sensor Data in the Internet of Thin...
 
Flink Forward 2018: Efficient Window Aggregation with Stream Slicing
Flink Forward 2018: Efficient Window Aggregation with Stream SlicingFlink Forward 2018: Efficient Window Aggregation with Stream Slicing
Flink Forward 2018: Efficient Window Aggregation with Stream Slicing
 
Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Scotty: Efficient Window Aggregation for Out-of-Order Stream ProcessingScotty: Efficient Window Aggregation for Out-of-Order Stream Processing
Scotty: Efficient Window Aggregation for Out-of-Order Stream Processing
 
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...
Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive W...
 
Efficient SIMD Vectorization for Hashing in OpenCL
Efficient SIMD Vectorization for Hashing in OpenCLEfficient SIMD Vectorization for Hashing in OpenCL
Efficient SIMD Vectorization for Hashing in OpenCL
 
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
UZH Stream Reasoning Workshop 2018: Optimized On-Demand Data Streaming from S...
 
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
I²: Interactive Real-Time Visualization for Streaming Data with Apache Flink ...
 
I²: Interactive Real-Time Visualization for Streaming Data
I²: Interactive Real-Time Visualization for Streaming DataI²: Interactive Real-Time Visualization for Streaming Data
I²: Interactive Real-Time Visualization for Streaming Data
 
LWA 2015: The Apache Flink Platform (Poster)
LWA 2015: The Apache Flink Platform (Poster)LWA 2015: The Apache Flink Platform (Poster)
LWA 2015: The Apache Flink Platform (Poster)
 
LWA 2015: The Apache Flink Platform for Parallel Batch and Stream Analysis
LWA 2015: The Apache Flink Platform for Parallel Batch and Stream AnalysisLWA 2015: The Apache Flink Platform for Parallel Batch and Stream Analysis
LWA 2015: The Apache Flink Platform for Parallel Batch and Stream Analysis
 

Recently uploaded

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 

Recently uploaded (20)

FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 

Efficiently Handling Streams from Millions of Sesors (@KIT - 2nd BMBF Big Data All Hands Meeting and 2nd Smart Data Innovation Conference)

  • 1. 2nd BMBF Big Data All Hands Meeting and 2nd Smart Data Innovation Conference Karlsruhe , October 11.-12., 2017 Presenting at Efficiently Handling Streams from Millions of Sensors Jonas Traub – TU Berlin / DFKI 1
  • 2. The Growth of the Internet of Things Gartner says 6.4 billion connected "Things" will be in use in 2016 and more than 20 billion in 2020. Year # Devices (in billions) Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 2
  • 3. Goal Provide real-time insights based on IoT data. Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 3
  • 4. Problem • Billions of devices provide real-time data • Result: Vast amount of data streams Heavy Network Utilization Scalability Challenges Increasing Latencies Financial Costs Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 4
  • 5. Solution Produce and process data streams based on the data demand of applications. Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 5
  • 6. State of the Art Approach Data Stream Production with Periodic Sampling Major Challenges: • Oversampling • Missing Adaptivity Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 6
  • 7. Solution On-Demand Data Streaming from Sensor Nodes Optimized On-Demand Data Streaming from Sensor Nodes Jonas Traub – TU Berlin / DFKI – Efficently Handling Streams from Millions of Sensors 7
  • 8. State of the Art Approach Provide all Data to Front-End Applications Optimized On-Demand Data Streaming from Sensor Nodes Major Challenge: • Front End Overload Jonas Traub – TU Berlin / DFKI – Efficently Handling Streams from Millions of Sensors 8
  • 9. Solution Adaptive Data Reduction with Streaming Engines Optimized On-Demand Data Streaming from Sensor Nodes I²: Interactive Real-Time Visualization for Streaming Data Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 9
  • 10. Solution Adaptive Data Reduction with Streaming Engines Optimized On-Demand Data Streaming from Sensor Nodes I²: Interactive Real-Time Visualization for Streaming Data Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 10
  • 11. Solution Efficient Processing of user-defined Windows Optimized On-Demand Data Streaming from Sensor Nodes I²: Interactive Real-Time Visualization for Streaming Data Cutty: Aggregate Sharing for User-Defined Windows Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 11
  • 12. Publications Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors 12
  • 13. Optimized On-Demand Data Streaming from Sensor Nodes Jonas Traub, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl Santa Clara, California, September 25-27, 2017 13
  • 14. Architecture Overview 14 Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 15. Architecture Overview 14 Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 16. Architecture Overview 14 Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 17. Architecture Overview 14 Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 18. Architecture Overview 14 Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 19. User-Defined Sampling Functions 19 • Provide an abstraction to define the data demand of applications. • Upon a sensor read, request the next sensor read. Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 20. User-Defined Sampling Functions 20 • Provide an abstraction to define the data demand of applications. • Upon a sensor read, request the next sensor read. • Make read time tolerances explicit. Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 21. User-Defined Sampling Functions 21 Enable adaptive sampling techniques to reduce data transmission e.g., Adam [Trihinas ‘15], FAST [Fan ‘14], L-SIP [Gaura ’13] Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 22. Sensor Read Fusion 22 Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 23. Sensor Read Fusion 23 1) Minimize Sensor Reads and Data Transfer: Latest possible read time 2) Optimize Sensor Read Times: ● Check the paper for all details on the read time optimizer! Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 24. 24 Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 25. Local Filtering 25 Jonas Traub et al. – Optimized On-Demand Data Streaming from Sensor Nodes – ACM SoCC 2017
  • 26. Optimized On-Demand Data Streaming from Sensor Nodes Wrap-Up: Tailor Data Streams to the Demand of Applications • Define data demand: User-Defined Sampling Functions • Schedule sensor reads and data transfer on-demand • Optimize read times globally - for all users and queries Jonas Traub, Sebastian Breß, Asterios Katsifodimos, Tilmann Rabl, Volker Markl 26
  • 27. Cutty: Aggregate Sharing for User-Defined Windows Paris Cabone, Jonas Traub, Asterios Katsifodimos, Seif Haridi, Volker Markl 27
  • 28. Streaming Window Aggragation Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 28
  • 29. Stream Slicing Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 29
  • 30. Applicability of Stream Slicing Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 30
  • 31. Yes, we can do better! Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 31
  • 32. Cutty Overview Paris Carbone et al. – Cutty: Aggregate Sharing for User-Defined Windows – CIKM 2017 32
  • 33. Cutty: Aggregate Sharing for User-Defined Windows Wrap-Up: Enable Stream Slicing beyond Simple Tumbling and Sliding Windows • Cutty enables Stream Slicing for a broad class of windows • Cutty combines Stream Slicing, On-the-fly Aggregation, Aggregate Sharing, and Aggregate Trees Paris Cabone, Jonas Traub, Asterios Katsifodimos, Seif Haridi, Volker Markl 33
  • 34. I²: Interactive Real-Time Visualization for Streaming Data Jonas Traub, Nikolaas Steenbergen, Philipp Grulich, Tilmann Rabl, Volker Markl 34
  • 35. Architecture Overview Jonas Traub et al. – I²: Interactive Real-Time Visualization for Streaming Data – EDBT 2017 35
  • 36. Check out our Flink Forward Talk youtube.com/watch?v=JNbq239JkK4 36
  • 37. The Big Picture Optimized On-Demand Data Streaming from Sensor Nodes Traub et al.; ACM SoCC’17 I²: Interactive Real-Time Visualization for Streaming Data Traub et al.; EDBT’17 Cutty: Aggregate Sharing for User-Defined Windows Carbone et al.; CIKM’16 Jonas Traub – TU Berlin / DFKI – Efficiently Handling Streams from Millions of Sensors