SlideShare uma empresa Scribd logo
1 de 42
Baixar para ler offline
IoT NY - Cloud services for IoT
James Chittenden
Google Cloud Platform Solutions Engineer
jameschi@google.com
+James Chittenden
(Big Data Cloud Engineer)
jameschi@google.com
Google confidential │ Do not
distribute
Agenda
Big Data the Cloud Way - Why would you ?
Fully Managed: NoOps Ingest, Process & Analyse
Hands On Demo: Building an Event Streaming Pipeline
1
2
3
Big Data at Google
aka. Data at Google
20-?? BILLION
devices will be
connected by 2020
$4-11 Trillion
Economic Impact
54% of top performer
companies will invest
more in sensors this yr
Sources: Gartner, PwC, McKinsey
20-?? BILLION
devices will be
connected by 2020
$4-11 Trillion
Economic Impact
54% of top performer
companies will invest
more in sensors this yr
Sources: Gartner, PwC, McKinsey
What is IoT?
IoT is a period of transformation
Phone IoT Phone
Wearables
Watches
Phones
Cars
Home
Appliances
Existing Business Owned
Equipment
Connected
IoT is a transition to connected
Not Connected
Back in the 70s ….
The PC
The Result
A datacenter is not a collection of computers,
a datacenter is a computer.
The same is happening in the Cloud today
State of the art Data Centers.
For the past 17 years, Google has been building out the
world’s fastest, most powerful, highest quality cloud
infrastructure on the planet.
2002 2004 2006 2008 2010 2012
Dremel ColossusMapReduce
GFS Bigtable Spanner
2014
Dataflow
Google’s Big Data Innovations go far back Flumejava
BigQuery
Millwheel
Bigtable
Extends the Android platform to IoT devices
Weave - IoT Protocol and Schema
Google Glass at Work
Nest - solutions for the connected home
Health and Wearables
Confidential & ProprietaryGoogle Cloud Platform 21
Management
Mobile
Services
Compute
Big Data
Networking
Storage
Developer Tools
Fully Managed:
NoOps Ingest, Process & Analyze
StoreCapture Analyze
BigQuery
Process
DataflowCloud Storage
DatastoreCloud SQL
Hadoop/SparkKafka
Pub/Sub
Hadoop/Spark
Manage the Entire Lifecycle of Big Data
Dataflow
BigQuery
Fast ETL
Regex
JSON
UDFs
Spreadsheets
BI Tools
Coworkers
Applications + Reports
PubSub
Cloud Storage
BigTable
Your
Data
GCS-Hadoop
Connector
Hadoop on
Compute Engine Cloud Dataproc
unmanaged
managed
Big Data Architecture with Google managed services
Building what’s next 25
Scales automatically
No setup or administration
Stream up to 100,000 rows p/sec
Easily integrates with third-party software
Google BigQuery
makes complex data analysis simple
Question:
Find root cause why ad was or was
not delivered in the last 30 days.
select date,
rejection_reason, count(*)
from line_item_table.
last30days
where line_item_id=56781234
1.2B Rows scanned
Result in ~5 seconds!
BigQuery Use @Google: DoubleClick Support
BigQuery scales “Google scale”
Streaming ingest at peak
Largest Data Lake on BigQuery
Largest query by data size
Largest query by rows 10.5 Trillion rows
2.3 Million rows per second
38 Petabytes
2.1 Petabytes
What is BigQuery?
Externalization of Google Dremel
Convenience of SQL
Petabyte-Scale and Fast
Fully Managed, No-Ops Data Warehouse
Building what’s next 29
Merges batch and stream processing
Data processing pipelines
Monitoring interface
Significantly lower cost
Runs on Google or Cloudera Spark (Github)
Google Cloud Dataflow
makes complex data analysis simple
What is Cloud Dataflow?
Cloud Dataflow is a
collection of SDKs for
building batch or
streaming parallelized
data processing pipelines.
Cloud Dataflow is a fully
managed service for
executing optimized
parallelized data processing
pipelines.
Cloud Pub/Sub
• Globally redundant
• Low latency (sub sec.)
• Batched read/write
• Custom labels
• Push & Pull
• Auto expiration
Publisher A Publisher B Publisher C
Message 1
Topic A Topic B Topic C
Subscription XA Subscription XB
Subscription
YC
Subscription
ZC
Cloud
Pub/Sub
Subscriber X Subscriber Y
Message 2 Message 3
Subscriber Z
Message 1
Message 2
Message 3
Message 3
Dataflow goodies
Autoscaling mid-job
Fully managed - No-Ops
Intuitive Data Processing Framework
Batch and Stream Processing in one
Liquid sharding mid-job
1
2
3
4
5
Pipeline p = Pipeline.create();
p.begin()
.apply(TextIO.Read.from(“gs://…”))
.apply(ParDo.of(new ExtractTags())
.apply(Count.create())
.apply(ParDo.of(new ExpandPrefixes())
.apply(Top.largestPerKey(3))
.apply(TextIO.Write.to(“gs://…”));
p.run();
Dataflow goodies
Autoscaling mid-job
Fully managed - No-Ops
Intuitive Data Processing Framework
Batch and Stream Processing in one
Liquid sharding mid-job
1
2
3
4
5
Deploy
Schedule & Monitor
Autoscaling mid-job
Fully managed - No-Ops
Intuitive Data Processing Framework
Batch and Stream Processing in one
Liquid sharding mid-job
1
2
3
4
5
Dataflow goodies
800 RPS 1200 RPS 5000 RPS 50 RPS
Autoscaling mid-job
Fully managed - No-Ops
Intuitive Data Processing Framework
Batch and Stream Processing in one
Liquid sharding mid-job
1
2
3
4
5
Dataflow goodies
Autoscaling mid-job
Fully managed - No-Ops
Intuitive Data Processing Framework
Batch and Stream Processing in one
Liquid sharding mid-job
1
2
3
4
5
Dataflow goodies
Pipeline p = Pipeline.create();
p.begin()
.apply(TextIO.Read.from(“gs://…”))
.apply(ParDo.of(new ExtractTags())
.apply(Count.create())
.apply(ParDo.of(new ExpandPrefixes())
.apply(Top.largestPerKey(3))
.apply(TextIO.Write.to(“gs://…”));
p.run();
.apply(PubsubIO.Read.from(“input_topic”))
.apply(Window.<Integer>by(FixedWindows.of(5, MINUTES))
.apply(PubsubIO.Write.to(“output_topic”));
Autoscaling mid-job
Fully managed - No-Ops
Intuitive Data Processing Framework
Batch and Stream Processing in one
Liquid sharding mid-job
1
2
3
4
5
Dataflow goodies
Nighttime Mid-Day Nighttime
Demo Time
Pub/Sub
Ingest Process Analyse
Cloud Dataflow BigQuery
Git: https://github.com/james-google/event-streams-dataflow
Demo Time
Pub/Sub
Ingest Process Analyse
Cloud Dataflow BigQuery
Git: https://github.com/james-google/event-streams-dataflow
Questions?
Thank You
James Chittenden
jameschi@google.com

Mais conteúdo relacionado

Mais procurados

Come costruire una soluzione Digital Twin con AWS IoT e AI-ML
Come costruire una soluzione Digital Twin con AWS IoT e AI-MLCome costruire una soluzione Digital Twin con AWS IoT e AI-ML
Come costruire una soluzione Digital Twin con AWS IoT e AI-MLAmazon Web Services
 
AI as Driver of Transformation - Didier Ongena @CONNECT19
AI as Driver of Transformation - Didier Ongena @CONNECT19AI as Driver of Transformation - Didier Ongena @CONNECT19
AI as Driver of Transformation - Didier Ongena @CONNECT19Codit
 
Embedded digital twin
Embedded digital twinEmbedded digital twin
Embedded digital twinIRS srl
 
IT Technology Trends 2014
IT Technology Trends 2014IT Technology Trends 2014
IT Technology Trends 2014IMC Institute
 
Digital twin - Internet of Things
Digital twin - Internet of ThingsDigital twin - Internet of Things
Digital twin - Internet of ThingsAhmed Sayed
 
Forecast on Cloud Computing Trends 2015
Forecast on  Cloud Computing  Trends 2015Forecast on  Cloud Computing  Trends 2015
Forecast on Cloud Computing Trends 2015IMC Institute
 
Digital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoTDigital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoTDimitri Volkmann
 
Internet of Things and Azure - DevTeach 2016
Internet of Things and Azure - DevTeach 2016Internet of Things and Azure - DevTeach 2016
Internet of Things and Azure - DevTeach 2016Guy Barrette
 
Pushing the boundaries with IoT - Glenn Colpaert @CONNECT19
Pushing the boundaries with IoT - Glenn Colpaert @CONNECT19Pushing the boundaries with IoT - Glenn Colpaert @CONNECT19
Pushing the boundaries with IoT - Glenn Colpaert @CONNECT19Codit
 
Xanadu for Big Data + Deep Learning + Cloud + IoT Integration Strategy
Xanadu for Big Data + Deep Learning + Cloud + IoT Integration StrategyXanadu for Big Data + Deep Learning + Cloud + IoT Integration Strategy
Xanadu for Big Data + Deep Learning + Cloud + IoT Integration StrategyAlex G. Lee, Ph.D. Esq. CLP
 
Big Data LDN 2017: Creating ROI from Big Data Investments - Monetizing your B...
Big Data LDN 2017: Creating ROI from Big Data Investments - Monetizing your B...Big Data LDN 2017: Creating ROI from Big Data Investments - Monetizing your B...
Big Data LDN 2017: Creating ROI from Big Data Investments - Monetizing your B...Matt Stubbs
 
2 pc enterprise summit cronin newfinal aug 18
2 pc enterprise summit cronin newfinal aug 182 pc enterprise summit cronin newfinal aug 18
2 pc enterprise summit cronin newfinal aug 18IntelAPAC
 
Alleantia - internet of things for enterprises - enabling data-driven organiz...
Alleantia - internet of things for enterprises - enabling data-driven organiz...Alleantia - internet of things for enterprises - enabling data-driven organiz...
Alleantia - internet of things for enterprises - enabling data-driven organiz...Antonio Conati Barbaro
 
Digital twins - Technology that is Changing Industry
Digital twins - Technology that is Changing IndustryDigital twins - Technology that is Changing Industry
Digital twins - Technology that is Changing IndustryWg Cdr Jayesh C S PAI
 
Big Data LDN 2017: Big Impact with Big Data
Big Data LDN 2017: Big Impact with Big DataBig Data LDN 2017: Big Impact with Big Data
Big Data LDN 2017: Big Impact with Big DataMatt Stubbs
 
Unicaseed Demo Days - Alleantia
Unicaseed Demo Days - AlleantiaUnicaseed Demo Days - Alleantia
Unicaseed Demo Days - AlleantiaUnicaseed
 

Mais procurados (20)

Come costruire una soluzione Digital Twin con AWS IoT e AI-ML
Come costruire una soluzione Digital Twin con AWS IoT e AI-MLCome costruire una soluzione Digital Twin con AWS IoT e AI-ML
Come costruire una soluzione Digital Twin con AWS IoT e AI-ML
 
AI as Driver of Transformation - Didier Ongena @CONNECT19
AI as Driver of Transformation - Didier Ongena @CONNECT19AI as Driver of Transformation - Didier Ongena @CONNECT19
AI as Driver of Transformation - Didier Ongena @CONNECT19
 
Embedded digital twin
Embedded digital twinEmbedded digital twin
Embedded digital twin
 
IT Technology Trends 2014
IT Technology Trends 2014IT Technology Trends 2014
IT Technology Trends 2014
 
Digital twin - Internet of Things
Digital twin - Internet of ThingsDigital twin - Internet of Things
Digital twin - Internet of Things
 
Forecast on Cloud Computing Trends 2015
Forecast on  Cloud Computing  Trends 2015Forecast on  Cloud Computing  Trends 2015
Forecast on Cloud Computing Trends 2015
 
Internet of Things and Big Data
Internet of Things and Big DataInternet of Things and Big Data
Internet of Things and Big Data
 
The digital twin story
The digital twin storyThe digital twin story
The digital twin story
 
Digital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoTDigital Twin: A radical new approach to IoT
Digital Twin: A radical new approach to IoT
 
Internet of Things and Azure - DevTeach 2016
Internet of Things and Azure - DevTeach 2016Internet of Things and Azure - DevTeach 2016
Internet of Things and Azure - DevTeach 2016
 
Pushing the boundaries with IoT - Glenn Colpaert @CONNECT19
Pushing the boundaries with IoT - Glenn Colpaert @CONNECT19Pushing the boundaries with IoT - Glenn Colpaert @CONNECT19
Pushing the boundaries with IoT - Glenn Colpaert @CONNECT19
 
Xanadu for Big Data + Deep Learning + Cloud + IoT Integration Strategy
Xanadu for Big Data + Deep Learning + Cloud + IoT Integration StrategyXanadu for Big Data + Deep Learning + Cloud + IoT Integration Strategy
Xanadu for Big Data + Deep Learning + Cloud + IoT Integration Strategy
 
Big Data LDN 2017: Creating ROI from Big Data Investments - Monetizing your B...
Big Data LDN 2017: Creating ROI from Big Data Investments - Monetizing your B...Big Data LDN 2017: Creating ROI from Big Data Investments - Monetizing your B...
Big Data LDN 2017: Creating ROI from Big Data Investments - Monetizing your B...
 
2 pc enterprise summit cronin newfinal aug 18
2 pc enterprise summit cronin newfinal aug 182 pc enterprise summit cronin newfinal aug 18
2 pc enterprise summit cronin newfinal aug 18
 
AI at the Edge
AI at the EdgeAI at the Edge
AI at the Edge
 
Digital Twin: Starting the journey
Digital Twin: Starting the journeyDigital Twin: Starting the journey
Digital Twin: Starting the journey
 
Alleantia - internet of things for enterprises - enabling data-driven organiz...
Alleantia - internet of things for enterprises - enabling data-driven organiz...Alleantia - internet of things for enterprises - enabling data-driven organiz...
Alleantia - internet of things for enterprises - enabling data-driven organiz...
 
Digital twins - Technology that is Changing Industry
Digital twins - Technology that is Changing IndustryDigital twins - Technology that is Changing Industry
Digital twins - Technology that is Changing Industry
 
Big Data LDN 2017: Big Impact with Big Data
Big Data LDN 2017: Big Impact with Big DataBig Data LDN 2017: Big Impact with Big Data
Big Data LDN 2017: Big Impact with Big Data
 
Unicaseed Demo Days - Alleantia
Unicaseed Demo Days - AlleantiaUnicaseed Demo Days - Alleantia
Unicaseed Demo Days - Alleantia
 

Destaque

Firebase para se divertir com Internet das Coisas
Firebase para se divertir com Internet das CoisasFirebase para se divertir com Internet das Coisas
Firebase para se divertir com Internet das CoisasLuís Leão
 
Pub/Sub for the masses- Ein Einführungsworkshop in MQTT [GERMAN]
Pub/Sub for the masses- Ein Einführungsworkshop in MQTT [GERMAN]Pub/Sub for the masses- Ein Einführungsworkshop in MQTT [GERMAN]
Pub/Sub for the masses- Ein Einführungsworkshop in MQTT [GERMAN]Dominik Obermaier
 
DWS16 - Connected things forum - Samuel Ropert, IDATE DigiWorld
DWS16 - Connected things forum - Samuel Ropert, IDATE DigiWorldDWS16 - Connected things forum - Samuel Ropert, IDATE DigiWorld
DWS16 - Connected things forum - Samuel Ropert, IDATE DigiWorldIDATE DigiWorld
 
JCConf 2015 - 輕鬆學google的雲端開發 - Google App Engine入門(下)
JCConf 2015  - 輕鬆學google的雲端開發 - Google App Engine入門(下)JCConf 2015  - 輕鬆學google的雲端開發 - Google App Engine入門(下)
JCConf 2015 - 輕鬆學google的雲端開發 - Google App Engine入門(下)Simon Su
 
Cloudera for Internet of Things
Cloudera for Internet of ThingsCloudera for Internet of Things
Cloudera for Internet of ThingsCloudera, Inc.
 
Google Cloud Monitoring
Google Cloud MonitoringGoogle Cloud Monitoring
Google Cloud MonitoringSimon Su
 
自治体における最新IT活用事例について
自治体における最新IT活用事例について自治体における最新IT活用事例について
自治体における最新IT活用事例についてYasunobu Fukasawa
 
Watson による性格分析 API を使ってみた
Watson による性格分析 API を使ってみたWatson による性格分析 API を使ってみた
Watson による性格分析 API を使ってみたK Kimura
 
Cassandra and Riak at BestBuy.com
Cassandra and Riak at BestBuy.comCassandra and Riak at BestBuy.com
Cassandra and Riak at BestBuy.comjoelcrabb
 
IBM Watson Visual Recognition を紹介するよ
IBM Watson Visual Recognition を紹介するよIBM Watson Visual Recognition を紹介するよ
IBM Watson Visual Recognition を紹介するよTohru Kobayashi
 
The BestBuy.com Cloud Architecture
The BestBuy.com Cloud ArchitectureThe BestBuy.com Cloud Architecture
The BestBuy.com Cloud Architecturejoelcrabb
 
Get more from Analytics 360 with BigQuery and the Google Cloud Platform
Get more from Analytics 360 with BigQuery and the Google Cloud PlatformGet more from Analytics 360 with BigQuery and the Google Cloud Platform
Get more from Analytics 360 with BigQuery and the Google Cloud Platformjavier ramirez
 
S06 Azure バックアップを利用した Microsoft Azure 仮想マシンのバックアップ
S06 Azure バックアップを利用した Microsoft Azure 仮想マシンのバックアップS06 Azure バックアップを利用した Microsoft Azure 仮想マシンのバックアップ
S06 Azure バックアップを利用した Microsoft Azure 仮想マシンのバックアップMicrosoft Azure Japan
 
Watsonでメール監視
Watsonでメール監視Watsonでメール監視
Watsonでメール監視BMXUG
 
PythonとYAMLでGCPをDeploy!「Google Cloud Deployment Manager」
PythonとYAMLでGCPをDeploy!「Google Cloud Deployment Manager」PythonとYAMLでGCPをDeploy!「Google Cloud Deployment Manager」
PythonとYAMLでGCPをDeploy!「Google Cloud Deployment Manager」Terui Masashi
 

Destaque (20)

IoT at Google Scale
IoT at Google ScaleIoT at Google Scale
IoT at Google Scale
 
Firebase para se divertir com Internet das Coisas
Firebase para se divertir com Internet das CoisasFirebase para se divertir com Internet das Coisas
Firebase para se divertir com Internet das Coisas
 
Pub/Sub for the masses- Ein Einführungsworkshop in MQTT [GERMAN]
Pub/Sub for the masses- Ein Einführungsworkshop in MQTT [GERMAN]Pub/Sub for the masses- Ein Einführungsworkshop in MQTT [GERMAN]
Pub/Sub for the masses- Ein Einführungsworkshop in MQTT [GERMAN]
 
DWS16 - Connected things forum - Samuel Ropert, IDATE DigiWorld
DWS16 - Connected things forum - Samuel Ropert, IDATE DigiWorldDWS16 - Connected things forum - Samuel Ropert, IDATE DigiWorld
DWS16 - Connected things forum - Samuel Ropert, IDATE DigiWorld
 
Multi Screen Hell
Multi Screen HellMulti Screen Hell
Multi Screen Hell
 
Google Spanner
Google SpannerGoogle Spanner
Google Spanner
 
JCConf 2015 - 輕鬆學google的雲端開發 - Google App Engine入門(下)
JCConf 2015  - 輕鬆學google的雲端開發 - Google App Engine入門(下)JCConf 2015  - 輕鬆學google的雲端開發 - Google App Engine入門(下)
JCConf 2015 - 輕鬆學google的雲端開發 - Google App Engine入門(下)
 
Cloudera for Internet of Things
Cloudera for Internet of ThingsCloudera for Internet of Things
Cloudera for Internet of Things
 
Google Cloud Monitoring
Google Cloud MonitoringGoogle Cloud Monitoring
Google Cloud Monitoring
 
自治体における最新IT活用事例について
自治体における最新IT活用事例について自治体における最新IT活用事例について
自治体における最新IT活用事例について
 
Spanner
SpannerSpanner
Spanner
 
Watson による性格分析 API を使ってみた
Watson による性格分析 API を使ってみたWatson による性格分析 API を使ってみた
Watson による性格分析 API を使ってみた
 
Cassandra and Riak at BestBuy.com
Cassandra and Riak at BestBuy.comCassandra and Riak at BestBuy.com
Cassandra and Riak at BestBuy.com
 
IBM Watson Visual Recognition を紹介するよ
IBM Watson Visual Recognition を紹介するよIBM Watson Visual Recognition を紹介するよ
IBM Watson Visual Recognition を紹介するよ
 
The BestBuy.com Cloud Architecture
The BestBuy.com Cloud ArchitectureThe BestBuy.com Cloud Architecture
The BestBuy.com Cloud Architecture
 
Big Data Landscape 2016
Big Data Landscape 2016Big Data Landscape 2016
Big Data Landscape 2016
 
Get more from Analytics 360 with BigQuery and the Google Cloud Platform
Get more from Analytics 360 with BigQuery and the Google Cloud PlatformGet more from Analytics 360 with BigQuery and the Google Cloud Platform
Get more from Analytics 360 with BigQuery and the Google Cloud Platform
 
S06 Azure バックアップを利用した Microsoft Azure 仮想マシンのバックアップ
S06 Azure バックアップを利用した Microsoft Azure 仮想マシンのバックアップS06 Azure バックアップを利用した Microsoft Azure 仮想マシンのバックアップ
S06 Azure バックアップを利用した Microsoft Azure 仮想マシンのバックアップ
 
Watsonでメール監視
Watsonでメール監視Watsonでメール監視
Watsonでメール監視
 
PythonとYAMLでGCPをDeploy!「Google Cloud Deployment Manager」
PythonとYAMLでGCPをDeploy!「Google Cloud Deployment Manager」PythonとYAMLでGCPをDeploy!「Google Cloud Deployment Manager」
PythonとYAMLでGCPをDeploy!「Google Cloud Deployment Manager」
 

Semelhante a IoT NY - Google Cloud Services for IoT

Critical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsCritical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsData Driven Innovation
 
Data Ingestion in Big Data and IoT platforms
Data Ingestion in Big Data and IoT platformsData Ingestion in Big Data and IoT platforms
Data Ingestion in Big Data and IoT platformsGuido Schmutz
 
Eric Andersen Keynote
Eric Andersen KeynoteEric Andersen Keynote
Eric Andersen KeynoteData Con LA
 
Predix Builder Roadshow
Predix Builder RoadshowPredix Builder Roadshow
Predix Builder RoadshowPredix
 
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Alluxio, Inc.
 
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowHow to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowLucas Arruda
 
TDC2017 | São Paulo - Trilha BigData How we figured out we had a SRE team at ...
TDC2017 | São Paulo - Trilha BigData How we figured out we had a SRE team at ...TDC2017 | São Paulo - Trilha BigData How we figured out we had a SRE team at ...
TDC2017 | São Paulo - Trilha BigData How we figured out we had a SRE team at ...tdc-globalcode
 
Analyzing petabytes of smartmeter data using Cloud Bigtable, Cloud Dataflow, ...
Analyzing petabytes of smartmeter data using Cloud Bigtable, Cloud Dataflow, ...Analyzing petabytes of smartmeter data using Cloud Bigtable, Cloud Dataflow, ...
Analyzing petabytes of smartmeter data using Cloud Bigtable, Cloud Dataflow, ...Edwin Poot
 
Voxxed Days Cluj - Powering interactive data analysis with Google BigQuery
Voxxed Days Cluj - Powering interactive data analysis with Google BigQueryVoxxed Days Cluj - Powering interactive data analysis with Google BigQuery
Voxxed Days Cluj - Powering interactive data analysis with Google BigQueryMárton Kodok
 
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryCodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryMárton Kodok
 
Bridge to Cloud: Using Apache Kafka to Migrate to GCP
Bridge to Cloud: Using Apache Kafka to Migrate to GCPBridge to Cloud: Using Apache Kafka to Migrate to GCP
Bridge to Cloud: Using Apache Kafka to Migrate to GCPconfluent
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and HadoopGoogle Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoophuguk
 
Cloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and FastCloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and FastDatabricks
 
Workshop on Google Cloud Data Platform
Workshop on Google Cloud Data PlatformWorkshop on Google Cloud Data Platform
Workshop on Google Cloud Data PlatformGoDataDriven
 
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQuery
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQueryGDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQuery
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQueryMárton Kodok
 
Google Cloud Dataflow Two Worlds Become a Much Better One
Google Cloud Dataflow Two Worlds Become a Much Better OneGoogle Cloud Dataflow Two Worlds Become a Much Better One
Google Cloud Dataflow Two Worlds Become a Much Better OneDataWorks Summit
 
Making advanced analytics accessible to more companies
Making advanced analytics accessible to more companiesMaking advanced analytics accessible to more companies
Making advanced analytics accessible to more companiesMárton Kodok
 
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...Márton Kodok
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...Alex Liu
 

Semelhante a IoT NY - Google Cloud Services for IoT (20)

Critical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsCritical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and Analytics
 
Data Ingestion in Big Data and IoT platforms
Data Ingestion in Big Data and IoT platformsData Ingestion in Big Data and IoT platforms
Data Ingestion in Big Data and IoT platforms
 
Eric Andersen Keynote
Eric Andersen KeynoteEric Andersen Keynote
Eric Andersen Keynote
 
Predix Builder Roadshow
Predix Builder RoadshowPredix Builder Roadshow
Predix Builder Roadshow
 
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
Integrating Google Cloud Dataproc with Alluxio for faster performance in the ...
 
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud DataflowHow to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
How to build an ETL pipeline with Apache Beam on Google Cloud Dataflow
 
TDC2017 | São Paulo - Trilha BigData How we figured out we had a SRE team at ...
TDC2017 | São Paulo - Trilha BigData How we figured out we had a SRE team at ...TDC2017 | São Paulo - Trilha BigData How we figured out we had a SRE team at ...
TDC2017 | São Paulo - Trilha BigData How we figured out we had a SRE team at ...
 
Analyzing petabytes of smartmeter data using Cloud Bigtable, Cloud Dataflow, ...
Analyzing petabytes of smartmeter data using Cloud Bigtable, Cloud Dataflow, ...Analyzing petabytes of smartmeter data using Cloud Bigtable, Cloud Dataflow, ...
Analyzing petabytes of smartmeter data using Cloud Bigtable, Cloud Dataflow, ...
 
Voxxed Days Cluj - Powering interactive data analysis with Google BigQuery
Voxxed Days Cluj - Powering interactive data analysis with Google BigQueryVoxxed Days Cluj - Powering interactive data analysis with Google BigQuery
Voxxed Days Cluj - Powering interactive data analysis with Google BigQuery
 
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQueryCodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
CodeCamp Iasi - Creating serverless data analytics system on GCP using BigQuery
 
Bridge to Cloud: Using Apache Kafka to Migrate to GCP
Bridge to Cloud: Using Apache Kafka to Migrate to GCPBridge to Cloud: Using Apache Kafka to Migrate to GCP
Bridge to Cloud: Using Apache Kafka to Migrate to GCP
 
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and HadoopGoogle Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
Google Cloud Dataproc - Easier, faster, more cost-effective Spark and Hadoop
 
Cloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and FastCloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and Fast
 
Workshop on Google Cloud Data Platform
Workshop on Google Cloud Data PlatformWorkshop on Google Cloud Data Platform
Workshop on Google Cloud Data Platform
 
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQuery
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQueryGDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQuery
GDG DevFest Ukraine - Powering Interactive Data Analysis with Google BigQuery
 
Google Cloud Dataflow Two Worlds Become a Much Better One
Google Cloud Dataflow Two Worlds Become a Much Better OneGoogle Cloud Dataflow Two Worlds Become a Much Better One
Google Cloud Dataflow Two Worlds Become a Much Better One
 
Big Data Ready Enterprise
Big Data Ready Enterprise Big Data Ready Enterprise
Big Data Ready Enterprise
 
Making advanced analytics accessible to more companies
Making advanced analytics accessible to more companiesMaking advanced analytics accessible to more companies
Making advanced analytics accessible to more companies
 
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
VoxxedDays Bucharest 2017 - Powering interactive data analysis with Google Bi...
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
 

Último

Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 

Último (20)

Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

IoT NY - Google Cloud Services for IoT

  • 1. IoT NY - Cloud services for IoT James Chittenden Google Cloud Platform Solutions Engineer jameschi@google.com
  • 2. +James Chittenden (Big Data Cloud Engineer) jameschi@google.com
  • 3. Google confidential │ Do not distribute Agenda Big Data the Cloud Way - Why would you ? Fully Managed: NoOps Ingest, Process & Analyse Hands On Demo: Building an Event Streaming Pipeline 1 2 3
  • 4. Big Data at Google aka. Data at Google
  • 5. 20-?? BILLION devices will be connected by 2020 $4-11 Trillion Economic Impact 54% of top performer companies will invest more in sensors this yr Sources: Gartner, PwC, McKinsey
  • 6. 20-?? BILLION devices will be connected by 2020 $4-11 Trillion Economic Impact 54% of top performer companies will invest more in sensors this yr Sources: Gartner, PwC, McKinsey
  • 7. What is IoT? IoT is a period of transformation Phone IoT Phone
  • 9. Back in the 70s ….
  • 12. A datacenter is not a collection of computers, a datacenter is a computer. The same is happening in the Cloud today
  • 13. State of the art Data Centers. For the past 17 years, Google has been building out the world’s fastest, most powerful, highest quality cloud infrastructure on the planet.
  • 14. 2002 2004 2006 2008 2010 2012 Dremel ColossusMapReduce GFS Bigtable Spanner 2014 Dataflow Google’s Big Data Innovations go far back Flumejava BigQuery Millwheel Bigtable
  • 15. Extends the Android platform to IoT devices
  • 16. Weave - IoT Protocol and Schema
  • 18. Nest - solutions for the connected home
  • 19.
  • 21. Confidential & ProprietaryGoogle Cloud Platform 21 Management Mobile Services Compute Big Data Networking Storage Developer Tools
  • 22. Fully Managed: NoOps Ingest, Process & Analyze
  • 23. StoreCapture Analyze BigQuery Process DataflowCloud Storage DatastoreCloud SQL Hadoop/SparkKafka Pub/Sub Hadoop/Spark Manage the Entire Lifecycle of Big Data
  • 24. Dataflow BigQuery Fast ETL Regex JSON UDFs Spreadsheets BI Tools Coworkers Applications + Reports PubSub Cloud Storage BigTable Your Data GCS-Hadoop Connector Hadoop on Compute Engine Cloud Dataproc unmanaged managed Big Data Architecture with Google managed services
  • 25. Building what’s next 25 Scales automatically No setup or administration Stream up to 100,000 rows p/sec Easily integrates with third-party software Google BigQuery makes complex data analysis simple
  • 26. Question: Find root cause why ad was or was not delivered in the last 30 days. select date, rejection_reason, count(*) from line_item_table. last30days where line_item_id=56781234 1.2B Rows scanned Result in ~5 seconds! BigQuery Use @Google: DoubleClick Support
  • 27. BigQuery scales “Google scale” Streaming ingest at peak Largest Data Lake on BigQuery Largest query by data size Largest query by rows 10.5 Trillion rows 2.3 Million rows per second 38 Petabytes 2.1 Petabytes
  • 28. What is BigQuery? Externalization of Google Dremel Convenience of SQL Petabyte-Scale and Fast Fully Managed, No-Ops Data Warehouse
  • 29. Building what’s next 29 Merges batch and stream processing Data processing pipelines Monitoring interface Significantly lower cost Runs on Google or Cloudera Spark (Github) Google Cloud Dataflow makes complex data analysis simple
  • 30. What is Cloud Dataflow? Cloud Dataflow is a collection of SDKs for building batch or streaming parallelized data processing pipelines. Cloud Dataflow is a fully managed service for executing optimized parallelized data processing pipelines.
  • 31. Cloud Pub/Sub • Globally redundant • Low latency (sub sec.) • Batched read/write • Custom labels • Push & Pull • Auto expiration Publisher A Publisher B Publisher C Message 1 Topic A Topic B Topic C Subscription XA Subscription XB Subscription YC Subscription ZC Cloud Pub/Sub Subscriber X Subscriber Y Message 2 Message 3 Subscriber Z Message 1 Message 2 Message 3 Message 3
  • 32. Dataflow goodies Autoscaling mid-job Fully managed - No-Ops Intuitive Data Processing Framework Batch and Stream Processing in one Liquid sharding mid-job 1 2 3 4 5 Pipeline p = Pipeline.create(); p.begin() .apply(TextIO.Read.from(“gs://…”)) .apply(ParDo.of(new ExtractTags()) .apply(Count.create()) .apply(ParDo.of(new ExpandPrefixes()) .apply(Top.largestPerKey(3)) .apply(TextIO.Write.to(“gs://…”)); p.run();
  • 33. Dataflow goodies Autoscaling mid-job Fully managed - No-Ops Intuitive Data Processing Framework Batch and Stream Processing in one Liquid sharding mid-job 1 2 3 4 5 Deploy Schedule & Monitor
  • 34. Autoscaling mid-job Fully managed - No-Ops Intuitive Data Processing Framework Batch and Stream Processing in one Liquid sharding mid-job 1 2 3 4 5 Dataflow goodies 800 RPS 1200 RPS 5000 RPS 50 RPS
  • 35. Autoscaling mid-job Fully managed - No-Ops Intuitive Data Processing Framework Batch and Stream Processing in one Liquid sharding mid-job 1 2 3 4 5 Dataflow goodies
  • 36. Autoscaling mid-job Fully managed - No-Ops Intuitive Data Processing Framework Batch and Stream Processing in one Liquid sharding mid-job 1 2 3 4 5 Dataflow goodies Pipeline p = Pipeline.create(); p.begin() .apply(TextIO.Read.from(“gs://…”)) .apply(ParDo.of(new ExtractTags()) .apply(Count.create()) .apply(ParDo.of(new ExpandPrefixes()) .apply(Top.largestPerKey(3)) .apply(TextIO.Write.to(“gs://…”)); p.run(); .apply(PubsubIO.Read.from(“input_topic”)) .apply(Window.<Integer>by(FixedWindows.of(5, MINUTES)) .apply(PubsubIO.Write.to(“output_topic”));
  • 37. Autoscaling mid-job Fully managed - No-Ops Intuitive Data Processing Framework Batch and Stream Processing in one Liquid sharding mid-job 1 2 3 4 5 Dataflow goodies Nighttime Mid-Day Nighttime
  • 38.
  • 39. Demo Time Pub/Sub Ingest Process Analyse Cloud Dataflow BigQuery Git: https://github.com/james-google/event-streams-dataflow
  • 40. Demo Time Pub/Sub Ingest Process Analyse Cloud Dataflow BigQuery Git: https://github.com/james-google/event-streams-dataflow