SlideShare uma empresa Scribd logo
1 de 27
Baixar para ler offline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Market Prediction Using ML
Experiment with Amazon SageMaker and the Deutsche Börse Public Dataset
Binoy Das
Partner Solution Architect
Amazon Web Services
F S V 3 0 9
Dave Rocamora
Partner Solution Architect
Amazon Web Service
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Our mission
Put machine learning
in the hands of every
developer,
data scientist, and
architect
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
 Overview of Amazon SageMaker
 Overview of Deutsche Börse Public Data Set
 Overview of Amazon Athena and Amazon QuickSight
 Hands-on—data preparation
 Hands-on—data analysis
 Hands-on—custom container on Amazon Elastic Container
Registry (Amazon ECR) to implement RNN model
 Hands-on—Use DeepAR on Amazon SageMaker
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The Amazon Machine Learning stack
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon SageMaker
1
I
Notebook instances
2
I
Algorithms
3
I
ML training service
4
I
ML hosting service
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
End-to-end machine
learning platform
Zero setup Flexible model
training
Pay by the second
Amazon SageMaker—Value proposition
The quickest and easiest way to get ML models from idea to production
$
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Stock market data maintained by Deutsche Börse
• Opening, closing, highest and lowest price, and trading
volume for every security traded
Published on Registry of Open Data on AWS
https://registry.opendata.aws/deutsche-boerse-pds/
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Deutsche Börse Public Data Set
Source:
s3://deutsche-boerse-xetra-pds (for Xetra data)
s3://deutsche-boerse-eurex-pds (for Eurex data)
Data dictionary:
Field Description Type
Mnemonic Stock exchange ticker symbol Text
Date Date of trading period Date
Time Minute of trading to which this entry relates Time
StartPrice Trading price at the start of period Decimal
MaxPrice Maximum price over the period Decimal
MinPrice Minimum price over the period Decimal
EndPrice Trading price at the end of the period Decimal
TradedVolume Total value traded Decimal
NumberOfTrades Number of distinct trades during the period Integer
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Serverless data lake & analytics with AWS
Crawlers scan your data sets and populate the AWS Glue Data Catalog
The AWS Glue Data Catalog serves as a central metadata repository
Once catalogued in AWS Glue, your data is immediately available for analytics
1
2
3
Amazon
Redshift
Spectrum
Amazon
EMR
Amazon
Athena
AWS Glue
Data Catalog
AWS Glue
crawler
Amazon S3 Amazon
QuickSight
1 2
3
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Visualization for a trading analyst
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Begin with a notebook
• First step: create lifecycle configurations
• Clone from GitHub
• Create folder structures
• Obtain cleaned up data
https://github.com/aws-samples/db-reinvent-workshop
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Explore stock behavior
• Generate database
• Create AWS Glue crawler
• Create tables
• Queries for introspection
• Create Athena queries
• Create custom views
• Stock visualization
• Create Amazon QuickSight dashboard
• Explore similarities in behavior
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Clustering of similar stocks
• Clustering algorithm
• Run custom clustering algorithm
• Find correlation
• Group clustered stocks
• Retrieve metrices for clustered stocks
• Create separate data files for clusters
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Containerize custom MLP code
• Data channels
• Peek into data
• Load to Amazon SageMaker S3 bucket
• Container image
• Bundle code and libraries
• Create Amazon ECR repository
• Model training
• Create Amazon SageMaker estimator
• Fit estimator to the data
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Deploy custom MLP model
• Model deployment
• Create Amazon SageMaker predictor
• Deploy to Amazon SageMaker endpoint
• Inference generation
• Generate prediction with test data
• Visualize prediction against observed
• Performance evaluation
• Measure accuracy
• Scope for improvement
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Train DeepAR model
• DeepAR data set
• Format clustered data for DeepAR
• Create training and test data
• Model training
• Create Amazon SageMaker training job
• Observe training behavior
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Deploy DeepAR model
• Model deployment
• Create endpoint configuration
• Create Amazon SageMaker endpoint
• Inference generation
• Generate prediction with test data
• Visualize prediction against observed
• Performance evaluation
• Measure accuracy
• Repeat with different cluster
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Binoy Das
binoyd@amazon.com
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Mais conteúdo relacionado

Mais procurados

M&E Leadership Session: The State of the Industry, What's New from AWS for M&...
M&E Leadership Session: The State of the Industry, What's New from AWS for M&...M&E Leadership Session: The State of the Industry, What's New from AWS for M&...
M&E Leadership Session: The State of the Industry, What's New from AWS for M&...Amazon Web Services
 
Deep Learning Applications Using PyTorch, Featuring Facebook (AIM402-R) - AWS...
Deep Learning Applications Using PyTorch, Featuring Facebook (AIM402-R) - AWS...Deep Learning Applications Using PyTorch, Featuring Facebook (AIM402-R) - AWS...
Deep Learning Applications Using PyTorch, Featuring Facebook (AIM402-R) - AWS...Amazon Web Services
 
Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...
Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...
Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...Amazon Web Services
 
Accelerate ML Training on Amazon SageMaker Using GPU-Based EC2 P3 Instances (...
Accelerate ML Training on Amazon SageMaker Using GPU-Based EC2 P3 Instances (...Accelerate ML Training on Amazon SageMaker Using GPU-Based EC2 P3 Instances (...
Accelerate ML Training on Amazon SageMaker Using GPU-Based EC2 P3 Instances (...Amazon Web Services
 
Build, Train, and Deploy ML Models Quickly and Easily with Amazon SageMaker, ...
Build, Train, and Deploy ML Models Quickly and Easily with Amazon SageMaker, ...Build, Train, and Deploy ML Models Quickly and Easily with Amazon SageMaker, ...
Build, Train, and Deploy ML Models Quickly and Easily with Amazon SageMaker, ...Amazon Web Services
 
Using Amazon SageMaker and AWS DeepLens with Teams New to Machine Learning (G...
Using Amazon SageMaker and AWS DeepLens with Teams New to Machine Learning (G...Using Amazon SageMaker and AWS DeepLens with Teams New to Machine Learning (G...
Using Amazon SageMaker and AWS DeepLens with Teams New to Machine Learning (G...Amazon Web Services
 
How Verizon is Accelerating Cloud Adoption and Migration with the AWS Service...
How Verizon is Accelerating Cloud Adoption and Migration with the AWS Service...How Verizon is Accelerating Cloud Adoption and Migration with the AWS Service...
How Verizon is Accelerating Cloud Adoption and Migration with the AWS Service...Amazon Web Services
 
Capture Voice of Customer Insights with NLP & Analytics (AIM415-R1) - AWS re:...
Capture Voice of Customer Insights with NLP & Analytics (AIM415-R1) - AWS re:...Capture Voice of Customer Insights with NLP & Analytics (AIM415-R1) - AWS re:...
Capture Voice of Customer Insights with NLP & Analytics (AIM415-R1) - AWS re:...Amazon Web Services
 
Machine Learning for Improving Disaster Management and Response (WPS313) - AW...
Machine Learning for Improving Disaster Management and Response (WPS313) - AW...Machine Learning for Improving Disaster Management and Response (WPS313) - AW...
Machine Learning for Improving Disaster Management and Response (WPS313) - AW...Amazon Web Services
 
Three Lessons from “Escape the Room” That Apply to Making Money with Your Ale...
Three Lessons from “Escape the Room” That Apply to Making Money with Your Ale...Three Lessons from “Escape the Room” That Apply to Making Money with Your Ale...
Three Lessons from “Escape the Room” That Apply to Making Money with Your Ale...Amazon Web Services
 
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Amazon Web Services
 
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018Amazon Web Services
 
Configuration Management and Service Discovery with AWS Lambda (SRV338-R1) - ...
Configuration Management and Service Discovery with AWS Lambda (SRV338-R1) - ...Configuration Management and Service Discovery with AWS Lambda (SRV338-R1) - ...
Configuration Management and Service Discovery with AWS Lambda (SRV338-R1) - ...Amazon Web Services
 
Go Global with Cloud-Native Architecture: Deploy AdTech Services Across Four ...
Go Global with Cloud-Native Architecture: Deploy AdTech Services Across Four ...Go Global with Cloud-Native Architecture: Deploy AdTech Services Across Four ...
Go Global with Cloud-Native Architecture: Deploy AdTech Services Across Four ...Amazon Web Services
 
AWS Startup Day Kyiv - AI/ML services for developers
AWS Startup Day Kyiv - AI/ML services for developersAWS Startup Day Kyiv - AI/ML services for developers
AWS Startup Day Kyiv - AI/ML services for developersAmazon Web Services
 
Five New Security Automations Using AWS Security Services & Open Source (SEC4...
Five New Security Automations Using AWS Security Services & Open Source (SEC4...Five New Security Automations Using AWS Security Services & Open Source (SEC4...
Five New Security Automations Using AWS Security Services & Open Source (SEC4...Amazon Web Services
 
Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...
Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...
Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...Amazon Web Services
 
Real-Time Personalized Customer Experiences at Bonobos (RET203) - AWS re:Inve...
Real-Time Personalized Customer Experiences at Bonobos (RET203) - AWS re:Inve...Real-Time Personalized Customer Experiences at Bonobos (RET203) - AWS re:Inve...
Real-Time Personalized Customer Experiences at Bonobos (RET203) - AWS re:Inve...Amazon Web Services
 
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Amazon Web Services
 
Scale Your Studio: Rendering with Spot and Deadline on AWS (CMP202) - AWS re:...
Scale Your Studio: Rendering with Spot and Deadline on AWS (CMP202) - AWS re:...Scale Your Studio: Rendering with Spot and Deadline on AWS (CMP202) - AWS re:...
Scale Your Studio: Rendering with Spot and Deadline on AWS (CMP202) - AWS re:...Amazon Web Services
 

Mais procurados (20)

M&E Leadership Session: The State of the Industry, What's New from AWS for M&...
M&E Leadership Session: The State of the Industry, What's New from AWS for M&...M&E Leadership Session: The State of the Industry, What's New from AWS for M&...
M&E Leadership Session: The State of the Industry, What's New from AWS for M&...
 
Deep Learning Applications Using PyTorch, Featuring Facebook (AIM402-R) - AWS...
Deep Learning Applications Using PyTorch, Featuring Facebook (AIM402-R) - AWS...Deep Learning Applications Using PyTorch, Featuring Facebook (AIM402-R) - AWS...
Deep Learning Applications Using PyTorch, Featuring Facebook (AIM402-R) - AWS...
 
Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...
Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...
Build a Searchable Media Library & Moderate Content at Scale Using Machine Le...
 
Accelerate ML Training on Amazon SageMaker Using GPU-Based EC2 P3 Instances (...
Accelerate ML Training on Amazon SageMaker Using GPU-Based EC2 P3 Instances (...Accelerate ML Training on Amazon SageMaker Using GPU-Based EC2 P3 Instances (...
Accelerate ML Training on Amazon SageMaker Using GPU-Based EC2 P3 Instances (...
 
Build, Train, and Deploy ML Models Quickly and Easily with Amazon SageMaker, ...
Build, Train, and Deploy ML Models Quickly and Easily with Amazon SageMaker, ...Build, Train, and Deploy ML Models Quickly and Easily with Amazon SageMaker, ...
Build, Train, and Deploy ML Models Quickly and Easily with Amazon SageMaker, ...
 
Using Amazon SageMaker and AWS DeepLens with Teams New to Machine Learning (G...
Using Amazon SageMaker and AWS DeepLens with Teams New to Machine Learning (G...Using Amazon SageMaker and AWS DeepLens with Teams New to Machine Learning (G...
Using Amazon SageMaker and AWS DeepLens with Teams New to Machine Learning (G...
 
How Verizon is Accelerating Cloud Adoption and Migration with the AWS Service...
How Verizon is Accelerating Cloud Adoption and Migration with the AWS Service...How Verizon is Accelerating Cloud Adoption and Migration with the AWS Service...
How Verizon is Accelerating Cloud Adoption and Migration with the AWS Service...
 
Capture Voice of Customer Insights with NLP & Analytics (AIM415-R1) - AWS re:...
Capture Voice of Customer Insights with NLP & Analytics (AIM415-R1) - AWS re:...Capture Voice of Customer Insights with NLP & Analytics (AIM415-R1) - AWS re:...
Capture Voice of Customer Insights with NLP & Analytics (AIM415-R1) - AWS re:...
 
Machine Learning for Improving Disaster Management and Response (WPS313) - AW...
Machine Learning for Improving Disaster Management and Response (WPS313) - AW...Machine Learning for Improving Disaster Management and Response (WPS313) - AW...
Machine Learning for Improving Disaster Management and Response (WPS313) - AW...
 
Three Lessons from “Escape the Room” That Apply to Making Money with Your Ale...
Three Lessons from “Escape the Room” That Apply to Making Money with Your Ale...Three Lessons from “Escape the Room” That Apply to Making Money with Your Ale...
Three Lessons from “Escape the Room” That Apply to Making Money with Your Ale...
 
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
 
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018
Detect Anomalies Using Amazon SageMaker (AIM420) - AWS re:Invent 2018
 
Configuration Management and Service Discovery with AWS Lambda (SRV338-R1) - ...
Configuration Management and Service Discovery with AWS Lambda (SRV338-R1) - ...Configuration Management and Service Discovery with AWS Lambda (SRV338-R1) - ...
Configuration Management and Service Discovery with AWS Lambda (SRV338-R1) - ...
 
Go Global with Cloud-Native Architecture: Deploy AdTech Services Across Four ...
Go Global with Cloud-Native Architecture: Deploy AdTech Services Across Four ...Go Global with Cloud-Native Architecture: Deploy AdTech Services Across Four ...
Go Global with Cloud-Native Architecture: Deploy AdTech Services Across Four ...
 
AWS Startup Day Kyiv - AI/ML services for developers
AWS Startup Day Kyiv - AI/ML services for developersAWS Startup Day Kyiv - AI/ML services for developers
AWS Startup Day Kyiv - AI/ML services for developers
 
Five New Security Automations Using AWS Security Services & Open Source (SEC4...
Five New Security Automations Using AWS Security Services & Open Source (SEC4...Five New Security Automations Using AWS Security Services & Open Source (SEC4...
Five New Security Automations Using AWS Security Services & Open Source (SEC4...
 
Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...
Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...
Leadership Session: Digital Advertising - Customer Learning & the Road Ahead ...
 
Real-Time Personalized Customer Experiences at Bonobos (RET203) - AWS re:Inve...
Real-Time Personalized Customer Experiences at Bonobos (RET203) - AWS re:Inve...Real-Time Personalized Customer Experiences at Bonobos (RET203) - AWS re:Inve...
Real-Time Personalized Customer Experiences at Bonobos (RET203) - AWS re:Inve...
 
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
 
Scale Your Studio: Rendering with Spot and Deadline on AWS (CMP202) - AWS re:...
Scale Your Studio: Rendering with Spot and Deadline on AWS (CMP202) - AWS re:...Scale Your Studio: Rendering with Spot and Deadline on AWS (CMP202) - AWS re:...
Scale Your Studio: Rendering with Spot and Deadline on AWS (CMP202) - AWS re:...
 

Semelhante a Market Prediction Using ML: Experiment with Amazon SageMaker and the Deutsche Börse Public Dataset (FSV309) - AWS re:Invent 2018

Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018Amazon Web Services
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAmazon Web Services
 
Building a Modern Data Platform in the Cloud
Building a Modern Data Platform in the CloudBuilding a Modern Data Platform in the Cloud
Building a Modern Data Platform in the CloudAmazon Web Services
 
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdfBuilding+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdfSasikumarPalanivel3
 
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdfBuilding+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdfsaidbilgen
 
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018Amazon Web Services
 
Implementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdfImplementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdfAmazon Web Services
 
Wild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
Wild Rydes with Big Data/Kinesis focus: AWS Serverless WorkshopWild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
Wild Rydes with Big Data/Kinesis focus: AWS Serverless WorkshopAWS Germany
 
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Amazon Web Services
 
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...Amazon Web Services
 
Data Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech TalksData Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech TalksAmazon Web Services
 
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...AWS Riyadh User Group
 
AWS and Symantec: Cyber Defense at Scale (SEC311-S) - AWS re:Invent 2018
AWS and Symantec: Cyber Defense at Scale (SEC311-S) - AWS re:Invent 2018AWS and Symantec: Cyber Defense at Scale (SEC311-S) - AWS re:Invent 2018
AWS and Symantec: Cyber Defense at Scale (SEC311-S) - AWS re:Invent 2018Amazon Web Services
 
Build your own log analytics solution on AWS - ADB301 - Atlanta AWS Summit
Build your own log analytics solution on AWS - ADB301 - Atlanta AWS SummitBuild your own log analytics solution on AWS - ADB301 - Atlanta AWS Summit
Build your own log analytics solution on AWS - ADB301 - Atlanta AWS SummitAmazon Web Services
 
Build, Deploy, and Serve Machine-Learning Models on Streaming Data Using Amaz...
Build, Deploy, and Serve Machine-Learning Models on Streaming Data Using Amaz...Build, Deploy, and Serve Machine-Learning Models on Streaming Data Using Amaz...
Build, Deploy, and Serve Machine-Learning Models on Streaming Data Using Amaz...Amazon Web Services
 
Customizing Data Lakes to Work for Your Enterprise with Sysco (STG340) - AWS ...
Customizing Data Lakes to Work for Your Enterprise with Sysco (STG340) - AWS ...Customizing Data Lakes to Work for Your Enterprise with Sysco (STG340) - AWS ...
Customizing Data Lakes to Work for Your Enterprise with Sysco (STG340) - AWS ...Amazon Web Services
 
Building the Organization of the Future: Leveraging AI & ML
Building the Organization of the Future: Leveraging AI & ML Building the Organization of the Future: Leveraging AI & ML
Building the Organization of the Future: Leveraging AI & ML Amazon Web Services
 
Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Amazon Web Services
 

Semelhante a Market Prediction Using ML: Experiment with Amazon SageMaker and the Deutsche Börse Public Dataset (FSV309) - AWS re:Invent 2018 (20)

Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scale
 
Building a Modern Data Platform in the Cloud
Building a Modern Data Platform in the CloudBuilding a Modern Data Platform in the Cloud
Building a Modern Data Platform in the Cloud
 
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdfBuilding+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
 
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdfBuilding+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
Building+your+Data+Project+on+AWS+-+Luke+Anderson.pdf
 
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
Serverless Data Prep with AWS Glue (ANT313) - AWS re:Invent 2018
 
Implementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdfImplementazione di una soluzione Data Lake.pdf
Implementazione di una soluzione Data Lake.pdf
 
Wild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
Wild Rydes with Big Data/Kinesis focus: AWS Serverless WorkshopWild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
Wild Rydes with Big Data/Kinesis focus: AWS Serverless Workshop
 
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
Building Data Lakes That Cost Less and Deliver Results Faster - AWS Online Te...
 
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
Data Lake Implementation: Processing and Querying Data in Place (STG204-R1) -...
 
Data Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech TalksData Transformation Patterns in AWS - AWS Online Tech Talks
Data Transformation Patterns in AWS - AWS Online Tech Talks
 
Log Analytics with AWS
Log Analytics with AWSLog Analytics with AWS
Log Analytics with AWS
 
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
Cutting to the chase for Machine Learning Analytics Ecosystem & AWS Lake Form...
 
AWS and Symantec: Cyber Defense at Scale (SEC311-S) - AWS re:Invent 2018
AWS and Symantec: Cyber Defense at Scale (SEC311-S) - AWS re:Invent 2018AWS and Symantec: Cyber Defense at Scale (SEC311-S) - AWS re:Invent 2018
AWS and Symantec: Cyber Defense at Scale (SEC311-S) - AWS re:Invent 2018
 
Build your own log analytics solution on AWS - ADB301 - Atlanta AWS Summit
Build your own log analytics solution on AWS - ADB301 - Atlanta AWS SummitBuild your own log analytics solution on AWS - ADB301 - Atlanta AWS Summit
Build your own log analytics solution on AWS - ADB301 - Atlanta AWS Summit
 
Build, Deploy, and Serve Machine-Learning Models on Streaming Data Using Amaz...
Build, Deploy, and Serve Machine-Learning Models on Streaming Data Using Amaz...Build, Deploy, and Serve Machine-Learning Models on Streaming Data Using Amaz...
Build, Deploy, and Serve Machine-Learning Models on Streaming Data Using Amaz...
 
Customizing Data Lakes to Work for Your Enterprise with Sysco (STG340) - AWS ...
Customizing Data Lakes to Work for Your Enterprise with Sysco (STG340) - AWS ...Customizing Data Lakes to Work for Your Enterprise with Sysco (STG340) - AWS ...
Customizing Data Lakes to Work for Your Enterprise with Sysco (STG340) - AWS ...
 
Building the Organization of the Future: Leveraging AI & ML
Building the Organization of the Future: Leveraging AI & ML Building the Organization of the Future: Leveraging AI & ML
Building the Organization of the Future: Leveraging AI & ML
 
Log Analytics with AWS
Log Analytics with AWSLog Analytics with AWS
Log Analytics with AWS
 
Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28
 

Mais de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Mais de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Market Prediction Using ML: Experiment with Amazon SageMaker and the Deutsche Börse Public Dataset (FSV309) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Market Prediction Using ML Experiment with Amazon SageMaker and the Deutsche Börse Public Dataset Binoy Das Partner Solution Architect Amazon Web Services F S V 3 0 9 Dave Rocamora Partner Solution Architect Amazon Web Service
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Our mission Put machine learning in the hands of every developer, data scientist, and architect
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda  Overview of Amazon SageMaker  Overview of Deutsche Börse Public Data Set  Overview of Amazon Athena and Amazon QuickSight  Hands-on—data preparation  Hands-on—data analysis  Hands-on—custom container on Amazon Elastic Container Registry (Amazon ECR) to implement RNN model  Hands-on—Use DeepAR on Amazon SageMaker
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The Amazon Machine Learning stack
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon SageMaker 1 I Notebook instances 2 I Algorithms 3 I ML training service 4 I ML hosting service
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. End-to-end machine learning platform Zero setup Flexible model training Pay by the second Amazon SageMaker—Value proposition The quickest and easiest way to get ML models from idea to production $
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Stock market data maintained by Deutsche Börse • Opening, closing, highest and lowest price, and trading volume for every security traded Published on Registry of Open Data on AWS https://registry.opendata.aws/deutsche-boerse-pds/
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Deutsche Börse Public Data Set Source: s3://deutsche-boerse-xetra-pds (for Xetra data) s3://deutsche-boerse-eurex-pds (for Eurex data) Data dictionary: Field Description Type Mnemonic Stock exchange ticker symbol Text Date Date of trading period Date Time Minute of trading to which this entry relates Time StartPrice Trading price at the start of period Decimal MaxPrice Maximum price over the period Decimal MinPrice Minimum price over the period Decimal EndPrice Trading price at the end of the period Decimal TradedVolume Total value traded Decimal NumberOfTrades Number of distinct trades during the period Integer
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Serverless data lake & analytics with AWS Crawlers scan your data sets and populate the AWS Glue Data Catalog The AWS Glue Data Catalog serves as a central metadata repository Once catalogued in AWS Glue, your data is immediately available for analytics 1 2 3 Amazon Redshift Spectrum Amazon EMR Amazon Athena AWS Glue Data Catalog AWS Glue crawler Amazon S3 Amazon QuickSight 1 2 3
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Visualization for a trading analyst
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Begin with a notebook • First step: create lifecycle configurations • Clone from GitHub • Create folder structures • Obtain cleaned up data https://github.com/aws-samples/db-reinvent-workshop
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Explore stock behavior • Generate database • Create AWS Glue crawler • Create tables • Queries for introspection • Create Athena queries • Create custom views • Stock visualization • Create Amazon QuickSight dashboard • Explore similarities in behavior
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Clustering of similar stocks • Clustering algorithm • Run custom clustering algorithm • Find correlation • Group clustered stocks • Retrieve metrices for clustered stocks • Create separate data files for clusters
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Containerize custom MLP code • Data channels • Peek into data • Load to Amazon SageMaker S3 bucket • Container image • Bundle code and libraries • Create Amazon ECR repository • Model training • Create Amazon SageMaker estimator • Fit estimator to the data
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Deploy custom MLP model • Model deployment • Create Amazon SageMaker predictor • Deploy to Amazon SageMaker endpoint • Inference generation • Generate prediction with test data • Visualize prediction against observed • Performance evaluation • Measure accuracy • Scope for improvement
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Train DeepAR model • DeepAR data set • Format clustered data for DeepAR • Create training and test data • Model training • Create Amazon SageMaker training job • Observe training behavior
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Deploy DeepAR model • Model deployment • Create endpoint configuration • Create Amazon SageMaker endpoint • Inference generation • Generate prediction with test data • Visualize prediction against observed • Performance evaluation • Measure accuracy • Repeat with different cluster
  • 26. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Binoy Das binoyd@amazon.com
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.