SlideShare a Scribd company logo
1 of 31
Download to read offline
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under a Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
SPRINGONE2GX
WASHINGTON, DC
Implementing a highly scalable
Stock prediction system with R,
Apache Geode and Spring XD
Fred Meloā€Ø
@fredmelo_br
William Markitoā€Ø
@william_markito
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
About us
Fred Melo
Technical Director for Data
fmelo@pivotal.io
@fredmelo_br
2
William Markito
Enterprise Architect for GemFire
wmarkito@pivotal.io
@william_markito
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 3
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 4
It's all about DATA
Data Sources
Look for patterns
Prediction
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
What do we want to build?
5
"Smart System"
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
ā€¦ in our specific case
6
Trading Data
"Smart System"
Historical Data
Repository
Learns with historical trends
"How were the medium average
price and relative strength
reading when the latest failures
happened? "
Live data
becomes
historical
over time
Real-Time
Evaluates live data
ā€œAccording to historical trends,
thereā€™s an 80% chance this stock
prices might go downhill within
the next hour"
Historical
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
ā€¦ in our specific case
7
Trading Data
"Smart System"
Historical Data
Repository
Learns with historical trends
"How were the medium average
price and relative strength reading
when the latest failures happened? "
Live data
becomes
historical
over time
Real-Time
Evaluates live data
ā€œAccording to historical trends,
thereā€™s an 80% chance this stock
prices might go downhill within the
next hour"
Historical
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 8
Live Data
Data
Temperature
Hot
Cold
Greenplum DB
Apache Geode / GemFire
1- Live data is ingested into the grid
3 - Results are pushed
immediately to deployed
applications
4 - ā€œHot" data ages,
becoming part of the
historical dataset
Machine Learning
model 5 - Re-training is triggered,
updating the model with
the latest historical data
Spring XD
Spring XD
The ML pipeline data flow
2 - Trained ML model compares
new data to historical patterns
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 9
Live Data
Apache Geode / GemFire
1- Live data is ingested into the grid
2 - Trained ML model compares
new data to historical patterns
3 - Results are pushed
immediately to deployed
applications
Machine Learning
model
4 - Re-training is triggered, updating
the model with the latest historical data
Spring XD
Spring XD
Simplified demo model Data
Temperature
Hot
Warm
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 10
Transform Sink
SpringXD
Extensible
Open-Source
Fault-Tolerant
Horizontally Scalable
Cloud-Native
Machine Learning
Enrich Filter
Split
Dashboard
Indicators
1
2
Predict
3
Real data
Simulator
/Stocks
/TechIndicators
/Predictions
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 11
Eating it in small bitesā€¦
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 12
SpringXD GemFire
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
ā€¢ Cache
ā€¢ Configurable through XML, ,Java
ā€¢ Region
ā€¢ Distributed j.u.Map on steroids
ā€¢ Highly available, redundant
ā€¢ Member
ā€¢ Locator, Server, Client
ā€¢ Callbacks
ā€¢ Listener, Writer, AsyncEventListener, Parallel/Serial
Apache Geode & GemFire Concepts
13
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Apache Geode & GemFire, why ?
ā€¢ Performance
ā€¢ Consistency
ā€¢ Resiliency
14
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Apache Geode & GemFire, why ?
15
Ā© Copyright 2014 Pivotal. All rights reserved.
Pivotal GemFire High Availability and Fault Tolerance in 6 acts
Failing data copies are
replaced transparently
Data is replicated to other
clusters and sites (WAN)
Network segmentations are
identiļ¬ed and ļ¬xed automatically
Client and cluster disconnections
are handled gracefully
Data is persisted on local
disk for ultimate durability
ā€œsplit brainā€
Failed function executions
are restarted automatically
restart
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Some interesting casesā€¦
16
China Railwayā€Ø
Corporation
5,700 train stations
4.5 million tickets per day
20 million daily users
1.4 billion page views per day
40,000 visits per second
* http://pivotal.io/big-data/pivotal-gemfire
Indian Railways
7,000 stations
72,000 miles of track
23 million passengers daily
120,000 concurrent users
10,000 transactions per minute
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Use cases and industries
17
Indian RailwaysChina Railway Corporation
World: ~7,349,000,000
~36% of the world population
Population: 1,251,695,6161,401,586,609
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
ā€¢ Commercial product available since 2004
ā€¢ Native clients in Java, C++, C#, REST
ā€¢ Event Subscriptions and Continuous
Queries
ā€¢ Configurable WAN Gateway between
clusters
ā€¢ Enterprise Support, commercial features
Apache Geode & Pivotal GemFire
ā€¢ Open Sourced in April/2015
ā€¢ Java Native Client, REST
ā€¢ 98% of GemFire API
ā€¢ Event subscriptions
ā€¢ ~30 contributors
ā€¢ Under Incubation
18
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 19
SpringXD GemFire
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
SpringXD Basic Concepts
ā€¢ Streams
ā€¢ Pipelines
ā€¢ Sources
ā€¢ Sinks
ā€¢ Filters
ā€¢ Taps
20
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
SpringXD Basic Concepts
21
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
A simple example
22
twittersearch --consumerKey=XXX ā€”consumerSecret=XXX --
query=SpringOne2GX --outputType=application/json | gemfire-json-
server --useLocator=true --host=localhost --port=10334 --
regionName=tweets --keyExpression=payload.getField('id_str')
twittersearch --query=SpringOne2GX | gemfire-json-server --host=localhost--regionName=tweets
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 23
SpringXD GemFire
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Apache Spark Concepts
ā€¢ RDD
ā€¢ Dataframe
ā€¢ Driver
ā€¢ Worker
24
"An RDD in Spark is simply an immutable distributed collection of objects.
Each RDD is split into multiple partitions, which may be computed on different nodes
of the cluster. RDDs can contain any type of Python, Java, or Scala objects,
including user-deļ¬ned classes."
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Apache Spark Concepts
ā€¢ RDD
ā€¢ Dataframe
ā€¢ Driver
ā€¢ Worker
25
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 26
medium
avg (x+1)
relative
strength (x)
medium avg (x)
price(x)
Machine Learning Model
(e.g. Linear Regression)
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 27
medium
avg (x+1)
relative
strength (x)
medium avg (x)
price(x)
Machine Learning Model
(e.g. Linear Regression)
Features Label
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 28
Transform Sink
SpringXD
Extensible
Open-Source
Fault-Tolerant
Horizontally Scalable
Cloud-Native
Machine Learning
Enrich Filter
Split
Dashboard
Indicators
1
2
Predict
3
Real data
Simulator
/Stocks
/TechIndicators
/Predictions
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 29
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Learn more!
30
https://github.com/Pivotal-Open-Source-Hub/geode-security-samples
https://github.com/Pivotal-Open-Source-Hub/WifiAnalyticsIoT
https://github.com/Pivotal-Open-Source-Hub/geode-social-demo
http://pivotal-open-source-hub.github.io/StockInference-Spark/
Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø
Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/
Thank you
31
@william_markito @fredmelo_br
Related: Building Highly-Scalable Spring Applications
with In-Memory, Distributed Data Grids
by John Blum & Luke Shannon
September 15, 2015 -10:30 - Salon M
http://pivotal-open-source-hub.github.io/StockInference-Spark/

More Related Content

What's hot

Spark sql meetup
Spark sql meetupSpark sql meetup
Spark sql meetupMichael Zhang
Ā 
Pivotal HAWQ į„‰į…©į„€į…¢
Pivotal HAWQ į„‰į…©į„€į…¢Pivotal HAWQ į„‰į…©į„€į…¢
Pivotal HAWQ į„‰į…©į„€į…¢Seungdon Choi
Ā 
Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!DataWorks Summit
Ā 
Building Audiā€™s enterprise big data platform
Building Audiā€™s enterprise big data platformBuilding Audiā€™s enterprise big data platform
Building Audiā€™s enterprise big data platformDataWorks Summit
Ā 
Empowering Zillowā€™s Developers with Self-Service ETL
Empowering Zillowā€™s Developers with Self-Service ETLEmpowering Zillowā€™s Developers with Self-Service ETL
Empowering Zillowā€™s Developers with Self-Service ETLDatabricks
Ā 
Introduction to Big Data Analytics using Apache Spark and Zeppelin on HDInsig...
Introduction to Big Data Analytics using Apache Spark and Zeppelin on HDInsig...Introduction to Big Data Analytics using Apache Spark and Zeppelin on HDInsig...
Introduction to Big Data Analytics using Apache Spark and Zeppelin on HDInsig...Alex Zeltov
Ā 
EDB's Migration Portal - Migrate from Oracle to Postgres
EDB's Migration Portal - Migrate from Oracle to PostgresEDB's Migration Portal - Migrate from Oracle to Postgres
EDB's Migration Portal - Migrate from Oracle to PostgresEDB
Ā 
Spark Development Lifecycle at Workday - ApacheCon 2020
Spark Development Lifecycle at Workday - ApacheCon 2020Spark Development Lifecycle at Workday - ApacheCon 2020
Spark Development Lifecycle at Workday - ApacheCon 2020Pavel Hardak
Ā 
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...DataWorks Summit/Hadoop Summit
Ā 
SAMā€”streaming analytics made easy
SAMā€”streaming analytics made easySAMā€”streaming analytics made easy
SAMā€”streaming analytics made easyDataWorks Summit
Ā 
Big Data Meets Learning Science: Keynote by Al Essa
Big Data Meets Learning Science: Keynote by Al EssaBig Data Meets Learning Science: Keynote by Al Essa
Big Data Meets Learning Science: Keynote by Al EssaSpark Summit
Ā 
Improving Python and Spark Performance and Interoperability with Apache Arrow...
Improving Python and Spark Performance and Interoperability with Apache Arrow...Improving Python and Spark Performance and Interoperability with Apache Arrow...
Improving Python and Spark Performance and Interoperability with Apache Arrow...Databricks
Ā 
Uber's data science workbench
Uber's data science workbenchUber's data science workbench
Uber's data science workbenchRan Wei
Ā 
ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...
ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...
ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...Databricks
Ā 
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...Dremio Corporation
Ā 
Oracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorldOracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorldJeffrey T. Pollock
Ā 
Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad ...
Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad ...Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad ...
Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad ...Spark Summit
Ā 
DbyDx Software Corporate Presentation
DbyDx Software Corporate PresentationDbyDx Software Corporate Presentation
DbyDx Software Corporate PresentationDbyDx Software
Ā 

What's hot (19)

Spark sql meetup
Spark sql meetupSpark sql meetup
Spark sql meetup
Ā 
Pivotal HAWQ į„‰į…©į„€į…¢
Pivotal HAWQ į„‰į…©į„€į…¢Pivotal HAWQ į„‰į…©į„€į…¢
Pivotal HAWQ į„‰į…©į„€į…¢
Ā 
Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!Analyzing the World's Largest Security Data Lake!
Analyzing the World's Largest Security Data Lake!
Ā 
Building Audiā€™s enterprise big data platform
Building Audiā€™s enterprise big data platformBuilding Audiā€™s enterprise big data platform
Building Audiā€™s enterprise big data platform
Ā 
Empowering Zillowā€™s Developers with Self-Service ETL
Empowering Zillowā€™s Developers with Self-Service ETLEmpowering Zillowā€™s Developers with Self-Service ETL
Empowering Zillowā€™s Developers with Self-Service ETL
Ā 
Introduction to Big Data Analytics using Apache Spark and Zeppelin on HDInsig...
Introduction to Big Data Analytics using Apache Spark and Zeppelin on HDInsig...Introduction to Big Data Analytics using Apache Spark and Zeppelin on HDInsig...
Introduction to Big Data Analytics using Apache Spark and Zeppelin on HDInsig...
Ā 
EDB's Migration Portal - Migrate from Oracle to Postgres
EDB's Migration Portal - Migrate from Oracle to PostgresEDB's Migration Portal - Migrate from Oracle to Postgres
EDB's Migration Portal - Migrate from Oracle to Postgres
Ā 
Spark Development Lifecycle at Workday - ApacheCon 2020
Spark Development Lifecycle at Workday - ApacheCon 2020Spark Development Lifecycle at Workday - ApacheCon 2020
Spark Development Lifecycle at Workday - ApacheCon 2020
Ā 
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Ā 
SAMā€”streaming analytics made easy
SAMā€”streaming analytics made easySAMā€”streaming analytics made easy
SAMā€”streaming analytics made easy
Ā 
Big Data Meets Learning Science: Keynote by Al Essa
Big Data Meets Learning Science: Keynote by Al EssaBig Data Meets Learning Science: Keynote by Al Essa
Big Data Meets Learning Science: Keynote by Al Essa
Ā 
Improving Python and Spark Performance and Interoperability with Apache Arrow...
Improving Python and Spark Performance and Interoperability with Apache Arrow...Improving Python and Spark Performance and Interoperability with Apache Arrow...
Improving Python and Spark Performance and Interoperability with Apache Arrow...
Ā 
Uber's data science workbench
Uber's data science workbenchUber's data science workbench
Uber's data science workbench
Ā 
ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...
ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...
ML, Statistics, and Spark with Databricks for Maximizing Revenue in a Delayed...
Ā 
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
The Future of Column-Oriented Data Processing With Apache Arrow and Apache Pa...
Ā 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake
Ā 
Oracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorldOracle Data Integration CON9737 at OpenWorld
Oracle Data Integration CON9737 at OpenWorld
Ā 
Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad ...
Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad ...Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad ...
Spark SQL: Another 16x Faster After Tungsten: Spark Summit East talk by Brad ...
Ā 
DbyDx Software Corporate Presentation
DbyDx Software Corporate PresentationDbyDx Software Corporate Presentation
DbyDx Software Corporate Presentation
Ā 

Viewers also liked

Apache Geode (incubating) Introduction with Docker
Apache Geode (incubating) Introduction with DockerApache Geode (incubating) Introduction with Docker
Apache Geode (incubating) Introduction with DockerWilliam Markito Oliveira
Ā 
Big and Fast Data - Building Infinitely Scalable Systems
Big and Fast Data - Building Infinitely Scalable SystemsBig and Fast Data - Building Infinitely Scalable Systems
Big and Fast Data - Building Infinitely Scalable SystemsFred Melo
Ā 
OSGeoģ™€ Open Data
OSGeoģ™€ Open DataOSGeoģ™€ Open Data
OSGeoģ™€ Open Datar-kor
Ā 
ķ™©ģ„±ģˆ˜ ź³µź³µė°ģ“ķ„° ź°œė°©ź³¼ ź³µź³µģ“ģŠˆ ķ•“ź²°
ķ™©ģ„±ģˆ˜ ź³µź³µė°ģ“ķ„° ź°œė°©ź³¼ ź³µź³µģ“ģŠˆ ķ•“ź²°ķ™©ģ„±ģˆ˜ ź³µź³µė°ģ“ķ„° ź°œė°©ź³¼ ź³µź³µģ“ģŠˆ ķ•“ź²°
ķ™©ģ„±ģˆ˜ ź³µź³µė°ģ“ķ„° ź°œė°©ź³¼ ź³µź³µģ“ģŠˆ ķ•“ź²°r-kor
Ā 
Deciphering voice of customer through speech analytics
Deciphering voice of customer through speech analyticsDeciphering voice of customer through speech analytics
Deciphering voice of customer through speech analyticsR Systems International
Ā 
Distributed R: The Next Generation Platform for Predictive Analytics
Distributed R: The Next Generation Platform for Predictive AnalyticsDistributed R: The Next Generation Platform for Predictive Analytics
Distributed R: The Next Generation Platform for Predictive AnalyticsJorge Martinez de Salinas
Ā 
Optimizing Facebook Campaigns with R
Optimizing Facebook Campaigns with ROptimizing Facebook Campaigns with R
Optimizing Facebook Campaigns with RDomino Data Lab
Ā 
Cloud Conf 2015 - Develop and Deploy IOT Applications
Cloud Conf 2015 - Develop and Deploy IOT ApplicationsCloud Conf 2015 - Develop and Deploy IOT Applications
Cloud Conf 2015 - Develop and Deploy IOT ApplicationsCorley S.r.l.
Ā 
The Next List: R&D Breakthroughs that are Changing the World
The Next List: R&D Breakthroughs that are Changing the WorldThe Next List: R&D Breakthroughs that are Changing the World
The Next List: R&D Breakthroughs that are Changing the WorldGE
Ā 
IMCSummit 2015 - Day 2 Developer Track - Implementing a Highly Scalable In-Me...
IMCSummit 2015 - Day 2 Developer Track - Implementing a Highly Scalable In-Me...IMCSummit 2015 - Day 2 Developer Track - Implementing a Highly Scalable In-Me...
IMCSummit 2015 - Day 2 Developer Track - Implementing a Highly Scalable In-Me...In-Memory Computing Summit
Ā 
ģ˜¤ķ”ˆė°ģ“ķ„°ģ™€ ģ˜¤ķ”ˆģ†ŒģŠ¤ ģ†Œķ”„ķŠøģ›Øģ–“ė„¼ ģ“ģš©ķ•œ ģ˜ė£Œģ“ģš©ģ •ė³“ģ˜ ģ‹œź°ķ™”
ģ˜¤ķ”ˆė°ģ“ķ„°ģ™€ ģ˜¤ķ”ˆģ†ŒģŠ¤ ģ†Œķ”„ķŠøģ›Øģ–“ė„¼ ģ“ģš©ķ•œ ģ˜ė£Œģ“ģš©ģ •ė³“ģ˜ ģ‹œź°ķ™”ģ˜¤ķ”ˆė°ģ“ķ„°ģ™€ ģ˜¤ķ”ˆģ†ŒģŠ¤ ģ†Œķ”„ķŠøģ›Øģ–“ė„¼ ģ“ģš©ķ•œ ģ˜ė£Œģ“ģš©ģ •ė³“ģ˜ ģ‹œź°ķ™”
ģ˜¤ķ”ˆė°ģ“ķ„°ģ™€ ģ˜¤ķ”ˆģ†ŒģŠ¤ ģ†Œķ”„ķŠøģ›Øģ–“ė„¼ ģ“ģš©ķ•œ ģ˜ė£Œģ“ģš©ģ •ė³“ģ˜ ģ‹œź°ķ™”r-kor
Ā 
Trading System Design
Trading System DesignTrading System Design
Trading System DesignMarketcalls
Ā 
źµ¬ģ”°ķ™”ėœ ė°ģ“ķ„°: Schema.orgģ™€ Microdata, RDFa, JSON-LD
źµ¬ģ”°ķ™”ėœ ė°ģ“ķ„°: Schema.orgģ™€ Microdata, RDFa, JSON-LDźµ¬ģ”°ķ™”ėœ ė°ģ“ķ„°: Schema.orgģ™€ Microdata, RDFa, JSON-LD
źµ¬ģ”°ķ™”ėœ ė°ģ“ķ„°: Schema.orgģ™€ Microdata, RDFa, JSON-LDr-kor
Ā 
Trading sentimental analysis
Trading sentimental analysisTrading sentimental analysis
Trading sentimental analysisMarketcalls
Ā 
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla AirII-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla AirDr. Haxel Consult
Ā 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudRevolution Analytics
Ā 
H2O World - Intro to R, Python, and Flow - Amy Wang
H2O World - Intro to R, Python, and Flow - Amy WangH2O World - Intro to R, Python, and Flow - Amy Wang
H2O World - Intro to R, Python, and Flow - Amy WangSri Ambati
Ā 

Viewers also liked (20)

How to Contribute to Apache Geode
How to Contribute to Apache GeodeHow to Contribute to Apache Geode
How to Contribute to Apache Geode
Ā 
Apache Geode (incubating) Introduction with Docker
Apache Geode (incubating) Introduction with DockerApache Geode (incubating) Introduction with Docker
Apache Geode (incubating) Introduction with Docker
Ā 
Big and Fast Data - Building Infinitely Scalable Systems
Big and Fast Data - Building Infinitely Scalable SystemsBig and Fast Data - Building Infinitely Scalable Systems
Big and Fast Data - Building Infinitely Scalable Systems
Ā 
OSGeoģ™€ Open Data
OSGeoģ™€ Open DataOSGeoģ™€ Open Data
OSGeoģ™€ Open Data
Ā 
ķ™©ģ„±ģˆ˜ ź³µź³µė°ģ“ķ„° ź°œė°©ź³¼ ź³µź³µģ“ģŠˆ ķ•“ź²°
ķ™©ģ„±ģˆ˜ ź³µź³µė°ģ“ķ„° ź°œė°©ź³¼ ź³µź³µģ“ģŠˆ ķ•“ź²°ķ™©ģ„±ģˆ˜ ź³µź³µė°ģ“ķ„° ź°œė°©ź³¼ ź³µź³µģ“ģŠˆ ķ•“ź²°
ķ™©ģ„±ģˆ˜ ź³µź³µė°ģ“ķ„° ź°œė°©ź³¼ ź³µź³µģ“ģŠˆ ķ•“ź²°
Ā 
Deciphering voice of customer through speech analytics
Deciphering voice of customer through speech analyticsDeciphering voice of customer through speech analytics
Deciphering voice of customer through speech analytics
Ā 
Distributed R: The Next Generation Platform for Predictive Analytics
Distributed R: The Next Generation Platform for Predictive AnalyticsDistributed R: The Next Generation Platform for Predictive Analytics
Distributed R: The Next Generation Platform for Predictive Analytics
Ā 
Optimizing Facebook Campaigns with R
Optimizing Facebook Campaigns with ROptimizing Facebook Campaigns with R
Optimizing Facebook Campaigns with R
Ā 
Cloud Conf 2015 - Develop and Deploy IOT Applications
Cloud Conf 2015 - Develop and Deploy IOT ApplicationsCloud Conf 2015 - Develop and Deploy IOT Applications
Cloud Conf 2015 - Develop and Deploy IOT Applications
Ā 
R lecture oga
R lecture ogaR lecture oga
R lecture oga
Ā 
The Next List: R&D Breakthroughs that are Changing the World
The Next List: R&D Breakthroughs that are Changing the WorldThe Next List: R&D Breakthroughs that are Changing the World
The Next List: R&D Breakthroughs that are Changing the World
Ā 
IMCSummit 2015 - Day 2 Developer Track - Implementing a Highly Scalable In-Me...
IMCSummit 2015 - Day 2 Developer Track - Implementing a Highly Scalable In-Me...IMCSummit 2015 - Day 2 Developer Track - Implementing a Highly Scalable In-Me...
IMCSummit 2015 - Day 2 Developer Track - Implementing a Highly Scalable In-Me...
Ā 
ģ˜¤ķ”ˆė°ģ“ķ„°ģ™€ ģ˜¤ķ”ˆģ†ŒģŠ¤ ģ†Œķ”„ķŠøģ›Øģ–“ė„¼ ģ“ģš©ķ•œ ģ˜ė£Œģ“ģš©ģ •ė³“ģ˜ ģ‹œź°ķ™”
ģ˜¤ķ”ˆė°ģ“ķ„°ģ™€ ģ˜¤ķ”ˆģ†ŒģŠ¤ ģ†Œķ”„ķŠøģ›Øģ–“ė„¼ ģ“ģš©ķ•œ ģ˜ė£Œģ“ģš©ģ •ė³“ģ˜ ģ‹œź°ķ™”ģ˜¤ķ”ˆė°ģ“ķ„°ģ™€ ģ˜¤ķ”ˆģ†ŒģŠ¤ ģ†Œķ”„ķŠøģ›Øģ–“ė„¼ ģ“ģš©ķ•œ ģ˜ė£Œģ“ģš©ģ •ė³“ģ˜ ģ‹œź°ķ™”
ģ˜¤ķ”ˆė°ģ“ķ„°ģ™€ ģ˜¤ķ”ˆģ†ŒģŠ¤ ģ†Œķ”„ķŠøģ›Øģ–“ė„¼ ģ“ģš©ķ•œ ģ˜ė£Œģ“ģš©ģ •ė³“ģ˜ ģ‹œź°ķ™”
Ā 
Trading System Design
Trading System DesignTrading System Design
Trading System Design
Ā 
źµ¬ģ”°ķ™”ėœ ė°ģ“ķ„°: Schema.orgģ™€ Microdata, RDFa, JSON-LD
źµ¬ģ”°ķ™”ėœ ė°ģ“ķ„°: Schema.orgģ™€ Microdata, RDFa, JSON-LDźµ¬ģ”°ķ™”ėœ ė°ģ“ķ„°: Schema.orgģ™€ Microdata, RDFa, JSON-LD
źµ¬ģ”°ķ™”ėœ ė°ģ“ķ„°: Schema.orgģ™€ Microdata, RDFa, JSON-LD
Ā 
Trading sentimental analysis
Trading sentimental analysisTrading sentimental analysis
Trading sentimental analysis
Ā 
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla AirII-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
Ā 
Language R
Language RLanguage R
Language R
Ā 
Taking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the CloudTaking R Analytics to SQL and the Cloud
Taking R Analytics to SQL and the Cloud
Ā 
H2O World - Intro to R, Python, and Flow - Amy Wang
H2O World - Intro to R, Python, and Flow - Amy WangH2O World - Intro to R, Python, and Flow - Amy Wang
H2O World - Intro to R, Python, and Flow - Amy Wang
Ā 

Similar to Implementing a highly scalable stock prediction system with R, Geode, SpringXD and Spark

Building Highly Scalable Spring Applications using In-Memory Data Grids
Building Highly Scalable Spring Applications using In-Memory Data GridsBuilding Highly Scalable Spring Applications using In-Memory Data Grids
Building Highly Scalable Spring Applications using In-Memory Data GridsJohn Blum
Ā 
Migrating from Big Data Architecture to Spring Cloud
Migrating from Big Data Architecture to Spring CloudMigrating from Big Data Architecture to Spring Cloud
Migrating from Big Data Architecture to Spring CloudVMware Tanzu
Ā 
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerceWhat We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerceVMware Tanzu
Ā 
Federated Queries with HAWQ - SQL on Hadoop and Beyond
Federated Queries with HAWQ - SQL on Hadoop and BeyondFederated Queries with HAWQ - SQL on Hadoop and Beyond
Federated Queries with HAWQ - SQL on Hadoop and BeyondChristian Tzolov
Ā 
YugaByte DBā€”A Planet-Scale Database for Low Latency Transactional Apps
YugaByte DBā€”A Planet-Scale Database for Low Latency Transactional AppsYugaByte DBā€”A Planet-Scale Database for Low Latency Transactional Apps
YugaByte DBā€”A Planet-Scale Database for Low Latency Transactional AppsVMware Tanzu
Ā 
Simple Data Movement Patterns: Legacy Application to Cloud-Native Environment...
Simple Data Movement Patterns: Legacy Application to Cloud-Native Environment...Simple Data Movement Patterns: Legacy Application to Cloud-Native Environment...
Simple Data Movement Patterns: Legacy Application to Cloud-Native Environment...VMware Tanzu
Ā 
Containerizing a Data Warehouse for Kubernetes
Containerizing a Data Warehouse for KubernetesContainerizing a Data Warehouse for Kubernetes
Containerizing a Data Warehouse for KubernetesVMware Tanzu
Ā 
Spring Cloud Gateway - Ryan Baxter
Spring Cloud Gateway - Ryan BaxterSpring Cloud Gateway - Ryan Baxter
Spring Cloud Gateway - Ryan BaxterVMware Tanzu
Ā 
Itā€™s a Multi-Cloud World, But What About The Data?
Itā€™s a Multi-Cloud World, But What About The Data?Itā€™s a Multi-Cloud World, But What About The Data?
Itā€™s a Multi-Cloud World, But What About The Data?VMware Tanzu
Ā 
Developing Real-Time Data Pipelines with Apache Kafka
Developing Real-Time Data Pipelines with Apache KafkaDeveloping Real-Time Data Pipelines with Apache Kafka
Developing Real-Time Data Pipelines with Apache KafkaJoe Stein
Ā 
Cross-Platform Observability for Cloud Foundry
Cross-Platform Observability for Cloud FoundryCross-Platform Observability for Cloud Foundry
Cross-Platform Observability for Cloud FoundryVMware Tanzu
Ā 
Developer Secure Containers for the Cyberspace Battlefield
Developer Secure Containers for the Cyberspace BattlefieldDeveloper Secure Containers for the Cyberspace Battlefield
Developer Secure Containers for the Cyberspace BattlefieldVMware Tanzu
Ā 
Achieving High Throughput With Reliability In Transactional Systems
Achieving High Throughput With Reliability In Transactional SystemsAchieving High Throughput With Reliability In Transactional Systems
Achieving High Throughput With Reliability In Transactional SystemsVMware Tanzu
Ā 
Beyond Caching: Extending Redis Enterprise for Real-Time Streams Processing
Beyond Caching: Extending Redis Enterprise for Real-Time Streams ProcessingBeyond Caching: Extending Redis Enterprise for Real-Time Streams Processing
Beyond Caching: Extending Redis Enterprise for Real-Time Streams ProcessingVMware Tanzu
Ā 
Ratpack - SpringOne2GX 2015
Ratpack - SpringOne2GX 2015Ratpack - SpringOne2GX 2015
Ratpack - SpringOne2GX 2015Daniel Woods
Ā 
Building a Secure App with Google Polymer and Java / Spring
Building a Secure App with Google Polymer and Java / SpringBuilding a Secure App with Google Polymer and Java / Spring
Building a Secure App with Google Polymer and Java / Springsdeeg
Ā 
Living on the Edge With Spring Cloud Gateway - Cora Iberkleid
Living on the Edge With Spring Cloud Gateway - Cora IberkleidLiving on the Edge With Spring Cloud Gateway - Cora Iberkleid
Living on the Edge With Spring Cloud Gateway - Cora IberkleidVMware Tanzu
Ā 
Living on the Edge With Spring Cloud Gateway - Cora Iberkleid
Living on the Edge With Spring Cloud Gateway - Cora IberkleidLiving on the Edge With Spring Cloud Gateway - Cora Iberkleid
Living on the Edge With Spring Cloud Gateway - Cora IberkleidVMware Tanzu
Ā 
Building Data Environments for Production Microservices with Geode
Building Data Environments for Production Microservices with GeodeBuilding Data Environments for Production Microservices with Geode
Building Data Environments for Production Microservices with GeodeVMware Tanzu
Ā 

Similar to Implementing a highly scalable stock prediction system with R, Geode, SpringXD and Spark (20)

Building Highly Scalable Spring Applications using In-Memory Data Grids
Building Highly Scalable Spring Applications using In-Memory Data GridsBuilding Highly Scalable Spring Applications using In-Memory Data Grids
Building Highly Scalable Spring Applications using In-Memory Data Grids
Ā 
Migrating from Big Data Architecture to Spring Cloud
Migrating from Big Data Architecture to Spring CloudMigrating from Big Data Architecture to Spring Cloud
Migrating from Big Data Architecture to Spring Cloud
Ā 
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerceWhat We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
What We're Learning Adopting Spring Boot and PCF for Dell.com's eCommerce
Ā 
Federated Queries with HAWQ - SQL on Hadoop and Beyond
Federated Queries with HAWQ - SQL on Hadoop and BeyondFederated Queries with HAWQ - SQL on Hadoop and Beyond
Federated Queries with HAWQ - SQL on Hadoop and Beyond
Ā 
YugaByte DBā€”A Planet-Scale Database for Low Latency Transactional Apps
YugaByte DBā€”A Planet-Scale Database for Low Latency Transactional AppsYugaByte DBā€”A Planet-Scale Database for Low Latency Transactional Apps
YugaByte DBā€”A Planet-Scale Database for Low Latency Transactional Apps
Ā 
Simple Data Movement Patterns: Legacy Application to Cloud-Native Environment...
Simple Data Movement Patterns: Legacy Application to Cloud-Native Environment...Simple Data Movement Patterns: Legacy Application to Cloud-Native Environment...
Simple Data Movement Patterns: Legacy Application to Cloud-Native Environment...
Ā 
S1P: Spring Cloud on PKS
S1P: Spring Cloud on PKSS1P: Spring Cloud on PKS
S1P: Spring Cloud on PKS
Ā 
Containerizing a Data Warehouse for Kubernetes
Containerizing a Data Warehouse for KubernetesContainerizing a Data Warehouse for Kubernetes
Containerizing a Data Warehouse for Kubernetes
Ā 
Spring Cloud Gateway - Ryan Baxter
Spring Cloud Gateway - Ryan BaxterSpring Cloud Gateway - Ryan Baxter
Spring Cloud Gateway - Ryan Baxter
Ā 
Itā€™s a Multi-Cloud World, But What About The Data?
Itā€™s a Multi-Cloud World, But What About The Data?Itā€™s a Multi-Cloud World, But What About The Data?
Itā€™s a Multi-Cloud World, But What About The Data?
Ā 
Developing Real-Time Data Pipelines with Apache Kafka
Developing Real-Time Data Pipelines with Apache KafkaDeveloping Real-Time Data Pipelines with Apache Kafka
Developing Real-Time Data Pipelines with Apache Kafka
Ā 
Cross-Platform Observability for Cloud Foundry
Cross-Platform Observability for Cloud FoundryCross-Platform Observability for Cloud Foundry
Cross-Platform Observability for Cloud Foundry
Ā 
Developer Secure Containers for the Cyberspace Battlefield
Developer Secure Containers for the Cyberspace BattlefieldDeveloper Secure Containers for the Cyberspace Battlefield
Developer Secure Containers for the Cyberspace Battlefield
Ā 
Achieving High Throughput With Reliability In Transactional Systems
Achieving High Throughput With Reliability In Transactional SystemsAchieving High Throughput With Reliability In Transactional Systems
Achieving High Throughput With Reliability In Transactional Systems
Ā 
Beyond Caching: Extending Redis Enterprise for Real-Time Streams Processing
Beyond Caching: Extending Redis Enterprise for Real-Time Streams ProcessingBeyond Caching: Extending Redis Enterprise for Real-Time Streams Processing
Beyond Caching: Extending Redis Enterprise for Real-Time Streams Processing
Ā 
Ratpack - SpringOne2GX 2015
Ratpack - SpringOne2GX 2015Ratpack - SpringOne2GX 2015
Ratpack - SpringOne2GX 2015
Ā 
Building a Secure App with Google Polymer and Java / Spring
Building a Secure App with Google Polymer and Java / SpringBuilding a Secure App with Google Polymer and Java / Spring
Building a Secure App with Google Polymer and Java / Spring
Ā 
Living on the Edge With Spring Cloud Gateway - Cora Iberkleid
Living on the Edge With Spring Cloud Gateway - Cora IberkleidLiving on the Edge With Spring Cloud Gateway - Cora Iberkleid
Living on the Edge With Spring Cloud Gateway - Cora Iberkleid
Ā 
Living on the Edge With Spring Cloud Gateway - Cora Iberkleid
Living on the Edge With Spring Cloud Gateway - Cora IberkleidLiving on the Edge With Spring Cloud Gateway - Cora Iberkleid
Living on the Edge With Spring Cloud Gateway - Cora Iberkleid
Ā 
Building Data Environments for Production Microservices with Geode
Building Data Environments for Production Microservices with GeodeBuilding Data Environments for Production Microservices with Geode
Building Data Environments for Production Microservices with Geode
Ā 

Recently uploaded

%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Hararemasabamasaba
Ā 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
Ā 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyviewmasabamasaba
Ā 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
Ā 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in sowetomasabamasaba
Ā 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastPapp KrisztiƔn
Ā 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba
Ā 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfkalichargn70th171
Ā 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareJim McKeeth
Ā 
Introducing Microsoftā€™s new Enterprise Work Management (EWM) Solution
Introducing Microsoftā€™s new Enterprise Work Management (EWM) SolutionIntroducing Microsoftā€™s new Enterprise Work Management (EWM) Solution
Introducing Microsoftā€™s new Enterprise Work Management (EWM) SolutionOnePlan Solutions
Ā 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
Ā 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2
Ā 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2
Ā 
Shapes for Sharing between Graph Data SpacesĀ - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data SpacesĀ - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data SpacesĀ - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data SpacesĀ - and Epistemic Querying of RDF-...Steffen Staab
Ā 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
Ā 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
Ā 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
Ā 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfWilly Marroquin (WillyDevNET)
Ā 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
Ā 

Recently uploaded (20)

%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
Ā 
Abortion Pill Prices Tembisa [(+27832195400*)] šŸ„ Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] šŸ„ Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] šŸ„ Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] šŸ„ Women's Abortion Clinic in T...
Ā 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Ā 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
Ā 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
Ā 
%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto%in Soweto+277-882-255-28 abortion pills for sale in soweto
%in Soweto+277-882-255-28 abortion pills for sale in soweto
Ā 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
Ā 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
Ā 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Ā 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
Ā 
Introducing Microsoftā€™s new Enterprise Work Management (EWM) Solution
Introducing Microsoftā€™s new Enterprise Work Management (EWM) SolutionIntroducing Microsoftā€™s new Enterprise Work Management (EWM) Solution
Introducing Microsoftā€™s new Enterprise Work Management (EWM) Solution
Ā 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
Ā 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
Ā 
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
WSO2Con2024 - From Code To Cloud: Fast Track Your Cloud Native Journey with C...
Ā 
Shapes for Sharing between Graph Data SpacesĀ - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data SpacesĀ - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data SpacesĀ - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data SpacesĀ - and Epistemic Querying of RDF-...
Ā 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
Ā 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
Ā 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
Ā 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
Ā 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
Ā 

Implementing a highly scalable stock prediction system with R, Geode, SpringXD and Spark

  • 1. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under a Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ SPRINGONE2GX WASHINGTON, DC Implementing a highly scalable Stock prediction system with R, Apache Geode and Spring XD Fred Meloā€Ø @fredmelo_br William Markitoā€Ø @william_markito
  • 2. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ About us Fred Melo Technical Director for Data fmelo@pivotal.io @fredmelo_br 2 William Markito Enterprise Architect for GemFire wmarkito@pivotal.io @william_markito
  • 3. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 3
  • 4. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 4 It's all about DATA Data Sources Look for patterns Prediction
  • 5. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ What do we want to build? 5 "Smart System"
  • 6. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ ā€¦ in our specific case 6 Trading Data "Smart System" Historical Data Repository Learns with historical trends "How were the medium average price and relative strength reading when the latest failures happened? " Live data becomes historical over time Real-Time Evaluates live data ā€œAccording to historical trends, thereā€™s an 80% chance this stock prices might go downhill within the next hour" Historical
  • 7. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ ā€¦ in our specific case 7 Trading Data "Smart System" Historical Data Repository Learns with historical trends "How were the medium average price and relative strength reading when the latest failures happened? " Live data becomes historical over time Real-Time Evaluates live data ā€œAccording to historical trends, thereā€™s an 80% chance this stock prices might go downhill within the next hour" Historical
  • 8. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 8 Live Data Data Temperature Hot Cold Greenplum DB Apache Geode / GemFire 1- Live data is ingested into the grid 3 - Results are pushed immediately to deployed applications 4 - ā€œHot" data ages, becoming part of the historical dataset Machine Learning model 5 - Re-training is triggered, updating the model with the latest historical data Spring XD Spring XD The ML pipeline data flow 2 - Trained ML model compares new data to historical patterns
  • 9. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 9 Live Data Apache Geode / GemFire 1- Live data is ingested into the grid 2 - Trained ML model compares new data to historical patterns 3 - Results are pushed immediately to deployed applications Machine Learning model 4 - Re-training is triggered, updating the model with the latest historical data Spring XD Spring XD Simplified demo model Data Temperature Hot Warm
  • 10. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 10 Transform Sink SpringXD Extensible Open-Source Fault-Tolerant Horizontally Scalable Cloud-Native Machine Learning Enrich Filter Split Dashboard Indicators 1 2 Predict 3 Real data Simulator /Stocks /TechIndicators /Predictions
  • 11. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 11 Eating it in small bitesā€¦
  • 12. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 12 SpringXD GemFire
  • 13. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ ā€¢ Cache ā€¢ Configurable through XML, ,Java ā€¢ Region ā€¢ Distributed j.u.Map on steroids ā€¢ Highly available, redundant ā€¢ Member ā€¢ Locator, Server, Client ā€¢ Callbacks ā€¢ Listener, Writer, AsyncEventListener, Parallel/Serial Apache Geode & GemFire Concepts 13
  • 14. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ Apache Geode & GemFire, why ? ā€¢ Performance ā€¢ Consistency ā€¢ Resiliency 14
  • 15. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ Apache Geode & GemFire, why ? 15 Ā© Copyright 2014 Pivotal. All rights reserved. Pivotal GemFire High Availability and Fault Tolerance in 6 acts Failing data copies are replaced transparently Data is replicated to other clusters and sites (WAN) Network segmentations are identiļ¬ed and ļ¬xed automatically Client and cluster disconnections are handled gracefully Data is persisted on local disk for ultimate durability ā€œsplit brainā€ Failed function executions are restarted automatically restart
  • 16. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ Some interesting casesā€¦ 16 China Railwayā€Ø Corporation 5,700 train stations 4.5 million tickets per day 20 million daily users 1.4 billion page views per day 40,000 visits per second * http://pivotal.io/big-data/pivotal-gemfire Indian Railways 7,000 stations 72,000 miles of track 23 million passengers daily 120,000 concurrent users 10,000 transactions per minute
  • 17. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ Use cases and industries 17 Indian RailwaysChina Railway Corporation World: ~7,349,000,000 ~36% of the world population Population: 1,251,695,6161,401,586,609
  • 18. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ ā€¢ Commercial product available since 2004 ā€¢ Native clients in Java, C++, C#, REST ā€¢ Event Subscriptions and Continuous Queries ā€¢ Configurable WAN Gateway between clusters ā€¢ Enterprise Support, commercial features Apache Geode & Pivotal GemFire ā€¢ Open Sourced in April/2015 ā€¢ Java Native Client, REST ā€¢ 98% of GemFire API ā€¢ Event subscriptions ā€¢ ~30 contributors ā€¢ Under Incubation 18
  • 19. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 19 SpringXD GemFire
  • 20. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ SpringXD Basic Concepts ā€¢ Streams ā€¢ Pipelines ā€¢ Sources ā€¢ Sinks ā€¢ Filters ā€¢ Taps 20
  • 21. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ SpringXD Basic Concepts 21
  • 22. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ A simple example 22 twittersearch --consumerKey=XXX ā€”consumerSecret=XXX -- query=SpringOne2GX --outputType=application/json | gemfire-json- server --useLocator=true --host=localhost --port=10334 -- regionName=tweets --keyExpression=payload.getField('id_str') twittersearch --query=SpringOne2GX | gemfire-json-server --host=localhost--regionName=tweets
  • 23. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 23 SpringXD GemFire
  • 24. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ Apache Spark Concepts ā€¢ RDD ā€¢ Dataframe ā€¢ Driver ā€¢ Worker 24 "An RDD in Spark is simply an immutable distributed collection of objects. Each RDD is split into multiple partitions, which may be computed on different nodes of the cluster. RDDs can contain any type of Python, Java, or Scala objects, including user-deļ¬ned classes."
  • 25. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ Apache Spark Concepts ā€¢ RDD ā€¢ Dataframe ā€¢ Driver ā€¢ Worker 25
  • 26. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 26 medium avg (x+1) relative strength (x) medium avg (x) price(x) Machine Learning Model (e.g. Linear Regression)
  • 27. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 27 medium avg (x+1) relative strength (x) medium avg (x) price(x) Machine Learning Model (e.g. Linear Regression) Features Label
  • 28. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 28 Transform Sink SpringXD Extensible Open-Source Fault-Tolerant Horizontally Scalable Cloud-Native Machine Learning Enrich Filter Split Dashboard Indicators 1 2 Predict 3 Real data Simulator /Stocks /TechIndicators /Predictions
  • 29. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ 29
  • 30. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ Learn more! 30 https://github.com/Pivotal-Open-Source-Hub/geode-security-samples https://github.com/Pivotal-Open-Source-Hub/WifiAnalyticsIoT https://github.com/Pivotal-Open-Source-Hub/geode-social-demo http://pivotal-open-source-hub.github.io/StockInference-Spark/
  • 31. Unless otherwise indicated, these slides are Ā© 2013-2015 Pivotal Software, Inc. and licensed under aā€Ø Creative Commons Attribution-NonCommercial license: http://creativecommons.org/licenses/by-nc/3.0/ Thank you 31 @william_markito @fredmelo_br Related: Building Highly-Scalable Spring Applications with In-Memory, Distributed Data Grids by John Blum & Luke Shannon September 15, 2015 -10:30 - Salon M http://pivotal-open-source-hub.github.io/StockInference-Spark/