SlideShare uma empresa Scribd logo
1 de 30
Information Classification: GENERAL
BIG DATA
(on Data you don’t have)
1
Information Classification: GENERAL
HOW DO WE DEAL WITH INFINITE DIMENSIONAL DATA…..
BY GENERALIZING TRADITIONAL MAP REDUCE
PARADIGM…….
Information Classification: GENERAL
DISCLAIMER
Information Classification: GENERAL
THERE ARE FOUR SOURCES OF DATA
4
Data I have
(traditional
“Big Data”)
Data I can model
Data I can
acquire
Data someone
else can acquire
or model
Information Classification: GENERAL
HOW WE REPRESENT THESE ITEMS
5
Pre-Calculated
Data
Formulas you
have
Services
Things People
can Share with
Me
Information Classification: GENERAL
6
Pre-Calculated
Data
Formulas
Services
Things People
can Share with
Me
MSCI Beon™
Information Classification: GENERAL
7
Jim Burns
David Clark
Information Classification: GENERAL
MSCI PLATFORM – A NEXT GENERATION LEAP
8
Big
Data
Repository
Hadoop / Cloudera etc
Slice/Dice
Traditional Big Data “Data you Have” Paradigm
Beon
New Front End
NEW Big Data Paradigm
Calculation and Data Services
On
Demand
Data
Expressions
The
Morning
Load
Virtual
fields
Dynamic
new data
Information Classification: GENERAL
COMPLEX QUESTIONS
9
Information Classification: GENERAL
WHAT IS A COMPLEX QUESTION VERSUS A SPECIFIC QUESTION?
10
Specific questions can be hard, for example:
• What happens to sea level if the temperature goes up 1.5 degrees by 2035?
• What properties are on the beach and over x meters above sea level in Marbella?
• What are the biggest real estate bargains in a portfolio.
Complex questions are combinations of specific questions.
• What should I buy if I believe that temperatures are going to raise 1.5 degrees by 2035 and I
only want property that will be at least 1 meter above sea level in 2035 but still on the beach.
Information Classification: GENERAL
HOW TO ANSWER A COMPLEX QUESTION
11
So to answer a complex question you need something that can answer this
Let Portfolio = All the houses in Marbella
safeHouses = Filter( SeaLevel >= 1.0 + seaLevelRise(1.5 c)) Portfolio
BestBargains = BargainFinder safeHouses
It does this by calling the services below for certain calculations.
Platform
Marbella
Houses
Planet
Simulator
Sea Level RaiseHouse Database
Execute the question
above, Filtering, etc..
Bargain
Finder
Information Classification: GENERAL
GENERALIZING MAP-REDUCE
UH OH – SOME MATH……
12
Information Classification: GENERAL
𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ → ℝ
Just for simplicity, lets assume we only care about real numbers (obviously, we could have tuples, strings,
dictionaries, any valid type honestly…)
Standard map reduce, Gamma is your class object/structure/thing
𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, ℂ → {ℝ, ℂ}
First things first, we need a context.
Information Classification: GENERAL
?
Yesterday Today
My Portfolio is worth $43 My Portfolio is worth $40
Result
I lost $3 
I lost $3/1.1 = € 2.72
My Portfolio is worth €
35.83
My Portfolio is worth €
36.36
I made € .53 
The reason for the error is that this is a lie. You DID NOT LOSE $3.
The answer is “I have made or lost ($40 in todays context - $43 in yesterdays context)”
Information Classification: GENERAL
Now we also toss in some services…….
𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, ℂ → {ℝ, ℂ}
Becomes
𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, 𝑆, ℂ → {ℝ, ℂ}
𝑤ℎ𝑒𝑟𝑒 𝑆 = 𝑆1, 𝑆2, … , 𝑆𝑛 𝑜𝑢𝑟 𝑠𝑒𝑟𝑣𝑖𝑐𝑒𝑠
But what our services??? This is a functional language conference soooo, we use functions
to access services.
𝑙𝑒𝑡 Ϝ = Ϝ𝑖, 𝑗 𝑎𝑙𝑙 𝑖, 𝑗 𝑤𝑖𝑡ℎ Ϝ𝑖, 𝑗: {Γ, 𝑆1,𝑆2,…., 𝑆𝑖,ℂ} → {ℝ, ℂ}
𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, Ϝ, ℂ → {ℝ, ℂ}
So new services
can leverage old
services
Information Classification: GENERAL
𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, Ϝ, ℂ → {ℝ, ℂ}
𝓏: ⊕𝑖=1…𝑚 Γ, Ϝ, ℂ →
𝑘=1…𝑛
{ℝ, ℂ}
Data You Have Data You Can
Acquire
Data You Can
Model
Obvious Extensions…
Information Classification: GENERAL
𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, Ϝ, ℂ → {ℝ, ℂ}
𝑎𝑏𝑠𝑡𝑟𝑎𝑐𝑡 𝑜𝑏𝑗𝑒𝑐𝑡 𝑠𝑝𝑎𝑐𝑒 Γ, Ϝ, ℂ , 𝑏𝑢𝑡 𝑡ℎ𝑒𝑟𝑒 𝑎𝑟𝑒 𝑜𝑡ℎ𝑒𝑟𝑠
Example:
• Γ = Customer Records
• F = purchasesOfWine(tenor)
• ℂ = Date
Example:
• Γ = CountryList +
wineSales
• F =
• weather()
• totalWineSales(tenor)
• ℂ = Date, weather
Customer Space Country Space
TRANSFORM
Information Classification: GENERAL
18
Customer Location Wine purchasesOfWine(tenor)
Bob Spain 1/1/2019 – 3btl
15/3/2019 – 2btl
Mary France 15/1/2019 – 2btl
Juan Spain 12/5/2019 –
6 btl
Edward England 13/4/2019 –
8 btl
TRANSFORM
Country Purchases totalWineSales(tenor) Weather()
Spain 11 bottles
France 2 bottles
England 8 bottles
Γ1, Ϝ1, ℂ1
Γ2, Ϝ2, ℂ2
𝓣 𝟏 Γ2, Ϝ2, ℂ2 = 𝒯1 ∘ Γ1, Ϝ1, ℂ1
Information Classification: GENERAL
𝓏: ⊕𝑖=1…𝑚 Γ, Ϝ, ℂ →
𝑘=1…𝑛
{ℝ, ℂ}
Step 1:
Step 2:
𝒯𝑘: ⊕𝑖=1…𝑚 Γ 𝑘, Ϝ 𝑘, ℂ 𝑘 →⊕𝑖=1…𝑛 Γ 𝑘 + 1, Ϝ 𝑘 + 1, ℂ 𝑘 + 1
Step 1:
Step 2:
⊕𝑖=1…𝑚 Γ 𝑘, Ϝ 𝑘, ℂ 𝑘
= 𝒯𝑘 ∘ 𝒯𝑘−1 ∘ ⋯ ∘ 𝒯1 ∘⊕𝑖=1…𝑛 Γ1, Ϝ1, ℂ1
𝓏: ⊕𝑖=1…𝑚 Γ 𝑘, Ϝ 𝑘, ℂ 𝑘 →
𝑖=1…𝑛
{ℝ, ℂ}
THE FINAL FORMULA
𝑖=1…𝑚
{ℝ, ℂ} = 𝓏 ∘ 𝒯𝑘 ∘ 𝒯𝑘−1 ∘ ⋯ ∘ 𝒯1 ∘⊕𝑖=1…𝑛 Γ1, Ϝ1, ℂ1
Information Classification: GENERAL
WEBSMACK FRAMEWORK
20
Information Classification: GENERAL
21
Information Classification: GENERAL
𝑥 = 𝓏 ∘ 𝒯𝑘 ∘ 𝒯𝑘−1 ∘ ⋯ ∘ 𝒯1: ⊕𝑖=1…𝑛 Γ1, Ϝ1, ℂ1 →
𝑖=1…𝑚
{ℝ, ℂ}
𝑡𝑟𝑎𝑑𝑖𝑡𝑖𝑜𝑛𝑎𝑙 … . . 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ → ℝ
𝑁𝐸𝑊 𝐴𝑁𝐷 𝐼𝑀𝑃𝑅𝑂𝑉𝐸𝐷 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ → ℝ
Evaluation
Evaluation
Information Classification: GENERAL
rootVFP
|> scenario (asOf(15-05-2019))
|> Load “position” (filter(“MSCI USA – Daily”))
|> filter (instrument.ESG.WomenOnBoard = true)
THIS IS NOT AN IMPERATIVE ORDERING!!!!!!!!!!!!
Companies with Women on
Board
MSCI
IBM
Apple
|> scenario (timeseries(Date(1,1,2019),Date(15,5,2019) ) )
Companies with Women on
Board
1/1/2019 – {list of companies}
2/1/2019 – (list of companies)
3/1/2019 – (list of companies)
Information Classification: GENERAL
THIS NATURALLY LETS YOU MAKE A 5TH GENERATION FRONT END
Information Classification: GENERAL
25
Information Classification: GENERAL
HOW THE MACHINE WORKS
26
Information Classification: GENERAL
Service API layer
MSCI BEON – A NEW PARADIGM
27
Framework based on the Beon Engine
Functions Library
Process X
I’m Process X
and I can
provide x
Process Y
I’m Process Y
and I can
provide y
Process S Process T Process C
x -> ProcessX
y -> ProcessY
s -> ProcessS
t -> ProcessT
c -> ProcessC
Beon Engine
a = x + y
b = s / t
Information Classification: GENERAL
Service API layer
MSCI BEON – A NEW PARADIGM
28
Everything starts with a question …
Functions Library
Process X Process Y Process S Process T Process C
x -> ProcessX
y -> ProcessY
s -> ProcessS
t -> ProcessT
c -> ProcessC
Beon Engine
a = x + y
b = s / t
Query API
ResultSpec request
Information Classification: GENERAL
Service API layer
MSCI BEON – A NEW PARADIGM
29
The question is then expanded, compiled into byte code, and then parametrized with a context …
Functions Library
Process X Process Y Process S Process T Process C
x -> ProcessX
y -> ProcessY
s -> ProcessS
t -> ProcessT
c -> ProcessC
Beon Engine
a = x + y
b = s / t
Query API
ResultSpec request
Compiler Execution Enginea
s
w d
t
m o u
c
h
p
a
s
w d
m o
c
h
p
a
s
w d
c
a
s c
Context
Compiler
Information Classification: GENERAL
Service API layer
MSCI BEON – A NEW PARADIGM
30
Then executed against the various data services. Results are then recombined and presented back.
Functions Library
Process X Process Y Process S Process T Process C
x -> ProcessX
y -> ProcessY
s -> ProcessS
t -> ProcessT
c -> ProcessC
Beon Engine
a = x + y
b = s / t
Query API
ResultSpec request
Compiler Execution Enginea
s
w d
t
m o u
c
h
p
a
s
w d
m o
c
h
p
a
s
w d
c
a
s c
Conte
xt
Processing …

Mais conteúdo relacionado

Semelhante a Big Data On Data You Don’t Have

Improving Findability through Site Search Analytics
Improving Findability through Site Search AnalyticsImproving Findability through Site Search Analytics
Improving Findability through Site Search AnalyticsLouis Rosenfeld
 
5.01 database-fundamentals
5.01 database-fundamentals5.01 database-fundamentals
5.01 database-fundamentalsTammy Carter
 
Data Science: The Product Manager's Primer
Data Science: The Product Manager's PrimerData Science: The Product Manager's Primer
Data Science: The Product Manager's PrimerProduct School
 
Learning content - Data Science Basics
Learning content - Data Science Basics Learning content - Data Science Basics
Learning content - Data Science Basics PredicSis
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessingkayathri02
 
Working With Big Data - Nov 2016
Working With Big Data - Nov 2016Working With Big Data - Nov 2016
Working With Big Data - Nov 2016Seth Familian
 
ML-ChapterTwo-Data Preprocessing.ppt
ML-ChapterTwo-Data Preprocessing.pptML-ChapterTwo-Data Preprocessing.ppt
ML-ChapterTwo-Data Preprocessing.pptbelay41
 
Unit 1-Data Science Process Overview.pptx
Unit 1-Data Science Process Overview.pptxUnit 1-Data Science Process Overview.pptx
Unit 1-Data Science Process Overview.pptxAnusuya123
 
In:Confidence 2019 - Balancing the conflicting objectives of data access and ...
In:Confidence 2019 - Balancing the conflicting objectives of data access and ...In:Confidence 2019 - Balancing the conflicting objectives of data access and ...
In:Confidence 2019 - Balancing the conflicting objectives of data access and ...Privitar
 
MLconf NYC Edo Liberty
MLconf NYC Edo LibertyMLconf NYC Edo Liberty
MLconf NYC Edo LibertyMLconf
 
Visualize data using the split-apply-combine approach
Visualize data using the split-apply-combine approachVisualize data using the split-apply-combine approach
Visualize data using the split-apply-combine approachLuca Candela
 
Data Mining Concepts 15061
Data Mining Concepts 15061Data Mining Concepts 15061
Data Mining Concepts 15061badirh
 

Semelhante a Big Data On Data You Don’t Have (20)

3prep
3prep3prep
3prep
 
UNIT 1_2.ppt
UNIT 1_2.pptUNIT 1_2.ppt
UNIT 1_2.ppt
 
4 preprocess
4 preprocess4 preprocess
4 preprocess
 
Improving Findability through Site Search Analytics
Improving Findability through Site Search AnalyticsImproving Findability through Site Search Analytics
Improving Findability through Site Search Analytics
 
5.01 database-fundamentals
5.01 database-fundamentals5.01 database-fundamentals
5.01 database-fundamentals
 
Preprocessing
PreprocessingPreprocessing
Preprocessing
 
Data Science: The Product Manager's Primer
Data Science: The Product Manager's PrimerData Science: The Product Manager's Primer
Data Science: The Product Manager's Primer
 
Learning content - Data Science Basics
Learning content - Data Science Basics Learning content - Data Science Basics
Learning content - Data Science Basics
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Working With Big Data - Nov 2016
Working With Big Data - Nov 2016Working With Big Data - Nov 2016
Working With Big Data - Nov 2016
 
ML-ChapterTwo-Data Preprocessing.ppt
ML-ChapterTwo-Data Preprocessing.pptML-ChapterTwo-Data Preprocessing.ppt
ML-ChapterTwo-Data Preprocessing.ppt
 
Datapreprocessing
DatapreprocessingDatapreprocessing
Datapreprocessing
 
Unit 1-Data Science Process Overview.pptx
Unit 1-Data Science Process Overview.pptxUnit 1-Data Science Process Overview.pptx
Unit 1-Data Science Process Overview.pptx
 
Sales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca IannuzziSales Forecasting as a Data Product by Francesca Iannuzzi
Sales Forecasting as a Data Product by Francesca Iannuzzi
 
In:Confidence 2019 - Balancing the conflicting objectives of data access and ...
In:Confidence 2019 - Balancing the conflicting objectives of data access and ...In:Confidence 2019 - Balancing the conflicting objectives of data access and ...
In:Confidence 2019 - Balancing the conflicting objectives of data access and ...
 
Lesson07
Lesson07Lesson07
Lesson07
 
MLconf NYC Edo Liberty
MLconf NYC Edo LibertyMLconf NYC Edo Liberty
MLconf NYC Edo Liberty
 
Visualize data using the split-apply-combine approach
Visualize data using the split-apply-combine approachVisualize data using the split-apply-combine approach
Visualize data using the split-apply-combine approach
 
Data Mining.ppt
Data Mining.pptData Mining.ppt
Data Mining.ppt
 
Data Mining Concepts 15061
Data Mining Concepts 15061Data Mining Concepts 15061
Data Mining Concepts 15061
 

Mais de J On The Beach

Massively scalable ETL in real world applications: the hard way
Massively scalable ETL in real world applications: the hard wayMassively scalable ETL in real world applications: the hard way
Massively scalable ETL in real world applications: the hard wayJ On The Beach
 
Acoustic Time Series in Industry 4.0: Improved Reliability and Cyber-Security...
Acoustic Time Series in Industry 4.0: Improved Reliability and Cyber-Security...Acoustic Time Series in Industry 4.0: Improved Reliability and Cyber-Security...
Acoustic Time Series in Industry 4.0: Improved Reliability and Cyber-Security...J On The Beach
 
Pushing it to the edge in IoT
Pushing it to the edge in IoTPushing it to the edge in IoT
Pushing it to the edge in IoTJ On The Beach
 
Drinking from the firehose, with virtual streams and virtual actors
Drinking from the firehose, with virtual streams and virtual actorsDrinking from the firehose, with virtual streams and virtual actors
Drinking from the firehose, with virtual streams and virtual actorsJ On The Beach
 
How do we deploy? From Punched cards to Immutable server pattern
How do we deploy? From Punched cards to Immutable server patternHow do we deploy? From Punched cards to Immutable server pattern
How do we deploy? From Punched cards to Immutable server patternJ On The Beach
 
When Cloud Native meets the Financial Sector
When Cloud Native meets the Financial SectorWhen Cloud Native meets the Financial Sector
When Cloud Native meets the Financial SectorJ On The Beach
 
The big data Universe. Literally.
The big data Universe. Literally.The big data Universe. Literally.
The big data Universe. Literally.J On The Beach
 
Streaming to a New Jakarta EE
Streaming to a New Jakarta EEStreaming to a New Jakarta EE
Streaming to a New Jakarta EEJ On The Beach
 
The TIPPSS Imperative for IoT - Ensuring Trust, Identity, Privacy, Protection...
The TIPPSS Imperative for IoT - Ensuring Trust, Identity, Privacy, Protection...The TIPPSS Imperative for IoT - Ensuring Trust, Identity, Privacy, Protection...
The TIPPSS Imperative for IoT - Ensuring Trust, Identity, Privacy, Protection...J On The Beach
 
Pushing AI to the Client with WebAssembly and Blazor
Pushing AI to the Client with WebAssembly and BlazorPushing AI to the Client with WebAssembly and Blazor
Pushing AI to the Client with WebAssembly and BlazorJ On The Beach
 
Axon Server went RAFTing
Axon Server went RAFTingAxon Server went RAFTing
Axon Server went RAFTingJ On The Beach
 
The Six Pitfalls of building a Microservices Architecture (and how to avoid t...
The Six Pitfalls of building a Microservices Architecture (and how to avoid t...The Six Pitfalls of building a Microservices Architecture (and how to avoid t...
The Six Pitfalls of building a Microservices Architecture (and how to avoid t...J On The Beach
 
Madaari : Ordering For The Monkeys
Madaari : Ordering For The MonkeysMadaari : Ordering For The Monkeys
Madaari : Ordering For The MonkeysJ On The Beach
 
Servers are doomed to fail
Servers are doomed to failServers are doomed to fail
Servers are doomed to failJ On The Beach
 
Interaction Protocols: It's all about good manners
Interaction Protocols: It's all about good mannersInteraction Protocols: It's all about good manners
Interaction Protocols: It's all about good mannersJ On The Beach
 
A race of two compilers: GraalVM JIT versus HotSpot JIT C2. Which one offers ...
A race of two compilers: GraalVM JIT versus HotSpot JIT C2. Which one offers ...A race of two compilers: GraalVM JIT versus HotSpot JIT C2. Which one offers ...
A race of two compilers: GraalVM JIT versus HotSpot JIT C2. Which one offers ...J On The Beach
 
Leadership at every level
Leadership at every levelLeadership at every level
Leadership at every levelJ On The Beach
 
Machine Learning: The Bare Math Behind Libraries
Machine Learning: The Bare Math Behind LibrariesMachine Learning: The Bare Math Behind Libraries
Machine Learning: The Bare Math Behind LibrariesJ On The Beach
 
Getting started with Deep Reinforcement Learning
Getting started with Deep Reinforcement LearningGetting started with Deep Reinforcement Learning
Getting started with Deep Reinforcement LearningJ On The Beach
 

Mais de J On The Beach (20)

Massively scalable ETL in real world applications: the hard way
Massively scalable ETL in real world applications: the hard wayMassively scalable ETL in real world applications: the hard way
Massively scalable ETL in real world applications: the hard way
 
Acoustic Time Series in Industry 4.0: Improved Reliability and Cyber-Security...
Acoustic Time Series in Industry 4.0: Improved Reliability and Cyber-Security...Acoustic Time Series in Industry 4.0: Improved Reliability and Cyber-Security...
Acoustic Time Series in Industry 4.0: Improved Reliability and Cyber-Security...
 
Pushing it to the edge in IoT
Pushing it to the edge in IoTPushing it to the edge in IoT
Pushing it to the edge in IoT
 
Drinking from the firehose, with virtual streams and virtual actors
Drinking from the firehose, with virtual streams and virtual actorsDrinking from the firehose, with virtual streams and virtual actors
Drinking from the firehose, with virtual streams and virtual actors
 
How do we deploy? From Punched cards to Immutable server pattern
How do we deploy? From Punched cards to Immutable server patternHow do we deploy? From Punched cards to Immutable server pattern
How do we deploy? From Punched cards to Immutable server pattern
 
Java, Turbocharged
Java, TurbochargedJava, Turbocharged
Java, Turbocharged
 
When Cloud Native meets the Financial Sector
When Cloud Native meets the Financial SectorWhen Cloud Native meets the Financial Sector
When Cloud Native meets the Financial Sector
 
The big data Universe. Literally.
The big data Universe. Literally.The big data Universe. Literally.
The big data Universe. Literally.
 
Streaming to a New Jakarta EE
Streaming to a New Jakarta EEStreaming to a New Jakarta EE
Streaming to a New Jakarta EE
 
The TIPPSS Imperative for IoT - Ensuring Trust, Identity, Privacy, Protection...
The TIPPSS Imperative for IoT - Ensuring Trust, Identity, Privacy, Protection...The TIPPSS Imperative for IoT - Ensuring Trust, Identity, Privacy, Protection...
The TIPPSS Imperative for IoT - Ensuring Trust, Identity, Privacy, Protection...
 
Pushing AI to the Client with WebAssembly and Blazor
Pushing AI to the Client with WebAssembly and BlazorPushing AI to the Client with WebAssembly and Blazor
Pushing AI to the Client with WebAssembly and Blazor
 
Axon Server went RAFTing
Axon Server went RAFTingAxon Server went RAFTing
Axon Server went RAFTing
 
The Six Pitfalls of building a Microservices Architecture (and how to avoid t...
The Six Pitfalls of building a Microservices Architecture (and how to avoid t...The Six Pitfalls of building a Microservices Architecture (and how to avoid t...
The Six Pitfalls of building a Microservices Architecture (and how to avoid t...
 
Madaari : Ordering For The Monkeys
Madaari : Ordering For The MonkeysMadaari : Ordering For The Monkeys
Madaari : Ordering For The Monkeys
 
Servers are doomed to fail
Servers are doomed to failServers are doomed to fail
Servers are doomed to fail
 
Interaction Protocols: It's all about good manners
Interaction Protocols: It's all about good mannersInteraction Protocols: It's all about good manners
Interaction Protocols: It's all about good manners
 
A race of two compilers: GraalVM JIT versus HotSpot JIT C2. Which one offers ...
A race of two compilers: GraalVM JIT versus HotSpot JIT C2. Which one offers ...A race of two compilers: GraalVM JIT versus HotSpot JIT C2. Which one offers ...
A race of two compilers: GraalVM JIT versus HotSpot JIT C2. Which one offers ...
 
Leadership at every level
Leadership at every levelLeadership at every level
Leadership at every level
 
Machine Learning: The Bare Math Behind Libraries
Machine Learning: The Bare Math Behind LibrariesMachine Learning: The Bare Math Behind Libraries
Machine Learning: The Bare Math Behind Libraries
 
Getting started with Deep Reinforcement Learning
Getting started with Deep Reinforcement LearningGetting started with Deep Reinforcement Learning
Getting started with Deep Reinforcement Learning
 

Último

%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfonteinmasabamasaba
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...Nitya salvi
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is insideshinachiaurasa2
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfVishalKumarJha10
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfryanfarris8
 
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Pharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyPharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyAnusha Are
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale
 

Último (20)

%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfintroduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdf
 
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdfAzure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
Azure_Native_Qumulo_High_Performance_Compute_Benchmarks.pdf
 
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 18, Noida Call girls :8448380779 Model Escorts | 100% verified
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Pharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodologyPharm-D Biostatistics and Research methodology
Pharm-D Biostatistics and Research methodology
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 

Big Data On Data You Don’t Have

  • 1. Information Classification: GENERAL BIG DATA (on Data you don’t have) 1
  • 2. Information Classification: GENERAL HOW DO WE DEAL WITH INFINITE DIMENSIONAL DATA….. BY GENERALIZING TRADITIONAL MAP REDUCE PARADIGM…….
  • 4. Information Classification: GENERAL THERE ARE FOUR SOURCES OF DATA 4 Data I have (traditional “Big Data”) Data I can model Data I can acquire Data someone else can acquire or model
  • 5. Information Classification: GENERAL HOW WE REPRESENT THESE ITEMS 5 Pre-Calculated Data Formulas you have Services Things People can Share with Me
  • 8. Information Classification: GENERAL MSCI PLATFORM – A NEXT GENERATION LEAP 8 Big Data Repository Hadoop / Cloudera etc Slice/Dice Traditional Big Data “Data you Have” Paradigm Beon New Front End NEW Big Data Paradigm Calculation and Data Services On Demand Data Expressions The Morning Load Virtual fields Dynamic new data
  • 10. Information Classification: GENERAL WHAT IS A COMPLEX QUESTION VERSUS A SPECIFIC QUESTION? 10 Specific questions can be hard, for example: • What happens to sea level if the temperature goes up 1.5 degrees by 2035? • What properties are on the beach and over x meters above sea level in Marbella? • What are the biggest real estate bargains in a portfolio. Complex questions are combinations of specific questions. • What should I buy if I believe that temperatures are going to raise 1.5 degrees by 2035 and I only want property that will be at least 1 meter above sea level in 2035 but still on the beach.
  • 11. Information Classification: GENERAL HOW TO ANSWER A COMPLEX QUESTION 11 So to answer a complex question you need something that can answer this Let Portfolio = All the houses in Marbella safeHouses = Filter( SeaLevel >= 1.0 + seaLevelRise(1.5 c)) Portfolio BestBargains = BargainFinder safeHouses It does this by calling the services below for certain calculations. Platform Marbella Houses Planet Simulator Sea Level RaiseHouse Database Execute the question above, Filtering, etc.. Bargain Finder
  • 12. Information Classification: GENERAL GENERALIZING MAP-REDUCE UH OH – SOME MATH…… 12
  • 13. Information Classification: GENERAL 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ → ℝ Just for simplicity, lets assume we only care about real numbers (obviously, we could have tuples, strings, dictionaries, any valid type honestly…) Standard map reduce, Gamma is your class object/structure/thing 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, ℂ → {ℝ, ℂ} First things first, we need a context.
  • 14. Information Classification: GENERAL ? Yesterday Today My Portfolio is worth $43 My Portfolio is worth $40 Result I lost $3  I lost $3/1.1 = € 2.72 My Portfolio is worth € 35.83 My Portfolio is worth € 36.36 I made € .53  The reason for the error is that this is a lie. You DID NOT LOSE $3. The answer is “I have made or lost ($40 in todays context - $43 in yesterdays context)”
  • 15. Information Classification: GENERAL Now we also toss in some services……. 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, ℂ → {ℝ, ℂ} Becomes 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, 𝑆, ℂ → {ℝ, ℂ} 𝑤ℎ𝑒𝑟𝑒 𝑆 = 𝑆1, 𝑆2, … , 𝑆𝑛 𝑜𝑢𝑟 𝑠𝑒𝑟𝑣𝑖𝑐𝑒𝑠 But what our services??? This is a functional language conference soooo, we use functions to access services. 𝑙𝑒𝑡 Ϝ = Ϝ𝑖, 𝑗 𝑎𝑙𝑙 𝑖, 𝑗 𝑤𝑖𝑡ℎ Ϝ𝑖, 𝑗: {Γ, 𝑆1,𝑆2,…., 𝑆𝑖,ℂ} → {ℝ, ℂ} 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, Ϝ, ℂ → {ℝ, ℂ} So new services can leverage old services
  • 16. Information Classification: GENERAL 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, Ϝ, ℂ → {ℝ, ℂ} 𝓏: ⊕𝑖=1…𝑚 Γ, Ϝ, ℂ → 𝑘=1…𝑛 {ℝ, ℂ} Data You Have Data You Can Acquire Data You Can Model Obvious Extensions…
  • 17. Information Classification: GENERAL 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ, Ϝ, ℂ → {ℝ, ℂ} 𝑎𝑏𝑠𝑡𝑟𝑎𝑐𝑡 𝑜𝑏𝑗𝑒𝑐𝑡 𝑠𝑝𝑎𝑐𝑒 Γ, Ϝ, ℂ , 𝑏𝑢𝑡 𝑡ℎ𝑒𝑟𝑒 𝑎𝑟𝑒 𝑜𝑡ℎ𝑒𝑟𝑠 Example: • Γ = Customer Records • F = purchasesOfWine(tenor) • ℂ = Date Example: • Γ = CountryList + wineSales • F = • weather() • totalWineSales(tenor) • ℂ = Date, weather Customer Space Country Space TRANSFORM
  • 18. Information Classification: GENERAL 18 Customer Location Wine purchasesOfWine(tenor) Bob Spain 1/1/2019 – 3btl 15/3/2019 – 2btl Mary France 15/1/2019 – 2btl Juan Spain 12/5/2019 – 6 btl Edward England 13/4/2019 – 8 btl TRANSFORM Country Purchases totalWineSales(tenor) Weather() Spain 11 bottles France 2 bottles England 8 bottles Γ1, Ϝ1, ℂ1 Γ2, Ϝ2, ℂ2 𝓣 𝟏 Γ2, Ϝ2, ℂ2 = 𝒯1 ∘ Γ1, Ϝ1, ℂ1
  • 19. Information Classification: GENERAL 𝓏: ⊕𝑖=1…𝑚 Γ, Ϝ, ℂ → 𝑘=1…𝑛 {ℝ, ℂ} Step 1: Step 2: 𝒯𝑘: ⊕𝑖=1…𝑚 Γ 𝑘, Ϝ 𝑘, ℂ 𝑘 →⊕𝑖=1…𝑛 Γ 𝑘 + 1, Ϝ 𝑘 + 1, ℂ 𝑘 + 1 Step 1: Step 2: ⊕𝑖=1…𝑚 Γ 𝑘, Ϝ 𝑘, ℂ 𝑘 = 𝒯𝑘 ∘ 𝒯𝑘−1 ∘ ⋯ ∘ 𝒯1 ∘⊕𝑖=1…𝑛 Γ1, Ϝ1, ℂ1 𝓏: ⊕𝑖=1…𝑚 Γ 𝑘, Ϝ 𝑘, ℂ 𝑘 → 𝑖=1…𝑛 {ℝ, ℂ} THE FINAL FORMULA 𝑖=1…𝑚 {ℝ, ℂ} = 𝓏 ∘ 𝒯𝑘 ∘ 𝒯𝑘−1 ∘ ⋯ ∘ 𝒯1 ∘⊕𝑖=1…𝑛 Γ1, Ϝ1, ℂ1
  • 22. Information Classification: GENERAL 𝑥 = 𝓏 ∘ 𝒯𝑘 ∘ 𝒯𝑘−1 ∘ ⋯ ∘ 𝒯1: ⊕𝑖=1…𝑛 Γ1, Ϝ1, ℂ1 → 𝑖=1…𝑚 {ℝ, ℂ} 𝑡𝑟𝑎𝑑𝑖𝑡𝑖𝑜𝑛𝑎𝑙 … . . 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ → ℝ 𝑁𝐸𝑊 𝐴𝑁𝐷 𝐼𝑀𝑃𝑅𝑂𝑉𝐸𝐷 𝑚𝑎𝑝 𝑥 𝑤ℎ𝑒𝑟𝑒 𝑥: Γ → ℝ Evaluation Evaluation
  • 23. Information Classification: GENERAL rootVFP |> scenario (asOf(15-05-2019)) |> Load “position” (filter(“MSCI USA – Daily”)) |> filter (instrument.ESG.WomenOnBoard = true) THIS IS NOT AN IMPERATIVE ORDERING!!!!!!!!!!!! Companies with Women on Board MSCI IBM Apple |> scenario (timeseries(Date(1,1,2019),Date(15,5,2019) ) ) Companies with Women on Board 1/1/2019 – {list of companies} 2/1/2019 – (list of companies) 3/1/2019 – (list of companies)
  • 24. Information Classification: GENERAL THIS NATURALLY LETS YOU MAKE A 5TH GENERATION FRONT END
  • 27. Information Classification: GENERAL Service API layer MSCI BEON – A NEW PARADIGM 27 Framework based on the Beon Engine Functions Library Process X I’m Process X and I can provide x Process Y I’m Process Y and I can provide y Process S Process T Process C x -> ProcessX y -> ProcessY s -> ProcessS t -> ProcessT c -> ProcessC Beon Engine a = x + y b = s / t
  • 28. Information Classification: GENERAL Service API layer MSCI BEON – A NEW PARADIGM 28 Everything starts with a question … Functions Library Process X Process Y Process S Process T Process C x -> ProcessX y -> ProcessY s -> ProcessS t -> ProcessT c -> ProcessC Beon Engine a = x + y b = s / t Query API ResultSpec request
  • 29. Information Classification: GENERAL Service API layer MSCI BEON – A NEW PARADIGM 29 The question is then expanded, compiled into byte code, and then parametrized with a context … Functions Library Process X Process Y Process S Process T Process C x -> ProcessX y -> ProcessY s -> ProcessS t -> ProcessT c -> ProcessC Beon Engine a = x + y b = s / t Query API ResultSpec request Compiler Execution Enginea s w d t m o u c h p a s w d m o c h p a s w d c a s c Context Compiler
  • 30. Information Classification: GENERAL Service API layer MSCI BEON – A NEW PARADIGM 30 Then executed against the various data services. Results are then recombined and presented back. Functions Library Process X Process Y Process S Process T Process C x -> ProcessX y -> ProcessY s -> ProcessS t -> ProcessT c -> ProcessC Beon Engine a = x + y b = s / t Query API ResultSpec request Compiler Execution Enginea s w d t m o u c h p a s w d m o c h p a s w d c a s c Conte xt Processing …