SlideShare uma empresa Scribd logo
1 de 36
1© Cloudera, Inc. All rights reserved.
Advanced Analytics for
Investment Firms
Faster Time to Value and Higher Returns
for Your Analytics Projects
Dr. Richard Harmon
Sean Anderson
2© Cloudera, Inc. All rights reserved.
Types of Analytics
Dr. Richard Harmon
Financial Services Industry Leader
Cloudera
Machine learning and advanced analytics for
higher investment returns
Develop better models faster with Data
Science Workbench
Sean Anderson
Sr. Product Marketing Manager, Data
Science and Machine Learning
Cloudera
3© Cloudera, Inc. All rights reserved.
Our relationship with data is changing
Data is now a strategic asset, how you use it is a key differentiator
4© Cloudera, Inc. All rights reserved.
• What makes “big data” big?
⎼ Volume?
⎼ Variety?
⎼ Velocity?
• Data becomes big when we take one or
more large data sets and start to analyze
relationships between observations
Big Data and Data Products
5© Cloudera, Inc. All rights reserved.
Wrangling Big Data is Time Consuming
Data preparation accounts for about 80% of the work of data scientists
(Source: CrowdFlower)
6© Cloudera, Inc. All rights reserved.
METHODS
DATA
Data Mining
Machine Learning
Large Scale
Data Analysis
OPTIMIZATION
Optimization
Techniques
SIMULATION
High Performance
Computing
Spatial
Computation
Run-Time
Monitoring
Numerical
Simulation
NATURE INSPIRED
Evolutionary
Algorithms
Swarm Insects
Gene Regulatory
Networks
Reasoning
Compositional
ReasoningStrategic
Reasoning
Artificial
Intelligence
FORMAL METHODS
Model
Validation
Program
Analysis
DECISION MAKING
New Institutional
Economics
Incentive
Schemes
Game
Theory
MODELING
Agent-Based
Modeling
Complex System
Methods
Multi-Scale
Modeling Dynamical
Modeling
Social Aspects
Social
Computing
Human Systems
An Analytical Methods Viewpoint based upon the Approach Used
7© Cloudera, Inc. All rights reserved.
An Overview of the Analytics Space from a AI Perspective
8© Cloudera, Inc. All rights reserved.
A Machine Learning Viewpoint with a Use Case Mapping
9© Cloudera, Inc. All rights reserved.
Storage
• Archival
• Traceability
Batch
• ETL
• Data
Validation
• Reg Reporting
Interactive
• Risk
Aggregation
• Stress Testing
HPC
• Risk Modeling
• Backtesting
Streaming &
Real Time
• Mkt
Surveillance
• Best execution
Evolution
Risk & Regulatory Compliance Use Cases on Big Data
Cloudera Data
Science
Workbench
10© Cloudera, Inc. All rights reserved.
Why is Hadoop Ideal for Data Science?
• High volume, low-cost shared storage = more data, more kinds of data
• Parallel compute, local to the data = more experiments, better results
• Scalable, fault tolerant = easy scale out, not just scale up
• Flexible, multipurpose data platform = easier path to production
• Superior flexibility and price/performance to any other data platform
11© Cloudera, Inc. All rights reserved.
Cloudera Enterprise Data Hub
Handle real-time
data ingest from
diverse sources
Governance and
Security
Data Streams
Deployment Flexibility
Machine Learning
Capabilities
Diverse Analytical
Options
Combine Data from Different Sources
Data Mgmt. Hub
Scale easily & Cost
effectively
Batch or Real- time
Data Streams
Data Sources
Data Sources
Data Storage &
Processing
Reporting, Analytics &
Auditing
Data Ingest
Other
Data Governance (Data Lineage, Data Protection)
12© Cloudera, Inc. All rights reserved.
Why Cloudera Enterprise
Making Hadoop Fast, Easy, and Secure
Hadoop:
• One place for unlimited data
• Unified data access
Cloudera makes it:
• Fast for business
• Easy to manage
• Secure without compromise
13© Cloudera, Inc. All rights reserved.
Machine Learning in
Investment Management
14© Cloudera, Inc. All rights reserved.
Advanced Analytics Use Cases
Patterns in Market Data
Satellite Image Analysis
Simulation for
Strategy Modeling
News Feeds for
Economic Forecasting
Market Sentiment Localized Pricing
Risk Assessment Compliance
15© Cloudera, Inc. All rights reserved.
Interconnected Risk Indicator: CoVaR (Conditional Value at Risk)
CoVaR: Conditional Value at Risk
• Dr. Tobias Adrian (NY FED) & Prof. Markus Brunnermeier (Princeton Univ) –
American Economic Review, Vol 106, no. 7, July 2016
• Measures “tail-event” linkages
• It is the VaR (Value at Risk) of the whole financial sector conditional on one or more
institutions being in distress.
• CoVaR - measures interconnectedness, spillover effects, risk transmission,
deleveraging, etc.
16© Cloudera, Inc. All rights reserved.
CoVaR Estimation: Application of ML for Improved Model Fit
Quantile Regression vs. Kernel Quantile Regression
Kernel Quantile Regression (KQR-CoVaR)
Robust
Quantile Regression
Noisy
Adrian (NY FED) & Brunnermeier (Princeton Univ) Harmon (Cloudera), Bittman, Ruttenberg (SAP)
17© Cloudera, Inc. All rights reserved.
Improved Economic Forecasting:
Predicting US Non-Farm Payroll (NFP)
Abby Levenberg, Stephen Pulman, Karo Moilanen, Edwin Simpson, and Stephen Roberts, “Predicting Economic Indicators from Web
Text Using Sentiment Composition”, International Journal of Computer and Communication Engineering, Vol. 3, No. 2, March 2014.
(http://www.ijcce.org/papers/302-E3007.pdf)
18© Cloudera, Inc. All rights reserved.
Modeling The Real World – Simulation & Optimization
Agent-Based Models (ABMs)
• Agent-based models (ABM) consist of heterogeneous agents
that are allowed to freely and randomly interact in the
interests of maximizing their own goals.
• ABMs are uniquely able to include evolving intelligence that
allows agents to learn and adjust as new information comes
into the system and how the system evolves in response to
this new information.
• ABM’s are ideally suited to a model behavioral heterogeneity,
and the dynamics that result from that heterogeneity.
19© Cloudera, Inc. All rights reserved.
Mortgage Prepayment Reduced Sensitivity: (Burnout)
In US RMBS market, the phenomenon known as interest rate burnout
arises from the changing composition of homeowners in a fixed
mortgage pool over time. Many prepayment models applied various
techniques to track heterogeneous borrower behavior within mortgage
pools.
JPM created multiple dimensions of homeowner incentives - essentially
the homeowner must be "ready”, "willing”, and "able” to refinance.
This early form of ABM was a key way to model heterogeneous
behavior when individual loan data was not access for a pool of
mortgages.
Option Adjusted Spread (OAS) Analysis
OAS is a widely-used computational simulation valuation model for
RMBS. This requires a Monte Carlo based simulation of the distribution
of future cash flows under thousands of different interest rate paths.
Residential MBS Investment: Mortgage Prepayment Modeling
20© Cloudera, Inc. All rights reserved.
What is Endogenous Risk?
- The conventional view was that markets were huge and anybody trading in the market was
small by comparison – that you could neglect the impact of what the firm was doing.
- In most empirical risk models, risk is determined by exposure to a group of factors – markets,
industries, countries, exchange rates, ... and using volatility as a metric for uncertainty.
- While these models allow for uncertainty about those exposures, but often ignore
endogenous influences such as how market participants react to each other.
- Endogenous risk refers to the risk that comes from market participants reacting to each other
rather than reacting to outside forces.
Endogenous Risk – A New Challenge for Investment Managers
Source: “Avoiding crowds: BlackRock leads push to model 'endogenous' risk”, Risk.Net, 2016.
”A market becoming more correlated more quickly is a market becoming increasingly hard to predict.”
21© Cloudera, Inc. All rights reserved.
Feed-Back Loops & Tipping Points:
Jean-Philippe Bouchaud of Capital Fund Management (CFM) thinks investors with even relatively
small positions can be trapped in a feedback loop where they cause asset prices to fall by selling,
and are forced to liquidate more assets as a result.
The logic here is that firms should look at holdings relative to securities available in the market,
rather than compared with the whole universe of issued securities.
CFM uses an agent-based modeling approach that takes into account the size of holdings by
market participants to help evaluate the tipping point at which they risk creating their own
deleveraging vortex.
Endogenous Risk – New Challenge for Investment Managers (Con’t)
Source: “Avoiding crowds: BlackRock leads push to model 'endogenous' risk”, Risk.Net, 2016.
22© Cloudera, Inc. All rights reserved.
Polling Questions #1
What kind of advanced analytics do you use for your investment strategy?
1. Machine Learning
2. Natural Language Processing
3. Simulations/Agent-Based
4. Multiple Approaches
5. None
23© Cloudera, Inc. All rights reserved.
Alternative Data Sources
& “Quantamental Investing”
24© Cloudera, Inc. All rights reserved.
What is Alternative Data?
"Alternative data draws from non-traditional data sources,
so that when you apply analytics to the data, they yield
additional insights that complement the information you
receive from traditional sources."
S&P Global CIO Krishna Nathan
(CIO, January 3, 2017)
25© Cloudera, Inc. All rights reserved.
“Quantamental” Investing – “Big Data” driven investment approach
- Quantamental managers combine the bottom-up stock-picking skills of fundamental
investors with the use of computing power and big-data sets to test their hypotheses.
- Quantamental investing has been driven in part by the rise of so-called alternative
data. Fund managers can now study everything from social-media data (to predict
footfall in a location or sentiment around a new movie) to the number of cars in a mall
parking lot.
Merging Quantitative Analysis, AI, and Fundamental Research
Some $30 billion in assets (about 11 percent of active equity funds) will be targeted,
with $6 billion rebranded BlackRock Advantage funds. These funds focus on quantitative
and other strategies that adopt a more rules-based approach to investing.
“At BlackRock, Machines Are Rising Over Managers to Pick Stocks” – NY Times, March 28, 2017
26© Cloudera, Inc. All rights reserved.
Alternative Data Sources - Examples
Satellite Data - These are companies that
utilize image data from orbiting satellites to
do things like measure the number of cars in
Walmart parking lots or farm health based on
the color of crops.
Web/App/Social Media Data – These are
companies which mine social media or use data
firehoses from the web/mobile to understand
what’s happening in the world or how people are
interacting with their devices.
Weather Data – These are companies which are
developing weather models and utilizing more
sensors to get better localized data or improve
weather forecasting.
Location/Foot Traffic Data – Companies
that use different means to understand
where consumers are going by measuring
foot traffic via check-ins, video analysis, etc.
Local Prices – These companies can see what’s
happening to prices and inflation by
aggregating data from ground-level sources.
Alternative Credit - Companies developing new
credit models that utilize sources of alternative
data (like mobile usage).
Credit Card Transactions – These are
companies that use anonymous
aggregate transaction data to understand
trends in consumer purchasing habits.
Alternative Data Monetizers/Aggregators –
These are companies who pay for access to
individual data streams which become more
valuable in a bundle, and then sell those
packages to investors.
27© Cloudera, Inc. All rights reserved.
Alternative Data Source Vendors
28© Cloudera, Inc. All rights reserved.
Alternative Data Example: Best Predictor of Chipotle Sales
On April 12, Jeff Glueck, the
CEO of Foursquare, published
a post predicting that
Chipotle's first-quarter sales
would be down nearly 30%.
That was based on foot-traffic
statistics built from explicit
check-ins and implicit visits
from Foursquare and Swarm
app users who enable
background location.
The chart below shows the share of visits to Chipotle restaurants in comparison to visits to ALL restaurants in the
United States. In the 2015–2016 winter, visits to Chipotle restaurants declined more significantly than in 2014–2015.
29© Cloudera, Inc. All rights reserved.
Advanced Text Analytics: Asset Management Use Case
Partners
Big Data Enabled Investment Management Process:
Goal: To provide new data sources and analytic tools to streamline and
improve the portfolio investment process.
Benefits:
- Centralized, secure & scalable investment analysis data lake
- Streamlined information management and analysis process
- Flexibility for portfolio managers to utilize any tools and any data sources
- New insights from sentiment/emotional analysis of sell-side analyst
reports and news feeds
- Expanded usage of ML-based analytics for macroeconomic, risk and
market analysis.
- Automated market, portfolio and event monitoring and alerts
- Part of a wider digital transformation initiative
30© Cloudera, Inc. All rights reserved.
Open data science in the enterprise
IT
drive adoption while maintaining compliance
Data Scientist
explore, experiment, iterate
31© Cloudera, Inc. All rights reserved.
Our goal: An open platform for data science at scale
Help more data scientists
use the power of Hadoop
Use a powerful, familiar
environment with direct access to
Hadoop data and compute
Data Scientist
Data Engineer
Make it easy and secure to
add new users, use cases
Offer secure self-service analytics
and a faster path to production on
common, affordable infrastructure
Enterprise Architect
Hadoop Admin
32© Cloudera, Inc. All rights reserved.
Introducing Cloudera Data Science Workbench
Self-service data science for the enterprise
Accelerates data science from
development to production with:
• Secure self-service environments
for data scientists to work against
Cloudera clusters
• Support for Python, R, and Scala,
plus project dependency isolation
for multiple library versions
• Workflow automation, version
control, collaboration and sharing
33© Cloudera, Inc. All rights reserved.
Polling Question #2
How fast do you need to ingest your data?
1. sec
2. ms
3. hours
4. min
5. don’t know
34© Cloudera, Inc. All rights reserved.
Demo
35© Cloudera, Inc. All rights reserved.
Q&A
36© Cloudera, Inc. All rights reserved.
Thank you

Mais conteúdo relacionado

Mais procurados

Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldPart 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldCloudera, Inc.
 
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformHow to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformCloudera, Inc.
 
Big data journey to the cloud maz chaudhri 5.30.18
Big data journey to the cloud   maz chaudhri 5.30.18Big data journey to the cloud   maz chaudhri 5.30.18
Big data journey to the cloud maz chaudhri 5.30.18Cloudera, Inc.
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Cloudera, Inc.
 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationCloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Cloudera, Inc.
 
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Introducing Cloudera Navigator Optimizer: Offload Assessments and Active Data...
Introducing Cloudera Navigator Optimizer: Offload Assessments and Active Data...Introducing Cloudera Navigator Optimizer: Offload Assessments and Active Data...
Introducing Cloudera Navigator Optimizer: Offload Assessments and Active Data...Cloudera, Inc.
 
Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformCloudera, Inc.
 
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made EasyCloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made EasyCloudera, Inc.
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Cloudera, Inc.
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...Cloudera, Inc.
 
Preparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity RenaissancePreparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity RenaissanceCloudera, Inc.
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Cloudera, Inc.
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)Cloudera, Inc.
 
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...Cloudera, Inc.
 
Intuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchIntuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchCloudera, Inc.
 

Mais procurados (20)

Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud WorldPart 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
Part 1: Cloudera’s Analytic Database: BI & SQL Analytics in a Hybrid Cloud World
 
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data PlatformHow to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
How to Build Multi-disciplinary Analytics Applications on a Shared Data Platform
 
Big data journey to the cloud maz chaudhri 5.30.18
Big data journey to the cloud   maz chaudhri 5.30.18Big data journey to the cloud   maz chaudhri 5.30.18
Big data journey to the cloud maz chaudhri 5.30.18
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your Organization
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
Multidisziplinäre Analyseanwendungen auf einer gemeinsamen Datenplattform ers...
 
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr
Analyzing Hadoop Data Using Sparklyr

Analyzing Hadoop Data Using Sparklyr

 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Introducing Cloudera Navigator Optimizer: Offload Assessments and Active Data...
Introducing Cloudera Navigator Optimizer: Offload Assessments and Active Data...Introducing Cloudera Navigator Optimizer: Offload Assessments and Active Data...
Introducing Cloudera Navigator Optimizer: Offload Assessments and Active Data...
 
Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data Platform
 
Cloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made EasyCloudera Altus: Big Data in the Cloud Made Easy
Cloudera Altus: Big Data in the Cloud Made Easy
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
 
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
New Performance Benchmarks: Apache Impala (incubating) Leads Traditional Anal...
 
Preparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity RenaissancePreparing for the Cybersecurity Renaissance
Preparing for the Cybersecurity Renaissance
 
Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18Consolidate your data marts for fast, flexible analytics 5.24.18
Consolidate your data marts for fast, flexible analytics 5.24.18
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)
 
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
Optimized Data Management with Cloudera 5.7: Understanding data value with Cl...
 
Intuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with SearchIntuitive Real-Time Analytics with Search
Intuitive Real-Time Analytics with Search
 

Semelhante a Advanced Analytics for Investment Firms and Machine Learning

Accenture 2017 Global Risk Study: Capital Markets Summary
Accenture 2017 Global Risk Study: Capital Markets SummaryAccenture 2017 Global Risk Study: Capital Markets Summary
Accenture 2017 Global Risk Study: Capital Markets Summaryaccenture
 
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty ComputationISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty ComputationUlfMattsson7
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyCloudera, Inc.
 
GSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-EditionGSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-EditionGang Li
 
From information to intelligence
From information to intelligence From information to intelligence
From information to intelligence Srini Koushik
 
New technologies for data protection
New technologies for data protectionNew technologies for data protection
New technologies for data protectionUlf Mattsson
 
Energy Central Webinar on June 14, 2016
Energy Central Webinar on June 14, 2016Energy Central Webinar on June 14, 2016
Energy Central Webinar on June 14, 2016OMNETRIC
 
Gene Villeneuve - Moving from descriptive to cognitive analytics
Gene Villeneuve - Moving from descriptive to cognitive analyticsGene Villeneuve - Moving from descriptive to cognitive analytics
Gene Villeneuve - Moving from descriptive to cognitive analyticsIBM Sverige
 
Bringing AI into the Enterprise: A Machine Learning Primer
Bringing AI into the Enterprise: A Machine Learning PrimerBringing AI into the Enterprise: A Machine Learning Primer
Bringing AI into the Enterprise: A Machine Learning Primermercatoradvisory
 
Elastic's recommendation on keeping services up and running with real-time vi...
Elastic's recommendation on keeping services up and running with real-time vi...Elastic's recommendation on keeping services up and running with real-time vi...
Elastic's recommendation on keeping services up and running with real-time vi...FaithWestdorp
 
yellowibm
yellowibmyellowibm
yellowibmKay Orr
 
Better business outcomes with Big Data Analytics
Better business outcomes with Big Data AnalyticsBetter business outcomes with Big Data Analytics
Better business outcomes with Big Data AnalyticsBillington K
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...IT Support Engineer
 
CS309A Final Paper_KM_DD
CS309A Final Paper_KM_DDCS309A Final Paper_KM_DD
CS309A Final Paper_KM_DDDavid Darrough
 
Putting data science into perspective
Putting data science into perspectivePutting data science into perspective
Putting data science into perspectiveSravan Ankaraju
 
Better Business Outcomes with Big Data Analytics
Better Business Outcomes with Big Data AnalyticsBetter Business Outcomes with Big Data Analytics
Better Business Outcomes with Big Data AnalyticsIBM Software India
 

Semelhante a Advanced Analytics for Investment Firms and Machine Learning (20)

Accenture 2017 Global Risk Study: Capital Markets Summary
Accenture 2017 Global Risk Study: Capital Markets SummaryAccenture 2017 Global Risk Study: Capital Markets Summary
Accenture 2017 Global Risk Study: Capital Markets Summary
 
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty ComputationISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
 
Modernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data StrategyModernizing Architecture for a Complete Data Strategy
Modernizing Architecture for a Complete Data Strategy
 
MACHINE LEARNING – THE WHY, WHAT AND HOW
MACHINE LEARNING –  THE WHY, WHAT AND HOWMACHINE LEARNING –  THE WHY, WHAT AND HOW
MACHINE LEARNING – THE WHY, WHAT AND HOW
 
GSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-EditionGSAMPerspectives7-BigData-Edition
GSAMPerspectives7-BigData-Edition
 
From information to intelligence
From information to intelligence From information to intelligence
From information to intelligence
 
New technologies for data protection
New technologies for data protectionNew technologies for data protection
New technologies for data protection
 
Energy Central Webinar on June 14, 2016
Energy Central Webinar on June 14, 2016Energy Central Webinar on June 14, 2016
Energy Central Webinar on June 14, 2016
 
Gene Villeneuve - Moving from descriptive to cognitive analytics
Gene Villeneuve - Moving from descriptive to cognitive analyticsGene Villeneuve - Moving from descriptive to cognitive analytics
Gene Villeneuve - Moving from descriptive to cognitive analytics
 
Bringing AI into the Enterprise: A Machine Learning Primer
Bringing AI into the Enterprise: A Machine Learning PrimerBringing AI into the Enterprise: A Machine Learning Primer
Bringing AI into the Enterprise: A Machine Learning Primer
 
Elastic's recommendation on keeping services up and running with real-time vi...
Elastic's recommendation on keeping services up and running with real-time vi...Elastic's recommendation on keeping services up and running with real-time vi...
Elastic's recommendation on keeping services up and running with real-time vi...
 
yellowibm
yellowibmyellowibm
yellowibm
 
yellowibm
yellowibmyellowibm
yellowibm
 
Better business outcomes with Big Data Analytics
Better business outcomes with Big Data AnalyticsBetter business outcomes with Big Data Analytics
Better business outcomes with Big Data Analytics
 
yellowibm
yellowibmyellowibm
yellowibm
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 
CS309A Final Paper_KM_DD
CS309A Final Paper_KM_DDCS309A Final Paper_KM_DD
CS309A Final Paper_KM_DD
 
Putting data science into perspective
Putting data science into perspectivePutting data science into perspective
Putting data science into perspective
 
ForresterPredictiveWave
ForresterPredictiveWaveForresterPredictiveWave
ForresterPredictiveWave
 
Better Business Outcomes with Big Data Analytics
Better Business Outcomes with Big Data AnalyticsBetter Business Outcomes with Big Data Analytics
Better Business Outcomes with Big Data Analytics
 

Mais de Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Cloudera, Inc.
 

Mais de Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 
Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18Introducing Workload XM 8.7.18
Introducing Workload XM 8.7.18
 

Último

Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Cizo Technology Services
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
Best Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfBest Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfIdiosysTechnologies1
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceBrainSell Technologies
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commercemanigoyal112
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...OnePlan Solutions
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 

Último (20)

Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
Global Identity Enrolment and Verification Pro Solution - Cizo Technology Ser...
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
Best Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdfBest Web Development Agency- Idiosys USA.pdf
Best Web Development Agency- Idiosys USA.pdf
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
CRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. SalesforceCRM Contender Series: HubSpot vs. Salesforce
CRM Contender Series: HubSpot vs. Salesforce
 
Cyber security and its impact on E commerce
Cyber security and its impact on E commerceCyber security and its impact on E commerce
Cyber security and its impact on E commerce
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
Tech Tuesday - Mastering Time Management Unlock the Power of OnePlan's Timesh...
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva2.pdf Ejercicios de programación competitiva
2.pdf Ejercicios de programación competitiva
 

Advanced Analytics for Investment Firms and Machine Learning

  • 1. 1© Cloudera, Inc. All rights reserved. Advanced Analytics for Investment Firms Faster Time to Value and Higher Returns for Your Analytics Projects Dr. Richard Harmon Sean Anderson
  • 2. 2© Cloudera, Inc. All rights reserved. Types of Analytics Dr. Richard Harmon Financial Services Industry Leader Cloudera Machine learning and advanced analytics for higher investment returns Develop better models faster with Data Science Workbench Sean Anderson Sr. Product Marketing Manager, Data Science and Machine Learning Cloudera
  • 3. 3© Cloudera, Inc. All rights reserved. Our relationship with data is changing Data is now a strategic asset, how you use it is a key differentiator
  • 4. 4© Cloudera, Inc. All rights reserved. • What makes “big data” big? ⎼ Volume? ⎼ Variety? ⎼ Velocity? • Data becomes big when we take one or more large data sets and start to analyze relationships between observations Big Data and Data Products
  • 5. 5© Cloudera, Inc. All rights reserved. Wrangling Big Data is Time Consuming Data preparation accounts for about 80% of the work of data scientists (Source: CrowdFlower)
  • 6. 6© Cloudera, Inc. All rights reserved. METHODS DATA Data Mining Machine Learning Large Scale Data Analysis OPTIMIZATION Optimization Techniques SIMULATION High Performance Computing Spatial Computation Run-Time Monitoring Numerical Simulation NATURE INSPIRED Evolutionary Algorithms Swarm Insects Gene Regulatory Networks Reasoning Compositional ReasoningStrategic Reasoning Artificial Intelligence FORMAL METHODS Model Validation Program Analysis DECISION MAKING New Institutional Economics Incentive Schemes Game Theory MODELING Agent-Based Modeling Complex System Methods Multi-Scale Modeling Dynamical Modeling Social Aspects Social Computing Human Systems An Analytical Methods Viewpoint based upon the Approach Used
  • 7. 7© Cloudera, Inc. All rights reserved. An Overview of the Analytics Space from a AI Perspective
  • 8. 8© Cloudera, Inc. All rights reserved. A Machine Learning Viewpoint with a Use Case Mapping
  • 9. 9© Cloudera, Inc. All rights reserved. Storage • Archival • Traceability Batch • ETL • Data Validation • Reg Reporting Interactive • Risk Aggregation • Stress Testing HPC • Risk Modeling • Backtesting Streaming & Real Time • Mkt Surveillance • Best execution Evolution Risk & Regulatory Compliance Use Cases on Big Data Cloudera Data Science Workbench
  • 10. 10© Cloudera, Inc. All rights reserved. Why is Hadoop Ideal for Data Science? • High volume, low-cost shared storage = more data, more kinds of data • Parallel compute, local to the data = more experiments, better results • Scalable, fault tolerant = easy scale out, not just scale up • Flexible, multipurpose data platform = easier path to production • Superior flexibility and price/performance to any other data platform
  • 11. 11© Cloudera, Inc. All rights reserved. Cloudera Enterprise Data Hub Handle real-time data ingest from diverse sources Governance and Security Data Streams Deployment Flexibility Machine Learning Capabilities Diverse Analytical Options Combine Data from Different Sources Data Mgmt. Hub Scale easily & Cost effectively Batch or Real- time Data Streams Data Sources Data Sources Data Storage & Processing Reporting, Analytics & Auditing Data Ingest Other Data Governance (Data Lineage, Data Protection)
  • 12. 12© Cloudera, Inc. All rights reserved. Why Cloudera Enterprise Making Hadoop Fast, Easy, and Secure Hadoop: • One place for unlimited data • Unified data access Cloudera makes it: • Fast for business • Easy to manage • Secure without compromise
  • 13. 13© Cloudera, Inc. All rights reserved. Machine Learning in Investment Management
  • 14. 14© Cloudera, Inc. All rights reserved. Advanced Analytics Use Cases Patterns in Market Data Satellite Image Analysis Simulation for Strategy Modeling News Feeds for Economic Forecasting Market Sentiment Localized Pricing Risk Assessment Compliance
  • 15. 15© Cloudera, Inc. All rights reserved. Interconnected Risk Indicator: CoVaR (Conditional Value at Risk) CoVaR: Conditional Value at Risk • Dr. Tobias Adrian (NY FED) & Prof. Markus Brunnermeier (Princeton Univ) – American Economic Review, Vol 106, no. 7, July 2016 • Measures “tail-event” linkages • It is the VaR (Value at Risk) of the whole financial sector conditional on one or more institutions being in distress. • CoVaR - measures interconnectedness, spillover effects, risk transmission, deleveraging, etc.
  • 16. 16© Cloudera, Inc. All rights reserved. CoVaR Estimation: Application of ML for Improved Model Fit Quantile Regression vs. Kernel Quantile Regression Kernel Quantile Regression (KQR-CoVaR) Robust Quantile Regression Noisy Adrian (NY FED) & Brunnermeier (Princeton Univ) Harmon (Cloudera), Bittman, Ruttenberg (SAP)
  • 17. 17© Cloudera, Inc. All rights reserved. Improved Economic Forecasting: Predicting US Non-Farm Payroll (NFP) Abby Levenberg, Stephen Pulman, Karo Moilanen, Edwin Simpson, and Stephen Roberts, “Predicting Economic Indicators from Web Text Using Sentiment Composition”, International Journal of Computer and Communication Engineering, Vol. 3, No. 2, March 2014. (http://www.ijcce.org/papers/302-E3007.pdf)
  • 18. 18© Cloudera, Inc. All rights reserved. Modeling The Real World – Simulation & Optimization Agent-Based Models (ABMs) • Agent-based models (ABM) consist of heterogeneous agents that are allowed to freely and randomly interact in the interests of maximizing their own goals. • ABMs are uniquely able to include evolving intelligence that allows agents to learn and adjust as new information comes into the system and how the system evolves in response to this new information. • ABM’s are ideally suited to a model behavioral heterogeneity, and the dynamics that result from that heterogeneity.
  • 19. 19© Cloudera, Inc. All rights reserved. Mortgage Prepayment Reduced Sensitivity: (Burnout) In US RMBS market, the phenomenon known as interest rate burnout arises from the changing composition of homeowners in a fixed mortgage pool over time. Many prepayment models applied various techniques to track heterogeneous borrower behavior within mortgage pools. JPM created multiple dimensions of homeowner incentives - essentially the homeowner must be "ready”, "willing”, and "able” to refinance. This early form of ABM was a key way to model heterogeneous behavior when individual loan data was not access for a pool of mortgages. Option Adjusted Spread (OAS) Analysis OAS is a widely-used computational simulation valuation model for RMBS. This requires a Monte Carlo based simulation of the distribution of future cash flows under thousands of different interest rate paths. Residential MBS Investment: Mortgage Prepayment Modeling
  • 20. 20© Cloudera, Inc. All rights reserved. What is Endogenous Risk? - The conventional view was that markets were huge and anybody trading in the market was small by comparison – that you could neglect the impact of what the firm was doing. - In most empirical risk models, risk is determined by exposure to a group of factors – markets, industries, countries, exchange rates, ... and using volatility as a metric for uncertainty. - While these models allow for uncertainty about those exposures, but often ignore endogenous influences such as how market participants react to each other. - Endogenous risk refers to the risk that comes from market participants reacting to each other rather than reacting to outside forces. Endogenous Risk – A New Challenge for Investment Managers Source: “Avoiding crowds: BlackRock leads push to model 'endogenous' risk”, Risk.Net, 2016. ”A market becoming more correlated more quickly is a market becoming increasingly hard to predict.”
  • 21. 21© Cloudera, Inc. All rights reserved. Feed-Back Loops & Tipping Points: Jean-Philippe Bouchaud of Capital Fund Management (CFM) thinks investors with even relatively small positions can be trapped in a feedback loop where they cause asset prices to fall by selling, and are forced to liquidate more assets as a result. The logic here is that firms should look at holdings relative to securities available in the market, rather than compared with the whole universe of issued securities. CFM uses an agent-based modeling approach that takes into account the size of holdings by market participants to help evaluate the tipping point at which they risk creating their own deleveraging vortex. Endogenous Risk – New Challenge for Investment Managers (Con’t) Source: “Avoiding crowds: BlackRock leads push to model 'endogenous' risk”, Risk.Net, 2016.
  • 22. 22© Cloudera, Inc. All rights reserved. Polling Questions #1 What kind of advanced analytics do you use for your investment strategy? 1. Machine Learning 2. Natural Language Processing 3. Simulations/Agent-Based 4. Multiple Approaches 5. None
  • 23. 23© Cloudera, Inc. All rights reserved. Alternative Data Sources & “Quantamental Investing”
  • 24. 24© Cloudera, Inc. All rights reserved. What is Alternative Data? "Alternative data draws from non-traditional data sources, so that when you apply analytics to the data, they yield additional insights that complement the information you receive from traditional sources." S&P Global CIO Krishna Nathan (CIO, January 3, 2017)
  • 25. 25© Cloudera, Inc. All rights reserved. “Quantamental” Investing – “Big Data” driven investment approach - Quantamental managers combine the bottom-up stock-picking skills of fundamental investors with the use of computing power and big-data sets to test their hypotheses. - Quantamental investing has been driven in part by the rise of so-called alternative data. Fund managers can now study everything from social-media data (to predict footfall in a location or sentiment around a new movie) to the number of cars in a mall parking lot. Merging Quantitative Analysis, AI, and Fundamental Research Some $30 billion in assets (about 11 percent of active equity funds) will be targeted, with $6 billion rebranded BlackRock Advantage funds. These funds focus on quantitative and other strategies that adopt a more rules-based approach to investing. “At BlackRock, Machines Are Rising Over Managers to Pick Stocks” – NY Times, March 28, 2017
  • 26. 26© Cloudera, Inc. All rights reserved. Alternative Data Sources - Examples Satellite Data - These are companies that utilize image data from orbiting satellites to do things like measure the number of cars in Walmart parking lots or farm health based on the color of crops. Web/App/Social Media Data – These are companies which mine social media or use data firehoses from the web/mobile to understand what’s happening in the world or how people are interacting with their devices. Weather Data – These are companies which are developing weather models and utilizing more sensors to get better localized data or improve weather forecasting. Location/Foot Traffic Data – Companies that use different means to understand where consumers are going by measuring foot traffic via check-ins, video analysis, etc. Local Prices – These companies can see what’s happening to prices and inflation by aggregating data from ground-level sources. Alternative Credit - Companies developing new credit models that utilize sources of alternative data (like mobile usage). Credit Card Transactions – These are companies that use anonymous aggregate transaction data to understand trends in consumer purchasing habits. Alternative Data Monetizers/Aggregators – These are companies who pay for access to individual data streams which become more valuable in a bundle, and then sell those packages to investors.
  • 27. 27© Cloudera, Inc. All rights reserved. Alternative Data Source Vendors
  • 28. 28© Cloudera, Inc. All rights reserved. Alternative Data Example: Best Predictor of Chipotle Sales On April 12, Jeff Glueck, the CEO of Foursquare, published a post predicting that Chipotle's first-quarter sales would be down nearly 30%. That was based on foot-traffic statistics built from explicit check-ins and implicit visits from Foursquare and Swarm app users who enable background location. The chart below shows the share of visits to Chipotle restaurants in comparison to visits to ALL restaurants in the United States. In the 2015–2016 winter, visits to Chipotle restaurants declined more significantly than in 2014–2015.
  • 29. 29© Cloudera, Inc. All rights reserved. Advanced Text Analytics: Asset Management Use Case Partners Big Data Enabled Investment Management Process: Goal: To provide new data sources and analytic tools to streamline and improve the portfolio investment process. Benefits: - Centralized, secure & scalable investment analysis data lake - Streamlined information management and analysis process - Flexibility for portfolio managers to utilize any tools and any data sources - New insights from sentiment/emotional analysis of sell-side analyst reports and news feeds - Expanded usage of ML-based analytics for macroeconomic, risk and market analysis. - Automated market, portfolio and event monitoring and alerts - Part of a wider digital transformation initiative
  • 30. 30© Cloudera, Inc. All rights reserved. Open data science in the enterprise IT drive adoption while maintaining compliance Data Scientist explore, experiment, iterate
  • 31. 31© Cloudera, Inc. All rights reserved. Our goal: An open platform for data science at scale Help more data scientists use the power of Hadoop Use a powerful, familiar environment with direct access to Hadoop data and compute Data Scientist Data Engineer Make it easy and secure to add new users, use cases Offer secure self-service analytics and a faster path to production on common, affordable infrastructure Enterprise Architect Hadoop Admin
  • 32. 32© Cloudera, Inc. All rights reserved. Introducing Cloudera Data Science Workbench Self-service data science for the enterprise Accelerates data science from development to production with: • Secure self-service environments for data scientists to work against Cloudera clusters • Support for Python, R, and Scala, plus project dependency isolation for multiple library versions • Workflow automation, version control, collaboration and sharing
  • 33. 33© Cloudera, Inc. All rights reserved. Polling Question #2 How fast do you need to ingest your data? 1. sec 2. ms 3. hours 4. min 5. don’t know
  • 34. 34© Cloudera, Inc. All rights reserved. Demo
  • 35. 35© Cloudera, Inc. All rights reserved. Q&A
  • 36. 36© Cloudera, Inc. All rights reserved. Thank you