SlideShare a Scribd company logo
1 of 13
THINK BREADTH,
NOT DEPTH
Seong Lee
O P E N
D A T A
S C I E N C E
C O N F E R E N C E_
BOSTON 2015
@opendatasci
THINK
BREADTH,
NOT DEPTH
HOW NEW DATA GENERATES NEW ALPHA
OUR AGENDA
• My Story With Quantitative Investing
• The Problem Of Small Fish Big Pond
• The Solution: Big Fish Small Pond
• How This Applies To Quantitative Finance
• How You Can Do This Too
• Conclusion
MY STORY
• I work with algorithms, clients, and ideas
• My job is to combine all three into one finished product
• Everyone has different ideas but wants the same thing
• Alpha but everyone uses the same data
• Wall Street Analyst Estimates – PEAD
• Build something that beats the market?
THE PROBLEM
• Too many people compete in the same arena
• Small fish in a big pond
• Finance: Wall Street
• Business: Samsung, Apple
THE SOLUTION
• Where no one is, is the best place for you to succeed
• Wall Street: Don’t go to MS, go to the boutique firm next
door
• Business: A smartphone for 3rd world toxin detection in
water systems
• AKA new data sources
REAL LIFE EXAMPLE
• EVERYONE uses wall street analyst earnings estimates
• Wall Street vs. Crowd (Estimize)
• Estimize crowd predictions are 67% more accurate
• Algorithm perform 2x better on Crowd vs. Wall Street
WALLSTREET
CROWDSOURCED
AVOIDING THE TWITTER LEAK
• April 28, 2015 Twitter earnings were leaked
• 20% drop in market cap
• $5 billion drop in market cap
• Accern (News Sentiment) analyzed +100,000 articles
• 1 day before leak, Accern shorted Twitter
• SunTrust downgraded Twitter -> Strong Negative
sentiment
AVOIDING THE TWITTER LEAK
HOW YOU CAN DO THIS
TOO
• Kimono Labs
• API for scraping websites
• Quandl
• Thousands and thousands of datasets
• Social Media
• CO Everywhere, Ground Signal
• Location based social media information
CONCLUSION
• Think breadth and diversity
• Email: slee@quantopian.com

More Related Content

Similar to Think Breadth, Not Depth

Social Media Simplified: The Benefits and Getting Started
Social Media Simplified: The Benefits and Getting StartedSocial Media Simplified: The Benefits and Getting Started
Social Media Simplified: The Benefits and Getting Started
Emily Miethner
 
Lean Startup Challenge Boston
Lean Startup Challenge BostonLean Startup Challenge Boston
Lean Startup Challenge Boston
Brant Cooper
 
Disruption, Collaboration & Love
Disruption, Collaboration & LoveDisruption, Collaboration & Love
Disruption, Collaboration & Love
Pure360
 

Similar to Think Breadth, Not Depth (20)

Social Media Simplified: The Benefits and Getting Started
Social Media Simplified: The Benefits and Getting StartedSocial Media Simplified: The Benefits and Getting Started
Social Media Simplified: The Benefits and Getting Started
 
Event Marketing Strategies - 8 Social Media Tactics to Ignite Ticket Sales
Event Marketing Strategies - 8 Social Media Tactics to Ignite Ticket SalesEvent Marketing Strategies - 8 Social Media Tactics to Ignite Ticket Sales
Event Marketing Strategies - 8 Social Media Tactics to Ignite Ticket Sales
 
Different types of startups, markets and whys
Different types of startups, markets and whysDifferent types of startups, markets and whys
Different types of startups, markets and whys
 
Lean Startup Challenge Boston
Lean Startup Challenge BostonLean Startup Challenge Boston
Lean Startup Challenge Boston
 
Sustaining open data innovations
Sustaining open data innovationsSustaining open data innovations
Sustaining open data innovations
 
Disruption, Collaboration & Love
Disruption, Collaboration & LoveDisruption, Collaboration & Love
Disruption, Collaboration & Love
 
Alternative Sources of Funding - Entrepreneurship 101
Alternative Sources of Funding - Entrepreneurship 101Alternative Sources of Funding - Entrepreneurship 101
Alternative Sources of Funding - Entrepreneurship 101
 
Ten Inventions that will Revolutionize Retail
Ten Inventions that will Revolutionize RetailTen Inventions that will Revolutionize Retail
Ten Inventions that will Revolutionize Retail
 
Alternative Business Models: open-source, crowd funding and tokenisation
Alternative Business Models: open-source, crowd funding and tokenisationAlternative Business Models: open-source, crowd funding and tokenisation
Alternative Business Models: open-source, crowd funding and tokenisation
 
Changes in Venture Capital & Building 500 Startups (Sao Paulo, Sept 2013)
Changes in Venture Capital & Building 500 Startups (Sao Paulo, Sept 2013)Changes in Venture Capital & Building 500 Startups (Sao Paulo, Sept 2013)
Changes in Venture Capital & Building 500 Startups (Sao Paulo, Sept 2013)
 
Good Idea, Bad Startup (UCLA ECON 106E)
Good Idea, Bad Startup (UCLA ECON 106E)Good Idea, Bad Startup (UCLA ECON 106E)
Good Idea, Bad Startup (UCLA ECON 106E)
 
Market Study workshop for Startup Weekend Liege
Market Study workshop for Startup Weekend LiegeMarket Study workshop for Startup Weekend Liege
Market Study workshop for Startup Weekend Liege
 
Tcm concept discovery stage introduction
Tcm concept discovery stage introductionTcm concept discovery stage introduction
Tcm concept discovery stage introduction
 
Product Market Fit
Product Market FitProduct Market Fit
Product Market Fit
 
Building 500 Startups: #500STRONG
Building 500 Startups: #500STRONGBuilding 500 Startups: #500STRONG
Building 500 Startups: #500STRONG
 
Ver 1.10 the venture capital ecosystem feb 2015
Ver 1.10   the venture capital ecosystem feb 2015Ver 1.10   the venture capital ecosystem feb 2015
Ver 1.10 the venture capital ecosystem feb 2015
 
Level Seven - Big data in interactive marketing
Level Seven - Big data in interactive marketingLevel Seven - Big data in interactive marketing
Level Seven - Big data in interactive marketing
 
Startup Next Seattle - Product Market Fit by Joanna Lord
Startup Next Seattle - Product Market Fit by Joanna LordStartup Next Seattle - Product Market Fit by Joanna Lord
Startup Next Seattle - Product Market Fit by Joanna Lord
 
From an idea to a Startup
From an idea to a StartupFrom an idea to a Startup
From an idea to a Startup
 
Investing in Tech Startups & Building Startup Ecosystems (PreMoney Miami, Mar...
Investing in Tech Startups & Building Startup Ecosystems (PreMoney Miami, Mar...Investing in Tech Startups & Building Startup Ecosystems (PreMoney Miami, Mar...
Investing in Tech Startups & Building Startup Ecosystems (PreMoney Miami, Mar...
 

More from odsc

Kaggle The Home of Data Science
Kaggle The Home of Data ScienceKaggle The Home of Data Science
Kaggle The Home of Data Science
odsc
 
The Art of Data Science
The Art of Data Science The Art of Data Science
The Art of Data Science
odsc
 
Frontiers of Open Data Science Research
Frontiers of Open Data Science ResearchFrontiers of Open Data Science Research
Frontiers of Open Data Science Research
odsc
 

More from odsc (20)

Understanding the Chief Data Officer
Understanding the Chief Data Officer Understanding the Chief Data Officer
Understanding the Chief Data Officer
 
Machine-In-The-Loop for Knowledge Discovery
Machine-In-The-Loop for Knowledge DiscoveryMachine-In-The-Loop for Knowledge Discovery
Machine-In-The-Loop for Knowledge Discovery
 
API Driven Development
API Driven Development API Driven Development
API Driven Development
 
Mobile technology Usage by Humanitarian Programs: A Metadata Analysis
Mobile technology Usage by Humanitarian Programs: A Metadata AnalysisMobile technology Usage by Humanitarian Programs: A Metadata Analysis
Mobile technology Usage by Humanitarian Programs: A Metadata Analysis
 
Productionizing Deep Learning From the Ground Up
Productionizing Deep Learning From the Ground UpProductionizing Deep Learning From the Ground Up
Productionizing Deep Learning From the Ground Up
 
Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive
Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and HiveBig Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive
Big Data Infrastructure: Introduction to Hadoop with MapReduce, Pig, and Hive
 
Data Science at Dow Jones: Monetizing Data, News and Information
Data Science at Dow Jones: Monetizing Data, News and InformationData Science at Dow Jones: Monetizing Data, News and Information
Data Science at Dow Jones: Monetizing Data, News and Information
 
Spark, Python and Parquet
Spark, Python and Parquet Spark, Python and Parquet
Spark, Python and Parquet
 
Building a Predictive Analytics Solution with Azure ML
Building a Predictive Analytics Solution with Azure MLBuilding a Predictive Analytics Solution with Azure ML
Building a Predictive Analytics Solution with Azure ML
 
Beyond Names
Beyond NamesBeyond Names
Beyond Names
 
How Woman are Conquering the S&P 500
How Woman are Conquering the S&P 500How Woman are Conquering the S&P 500
How Woman are Conquering the S&P 500
 
Domain Expertise and Unstructured Data
Domain Expertise and Unstructured DataDomain Expertise and Unstructured Data
Domain Expertise and Unstructured Data
 
Kaggle The Home of Data Science
Kaggle The Home of Data ScienceKaggle The Home of Data Science
Kaggle The Home of Data Science
 
Open Source Tools & Data Science Competitions
Open Source Tools & Data Science Competitions Open Source Tools & Data Science Competitions
Open Source Tools & Data Science Competitions
 
Machine Learning with scikit-learn
Machine Learning with scikit-learnMachine Learning with scikit-learn
Machine Learning with scikit-learn
 
Bridging the Gap Between Data and Insight using Open-Source Tools
Bridging the Gap Between Data and Insight using Open-Source ToolsBridging the Gap Between Data and Insight using Open-Source Tools
Bridging the Gap Between Data and Insight using Open-Source Tools
 
Top 10 Signs of the Textpocalypse
Top 10 Signs of the TextpocalypseTop 10 Signs of the Textpocalypse
Top 10 Signs of the Textpocalypse
 
The Art of Data Science
The Art of Data Science The Art of Data Science
The Art of Data Science
 
Frontiers of Open Data Science Research
Frontiers of Open Data Science ResearchFrontiers of Open Data Science Research
Frontiers of Open Data Science Research
 
Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering
 

Recently uploaded

Recently uploaded (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

Think Breadth, Not Depth

  • 1. THINK BREADTH, NOT DEPTH Seong Lee O P E N D A T A S C I E N C E C O N F E R E N C E_ BOSTON 2015 @opendatasci
  • 2. THINK BREADTH, NOT DEPTH HOW NEW DATA GENERATES NEW ALPHA
  • 3. OUR AGENDA • My Story With Quantitative Investing • The Problem Of Small Fish Big Pond • The Solution: Big Fish Small Pond • How This Applies To Quantitative Finance • How You Can Do This Too • Conclusion
  • 4. MY STORY • I work with algorithms, clients, and ideas • My job is to combine all three into one finished product • Everyone has different ideas but wants the same thing • Alpha but everyone uses the same data • Wall Street Analyst Estimates – PEAD • Build something that beats the market?
  • 5. THE PROBLEM • Too many people compete in the same arena • Small fish in a big pond • Finance: Wall Street • Business: Samsung, Apple
  • 6. THE SOLUTION • Where no one is, is the best place for you to succeed • Wall Street: Don’t go to MS, go to the boutique firm next door • Business: A smartphone for 3rd world toxin detection in water systems • AKA new data sources
  • 7. REAL LIFE EXAMPLE • EVERYONE uses wall street analyst earnings estimates • Wall Street vs. Crowd (Estimize) • Estimize crowd predictions are 67% more accurate • Algorithm perform 2x better on Crowd vs. Wall Street
  • 10. AVOIDING THE TWITTER LEAK • April 28, 2015 Twitter earnings were leaked • 20% drop in market cap • $5 billion drop in market cap • Accern (News Sentiment) analyzed +100,000 articles • 1 day before leak, Accern shorted Twitter • SunTrust downgraded Twitter -> Strong Negative sentiment
  • 12. HOW YOU CAN DO THIS TOO • Kimono Labs • API for scraping websites • Quandl • Thousands and thousands of datasets • Social Media • CO Everywhere, Ground Signal • Location based social media information
  • 13. CONCLUSION • Think breadth and diversity • Email: slee@quantopian.com

Editor's Notes

  1. And from working with all of these different clients, I’ve realized that everyone has different ideas. Most of these different ideas are generally small tweaks on existing methods The question is not a yes/no answer, but what will give me the highest probability of beating JP Morgan? Morgan Stanley? Renaissance Establish the PEAD first An extremely well documented phenomon that happens on the earnings announcement date So a companuy like apple will announe earnings about four times a year When I first started my job at Quantopian, Jess Stauth, our VP of Quant Strategy, came to me with the challenge of writing a PEAD algorithm
  2. Too many people are competing in the same arena Take Wall Street -> Boston College graduate competing against Harvard Graduate Take Business -> HTC or Nokia competing against AAPL and Samsung Finally, something that actually relevant -> let’s look at a bar If you’re a small business, you need to compete in a different arena
  3. The world of quantitative finance, there are two different ways to compete in this arena: mathematics, data Does it seem weird that we have the solution in different slides -> can we make them all congruent? E.g. put this slide towards the end where wall street, business, and life are afterwards?
  4. Explain WHY the crowd is unconventional Sticking point – Need to talk about the algorithm itself briefly? Looking at an earnings surprise, buy/sell and hold position for 5 days.
  5. This should be extremely specific for different types of life situations