The impact of data science on business is undeniable, and the value it provides is growing without signs of slowing. To keep up with this rapidly evolving technology landscape, data scientists must adapt and specialize through continuous learning. This talk focuses on how they can do that in a way that maximizes the positive impact data science will have on their organization.
Big Data Day LA 2016/ Data Science Track - Intuit's Payments Risk Platform, D...
Semelhante a Big Data Day LA 2016/ Data Science Track - The Evolving Data Science Landscape, Kyle Polich - Principal Consulting Engineer, Datascience Inc
How to expand the Galaxy from genes to Earth in six simple steps (and live sm...Raffaele Montella
Semelhante a Big Data Day LA 2016/ Data Science Track - The Evolving Data Science Landscape, Kyle Polich - Principal Consulting Engineer, Datascience Inc (20)
2. LIGO
One of the most
advanced metrology
projects; one of the
more precise
instruments ever
created
2
Measures changes
1 / 10,000th the
width of a proton
4km interferometer
to measure
gravitational
fluctuations from
cosmic explosions
3. LIGO
According to Scientific American, cost $1.1 billion over last 40 years
3
Turned on in 2002Construction took
8 years
Managed by ~1k
scientists
Gravity waves
detected 2016
4. Value Delivery
“Despite the hype of big data, a majority of the business value
produced by data still happens in this more traditional setting, and
we would like to support these communities.”
- Szilard Pafka (Dec, 2014) announcing new
- DW/BI/Analytics Meetup
4
5. Bias, Variance, Heterogeneity
“Up until late last year, tracking would be done unpredictably after
almost every release.”
“We changed the way we capture that last April and again this
January.”
“We have four divisions that all do their analytics differently.”
5
7. Excitement hierarchy
7
Report generation
ML on 10k observations, 20 features
ML on 1 billion observations, 1500 features
ML on 1 million observations, 100 features
1000 node clustered computing
A/B testing
High performance computing
Econometric modeling for adtech
Deep
learning
SQL queriesLogistic regression
Off the shelf OpenCV implementation
Online multi-armed bandit
Online streaming algorithms
Commercial opportunities for quantum computing
9. Measure of effectiveness
Return on Investment (ROI)
Revenue savings from automation
Lift
Impact Factor*
Causal Impact
Value of information
9
11. Value of Information
11
Expected Revenue
if information know=Value
(information)
Expected revenue if
Information NOT
know- - Cost of
Information
12. Iteration and precision
Early objectives
• Maximize conversion rate
• Send / don’t send offer
• Raise / lower budget
• Predict number of machine failures
• Find available service provider
Late objectives
• Maximize lifetime value
• Personalized offer
• Real time bid optimization
• Optimize factory environmental
controls
• Global service pairing optimization
12
19. DataForward Event Series
DataForward is a gathering of professionals across industries who are passionate about data
science, big data technologies, and data driven businesses. The group meets once a month at
keynote events featuring talks and presentations by industry leaders. The DataForward events
are hosted and organized by DataScience Inc, and livestreamed to audiences all over the
world.
The monthly events are dedicated to key topics facing data-driven organizations- disruptive
technologies, data-driven culture, investment trends, and insights into how existing
organizations can unlock the value from their data. To signup for our first keynote event in
August, please visit meetup.com/DataForward.
19
The impact of data science on business is undeniable, and the value it provides is growing without signs of slowing. To keep up with this rapidly evolving methodology and technology landscape, data scientists must adapt and specialize through continuous learning. This talk focuses on how they can do that in a way that maximizes the positive impact data science will have on their organization.
This is equivalent to measuring the distance to the nearest star to an accuracy smaller than the width of a human hair!
The truest big data problem
DataScience takes on ambitious problems, but not this ambitious of costly
LIGO – scientific value
Business impact and value
In the real world, it’s a metrologist’s worst nightmare
80% of time cleaning -> business understanding
How many of the top problems does a data scientist get to solve in their career?
YM mistake
Impact factor – used for citations; choice differs from previous solution
Lemonade stand
Reference LIGO again
Recommender engine vs. social network integration
I’m looking forward to the day when my title is no longer data scientist
How many new techniques since you got your degree?
I’ll accept PR on this list, but it’s going to end up looking like next slide
A need for specialization; students Astar
A focus on business impact; problem and person match (soft pitch here)
A need for community and continuous learning
“Like TED, but for data professionals”
DataForward is a gathering of professionals across industries who are passionate about data science, big data technologies, and data driven businesses. The group meets once a month at keynote events featuring talks and presentations by industry leaders. The DataForward events are hosted and organized by DataScience Inc, and livestreamed to audiences all over the world.
The monthly events are dedicated to key topics facing data-driven organizations- disruptive technologies, data-driven culture, investment trends, and insights into how existing organizations can unlock the value from their data. To signup for our first keynote event in August, please visit meetup.com/DataForward.