4. * Original Question “There is a scarcity for Data
Scientists. Isn‟t Data Science a Fad which will
go away like other Fad‟s?”
* The Correct Question - “Our company is too
small to exhaust the Data Scientist supply and
affect the Price – Demand – Supply equilibrium.
Data Scientists come at a premium and we are
offering commodity wages. Lets understand
what's Data Science and how we can utilize it.”
*
6. * Big Data is mostly about Engineering
* Data Science is the science behind leveraging
data
*
7. * Wikipedia - Machine Learning is used to
automatically learn from data i.e Identify/quantify
complex relationships thought to be features of the
underlying mechanism that generated the data, and
employ these identified patterns to make
predictions based on new data. – This is most
common in companies like Google, Amazon, Yahoo,
Linkedin etc. And extends logically to Big Data.
* Wikipedia – Statistics is the study of the collection,
organization, analysis, interpretation, and
presentation of data. Its about Hypothesis Testing
and may or maynot need large datasets. The
approach companies like GE, Toyota etc. take.
*
8. * Today every Data Center sells its services by
calling itself a Cloud (WTH!!!)
* 10,000 people DW/BI/Java-Developer Divisions
and basically everyone else on the planet now
call themselves „Data Scientists‟ (WTH!!!)
* Millions of Java/Python/SQL „Application
Developers‟ call themselves Big Data Engineers.
(WTH!!!) Do you understand the difference
between an „Application Developer‟ vs an
„Engineer‟? Or, do you?
*
9. * Data Science is something new. Reality – Data Science is as
old as Statistics and Maths and Algorithms in general.
* Data Science is a Fad that will fade away. Reality – All
avenues that offer Competitive Leverage have been
explored… Data Science is what does and will differentiate
the best from the others.
* Data Scientists are Geeks who know nothing about the
Business. Reality – Most Top Data Scientists „Run Top
Businesses‟ and „Drive Strategy‟
* Data Science is Magic. Reality – It‟s a Science based on Maths
and other converged disciplines. There is no magic and the
most effective methods can be conceptually very simple to
understand too.
*
12. Education Level
50%
44%
45%
40% 38%
35%
30%
25%
20% Education Level
15% 13%
10% 6%
5% 0%
0%
School Bachelors Masters Ph.D. Post
Doctoral
* A field today is extremely dominated by
extremely educated individuals.
*
13. Roles & Responsibilities
20% 18%
15%
15% 12% 12% 12%
7% 7% 9% 7%
10%
5% 1%
0% Roles & Responsibilities
* In highest probability most Data Scientists are
doing Big Data or Machine Learning.
*
14. Age
8
7
6
5
4
Age
3
2
1
0
18-29 30-36 37-44 45+
* 94% of Top Data Scientists are Male
* Most of them are in Late 30‟s or older [Did you
say Data Science was new? ;-) ]
*
15. * Statistics
* Machine Learning
* Big Data
* Data Mining
* Operations Research & Decision Sciences
* Distributed Algorithms
* Clustered Systems Engineering
* Artificial Intelligence
*
16. * Descriptive Data Science – The Vision
* Inferential Data Science – The Intelligence
*
17. * Impacts Business
* Impacts Government
* Impacts Society
Wait a minute. Those are all the 3 Facets of
Strategy. ;-)
*