SlideShare uma empresa Scribd logo
1 de 24
Baixar para ler offline
$
€ £
Salary & Skills Report
Learn More To Earn More
$
Data Science & BI
www.packtpub.com/skillup
2
Data Science & Bi Salary & Skills Report
‘What you need to know to earn more in
Data Science and Business Intelligence’
The most comprehensive global IT skills and salary survey ever.
3
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
Contents
What is Skill Up?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Respondents. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Which Industries are best for Data jobs – and which roles are most valuable?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Industry Breakdown. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Career Development and Data-Oriented Roles. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Technology Usage Analysis. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
What exactly are people using?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
What else are data Pythonistas using on a daily basis?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Tech Stacks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1. Data Visualizers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2. Programmatic Data Wranglers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3. Big Data Experts. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4. Data Architects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
What comes next?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
What Trends Are Emerging?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
Hot Topics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
Do you think Julia will replace R and Python as the data science language of choice in the next 12 months?. . . . . . . . . . . . . . . . 20
Do you think Apache Spark is likely to Replace Hadoop in the next 12 months?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Is the line between data analysis and data retrieval being blurred?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
Is your company planning to implement a big data project over the next 12 months?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
And finally.... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Does Excel still hold a place in your heart?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
4
Data Science & Bi Salary & Skills Report
The world of Data Science is rapidly growing,
with data becoming increasingly vital to a huge
range of organizations. Undoubtedly tied to
the much-discussed rise of ‘Big Data’ over the
past decade, data-oriented roles are today
some of the most prominent technical roles in
the economy.
This report, focusing on the data science
respondents to Packt’s Skill Up survey explores
where data science is most valuable, what
tools are being used, and what the trends and
challenges will be in the future.
ll Data science is immensely valuable to
SMEs evidenced by their investment in
young talent.
ll Finance is still a lucrative sector for
data science.
ll R and Python are still neck and neck as
the key data science languages.
ll Distributed Computing and machine
learning are still on the ascendancy.
ll IOT is one of the hottest trends for
data scientists that promises to bring
new challenges and opportunities.
ll Excel will never die!
The need to answer these questions led us to
look at the community as a whole, and so we
decided to launch our Skill Up campaign.
What is Skill Up?
With our Skill Up survey we wanted to look
at the tech community as a whole to identify
upcoming trends over the next few years and
share what you can do to ensure you get the
most out of your career and skills. We divided
our survey into 4 segments, Web Development
& Design, Application Development, Security
& System Administration, and Data Science &
Business Intelligence, making this one of the
most comprehensive surveys in recent years.
Specifically we asked:
ll What skills lead to a higher salary?
ll What skills/technologies are most
highly valued by different industries?
ll What cutting edge technologies are
really worth you spending your
time learning?
To get a better idea of the community’s
thoughts we asked you all to fill in our survey,
the results of which you can find compiled
here in this report, giving you the facts, the
figures, and more importantly – the knowledge
and skills you need to make the best career
decisions.
Let’s look at the results in more detail.
5
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
Respondents
Respondents by Country
United States
United
Kingdom
GermanyCanada
India
Australia
Italy
Spain
Brazil
France
Poland
Netherlands
Mexico
Sweden
Belgium
Switzerland
Russia
Denmark
Japan
Ireland
South Africa
Finland
Czech Republic
Portugal
Singapore
Norway
Hungary
New Zealand
Romania
Colombia
Argentina
Austria
Thailand
Greece
Malaysia
Indonesia
Israel
Croatia
6
Data Science & Bi Salary & Skills Report
Job Sector
The Data Science stream, from which the data in this report is drawn, received over 3,800 responses
from individuals with a wide range of experience levels, working in a diverse set of industries.
Experience Level
More than
20 years
Less than 1 year
5-10 years
3-5 years
10-20 years
1-3 years
0
500
1000
1500
2000
2500
3000
3500
4000
Telecom
m
unications
Health/Biotech/Science
M
edia/Advertising
/Entertainm
ent&
Gam
ing
Governm
ent
Finance/Banking
Education/Academ
ia/Research
Consulting
W
eb
Services/Internet
Softw
are
Products
7
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
$0
$20,000
$40,000
$60,000
$80,000
$100,000
More than 20 yearsLess than 1 year
Self employedStart-upEnterpriseSME
There’s a consensus that data-oriented
roles are some of the most valuable around,
especially in industries where data is so vital
(there are very few industries where you
could say it isn’t). But there are trends that
indicate where it’s having the biggest impact,
and what’s going to become more and more
important for anyone working in data.
ll SMEs pay inexperienced people the
best.
ll In terms of industries, Finance still
offers the best pay for inexperienced
employees.
ll Data Architect is one of the most
important roles in dynamic and fast-
paced industries such as Media and
Entertainment. This proves that the
ability to build and implement business
critical solutions is essential.
Our research shows that SMEs are a great place
for inexperienced data scientists and analysts
tobegintheircareers,offeringahigherstarting
salary than Enterprise organizations. Even
those working for start-ups earn only slightly
less than Enterprise. With the opportunities
available in small, rapid growth organizations,
Start-ups are a great option for anyone
ambitious and eager to prove themselves!
Which Industries are best for Data jobs – and which roles are
most valuable?
Salary by Company Type and Experience
8
Data Science & Bi Salary & Skills Report
Industry Breakdown
Our research also investigated how data science is faring in each sector. The responses provided a
good indication of where data science is most critical.
The largest number of respondents with
more than 20 years experience are working in
consulting. Moreover, the commonality of this
suggests that external expertise is something
that is very much in demand.
Those respondents with least experience are
predominantly working in Education/Academia
and Research. However, as the graph overleaf
shows, it does not appear to be one of the best
paid industries.
0% 20% 40% 60% 80% 100%
More than
20 years
10-20 years
5-10 years
3-5 years
1-3 years
Less than
1 year
Web Services
/Internet
Telecoms
Software
Products
Media/Advertising
/Entertainment
and Gaming
Health
/Biotech
/Science
Government
Finance/Banking
Education
/Academia
/Research
Consulting
9
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
Well paid Industries for Less Experienced people
Finance and Banking comes out as the best
paid industry according to our data. With
the rise of algorithmic trading, and Big Data
in general playing a large part in just about
every component of Finance, there are huge
opportunities for data fluent people to
command high salaries, even without a great
deal of experience.
Want to get to grips with quantitative
finance? Pick up this bundle today.
ll Mastering R for Quantitative
Finance
ll Introduction to R for Quantitative
Finance
ll Python for Finance
ll Mastering Python for Finance
ll Advanced Quantitative Finance
with C++
Media / Advertising / Entertainment and
Gaming, while certainly not as lucrative as
the Financial sector, appears to be a sector
offering a substantial salary to inexperienced
people. If we consider the fact that this
sector, taken generally, is very competitive
for inexperienced people and known for low
salaries, the data provides a clear indication
that these industries are willing to invest
in inexperienced people with technical and
numerical skills.
This is an indication that these industries are
relying on data-driven strategies to remain
competitive in tough areas of the economy.
While they may lack the funds and cashflow
to invest in established data professionals,
inexperienced people with the right skills
might well fill this role perfectly, without
commanding such high salaries.
$0 $1,0000 $20,000 $30,000 $40,000 $50,000 $60,000 $70,000 $80,000
Telecommunications
Software Products
Web Services/Internet
Education/Academia/Research
Health/Biotech/Science
Government
Media/Advertising
/Entertainment and Gaming
Consulting
Finance/Banking
10
Data Science & Bi Salary & Skills Report
Career Development and Data-Oriented Roles
We’ve seen indicators of what’s best for inexperienced data professionals, but what about
career development?
We also wanted to see how the different job types fare in different industries. The graph below
shows which specific roles command the highest salaries, and in which industry these roles are
most valuable.
ll Data Architect is one of the most
important roles within Media/
Advertising/Entertainment and
Gaming, commanding a higher salary in
other areas.
ll Comparing the roles of Data Architect
to Statistician, we can infer some key
differences about how data science
and data-driven strategies are playing
out between industries. In Finance, the
statistician earns slightly more than
the data architect whilst in the Media
category, the Data Architect earns
significantly more.
ll In Media and Entertainment, where
agility and organizational change
is essential for rapid responses
to change, there is a high value
placed on someone who is able to
develop a solution, such as a Data
Architect. Whereas, in Finance these
architectures are already in place.
Average Salary by Industry & Job Type
$0 $20,000 $40,000 $60,000 $80,000 $100,000 $120,000
Statistician
Data Scientist/Analyst
Business Intelligence Expert
(includes Devs, analysts)
Data Architect
Assistant Director
Business Intelligence Expert
(includes Devs, analysts)
Data Scientist/Analyst
Data Architect
Statistician
Intermediate IT
/Regulation/Business
Data Scientist/Analyst
Business Intelligence Expert
(includes Devs, analysts)
Statistician
Data Architect
Media/Advertising
/Entertainment
andGaming
Finance/BankinGCONSULTING
11
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
Technology Usage Analysis
What exactly are people using?
ll More than 25% of respondents use
Python on a daily basis, but almost the
same number use R.
ll Distributed computing and machine
learning tools are becoming more and
more important.
ll Augmented Reality and the Internet of
Things are poised to change how we
think about data.
The last two points spell out the next 5
years in data. The volume of data available
from all sources will only continue to grow
exponentially. The technology to deal with
that data is still being developed, so don’t let
yourself get left behind!
We asked what tools people use on a daily basis, and here’s what they said:
Python is definitely top dog when it comes to data. Its versatility and wealth of easy to use third-
party libraries for everything from machine learning to web scraping, combined with its low barrier
to entry, make it the ideal choice.
0% 5% 10% 15% 20% 25% 30%
gis
c++
mysql
html
c#
php
oracle
excel
linux
javascript
java
r
sql
python
12
Data Science & Bi Salary & Skills Report
What else are data Pythonistas using on a daily basis?
A large number of respondents are using
Python and R, Python and Java, or Python
and C++.
Get to grips with both Python and R to
broaden your fluency and become a more
flexible data scientist:
ll Practical Data Science Cookbook
ll R Data Analysis Cookbook
ll Python Data Analysis
ll R for Data Science
ll Python Data Science Essentials
0% 5% 10% 15% 20% 25% 30%
bash
html
mysql
gis
c
data analysis
spark
hadoop
machine learning
c++
linux
javascript
java
sql
r
13
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
Tech Stacks
To get a better idea of what people are commonly using together, let’s do some clustering. We put
all the responses into a graph database, and ran a clustering algorithm across the techs people use
every day, and from this we’ve identified some coherent clusters of people based on their stack:
These are people working at the design end of
the data science spectrum. As you can see from
the cluster of tools below, they have as much
in common with a front end web developer
as any of our groups. JavaScript comes top
here – unsurprising when you consider how
much it dominates web development today
– and other design tools and plugins such as
CSS, HTML5 and jQuery serve to underline its
dominance.
The very presence of this group of tools
in our data emphasises the importance of
communicating insight via the web, and
highlights just how important design is when
trying to understand and interpret data.
1. Data Visualizers
1. javascript
2. html
3. css
4. php
5. html5
6. jquery
7. java
8. xml
14
Data Science & Bi Salary & Skills Report
This group of tools are used by those people
that squeeze insight out of data. They are
primarily responsible for mining, cleaning and
manipulating data very quickly, in order to
answer specific questions about everything
from customer behaviour to financial planning.
Python here comes out as the most important
tool – perhaps unsurprising given how easy it is
to prototype and its much-vaunted flexibility.
The presence of pandas serves to underline
Python’s dominance – indeed, it might
consolidate it precisely because of the way in
which pandas improve Python’s data analysis
capabilities.
But it’s also interesting to see C++ here –
while Python offers flexibility and ease,
the impressive speed that C++ offers is
still unrivalled.
2. Programmatic Data Wranglers
1. python
2. c++
3. linux
4. bash
5. pandas
6. matlab
7. machine learning
8. postgresql
15
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
This cluster of tools is used by Big Data
specialists, interested in scalability and
robustness.
Hadoop here dominates the Big Data
world – however, Scala and Spark are also
growing in prominence, as their presence in
this cluster indicates. As demand grows for
faster processing (and by extension real-time
analytics), it’s likely that we’ll see more from
them.
It’s also interesting to see web tools such as
JavaScript and Spring included here. As with
the first cluster, this indicates the need to
communicate effectively and quickly through
web based applications.
3. Big Data Experts
1. java
2. hadoop
3. mysql
4. javascript
5. scala
6. spring
7. spark
8. maven
16
Data Science & Bi Salary & Skills Report
This provides a useful insight into how the
world of data breaks down, and how different
roles appear to be built around ‘ecosystems’
of tools. One of the interesting questions that
comes out of this is how this might change
over the coming years. Is it possible that these
clusters will become more fluid?
Thisgroupoftoolsreflectstheneedtoorganize
and communicate data insights in effective
and intelligent ways – the main challenge of a
Data Architect. Clearly, Enterprise-Ready tools
dominate this cluster, indicating that Microsoft
and Oracle are still regarded as go-to brands
when it comes to these business-critical tools.
4. Data Architects
1. sql
2. ms
3. oracle
4. server
5. ssis
6. ssas
7. database
8. ssrs
17
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
There are a few key points our research
suggests here:
ll Spark and Hadoop are well
represented, suggesting the growth of
cluster computing.
ll Web Based technologies are also
appearing here indicating people are
seeing a value in sharing data analysis
on the web.
ll NoSQL databases are going to keep
rising in the data world.
What comes next?
We asked people what tools they were
planning on learning over the next 6 months.
We wanted to know what’s hot, and what
people in the know (those earning the money)
are using, to help you decide in which area
you’d like to enhance your skills, or learn
new ones.
In the tag cloud below word frequencies
are weighted by salary. So what are people
looking to learn?
What Trends Are Emerging?
We asked respondents what they think is the most important trend emerging in their field in the
next 12 months:
18
Data Science & Bi Salary & Skills Report
There are some clear messages here:
ll Machine learning is going to become
one of the focal points for everyone
working in data, driven by a demand
for predictive insights and statistical
analysis in a range of sectors and
industries.
ll Augmented Reality and Internet of
Things are going to be key challenges
for data scientists over the next few
years.
ll The prominence of mobile in the tag
cloud suggests an increased emphasis
on mobile analytics as users move
further away from desktop.
ll Distributed Computing (both on
clusters and on the Cloud) is going to
change the way we even think about
data, suggesting increasing anxiety
about how to manage resources for
Big Data projects. This is possibly
symptomatic of the two outcomes
above. As IoT and mobile become more
dominant, managing larger datasets is
going to become a greater challenge.
If you’re involved with a Big Data project,
you’re going to need Machine Learning. Pick
up this bundle and start exploring machine
learning and predictive analytics today.
ll Machine Learning with Spark
ll Scala for Machine Learning
ll Machine Learning with R
ll Machine Learning with R Cookbook
ll Building Machine Learning Systems
with Python - Second Edition
19
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
Hot Topics
According to the highest earning respondents,
Julia is on the ascent. It’s easy to see why as it is
designed specifically for technical computing,
and boasting interesting features such as
multiple dispatch, useful libraries for graphing,
and impressive JIT compiler benchmarks. Julia
is one to watch!
Stay ahead of the trend and start learning
Julia with this essential selection of
Julia books.
ll Getting started with Julia
Programming Language
ll Mastering Julia
ll Getting Started with LLVM Core
Libraries
ll Python High Performance
Programming
ll R High Performance Programming
To complete our survey, we asked respondents some simple questions about hot topics, trends and
challenges in the data.
DoyouthinkJuliawillreplaceRandPythonasthedatasciencelanguage
of choice in the next 12 months?
$81,000
$82,000
$84,000
$83,000
$85,000
$86,000
YesNo
20
Data Science & Bi Salary & Skills Report
Our survey says you’re safe for now!
However, if you’re not already on the Hadoop train, there’s never been a better time to get
on it…
ll Learning Hadoop 2
ll Mastering Hadoop
ll Big Data Analytics with R and Hadoop
ll Fast Data Processing with Spark - Second Edition
ll Apache Mesos Essentials
Do you think Apache Spark is likely to Replace Hadoop in the next
12 months?
$81,000
$82,000
$84,000
$83,000
$85,000
$86,000
YesNo
21
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
Is the line between data analysis and data retrieval being blurred?
Across the board, this was contentious.
ll 64% agree that it is.
This suggests tools such as BigQuery could
become more prominent. It’s certainly a tool
to watch over the next 12 months.
As Big Data becomes ubiquitous, the aim of
the game is no longer to simply have the most
effective strategy, but also the most efficient
and fast.
Learn how to master the art of data analysis
and retrieval with this bundle of popular
books:
ll Clean Data
ll Practical Data Analysis	
ll Mastering Predictive Analytics with R
ll Learning Data Mining with R
ll Learning Pandas
0% 20% 40% 60% 80% 100%
YesNo
YesNo
YesNo
YesNo
YesNo
More than 20 years
10-20 years
5-10 years
3-5 years
1-3 years
22
Data Science & Bi Salary & Skills Report
Excel is eternal, Excel 2013 is awesome…all of you know it deep down!
And finally...
Does Excel still hold a place in your heart?
Is your company planning to implement a big data project over the next
12 months?
It appears that most respondents are working
in organizations looking to implement Big Data
projects. The significant anomaly to that are
those with very little experience. The reasons
for this aren’t immediately obvious, but there
are a number of possible explanations.
Itcouldbethatthosewithverylittleexperience
simply aren’t privy to organizational strategy
and decision-making. Conversely, it could be
that those people who are just starting out
have been employed precisely because a Big
Data project has been implemented.
0% 20% 40% 60% 80% 100%
More than 20 years
10-20 years
5-10 years
3-5 years
1-3 years
Less than 1 year YesNo
YesNo
YesNo
YesNo
YesNo
YesNo
Yes
No
23
Data Science & Bi Salary & Skills ReportLearn More To Earn More
$
Summary
It may be a truism, but it’s clear, and perhaps
it has been for years, that Data Science and
Big Data are not simply trends, but are instead
symptomatic of a wider social, cultural and
economic change.
It’s time we stopped talking about the ‘Big Data
revolution’ or how ‘data scientist’ is the ‘sexiest
job of the twenty-first century’, and instead
look at the different ways data is being used
in different areas. For SMEs, data is crucial for
making companies more responsive and open
to changes in the market. The prominence
of machine learning underlines this further,
making it clear that there is a real onus on
delivering rapid insight and fast! For larger
organizations there is a drive towards creating
faster Big Data solutions. The apparent rise of
distributed and cluster computing is evidence
of this, as data scientists and analysts look for
new ways to put tools such as Hadoop and
Spark to work.
But even more interesting is how our
understanding of data looks set to change,
thanks to emerging trends such as the Internet
of Things and Augmented Reality. It’s possible
that IOT will become the buzzword to replace
Big Data. How Data Scientists, analysts and
architects tackle it day to day isn’t clear yet, but
it will almost certainly be a challenge that will
offer exciting opportunities for data literate
people everywhere.
What you should be doing if you’re a data
scientist:
ll Broadening the range of languages
you know is essential. It will help you
become more flexible when working
on a range of different projects and
also provides you with more solutions.
If you know R, why not learn Python?
ll You need to get to grips with Machine
Learning. If you want to get started or
investigate it further, grab our Machine
Learning bundle!
ll If you’re just starting your career, you
could do a lot worse than working
in Finance or for an SME. You might
command a higher salary working for
an established enterprise organization,
but the difference is likely to be
small with more opportunities and
responsibility at an SME.
ll If you’re interested in working in
popular industries such as Media, work
towards becoming a data architect,
and learn how to develop and
implement large-scale data solutions
that can deliver benefits across an
organization.
ll Getting to grips with Big Data tools
such as Hadoop and Spark will be
valuable, but learning how to use
them in the context of distributed
networks will be even more valuable as
resources become stretched.
ll Pay attention to IoT – we still don’t
quite know where it will lead the data
world!
Helping IT professionals to put
software to work in new ways
www.packtpub.com Tel: +44 (0)121 265 6484 Published 2015
Data Analyst
Greg Roberts
Project Manager
Sarah Cullington
Editor
Richard Gall
Technical Advisor
Akram Hussain
Design
Chris Murray
Founded in 2004 in Birmingham, UK, Packt’s
mission is to help the world put software to
work in new ways, through the delivery of
effective learning and information services to
IT professionals.
Workingtowardsthatvision,wehavepublished
over 3000 books and videos so far, providing IT
professionals with the actionable knowledge
they need to get the job done –whether that’s
specific learning on an emerging technology or
optimizing key skills in more established tools.
As part of our mission, we have also awarded
over $1,000,000 through our Open Source
Project Royalty scheme, helping numerous
projects become household names along
the way.

Mais conteúdo relacionado

Mais procurados

Hcad mass appraisal report 2016
Hcad mass appraisal report 2016Hcad mass appraisal report 2016
Hcad mass appraisal report 2016cutmytaxes
 
Platform Monitoring and Alert
Platform Monitoring and AlertPlatform Monitoring and Alert
Platform Monitoring and AlertBraja Krishna Das
 
Planning a Microsoft Virtual Server infrastructure with HP ...
Planning a Microsoft Virtual Server infrastructure with HP ...Planning a Microsoft Virtual Server infrastructure with HP ...
Planning a Microsoft Virtual Server infrastructure with HP ...webhostingguy
 
Georgia Annual state IT report 2016
Georgia Annual state IT report 2016Georgia Annual state IT report 2016
Georgia Annual state IT report 2016State of Georgia
 
It policy-2011-english
It policy-2011-englishIt policy-2011-english
It policy-2011-englishPiyush Gaur
 
HPE Information Governance
HPE Information GovernanceHPE Information Governance
HPE Information GovernanceAndrey Karpov
 

Mais procurados (6)

Hcad mass appraisal report 2016
Hcad mass appraisal report 2016Hcad mass appraisal report 2016
Hcad mass appraisal report 2016
 
Platform Monitoring and Alert
Platform Monitoring and AlertPlatform Monitoring and Alert
Platform Monitoring and Alert
 
Planning a Microsoft Virtual Server infrastructure with HP ...
Planning a Microsoft Virtual Server infrastructure with HP ...Planning a Microsoft Virtual Server infrastructure with HP ...
Planning a Microsoft Virtual Server infrastructure with HP ...
 
Georgia Annual state IT report 2016
Georgia Annual state IT report 2016Georgia Annual state IT report 2016
Georgia Annual state IT report 2016
 
It policy-2011-english
It policy-2011-englishIt policy-2011-english
It policy-2011-english
 
HPE Information Governance
HPE Information GovernanceHPE Information Governance
HPE Information Governance
 

Destaque

Destaque (19)

resume updatefall2016option
resume updatefall2016optionresume updatefall2016option
resume updatefall2016option
 
heli dhillon Resume
heli dhillon Resumeheli dhillon Resume
heli dhillon Resume
 
Wellness Weekend Brochure
Wellness Weekend BrochureWellness Weekend Brochure
Wellness Weekend Brochure
 
CV Zaskia S.
CV Zaskia S.CV Zaskia S.
CV Zaskia S.
 
Fizz fuzz and the tiger
Fizz fuzz and the tigerFizz fuzz and the tiger
Fizz fuzz and the tiger
 
Door knocking guidelines
Door knocking guidelinesDoor knocking guidelines
Door knocking guidelines
 
Bridal Store Knysna area
Bridal Store Knysna areaBridal Store Knysna area
Bridal Store Knysna area
 
Jessica El-rami
Jessica El-ramiJessica El-rami
Jessica El-rami
 
VET FEE-HELP Presentation
VET FEE-HELP PresentationVET FEE-HELP Presentation
VET FEE-HELP Presentation
 
Hogeschool v amsterdam dr. jesse weltevreden dr. tibert verhagen
Hogeschool v amsterdam dr. jesse weltevreden  dr. tibert verhagenHogeschool v amsterdam dr. jesse weltevreden  dr. tibert verhagen
Hogeschool v amsterdam dr. jesse weltevreden dr. tibert verhagen
 
CIM DIploma001
CIM DIploma001CIM DIploma001
CIM DIploma001
 
Ingram maarten van der tas
Ingram maarten van der tasIngram maarten van der tas
Ingram maarten van der tas
 
UZ DEGREE
UZ DEGREEUZ DEGREE
UZ DEGREE
 
CIM Professional Certificate in Marketing
CIM Professional Certificate in MarketingCIM Professional Certificate in Marketing
CIM Professional Certificate in Marketing
 
Schools in the past
Schools in the pastSchools in the past
Schools in the past
 
Poriferos
PoriferosPoriferos
Poriferos
 
Chem 2 - Free Energy and the Equilbrium Constant K VIII
Chem 2 - Free Energy and the Equilbrium Constant K VIIIChem 2 - Free Energy and the Equilbrium Constant K VIII
Chem 2 - Free Energy and the Equilbrium Constant K VIII
 
FBrown_Resume_3
FBrown_Resume_3FBrown_Resume_3
FBrown_Resume_3
 
Logo fail
Logo failLogo fail
Logo fail
 

Semelhante a Data Science & BI Salary & Skills Report

Big Data, Little Data, and Everything in Between
Big Data, Little Data, and Everything in BetweenBig Data, Little Data, and Everything in Between
Big Data, Little Data, and Everything in Betweenxband
 
Benefits of Modern Cloud Data Lake Platform Qubole GCP - Whitepaper
Benefits of Modern Cloud Data Lake Platform Qubole GCP - WhitepaperBenefits of Modern Cloud Data Lake Platform Qubole GCP - Whitepaper
Benefits of Modern Cloud Data Lake Platform Qubole GCP - WhitepaperVasu S
 
Unstructured Data and the Enterprise
Unstructured Data and the EnterpriseUnstructured Data and the Enterprise
Unstructured Data and the EnterpriseDATAVERSITY
 
R@D 4 - Digital Activism Survey Report 2009
R@D 4 - Digital Activism Survey Report 2009R@D 4 - Digital Activism Survey Report 2009
R@D 4 - Digital Activism Survey Report 2009DigiActive
 
LafargeHolcim good application tips
LafargeHolcim good application tipsLafargeHolcim good application tips
LafargeHolcim good application tipsClaudia Balan
 
2012 Grantmakers Information Technology Survey Report
2012 Grantmakers Information Technology Survey Report2012 Grantmakers Information Technology Survey Report
2012 Grantmakers Information Technology Survey ReportJSA Consultants (Jill M S)
 
Insights from the VMware 2013 Journey to IT as a Service Survey
Insights from the VMware 2013 Journey to IT as a Service SurveyInsights from the VMware 2013 Journey to IT as a Service Survey
Insights from the VMware 2013 Journey to IT as a Service SurveyVMware
 
Concorde_TechBooklet_6.1.16
Concorde_TechBooklet_6.1.16Concorde_TechBooklet_6.1.16
Concorde_TechBooklet_6.1.16Kelly Knight
 
The Role of Analytics In Defining The Art Of The Possible
The Role of Analytics In Defining The Art Of The PossibleThe Role of Analytics In Defining The Art Of The Possible
The Role of Analytics In Defining The Art Of The PossibleLora Cecere
 
Manufacturing-Tech-Survey-Results-US50602523-IB.pdf
Manufacturing-Tech-Survey-Results-US50602523-IB.pdfManufacturing-Tech-Survey-Results-US50602523-IB.pdf
Manufacturing-Tech-Survey-Results-US50602523-IB.pdfAhmedMahamedMustafaG
 
ia revoluciona el mundo gracias a copilot
ia revoluciona el mundo gracias a copilotia revoluciona el mundo gracias a copilot
ia revoluciona el mundo gracias a copilotCade Soluciones
 
01 content analytics-iw2015
01 content analytics-iw201501 content analytics-iw2015
01 content analytics-iw2015Kaizenlogcom
 
Digital Asset Management Whitepaper by KeyFruit Inc.
Digital Asset Management Whitepaper by KeyFruit Inc.Digital Asset Management Whitepaper by KeyFruit Inc.
Digital Asset Management Whitepaper by KeyFruit Inc.KeyFruit Inc.
 
Understanding the travel consumers path to purchase
Understanding the travel consumers path to purchaseUnderstanding the travel consumers path to purchase
Understanding the travel consumers path to purchaseGabriela Otto
 
Placement Portfolio
Placement PortfolioPlacement Portfolio
Placement PortfolioJPC Hanson
 

Semelhante a Data Science & BI Salary & Skills Report (20)

Big Data, Little Data, and Everything in Between
Big Data, Little Data, and Everything in BetweenBig Data, Little Data, and Everything in Between
Big Data, Little Data, and Everything in Between
 
Benefits of Modern Cloud Data Lake Platform Qubole GCP - Whitepaper
Benefits of Modern Cloud Data Lake Platform Qubole GCP - WhitepaperBenefits of Modern Cloud Data Lake Platform Qubole GCP - Whitepaper
Benefits of Modern Cloud Data Lake Platform Qubole GCP - Whitepaper
 
Unstructured Data and the Enterprise
Unstructured Data and the EnterpriseUnstructured Data and the Enterprise
Unstructured Data and the Enterprise
 
R@D 4 - Digital Activism Survey Report 2009
R@D 4 - Digital Activism Survey Report 2009R@D 4 - Digital Activism Survey Report 2009
R@D 4 - Digital Activism Survey Report 2009
 
Customer Experience Management
Customer  Experience ManagementCustomer  Experience Management
Customer Experience Management
 
Ercis wp 18new (1)
Ercis wp 18new (1)Ercis wp 18new (1)
Ercis wp 18new (1)
 
LafargeHolcim good application tips
LafargeHolcim good application tipsLafargeHolcim good application tips
LafargeHolcim good application tips
 
Wp cost transparency
Wp cost transparencyWp cost transparency
Wp cost transparency
 
Blockchain in HCM
Blockchain in HCM Blockchain in HCM
Blockchain in HCM
 
2012 Grantmakers Information Technology Survey Report
2012 Grantmakers Information Technology Survey Report2012 Grantmakers Information Technology Survey Report
2012 Grantmakers Information Technology Survey Report
 
Insights from the VMware 2013 Journey to IT as a Service Survey
Insights from the VMware 2013 Journey to IT as a Service SurveyInsights from the VMware 2013 Journey to IT as a Service Survey
Insights from the VMware 2013 Journey to IT as a Service Survey
 
Concorde_TechBooklet_6.1.16
Concorde_TechBooklet_6.1.16Concorde_TechBooklet_6.1.16
Concorde_TechBooklet_6.1.16
 
The Role of Analytics In Defining The Art Of The Possible
The Role of Analytics In Defining The Art Of The PossibleThe Role of Analytics In Defining The Art Of The Possible
The Role of Analytics In Defining The Art Of The Possible
 
Manufacturing-Tech-Survey-Results-US50602523-IB.pdf
Manufacturing-Tech-Survey-Results-US50602523-IB.pdfManufacturing-Tech-Survey-Results-US50602523-IB.pdf
Manufacturing-Tech-Survey-Results-US50602523-IB.pdf
 
ia revoluciona el mundo gracias a copilot
ia revoluciona el mundo gracias a copilotia revoluciona el mundo gracias a copilot
ia revoluciona el mundo gracias a copilot
 
01 content analytics-iw2015
01 content analytics-iw201501 content analytics-iw2015
01 content analytics-iw2015
 
Digital Asset Management Whitepaper by KeyFruit Inc.
Digital Asset Management Whitepaper by KeyFruit Inc.Digital Asset Management Whitepaper by KeyFruit Inc.
Digital Asset Management Whitepaper by KeyFruit Inc.
 
Ytl03375 usen
Ytl03375 usenYtl03375 usen
Ytl03375 usen
 
Understanding the travel consumers path to purchase
Understanding the travel consumers path to purchaseUnderstanding the travel consumers path to purchase
Understanding the travel consumers path to purchase
 
Placement Portfolio
Placement PortfolioPlacement Portfolio
Placement Portfolio
 

Data Science & BI Salary & Skills Report

  • 1. $ € £ Salary & Skills Report Learn More To Earn More $ Data Science & BI www.packtpub.com/skillup
  • 2. 2 Data Science & Bi Salary & Skills Report ‘What you need to know to earn more in Data Science and Business Intelligence’ The most comprehensive global IT skills and salary survey ever.
  • 3. 3 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ Contents What is Skill Up?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Respondents. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Which Industries are best for Data jobs – and which roles are most valuable?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Industry Breakdown. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 Career Development and Data-Oriented Roles. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Technology Usage Analysis. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 What exactly are people using?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 What else are data Pythonistas using on a daily basis?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 Tech Stacks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 1. Data Visualizers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 2. Programmatic Data Wranglers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 3. Big Data Experts. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 4. Data Architects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 What comes next?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 What Trends Are Emerging?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 Hot Topics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20 Do you think Julia will replace R and Python as the data science language of choice in the next 12 months?. . . . . . . . . . . . . . . . 20 Do you think Apache Spark is likely to Replace Hadoop in the next 12 months?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Is the line between data analysis and data retrieval being blurred?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 Is your company planning to implement a big data project over the next 12 months?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 And finally.... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Does Excel still hold a place in your heart?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 Summary. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
  • 4. 4 Data Science & Bi Salary & Skills Report The world of Data Science is rapidly growing, with data becoming increasingly vital to a huge range of organizations. Undoubtedly tied to the much-discussed rise of ‘Big Data’ over the past decade, data-oriented roles are today some of the most prominent technical roles in the economy. This report, focusing on the data science respondents to Packt’s Skill Up survey explores where data science is most valuable, what tools are being used, and what the trends and challenges will be in the future. ll Data science is immensely valuable to SMEs evidenced by their investment in young talent. ll Finance is still a lucrative sector for data science. ll R and Python are still neck and neck as the key data science languages. ll Distributed Computing and machine learning are still on the ascendancy. ll IOT is one of the hottest trends for data scientists that promises to bring new challenges and opportunities. ll Excel will never die! The need to answer these questions led us to look at the community as a whole, and so we decided to launch our Skill Up campaign. What is Skill Up? With our Skill Up survey we wanted to look at the tech community as a whole to identify upcoming trends over the next few years and share what you can do to ensure you get the most out of your career and skills. We divided our survey into 4 segments, Web Development & Design, Application Development, Security & System Administration, and Data Science & Business Intelligence, making this one of the most comprehensive surveys in recent years. Specifically we asked: ll What skills lead to a higher salary? ll What skills/technologies are most highly valued by different industries? ll What cutting edge technologies are really worth you spending your time learning? To get a better idea of the community’s thoughts we asked you all to fill in our survey, the results of which you can find compiled here in this report, giving you the facts, the figures, and more importantly – the knowledge and skills you need to make the best career decisions. Let’s look at the results in more detail.
  • 5. 5 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ Respondents Respondents by Country United States United Kingdom GermanyCanada India Australia Italy Spain Brazil France Poland Netherlands Mexico Sweden Belgium Switzerland Russia Denmark Japan Ireland South Africa Finland Czech Republic Portugal Singapore Norway Hungary New Zealand Romania Colombia Argentina Austria Thailand Greece Malaysia Indonesia Israel Croatia
  • 6. 6 Data Science & Bi Salary & Skills Report Job Sector The Data Science stream, from which the data in this report is drawn, received over 3,800 responses from individuals with a wide range of experience levels, working in a diverse set of industries. Experience Level More than 20 years Less than 1 year 5-10 years 3-5 years 10-20 years 1-3 years 0 500 1000 1500 2000 2500 3000 3500 4000 Telecom m unications Health/Biotech/Science M edia/Advertising /Entertainm ent& Gam ing Governm ent Finance/Banking Education/Academ ia/Research Consulting W eb Services/Internet Softw are Products
  • 7. 7 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ $0 $20,000 $40,000 $60,000 $80,000 $100,000 More than 20 yearsLess than 1 year Self employedStart-upEnterpriseSME There’s a consensus that data-oriented roles are some of the most valuable around, especially in industries where data is so vital (there are very few industries where you could say it isn’t). But there are trends that indicate where it’s having the biggest impact, and what’s going to become more and more important for anyone working in data. ll SMEs pay inexperienced people the best. ll In terms of industries, Finance still offers the best pay for inexperienced employees. ll Data Architect is one of the most important roles in dynamic and fast- paced industries such as Media and Entertainment. This proves that the ability to build and implement business critical solutions is essential. Our research shows that SMEs are a great place for inexperienced data scientists and analysts tobegintheircareers,offeringahigherstarting salary than Enterprise organizations. Even those working for start-ups earn only slightly less than Enterprise. With the opportunities available in small, rapid growth organizations, Start-ups are a great option for anyone ambitious and eager to prove themselves! Which Industries are best for Data jobs – and which roles are most valuable? Salary by Company Type and Experience
  • 8. 8 Data Science & Bi Salary & Skills Report Industry Breakdown Our research also investigated how data science is faring in each sector. The responses provided a good indication of where data science is most critical. The largest number of respondents with more than 20 years experience are working in consulting. Moreover, the commonality of this suggests that external expertise is something that is very much in demand. Those respondents with least experience are predominantly working in Education/Academia and Research. However, as the graph overleaf shows, it does not appear to be one of the best paid industries. 0% 20% 40% 60% 80% 100% More than 20 years 10-20 years 5-10 years 3-5 years 1-3 years Less than 1 year Web Services /Internet Telecoms Software Products Media/Advertising /Entertainment and Gaming Health /Biotech /Science Government Finance/Banking Education /Academia /Research Consulting
  • 9. 9 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ Well paid Industries for Less Experienced people Finance and Banking comes out as the best paid industry according to our data. With the rise of algorithmic trading, and Big Data in general playing a large part in just about every component of Finance, there are huge opportunities for data fluent people to command high salaries, even without a great deal of experience. Want to get to grips with quantitative finance? Pick up this bundle today. ll Mastering R for Quantitative Finance ll Introduction to R for Quantitative Finance ll Python for Finance ll Mastering Python for Finance ll Advanced Quantitative Finance with C++ Media / Advertising / Entertainment and Gaming, while certainly not as lucrative as the Financial sector, appears to be a sector offering a substantial salary to inexperienced people. If we consider the fact that this sector, taken generally, is very competitive for inexperienced people and known for low salaries, the data provides a clear indication that these industries are willing to invest in inexperienced people with technical and numerical skills. This is an indication that these industries are relying on data-driven strategies to remain competitive in tough areas of the economy. While they may lack the funds and cashflow to invest in established data professionals, inexperienced people with the right skills might well fill this role perfectly, without commanding such high salaries. $0 $1,0000 $20,000 $30,000 $40,000 $50,000 $60,000 $70,000 $80,000 Telecommunications Software Products Web Services/Internet Education/Academia/Research Health/Biotech/Science Government Media/Advertising /Entertainment and Gaming Consulting Finance/Banking
  • 10. 10 Data Science & Bi Salary & Skills Report Career Development and Data-Oriented Roles We’ve seen indicators of what’s best for inexperienced data professionals, but what about career development? We also wanted to see how the different job types fare in different industries. The graph below shows which specific roles command the highest salaries, and in which industry these roles are most valuable. ll Data Architect is one of the most important roles within Media/ Advertising/Entertainment and Gaming, commanding a higher salary in other areas. ll Comparing the roles of Data Architect to Statistician, we can infer some key differences about how data science and data-driven strategies are playing out between industries. In Finance, the statistician earns slightly more than the data architect whilst in the Media category, the Data Architect earns significantly more. ll In Media and Entertainment, where agility and organizational change is essential for rapid responses to change, there is a high value placed on someone who is able to develop a solution, such as a Data Architect. Whereas, in Finance these architectures are already in place. Average Salary by Industry & Job Type $0 $20,000 $40,000 $60,000 $80,000 $100,000 $120,000 Statistician Data Scientist/Analyst Business Intelligence Expert (includes Devs, analysts) Data Architect Assistant Director Business Intelligence Expert (includes Devs, analysts) Data Scientist/Analyst Data Architect Statistician Intermediate IT /Regulation/Business Data Scientist/Analyst Business Intelligence Expert (includes Devs, analysts) Statistician Data Architect Media/Advertising /Entertainment andGaming Finance/BankinGCONSULTING
  • 11. 11 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ Technology Usage Analysis What exactly are people using? ll More than 25% of respondents use Python on a daily basis, but almost the same number use R. ll Distributed computing and machine learning tools are becoming more and more important. ll Augmented Reality and the Internet of Things are poised to change how we think about data. The last two points spell out the next 5 years in data. The volume of data available from all sources will only continue to grow exponentially. The technology to deal with that data is still being developed, so don’t let yourself get left behind! We asked what tools people use on a daily basis, and here’s what they said: Python is definitely top dog when it comes to data. Its versatility and wealth of easy to use third- party libraries for everything from machine learning to web scraping, combined with its low barrier to entry, make it the ideal choice. 0% 5% 10% 15% 20% 25% 30% gis c++ mysql html c# php oracle excel linux javascript java r sql python
  • 12. 12 Data Science & Bi Salary & Skills Report What else are data Pythonistas using on a daily basis? A large number of respondents are using Python and R, Python and Java, or Python and C++. Get to grips with both Python and R to broaden your fluency and become a more flexible data scientist: ll Practical Data Science Cookbook ll R Data Analysis Cookbook ll Python Data Analysis ll R for Data Science ll Python Data Science Essentials 0% 5% 10% 15% 20% 25% 30% bash html mysql gis c data analysis spark hadoop machine learning c++ linux javascript java sql r
  • 13. 13 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ Tech Stacks To get a better idea of what people are commonly using together, let’s do some clustering. We put all the responses into a graph database, and ran a clustering algorithm across the techs people use every day, and from this we’ve identified some coherent clusters of people based on their stack: These are people working at the design end of the data science spectrum. As you can see from the cluster of tools below, they have as much in common with a front end web developer as any of our groups. JavaScript comes top here – unsurprising when you consider how much it dominates web development today – and other design tools and plugins such as CSS, HTML5 and jQuery serve to underline its dominance. The very presence of this group of tools in our data emphasises the importance of communicating insight via the web, and highlights just how important design is when trying to understand and interpret data. 1. Data Visualizers 1. javascript 2. html 3. css 4. php 5. html5 6. jquery 7. java 8. xml
  • 14. 14 Data Science & Bi Salary & Skills Report This group of tools are used by those people that squeeze insight out of data. They are primarily responsible for mining, cleaning and manipulating data very quickly, in order to answer specific questions about everything from customer behaviour to financial planning. Python here comes out as the most important tool – perhaps unsurprising given how easy it is to prototype and its much-vaunted flexibility. The presence of pandas serves to underline Python’s dominance – indeed, it might consolidate it precisely because of the way in which pandas improve Python’s data analysis capabilities. But it’s also interesting to see C++ here – while Python offers flexibility and ease, the impressive speed that C++ offers is still unrivalled. 2. Programmatic Data Wranglers 1. python 2. c++ 3. linux 4. bash 5. pandas 6. matlab 7. machine learning 8. postgresql
  • 15. 15 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ This cluster of tools is used by Big Data specialists, interested in scalability and robustness. Hadoop here dominates the Big Data world – however, Scala and Spark are also growing in prominence, as their presence in this cluster indicates. As demand grows for faster processing (and by extension real-time analytics), it’s likely that we’ll see more from them. It’s also interesting to see web tools such as JavaScript and Spring included here. As with the first cluster, this indicates the need to communicate effectively and quickly through web based applications. 3. Big Data Experts 1. java 2. hadoop 3. mysql 4. javascript 5. scala 6. spring 7. spark 8. maven
  • 16. 16 Data Science & Bi Salary & Skills Report This provides a useful insight into how the world of data breaks down, and how different roles appear to be built around ‘ecosystems’ of tools. One of the interesting questions that comes out of this is how this might change over the coming years. Is it possible that these clusters will become more fluid? Thisgroupoftoolsreflectstheneedtoorganize and communicate data insights in effective and intelligent ways – the main challenge of a Data Architect. Clearly, Enterprise-Ready tools dominate this cluster, indicating that Microsoft and Oracle are still regarded as go-to brands when it comes to these business-critical tools. 4. Data Architects 1. sql 2. ms 3. oracle 4. server 5. ssis 6. ssas 7. database 8. ssrs
  • 17. 17 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ There are a few key points our research suggests here: ll Spark and Hadoop are well represented, suggesting the growth of cluster computing. ll Web Based technologies are also appearing here indicating people are seeing a value in sharing data analysis on the web. ll NoSQL databases are going to keep rising in the data world. What comes next? We asked people what tools they were planning on learning over the next 6 months. We wanted to know what’s hot, and what people in the know (those earning the money) are using, to help you decide in which area you’d like to enhance your skills, or learn new ones. In the tag cloud below word frequencies are weighted by salary. So what are people looking to learn? What Trends Are Emerging? We asked respondents what they think is the most important trend emerging in their field in the next 12 months:
  • 18. 18 Data Science & Bi Salary & Skills Report There are some clear messages here: ll Machine learning is going to become one of the focal points for everyone working in data, driven by a demand for predictive insights and statistical analysis in a range of sectors and industries. ll Augmented Reality and Internet of Things are going to be key challenges for data scientists over the next few years. ll The prominence of mobile in the tag cloud suggests an increased emphasis on mobile analytics as users move further away from desktop. ll Distributed Computing (both on clusters and on the Cloud) is going to change the way we even think about data, suggesting increasing anxiety about how to manage resources for Big Data projects. This is possibly symptomatic of the two outcomes above. As IoT and mobile become more dominant, managing larger datasets is going to become a greater challenge. If you’re involved with a Big Data project, you’re going to need Machine Learning. Pick up this bundle and start exploring machine learning and predictive analytics today. ll Machine Learning with Spark ll Scala for Machine Learning ll Machine Learning with R ll Machine Learning with R Cookbook ll Building Machine Learning Systems with Python - Second Edition
  • 19. 19 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ Hot Topics According to the highest earning respondents, Julia is on the ascent. It’s easy to see why as it is designed specifically for technical computing, and boasting interesting features such as multiple dispatch, useful libraries for graphing, and impressive JIT compiler benchmarks. Julia is one to watch! Stay ahead of the trend and start learning Julia with this essential selection of Julia books. ll Getting started with Julia Programming Language ll Mastering Julia ll Getting Started with LLVM Core Libraries ll Python High Performance Programming ll R High Performance Programming To complete our survey, we asked respondents some simple questions about hot topics, trends and challenges in the data. DoyouthinkJuliawillreplaceRandPythonasthedatasciencelanguage of choice in the next 12 months? $81,000 $82,000 $84,000 $83,000 $85,000 $86,000 YesNo
  • 20. 20 Data Science & Bi Salary & Skills Report Our survey says you’re safe for now! However, if you’re not already on the Hadoop train, there’s never been a better time to get on it… ll Learning Hadoop 2 ll Mastering Hadoop ll Big Data Analytics with R and Hadoop ll Fast Data Processing with Spark - Second Edition ll Apache Mesos Essentials Do you think Apache Spark is likely to Replace Hadoop in the next 12 months? $81,000 $82,000 $84,000 $83,000 $85,000 $86,000 YesNo
  • 21. 21 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ Is the line between data analysis and data retrieval being blurred? Across the board, this was contentious. ll 64% agree that it is. This suggests tools such as BigQuery could become more prominent. It’s certainly a tool to watch over the next 12 months. As Big Data becomes ubiquitous, the aim of the game is no longer to simply have the most effective strategy, but also the most efficient and fast. Learn how to master the art of data analysis and retrieval with this bundle of popular books: ll Clean Data ll Practical Data Analysis ll Mastering Predictive Analytics with R ll Learning Data Mining with R ll Learning Pandas 0% 20% 40% 60% 80% 100% YesNo YesNo YesNo YesNo YesNo More than 20 years 10-20 years 5-10 years 3-5 years 1-3 years
  • 22. 22 Data Science & Bi Salary & Skills Report Excel is eternal, Excel 2013 is awesome…all of you know it deep down! And finally... Does Excel still hold a place in your heart? Is your company planning to implement a big data project over the next 12 months? It appears that most respondents are working in organizations looking to implement Big Data projects. The significant anomaly to that are those with very little experience. The reasons for this aren’t immediately obvious, but there are a number of possible explanations. Itcouldbethatthosewithverylittleexperience simply aren’t privy to organizational strategy and decision-making. Conversely, it could be that those people who are just starting out have been employed precisely because a Big Data project has been implemented. 0% 20% 40% 60% 80% 100% More than 20 years 10-20 years 5-10 years 3-5 years 1-3 years Less than 1 year YesNo YesNo YesNo YesNo YesNo YesNo Yes No
  • 23. 23 Data Science & Bi Salary & Skills ReportLearn More To Earn More $ Summary It may be a truism, but it’s clear, and perhaps it has been for years, that Data Science and Big Data are not simply trends, but are instead symptomatic of a wider social, cultural and economic change. It’s time we stopped talking about the ‘Big Data revolution’ or how ‘data scientist’ is the ‘sexiest job of the twenty-first century’, and instead look at the different ways data is being used in different areas. For SMEs, data is crucial for making companies more responsive and open to changes in the market. The prominence of machine learning underlines this further, making it clear that there is a real onus on delivering rapid insight and fast! For larger organizations there is a drive towards creating faster Big Data solutions. The apparent rise of distributed and cluster computing is evidence of this, as data scientists and analysts look for new ways to put tools such as Hadoop and Spark to work. But even more interesting is how our understanding of data looks set to change, thanks to emerging trends such as the Internet of Things and Augmented Reality. It’s possible that IOT will become the buzzword to replace Big Data. How Data Scientists, analysts and architects tackle it day to day isn’t clear yet, but it will almost certainly be a challenge that will offer exciting opportunities for data literate people everywhere. What you should be doing if you’re a data scientist: ll Broadening the range of languages you know is essential. It will help you become more flexible when working on a range of different projects and also provides you with more solutions. If you know R, why not learn Python? ll You need to get to grips with Machine Learning. If you want to get started or investigate it further, grab our Machine Learning bundle! ll If you’re just starting your career, you could do a lot worse than working in Finance or for an SME. You might command a higher salary working for an established enterprise organization, but the difference is likely to be small with more opportunities and responsibility at an SME. ll If you’re interested in working in popular industries such as Media, work towards becoming a data architect, and learn how to develop and implement large-scale data solutions that can deliver benefits across an organization. ll Getting to grips with Big Data tools such as Hadoop and Spark will be valuable, but learning how to use them in the context of distributed networks will be even more valuable as resources become stretched. ll Pay attention to IoT – we still don’t quite know where it will lead the data world!
  • 24. Helping IT professionals to put software to work in new ways www.packtpub.com Tel: +44 (0)121 265 6484 Published 2015 Data Analyst Greg Roberts Project Manager Sarah Cullington Editor Richard Gall Technical Advisor Akram Hussain Design Chris Murray Founded in 2004 in Birmingham, UK, Packt’s mission is to help the world put software to work in new ways, through the delivery of effective learning and information services to IT professionals. Workingtowardsthatvision,wehavepublished over 3000 books and videos so far, providing IT professionals with the actionable knowledge they need to get the job done –whether that’s specific learning on an emerging technology or optimizing key skills in more established tools. As part of our mission, we have also awarded over $1,000,000 through our Open Source Project Royalty scheme, helping numerous projects become household names along the way.