SlideShare uma empresa Scribd logo
1 de 30
Demystifying Data Science
A Realistic Perspective
By
R Venkat Raman
• So Many Definitions
• So Many Assumptions
• So Many Expectations
• So Much Hype
WHY THE NEED TO DEMYSTIFY ?
R Venkat Raman
So Many Definitions
R Venkat Raman
WHAT IS DATA SCIENCE?
“Data science is the field of study that combines domain expertise, programming
skills, and knowledge of math and statistics to extract meaningful insights from data.”
“Data science is the discipline of making data useful.”
“Data Science as a multi-disciplinary subject encompasses the use of mathematics, statistics, and
computer science to study and evaluate data. The key objective of Data Science is to extract valuable
information for use in strategic decision making, product development, trend analysis and forecasting.”
“Data science is a ‘concept to unify statistics, data analysis, machine learning and their
related methods’ in order to ‘understand and analyze actual phenomena’ with data.”
R Venkat Raman
WHAT IS DATA SCIENCE?
THE VENN DIAGRAMS
R Venkat Raman
WHO IS A DATA SCIENTIST?
“An ideal data scientist is someone who has the both the engineering skills to acquire and manage
large data sets, and also has the statistician’s skills to extract value from the large data sets and
present that data to a large audience”
“A data scientist is someone who blends, math, algorithms, and an understanding of human
behaviour with the ability to hack systems together to get answers to interesting human questions
from data”
“A Data Scientist is a person who does Data Science”
“Person who is better at statistics than any software engineer and
better at software engineering than any statistician.”
R Venkat Raman
So Many Assumptions
R Venkat Raman
HOW PEOPLE PERCEIVE DATA SCIENCE
R Venkat Raman
So Many Expectations
R Venkat Raman
BECOMING DATA SCIENTIST – QUICKLY !!
R Venkat Raman
GETTING RICH QUICKLY !!
R Venkat Raman
So Much Hype
R Venkat Raman
CASE OF OLD WINE IN NEW BOTTLE ?
R Venkat Raman
ARTIFICIAL GENERAL INTELLIGENCE ?
R Venkat Raman
THE HYPE CYCLE – WHERE ARE WE ?
We are here
R Venkat Raman
Why This Buzz Now ?
R Venkat Raman
INCREASED STORAGE AND COMPUTING POWER
THE STATISTICS – MACHINE LEARNING DIVERGENCE
• In the 20th century, the computing and storage power was less. This required statisticians to infer a lot of things from a
sample. Hence inferential statistics was heavily used and relied upon.
• Fast forward now, the computing and storage power has increased substantially. This enabled machine learning and Deep
learning to blossom. In Machine/Deep Learning, more data the better as the prediction improves with more quality training
data. This thinking is divergent from a 20th century statistical thinking.
R Venkat Raman
EXPLOSION OF DATA
• 2.5 quintillion bytes of data created each day1
• 90% of the data in the world today has been created in the last two
years alone1
• More than 3.7 Billion humans use the internet 1
• Every minute Snapchat users share 527,760 photos, Users watch
4,146,600 YouTube videos, 456,000 tweets are sent on Twitter,
Instagram users post 46,740 photos
• Close to 3 Billion smartphone users in the world
1:Report as of 2018
There is tremendous scope to extract insights out of these data !
Hence the demand for Data Scientists.
R Venkat Raman
Let’s Demystify
R Venkat Raman
THE VARIOUS FACETS OF DATA SCIENCE?
R Venkat Raman
DATA SCIENCE – A TEAM EFFORT
Data Engineers Data Scientists Data Storyteller/TranslatorsSoftware Engineers
What They Do
Skill Set
Tools Used
• Create Data pipelines.
• Evaluate Databases
• Design Schemas
• Perform ETL
• Knowledge of Databases
• Scripting skills (Linux
commands)
• Knowledge of Cloud
technologies
• SQL commands
• Apply statistical/Machine
learning techniques to
solve business problems
• Perform R&D
• Innovate new solutions
• Develop Data science
products
• Knowledge of statistical
and mathematical
concepts
• Knowledge of various
statistical/ML algorithms
• Scripting skills
(R/Python)
• SQL commands
• Help design UI (front end
coding)
• Do backend coding
• Help deploy data science
solution in production
• Automate the entire
process
• Knowledge of
Programming concepts
• Programming languages
• Knowledge of Databases
• Knowledge of Restful
APIs
• Scripting skills (Linux
commands)
• Communicate Data Science
solutions in Business friendly/ non
technical terms
• Understand business requirements
and translate them to Data science
problems
• Design persuasive Data
visualizations
• High level understanding of
statistics and ML concepts
• Business acumen
• Good soft skills
• Creativity
• Persuasion and articulation
R Venkat Raman
WHY DATA SCIENTISTS ARE VALUED?
R Venkat Raman
THE DATA SCIENTIST TALENT STACK
IDEA INSPIRED BY SCOTT ADAM’S TALENT STACK THEORY
Knowledge of Inner
workings of Algorithms
Statistics/Maths Skills
Coding/ Technical Skills
Persuasion /Storytelling
R Venkat Raman
THE PATH TO BECOME A DATA SCIENTIST
• Can anyone become a Data Scientist ?
Yes
• Can a person become a Data Scientist just by doing some Moocs/short courses for a duration of 3-6 months ?
No
R Venkat Raman
HOW GOOD ARE THE MOOCS AND KAGGLE COMPETITIONS?
TOO MUCH SIGNALING
• There are thousands of courses available online now.
• While the courses may be useful to build knowledge or act as a
repository for revising concepts, the course certificates by
themselves does not guarantee to a person a Data Science Job
• Millions of people take the same courses and the solutions to the
questions of these Moocs are easily hackable or available
• Kaggle competitions are a competition more for showcasing processing
speed or ensemble techniques than intellectual rigor.
• The data is never clean in real life as given in Kaggle competitions
• But Kaggle kernels are useful
MOOCs
Kaggle Competitions
R Venkat Raman
GETTING HIRED AS A DATA SCIENTIST
HOW TO IMPROVE VISIBILITY AND BECOME EMPLOYABLE
• Focus on a specific area like NLP, Computer Vision,
Marketing Analytics, Classical Statistical applications. Try to
be specialist than a generalist.
• This strategy will work to gain entry into the field of
Data Science. But as one gains more experience, it
becomes harder to stay a specialist unless one is in
an academic framework.
• Write technical and non technical blogs
• Try the Feynman technique of learning things
• Do pet projects, develop small products, put the code on
GitHub
• Learn niche and complimentary skills like putting the code
in production or how to dockerize codes.
• Network with Data Scientists in Industry and Academia
• Follow the Data Scientists on Twitter or LinkedIn
• As an Institution or Individual, start Data Science podcasts
R Venkat Raman
BLUE OCEAN STRATEGY – BECOME A DATA SCIENCE TRANSLATOR
R Venkat Raman
REFERENCES & RESOURCES
Slide 4:
https://www.datarobot.com/wiki/data-science/
https://www.kdnuggets.com/2018/09/what-is-data-science.html
https://en.wikipedia.org/wiki/Data_science
https://www.digitalvidya.com/blog/what-is-data-science/
Slide 5:
https://www.datasciencecentral.com/profiles/blogs/difference-of-data-science-machine-learning-and-data-mining
https://towardsdatascience.com/introduction-to-statistics-e9d72d818745
https://towardsdatascience.com/introduction-to-statistics-e9d72d818745
Slide 6:
https://bigdata-madesimple.com/what-is-a-data-scientist-14-definitions-of-a-data-scientist/
https://twitter.com/josh_wills/status/198093512149958656?lang=en
Slide 8:
https://me.me/i/data-scientist-31-1-120-0-what-my-friends-think-15a983c0fbc54a91a76d8b25d1c5daaa
Slide 11:
http://blog.fusemachines.com/data-scientist-sexiest-job-21st-century/
Slide 14:
https://www.cnbc.com/2018/03/13/elon-musk-at-sxsw-a-i-is-more-dangerous-than-nuclear-weapons.html
https://www.newyorker.com/magazine/2018/05/14/how-frightened-should-we-be-of-ai
https://www.forbes.com/sites/forbestechcouncil/2017/12/04/why-we-should-be-afraid-of-intelligent-machines/#74fbc13f6be1
R Venkat Raman
REFERENCES & RESOURCES
Slide 15:
https://www.botxo.co/2018/09/03/our-take-on-the-gartner-hype-cycle/
Slide 17:
https://ourworldindata.org/technological-progress
Slide 18:
https://www.socialmediatoday.com/news/how-much-data-is-generated-every-minute-infographic-1/525692/
https://www.forbes.com/sites/bernardmarr/2018/05/21/how-much-data-do-we-create-every-day-the-mind-blowing-stats-everyone-
should-read/#7cfa86f460ba
https://blog.microfocus.com/how-much-data-is-created-on-the-internet-each-day/
Slide 20:
https://blog.jedox.com/artificial-intelligence-business-intelligence-fpa-part-2/
Slide 23:
https://www.amazon.com/Win-Bigly-Persuasion-World-Matter/dp/0735219710
Slide 27:
https://www.forbes.com/sites/bernardmarr/2018/03/12/forget-data-scientists-and-hire-a-data-translator-instead/#4b209212848a
https://www.mckinsey.com/business-functions/mckinsey-analytics/our-insights/analytics-translator
https://sloanreview.mit.edu/article/why-your-company-needs-data-translators/
R Venkat Raman
Thank You !!
R Venkat Raman

Mais conteúdo relacionado

Mais procurados

Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First CourseArnab Majumdar
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientistryanorban
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
 
Data Scientists Are Analysts Are Also Software Engineers
Data Scientists Are Analysts Are Also Software EngineersData Scientists Are Analysts Are Also Software Engineers
Data Scientists Are Analysts Are Also Software EngineersDomino Data Lab
 
Data Science 101
Data Science 101Data Science 101
Data Science 101odsc
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist ToolboxAndrei Savu
 
Course - Machine Learning Basics with R
Course - Machine Learning Basics with R Course - Machine Learning Basics with R
Course - Machine Learning Basics with R Persontyle
 
Datascienceindia article
Datascienceindia articleDatascienceindia article
Datascienceindia articleHimanshuPise1
 
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceGeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceMark West
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
The Other 99% of a Data Science Project
The Other 99% of a Data Science ProjectThe Other 99% of a Data Science Project
The Other 99% of a Data Science ProjectEugene Mandel
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...Edureka!
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data ScienceAjay Ohri
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Edureka!
 

Mais procurados (20)

Data science 101
Data science 101Data science 101
Data science 101
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First Course
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientist
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data Scientists Are Analysts Are Also Software Engineers
Data Scientists Are Analysts Are Also Software EngineersData Scientists Are Analysts Are Also Software Engineers
Data Scientists Are Analysts Are Also Software Engineers
 
Data Science 101
Data Science 101Data Science 101
Data Science 101
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist Toolbox
 
Data science
Data scienceData science
Data science
 
Course - Machine Learning Basics with R
Course - Machine Learning Basics with R Course - Machine Learning Basics with R
Course - Machine Learning Basics with R
 
Evaluation of big data analysis
Evaluation of big data analysisEvaluation of big data analysis
Evaluation of big data analysis
 
Datascienceindia article
Datascienceindia articleDatascienceindia article
Datascienceindia article
 
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data ScienceGeeCon Prague 2018 - A Practical-ish Introduction to Data Science
GeeCon Prague 2018 - A Practical-ish Introduction to Data Science
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
Data science Big Data
Data science Big DataData science Big Data
Data science Big Data
 
The Big Data Dream Team
The Big Data Dream TeamThe Big Data Dream Team
The Big Data Dream Team
 
The Other 99% of a Data Science Project
The Other 99% of a Data Science ProjectThe Other 99% of a Data Science Project
The Other 99% of a Data Science Project
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
 
Training in Analytics and Data Science
Training in Analytics and Data ScienceTraining in Analytics and Data Science
Training in Analytics and Data Science
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...
 
Data Science
Data ScienceData Science
Data Science
 

Semelhante a Demystifying Data Science

Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)SayyedYusufali
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)SayyedYusufali
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)SayyedYusufali
 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and PlacementAkhilGGM
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?DIGITALSAI1
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification courseKumarNaik21
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabadVamsiNihal
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabadsaitejavella
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)SayyedYusufali
 
data science training and placement
data science training and placementdata science training and placement
data science training and placementSaiprasadVella
 
online data science training
online data science trainingonline data science training
online data science trainingDIGITALSAI1
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabadVamsiNihal
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabadVamsiNihal
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in HyderabadKumarNaik21
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training HyderabadNithinsunil1
 
DATA SCIENCE.pptx.pdf
DATA SCIENCE.pptx.pdfDATA SCIENCE.pptx.pdf
DATA SCIENCE.pptx.pdfRahulTr22
 

Semelhante a Demystifying Data Science (20)

LSESU a Taste of R Language Workshop
LSESU a Taste of R Language WorkshopLSESU a Taste of R Language Workshop
LSESU a Taste of R Language Workshop
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)
 
Data Science Training and Placement
Data Science Training and PlacementData Science Training and Placement
Data Science Training and Placement
 
Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
data science training and placement
data science training and placementdata science training and placement
data science training and placement
 
online data science training
online data science trainingonline data science training
online data science training
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabad
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
DATA SCIENCE.pptx.pdf
DATA SCIENCE.pptx.pdfDATA SCIENCE.pptx.pdf
DATA SCIENCE.pptx.pdf
 

Último

定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 

Último (20)

定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 

Demystifying Data Science

  • 1. Demystifying Data Science A Realistic Perspective By R Venkat Raman
  • 2. • So Many Definitions • So Many Assumptions • So Many Expectations • So Much Hype WHY THE NEED TO DEMYSTIFY ? R Venkat Raman
  • 3. So Many Definitions R Venkat Raman
  • 4. WHAT IS DATA SCIENCE? “Data science is the field of study that combines domain expertise, programming skills, and knowledge of math and statistics to extract meaningful insights from data.” “Data science is the discipline of making data useful.” “Data Science as a multi-disciplinary subject encompasses the use of mathematics, statistics, and computer science to study and evaluate data. The key objective of Data Science is to extract valuable information for use in strategic decision making, product development, trend analysis and forecasting.” “Data science is a ‘concept to unify statistics, data analysis, machine learning and their related methods’ in order to ‘understand and analyze actual phenomena’ with data.” R Venkat Raman
  • 5. WHAT IS DATA SCIENCE? THE VENN DIAGRAMS R Venkat Raman
  • 6. WHO IS A DATA SCIENTIST? “An ideal data scientist is someone who has the both the engineering skills to acquire and manage large data sets, and also has the statistician’s skills to extract value from the large data sets and present that data to a large audience” “A data scientist is someone who blends, math, algorithms, and an understanding of human behaviour with the ability to hack systems together to get answers to interesting human questions from data” “A Data Scientist is a person who does Data Science” “Person who is better at statistics than any software engineer and better at software engineering than any statistician.” R Venkat Raman
  • 7. So Many Assumptions R Venkat Raman
  • 8. HOW PEOPLE PERCEIVE DATA SCIENCE R Venkat Raman
  • 9. So Many Expectations R Venkat Raman
  • 10. BECOMING DATA SCIENTIST – QUICKLY !! R Venkat Raman
  • 11. GETTING RICH QUICKLY !! R Venkat Raman
  • 12. So Much Hype R Venkat Raman
  • 13. CASE OF OLD WINE IN NEW BOTTLE ? R Venkat Raman
  • 15. THE HYPE CYCLE – WHERE ARE WE ? We are here R Venkat Raman
  • 16. Why This Buzz Now ? R Venkat Raman
  • 17. INCREASED STORAGE AND COMPUTING POWER THE STATISTICS – MACHINE LEARNING DIVERGENCE • In the 20th century, the computing and storage power was less. This required statisticians to infer a lot of things from a sample. Hence inferential statistics was heavily used and relied upon. • Fast forward now, the computing and storage power has increased substantially. This enabled machine learning and Deep learning to blossom. In Machine/Deep Learning, more data the better as the prediction improves with more quality training data. This thinking is divergent from a 20th century statistical thinking. R Venkat Raman
  • 18. EXPLOSION OF DATA • 2.5 quintillion bytes of data created each day1 • 90% of the data in the world today has been created in the last two years alone1 • More than 3.7 Billion humans use the internet 1 • Every minute Snapchat users share 527,760 photos, Users watch 4,146,600 YouTube videos, 456,000 tweets are sent on Twitter, Instagram users post 46,740 photos • Close to 3 Billion smartphone users in the world 1:Report as of 2018 There is tremendous scope to extract insights out of these data ! Hence the demand for Data Scientists. R Venkat Raman
  • 20. THE VARIOUS FACETS OF DATA SCIENCE? R Venkat Raman
  • 21. DATA SCIENCE – A TEAM EFFORT Data Engineers Data Scientists Data Storyteller/TranslatorsSoftware Engineers What They Do Skill Set Tools Used • Create Data pipelines. • Evaluate Databases • Design Schemas • Perform ETL • Knowledge of Databases • Scripting skills (Linux commands) • Knowledge of Cloud technologies • SQL commands • Apply statistical/Machine learning techniques to solve business problems • Perform R&D • Innovate new solutions • Develop Data science products • Knowledge of statistical and mathematical concepts • Knowledge of various statistical/ML algorithms • Scripting skills (R/Python) • SQL commands • Help design UI (front end coding) • Do backend coding • Help deploy data science solution in production • Automate the entire process • Knowledge of Programming concepts • Programming languages • Knowledge of Databases • Knowledge of Restful APIs • Scripting skills (Linux commands) • Communicate Data Science solutions in Business friendly/ non technical terms • Understand business requirements and translate them to Data science problems • Design persuasive Data visualizations • High level understanding of statistics and ML concepts • Business acumen • Good soft skills • Creativity • Persuasion and articulation R Venkat Raman
  • 22. WHY DATA SCIENTISTS ARE VALUED? R Venkat Raman
  • 23. THE DATA SCIENTIST TALENT STACK IDEA INSPIRED BY SCOTT ADAM’S TALENT STACK THEORY Knowledge of Inner workings of Algorithms Statistics/Maths Skills Coding/ Technical Skills Persuasion /Storytelling R Venkat Raman
  • 24. THE PATH TO BECOME A DATA SCIENTIST • Can anyone become a Data Scientist ? Yes • Can a person become a Data Scientist just by doing some Moocs/short courses for a duration of 3-6 months ? No R Venkat Raman
  • 25. HOW GOOD ARE THE MOOCS AND KAGGLE COMPETITIONS? TOO MUCH SIGNALING • There are thousands of courses available online now. • While the courses may be useful to build knowledge or act as a repository for revising concepts, the course certificates by themselves does not guarantee to a person a Data Science Job • Millions of people take the same courses and the solutions to the questions of these Moocs are easily hackable or available • Kaggle competitions are a competition more for showcasing processing speed or ensemble techniques than intellectual rigor. • The data is never clean in real life as given in Kaggle competitions • But Kaggle kernels are useful MOOCs Kaggle Competitions R Venkat Raman
  • 26. GETTING HIRED AS A DATA SCIENTIST HOW TO IMPROVE VISIBILITY AND BECOME EMPLOYABLE • Focus on a specific area like NLP, Computer Vision, Marketing Analytics, Classical Statistical applications. Try to be specialist than a generalist. • This strategy will work to gain entry into the field of Data Science. But as one gains more experience, it becomes harder to stay a specialist unless one is in an academic framework. • Write technical and non technical blogs • Try the Feynman technique of learning things • Do pet projects, develop small products, put the code on GitHub • Learn niche and complimentary skills like putting the code in production or how to dockerize codes. • Network with Data Scientists in Industry and Academia • Follow the Data Scientists on Twitter or LinkedIn • As an Institution or Individual, start Data Science podcasts R Venkat Raman
  • 27. BLUE OCEAN STRATEGY – BECOME A DATA SCIENCE TRANSLATOR R Venkat Raman
  • 28. REFERENCES & RESOURCES Slide 4: https://www.datarobot.com/wiki/data-science/ https://www.kdnuggets.com/2018/09/what-is-data-science.html https://en.wikipedia.org/wiki/Data_science https://www.digitalvidya.com/blog/what-is-data-science/ Slide 5: https://www.datasciencecentral.com/profiles/blogs/difference-of-data-science-machine-learning-and-data-mining https://towardsdatascience.com/introduction-to-statistics-e9d72d818745 https://towardsdatascience.com/introduction-to-statistics-e9d72d818745 Slide 6: https://bigdata-madesimple.com/what-is-a-data-scientist-14-definitions-of-a-data-scientist/ https://twitter.com/josh_wills/status/198093512149958656?lang=en Slide 8: https://me.me/i/data-scientist-31-1-120-0-what-my-friends-think-15a983c0fbc54a91a76d8b25d1c5daaa Slide 11: http://blog.fusemachines.com/data-scientist-sexiest-job-21st-century/ Slide 14: https://www.cnbc.com/2018/03/13/elon-musk-at-sxsw-a-i-is-more-dangerous-than-nuclear-weapons.html https://www.newyorker.com/magazine/2018/05/14/how-frightened-should-we-be-of-ai https://www.forbes.com/sites/forbestechcouncil/2017/12/04/why-we-should-be-afraid-of-intelligent-machines/#74fbc13f6be1 R Venkat Raman
  • 29. REFERENCES & RESOURCES Slide 15: https://www.botxo.co/2018/09/03/our-take-on-the-gartner-hype-cycle/ Slide 17: https://ourworldindata.org/technological-progress Slide 18: https://www.socialmediatoday.com/news/how-much-data-is-generated-every-minute-infographic-1/525692/ https://www.forbes.com/sites/bernardmarr/2018/05/21/how-much-data-do-we-create-every-day-the-mind-blowing-stats-everyone- should-read/#7cfa86f460ba https://blog.microfocus.com/how-much-data-is-created-on-the-internet-each-day/ Slide 20: https://blog.jedox.com/artificial-intelligence-business-intelligence-fpa-part-2/ Slide 23: https://www.amazon.com/Win-Bigly-Persuasion-World-Matter/dp/0735219710 Slide 27: https://www.forbes.com/sites/bernardmarr/2018/03/12/forget-data-scientists-and-hire-a-data-translator-instead/#4b209212848a https://www.mckinsey.com/business-functions/mckinsey-analytics/our-insights/analytics-translator https://sloanreview.mit.edu/article/why-your-company-needs-data-translators/ R Venkat Raman
  • 30. Thank You !! R Venkat Raman

Notas do Editor

  1. Sources : https://www.datarobot.com/wiki/data-science/ https://www.kdnuggets.com/2018/09/what-is-data-science.html https://en.wikipedia.org/wiki/Data_science https://www.digitalvidya.com/blog/what-is-data-science/
  2. Source : https://www.datasciencecentral.com/profiles/blogs/difference-of-data-science-machine-learning-and-data-mining https://towardsdatascience.com/introduction-to-statistics-e9d72d818745 https://towardsdatascience.com/introduction-to-statistics-e9d72d818745
  3. Sources : https://bigdata-madesimple.com/what-is-a-data-scientist-14-definitions-of-a-data-scientist/ https://twitter.com/josh_wills/status/198093512149958656?lang=en
  4. Sources: https://me.me/i/data-scientist-31-1-120-0-what-my-friends-think-15a983c0fbc54a91a76d8b25d1c5daaa
  5. Source : http://blog.fusemachines.com/data-scientist-sexiest-job-21st-century/
  6. Source : https://www.cnbc.com/2018/03/13/elon-musk-at-sxsw-a-i-is-more-dangerous-than-nuclear-weapons.html https://www.newyorker.com/magazine/2018/05/14/how-frightened-should-we-be-of-ai https://www.forbes.com/sites/forbestechcouncil/2017/12/04/why-we-should-be-afraid-of-intelligent-machines/#74fbc13f6be1
  7. Source : https://www.botxo.co/2018/09/03/our-take-on-the-gartner-hype-cycle/
  8. Source: https://ourworldindata.org/technological-progress
  9. Source: https://www.socialmediatoday.com/news/how-much-data-is-generated-every-minute-infographic-1/525692/ https://www.forbes.com/sites/bernardmarr/2018/05/21/how-much-data-do-we-create-every-day-the-mind-blowing-stats-everyone-should-read/#7cfa86f460ba https://blog.microfocus.com/how-much-data-is-created-on-the-internet-each-day/
  10. Source : https://blog.jedox.com/artificial-intelligence-business-intelligence-fpa-part-2/
  11. Source : https://blog.jedox.com/artificial-intelligence-business-intelligence-fpa-part-2/
  12. Source: Inspired by Scott Adams Talent stack idea from his book – Win Bigly https://www.amazon.com/Win-Bigly-Persuasion-World-Matter/dp/0735219710
  13. Sources: https://www.forbes.com/sites/bernardmarr/2018/03/12/forget-data-scientists-and-hire-a-data-translator-instead/#4b209212848a https://www.mckinsey.com/business-functions/mckinsey-analytics/our-insights/analytics-translator https://sloanreview.mit.edu/article/why-your-company-needs-data-translators/