Data Skills for Digital Era-مهارت های داده ای

Data Skills for Digital Era
The Top Data Skills You Need To Get Hired
Main Focus
Data Science Business Intelligence
Big Data Data Engineering
Mohtat@ut.ac.ir 2
Data Skills for Digital Era-مهارت های داده ای
Data Science
Math & Statistics
Computer Science
Subject Matter Expertise
Mohtat@ut.ac.ir 4
Data Science is an
interdisciplinary field about
processes and systems to
extract knowledge or
insights from data, which is
a continuation of some of
the data analysis fields such
as statistics, data mining,
and predictive analytics,
similar to Knowledge
Discovery in
Databases (KDD).
Types of Analytics
Descriptive
Diagnostic
Prescriptive
Predictive
Mohtat@ut.ac.ir 6
Data
Science
Technology
Application
Mohtat@ut.ac.ir 8
Critical Skills for Data Scientists
Python
R
SQL
Data Mining Tools
Knime , ReapidMiner,
IBM SPSS Modeler
Excel
BI Tools
Tableau, Power BI, Qlik
Mohtat@ut.ac.ir 9
Top Python Libraries in Data Science
TensorFlow
“TensorFlow is an open source
software library for numerical
computation using data flow graphs.
PyTorch
“PyTorch is a Python package that
provides Deep neural networks built
on a tape-based autograd system
Numpy
“NumPy is the fundamental
package needed for scientific
computing with Python.
Scikit-Learn
“scikit-learn is a Python module for
machine learning built on NumPy,
SciPy and matplotlib.
Keras
“Keras is a high-level neural networks
API, written in Python and capable of
running on top of TensorFlow, CNTK,
or Theano.
Scipy
“SciPy is open-source software for
mathematics, science, and engineering.
Pandas
“pandas is a Python package providing
fast, flexible, and expressive data
structures designed to make working
with "relational" or "labeled" data both
easy and intuitive
Matplotlib
“Matplotlib is a Python 2D plotting
library which produces publication-
quality figures in a variety of
hardcopy formats and interactive
environments across platforms.
Scrapy
“Scrapy is a fast high-level web crawling
and web scraping framework, used to
crawl websites and extract structured
data from their pages.
Mohtat@ut.ac.ir 10
Top Skills every Data Scientist needs to Master
TensorFlow Keras Hadoop Spark Hive Java Matlab
Mohtat@ut.ac.ir 11
Most Essential Skills for Data Scientists
Complex Problem Solving
Team Working
Emotional Intelligence
Creativity
Critical Thinking
Negotiation
Mohtat@ut.ac.ir 12
Applied Data Science with Python
Michigan University(Coursera)
Basic Data Visualization Machine Learning Text Mining SNA
Applied Text Mining in Python
Introduction to Data Science in Python
Applied Plotting, Charting & Data
Representation in Python
Applied Machine Learning in Python Applied Social Network Analysis in
Python
Mohtat@ut.ac.ir 13LOGO HERE
Data Science Books
14
Data Skills for Digital Era-مهارت های داده ای
Business Intelligence
encompasses a wide variety of
tools, applications and
methodologies that enable
organizations to collect data
from internal systems and
external sources; prepare it for
analysis; develop and run
queries against that data; and
create reports, dashboards and
data visualizations to make the
analytical results available to
corporate decision-makers, as
well as operational workers.
BI
Mohtat@ut.ac.ir 17
Business Skills
Link to Business Strategy
Define Priorities
Define BI Vision
Lead Organization / BPR
Analytics Skills
Data Mining
Social BI
IT Skills
Infrastructure
Build Technology
Data Integration & Quality
Business
Intelligence
Architect
Simple is what it needs in business
Top Business Intelligence Skills
SQL
Data Warehousing
Data Analysis
Tableau
ETL
23%
85%
28%
41%
65%
Mohtat@ut.ac.ir 20
28%
Top Business Intelligence Skills
Business Analyst
Oracle
SQL Server BI
Business Process
Data Modeling 17%
85%
19%
21%
22%
Mohtat@ut.ac.ir 21
19%
Top Business Intelligence Tools
Tableau Power BI Qlik
Your Choice Is Clear
Mohtat@ut.ac.ir 22
Data Skills for Digital Era-مهارت های داده ای
Data Skills for Digital Era-مهارت های داده ای
Data Skills for Digital Era-مهارت های داده ای
Data Skills for Digital Era-مهارت های داده ای
Big Data
Volume
Terabyte
Distribute
Big Table
Velocity
Real-time
Stream Processing
Variety
Structured
Unstructured
Text, Image, Video
Mohtat@ut.ac.ir 27
Big data is a term used to
refer to data sets that are
too large or complex for
traditional data-processing
application software to
adequately deal with.
It’s what organizations do
with the data that matters.
Big data can be analyzed
for insights that lead to
better decisions and
strategic business moves.
Hadoop Ecosystem
3 Types of Big Data Jobs
1 2
3
Big Data Developer
Big Data Administration
Big Data Analytics
Mohtat@ut.ac.ir 29
Top Big Data Programming Languages
Not only Hadoop, many other big data analysis tools like Storm,
Spark, and Kafka are written in Java and run on the JVM
Java
Python is a simple, open-source, general-purpose language.
Hence, it is easy to learn Python for anyone.. With its rich set
of utilities and libraries and easy-to-use features, it works
wonder for big data processing and analysis.
Python
Scala is a rival of Java and Python in the world of Data Science
and becoming more and more popular due to extensive use of
Apache Spark in Big data Hadoop industry.
Scala
Mohtat@ut.ac.ir 30
Pathway to Success
Success
Apache Hadoop
Apache Spark
Start
NoSQL Database
Data Analytics
Data Visualization
Mohtat@ut.ac.ir 31
Big Data Companies & Vendors
Cloudera, Inc. is a US-based
software company that
provides a software platform
for data engineering, data
warehousing, machine
learning and analytics that
runs in the cloud or on
premises
Cloudera
MapR is a business software
company headquartered in
Santa Clara, California. MapR
provides access to a variety of
data sources from a single
computer cluster, including big
data workloads
MapR
Hortonworks is a data software
company based in Santa Clara,
California that develops,
supports, and provides expertise
on a set of open-source software
designed to manage data and
processing for things such as IOT,
single view of X, and advanced
analytics and machine learning
Hortonworks
34
‫داده‬‫کالن‬ ‫زیرساخت‬ ‫اجرا‬ ‫و‬ ‫نصب‬
Mohtat@ut.ac.ir
35
‫داده‬‫کالن‬ ‫زیرساخت‬ ‫اجرا‬ ‫و‬ ‫نصب‬
Mohtat@ut.ac.ir
Big Data Specialization
Michigan University(Coursera)
Introduction to Big Data
Big Data Modeling and
Management Systems
Big Data Integration and Processing
Machine Learning With Big Data
Graph Analytics for Big Data
Mohtat@ut.ac.ir 36LOGO HERE
Apache Spark
Berkeley University
Mohtat@ut.ac.ir 37LOGO HERE
Big Data Book
38
Data Skills for Digital Era-مهارت های داده ای
Data Scientist VS Data Engineer
Mohtat@ut.ac.ir 40
Dolor sit ametis
Data Engineering
Data Scientist
Data Pipelines
Visualization & Storytelling
Programming
Modeling & Advance Analytics
Math & Statistics
System Implementation
How To Become A Data Engineer
Linux
NoSQL & SQL
Python / Java / Scala
Agile Development
Data Ingestion
Processing Frameworks
Mohtat@ut.ac.ir 42
Best Data Processing Frameworks
MapReduce is a programming model
and an associated implementation for
processing and generating big data
sets with a parallel, distributed
algorithm on a cluster
Apache Spark is an open-
source distributed
general-purpose cluster-
computing framework.
Apache Storm is a free
and open source
distributed realtime
computation system.
The core of Apache Flink
is a distributed streaming
dataflow engine written in
Java and Scala
43
Cassandra
Best NoSQL Database
Mohtat@ut.ac.ir 44
Data Ingestion Tools
Apache Kafka
SSIS & ODI
Apache NiFi
Logstash
Mohtat@ut.ac.ir 45
Data Skills for Digital Era-مهارت های داده ای
Mohtat@ut.ac.ir
https://www.linkedin.com/in/mohtat
https://www.t.me/DataAnalysis
Contact Us
Thank You
1 de 41

Recomendados

Introduction to Data Mining, Business Intelligence and Data Science por
Introduction to Data Mining, Business Intelligence and Data ScienceIntroduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIMC Institute
3.3K visualizações33 slides
Big data analysis por
Big data analysisBig data analysis
Big data analysisSAishwaryaDinesh
285 visualizações15 slides
BigData Analysis por
BigData AnalysisBigData Analysis
BigData AnalysisInnfinision Cloud and BigData Solutions
1.6K visualizações21 slides
The book of elephant tattoo por
The book of elephant tattooThe book of elephant tattoo
The book of elephant tattooMohamed Magdy
171 visualizações30 slides
Business case for Big Data Analytics por
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data AnalyticsVijay Rao
4.7K visualizações21 slides
Big data analytics por
Big data analyticsBig data analytics
Big data analyticsRavi Teja
531 visualizações19 slides

Mais conteúdo relacionado

Mais procurados

Data Mining and Business Intelligence Tools por
Data Mining and Business Intelligence ToolsData Mining and Business Intelligence Tools
Data Mining and Business Intelligence ToolsMotaz Saad
17.3K visualizações55 slides
Accelerating Insight - Smart Data Lake Customer Success Stories por
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesCambridge Semantics
1.9K visualizações24 slides
Introduction to Data Science (Data Summit, 2017) por
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Caserta
3.2K visualizações55 slides
Big data course | big data training | big data classes por
Big data course | big data training | big data classesBig data course | big data training | big data classes
Big data course | big data training | big data classesNaviWalker
131 visualizações5 slides
Evaluating Big Data Predictive Analytics Platforms por
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsTeradata Aster
8.3K visualizações42 slides
Big data and Predictive Analytics By : Professor Lili Saghafi por
Big data and Predictive Analytics By : Professor Lili SaghafiBig data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiProfessor Lili Saghafi
33.1K visualizações65 slides

Mais procurados(20)

Data Mining and Business Intelligence Tools por Motaz Saad
Data Mining and Business Intelligence ToolsData Mining and Business Intelligence Tools
Data Mining and Business Intelligence Tools
Motaz Saad17.3K visualizações
Accelerating Insight - Smart Data Lake Customer Success Stories por Cambridge Semantics
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success Stories
Cambridge Semantics1.9K visualizações
Introduction to Data Science (Data Summit, 2017) por Caserta
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
Caserta 3.2K visualizações
Big data course | big data training | big data classes por NaviWalker
Big data course | big data training | big data classesBig data course | big data training | big data classes
Big data course | big data training | big data classes
NaviWalker131 visualizações
Evaluating Big Data Predictive Analytics Platforms por Teradata Aster
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
Teradata Aster8.3K visualizações
Big data and Predictive Analytics By : Professor Lili Saghafi por Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili SaghafiBig data and Predictive Analytics By : Professor Lili Saghafi
Big data and Predictive Analytics By : Professor Lili Saghafi
Professor Lili Saghafi33.1K visualizações
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed... por Edureka!
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Edureka!1.6K visualizações
Brochure_Big-Data_Offerings por Anisha Lamba
Brochure_Big-Data_OfferingsBrochure_Big-Data_Offerings
Brochure_Big-Data_Offerings
Anisha Lamba365 visualizações
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy por Cambridge Semantics
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
Cambridge Semantics721 visualizações
Unit i big data introduction por SujaMaryD
Unit  i big data introductionUnit  i big data introduction
Unit i big data introduction
SujaMaryD100 visualizações
Datascienceindia article por HimanshuPise1
Datascienceindia articleDatascienceindia article
Datascienceindia article
HimanshuPise195 visualizações
Future of Data - Big Data por shankar_radhakrishnan
Future of Data - Big DataFuture of Data - Big Data
Future of Data - Big Data
shankar_radhakrishnan4.1K visualizações
Data analytics & its Trends por Dr.K.Sreenivas Rao
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
Dr.K.Sreenivas Rao 230 visualizações
Ehr challenges [bigdata] por Nesma Almoazamy
Ehr challenges [bigdata]Ehr challenges [bigdata]
Ehr challenges [bigdata]
Nesma Almoazamy547 visualizações
Big data ppt por Nasrin Hussain
Big  data pptBig  data ppt
Big data ppt
Nasrin Hussain560.7K visualizações
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag... por Experfy
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
March Towards Big Data - Big Data Implementation, Migration, Ingestion, Manag...
Experfy131 visualizações
The Year of the Graph por Cambridge Semantics
The Year of the GraphThe Year of the Graph
The Year of the Graph
Cambridge Semantics6.5K visualizações

Similar a Data Skills for Digital Era-مهارت های داده ای

Data Skills for Digital Era por
Data Skills for Digital EraData Skills for Digital Era
Data Skills for Digital EraMohamadreza Mohtat
319 visualizações43 slides
Python para Manual de Ciência de Dados por
Python para Manual de Ciência de DadosPython para Manual de Ciência de Dados
Python para Manual de Ciência de DadosRafael Oliveira Bitcoin
3 visualizações7 slides
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc... por
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...Denodo
73 visualizações29 slides
Ch1IntroductiontoDataScience.pptx por
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptxAbderrahmanABID2
14 visualizações27 slides
Advanced Analytics and Machine Learning with Data Virtualization por
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
144 visualizações32 slides
Data science Nagarajan and madhav.pptx por
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
6 visualizações20 slides

Similar a Data Skills for Digital Era-مهارت های داده ای(20)

Data Skills for Digital Era por Mohamadreza Mohtat
Data Skills for Digital EraData Skills for Digital Era
Data Skills for Digital Era
Mohamadreza Mohtat319 visualizações
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc... por Denodo
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Denodo 73 visualizações
Ch1IntroductiontoDataScience.pptx por AbderrahmanABID2
Ch1IntroductiontoDataScience.pptxCh1IntroductiontoDataScience.pptx
Ch1IntroductiontoDataScience.pptx
AbderrahmanABID214 visualizações
Advanced Analytics and Machine Learning with Data Virtualization por Denodo
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
Denodo 144 visualizações
Data science Nagarajan and madhav.pptx por NagarajanG35
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
NagarajanG356 visualizações
Advanced Analytics and Artificial Intelligence - Transforming Your Business T... por David J Rosenthal
Advanced Analytics and Artificial Intelligence - Transforming Your Business T...Advanced Analytics and Artificial Intelligence - Transforming Your Business T...
Advanced Analytics and Artificial Intelligence - Transforming Your Business T...
David J Rosenthal969 visualizações
Bhadale group of companies our technology ecosystem por Vijayananda Mohire
Bhadale group of companies our technology ecosystemBhadale group of companies our technology ecosystem
Bhadale group of companies our technology ecosystem
Vijayananda Mohire55 visualizações
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data... por Simplilearn
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Simplilearn1.4K visualizações
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics Dell Statisti... por Big Data Week
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics  Dell Statisti...BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics  Dell Statisti...
BDW Chicago 2016 - John K. Thompson, GM for Advanced Analytics Dell Statisti...
Big Data Week91 visualizações
Data Analytics in your IoT Solution Fukiat Julnual, Technical Evangelist, Mic... por BAINIDA
Data Analytics in your IoT SolutionFukiat Julnual, Technical Evangelist, Mic...Data Analytics in your IoT SolutionFukiat Julnual, Technical Evangelist, Mic...
Data Analytics in your IoT Solution Fukiat Julnual, Technical Evangelist, Mic...
BAINIDA979 visualizações
Big Data Analytics por Sreedhar Chowdam
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Sreedhar Chowdam2.8K visualizações
Data science presentation por MSDEVMTL
Data science presentationData science presentation
Data science presentation
MSDEVMTL38.2K visualizações
Just ask Watson Seminar por Certus Solutions
Just ask Watson SeminarJust ask Watson Seminar
Just ask Watson Seminar
Certus Solutions423 visualizações
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum por Starttech Ventures
Ανδρέας Τσαγκάρης, 5th Digital Banking ForumΑνδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
Starttech Ventures469 visualizações
2019 DSA 105 Introduction to Data Science Week 4 por Ferdin Joe John Joseph PhD
2019 DSA 105 Introduction to Data Science Week 42019 DSA 105 Introduction to Data Science Week 4
2019 DSA 105 Introduction to Data Science Week 4
Ferdin Joe John Joseph PhD229 visualizações
Artificial Intelligence As a Service por John Liu
Artificial Intelligence As a ServiceArtificial Intelligence As a Service
Artificial Intelligence As a Service
John Liu327 visualizações
The Maturity Model: Taking the Growing Pains Out of Hadoop por Inside Analysis
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
Inside Analysis1K visualizações
[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic por DataScienceConferenc1
[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic
[DSC Europe 22] Overview of the Databricks Platform - Petar Zecevic
DataScienceConferenc174 visualizações
DATA SCIENCE 2024 KNOW TECHNOLOGIES GUIDING YOUR FUTURE.pdf por USDSI
DATA SCIENCE 2024 KNOW TECHNOLOGIES GUIDING YOUR FUTURE.pdfDATA SCIENCE 2024 KNOW TECHNOLOGIES GUIDING YOUR FUTURE.pdf
DATA SCIENCE 2024 KNOW TECHNOLOGIES GUIDING YOUR FUTURE.pdf
USDSI7 visualizações

Mais de Hosseinieh Ershad Public Library

تجربه مشتریان داده محور por
تجربه مشتریان داده محورتجربه مشتریان داده محور
تجربه مشتریان داده محورHosseinieh Ershad Public Library
594 visualizações43 slides
محصول داده محور por
محصول داده محورمحصول داده محور
محصول داده محورHosseinieh Ershad Public Library
210 visualizações10 slides
محصول داده محور por
محصول داده محورمحصول داده محور
محصول داده محورHosseinieh Ershad Public Library
206 visualizações18 slides
مباشرت داده: نقشی نوین فراتر از تخصص por
مباشرت داده: نقشی نوین فراتر از تخصصمباشرت داده: نقشی نوین فراتر از تخصص
مباشرت داده: نقشی نوین فراتر از تخصصHosseinieh Ershad Public Library
151 visualizações11 slides
از مباشرتِ داده‌ها تا حکمرانیِ داده‌ها por
از مباشرتِ داده‌ها تا حکمرانیِ داده‌هااز مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
از مباشرتِ داده‌ها تا حکمرانیِ داده‌هاHosseinieh Ershad Public Library
416 visualizações59 slides
فرهنگِ داده‌محور در سازمان por
 فرهنگِ داده‌محور در سازمان فرهنگِ داده‌محور در سازمان
فرهنگِ داده‌محور در سازمانHosseinieh Ershad Public Library
179 visualizações20 slides

Mais de Hosseinieh Ershad Public Library(20)

از مباشرتِ داده‌ها تا حکمرانیِ داده‌ها por Hosseinieh Ershad Public Library
از مباشرتِ داده‌ها تا حکمرانیِ داده‌هااز مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
از مباشرتِ داده‌ها تا حکمرانیِ داده‌ها
Hosseinieh Ershad Public Library416 visualizações
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی por Hosseinieh Ershad Public Library
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانیBusiness Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی
Business Data Alignment-همراستاییِ داده‌ها با اهداف سازمانی
Hosseinieh Ershad Public Library252 visualizações
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور por Hosseinieh Ershad Public Library
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محورData driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور
Data driven m arketing and design-بازاریابی داده محور و تأثیر طراحی داده محور
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری por Hosseinieh Ershad Public Library
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری
چارچوب سیاستی داده حکومتی باز در حوزه علم و فناوری
Hosseinieh Ershad Public Library186 visualizações
زنجیره تامین داده محور و انقلاب صنعتی چهارم por Hosseinieh Ershad Public Library
زنجیره تامین داده محور و انقلاب صنعتی چهارمزنجیره تامین داده محور و انقلاب صنعتی چهارم
زنجیره تامین داده محور و انقلاب صنعتی چهارم
Hosseinieh Ershad Public Library314 visualizações

Último

Advanced_Recommendation_Systems_Presentation.pptx por
Advanced_Recommendation_Systems_Presentation.pptxAdvanced_Recommendation_Systems_Presentation.pptx
Advanced_Recommendation_Systems_Presentation.pptxneeharikasingh29
5 visualizações9 slides
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx por
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptxDataScienceConferenc1
5 visualizações12 slides
3196 The Case of The East River por
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East RiverErickANDRADE90
12 visualizações4 slides
MOSORE_BRESCIA por
MOSORE_BRESCIAMOSORE_BRESCIA
MOSORE_BRESCIAFederico Karagulian
5 visualizações8 slides
SAP-TCodes.pdf por
SAP-TCodes.pdfSAP-TCodes.pdf
SAP-TCodes.pdfmustafaghulam8181
9 visualizações285 slides
Cross-network in Google Analytics 4.pdf por
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdfGA4 Tutorials
6 visualizações7 slides

Último(20)

Advanced_Recommendation_Systems_Presentation.pptx por neeharikasingh29
Advanced_Recommendation_Systems_Presentation.pptxAdvanced_Recommendation_Systems_Presentation.pptx
Advanced_Recommendation_Systems_Presentation.pptx
neeharikasingh295 visualizações
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx por DataScienceConferenc1
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
DataScienceConferenc15 visualizações
3196 The Case of The East River por ErickANDRADE90
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East River
ErickANDRADE9012 visualizações
Cross-network in Google Analytics 4.pdf por GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 visualizações
TGP 2.docx por sandi636490
TGP 2.docxTGP 2.docx
TGP 2.docx
sandi63649010 visualizações
Organic Shopping in Google Analytics 4.pdf por GA4 Tutorials
Organic Shopping in Google Analytics 4.pdfOrganic Shopping in Google Analytics 4.pdf
Organic Shopping in Google Analytics 4.pdf
GA4 Tutorials12 visualizações
Ukraine Infographic_22NOV2023_v2.pdf por AnastosiyaGurin
Ukraine Infographic_22NOV2023_v2.pdfUkraine Infographic_22NOV2023_v2.pdf
Ukraine Infographic_22NOV2023_v2.pdf
AnastosiyaGurin1.4K visualizações
PROGRAMME.pdf por HiNedHaJar
PROGRAMME.pdfPROGRAMME.pdf
PROGRAMME.pdf
HiNedHaJar19 visualizações
Chapter 3b- Process Communication (1) (1)(1) (1).pptx por ayeshabaig2004
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptx
ayeshabaig20045 visualizações
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation por DataScienceConferenc1
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
DataScienceConferenc111 visualizações
How Leaders See Data? (Level 1) por Narendra Narendra
How Leaders See Data? (Level 1)How Leaders See Data? (Level 1)
How Leaders See Data? (Level 1)
Narendra Narendra13 visualizações
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... por DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
DataScienceConferenc15 visualizações
CRIJ4385_Death Penalty_F23.pptx por yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1006 visualizações
Short Story Assignment by Kelly Nguyen por kellynguyen01
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyen
kellynguyen0119 visualizações
Binder1.pdf por EstherSita2
Binder1.pdfBinder1.pdf
Binder1.pdf
EstherSita210 visualizações
UNEP FI CRS Climate Risk Results.pptx por pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 visualizações
RIO GRANDE SUPPLY COMPANY INC, JAYSON.docx por JaysonGarabilesEspej
RIO GRANDE SUPPLY COMPANY INC, JAYSON.docxRIO GRANDE SUPPLY COMPANY INC, JAYSON.docx
RIO GRANDE SUPPLY COMPANY INC, JAYSON.docx
JaysonGarabilesEspej6 visualizações

Data Skills for Digital Era-مهارت های داده ای

  • 1. Data Skills for Digital Era The Top Data Skills You Need To Get Hired
  • 2. Main Focus Data Science Business Intelligence Big Data Data Engineering Mohtat@ut.ac.ir 2
  • 4. Data Science Math & Statistics Computer Science Subject Matter Expertise Mohtat@ut.ac.ir 4 Data Science is an interdisciplinary field about processes and systems to extract knowledge or insights from data, which is a continuation of some of the data analysis fields such as statistics, data mining, and predictive analytics, similar to Knowledge Discovery in Databases (KDD).
  • 7. Critical Skills for Data Scientists Python R SQL Data Mining Tools Knime , ReapidMiner, IBM SPSS Modeler Excel BI Tools Tableau, Power BI, Qlik Mohtat@ut.ac.ir 9
  • 8. Top Python Libraries in Data Science TensorFlow “TensorFlow is an open source software library for numerical computation using data flow graphs. PyTorch “PyTorch is a Python package that provides Deep neural networks built on a tape-based autograd system Numpy “NumPy is the fundamental package needed for scientific computing with Python. Scikit-Learn “scikit-learn is a Python module for machine learning built on NumPy, SciPy and matplotlib. Keras “Keras is a high-level neural networks API, written in Python and capable of running on top of TensorFlow, CNTK, or Theano. Scipy “SciPy is open-source software for mathematics, science, and engineering. Pandas “pandas is a Python package providing fast, flexible, and expressive data structures designed to make working with "relational" or "labeled" data both easy and intuitive Matplotlib “Matplotlib is a Python 2D plotting library which produces publication- quality figures in a variety of hardcopy formats and interactive environments across platforms. Scrapy “Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. Mohtat@ut.ac.ir 10
  • 9. Top Skills every Data Scientist needs to Master TensorFlow Keras Hadoop Spark Hive Java Matlab Mohtat@ut.ac.ir 11
  • 10. Most Essential Skills for Data Scientists Complex Problem Solving Team Working Emotional Intelligence Creativity Critical Thinking Negotiation Mohtat@ut.ac.ir 12
  • 11. Applied Data Science with Python Michigan University(Coursera) Basic Data Visualization Machine Learning Text Mining SNA Applied Text Mining in Python Introduction to Data Science in Python Applied Plotting, Charting & Data Representation in Python Applied Machine Learning in Python Applied Social Network Analysis in Python Mohtat@ut.ac.ir 13LOGO HERE
  • 14. Business Intelligence encompasses a wide variety of tools, applications and methodologies that enable organizations to collect data from internal systems and external sources; prepare it for analysis; develop and run queries against that data; and create reports, dashboards and data visualizations to make the analytical results available to corporate decision-makers, as well as operational workers. BI Mohtat@ut.ac.ir 17 Business Skills Link to Business Strategy Define Priorities Define BI Vision Lead Organization / BPR Analytics Skills Data Mining Social BI IT Skills Infrastructure Build Technology Data Integration & Quality
  • 16. Top Business Intelligence Skills SQL Data Warehousing Data Analysis Tableau ETL 23% 85% 28% 41% 65% Mohtat@ut.ac.ir 20 28%
  • 17. Top Business Intelligence Skills Business Analyst Oracle SQL Server BI Business Process Data Modeling 17% 85% 19% 21% 22% Mohtat@ut.ac.ir 21 19%
  • 18. Top Business Intelligence Tools Tableau Power BI Qlik Your Choice Is Clear Mohtat@ut.ac.ir 22
  • 23. Big Data Volume Terabyte Distribute Big Table Velocity Real-time Stream Processing Variety Structured Unstructured Text, Image, Video Mohtat@ut.ac.ir 27 Big data is a term used to refer to data sets that are too large or complex for traditional data-processing application software to adequately deal with. It’s what organizations do with the data that matters. Big data can be analyzed for insights that lead to better decisions and strategic business moves.
  • 25. 3 Types of Big Data Jobs 1 2 3 Big Data Developer Big Data Administration Big Data Analytics Mohtat@ut.ac.ir 29
  • 26. Top Big Data Programming Languages Not only Hadoop, many other big data analysis tools like Storm, Spark, and Kafka are written in Java and run on the JVM Java Python is a simple, open-source, general-purpose language. Hence, it is easy to learn Python for anyone.. With its rich set of utilities and libraries and easy-to-use features, it works wonder for big data processing and analysis. Python Scala is a rival of Java and Python in the world of Data Science and becoming more and more popular due to extensive use of Apache Spark in Big data Hadoop industry. Scala Mohtat@ut.ac.ir 30
  • 27. Pathway to Success Success Apache Hadoop Apache Spark Start NoSQL Database Data Analytics Data Visualization Mohtat@ut.ac.ir 31
  • 28. Big Data Companies & Vendors Cloudera, Inc. is a US-based software company that provides a software platform for data engineering, data warehousing, machine learning and analytics that runs in the cloud or on premises Cloudera MapR is a business software company headquartered in Santa Clara, California. MapR provides access to a variety of data sources from a single computer cluster, including big data workloads MapR Hortonworks is a data software company based in Santa Clara, California that develops, supports, and provides expertise on a set of open-source software designed to manage data and processing for things such as IOT, single view of X, and advanced analytics and machine learning Hortonworks
  • 31. Big Data Specialization Michigan University(Coursera) Introduction to Big Data Big Data Modeling and Management Systems Big Data Integration and Processing Machine Learning With Big Data Graph Analytics for Big Data Mohtat@ut.ac.ir 36LOGO HERE
  • 35. Data Scientist VS Data Engineer Mohtat@ut.ac.ir 40 Dolor sit ametis Data Engineering Data Scientist Data Pipelines Visualization & Storytelling Programming Modeling & Advance Analytics Math & Statistics System Implementation
  • 36. How To Become A Data Engineer Linux NoSQL & SQL Python / Java / Scala Agile Development Data Ingestion Processing Frameworks Mohtat@ut.ac.ir 42
  • 37. Best Data Processing Frameworks MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel, distributed algorithm on a cluster Apache Spark is an open- source distributed general-purpose cluster- computing framework. Apache Storm is a free and open source distributed realtime computation system. The core of Apache Flink is a distributed streaming dataflow engine written in Java and Scala 43
  • 39. Data Ingestion Tools Apache Kafka SSIS & ODI Apache NiFi Logstash Mohtat@ut.ac.ir 45