SlideShare uma empresa Scribd logo
1 de 25
Baixar para ler offline
Secrets of
Managing
Data
Science
Projects
Data Science For
Project Managers
P M I - W V P R O F E S S I O N A L D E V E LO P M E N T C O N F E R E N C E
O C T O B E R 20 1 5
T Z E - Y I U YO N G
Topic Coverage
Exploration of data science from various contexts
Data products: Output of data science
Implications for project managers
HELLO!
My name is Tze-Yiu Yong.
Technology Manager
https://www.linkedin.com/in/tzeyiuyong
More About Tze
Electrical engineer by training (semiconductor / statistics)
MBA from Duke University
Passionate about bridging the gap between technologists
and non-technologists.
Currently studying for PMP – advice appreciated!
Data Generated Every Minute
How Big Is The Digital Universe?
IBM conservatively
reports that we create
2.5 Exabytes of data per
day!
(1EB = 1MTBs)
(1ZB = 1K EB or 1BTBs)
Michael Lesk, the Bell Labs computer scientist known for the Lesk Algorithm, wrote in 1997 (!), “There may be a
few thousand petabytes (1KTBs) of information all told … the typical piece of information will never be looked at
by a human being.”
Believe WhatYou See On Television?
Dictionary Terms – Sum Of Parts
DATA
• a set of values of qualitative or quantitative variables.
• pieces of data are individual pieces of information.
SCIENCE
• organizes knowledge in the form of testable explanations and predictions.
• a way of pursuing knowledge, not only the knowledge itself.
www.thefreedictionary.com
Google Search Interest
Data Science evolved from BusinessAnalytics which evolved from Data Mining.
Job Growth
Job Skills
In 2009, Drew Conway published thisVenn Diagram of Data Science
skills.
Job Skills
Which was then expanded on by Natalia Bilenko.
Job Skills
Now crystallized by reality.
Joel Grus Profile Pic
The Government Knows!
The Wikipedia
Data Product
“Any tool created with the
help of data to make a more
informed decision.”
Examples
Dashboards
Spreadsheets
Emails
‘People AlsoViewed’
‘More ItemsTo Consider’
Many others
Data Science Workflow
Hadoop / Cassandra (FB) / HBase
DB Schema
DBMS
HW: Cluster / node
Big Data
Descriptive Predictive
Analytics Analytics
(Now)
Real-time
Analytics
TIME
Unstructured data
Data munging
Machine learning
Data mining
Tableau
Datawrapper
Time Check / Questions? [6 more slides]
(4 MEATY ONES)
Implications For Project Managers
“Develop a reporting portal for internal stakeholders and customers that
incorporates and aggregates machine data from a 30 unit installed base.”
“Implement a smart meter program for a major utility that enables increased
accuracy of customer billings, improved outage management and reduced costs of
usage collection information.”
“Analyze existing credit related customer data to develop a predictive credit score
and risk of default model. Implement a user interface that facilitates broad usability
across a wide range of internal business analysts.”
Implications For Project Managers.
“Develop a reporting portal for internal stakeholders and customers that
incorporates and aggregates machine data from a 30 unit installed base.”
“Implement a smart meter program for a major utility that enables increased
accuracy of customer billings, improved outage management and reduced costs of
usage collection information.”
“Analyze existing credit related customer data to develop a predictive credit score
and risk of default model. Implement a user interface that facilitates broad usability
across a wide range of internal business analysts.”
Implications For Project Managers
You already have a good foundation.
Implications For Project Managers
Know your stakeholders – they are numerous. [Initiating]
Don’t let tools drive your policy or process. [Initiating / Planning]
Think about boundaries – external vs internal, functional. [Executing]
Manage up. [Monitor and Control]
Unique Considerations
Seek agreement on the quality of the data this is required for the data product.
Understand and then communicate where the data comes from and where its going.
Internalize the privacy concerns, regulations and practices for the data in your
project scope.
Thanks for your interest and attention!
I AM HOPING FOR DISCUSSION …
CONNECT WITH ME ON LINKEDIN
HTTPS://WWW.LINKEDIN.COM/IN/TZEYIUYONG

Mais conteúdo relacionado

Mais procurados

1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptopRising Media, Inc.
 
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use CaseData Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use CaseFormulatedby
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Swiss Big Data User Group
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Domino Data Lab
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Edureka!
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino Data Lab
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsSri Ambati
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Edureka!
 
Planning Your Data Science Projects
Planning Your Data Science ProjectsPlanning Your Data Science Projects
Planning Your Data Science ProjectsSpotle.ai
 
Operationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the EnterpriseOperationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the Enterprisemark madsen
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Thoughtworks
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Caserta
 
A Hybrid Approach to Data Science Project Management
A Hybrid Approach to Data Science Project ManagementA Hybrid Approach to Data Science Project Management
A Hybrid Approach to Data Science Project ManagementElaine K. Lee
 
H2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioH2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioSri Ambati
 
BAS 150 Lesson 1 Lecture
BAS 150 Lesson 1 LectureBAS 150 Lesson 1 Lecture
BAS 150 Lesson 1 LectureWake Tech BAS
 
Barga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteBarga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteRoger Barga
 
H2O World - Machine Learning for non-data scientists
H2O World - Machine Learning for non-data scientistsH2O World - Machine Learning for non-data scientists
H2O World - Machine Learning for non-data scientistsSri Ambati
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at ScaleDomino Data Lab
 

Mais procurados (20)

1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop1645 track 1 bress_using his laptop
1645 track 1 bress_using his laptop
 
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use CaseData Science Salon: Introduction to Machine Learning - Marketing Use Case
Data Science Salon: Introduction to Machine Learning - Marketing Use Case
 
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)Big Data Use-Cases across industries (Georg Polzer, Teralytics)
Big Data Use-Cases across industries (Georg Polzer, Teralytics)
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...Data Science Applications | Data Science For Beginners | Data Science Trainin...
Data Science Applications | Data Science For Beginners | Data Science Trainin...
 
Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...Domino and AWS: collaborative analytics and model governance at financial ser...
Domino and AWS: collaborative analytics and model governance at financial ser...
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
Vikrant data scientist
Vikrant data scientistVikrant data scientist
Vikrant data scientist
 
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
Data Scientist Roles and Responsibilities | Data Scientist Career | Data Scie...
 
Planning Your Data Science Projects
Planning Your Data Science ProjectsPlanning Your Data Science Projects
Planning Your Data Science Projects
 
Operationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the EnterpriseOperationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the Enterprise
 
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
Big Data Agile Analytics by Ken Collier - Director Agile Analytics, Thoughtwo...
 
Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)Introduction to Data Science (Data Summit, 2017)
Introduction to Data Science (Data Summit, 2017)
 
Agile Data Science
Agile Data ScienceAgile Data Science
Agile Data Science
 
A Hybrid Approach to Data Science Project Management
A Hybrid Approach to Data Science Project ManagementA Hybrid Approach to Data Science Project Management
A Hybrid Approach to Data Science Project Management
 
H2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.ioH2O World - What you need before doing predictive analysis - Keen.io
H2O World - What you need before doing predictive analysis - Keen.io
 
BAS 150 Lesson 1 Lecture
BAS 150 Lesson 1 LectureBAS 150 Lesson 1 Lecture
BAS 150 Lesson 1 Lecture
 
Barga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteBarga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 Keynote
 
H2O World - Machine Learning for non-data scientists
H2O World - Machine Learning for non-data scientistsH2O World - Machine Learning for non-data scientists
H2O World - Machine Learning for non-data scientists
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at Scale
 

Destaque

Metis data science_project_kiva_20150407
Metis data science_project_kiva_20150407Metis data science_project_kiva_20150407
Metis data science_project_kiva_20150407Frederik Durant
 
CRISP-DM: Data Mining e Modelos Preditivos
CRISP-DM: Data Mining e Modelos PreditivosCRISP-DM: Data Mining e Modelos Preditivos
CRISP-DM: Data Mining e Modelos PreditivosLeandro Guerra
 
Tutorial: Writing Sencha Touch Mobile Apps using ]project-open[
Tutorial: Writing Sencha Touch Mobile Apps using ]project-open[Tutorial: Writing Sencha Touch Mobile Apps using ]project-open[
Tutorial: Writing Sencha Touch Mobile Apps using ]project-open[Klaus Hofeditz
 
]project-open[ CVS+ACL Permission Configuration
]project-open[ CVS+ACL Permission Configuration]project-open[ CVS+ACL Permission Configuration
]project-open[ CVS+ACL Permission ConfigurationKlaus Hofeditz
 
Leading an open source project oscon2016
Leading an open source project oscon2016Leading an open source project oscon2016
Leading an open source project oscon2016Tessa Mero
 
The Top 10 Free and Open Source Project Management Software For Your Small Bu...
The Top 10 Free and Open Source Project Management Software For Your Small Bu...The Top 10 Free and Open Source Project Management Software For Your Small Bu...
The Top 10 Free and Open Source Project Management Software For Your Small Bu...Capterra
 
How to cover the whole Translation Project Workflow with one open-source syst...
How to cover the whole Translation Project Workflow with one open-source syst...How to cover the whole Translation Project Workflow with one open-source syst...
How to cover the whole Translation Project Workflow with one open-source syst...Qabiria
 
Open Source Project Management Part 2
Open Source Project Management Part 2Open Source Project Management Part 2
Open Source Project Management Part 2Semen Arslan
 
Eclipse Mylyn Integration with ]project-open[
Eclipse Mylyn Integration with ]project-open[Eclipse Mylyn Integration with ]project-open[
Eclipse Mylyn Integration with ]project-open[Klaus Hofeditz
 
BFBM(7-2016) Productivity : Smarter Faster Better ေဟာေျပာပြဲ (မံုရြာ)
 BFBM(7-2016) Productivity : Smarter Faster Better ေဟာေျပာပြဲ (မံုရြာ) BFBM(7-2016) Productivity : Smarter Faster Better ေဟာေျပာပြဲ (မံုရြာ)
BFBM(7-2016) Productivity : Smarter Faster Better ေဟာေျပာပြဲ (မံုရြာ)Hub Myanmar Company Limited
 
]project-open[ Budget Planning and Tracking
]project-open[ Budget Planning and Tracking]project-open[ Budget Planning and Tracking
]project-open[ Budget Planning and TrackingKlaus Hofeditz
 
]project-open[ Timesheet Project Invoicing
]project-open[ Timesheet Project Invoicing]project-open[ Timesheet Project Invoicing
]project-open[ Timesheet Project InvoicingKlaus Hofeditz
 
]project-open[ on Amazon AWS
]project-open[ on Amazon AWS]project-open[ on Amazon AWS
]project-open[ on Amazon AWSKlaus Hofeditz
 
]project-open[ Workflow Developer Tutorial Part 4
]project-open[ Workflow Developer Tutorial Part 4]project-open[ Workflow Developer Tutorial Part 4
]project-open[ Workflow Developer Tutorial Part 4Klaus Hofeditz
 
Empxtrack The Complete HR Package
Empxtrack The Complete HR PackageEmpxtrack The Complete HR Package
Empxtrack The Complete HR PackageEmpxtrack Inc.
 
Data Science Project Lifecycle
Data Science Project LifecycleData Science Project Lifecycle
Data Science Project LifecycleJason Geng
 
]project-open[ Roll Out Plan
]project-open[ Roll Out Plan]project-open[ Roll Out Plan
]project-open[ Roll Out PlanKlaus Hofeditz
 
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsCRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsMichał Łopuszyński
 
]project-open[ OSS Project Mangement
]project-open[ OSS Project Mangement]project-open[ OSS Project Mangement
]project-open[ OSS Project MangementKlaus Hofeditz
 

Destaque (20)

Metis data science_project_kiva_20150407
Metis data science_project_kiva_20150407Metis data science_project_kiva_20150407
Metis data science_project_kiva_20150407
 
CRISP-DM: Data Mining e Modelos Preditivos
CRISP-DM: Data Mining e Modelos PreditivosCRISP-DM: Data Mining e Modelos Preditivos
CRISP-DM: Data Mining e Modelos Preditivos
 
Tutorial: Writing Sencha Touch Mobile Apps using ]project-open[
Tutorial: Writing Sencha Touch Mobile Apps using ]project-open[Tutorial: Writing Sencha Touch Mobile Apps using ]project-open[
Tutorial: Writing Sencha Touch Mobile Apps using ]project-open[
 
]project-open[ CVS+ACL Permission Configuration
]project-open[ CVS+ACL Permission Configuration]project-open[ CVS+ACL Permission Configuration
]project-open[ CVS+ACL Permission Configuration
 
Leading an open source project oscon2016
Leading an open source project oscon2016Leading an open source project oscon2016
Leading an open source project oscon2016
 
BFBM(12-2016) Business to business marketing
BFBM(12-2016) Business to business marketingBFBM(12-2016) Business to business marketing
BFBM(12-2016) Business to business marketing
 
The Top 10 Free and Open Source Project Management Software For Your Small Bu...
The Top 10 Free and Open Source Project Management Software For Your Small Bu...The Top 10 Free and Open Source Project Management Software For Your Small Bu...
The Top 10 Free and Open Source Project Management Software For Your Small Bu...
 
How to cover the whole Translation Project Workflow with one open-source syst...
How to cover the whole Translation Project Workflow with one open-source syst...How to cover the whole Translation Project Workflow with one open-source syst...
How to cover the whole Translation Project Workflow with one open-source syst...
 
Open Source Project Management Part 2
Open Source Project Management Part 2Open Source Project Management Part 2
Open Source Project Management Part 2
 
Eclipse Mylyn Integration with ]project-open[
Eclipse Mylyn Integration with ]project-open[Eclipse Mylyn Integration with ]project-open[
Eclipse Mylyn Integration with ]project-open[
 
BFBM(7-2016) Productivity : Smarter Faster Better ေဟာေျပာပြဲ (မံုရြာ)
 BFBM(7-2016) Productivity : Smarter Faster Better ေဟာေျပာပြဲ (မံုရြာ) BFBM(7-2016) Productivity : Smarter Faster Better ေဟာေျပာပြဲ (မံုရြာ)
BFBM(7-2016) Productivity : Smarter Faster Better ေဟာေျပာပြဲ (မံုရြာ)
 
]project-open[ Budget Planning and Tracking
]project-open[ Budget Planning and Tracking]project-open[ Budget Planning and Tracking
]project-open[ Budget Planning and Tracking
 
]project-open[ Timesheet Project Invoicing
]project-open[ Timesheet Project Invoicing]project-open[ Timesheet Project Invoicing
]project-open[ Timesheet Project Invoicing
 
]project-open[ on Amazon AWS
]project-open[ on Amazon AWS]project-open[ on Amazon AWS
]project-open[ on Amazon AWS
 
]project-open[ Workflow Developer Tutorial Part 4
]project-open[ Workflow Developer Tutorial Part 4]project-open[ Workflow Developer Tutorial Part 4
]project-open[ Workflow Developer Tutorial Part 4
 
Empxtrack The Complete HR Package
Empxtrack The Complete HR PackageEmpxtrack The Complete HR Package
Empxtrack The Complete HR Package
 
Data Science Project Lifecycle
Data Science Project LifecycleData Science Project Lifecycle
Data Science Project Lifecycle
 
]project-open[ Roll Out Plan
]project-open[ Roll Out Plan]project-open[ Roll Out Plan
]project-open[ Roll Out Plan
 
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsCRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining Projects
 
]project-open[ OSS Project Mangement
]project-open[ OSS Project Mangement]project-open[ OSS Project Mangement
]project-open[ OSS Project Mangement
 

Semelhante a 20151016 Data Science For Project Managers

Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsBrand Niemann
 
Data Science towards the Digital Enterprise
Data Science towards the Digital EnterpriseData Science towards the Digital Enterprise
Data Science towards the Digital EnterpriseJake Bouma
 
The Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterThe Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterInside Analysis
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsCaserta
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with DatabricksGrega Kespret
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudInside Analysis
 
Business in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationBusiness in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationInside Analysis
 
Trends in Data Modeling
Trends in Data ModelingTrends in Data Modeling
Trends in Data ModelingDATAVERSITY
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Prof.Balakrishnan S
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?Inside Analysis
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactDr. Sunil Kr. Pandey
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)Denodo
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Denodo
 
Big Data Analytics with Microsoft
Big Data Analytics with MicrosoftBig Data Analytics with Microsoft
Big Data Analytics with MicrosoftCaserta
 
5. big data vs it stki - pini cohen
5. big data vs  it    stki - pini cohen5. big data vs  it    stki - pini cohen
5. big data vs it stki - pini cohenTaldor Group
 
Hot Technologies of 2013: Investigative Analytics
Hot Technologies of 2013: Investigative AnalyticsHot Technologies of 2013: Investigative Analytics
Hot Technologies of 2013: Investigative AnalyticsInside Analysis
 
Understanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyUnderstanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyInside Analysis
 

Semelhante a 20151016 Data Science For Project Managers (20)

Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data Dashboards
 
Data Science towards the Digital Enterprise
Data Science towards the Digital EnterpriseData Science towards the Digital Enterprise
Data Science towards the Digital Enterprise
 
The Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterThe Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value Thereafter
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
How Celtra Optimizes its Advertising Platform with Databricks
How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with Databricks
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the Cloud
 
Business in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationBusiness in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for Integration
 
Trends in Data Modeling
Trends in Data ModelingTrends in Data Modeling
Trends in Data Modeling
 
The Data Asset
The Data AssetThe Data Asset
The Data Asset
 
The Data Asset
The Data AssetThe Data Asset
The Data Asset
 
Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19Big Data Driven Solutions to Combat Covid' 19
Big Data Driven Solutions to Combat Covid' 19
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
 
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & ImpactData Science - An emerging Stream of Science with its Spreading Reach & Impact
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
Big Data Analytics with Microsoft
Big Data Analytics with MicrosoftBig Data Analytics with Microsoft
Big Data Analytics with Microsoft
 
5. big data vs it stki - pini cohen
5. big data vs  it    stki - pini cohen5. big data vs  it    stki - pini cohen
5. big data vs it stki - pini cohen
 
Hot Technologies of 2013: Investigative Analytics
Hot Technologies of 2013: Investigative AnalyticsHot Technologies of 2013: Investigative Analytics
Hot Technologies of 2013: Investigative Analytics
 
Understanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyUnderstanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data Quickly
 

20151016 Data Science For Project Managers

  • 2. Data Science For Project Managers P M I - W V P R O F E S S I O N A L D E V E LO P M E N T C O N F E R E N C E O C T O B E R 20 1 5 T Z E - Y I U YO N G
  • 3. Topic Coverage Exploration of data science from various contexts Data products: Output of data science Implications for project managers
  • 4. HELLO! My name is Tze-Yiu Yong. Technology Manager https://www.linkedin.com/in/tzeyiuyong
  • 5. More About Tze Electrical engineer by training (semiconductor / statistics) MBA from Duke University Passionate about bridging the gap between technologists and non-technologists. Currently studying for PMP – advice appreciated!
  • 7. How Big Is The Digital Universe? IBM conservatively reports that we create 2.5 Exabytes of data per day! (1EB = 1MTBs) (1ZB = 1K EB or 1BTBs) Michael Lesk, the Bell Labs computer scientist known for the Lesk Algorithm, wrote in 1997 (!), “There may be a few thousand petabytes (1KTBs) of information all told … the typical piece of information will never be looked at by a human being.”
  • 8. Believe WhatYou See On Television?
  • 9. Dictionary Terms – Sum Of Parts DATA • a set of values of qualitative or quantitative variables. • pieces of data are individual pieces of information. SCIENCE • organizes knowledge in the form of testable explanations and predictions. • a way of pursuing knowledge, not only the knowledge itself. www.thefreedictionary.com
  • 10. Google Search Interest Data Science evolved from BusinessAnalytics which evolved from Data Mining.
  • 12. Job Skills In 2009, Drew Conway published thisVenn Diagram of Data Science skills.
  • 13. Job Skills Which was then expanded on by Natalia Bilenko.
  • 14. Job Skills Now crystallized by reality. Joel Grus Profile Pic
  • 17. Data Product “Any tool created with the help of data to make a more informed decision.” Examples Dashboards Spreadsheets Emails ‘People AlsoViewed’ ‘More ItemsTo Consider’ Many others
  • 18. Data Science Workflow Hadoop / Cassandra (FB) / HBase DB Schema DBMS HW: Cluster / node Big Data Descriptive Predictive Analytics Analytics (Now) Real-time Analytics TIME Unstructured data Data munging Machine learning Data mining Tableau Datawrapper
  • 19. Time Check / Questions? [6 more slides] (4 MEATY ONES)
  • 20. Implications For Project Managers “Develop a reporting portal for internal stakeholders and customers that incorporates and aggregates machine data from a 30 unit installed base.” “Implement a smart meter program for a major utility that enables increased accuracy of customer billings, improved outage management and reduced costs of usage collection information.” “Analyze existing credit related customer data to develop a predictive credit score and risk of default model. Implement a user interface that facilitates broad usability across a wide range of internal business analysts.”
  • 21. Implications For Project Managers. “Develop a reporting portal for internal stakeholders and customers that incorporates and aggregates machine data from a 30 unit installed base.” “Implement a smart meter program for a major utility that enables increased accuracy of customer billings, improved outage management and reduced costs of usage collection information.” “Analyze existing credit related customer data to develop a predictive credit score and risk of default model. Implement a user interface that facilitates broad usability across a wide range of internal business analysts.”
  • 22. Implications For Project Managers You already have a good foundation.
  • 23. Implications For Project Managers Know your stakeholders – they are numerous. [Initiating] Don’t let tools drive your policy or process. [Initiating / Planning] Think about boundaries – external vs internal, functional. [Executing] Manage up. [Monitor and Control]
  • 24. Unique Considerations Seek agreement on the quality of the data this is required for the data product. Understand and then communicate where the data comes from and where its going. Internalize the privacy concerns, regulations and practices for the data in your project scope.
  • 25. Thanks for your interest and attention! I AM HOPING FOR DISCUSSION … CONNECT WITH ME ON LINKEDIN HTTPS://WWW.LINKEDIN.COM/IN/TZEYIUYONG