SlideShare uma empresa Scribd logo
1 de 23
Baixar para ler offline
Data Science and Visualization
@iHubResearch
Data in Africa
Common challenges –
Accessibility/ Availability and Quality
How can we discover data and surface
information?
– How does Facebook and Amazon suggest
friends you may know and recommended
items to purchase?
– How can we solve the never ending traffic
problems in the city?
– How does NYC determine the frequency of
subway trains all day; all weekend?
Photo courtesy of #kenya365
What is Data Science?
The process of using data to
surface information and tell
stories
Data Science includes
collecting data, cleaning and
managing the data, making it
tell its story, and presenting
that story to others
An ideal Data Scientist is:
1/3 part Mathematician
1/3 part Computer Scientist
1/3 part Artist
How can we innovate in each of
these processes?
Data Collection
•  Survey management
processes:
– Mobile and web data
collections tools
– Open Data Portals
– Crowdsourcing tools
KODI
Data Storage
•  Data Storage/Warehousing:
–  MySQL; NoSQL; SparQL; Linked Data; Azure; Amazon
Cloud Services; Dropbox
–  What formats is the data stored in?
–  Data Cleaning processes – Need tools such as Google
Refine?
Data Analysis
Analytical Tools:
Excel; SPSS; S; R; Python; Stata; Pivot Tables
Data Mining Processes:
Hadoop; Weka
Data Visualization
•  Data Visualization:
Charts; graphs; pictures; maps
•  Infographic tools:
Illustrator; Infogr.am; ManyEyes; GIS mapping;
Tableau
•  HOW NOT TO VISUALIZE YOUR DATA!!!
*iHub Research_ Data Science and
Visualization Lab
Data Visualization
We are surfacing new data - Latest stats in
African tech. sector.
Data Visualization
We are providing a melting pot for different
industries/sectors to utilize technology to
discover and use data for decision making.
Data Visualization
We are setting local industry standards on
the use of data and influencing critical data
focusing policies:
Data Protection laws; FOIs; Privacy and
Security.
Data Visualization
We are innovating on new ways to
effectively discover and use data in our local
settings
Gsma
On-going Projects
•  Umati II – automation of hate-speech monitoring process
•  3Vs Crowdsourcing Framework– viability, validation and
verification
•  Investment Research– Mapping the Tech. Investment
Landscape in Kenya
•  Infographics– Africa infographics; Tech statistics and data
•  Data Warehousing solutions - Tools to archive and discover
our own research data and information
How can you be part of this?
Training
Business Support
Consultancy
Use Cases for iHub Cluster
Data Challenges 
Example of sites developed by Code4Kenya
Data science and visualization lab presentation

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Data Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future JobsData Science: An Emerging Field for Future Jobs
Data Science: An Emerging Field for Future Jobs
 
data science
data sciencedata science
data science
 
Data science
Data scienceData science
Data science
 
Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science  Data Science Innovations : Democratisation of Data and Data Science
Data Science Innovations : Democratisation of Data and Data Science
 
Understand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.SUnderstand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.S
 
Top career opportunities in data science
Top career opportunities in data scienceTop career opportunities in data science
Top career opportunities in data science
 
Data science
Data scienceData science
Data science
 
Introduction to data science club
Introduction to data science clubIntroduction to data science club
Introduction to data science club
 
Data science Big Data
Data science Big DataData science Big Data
Data science Big Data
 
IoT and Big Data
IoT and Big DataIoT and Big Data
IoT and Big Data
 
Big data Introduction
Big data IntroductionBig data Introduction
Big data Introduction
 
Big data analysis
Big data analysisBig data analysis
Big data analysis
 
Big Data
Big DataBig Data
Big Data
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Big Data and Computer Science Education
Big Data and Computer Science EducationBig Data and Computer Science Education
Big Data and Computer Science Education
 
Data science applications and usecases
Data science applications and usecasesData science applications and usecases
Data science applications and usecases
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Big Data for Ag (2019)
Big Data for Ag (2019)Big Data for Ag (2019)
Big Data for Ag (2019)
 
AI & Big Data Analytics : Innovation trends and use cases
AI & Big Data Analytics : Innovation trends and use casesAI & Big Data Analytics : Innovation trends and use cases
AI & Big Data Analytics : Innovation trends and use cases
 
Big Data
Big DataBig Data
Big Data
 

Semelhante a Data science and visualization lab presentation

Semelhante a Data science and visualization lab presentation (20)

Data science and business analytics
Data  science and business analyticsData  science and business analytics
Data science and business analytics
 
Data sciences and marketing analytics
Data sciences and marketing analyticsData sciences and marketing analytics
Data sciences and marketing analytics
 
Intro to Data Science Big Data
Intro to Data Science Big DataIntro to Data Science Big Data
Intro to Data Science Big Data
 
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...Towards a Community-driven Data Science Body of Knowledge – Data Management S...
Towards a Community-driven Data Science Body of Knowledge – Data Management S...
 
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science Expertise
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)Getting started in Data Science (April 2017, Los Angeles)
Getting started in Data Science (April 2017, Los Angeles)
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Unit 1 (DSBDA) PD.pptx
Unit 1 (DSBDA)  PD.pptxUnit 1 (DSBDA)  PD.pptx
Unit 1 (DSBDA) PD.pptx
 
Big data
Big dataBig data
Big data
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Data science and visualization lab presentation

  • 1. Data Science and Visualization @iHubResearch
  • 2. Data in Africa Common challenges – Accessibility/ Availability and Quality How can we discover data and surface information?
  • 3. – How does Facebook and Amazon suggest friends you may know and recommended items to purchase? – How can we solve the never ending traffic problems in the city? – How does NYC determine the frequency of subway trains all day; all weekend? Photo courtesy of #kenya365
  • 4. What is Data Science? The process of using data to surface information and tell stories Data Science includes collecting data, cleaning and managing the data, making it tell its story, and presenting that story to others
  • 5. An ideal Data Scientist is: 1/3 part Mathematician 1/3 part Computer Scientist 1/3 part Artist
  • 6. How can we innovate in each of these processes?
  • 7. Data Collection •  Survey management processes: – Mobile and web data collections tools – Open Data Portals – Crowdsourcing tools
  • 9. Data Storage •  Data Storage/Warehousing: –  MySQL; NoSQL; SparQL; Linked Data; Azure; Amazon Cloud Services; Dropbox –  What formats is the data stored in? –  Data Cleaning processes – Need tools such as Google Refine?
  • 10. Data Analysis Analytical Tools: Excel; SPSS; S; R; Python; Stata; Pivot Tables Data Mining Processes: Hadoop; Weka
  • 11. Data Visualization •  Data Visualization: Charts; graphs; pictures; maps •  Infographic tools: Illustrator; Infogr.am; ManyEyes; GIS mapping; Tableau •  HOW NOT TO VISUALIZE YOUR DATA!!!
  • 12. *iHub Research_ Data Science and Visualization Lab
  • 13. Data Visualization We are surfacing new data - Latest stats in African tech. sector.
  • 14. Data Visualization We are providing a melting pot for different industries/sectors to utilize technology to discover and use data for decision making.
  • 15. Data Visualization We are setting local industry standards on the use of data and influencing critical data focusing policies: Data Protection laws; FOIs; Privacy and Security.
  • 16. Data Visualization We are innovating on new ways to effectively discover and use data in our local settings
  • 17.
  • 18.
  • 19. Gsma
  • 20. On-going Projects •  Umati II – automation of hate-speech monitoring process •  3Vs Crowdsourcing Framework– viability, validation and verification •  Investment Research– Mapping the Tech. Investment Landscape in Kenya •  Infographics– Africa infographics; Tech statistics and data •  Data Warehousing solutions - Tools to archive and discover our own research data and information
  • 21. How can you be part of this? Training Business Support Consultancy Use Cases for iHub Cluster Data Challenges 
  • 22. Example of sites developed by Code4Kenya