SlideShare uma empresa Scribd logo
1 de 37
Baixar para ler offline
LEARNTO BE A	

DATA SCIENTIST FOR $1
Hack Kid Conference - April 2014	

by Adrian Cockcroft	

BatteryVentures
A BIG new problem	

for a new generation
Now
A BIG new problem	

for a new generation
Now
A BIG new problem	

for a new generation
Your future job as a Data Scientist
WHAT DOES A DATA SCIENTIST DO?
The hive mind map shows popular twitter hashtags
for the last 7 days and how they are connected
http://hivemindmap.com/?#
HIVE MIND MAP
A mind-map of what’s happening onTwitter
Thanks to Mark Harwood for these slides and the Hive Mind Map
http://www.infoq.com/presentations/elasticsearch-revealing-uncommonly-common
Connections
The thickness of a line between hashtags is
based on the strength of connection
Tip:!
Strength of connection
is the number of tweets
with both tags vs the
number with only one -
see “Jaccard similarity
coefficient”
Top tweets
The most popular tweets for a tag are sorted
based on the number of “retweets”
When?
The rise and fall of each hashtag’s popularity
can be shown over time
Calendar summary
Tags that “peak” together are grouped into
events on a calendar
Tip:!
Peaks are detected
using standard
deviations. Only tags
with a single peak are
chosen as events
Tip:!
Tags that rise and
fall in popularity at
the same time are
detected using
Pearson’s
Correlation
What makes this possible?
• Free software (Lucene, Java, Eclipse, Gephi, Tomcat, d3, Google analytics…)
• Free data (millions of users’ tweets from Twitter’s 1% sample feed)
• “Cloud” computing (rented server)
• Smarter web browsers (visualizations using HTML5’s SVG/Canvas)
• All the friendly folks on the internet (e.g. http://stackoverflow.com/
questions/14799842)
• Some imagination…
Opportunities in Data Science
• We are all generating volumes of data never seen before
• You can recycle the behaviors of billions of people into
more intelligent systems
• customer purchases can be used for product recommendations
• user searches can be used for spelling corrections,
• Reader clicks can influence the trending news
• Spotify activity is used to make music recommendations)
• The tools have never been cheaper
• It has never been easier to find help in developing systems
…one more thing..
I’m writing these slides for you
while on my annual snowboarding
trip to Canada.
Data science pays well ;-)
Wish you were here…
HOW CAN A KID
LEARN BIG DATA
FOR $1?
BIG DATA INTHE CLOUD WITH AMAZON EMR
https://www.youtube.com/watch?v=S6Ja55n-o0M
LESSTHAN $1
After running two of the EMR examples, creating 6 computers in the cloud
to do the analysis for up to an hour each
GOOGLE BIGQUERY
https://demobigquery.appspot.com/
BAY AREA WEATHER
https://demobigquery.appspot.com/
WHYTHE FLINTSTONES?
https://demobigquery.appspot.com/
MEASURING KIDS
How good are you at Math and Science, is it getting better or worse?
SCHOOL DATA
https://www.data.gov/	

http://eddataexpress.ed.gov/state-report.cfm/state/CA/
ACHIEVEMENT SCORES
Download results into Excel to analyze and draw graphs
DOWNLOADED DATA
Needed some clean-up. Made sure grade was consistent (4, 8, HS) for all
results, and created a short Subject column
SCORES 2004-2012
Elementary - 4th Grade, Middle School - 8th Grade, High School
SCORES 2004-2012
Elementary - 4th Grade, Middle School - 8th Grade, High School
About half of	

high school	

students in	

California are	

proficient at	

Math and	

Science
CALIFORNIA SCHOOLS
Science and Math Scores at Elementary, Middle and High School Level
CALIFORNIA SCHOOLS
Science and Math Scores at Elementary, Middle and High School Level
Scores have	

been getting	

better. Good!
CALIFORNIA SCHOOLS
Science and Math Scores at Elementary, Middle and High School Level
Scores have	

been getting	

better. Good!
Maybe the	

Math tests	

were harder	

for everyone	

that year?
CALIFORNIA SCHOOLS
Science and Math Scores at Elementary, Middle and High School Level
Scores have	

been getting	

better. Good!4th Grade	

“cohort” in	

2004 was 8th	

Grade in 2008
Maybe the	

Math tests	

were harder	

for everyone	

that year?
DATA SCIENCE WITH EXCEL
Pivot tables let you rearrange data and trend lines measure the slope
LEARNTO BE A DATA SCIENTIST FOR $1
• Everything is being measured	

• The latest data science tools are
available to anyone for pennies	

• There is lots of freely available data	

• Pay attention in math and science class,
play around with EMR and Bigquery
and get an interesting and well paid job
as a data scientist!

Mais conteúdo relacionado

Destaque

Data Scientist: The Sexiest Job in the 21st Century
Data Scientist: The Sexiest Job in the 21st CenturyData Scientist: The Sexiest Job in the 21st Century
Data Scientist: The Sexiest Job in the 21st CenturyLyn Fenex
 
What is a Data Scientist
What is a Data Scientist What is a Data Scientist
What is a Data Scientist Experian_US
 
Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyPeter Kua
 
Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps! Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps! PromptCloud
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientistPoo Kuan Hoong
 
A Data Scientist Experiment
A Data Scientist ExperimentA Data Scientist Experiment
A Data Scientist ExperimentJan Chipchase
 
Data Scientist 101 BI Dutch
Data Scientist 101 BI DutchData Scientist 101 BI Dutch
Data Scientist 101 BI DutchJos van Dongen
 
Вебинар: Инструменты для работы Data Scientist
Вебинар: Инструменты для работы Data ScientistВебинар: Инструменты для работы Data Scientist
Вебинар: Инструменты для работы Data ScientistFlyElephant
 
Data Science Day New York: Data Scientist - The New Data Analyst
Data Science Day New York: Data Scientist - The New Data AnalystData Science Day New York: Data Scientist - The New Data Analyst
Data Science Day New York: Data Scientist - The New Data AnalystCloudera, Inc.
 
Girish Sathyanarayana, Senior Data Scientist at AppLift, " Business Value Thr...
Girish Sathyanarayana, Senior Data Scientist at AppLift, " Business Value Thr...Girish Sathyanarayana, Senior Data Scientist at AppLift, " Business Value Thr...
Girish Sathyanarayana, Senior Data Scientist at AppLift, " Business Value Thr...Dataconomy Media
 
Is Data Scientist still the sexiest job of 21st century? Find Out!
Is Data Scientist still the sexiest job of 21st century? Find Out!Is Data Scientist still the sexiest job of 21st century? Find Out!
Is Data Scientist still the sexiest job of 21st century? Find Out!Edureka!
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist? HackerEarth
 
How Will AI Change the Role of the Data Scientist?
How Will AI Change the Role of the Data Scientist?How Will AI Change the Role of the Data Scientist?
How Will AI Change the Role of the Data Scientist?Hugo Gävert
 
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...Galvanize
 
Life of a data scientist (pub)
Life of a data scientist (pub)Life of a data scientist (pub)
Life of a data scientist (pub)Buhwan Jeong
 
Göteborg university(condensed)
Göteborg university(condensed)Göteborg university(condensed)
Göteborg university(condensed)Zenodia Charpy
 

Destaque (17)

Data Scientist: The Sexiest Job in the 21st Century
Data Scientist: The Sexiest Job in the 21st CenturyData Scientist: The Sexiest Job in the 21st Century
Data Scientist: The Sexiest Job in the 21st Century
 
What is a Data Scientist
What is a Data Scientist What is a Data Scientist
What is a Data Scientist
 
Data science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi PeriasamyData science vs. Data scientist by Jothi Periasamy
Data science vs. Data scientist by Jothi Periasamy
 
Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps! Be a Data Scientist in 8 steps!
Be a Data Scientist in 8 steps!
 
Data Scientist Why now?
Data Scientist Why now?Data Scientist Why now?
Data Scientist Why now?
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientist
 
A Data Scientist Experiment
A Data Scientist ExperimentA Data Scientist Experiment
A Data Scientist Experiment
 
Data Scientist 101 BI Dutch
Data Scientist 101 BI DutchData Scientist 101 BI Dutch
Data Scientist 101 BI Dutch
 
Вебинар: Инструменты для работы Data Scientist
Вебинар: Инструменты для работы Data ScientistВебинар: Инструменты для работы Data Scientist
Вебинар: Инструменты для работы Data Scientist
 
Data Science Day New York: Data Scientist - The New Data Analyst
Data Science Day New York: Data Scientist - The New Data AnalystData Science Day New York: Data Scientist - The New Data Analyst
Data Science Day New York: Data Scientist - The New Data Analyst
 
Girish Sathyanarayana, Senior Data Scientist at AppLift, " Business Value Thr...
Girish Sathyanarayana, Senior Data Scientist at AppLift, " Business Value Thr...Girish Sathyanarayana, Senior Data Scientist at AppLift, " Business Value Thr...
Girish Sathyanarayana, Senior Data Scientist at AppLift, " Business Value Thr...
 
Is Data Scientist still the sexiest job of 21st century? Find Out!
Is Data Scientist still the sexiest job of 21st century? Find Out!Is Data Scientist still the sexiest job of 21st century? Find Out!
Is Data Scientist still the sexiest job of 21st century? Find Out!
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist?
 
How Will AI Change the Role of the Data Scientist?
How Will AI Change the Role of the Data Scientist?How Will AI Change the Role of the Data Scientist?
How Will AI Change the Role of the Data Scientist?
 
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
 
Life of a data scientist (pub)
Life of a data scientist (pub)Life of a data scientist (pub)
Life of a data scientist (pub)
 
Göteborg university(condensed)
Göteborg university(condensed)Göteborg university(condensed)
Göteborg university(condensed)
 

Semelhante a Hack Kid Con - Learn to be a Data Scientist for $1

WLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CC
WLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CCWLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CC
WLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CCPaige Jaeger
 
Epub compass 2012 ace_conference
Epub compass 2012 ace_conferenceEpub compass 2012 ace_conference
Epub compass 2012 ace_conferencelindarg
 
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...Heidi Nance
 
Melbourne officeevent
Melbourne officeeventMelbourne officeevent
Melbourne officeeventStephen Abram
 
Wow resource roundup
Wow resource roundupWow resource roundup
Wow resource roundupjdanielian
 
Data matters-bournemouth-2015
Data matters-bournemouth-2015Data matters-bournemouth-2015
Data matters-bournemouth-2015Alan Dix
 
From Digital Literacy to Digital Fluency
From Digital Literacy to Digital FluencyFrom Digital Literacy to Digital Fluency
From Digital Literacy to Digital FluencyDavid Cain
 
Evaluating Electronic Resources
Evaluating Electronic ResourcesEvaluating Electronic Resources
Evaluating Electronic ResourcesRichard Bernier
 
Bondurant-Farrar
Bondurant-FarrarBondurant-Farrar
Bondurant-FarrarEvan Abbey
 
Bondurant-Farrar
Bondurant-FarrarBondurant-Farrar
Bondurant-FarrarEvan Abbey
 
How Does Reading & Learning Change on the Internet: Responding to New Literacies
How Does Reading & Learning Change on the Internet: Responding to New LiteraciesHow Does Reading & Learning Change on the Internet: Responding to New Literacies
How Does Reading & Learning Change on the Internet: Responding to New LiteraciesJulie Coiro
 
Getting to Know Your Data with R
Getting to Know Your Data with RGetting to Know Your Data with R
Getting to Know Your Data with RStephen Withington
 
Fys presentation 12_aug_2010
Fys presentation 12_aug_2010Fys presentation 12_aug_2010
Fys presentation 12_aug_2010Bruce Gilbert
 
Professional Information Research
Professional Information ResearchProfessional Information Research
Professional Information ResearchEric Kokke
 

Semelhante a Hack Kid Con - Learn to be a Data Scientist for $1 (20)

Delaware2011
Delaware2011Delaware2011
Delaware2011
 
WLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CC
WLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CCWLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CC
WLMA 14 Conference Keynote PPT - Paige Jaeger: Connecting Creatively with the CC
 
Epub compass 2012 ace_conference
Epub compass 2012 ace_conferenceEpub compass 2012 ace_conference
Epub compass 2012 ace_conference
 
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
Open Source Data Visualization for Resource Sharing: An Ivy Plus Libraries Pr...
 
Melbourne officeevent
Melbourne officeeventMelbourne officeevent
Melbourne officeevent
 
Mich la april 2011
Mich la april 2011Mich la april 2011
Mich la april 2011
 
Wow resource roundup
Wow resource roundupWow resource roundup
Wow resource roundup
 
Get Ready For Abundance Culture At High School
Get Ready For Abundance Culture At High SchoolGet Ready For Abundance Culture At High School
Get Ready For Abundance Culture At High School
 
Data matters-bournemouth-2015
Data matters-bournemouth-2015Data matters-bournemouth-2015
Data matters-bournemouth-2015
 
From Digital Literacy to Digital Fluency
From Digital Literacy to Digital FluencyFrom Digital Literacy to Digital Fluency
From Digital Literacy to Digital Fluency
 
Maine Libraries
Maine LibrariesMaine Libraries
Maine Libraries
 
Evaluating Electronic Resources
Evaluating Electronic ResourcesEvaluating Electronic Resources
Evaluating Electronic Resources
 
Ma sla
Ma slaMa sla
Ma sla
 
Bondurant-Farrar
Bondurant-FarrarBondurant-Farrar
Bondurant-Farrar
 
Bondurant-Farrar
Bondurant-FarrarBondurant-Farrar
Bondurant-Farrar
 
How Does Reading & Learning Change on the Internet: Responding to New Literacies
How Does Reading & Learning Change on the Internet: Responding to New LiteraciesHow Does Reading & Learning Change on the Internet: Responding to New Literacies
How Does Reading & Learning Change on the Internet: Responding to New Literacies
 
Getting to Know Your Data with R
Getting to Know Your Data with RGetting to Know Your Data with R
Getting to Know Your Data with R
 
Fys presentation 12_aug_2010
Fys presentation 12_aug_2010Fys presentation 12_aug_2010
Fys presentation 12_aug_2010
 
Why Be Open?
Why Be Open?Why Be Open?
Why Be Open?
 
Professional Information Research
Professional Information ResearchProfessional Information Research
Professional Information Research
 

Mais de Adrian Cockcroft

Microservices Workshop All Topics Deck 2016
Microservices Workshop All Topics Deck 2016Microservices Workshop All Topics Deck 2016
Microservices Workshop All Topics Deck 2016Adrian Cockcroft
 
Gophercon 2016 Communicating Sequential Goroutines
Gophercon 2016 Communicating Sequential GoroutinesGophercon 2016 Communicating Sequential Goroutines
Gophercon 2016 Communicating Sequential GoroutinesAdrian Cockcroft
 
Monitoring Challenges - Monitorama 2016 - Monitoringless
Monitoring Challenges - Monitorama 2016 - MonitoringlessMonitoring Challenges - Monitorama 2016 - Monitoringless
Monitoring Challenges - Monitorama 2016 - MonitoringlessAdrian Cockcroft
 
Microservices Application Tracing Standards and Simulators - Adrians at OSCON
Microservices Application Tracing Standards and Simulators - Adrians at OSCONMicroservices Application Tracing Standards and Simulators - Adrians at OSCON
Microservices Application Tracing Standards and Simulators - Adrians at OSCONAdrian Cockcroft
 
Microservices Workshop - Craft Conference
Microservices Workshop - Craft ConferenceMicroservices Workshop - Craft Conference
Microservices Workshop - Craft ConferenceAdrian Cockcroft
 
Evolution of Microservices - Craft Conference
Evolution of Microservices - Craft ConferenceEvolution of Microservices - Craft Conference
Evolution of Microservices - Craft ConferenceAdrian Cockcroft
 
Microservices: What's Missing - O'Reilly Software Architecture New York
Microservices: What's Missing - O'Reilly Software Architecture New YorkMicroservices: What's Missing - O'Reilly Software Architecture New York
Microservices: What's Missing - O'Reilly Software Architecture New YorkAdrian Cockcroft
 
What's Missing? Microservices Meetup at Cisco
What's Missing? Microservices Meetup at CiscoWhat's Missing? Microservices Meetup at Cisco
What's Missing? Microservices Meetup at CiscoAdrian Cockcroft
 
Microxchg Analyzing Response Time Distributions for Microservices
Microxchg Analyzing Response Time Distributions for MicroservicesMicroxchg Analyzing Response Time Distributions for Microservices
Microxchg Analyzing Response Time Distributions for MicroservicesAdrian Cockcroft
 
Innovation and Architecture
Innovation and ArchitectureInnovation and Architecture
Innovation and ArchitectureAdrian Cockcroft
 
Cloud Trends Nov2015 Structure
Cloud Trends Nov2015 StructureCloud Trends Nov2015 Structure
Cloud Trends Nov2015 StructureAdrian Cockcroft
 
Openstack Silicon Valley - Vendor Lock In
Openstack Silicon Valley - Vendor Lock InOpenstack Silicon Valley - Vendor Lock In
Openstack Silicon Valley - Vendor Lock InAdrian Cockcroft
 
When Developers Operate and Operators Develop
When Developers Operate and Operators DevelopWhen Developers Operate and Operators Develop
When Developers Operate and Operators DevelopAdrian Cockcroft
 
Dockercon 2015 - Faster Cheaper Safer
Dockercon 2015 - Faster Cheaper SaferDockercon 2015 - Faster Cheaper Safer
Dockercon 2015 - Faster Cheaper SaferAdrian Cockcroft
 
Microservices the Good Bad and the Ugly
Microservices the Good Bad and the UglyMicroservices the Good Bad and the Ugly
Microservices the Good Bad and the UglyAdrian Cockcroft
 
Gluecon Monitoring Microservices and Containers: A Challenge
Gluecon Monitoring Microservices and Containers: A ChallengeGluecon Monitoring Microservices and Containers: A Challenge
Gluecon Monitoring Microservices and Containers: A ChallengeAdrian Cockcroft
 
Software Architecture Conference - Monitoring Microservices - A Challenge
Software Architecture Conference -  Monitoring Microservices - A ChallengeSoftware Architecture Conference -  Monitoring Microservices - A Challenge
Software Architecture Conference - Monitoring Microservices - A ChallengeAdrian Cockcroft
 
Cloud Native Cost Optimization UCC
Cloud Native Cost Optimization UCCCloud Native Cost Optimization UCC
Cloud Native Cost Optimization UCCAdrian Cockcroft
 

Mais de Adrian Cockcroft (20)

Microservices Workshop All Topics Deck 2016
Microservices Workshop All Topics Deck 2016Microservices Workshop All Topics Deck 2016
Microservices Workshop All Topics Deck 2016
 
Gophercon 2016 Communicating Sequential Goroutines
Gophercon 2016 Communicating Sequential GoroutinesGophercon 2016 Communicating Sequential Goroutines
Gophercon 2016 Communicating Sequential Goroutines
 
Monitoring Challenges - Monitorama 2016 - Monitoringless
Monitoring Challenges - Monitorama 2016 - MonitoringlessMonitoring Challenges - Monitorama 2016 - Monitoringless
Monitoring Challenges - Monitorama 2016 - Monitoringless
 
Microservices Application Tracing Standards and Simulators - Adrians at OSCON
Microservices Application Tracing Standards and Simulators - Adrians at OSCONMicroservices Application Tracing Standards and Simulators - Adrians at OSCON
Microservices Application Tracing Standards and Simulators - Adrians at OSCON
 
Microservices Workshop - Craft Conference
Microservices Workshop - Craft ConferenceMicroservices Workshop - Craft Conference
Microservices Workshop - Craft Conference
 
Evolution of Microservices - Craft Conference
Evolution of Microservices - Craft ConferenceEvolution of Microservices - Craft Conference
Evolution of Microservices - Craft Conference
 
Microservices: What's Missing - O'Reilly Software Architecture New York
Microservices: What's Missing - O'Reilly Software Architecture New YorkMicroservices: What's Missing - O'Reilly Software Architecture New York
Microservices: What's Missing - O'Reilly Software Architecture New York
 
What's Missing? Microservices Meetup at Cisco
What's Missing? Microservices Meetup at CiscoWhat's Missing? Microservices Meetup at Cisco
What's Missing? Microservices Meetup at Cisco
 
In Search of Segmentation
In Search of SegmentationIn Search of Segmentation
In Search of Segmentation
 
Microxchg Analyzing Response Time Distributions for Microservices
Microxchg Analyzing Response Time Distributions for MicroservicesMicroxchg Analyzing Response Time Distributions for Microservices
Microxchg Analyzing Response Time Distributions for Microservices
 
Innovation and Architecture
Innovation and ArchitectureInnovation and Architecture
Innovation and Architecture
 
Cloud Trends Nov2015 Structure
Cloud Trends Nov2015 StructureCloud Trends Nov2015 Structure
Cloud Trends Nov2015 Structure
 
Openstack Silicon Valley - Vendor Lock In
Openstack Silicon Valley - Vendor Lock InOpenstack Silicon Valley - Vendor Lock In
Openstack Silicon Valley - Vendor Lock In
 
When Developers Operate and Operators Develop
When Developers Operate and Operators DevelopWhen Developers Operate and Operators Develop
When Developers Operate and Operators Develop
 
Dockercon 2015 - Faster Cheaper Safer
Dockercon 2015 - Faster Cheaper SaferDockercon 2015 - Faster Cheaper Safer
Dockercon 2015 - Faster Cheaper Safer
 
Microservices the Good Bad and the Ugly
Microservices the Good Bad and the UglyMicroservices the Good Bad and the Ugly
Microservices the Good Bad and the Ugly
 
Gluecon Monitoring Microservices and Containers: A Challenge
Gluecon Monitoring Microservices and Containers: A ChallengeGluecon Monitoring Microservices and Containers: A Challenge
Gluecon Monitoring Microservices and Containers: A Challenge
 
Software Architecture Conference - Monitoring Microservices - A Challenge
Software Architecture Conference -  Monitoring Microservices - A ChallengeSoftware Architecture Conference -  Monitoring Microservices - A Challenge
Software Architecture Conference - Monitoring Microservices - A Challenge
 
Microxchg Microservices
Microxchg MicroservicesMicroxchg Microservices
Microxchg Microservices
 
Cloud Native Cost Optimization UCC
Cloud Native Cost Optimization UCCCloud Native Cost Optimization UCC
Cloud Native Cost Optimization UCC
 

Último

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Último (20)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Hack Kid Con - Learn to be a Data Scientist for $1

  • 1. LEARNTO BE A DATA SCIENTIST FOR $1 Hack Kid Conference - April 2014 by Adrian Cockcroft BatteryVentures
  • 2.
  • 3.
  • 4.
  • 5. A BIG new problem for a new generation
  • 6. Now A BIG new problem for a new generation
  • 7. Now A BIG new problem for a new generation Your future job as a Data Scientist
  • 8.
  • 9. WHAT DOES A DATA SCIENTIST DO?
  • 10.
  • 11. The hive mind map shows popular twitter hashtags for the last 7 days and how they are connected http://hivemindmap.com/?#
  • 12. HIVE MIND MAP A mind-map of what’s happening onTwitter Thanks to Mark Harwood for these slides and the Hive Mind Map http://www.infoq.com/presentations/elasticsearch-revealing-uncommonly-common
  • 13. Connections The thickness of a line between hashtags is based on the strength of connection Tip:! Strength of connection is the number of tweets with both tags vs the number with only one - see “Jaccard similarity coefficient”
  • 14. Top tweets The most popular tweets for a tag are sorted based on the number of “retweets”
  • 15. When? The rise and fall of each hashtag’s popularity can be shown over time
  • 16. Calendar summary Tags that “peak” together are grouped into events on a calendar Tip:! Peaks are detected using standard deviations. Only tags with a single peak are chosen as events Tip:! Tags that rise and fall in popularity at the same time are detected using Pearson’s Correlation
  • 17. What makes this possible? • Free software (Lucene, Java, Eclipse, Gephi, Tomcat, d3, Google analytics…) • Free data (millions of users’ tweets from Twitter’s 1% sample feed) • “Cloud” computing (rented server) • Smarter web browsers (visualizations using HTML5’s SVG/Canvas) • All the friendly folks on the internet (e.g. http://stackoverflow.com/ questions/14799842) • Some imagination…
  • 18. Opportunities in Data Science • We are all generating volumes of data never seen before • You can recycle the behaviors of billions of people into more intelligent systems • customer purchases can be used for product recommendations • user searches can be used for spelling corrections, • Reader clicks can influence the trending news • Spotify activity is used to make music recommendations) • The tools have never been cheaper • It has never been easier to find help in developing systems
  • 19. …one more thing.. I’m writing these slides for you while on my annual snowboarding trip to Canada. Data science pays well ;-) Wish you were here…
  • 20. HOW CAN A KID LEARN BIG DATA FOR $1?
  • 21. BIG DATA INTHE CLOUD WITH AMAZON EMR https://www.youtube.com/watch?v=S6Ja55n-o0M
  • 22. LESSTHAN $1 After running two of the EMR examples, creating 6 computers in the cloud to do the analysis for up to an hour each
  • 26. MEASURING KIDS How good are you at Math and Science, is it getting better or worse?
  • 28. ACHIEVEMENT SCORES Download results into Excel to analyze and draw graphs
  • 29. DOWNLOADED DATA Needed some clean-up. Made sure grade was consistent (4, 8, HS) for all results, and created a short Subject column
  • 30. SCORES 2004-2012 Elementary - 4th Grade, Middle School - 8th Grade, High School
  • 31. SCORES 2004-2012 Elementary - 4th Grade, Middle School - 8th Grade, High School About half of high school students in California are proficient at Math and Science
  • 32. CALIFORNIA SCHOOLS Science and Math Scores at Elementary, Middle and High School Level
  • 33. CALIFORNIA SCHOOLS Science and Math Scores at Elementary, Middle and High School Level Scores have been getting better. Good!
  • 34. CALIFORNIA SCHOOLS Science and Math Scores at Elementary, Middle and High School Level Scores have been getting better. Good! Maybe the Math tests were harder for everyone that year?
  • 35. CALIFORNIA SCHOOLS Science and Math Scores at Elementary, Middle and High School Level Scores have been getting better. Good!4th Grade “cohort” in 2004 was 8th Grade in 2008 Maybe the Math tests were harder for everyone that year?
  • 36. DATA SCIENCE WITH EXCEL Pivot tables let you rearrange data and trend lines measure the slope
  • 37. LEARNTO BE A DATA SCIENTIST FOR $1 • Everything is being measured • The latest data science tools are available to anyone for pennies • There is lots of freely available data • Pay attention in math and science class, play around with EMR and Bigquery and get an interesting and well paid job as a data scientist!