SlideShare uma empresa Scribd logo
1 de 77
Baixar para ler offline
Full Stack Deep Learning - UC Berkeley Spring 2021
Week 8
Machine Learning Teams
Full Stack Deep Learning - UC Berkeley Spring 2021
Running ML teams is hard
2
ML Teams - overview
Running any technical team is hard…
• ML talent is expensive and scarce

• ML teams have a diverse set of roles

• Projects have unclear timelines and high
uncertainty

• The field is moving fast and ML is the “high-
interest credit card of technical debt”

• Leadership often doesn’t understand AI
• Hiring great people

• Managing and developing those people

• Managing your team’s output and making
sure your vectors are aligned

• Making good long-term technical choices &
managing technical debt

• Managing expectations from leadership
… And ML adds complexity
Full Stack Deep Learning - UC Berkeley Spring 2021
Goal of this module
• Give you some insight into how to think about building and managing ML
teams

• Help you get a job in ML
3
ML Teams - overview
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing
Module overview
4
Hiring
• How to manage a ML team
• How to hire ML engineers. How to get hired.
ML Teams - overview
Orgs
• How ML teams are organized and how they
fit into the broader organization
Roles
• ML-related roles and the skills they require
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing
Module overview
5
Hiring
• How to manage a ML team
• How to hire ML engineers. How to get hired.
Orgs
• How ML teams are organized and how they
fit into the broader organization
Roles
• ML-related roles and the skills they require
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Most common ML roles
• ML product manager

• DevOps

• Data engineer

• ML engineer

• ML researcher / ML scientist

• Data scientist
6
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Most common ML roles
• ML product manager

• DevOps

• Data engineer

• ML engineer

• ML researcher / ML scientist

• Data scientist
7
What’s the difference?
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Breakdown of job function by role
8
Role Job Function Work product
Commonly used
tools
ML product
manager
Work with ML team, business, users, data
owners to prioritize & execute projects
Design docs, wireframes,
work plans
Jira, etc
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Breakdown of job function by role
9
Role Job Function Work product
Commonly used
tools
ML product
manager
Work with ML team, business, users, data
owners to prioritize & execute projects
Design docs, wireframes,
work plans
Jira, etc
DevOps engineer Deploy & monitor production systems Deployed product AWS, etc.
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Breakdown of job function by role
10
Role Job Function Work product
Commonly used
tools
ML product
manager
Work with ML team, business, users, data
owners to prioritize & execute projects
Design docs, wireframes,
work plans
Jira, etc
DevOps engineer Deploy & monitor production systems Deployed product AWS, etc.
Data engineer
Build data pipelines, aggregation,
storage, monitoring
Distributed system
Hadoop, Kafka,
Airflow
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Breakdown of job function by role
11
Role Job Function Work product
Commonly used
tools
ML product
manager
Work with ML team, business, users, data
owners to prioritize & execute projects
Design docs, wireframes,
work plans
Jira, etc
DevOps engineer Deploy & monitor production systems Deployed product AWS, etc.
Data engineer
Build data pipelines, aggregation,
storage, monitoring
Distributed system
Hadoop, Kafka,
Airflow
ML engineer Train & deploy prediction models
Prediction system
running on real data
(often in production)
Tensorflow, Docker
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Breakdown of job function by role
12
Role Job Function Work product
Commonly used
tools
ML product
manager
Work with ML team, business, users, data
owners to prioritize & execute projects
Design docs, wireframes,
work plans
Jira, etc
DevOps engineer Deploy & monitor production systems Deployed product AWS, etc.
Data engineer
Build data pipelines, aggregation,
storage, monitoring
Distributed system
Hadoop, Kafka,
Airflow
ML engineer Train & deploy prediction models
Prediction system
running on real data
(often in production)
Tensorflow, Docker
ML researcher
Train prediction models (often forward
looking or not production-critical)
Prediction model & report
describing it
Tensorflow, pytorch,
Jupyter
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Breakdown of job function by role
13
Role Job Function Work product
Commonly used
tools
ML product
manager
Work with ML team, business, users, data
owners to prioritize & execute projects
Design docs, wireframes,
work plans
Jira, etc
DevOps engineer Deploy & monitor production systems Deployed product AWS, etc.
Data engineer
Build data pipelines, aggregation,
storage, monitoring
Distributed system
Hadoop, Kafka,
Airflow
ML engineer Train & deploy prediction models
Prediction system
running on real data
(often in production)
Tensorflow, Docker
ML researcher
Train prediction models (often forward
looking or not production-critical)
Prediction model & report
describing it
Tensorflow, pytorch,
Jupyter
Data scientist
Blanket term used to describe all of the
above. In some orgs, means answering
business questions using analytics
Prediction model or
report
SQL, Excel, Jupyter,
Pandas, SKLearn,
Tensorflow
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
What skills are needed for the roles?
14
Machine learning
Low
High
Low High
Size of bubble =
communication /
technical writing
ML
Researcher
ML
Engineer
Data
scientist
Data
engineer
ML DevOps
ML Teams - roles
ML PM
Software
engineering
Full Stack Deep Learning - UC Berkeley Spring 2021
What skills are needed for the roles?
15
Machine learning
Low
High
Low High
Size of bubble =
communication /
technical writing
ML
Researcher
ML
Engineer
Data
scientist
Data
engineer
ML DevOps
ML Teams - roles
ML PM
Primarily a software engineering role.
Often from standard SWE pipeline.
Software
engineering
Full Stack Deep Learning - UC Berkeley Spring 2021
What skills are needed for the roles?
16
Machine learning
Low
High
Low High
Size of bubble =
communication /
technical writing
ML
Researcher
ML
Engineer
Data
scientist
Data
engineer
ML DevOps
ML Teams - roles
ML PM
SWE with ML team as an active
customer
Software
engineering
Full Stack Deep Learning - UC Berkeley Spring 2021
What skills are needed for the roles?
17
Machine learning
Low
High
Low High
Size of bubble =
communication /
technical writing
ML
Researcher
ML
Engineer
Data
scientist
Data
engineer
ML DevOps
ML Teams - roles
ML PM
Rare mix of ML skills and SWE skills.
Often SWEs with significant self-
teaching or science / engineering
PhDs who worked as traditional SWEs
after grad school
Software
engineering
Full Stack Deep Learning - UC Berkeley Spring 2021
What skills are needed for the roles?
18
Machine learning
Low
High
Low High
Size of bubble =
communication /
technical writing
ML
Researcher
ML
Engineer
Data
scientist
Data
engineer
ML DevOps
ML Teams - roles
ML PM
ML experts. Usually have MS/PhD in CS or
Stats or did an industrial fellowship program
Software
engineering
Full Stack Deep Learning - UC Berkeley Spring 2021
What skills are needed for the roles?
19
Machine learning
Low
High
Low High
Size of bubble =
communication /
technical writing
ML
Researcher
ML
Engineer
Data
scientist
Data
engineer
ML DevOps
ML Teams - roles
ML PM
Wide range of backgrounds from
undergrad-only to science PhD
Software
engineering
Full Stack Deep Learning - UC Berkeley Spring 2021
What skills are needed for the roles?
20
Machine learning
Low
High
Low High
Size of bubble =
communication /
technical writing
ML
Researcher
ML
Engineer
Data
scientist
Data
engineer
ML DevOps
ML Teams - roles
ML PM
Traditional PMs, but with a deep
understanding of the ML
development process & mindset
Software
engineering
Full Stack Deep Learning - UC Berkeley Spring 2021
Questions?
21
ML Teams - roles
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing
Module overview
22
Hiring
• How to manage a ML team
• How to hire ML engineers. How to get hired.
Orgs
• How ML teams are organized and how they
fit into the broader organization
Roles
• ML-related roles and the skills they require
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML org structures - lessons learned
• No consensus yet on the right way to structure a ML
team

• This lecture: taxonomy of best practices for different
organizational maturity levels
23
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML organization archetypes
24
The ML Organization Mountain
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML organization archetypes
25
The ML Organization Mountain Nascent / Ad-Hoc ML
What it looks
like
• No one is doing ML, or ML is done on an ad-hoc basis

• Little ML expertise in-house
Example
organizations
• Most small-medium businesses

• Less technology-forward large companies (education,
logistics, etc)
Advantages
• Often low-hanging fruit for ML
Dis-
advantages
• Little support for ML projects, difficult to hire and retain
good talent
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML R&D
ML organization archetypes
26
What it looks
like
• ML efforts are centered in the R&D arm of the
organization

• Often hire researchers / PhDs & write papers
Example
organizations
• Larger Oil & gas, manufacturing, telecom companies
Advantages
• Often can hire experienced researchers

• Can work on long-term business priorities & big wins
Dis-
advantages
• Difficult to get data

• Rarely translates into actual business value, so usually
the amount of investment remains small
The ML Organization Mountain
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML embedded into business / product teams
ML organization archetypes
27
What it looks
like
• Certain product teams or business units have ML
expertise along-side their software or analytics talent

• ML reports up to the team’s engineering lead or tech
lead
Example
organizations
• Software / technology companies

• Financial services companies
Advantages
• ML improvements are likely to lead to business value

• Tight feedback cycle between idea and product
improvement
Dis-
advantages
• Hard to hire and develop top talent

• Access to resources (data / compute) can lag

• ML project cycles conflict with engineering mgmt

• Long-term projects can be hard to justify
The ML Organization Mountain
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
Independent ML Function
ML organization archetypes
28
What it looks
like
• ML division reporting to senior leadership (often CEO)

• ML PMs work with MLRs, MLEs, and customers to
build ML into products

• Teams sometimes publish long-term research
Example
organizations
• Large financial services companies
Advantages
• Talent density allows to hire & train top practitioners

• Senior leaders can marshal data / compute resources

• Can invest in tooling, practices, and culture around ML
development
Dis-
advantages
• Model handoffs to lines of business can be challenging
- users need to buy-in and be educated on model use

• Feedback cycles can be slow
The ML Organization Mountain
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML-First Organizations
ML organization archetypes
29
What it looks
like
• CEO buy-in 

• ML division working on challenging, long-term projects

• ML expertise in every line of business focusing on
quick wins and working with central ML division
Example
organizations
• Large tech companies

• ML-focused startups
Advantages
• Best data access: data thinking permeates the org

• Recruiting: ML team works on hardest problems

• Easiest deployment: product teams understand ML
Dis-
advantages
• Hard to implement

• Challenging & expensive to recruit enough talent

• Culturally difficult to embed ML thinking everywhere
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML team structures - design choices
30
Software
engineering
vs research
Data
ownership
Model
ownership
Key questions
• To what extent is the ML team responsible
for building or integrating with software?

• How important are SWE skills on the
team?
• How much control does the ML team
have over data collection, warehousing,
labeling, and pipelining?
• Is the ML team responsible for deploying
models into production?

• Who maintains deployed models?
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML team structures - design choices
31
Software
engineering
vs research
Data
ownership
Model
ownership
• Research prioritized
over SWE skills

• Researcher-SWE
collaboration lacking
ML R&D
• ML team has no control
over data

• ML team typically will
not have data
engineering component
• Models are rarely
deployed into
production
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML team structures - design choices
32
Software
engineering
vs research
Data
ownership
Model
ownership
Embedded ML
• SWE skills prioritized
over research skills

• Often, all researchers
need strong SWE as
everyone expected to
deploy
• ML team generally
does not own data
production / mgmt

• Work with data
engineers to build
pipelines
• ML engineers own the
models that they
deploy into production
ML Teams - orgs
• Research prioritized
over SWE skills

• Researcher-SWE
collaboration lacking
ML R&D
• ML team has no control
over data

• ML team typically will
not have data
engineering component
• Models are rarely
deployed into
production
Full Stack Deep Learning - UC Berkeley Spring 2021
ML team structures - design choices
33
Software
engineering
vs research
Data
ownership
Model
ownership
ML Function
• Each team has a strong
mix of SWE and
research skills

• SWE and researchers
work closely together
within team
• ML team has a voice in
data governance
discussions

• ML team has strong
internal data
engineering function
• ML team hands off
models to user, but is
responsible for
maintaining them
ML Teams - orgs
Embedded ML
• Research prioritized
over SWE skills

• Researcher-SWE
collaboration lacking
ML R&D
• ML team has no control
over data

• ML team typically will
not have data
engineering component
• Models are rarely
deployed into
production
• SWE skills prioritized
over research skills

• Often, all researchers
need strong SWE as
everyone expected to
deploy
• ML team generally
does not own data
production / mgmt

• Work with data
engineers to build
pipelines
• ML engineers own the
models that they
deploy into production
Full Stack Deep Learning - UC Berkeley Spring 2021
ML team structures - design choices
34
Software
engineering
vs research
Data
ownership
Model
ownership
Embedded ML
• Research prioritized
over SWE skills

• Researcher-SWE
collaboration lacking
ML R&D ML Function ML First
• ML team has no control
over data

• ML team typically will
not have data
engineering component
• Models are rarely
deployed into
production
• SWE skills prioritized
over research skills

• Often, all researchers
need strong SWE as
everyone expected to
deploy
• Each team has a strong
mix of SWE and
research skills

• SWE and researchers
work closely together
within team
• Different teams are
more or less research
oriented

• Research teams
collaborate closely with
SWE teams
• ML team generally
does not own data
production / mgmt

• Work with data
engineers to build
pipelines
• ML engineers own the
models that they
deploy into production
• ML team has a voice in
data governance
discussions

• ML team has strong
internal data
engineering function
• ML team hands off
models to user, but is
responsible for
maintaining them
• ML team often owns
company-wide data
infrastructure
• ML team hands off
models to user, who
operates and maintains
them
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
ML team structures - design choices
35
Software
engineering
vs research
Data
ownership
Model
ownership
Embedded ML
• Research prioritized
over SWE skills

• Researcher-SWE
collaboration lacking
ML R&D ML Function ML First
• ML team has no control
over data

• ML team typically will
not have data
engineering component
• Models are rarely
deployed into
production
• SWE skills prioritized
over research skills

• Often, all researchers
need strong SWE as
everyone expected to
deploy
• Each team has a strong
mix of SWE and
research skills

• SWE and researchers
work closely together
within team
• Different teams are
more or less research
oriented

• Research teams
collaborate closely with
SWE teams
• ML team generally
does not own data
production / mgmt

• Work with data
engineers to build
pipelines
• ML engineers own the
models that they
deploy into production
• ML team has a voice in
data governance
discussions

• ML team has strong
internal data
engineering function
• ML team hands off
models to user, but is
responsible for
maintaining them
• ML team often owns
company-wide data
infrastructure
• ML team hands off
models to user, who
operates and maintains
them
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
Questions?
36
ML Teams - orgs
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing
Module overview
37
Hiring
• How to manage a ML team
• How to hire ML engineers. How to get hired.
Orgs
• How ML teams are organized and how they
fit into the broader organization
Roles
• ML-related roles and the skills they require
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing ML teams is challenging
• It’s hard to tell in advance how hard or easy something is
38
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing ML teams is challenging
39
https://medium.com/@l2k/why-are-machine-learning-projects-so-hard-to-manage-8e9b9cf49641
It’s hard to tell in advance how easy or hard something is
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing ML teams is challenging
• It’s hard to tell in advance how hard or easy something is

• ML progress is nonlinear

• Very common for projects to stall for weeks or longer

• In early stages, difficult to plan project because unclear what will work

• As a result, estimating project timelines is extremely difficult

• I.e., production ML is still somewhere between “research” and “engineering”
40
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing ML teams is challenging
• It’s hard to tell in advance how hard or easy something is

• ML progress is nonlinear

• There are cultural gaps between research and engineering

• Different values, backgrounds, goals, norms

• In toxic cultures, the two sides often don’t value one another
41
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing ML teams is challenging
• It’s hard to tell in advance how hard or easy something is

• ML progress is nonlinear

• There are cultural gaps between research and engineering

• Leaders often don’t understand it
42
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
How to manage ML teams better
• Do ML Project planning probabilistically
43
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
How to manage ML teams better
• Do ML project planning probabilistically 

• From:









• To:



44
ML Teams - managing
Task A Task C Task D
Task E Task F
Task G
Week 1 Week 2 Week 3 Week 4
Full Stack Deep Learning - UC Berkeley Spring 2021
How to manage ML teams better
• Do ML project planning probabilistically 

• From:









• To:



45
ML Teams - managing
Task A Task C Task D
Task E Task F
Task G
Week 1 Week 2 Week 3 Week 4
Task A (50%)
Task B (25%)
Task C (50%)
Week 1 Week 2 Week 3 Week 4
Task D (75%)
Full Stack Deep Learning - UC Berkeley Spring 2021
How to manage ML teams better
• Do ML project planning probabilistically 

• From:









• To:



46
ML Teams - managing
Task A Task C Task D
Task E Task F
Task G
Week 1 Week 2 Week 3 Week 4
Task A (50%)
Task B (25%)
Task C (50%)
Week 1 Week 2 Week 3 Week 4
Task D (75%)
Full Stack Deep Learning - UC Berkeley Spring 2021
How to manage ML teams better
• Do ML project planning probabilistically 

• From:









• To:



47
ML Teams - managing
Task A Task C Task D
Task E Task F
Task G
Week 1 Week 2 Week 3 Week 4
Task A
Task B
Task C
Week 1 Week 2 Week 3 Week 4
Task D
Full Stack Deep Learning - UC Berkeley Spring 2021
How to manage ML teams better
• Do ML project planning probabilistically 

• From:









• To:



48
ML Teams - managing
Task A Task C Task D
Task E Task F
Task G
Week 1 Week 2 Week 3 Week 4
Task A
Task B
Task C
Week 1 Week 2 Week 3 Week 4
Task E (50%)
Task F (50%)
Task G (10%)
Task D (75%)
Full Stack Deep Learning - UC Berkeley Spring 2021
How to manage ML teams better
• Do ML Project planning probabilistically 

• Attempt a portfolio of approaches

• Measure progress based on inputs, not results

• Have researchers and engineers work together

• Get end-to-end pipelines together quickly to demonstrate quick wins

• Educate leadership on ML timeline uncertainty
49
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
Resources for educating execs
• https://a16z.com/2016/06/10/ai-deep-learning-machines/

• Pieter’s upcoming AI Strategy class:

https://emeritus-executive.berkeley.edu/artificial-intelligence/
50
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
Questions?
51
ML Teams - managing
Full Stack Deep Learning - UC Berkeley Spring 2021
Managing
Module overview
52
Hiring
• How to manage a ML team
• How to hire ML engineers. How to get hired.
Orgs
• How ML teams are organized and how they
fit into the broader organization
Roles
• ML-related roles and the skills they require
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Hiring for ML - outline
• The AI Talent Gap

• Sourcing

• Interviewing

• Finding a job
53
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Hiring for ML - outline
• The AI Talent Gap
• Sourcing

• Interviewing

• Finding a job
54
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
The AI Talent Gap
55
How many people know how to build AI systems?
5,000 (actively publishing research [Element AI])

10,000 (estimated num people with the right skillset [Element AI])

22,000 (PhD-educated AI researchers [Bloomberg])

90,000 (upper bound on number of people [Element AI])

200,000 - 300,000 (Number of AI researcher / practitioners [Tencent])

3.6M (Number of software developers in the US)

18.2M (Number of software developers in the world)
Sources: The AI Talent Shortage (Nikolai Yakovenko) https://medium.com/@Moscow25/the-ai-talent-shortage-704d8cf0c4cc
Just How Shallow is the Artificial Intelligence Talent Pool (Jeremy Kahn) 

https://www.bloomberg.com/news/articles/2018-02-07/just-how-shallow-is-the-artificial-intelligence-talent-pool
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021 56
The AI talent gap
Fierce competition for AI talent
“Everyone agrees that the competition to hire people who know how
to build artificial intelligence systems is intense. It’s  turned once-
staid academic conferences into frenzied meet markets for corporate
recruiters and driven the salaries of the top researchers to seven-
figures.” 

(Bloomberg)
Sources: The AI Talent Shortage (Nikolai Yakovenko) https://medium.com/@Moscow25/the-ai-talent-shortage-704d8cf0c4cc
Just How Shallow is the Artificial Intelligence Talent Pool (Jeremy Kahn) 

https://www.bloomberg.com/news/articles/2018-02-07/just-how-shallow-is-the-artificial-intelligence-talent-pool
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021 57
The AI talent gap
Fierce competition for AI talent
“Hiring is crazy right now. ML is a young field that got popular very
quickly. There’s a ton of demand and not a lot of supply.” 

(Computer Vision Engineer at Series C startup)
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021 58
The AI talent gap
Fierce competition for AI talent
“Hiring for ML is really challenging and takes way more time and
effort than we expected. We have someone working on it full-time
and we’re still only able to get a few people per quarter”
(Startup Founder)
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Hiring for ML - outline
• The AI Talent Gap

• Sourcing
• Interviewing

• Finding a job
59
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Most common ML roles
• ML product manager

• DevOps

• Data engineer

• ML engineer

• ML researcher / ML scientist

• Data scientist
60
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Most common ML roles
• ML product manager

• DevOps

• Data engineer

• ML engineer

• ML researcher / ML scientist

• Data scientist
61
Slightly different mindset required. 

Helpful to look for demonstrated
interest in AI - courses, conferences,
re-implementations, etc
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Most common ML roles
• ML product manager

• DevOps

• Data engineer

• ML engineer

• ML researcher / ML scientist

• Data scientist
62
Our focus
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
How to hire MLEs - the wrong way
• Job Description (Unicorn Machine Learning Engineer)

• Duties

• Keep up with the state of the art

• Implement models from scratch

• Deep understanding of mathematics & ability to come up with new models

• Build tooling & infrastructure for the ML team

• Build data pipelines for the ML team

• Deploy & monitor models into production

• Requirements

• PhD

• At least 4 years tensorflow experience

• At least 4 years as a software engineer

• Publications in top ML conference

• Experience building large-scale distributed systems
63
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
How to hire MLEs - the right way
• Hire for software engineering skills, interest in ML, and desire to learn.
Train to do ML.

• Go more junior. Most undergrad computer science students graduate with
ML experience.

• Be more specific about what you need. Not every ML engineer needs to
do DevOps.
64
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
How to hire MLRs
• Look for quality of publications, not quantity (e.g., originality of ideas,
quality of execution)

• Look for researchers with an eye for working on important problems (many
researchers focus on trendy problems without considering why they matter)

• Look for researchers with experience outside of academia

• Consider hiring talented people from adjacent fields (physics, statistics,
math)

• Consider hiring people without PhDs (e.g., talented undergraduate /
masters students, graduates of Google/Facebook/OpenAI fellowship
programs, dedicated self-studiers)
65
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
How to find MLE/MLR candidates
• Standard sources: LinkedIn, recruiters, on-campus recruiting, etc

• Monitor arXiv and top conferences and flag first authors of papers you like

• Look for good reimplementations of papers you like

• Attend ML research conferences (NeurIPS, ICLR, ICML)
66
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
How to attract MLR / MLE candidates
67
What do machine learning practitioners want? How to make your company stand out?
• Work with cutting edge tools & techniques
• Build skills / knowledge in an exciting field
• Work with excellent people
• Work on interesting datasets
• Do work that matters
• Work on research-oriented projects. Publicize
them. Invest in tooling for your team & empower
employees to try new tools.
• Build team culture around learning
(reading groups, learning days, professional
development budget, conference budget)
• Hire high-profile people. Help your best people
build their profile through publishing blogs &
papers.
• Sell the uniqueness of your dataset in recruiting
materials.
• Sell the mission of your company and potential
impact of machine learning on that mission. Work
on projects that have a tangible impact today.
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Hiring for ML - outline
• The AI Talent Gap

• Sourcing

• Interviewing
• Finding a job
68
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
What to test in an ML interview?
• Hire for strengths 

• Meet a minimum bar for everything else
69
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
What to test in an ML interview?
• Validate your hypotheses of candidate’s strengths

• Researchers: make sure they can think creatively about new ML
problems, probe how thoughtful they were about previous projects

• Engineers: make sure they are great generalist SWEs

• Make sure candidates meet a minimum bar on weaker areas

• Researchers: test SWE knowledge and ability to write good code

• SWEs: test ML knowledge
70
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
What happens in a ML interview?
• Much less well-defined than software engineering interviews

• Common types of assessments:

• Background & culture fit

• Whiteboard coding (similar to SWE interviews)

• Pair coding (similar to SWE interviews)

• Pair debugging (often ML-specific code)

• Math puzzles (e.g., involving linear algebra)

• Take-home ML project

• Applied ML (e.g., explain how you’d solve this problem with ML)

• Previous ML projects (e.g., probing on what you tried, why things did / didn’t work)

• ML theory (e.g., bias-variance tradeoff, overfitting, underfitting, understanding of specific
algorithms)
71
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Hiring for ML - outline
• The AI Talent Gap

• Sourcing

• Interviewing

• Finding a job
72
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Where to look for a ML job?
• Standard sources: LinkedIn, recruiters, on-campus recruiting, etc

• ML research conferences (NeurIPS, ICLR, ICML)

• Apply directly (remember, there’s a talent gap!)

• This course

• Those who pass the exam will get access to our recruiting database
73
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
How to stand out for ML roles?
• Build software engineering skills (e.g., work at a well-known software
company)

• Exhibit interest in ML (e.g., conference attendance, online courses taken)

• Show you have broad knowledge of ML (e.g., write blog posts synthesizing a
research area)

• Demonstrate ability to get ML projects done (e.g., create side projects, re-
implement papers)

• Prove you can think creatively in ML (e.g., win Kaggle competitions, publish
papers)
74
More
impressive
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
How to prepare for the interview?
• Prepare for a general SWE interview (e.g., “Cracking the Coding
Interview”)

• Prepare to talk in detail about your past ML projects (remember details,
prepare to talk about tradeoffs and decisions you made)

• Review how basic ML algorithms work (linear / logistic regression, nearest
neighbor, decision trees, k-means, MLPs, ConvNets, recurrent nets, etc)

• Review ML theory

• Think about the problems the company you’re interviewing with may face
and what ML techniques may apply to them
75
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Conclusion
76
Orgs
• ML teams are becoming more standalone,
hence more interdisciplinary
Roles
• Lots of different skills involved in production
ML, so there’s an opportunity for many to
contribute
Hiring
• Talent is scarce, so be specific about what is
must-have. It can be hard to break in as an
outsider - use projects to build awareness.
Managing
• Managing ML teams is hard. There’s no
silver bullet, but shifting toward probabilistic
planning can help
ML Teams - hiring
Full Stack Deep Learning - UC Berkeley Spring 2021
Thank you!
77

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Infrastructure and Tooling - Full Stack Deep Learning
Infrastructure and Tooling - Full Stack Deep LearningInfrastructure and Tooling - Full Stack Deep Learning
Infrastructure and Tooling - Full Stack Deep Learning
 
Lecture 8: Data Management (Full Stack Deep Learning - Spring 2021)
Lecture 8: Data Management (Full Stack Deep Learning - Spring 2021)Lecture 8: Data Management (Full Stack Deep Learning - Spring 2021)
Lecture 8: Data Management (Full Stack Deep Learning - Spring 2021)
 
Setting up Machine Learning Projects - Full Stack Deep Learning
Setting up Machine Learning Projects - Full Stack Deep LearningSetting up Machine Learning Projects - Full Stack Deep Learning
Setting up Machine Learning Projects - Full Stack Deep Learning
 
Lecture 1: Deep Learning Fundamentals - Full Stack Deep Learning - Spring 2021
Lecture 1: Deep Learning Fundamentals - Full Stack Deep Learning - Spring 2021Lecture 1: Deep Learning Fundamentals - Full Stack Deep Learning - Spring 2021
Lecture 1: Deep Learning Fundamentals - Full Stack Deep Learning - Spring 2021
 
Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)
Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)
Lecture 4: Transformers (Full Stack Deep Learning - Spring 2021)
 
On Impact in Software Engineering Research (HU Berlin 2021)
On Impact in Software Engineering Research (HU Berlin 2021)On Impact in Software Engineering Research (HU Berlin 2021)
On Impact in Software Engineering Research (HU Berlin 2021)
 
On impact in Software Engineering Research (ICSE 2018 New Faculty Symposium)
On impact in Software Engineering Research (ICSE 2018 New Faculty Symposium)On impact in Software Engineering Research (ICSE 2018 New Faculty Symposium)
On impact in Software Engineering Research (ICSE 2018 New Faculty Symposium)
 
Lecture 3: RNNs - Full Stack Deep Learning - Spring 2021
Lecture 3: RNNs - Full Stack Deep Learning - Spring 2021Lecture 3: RNNs - Full Stack Deep Learning - Spring 2021
Lecture 3: RNNs - Full Stack Deep Learning - Spring 2021
 
On Impact in Software Engineering Research (Dagstuhl 2020)
On Impact in Software Engineering Research (Dagstuhl 2020)On Impact in Software Engineering Research (Dagstuhl 2020)
On Impact in Software Engineering Research (Dagstuhl 2020)
 
On Impact in Software Engineering Research
On Impact in Software Engineering ResearchOn Impact in Software Engineering Research
On Impact in Software Engineering Research
 
ACM Chicago March 2019 meeting: Software Engineering and AI - Prof. Tao Xie, ...
ACM Chicago March 2019 meeting: Software Engineering and AI - Prof. Tao Xie, ...ACM Chicago March 2019 meeting: Software Engineering and AI - Prof. Tao Xie, ...
ACM Chicago March 2019 meeting: Software Engineering and AI - Prof. Tao Xie, ...
 
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
ISEC'18 Keynote: Intelligent Software Engineering: Synergy between AI and Sof...
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAML
 
MSRA 2018: Intelligent Software Engineering: Synergy between AI and Software ...
MSRA 2018: Intelligent Software Engineering: Synergy between AI and Software ...MSRA 2018: Intelligent Software Engineering: Synergy between AI and Software ...
MSRA 2018: Intelligent Software Engineering: Synergy between AI and Software ...
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
 
GALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software EngineeringGALE: Geometric active learning for Search-Based Software Engineering
GALE: Geometric active learning for Search-Based Software Engineering
 
Intelligent Software Engineering: Synergy between AI and Software Engineering
Intelligent Software Engineering: Synergy between AI and Software EngineeringIntelligent Software Engineering: Synergy between AI and Software Engineering
Intelligent Software Engineering: Synergy between AI and Software Engineering
 
Ai use cases
Ai use casesAi use cases
Ai use cases
 
Data ops: Machine Learning in production
Data ops: Machine Learning in productionData ops: Machine Learning in production
Data ops: Machine Learning in production
 
Software Mining and Software Datasets
Software Mining and Software DatasetsSoftware Mining and Software Datasets
Software Mining and Software Datasets
 

Semelhante a Lecture 13: ML Teams (Full Stack Deep Learning - Spring 2021)

201407 MIT CDO IQ conceptual data modeling, big data, and information quality
201407 MIT CDO IQ conceptual data modeling, big data, and information quality201407 MIT CDO IQ conceptual data modeling, big data, and information quality
201407 MIT CDO IQ conceptual data modeling, big data, and information quality
Peter O'Kelly
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
RajSingh512965
 
Mark Grohman Resume with Recommendation
Mark Grohman Resume with RecommendationMark Grohman Resume with Recommendation
Mark Grohman Resume with Recommendation
Mark Grohman
 
Mis321 ch0
Mis321 ch0Mis321 ch0
Mis321 ch0
benkaro3
 

Semelhante a Lecture 13: ML Teams (Full Stack Deep Learning - Spring 2021) (20)

Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
Object-Oriented Analysis and Design
Object-Oriented Analysis and DesignObject-Oriented Analysis and Design
Object-Oriented Analysis and Design
 
Data Production Pipelines: Legacy, practices, and innovation
Data Production Pipelines: Legacy, practices, and innovationData Production Pipelines: Legacy, practices, and innovation
Data Production Pipelines: Legacy, practices, and innovation
 
201407 MIT CDO IQ conceptual data modeling, big data, and information quality
201407 MIT CDO IQ conceptual data modeling, big data, and information quality201407 MIT CDO IQ conceptual data modeling, big data, and information quality
201407 MIT CDO IQ conceptual data modeling, big data, and information quality
 
A Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptxA Comprehensive Learning Path to Become a Data Science 2021.pptx
A Comprehensive Learning Path to Become a Data Science 2021.pptx
 
5-CEN6016-Chapter1.ppt
5-CEN6016-Chapter1.ppt5-CEN6016-Chapter1.ppt
5-CEN6016-Chapter1.ppt
 
parthiban Loganathan
parthiban Loganathanparthiban Loganathan
parthiban Loganathan
 
Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)Patterns for Successful Data Science Projects (Spark AI Summit)
Patterns for Successful Data Science Projects (Spark AI Summit)
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOps
 
Gulshan_resume
Gulshan_resumeGulshan_resume
Gulshan_resume
 
DevOps as an emerging university discipline
DevOps as an emerging university disciplineDevOps as an emerging university discipline
DevOps as an emerging university discipline
 
ML platforms & auto ml - UEM annotated (2) - #digitalbusinessweek
ML platforms & auto ml - UEM annotated (2) - #digitalbusinessweekML platforms & auto ml - UEM annotated (2) - #digitalbusinessweek
ML platforms & auto ml - UEM annotated (2) - #digitalbusinessweek
 
CEN6016-Chapter1.ppt
CEN6016-Chapter1.pptCEN6016-Chapter1.ppt
CEN6016-Chapter1.ppt
 
CEN6016-Chapter1.ppt
CEN6016-Chapter1.pptCEN6016-Chapter1.ppt
CEN6016-Chapter1.ppt
 
Mark Grohman Resume with Recommendation
Mark Grohman Resume with RecommendationMark Grohman Resume with Recommendation
Mark Grohman Resume with Recommendation
 
Avin kotian u
Avin kotian uAvin kotian u
Avin kotian u
 
Data Engineering Course Syllabus - WeCloudData
Data Engineering Course Syllabus - WeCloudDataData Engineering Course Syllabus - WeCloudData
Data Engineering Course Syllabus - WeCloudData
 
What's Next for MLflow in 2019
What's Next for MLflow in 2019What's Next for MLflow in 2019
What's Next for MLflow in 2019
 
Studying Software Engineering Patterns for Designing Machine Learning Systems
Studying Software Engineering Patterns for Designing Machine Learning SystemsStudying Software Engineering Patterns for Designing Machine Learning Systems
Studying Software Engineering Patterns for Designing Machine Learning Systems
 
Mis321 ch0
Mis321 ch0Mis321 ch0
Mis321 ch0
 

Mais de Sergey Karayev

Attentional Object Detection - introductory slides.
Attentional Object Detection - introductory slides.Attentional Object Detection - introductory slides.
Attentional Object Detection - introductory slides.
Sergey Karayev
 

Mais de Sergey Karayev (9)

Lecture 12: Research Directions (Full Stack Deep Learning - Spring 2021)
Lecture 12: Research Directions (Full Stack Deep Learning - Spring 2021)Lecture 12: Research Directions (Full Stack Deep Learning - Spring 2021)
Lecture 12: Research Directions (Full Stack Deep Learning - Spring 2021)
 
Lecture 11: ML Deployment & Monitoring (Full Stack Deep Learning - Spring 2021)
Lecture 11: ML Deployment & Monitoring (Full Stack Deep Learning - Spring 2021)Lecture 11: ML Deployment & Monitoring (Full Stack Deep Learning - Spring 2021)
Lecture 11: ML Deployment & Monitoring (Full Stack Deep Learning - Spring 2021)
 
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
Lecture 6: Infrastructure & Tooling (Full Stack Deep Learning - Spring 2021)
 
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
 
Lecture 2.A: Convolutional Networks - Full Stack Deep Learning - Spring 2021
Lecture 2.A: Convolutional Networks - Full Stack Deep Learning - Spring 2021Lecture 2.A: Convolutional Networks - Full Stack Deep Learning - Spring 2021
Lecture 2.A: Convolutional Networks - Full Stack Deep Learning - Spring 2021
 
Lab 1: Intro and Setup - Full Stack Deep Learning - Spring 2021
Lab 1: Intro and Setup - Full Stack Deep Learning - Spring 2021Lab 1: Intro and Setup - Full Stack Deep Learning - Spring 2021
Lab 1: Intro and Setup - Full Stack Deep Learning - Spring 2021
 
Research Directions - Full Stack Deep Learning
Research Directions - Full Stack Deep LearningResearch Directions - Full Stack Deep Learning
Research Directions - Full Stack Deep Learning
 
AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019AI Masterclass at ASU GSV 2019
AI Masterclass at ASU GSV 2019
 
Attentional Object Detection - introductory slides.
Attentional Object Detection - introductory slides.Attentional Object Detection - introductory slides.
Attentional Object Detection - introductory slides.
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

Lecture 13: ML Teams (Full Stack Deep Learning - Spring 2021)

  • 1. Full Stack Deep Learning - UC Berkeley Spring 2021 Week 8 Machine Learning Teams
  • 2. Full Stack Deep Learning - UC Berkeley Spring 2021 Running ML teams is hard 2 ML Teams - overview Running any technical team is hard… • ML talent is expensive and scarce • ML teams have a diverse set of roles • Projects have unclear timelines and high uncertainty • The field is moving fast and ML is the “high- interest credit card of technical debt” • Leadership often doesn’t understand AI • Hiring great people • Managing and developing those people • Managing your team’s output and making sure your vectors are aligned • Making good long-term technical choices & managing technical debt • Managing expectations from leadership … And ML adds complexity
  • 3. Full Stack Deep Learning - UC Berkeley Spring 2021 Goal of this module • Give you some insight into how to think about building and managing ML teams • Help you get a job in ML 3 ML Teams - overview
  • 4. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing Module overview 4 Hiring • How to manage a ML team • How to hire ML engineers. How to get hired. ML Teams - overview Orgs • How ML teams are organized and how they fit into the broader organization Roles • ML-related roles and the skills they require
  • 5. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing Module overview 5 Hiring • How to manage a ML team • How to hire ML engineers. How to get hired. Orgs • How ML teams are organized and how they fit into the broader organization Roles • ML-related roles and the skills they require ML Teams - roles
  • 6. Full Stack Deep Learning - UC Berkeley Spring 2021 Most common ML roles • ML product manager • DevOps • Data engineer • ML engineer • ML researcher / ML scientist • Data scientist 6 ML Teams - roles
  • 7. Full Stack Deep Learning - UC Berkeley Spring 2021 Most common ML roles • ML product manager • DevOps • Data engineer • ML engineer • ML researcher / ML scientist • Data scientist 7 What’s the difference? ML Teams - roles
  • 8. Full Stack Deep Learning - UC Berkeley Spring 2021 Breakdown of job function by role 8 Role Job Function Work product Commonly used tools ML product manager Work with ML team, business, users, data owners to prioritize & execute projects Design docs, wireframes, work plans Jira, etc ML Teams - roles
  • 9. Full Stack Deep Learning - UC Berkeley Spring 2021 Breakdown of job function by role 9 Role Job Function Work product Commonly used tools ML product manager Work with ML team, business, users, data owners to prioritize & execute projects Design docs, wireframes, work plans Jira, etc DevOps engineer Deploy & monitor production systems Deployed product AWS, etc. ML Teams - roles
  • 10. Full Stack Deep Learning - UC Berkeley Spring 2021 Breakdown of job function by role 10 Role Job Function Work product Commonly used tools ML product manager Work with ML team, business, users, data owners to prioritize & execute projects Design docs, wireframes, work plans Jira, etc DevOps engineer Deploy & monitor production systems Deployed product AWS, etc. Data engineer Build data pipelines, aggregation, storage, monitoring Distributed system Hadoop, Kafka, Airflow ML Teams - roles
  • 11. Full Stack Deep Learning - UC Berkeley Spring 2021 Breakdown of job function by role 11 Role Job Function Work product Commonly used tools ML product manager Work with ML team, business, users, data owners to prioritize & execute projects Design docs, wireframes, work plans Jira, etc DevOps engineer Deploy & monitor production systems Deployed product AWS, etc. Data engineer Build data pipelines, aggregation, storage, monitoring Distributed system Hadoop, Kafka, Airflow ML engineer Train & deploy prediction models Prediction system running on real data (often in production) Tensorflow, Docker ML Teams - roles
  • 12. Full Stack Deep Learning - UC Berkeley Spring 2021 Breakdown of job function by role 12 Role Job Function Work product Commonly used tools ML product manager Work with ML team, business, users, data owners to prioritize & execute projects Design docs, wireframes, work plans Jira, etc DevOps engineer Deploy & monitor production systems Deployed product AWS, etc. Data engineer Build data pipelines, aggregation, storage, monitoring Distributed system Hadoop, Kafka, Airflow ML engineer Train & deploy prediction models Prediction system running on real data (often in production) Tensorflow, Docker ML researcher Train prediction models (often forward looking or not production-critical) Prediction model & report describing it Tensorflow, pytorch, Jupyter ML Teams - roles
  • 13. Full Stack Deep Learning - UC Berkeley Spring 2021 Breakdown of job function by role 13 Role Job Function Work product Commonly used tools ML product manager Work with ML team, business, users, data owners to prioritize & execute projects Design docs, wireframes, work plans Jira, etc DevOps engineer Deploy & monitor production systems Deployed product AWS, etc. Data engineer Build data pipelines, aggregation, storage, monitoring Distributed system Hadoop, Kafka, Airflow ML engineer Train & deploy prediction models Prediction system running on real data (often in production) Tensorflow, Docker ML researcher Train prediction models (often forward looking or not production-critical) Prediction model & report describing it Tensorflow, pytorch, Jupyter Data scientist Blanket term used to describe all of the above. In some orgs, means answering business questions using analytics Prediction model or report SQL, Excel, Jupyter, Pandas, SKLearn, Tensorflow ML Teams - roles
  • 14. Full Stack Deep Learning - UC Berkeley Spring 2021 What skills are needed for the roles? 14 Machine learning Low High Low High Size of bubble = communication / technical writing ML Researcher ML Engineer Data scientist Data engineer ML DevOps ML Teams - roles ML PM Software engineering
  • 15. Full Stack Deep Learning - UC Berkeley Spring 2021 What skills are needed for the roles? 15 Machine learning Low High Low High Size of bubble = communication / technical writing ML Researcher ML Engineer Data scientist Data engineer ML DevOps ML Teams - roles ML PM Primarily a software engineering role. Often from standard SWE pipeline. Software engineering
  • 16. Full Stack Deep Learning - UC Berkeley Spring 2021 What skills are needed for the roles? 16 Machine learning Low High Low High Size of bubble = communication / technical writing ML Researcher ML Engineer Data scientist Data engineer ML DevOps ML Teams - roles ML PM SWE with ML team as an active customer Software engineering
  • 17. Full Stack Deep Learning - UC Berkeley Spring 2021 What skills are needed for the roles? 17 Machine learning Low High Low High Size of bubble = communication / technical writing ML Researcher ML Engineer Data scientist Data engineer ML DevOps ML Teams - roles ML PM Rare mix of ML skills and SWE skills. Often SWEs with significant self- teaching or science / engineering PhDs who worked as traditional SWEs after grad school Software engineering
  • 18. Full Stack Deep Learning - UC Berkeley Spring 2021 What skills are needed for the roles? 18 Machine learning Low High Low High Size of bubble = communication / technical writing ML Researcher ML Engineer Data scientist Data engineer ML DevOps ML Teams - roles ML PM ML experts. Usually have MS/PhD in CS or Stats or did an industrial fellowship program Software engineering
  • 19. Full Stack Deep Learning - UC Berkeley Spring 2021 What skills are needed for the roles? 19 Machine learning Low High Low High Size of bubble = communication / technical writing ML Researcher ML Engineer Data scientist Data engineer ML DevOps ML Teams - roles ML PM Wide range of backgrounds from undergrad-only to science PhD Software engineering
  • 20. Full Stack Deep Learning - UC Berkeley Spring 2021 What skills are needed for the roles? 20 Machine learning Low High Low High Size of bubble = communication / technical writing ML Researcher ML Engineer Data scientist Data engineer ML DevOps ML Teams - roles ML PM Traditional PMs, but with a deep understanding of the ML development process & mindset Software engineering
  • 21. Full Stack Deep Learning - UC Berkeley Spring 2021 Questions? 21 ML Teams - roles
  • 22. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing Module overview 22 Hiring • How to manage a ML team • How to hire ML engineers. How to get hired. Orgs • How ML teams are organized and how they fit into the broader organization Roles • ML-related roles and the skills they require ML Teams - orgs
  • 23. Full Stack Deep Learning - UC Berkeley Spring 2021 ML org structures - lessons learned • No consensus yet on the right way to structure a ML team • This lecture: taxonomy of best practices for different organizational maturity levels 23 ML Teams - orgs
  • 24. Full Stack Deep Learning - UC Berkeley Spring 2021 ML organization archetypes 24 The ML Organization Mountain ML Teams - orgs
  • 25. Full Stack Deep Learning - UC Berkeley Spring 2021 ML organization archetypes 25 The ML Organization Mountain Nascent / Ad-Hoc ML What it looks like • No one is doing ML, or ML is done on an ad-hoc basis • Little ML expertise in-house Example organizations • Most small-medium businesses • Less technology-forward large companies (education, logistics, etc) Advantages • Often low-hanging fruit for ML Dis- advantages • Little support for ML projects, difficult to hire and retain good talent ML Teams - orgs
  • 26. Full Stack Deep Learning - UC Berkeley Spring 2021 ML R&D ML organization archetypes 26 What it looks like • ML efforts are centered in the R&D arm of the organization • Often hire researchers / PhDs & write papers Example organizations • Larger Oil & gas, manufacturing, telecom companies Advantages • Often can hire experienced researchers • Can work on long-term business priorities & big wins Dis- advantages • Difficult to get data • Rarely translates into actual business value, so usually the amount of investment remains small The ML Organization Mountain ML Teams - orgs
  • 27. Full Stack Deep Learning - UC Berkeley Spring 2021 ML embedded into business / product teams ML organization archetypes 27 What it looks like • Certain product teams or business units have ML expertise along-side their software or analytics talent • ML reports up to the team’s engineering lead or tech lead Example organizations • Software / technology companies • Financial services companies Advantages • ML improvements are likely to lead to business value • Tight feedback cycle between idea and product improvement Dis- advantages • Hard to hire and develop top talent • Access to resources (data / compute) can lag • ML project cycles conflict with engineering mgmt • Long-term projects can be hard to justify The ML Organization Mountain ML Teams - orgs
  • 28. Full Stack Deep Learning - UC Berkeley Spring 2021 Independent ML Function ML organization archetypes 28 What it looks like • ML division reporting to senior leadership (often CEO) • ML PMs work with MLRs, MLEs, and customers to build ML into products • Teams sometimes publish long-term research Example organizations • Large financial services companies Advantages • Talent density allows to hire & train top practitioners • Senior leaders can marshal data / compute resources • Can invest in tooling, practices, and culture around ML development Dis- advantages • Model handoffs to lines of business can be challenging - users need to buy-in and be educated on model use • Feedback cycles can be slow The ML Organization Mountain ML Teams - orgs
  • 29. Full Stack Deep Learning - UC Berkeley Spring 2021 ML-First Organizations ML organization archetypes 29 What it looks like • CEO buy-in • ML division working on challenging, long-term projects • ML expertise in every line of business focusing on quick wins and working with central ML division Example organizations • Large tech companies • ML-focused startups Advantages • Best data access: data thinking permeates the org • Recruiting: ML team works on hardest problems • Easiest deployment: product teams understand ML Dis- advantages • Hard to implement • Challenging & expensive to recruit enough talent • Culturally difficult to embed ML thinking everywhere ML Teams - orgs
  • 30. Full Stack Deep Learning - UC Berkeley Spring 2021 ML team structures - design choices 30 Software engineering vs research Data ownership Model ownership Key questions • To what extent is the ML team responsible for building or integrating with software? • How important are SWE skills on the team? • How much control does the ML team have over data collection, warehousing, labeling, and pipelining? • Is the ML team responsible for deploying models into production? • Who maintains deployed models? ML Teams - orgs
  • 31. Full Stack Deep Learning - UC Berkeley Spring 2021 ML team structures - design choices 31 Software engineering vs research Data ownership Model ownership • Research prioritized over SWE skills • Researcher-SWE collaboration lacking ML R&D • ML team has no control over data • ML team typically will not have data engineering component • Models are rarely deployed into production ML Teams - orgs
  • 32. Full Stack Deep Learning - UC Berkeley Spring 2021 ML team structures - design choices 32 Software engineering vs research Data ownership Model ownership Embedded ML • SWE skills prioritized over research skills • Often, all researchers need strong SWE as everyone expected to deploy • ML team generally does not own data production / mgmt • Work with data engineers to build pipelines • ML engineers own the models that they deploy into production ML Teams - orgs • Research prioritized over SWE skills • Researcher-SWE collaboration lacking ML R&D • ML team has no control over data • ML team typically will not have data engineering component • Models are rarely deployed into production
  • 33. Full Stack Deep Learning - UC Berkeley Spring 2021 ML team structures - design choices 33 Software engineering vs research Data ownership Model ownership ML Function • Each team has a strong mix of SWE and research skills • SWE and researchers work closely together within team • ML team has a voice in data governance discussions • ML team has strong internal data engineering function • ML team hands off models to user, but is responsible for maintaining them ML Teams - orgs Embedded ML • Research prioritized over SWE skills • Researcher-SWE collaboration lacking ML R&D • ML team has no control over data • ML team typically will not have data engineering component • Models are rarely deployed into production • SWE skills prioritized over research skills • Often, all researchers need strong SWE as everyone expected to deploy • ML team generally does not own data production / mgmt • Work with data engineers to build pipelines • ML engineers own the models that they deploy into production
  • 34. Full Stack Deep Learning - UC Berkeley Spring 2021 ML team structures - design choices 34 Software engineering vs research Data ownership Model ownership Embedded ML • Research prioritized over SWE skills • Researcher-SWE collaboration lacking ML R&D ML Function ML First • ML team has no control over data • ML team typically will not have data engineering component • Models are rarely deployed into production • SWE skills prioritized over research skills • Often, all researchers need strong SWE as everyone expected to deploy • Each team has a strong mix of SWE and research skills • SWE and researchers work closely together within team • Different teams are more or less research oriented • Research teams collaborate closely with SWE teams • ML team generally does not own data production / mgmt • Work with data engineers to build pipelines • ML engineers own the models that they deploy into production • ML team has a voice in data governance discussions • ML team has strong internal data engineering function • ML team hands off models to user, but is responsible for maintaining them • ML team often owns company-wide data infrastructure • ML team hands off models to user, who operates and maintains them ML Teams - orgs
  • 35. Full Stack Deep Learning - UC Berkeley Spring 2021 ML team structures - design choices 35 Software engineering vs research Data ownership Model ownership Embedded ML • Research prioritized over SWE skills • Researcher-SWE collaboration lacking ML R&D ML Function ML First • ML team has no control over data • ML team typically will not have data engineering component • Models are rarely deployed into production • SWE skills prioritized over research skills • Often, all researchers need strong SWE as everyone expected to deploy • Each team has a strong mix of SWE and research skills • SWE and researchers work closely together within team • Different teams are more or less research oriented • Research teams collaborate closely with SWE teams • ML team generally does not own data production / mgmt • Work with data engineers to build pipelines • ML engineers own the models that they deploy into production • ML team has a voice in data governance discussions • ML team has strong internal data engineering function • ML team hands off models to user, but is responsible for maintaining them • ML team often owns company-wide data infrastructure • ML team hands off models to user, who operates and maintains them ML Teams - orgs
  • 36. Full Stack Deep Learning - UC Berkeley Spring 2021 Questions? 36 ML Teams - orgs
  • 37. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing Module overview 37 Hiring • How to manage a ML team • How to hire ML engineers. How to get hired. Orgs • How ML teams are organized and how they fit into the broader organization Roles • ML-related roles and the skills they require ML Teams - managing
  • 38. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing ML teams is challenging • It’s hard to tell in advance how hard or easy something is 38 ML Teams - managing
  • 39. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing ML teams is challenging 39 https://medium.com/@l2k/why-are-machine-learning-projects-so-hard-to-manage-8e9b9cf49641 It’s hard to tell in advance how easy or hard something is ML Teams - managing
  • 40. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing ML teams is challenging • It’s hard to tell in advance how hard or easy something is • ML progress is nonlinear • Very common for projects to stall for weeks or longer • In early stages, difficult to plan project because unclear what will work • As a result, estimating project timelines is extremely difficult • I.e., production ML is still somewhere between “research” and “engineering” 40 ML Teams - managing
  • 41. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing ML teams is challenging • It’s hard to tell in advance how hard or easy something is • ML progress is nonlinear • There are cultural gaps between research and engineering • Different values, backgrounds, goals, norms • In toxic cultures, the two sides often don’t value one another 41 ML Teams - managing
  • 42. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing ML teams is challenging • It’s hard to tell in advance how hard or easy something is • ML progress is nonlinear • There are cultural gaps between research and engineering • Leaders often don’t understand it 42 ML Teams - managing
  • 43. Full Stack Deep Learning - UC Berkeley Spring 2021 How to manage ML teams better • Do ML Project planning probabilistically 43 ML Teams - managing
  • 44. Full Stack Deep Learning - UC Berkeley Spring 2021 How to manage ML teams better • Do ML project planning probabilistically • From:
 
 
 
 
 • To:
 
 44 ML Teams - managing Task A Task C Task D Task E Task F Task G Week 1 Week 2 Week 3 Week 4
  • 45. Full Stack Deep Learning - UC Berkeley Spring 2021 How to manage ML teams better • Do ML project planning probabilistically • From:
 
 
 
 
 • To:
 
 45 ML Teams - managing Task A Task C Task D Task E Task F Task G Week 1 Week 2 Week 3 Week 4 Task A (50%) Task B (25%) Task C (50%) Week 1 Week 2 Week 3 Week 4 Task D (75%)
  • 46. Full Stack Deep Learning - UC Berkeley Spring 2021 How to manage ML teams better • Do ML project planning probabilistically • From:
 
 
 
 
 • To:
 
 46 ML Teams - managing Task A Task C Task D Task E Task F Task G Week 1 Week 2 Week 3 Week 4 Task A (50%) Task B (25%) Task C (50%) Week 1 Week 2 Week 3 Week 4 Task D (75%)
  • 47. Full Stack Deep Learning - UC Berkeley Spring 2021 How to manage ML teams better • Do ML project planning probabilistically • From:
 
 
 
 
 • To:
 
 47 ML Teams - managing Task A Task C Task D Task E Task F Task G Week 1 Week 2 Week 3 Week 4 Task A Task B Task C Week 1 Week 2 Week 3 Week 4 Task D
  • 48. Full Stack Deep Learning - UC Berkeley Spring 2021 How to manage ML teams better • Do ML project planning probabilistically • From:
 
 
 
 
 • To:
 
 48 ML Teams - managing Task A Task C Task D Task E Task F Task G Week 1 Week 2 Week 3 Week 4 Task A Task B Task C Week 1 Week 2 Week 3 Week 4 Task E (50%) Task F (50%) Task G (10%) Task D (75%)
  • 49. Full Stack Deep Learning - UC Berkeley Spring 2021 How to manage ML teams better • Do ML Project planning probabilistically • Attempt a portfolio of approaches • Measure progress based on inputs, not results • Have researchers and engineers work together • Get end-to-end pipelines together quickly to demonstrate quick wins • Educate leadership on ML timeline uncertainty 49 ML Teams - managing
  • 50. Full Stack Deep Learning - UC Berkeley Spring 2021 Resources for educating execs • https://a16z.com/2016/06/10/ai-deep-learning-machines/ • Pieter’s upcoming AI Strategy class:
 https://emeritus-executive.berkeley.edu/artificial-intelligence/ 50 ML Teams - managing
  • 51. Full Stack Deep Learning - UC Berkeley Spring 2021 Questions? 51 ML Teams - managing
  • 52. Full Stack Deep Learning - UC Berkeley Spring 2021 Managing Module overview 52 Hiring • How to manage a ML team • How to hire ML engineers. How to get hired. Orgs • How ML teams are organized and how they fit into the broader organization Roles • ML-related roles and the skills they require ML Teams - hiring
  • 53. Full Stack Deep Learning - UC Berkeley Spring 2021 Hiring for ML - outline • The AI Talent Gap • Sourcing • Interviewing • Finding a job 53 ML Teams - hiring
  • 54. Full Stack Deep Learning - UC Berkeley Spring 2021 Hiring for ML - outline • The AI Talent Gap • Sourcing • Interviewing • Finding a job 54 ML Teams - hiring
  • 55. Full Stack Deep Learning - UC Berkeley Spring 2021 The AI Talent Gap 55 How many people know how to build AI systems? 5,000 (actively publishing research [Element AI]) 10,000 (estimated num people with the right skillset [Element AI]) 22,000 (PhD-educated AI researchers [Bloomberg]) 90,000 (upper bound on number of people [Element AI]) 200,000 - 300,000 (Number of AI researcher / practitioners [Tencent]) 3.6M (Number of software developers in the US) 18.2M (Number of software developers in the world) Sources: The AI Talent Shortage (Nikolai Yakovenko) https://medium.com/@Moscow25/the-ai-talent-shortage-704d8cf0c4cc Just How Shallow is the Artificial Intelligence Talent Pool (Jeremy Kahn) 
 https://www.bloomberg.com/news/articles/2018-02-07/just-how-shallow-is-the-artificial-intelligence-talent-pool ML Teams - hiring
  • 56. Full Stack Deep Learning - UC Berkeley Spring 2021 56 The AI talent gap Fierce competition for AI talent “Everyone agrees that the competition to hire people who know how to build artificial intelligence systems is intense. It’s  turned once- staid academic conferences into frenzied meet markets for corporate recruiters and driven the salaries of the top researchers to seven- figures.” 
 (Bloomberg) Sources: The AI Talent Shortage (Nikolai Yakovenko) https://medium.com/@Moscow25/the-ai-talent-shortage-704d8cf0c4cc Just How Shallow is the Artificial Intelligence Talent Pool (Jeremy Kahn) 
 https://www.bloomberg.com/news/articles/2018-02-07/just-how-shallow-is-the-artificial-intelligence-talent-pool ML Teams - hiring
  • 57. Full Stack Deep Learning - UC Berkeley Spring 2021 57 The AI talent gap Fierce competition for AI talent “Hiring is crazy right now. ML is a young field that got popular very quickly. There’s a ton of demand and not a lot of supply.” 
 (Computer Vision Engineer at Series C startup) ML Teams - hiring
  • 58. Full Stack Deep Learning - UC Berkeley Spring 2021 58 The AI talent gap Fierce competition for AI talent “Hiring for ML is really challenging and takes way more time and effort than we expected. We have someone working on it full-time and we’re still only able to get a few people per quarter” (Startup Founder) ML Teams - hiring
  • 59. Full Stack Deep Learning - UC Berkeley Spring 2021 Hiring for ML - outline • The AI Talent Gap • Sourcing • Interviewing • Finding a job 59 ML Teams - hiring
  • 60. Full Stack Deep Learning - UC Berkeley Spring 2021 Most common ML roles • ML product manager • DevOps • Data engineer • ML engineer • ML researcher / ML scientist • Data scientist 60 ML Teams - hiring
  • 61. Full Stack Deep Learning - UC Berkeley Spring 2021 Most common ML roles • ML product manager • DevOps • Data engineer • ML engineer • ML researcher / ML scientist • Data scientist 61 Slightly different mindset required. 
 Helpful to look for demonstrated interest in AI - courses, conferences, re-implementations, etc ML Teams - hiring
  • 62. Full Stack Deep Learning - UC Berkeley Spring 2021 Most common ML roles • ML product manager • DevOps • Data engineer • ML engineer • ML researcher / ML scientist • Data scientist 62 Our focus ML Teams - hiring
  • 63. Full Stack Deep Learning - UC Berkeley Spring 2021 How to hire MLEs - the wrong way • Job Description (Unicorn Machine Learning Engineer) • Duties • Keep up with the state of the art • Implement models from scratch • Deep understanding of mathematics & ability to come up with new models • Build tooling & infrastructure for the ML team • Build data pipelines for the ML team • Deploy & monitor models into production • Requirements • PhD • At least 4 years tensorflow experience • At least 4 years as a software engineer • Publications in top ML conference • Experience building large-scale distributed systems 63 ML Teams - hiring
  • 64. Full Stack Deep Learning - UC Berkeley Spring 2021 How to hire MLEs - the right way • Hire for software engineering skills, interest in ML, and desire to learn. Train to do ML. • Go more junior. Most undergrad computer science students graduate with ML experience. • Be more specific about what you need. Not every ML engineer needs to do DevOps. 64 ML Teams - hiring
  • 65. Full Stack Deep Learning - UC Berkeley Spring 2021 How to hire MLRs • Look for quality of publications, not quantity (e.g., originality of ideas, quality of execution) • Look for researchers with an eye for working on important problems (many researchers focus on trendy problems without considering why they matter) • Look for researchers with experience outside of academia • Consider hiring talented people from adjacent fields (physics, statistics, math) • Consider hiring people without PhDs (e.g., talented undergraduate / masters students, graduates of Google/Facebook/OpenAI fellowship programs, dedicated self-studiers) 65 ML Teams - hiring
  • 66. Full Stack Deep Learning - UC Berkeley Spring 2021 How to find MLE/MLR candidates • Standard sources: LinkedIn, recruiters, on-campus recruiting, etc • Monitor arXiv and top conferences and flag first authors of papers you like • Look for good reimplementations of papers you like • Attend ML research conferences (NeurIPS, ICLR, ICML) 66 ML Teams - hiring
  • 67. Full Stack Deep Learning - UC Berkeley Spring 2021 How to attract MLR / MLE candidates 67 What do machine learning practitioners want? How to make your company stand out? • Work with cutting edge tools & techniques • Build skills / knowledge in an exciting field • Work with excellent people • Work on interesting datasets • Do work that matters • Work on research-oriented projects. Publicize them. Invest in tooling for your team & empower employees to try new tools. • Build team culture around learning (reading groups, learning days, professional development budget, conference budget) • Hire high-profile people. Help your best people build their profile through publishing blogs & papers. • Sell the uniqueness of your dataset in recruiting materials. • Sell the mission of your company and potential impact of machine learning on that mission. Work on projects that have a tangible impact today. ML Teams - hiring
  • 68. Full Stack Deep Learning - UC Berkeley Spring 2021 Hiring for ML - outline • The AI Talent Gap • Sourcing • Interviewing • Finding a job 68 ML Teams - hiring
  • 69. Full Stack Deep Learning - UC Berkeley Spring 2021 What to test in an ML interview? • Hire for strengths • Meet a minimum bar for everything else 69 ML Teams - hiring
  • 70. Full Stack Deep Learning - UC Berkeley Spring 2021 What to test in an ML interview? • Validate your hypotheses of candidate’s strengths • Researchers: make sure they can think creatively about new ML problems, probe how thoughtful they were about previous projects • Engineers: make sure they are great generalist SWEs • Make sure candidates meet a minimum bar on weaker areas • Researchers: test SWE knowledge and ability to write good code • SWEs: test ML knowledge 70 ML Teams - hiring
  • 71. Full Stack Deep Learning - UC Berkeley Spring 2021 What happens in a ML interview? • Much less well-defined than software engineering interviews • Common types of assessments: • Background & culture fit • Whiteboard coding (similar to SWE interviews) • Pair coding (similar to SWE interviews) • Pair debugging (often ML-specific code) • Math puzzles (e.g., involving linear algebra) • Take-home ML project • Applied ML (e.g., explain how you’d solve this problem with ML) • Previous ML projects (e.g., probing on what you tried, why things did / didn’t work) • ML theory (e.g., bias-variance tradeoff, overfitting, underfitting, understanding of specific algorithms) 71 ML Teams - hiring
  • 72. Full Stack Deep Learning - UC Berkeley Spring 2021 Hiring for ML - outline • The AI Talent Gap • Sourcing • Interviewing • Finding a job 72 ML Teams - hiring
  • 73. Full Stack Deep Learning - UC Berkeley Spring 2021 Where to look for a ML job? • Standard sources: LinkedIn, recruiters, on-campus recruiting, etc • ML research conferences (NeurIPS, ICLR, ICML) • Apply directly (remember, there’s a talent gap!) • This course • Those who pass the exam will get access to our recruiting database 73 ML Teams - hiring
  • 74. Full Stack Deep Learning - UC Berkeley Spring 2021 How to stand out for ML roles? • Build software engineering skills (e.g., work at a well-known software company) • Exhibit interest in ML (e.g., conference attendance, online courses taken) • Show you have broad knowledge of ML (e.g., write blog posts synthesizing a research area) • Demonstrate ability to get ML projects done (e.g., create side projects, re- implement papers) • Prove you can think creatively in ML (e.g., win Kaggle competitions, publish papers) 74 More impressive ML Teams - hiring
  • 75. Full Stack Deep Learning - UC Berkeley Spring 2021 How to prepare for the interview? • Prepare for a general SWE interview (e.g., “Cracking the Coding Interview”) • Prepare to talk in detail about your past ML projects (remember details, prepare to talk about tradeoffs and decisions you made) • Review how basic ML algorithms work (linear / logistic regression, nearest neighbor, decision trees, k-means, MLPs, ConvNets, recurrent nets, etc) • Review ML theory • Think about the problems the company you’re interviewing with may face and what ML techniques may apply to them 75 ML Teams - hiring
  • 76. Full Stack Deep Learning - UC Berkeley Spring 2021 Conclusion 76 Orgs • ML teams are becoming more standalone, hence more interdisciplinary Roles • Lots of different skills involved in production ML, so there’s an opportunity for many to contribute Hiring • Talent is scarce, so be specific about what is must-have. It can be hard to break in as an outsider - use projects to build awareness. Managing • Managing ML teams is hard. There’s no silver bullet, but shifting toward probabilistic planning can help ML Teams - hiring
  • 77. Full Stack Deep Learning - UC Berkeley Spring 2021 Thank you! 77