Continuous Intelligence Workshop

Continuous Intelligence
Workshop
AI Singapore

2
PRE-WORKSHOP SETUP
Instructions @ workshop scratchpad: https://bit.ly/2Ez5DRt
Please ensure you’ve completed both setup scripts

3
ABOUT US
@thoughtworks
https://www.thoughtworks.com/intelligent-empowerment

©ThoughtWorks 2018 Commercial in Confidence
Our experiences are encapsulated in 80+ books

5
ABOUT US
We are uncovering better ways of developing software
by doing it and helping others do it.

7
TODAY’S PLAN
Share principles and practices that can
make it easier for teams to iteratively deploy better ML
products
Share about what to strive towards, and
how to strive towards it

● Questions are welcome (esp. if we start speaking Greek)
● Use the stickies
○ Red: “I need help!”
○ Yellow: “You’re using too much jargon!”
● Parking lot
● Cross-talking
● Punctuality
8
SOME GROUND RULES

9
Time Session
09.00am - 09.30am Debugging setup (if anyone needs help)
09.30am - 10.30am Intro to agile + continuous intelligence
10.45am - 11.00am Learn enough Docker to be dangerous
11.00am - 12.30pm Dojo: Hands on exercise for continuous intelligence
12.30pm - 1.30pm Lunch
1.30pm - 3.00pm User experience
3.15pm - 4.45pm Dojo: You can’t do continuous delivery
without unit tests
5.00pm - 5.30pm General discussion + Q&A
TODAY’S SCHEDULE

LEARNING CHECKLIST
10
❏ Sessions
❏ Environment management with Docker
❏ User experience / product thinking
❏ Continuous integration + Continuous delivery
❏ Test pyramid and unit testing
❏ General discussion
❏ Reuse data processing pipelines to reduce complexity and training-serving
skew
❏ Explainability
❏ Model tracking and monitoring strategies (request shadowing)
❏ Monitor what we care about (metrics, business outcomes, fairness)
❏ Closing the data collection loop
❏ Cross-functional teams
❏ Kaizen health checklists
❏ etc.

Intro to agile +
continuous intelligence

● What do we mean by agile?
● Why should we apply agile to machine learning?
● Pain points in machine learning
● How can agile + continuous delivery practices solve these pain points?
12
SESSION PLAN

What do we really
mean by “agile”?

15
[1] Royce, Winston. "Managing the Development of Large Software Systems", Proceedings of IEEE WESCON26 (August): 1–9. 1970.
[2] Bell, Thomas E., and T. A. Thayer. "Software requirements: Are they really a problem?”, Proceedings of the 2nd international conference on
Software engineering. IEEE Computer Society Press, 1976.
AGILE VS. WATERFALL

16
design code test release
A WATERFALL RELEASE

17
design code test release
Deployment
issues
Defects
Product
changes
A WATERFALL RELEASE

Deliver value continuously through working software
Shorten feedback loops
Technical practices
18
AGILE IN 1 MINUTE

… we have come to value:
Individuals and interactions over processes and tools
Working software over comprehensive
documentation
Customer collaboration over contract negotiation
Responding to change over following a plan
That is, while there is value in the items on the right,
we value the items on the left more. 19
AGILE MANIFESTO

Why should we apply agile to
machine learning?

Many of the challenges we face in ML are solved problems
in software engineering
21

23Source: Machine Learning: The High Interest Credit Card of Technical Debt (Google, 2015)
WE GOT 99 PROBLEMS AND MACHINE LEARNING AIN’T ONE

24
THIS IS WHAT MACHINE LEARNING CAN FEEL LIKE

25
We don’t have a machine learning problem.
We have a {UX, business, data, software delivery, ML}
problem
Source: Machine Learning: The High Interest Credit Card of Technical Debt (Google, 2015)
WE GOT 99 PROBLEMS AND MACHINE LEARNING AIN’T ONE

Leveraging machine
learning techniques
to exhibit intelligent
behavior, and take
autonomous actions
from data insights.
Gain insights from
data to inform
decision making,
including descriptive
and diagnostic
analytics.
Ability to design and
build data platforms,
collecting, streaming
and managing
enterprise-wide data,
ready for analysis
MACHINE
INTELLIGENCE
DATA
INSIGHTS
DATA
PLATFORM
ENGINEERING
Uncovering data
opportunities and
guiding the vision for
transformation
organizations to
become data-led
DATA
STRATEGY
SOFTWARE EXCELLENCE AND PRODUCT THINKING
THOUGHTWORKS’ APPROACH TO MACHINE LEARNING / ARTIFICIAL
INTELLIGENCE

29Source: The AI hierarchy of needs
DELIVERING VALUE - A SLICE AT A TIME

PoC
Idea
Make it
simpler
Repeat!
Test in
Lab
Deploy
to prod
Collect,
evaluat
e
Model
iteratio
n
Dark
launch
or A/B
Test in
Lab
Add
value
Uncertain
how to
add value
PoC
Idea
Clear
value add
ML PRODUCT DEVELOPMENT SHOULD BE ITERATIVE, NOT BIG BANG

Pain points in
building machine learning
products

“Works on my machine!”
No data / Data is
everywhere but nowhere
Data stitching (web
scraping, api calls, csv
files)
Not allowed to have
production data on my
laptop
Untitled 17.ipynb
32
Deployment / infra work is
hard
Hard to keep track of
hyperparameters &
metrics
QA is hard
Is my model doing ok?
When should I retrain/re-
release?
Training-serving skew
Interpretability
“I want 100% accuracy”
Harmful models in prod
Users not using our
product
Reproducibility
Dan the data scientist
ML model in
production

How can agile + continuous
delivery practices solve these
problems?

“Works on my machine!”
No data / Data is
everywhere but nowhere
Data stitching (web
scraping, api calls, csv
files)
Not allowed to have
production data on my
laptop
Untitled 17.ipynb
34
Deployment / infra work
is hard
Hard to keep track of
hyperparameters &
metrics
QA is hard
Is my model doing ok?
When should I retrain/re-
release?
Training-serving skew
Interpretability
“I want 100% accuracy”
Harmful models in prod
Users not using our
product
Reproducibility
WHAT PROBLEMS DOES CONTINUOUS DELIVERY SOLVE?

36
CONTINUOUS DELIVERY PIPELINE
Run unit
tests
Deploy
candidate
model to
STAGING
Deploy
model to
PROD
Train and
evaluate
model
push
Source code
repository
trigger
feedback
Model
repositor
y
Data / feature repository
Local env
Model
repositor
y
Source: Continuous Delivery (Jez Humble, Dave Farley)

37
ANATOMY OF A PIPELINE
● Pipeline
○ Commit stage
■ build
■ run unit tests
■ train model
■ evaluate model → artifact
○ Deploy to staging
■ deploy
■ policy layer tests
■ fairness tests
■ adversarial tests
■ etc.
○ Deploy to prod
■ deploy

38
GIT PUSH → [SERIES OF AUTOMATED TESTS] → DEPLOY

CONTINUOUS DELIVERY CHECKLIST FOR ML PRODUCTS
40
Source control
● All code changes / model parameter changes are checked into source control
● Trunk-based development
● Code reuse
Configuration management
● Automated configuration management across environments (local, qa, prod)
Data processing
● Feature engineering is done through reusable data pipelines
● Data ingestion pipelines are mature, tested, reusable and automatable
● Regularly used features are precompiled into a feature ‘store’
● Collecting more and better training data from production, with every release (garbage in, garbage out problem)
● Create data turking systems for labelling new data (necessary for monitoring and re-training)
● Self-service data access
● Data access control
Training
● Automated infrastructure provisioning and configuration for model training
● Distributed training where necessary
Testing
● Test pyramid (unit tests, functional tests, policy layer tests, exploratory tests)
● Bias testing and fairness testing
● Adversarial testing
● Established baseline for evaluating model performance

41
CONTINUOUS DELIVERY CHECKLIST FOR ML PRODUCTS
Artifact versioning
● All trained models are artifacted and versioned
● Tag artifacts with relevant metadata (e.g. training data, hyperparameters, datetime)
Deployment
● Set up continuous delivery pipeline
● Tracer bullet. Start with simple model+features
● Single-command deployments
● Disaster recovery: (single-command) deployment of last good model in production
● Frequent deployments to production
Policy layer
● Don’t leave critical things to probability (Use rules / heuristics instead)
Monitoring
● Understand model performance in production using canary releases
● Monitor business metrics
● Monitor ML metrics (e.g. RMSE) tests (i) on models, (ii) using latest prod data
● Monitor anything that helps model interpretation (e.g. confusion matrices)
● Alerts and automated retraining of model candidates when/before performance begins to slip
Workflow
● Build cross-functional teams (UX, BA, DS, DE, DEV, etc)
● Iterative development lifecycle
Regular health checks
● How much calendar time to deploy a model from staging to production?
● How much calendar time to add a new feature to the production model?
● How comfortable does your team feel about iteratively deploying models?

Continuous Intelligence Workshop

Recomendados

Recomendados

Mais conteúdo relacionado

Mais procurados

Mais procurados (20)

Semelhante a Continuous Intelligence Workshop

Semelhante a Continuous Intelligence Workshop (20)

Último

Último (20)

Continuous Intelligence Workshop

Notas do Editor