SlideShare uma empresa Scribd logo
1 de 58
Baixar para ler offline
http://en.wikipedia.org/wiki/Portraits_of_Shakespeare
A/B Testing 

and the Infinite
Monkey Theorem.
Lukasz Twardowski
www.useitbetter.com
a monkey hitting keys
at random for an
infinite amount of time
will almost surely type
the complete works of
William Shakespeare.
a monkey hitting keys
at random for an
infinite amount of time
will almost surely
A/B testing
reach the conversion rate
of Amazon.
A/B testing
helps find out which
of two versions performs better
while running simultaneously.
THEORY
We do this
because every day is different,
unlike in the Groundhog Day movie.
Groundhog Day (1993, Dir. Harold Ramis)
http://nerds.airbnb.com/experiments-at-airbnb/
A single change, bad or good, will not change a trend.
Unless a change is A/B tested,
you won’t know its impact.
Why the monkey
metaphor?
The industry average hit rate
for A/B testing
=
Provide the benchmark:
EXERCISE 1.
The industry average hit rate
for A/B testing
=
14%
Just 1 out of 7 A/B tests
is successful!
http://conversionxl.com/ab-tests-fail/
Provide the benchmark:
EXERCISE 1.
King Kong (1933, Dir. Merian Cooper, Ernest Schoedsack)
How to
be the greatest
monkey in the biz
if infinity is not an
option?
Be a quick
monkey.
How to be the best monkey in the biz?
1 out of 7 tests wins
x 2 weeks per test
=
slow growth
Do the math:
EXERCISE 2.
Unless you experiment
at scale.
The currency in which
you pay for A/B tests

is traffic.
The currency in which
you pay for A/B tests 

is traffic. The more you
have, the more tests
you can run.
The currency in which
you pay for A/B tests

is traffic. The more you
have, the more tests
you can run. Never
waste what you have.
Shop Direct
Scaled to 101
experiments a month
in two years.
100+ year old company
Etsy
25 releases a day,
most of them are 

A/B tests.
A startup launched in 2005
http://www.slideshare.net/danmckinley/design-for-continuous-experimentation(linkedin)
Zero Tests Per Month.
Here’s the test idea,
numbers and execution.
Can we proceed?
Let’s meet to
discuss. Maybe
next week?
Looks good.
Will check with Z
and get back to you.
So here’s the test idea,
numbers…
Sorry,
had other priorities.
Can we meet
next week?
Sure! (D***!)
Have you
checked with Z?
Have you…?
Have you…?
Ground rules: 

1. Test ideas are
subject to prioritization
not approval.
evidence
x opportunity size
x strategy
=
priority
Magic formula:
EXERCISE 3.
The worst idea gets tested
if resources are available.
101 Tests Per Month.
Ok then, we’ll
do this, this
and that test.
Others will wait.
Guys, our
strategy shifted
to checkout
optimization.
Guys, we
need to increase
basket value.
Now this
and that one…
And this…
These two
would work…
Xmas is
coming!
DO NOTHING!
…this, this
and that…
Ground rules: 

2. Accept the fact that
things will go wrong.
Cheat like
a monkey.
How to be the best monkey in the biz?
If 1 out of 7 tests
wins, what about the
other 6?
https://www.groovehq.com/blog/failed-ab-tests
What was the result of the

Button Colors Test by Groove?
EXERCISE 1.
If 1 out of 7 tests
wins, what about the
other 6? 5 of them
will be inconclusive.
Most tests are inconclusive because:
a) too few users were using the changed
feature for it to get statistical significance.
b) the changed feature had little to do with
metrics used to evaluate the test.
c) there were multiple changes in the same
test and they levelled up.
Complete the sentence:
EXERCISE 4.
You do it to find out
what works and how well.
A/B testing is NOT about __________.making money
You can successfully
run tests that have no
chance of success.
… removing a feature
… slowing down the website
…
Cheat: Experiment to
test significance.
Test results show that…
didn’t reduce conversion.
… we shouldn’t
waste time on that.
Cheat:
test significance.
Test results show that…
Cheat: One change
per test. Order matters.
Select products, produce
videos, upload, add links,
launch test
Add links
Select products
Produce videos
…
INCONCLUSIVE
… people don’t
click “watch
video” links.
Cheat: Measure against
your hypothesis.
… adding videos
had no impact on
conversion.
INCONCLUSIVE
CONCLUSIVE
Test results show that…
A great presentation by Etsy:
goo.gl/WQpY65
The benefit you get
from A/B testing is
knowledge not
revenue.
The benefit you get
from A/B testing is
knowledge not
revenue. Revenue will
come as a result of
applied knowledge.
Don’t be

a monkey.
How to be the best monkey in the biz?
Don’t be a gnome either.
What about this
1 test out of 7 that
fails?
http://conversionxl.com/ab-tests-fail/
3 out of 4 companies (that are
A/B testing) make changes based on
intuition or best practices.
50%
NOT A/B testing
50%
A/B testing
collect underpants + ?
=
profit
Solve equation:
EXERCISE 5.
A/B test
is launched.
Test results come
back negative.
The idea gets killed,
next test is
launched.
A/B Testing Flow
Fail Fast Approach
One failed test doesn’t
make collecting
underpants a bad idea.
A/B test
is launched.
Test results come
back negative.
Survey responses
give a clue why.
Users are surveyed
alongside the test.
Respondents’
logs give
another clue.
Respondents
are emailed to
clarify the issue.
The issue is solved,
the test relaunched.
Users’ behaviors
are logged.
Pre-test research
is done.
Example of A/B Testing Flow at Spotify
Prepare for failure.
Courtesy of @bendressler researcher at Spotify
The real price you pay
for not researching
why tests fail is the
death of great ideas.
User
Testing
Voice of
Customer
I predict
that doing B
will change X
by Y% because
of Z.
Are
Metrics
Good?
Accepted
Rejected
What really
happened?
Insight
and Evidence
Metrics Based
Evaluation
Hypothesis
check
Evidence-Led Flow
Hypothesis Based

A/B Testing
Qual/Quant
Analytics
User
Testing
Voice of
Customer
I predict
that doing B
will change X
by Y% because
of Z.
Are
Metrics
Good?
Accepted
Rejected
What really
happened?
Insight
and Evidence
Metrics Based
Evaluation
Hypothesis
check
Evidence-Led Flow
Hypothesis Based

A/B Testing
1TB
Behavioural
Raw Data
40M
Unique
Interactions
Collect
behavioral
data.
Build
segmentation
rules.
41
Sets of Rules
Created
Explore,
analyze.
visualize.
Quantify
an opportunity
Translate
an insight
into a test.
average stats per website
from the last month
UseItBetter - The Platform for
Evidence-Led Experimentation at Scale
An analyst researching
for an infinite amount
of time will almost
surely get you to
100% hit ratio. Which
isn’t good either.
If you are going
to A/B test:
1. Never waste your traffic.
1. Never waste your traffic. 2.
Many small changes are better
than one big change.
1. Never waste your traffic. 2.
Many small changes are better
than one big change. 3. Even
the smallest change needs an
insight.
1. Never waste your traffic. 2.
Many small changes are better
than one big change. 3. Even
the smallest change needs an
insight. 4. Prepare for failure.
1. Never waste your traffic. 2.
Many small changes are better
than one big change. 3. Even
the smallest change needs an
insight. 4. Prepare for failure.
5. It’s OK to fail if you know
why you failed.
1. Never waste your traffic. 2.
Many small changes are better
than one big change. 3. Even
the smallest change needs an
insight. 4. Prepare for failure.
5. It’s OK to fail if you know
why you failed. 6. Iterate.
1. Never waste your traffic. 2.
Many small changes are better
than one big change. 3. Even
the smallest change needs an
insight. 4. Prepare for failure.
5. It’s OK to fail if you know
why you failed. 6. Iterate. 7. Be
honest.
For the sake of this presentation, I assumed that the results of the
7 tests I referred to had been correctly read out by the people who
are familiar with the terms like statistical significance,
confidence intervals, p-value etc.
Otherwise, it’s likely that the one winning test was just a phantom.
Disclaimer
Get in touch:
THE FINAL EXERCISE
Łukasz Twardowski
https://linkedin.com/in/twardowski

Mais conteúdo relacionado

Mais procurados

[CXL Live 16] You Can’t Make This Stuff Up by Alex Harris
[CXL Live 16] You Can’t Make This Stuff Up by Alex Harris[CXL Live 16] You Can’t Make This Stuff Up by Alex Harris
[CXL Live 16] You Can’t Make This Stuff Up by Alex HarrisCXL
 
Why do my AB tests suck? measurecamp
Why do my AB tests suck?   measurecampWhy do my AB tests suck?   measurecamp
Why do my AB tests suck? measurecampCraig Sullivan
 
[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep Laja[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep LajaCXL
 
Digital Impact 2014 - Oh Boy These AB tests sure look like Bullshit to me
Digital Impact 2014 - Oh Boy These AB tests sure look like Bullshit to meDigital Impact 2014 - Oh Boy These AB tests sure look like Bullshit to me
Digital Impact 2014 - Oh Boy These AB tests sure look like Bullshit to meCraig Sullivan
 
Mobile presentation - Sydney Online Retailer - 26 Sep 2011
Mobile presentation - Sydney Online Retailer - 26 Sep 2011Mobile presentation - Sydney Online Retailer - 26 Sep 2011
Mobile presentation - Sydney Online Retailer - 26 Sep 2011Craig Sullivan
 
Condensed testing syrup - @OptimiseorDie @sydney sep 2011 - 4 years of testin...
Condensed testing syrup - @OptimiseorDie @sydney sep 2011 - 4 years of testin...Condensed testing syrup - @OptimiseorDie @sydney sep 2011 - 4 years of testin...
Condensed testing syrup - @OptimiseorDie @sydney sep 2011 - 4 years of testin...Craig Sullivan
 
[CXL Live 16] Beyond Test-by-Test Results: CRO Metrics for Performance & Insi...
[CXL Live 16] Beyond Test-by-Test Results: CRO Metrics for Performance & Insi...[CXL Live 16] Beyond Test-by-Test Results: CRO Metrics for Performance & Insi...
[CXL Live 16] Beyond Test-by-Test Results: CRO Metrics for Performance & Insi...CXL
 
[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session
[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session
[Elite Camp 2016] Craig Sullivan - Elite Camp Summary SessionCXL
 
The Million Dollar Optimization Strategy - Andre Morys - ConversionXL Live 2015
The Million Dollar Optimization Strategy - Andre Morys - ConversionXL Live 2015The Million Dollar Optimization Strategy - Andre Morys - ConversionXL Live 2015
The Million Dollar Optimization Strategy - Andre Morys - ConversionXL Live 2015CXL
 
[CXL Live 16] SaaS Optimization - Effective Metrics, Process and Hacks by Ste...
[CXL Live 16] SaaS Optimization - Effective Metrics, Process and Hacks by Ste...[CXL Live 16] SaaS Optimization - Effective Metrics, Process and Hacks by Ste...
[CXL Live 16] SaaS Optimization - Effective Metrics, Process and Hacks by Ste...CXL
 
[Elite Camp 2016] Peep Laja - Fresh Out Of the Oven
[Elite Camp 2016] Peep Laja - Fresh Out Of the Oven[Elite Camp 2016] Peep Laja - Fresh Out Of the Oven
[Elite Camp 2016] Peep Laja - Fresh Out Of the OvenCXL
 
Interact London - 21 Oct 2015 - Scaling Stupidity
Interact London - 21 Oct 2015 - Scaling StupidityInteract London - 21 Oct 2015 - Scaling Stupidity
Interact London - 21 Oct 2015 - Scaling StupidityCraig Sullivan
 
How to Increase Your Testing Success by Combining Qualitative and Quantitativ...
How to Increase Your Testing Success by Combining Qualitative and Quantitativ...How to Increase Your Testing Success by Combining Qualitative and Quantitativ...
How to Increase Your Testing Success by Combining Qualitative and Quantitativ...Optimizely
 
[CXL Live 16] Growth Hacking BS: Fixing Marketing One Truth at a Time by Morg...
[CXL Live 16] Growth Hacking BS: Fixing Marketing One Truth at a Time by Morg...[CXL Live 16] Growth Hacking BS: Fixing Marketing One Truth at a Time by Morg...
[CXL Live 16] Growth Hacking BS: Fixing Marketing One Truth at a Time by Morg...CXL
 
Myths, Lies and Illusions of AB and Split Testing
Myths, Lies and Illusions of AB and Split TestingMyths, Lies and Illusions of AB and Split Testing
Myths, Lies and Illusions of AB and Split TestingCraig Sullivan
 
Turning Data into Customers - Conversion Hotel - Peep Laja
Turning Data into Customers - Conversion Hotel - Peep LajaTurning Data into Customers - Conversion Hotel - Peep Laja
Turning Data into Customers - Conversion Hotel - Peep LajaCXL
 
How to Test Anything
How to Test AnythingHow to Test Anything
How to Test AnythingJames Thomas
 
#Measurecamp : 18 Simple Ways to F*** up Your AB Testing
#Measurecamp : 18 Simple Ways to F*** up Your AB Testing#Measurecamp : 18 Simple Ways to F*** up Your AB Testing
#Measurecamp : 18 Simple Ways to F*** up Your AB TestingCraig Sullivan
 
Ab testing 101
Ab testing 101Ab testing 101
Ab testing 101Ashish Dua
 
Myths and Illusions of Cross Device Testing - Elite Camp June 2015
Myths and Illusions of Cross Device Testing - Elite Camp June 2015Myths and Illusions of Cross Device Testing - Elite Camp June 2015
Myths and Illusions of Cross Device Testing - Elite Camp June 2015Craig Sullivan
 

Mais procurados (20)

[CXL Live 16] You Can’t Make This Stuff Up by Alex Harris
[CXL Live 16] You Can’t Make This Stuff Up by Alex Harris[CXL Live 16] You Can’t Make This Stuff Up by Alex Harris
[CXL Live 16] You Can’t Make This Stuff Up by Alex Harris
 
Why do my AB tests suck? measurecamp
Why do my AB tests suck?   measurecampWhy do my AB tests suck?   measurecamp
Why do my AB tests suck? measurecamp
 
[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep Laja[CXL Live 16] Opening Keynote by Peep Laja
[CXL Live 16] Opening Keynote by Peep Laja
 
Digital Impact 2014 - Oh Boy These AB tests sure look like Bullshit to me
Digital Impact 2014 - Oh Boy These AB tests sure look like Bullshit to meDigital Impact 2014 - Oh Boy These AB tests sure look like Bullshit to me
Digital Impact 2014 - Oh Boy These AB tests sure look like Bullshit to me
 
Mobile presentation - Sydney Online Retailer - 26 Sep 2011
Mobile presentation - Sydney Online Retailer - 26 Sep 2011Mobile presentation - Sydney Online Retailer - 26 Sep 2011
Mobile presentation - Sydney Online Retailer - 26 Sep 2011
 
Condensed testing syrup - @OptimiseorDie @sydney sep 2011 - 4 years of testin...
Condensed testing syrup - @OptimiseorDie @sydney sep 2011 - 4 years of testin...Condensed testing syrup - @OptimiseorDie @sydney sep 2011 - 4 years of testin...
Condensed testing syrup - @OptimiseorDie @sydney sep 2011 - 4 years of testin...
 
[CXL Live 16] Beyond Test-by-Test Results: CRO Metrics for Performance & Insi...
[CXL Live 16] Beyond Test-by-Test Results: CRO Metrics for Performance & Insi...[CXL Live 16] Beyond Test-by-Test Results: CRO Metrics for Performance & Insi...
[CXL Live 16] Beyond Test-by-Test Results: CRO Metrics for Performance & Insi...
 
[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session
[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session
[Elite Camp 2016] Craig Sullivan - Elite Camp Summary Session
 
The Million Dollar Optimization Strategy - Andre Morys - ConversionXL Live 2015
The Million Dollar Optimization Strategy - Andre Morys - ConversionXL Live 2015The Million Dollar Optimization Strategy - Andre Morys - ConversionXL Live 2015
The Million Dollar Optimization Strategy - Andre Morys - ConversionXL Live 2015
 
[CXL Live 16] SaaS Optimization - Effective Metrics, Process and Hacks by Ste...
[CXL Live 16] SaaS Optimization - Effective Metrics, Process and Hacks by Ste...[CXL Live 16] SaaS Optimization - Effective Metrics, Process and Hacks by Ste...
[CXL Live 16] SaaS Optimization - Effective Metrics, Process and Hacks by Ste...
 
[Elite Camp 2016] Peep Laja - Fresh Out Of the Oven
[Elite Camp 2016] Peep Laja - Fresh Out Of the Oven[Elite Camp 2016] Peep Laja - Fresh Out Of the Oven
[Elite Camp 2016] Peep Laja - Fresh Out Of the Oven
 
Interact London - 21 Oct 2015 - Scaling Stupidity
Interact London - 21 Oct 2015 - Scaling StupidityInteract London - 21 Oct 2015 - Scaling Stupidity
Interact London - 21 Oct 2015 - Scaling Stupidity
 
How to Increase Your Testing Success by Combining Qualitative and Quantitativ...
How to Increase Your Testing Success by Combining Qualitative and Quantitativ...How to Increase Your Testing Success by Combining Qualitative and Quantitativ...
How to Increase Your Testing Success by Combining Qualitative and Quantitativ...
 
[CXL Live 16] Growth Hacking BS: Fixing Marketing One Truth at a Time by Morg...
[CXL Live 16] Growth Hacking BS: Fixing Marketing One Truth at a Time by Morg...[CXL Live 16] Growth Hacking BS: Fixing Marketing One Truth at a Time by Morg...
[CXL Live 16] Growth Hacking BS: Fixing Marketing One Truth at a Time by Morg...
 
Myths, Lies and Illusions of AB and Split Testing
Myths, Lies and Illusions of AB and Split TestingMyths, Lies and Illusions of AB and Split Testing
Myths, Lies and Illusions of AB and Split Testing
 
Turning Data into Customers - Conversion Hotel - Peep Laja
Turning Data into Customers - Conversion Hotel - Peep LajaTurning Data into Customers - Conversion Hotel - Peep Laja
Turning Data into Customers - Conversion Hotel - Peep Laja
 
How to Test Anything
How to Test AnythingHow to Test Anything
How to Test Anything
 
#Measurecamp : 18 Simple Ways to F*** up Your AB Testing
#Measurecamp : 18 Simple Ways to F*** up Your AB Testing#Measurecamp : 18 Simple Ways to F*** up Your AB Testing
#Measurecamp : 18 Simple Ways to F*** up Your AB Testing
 
Ab testing 101
Ab testing 101Ab testing 101
Ab testing 101
 
Myths and Illusions of Cross Device Testing - Elite Camp June 2015
Myths and Illusions of Cross Device Testing - Elite Camp June 2015Myths and Illusions of Cross Device Testing - Elite Camp June 2015
Myths and Illusions of Cross Device Testing - Elite Camp June 2015
 

Destaque

MozCon 2016! Mind Games: Craft Killer Experiences with 7 Lessons from Cogniti...
MozCon 2016! Mind Games: Craft Killer Experiences with 7 Lessons from Cogniti...MozCon 2016! Mind Games: Craft Killer Experiences with 7 Lessons from Cogniti...
MozCon 2016! Mind Games: Craft Killer Experiences with 7 Lessons from Cogniti...Sarah Weise
 
Understand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesUnderstand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesTheFamily
 
Seriously Advanced A/B Testing by Wyatt Jenkins
Seriously Advanced A/B Testing	by Wyatt JenkinsSeriously Advanced A/B Testing	by Wyatt Jenkins
Seriously Advanced A/B Testing by Wyatt JenkinsLean Startup Co.
 
10 A/B Testing Mistakes that Make Your Wallet Cry
10 A/B Testing Mistakes that Make Your Wallet Cry10 A/B Testing Mistakes that Make Your Wallet Cry
10 A/B Testing Mistakes that Make Your Wallet CryConvert.com
 
Design for Continuous Experimentation
Design for Continuous ExperimentationDesign for Continuous Experimentation
Design for Continuous ExperimentationDan McKinley
 
Teaching Students with Emojis, Emoticons, & Textspeak
Teaching Students with Emojis, Emoticons, & TextspeakTeaching Students with Emojis, Emoticons, & Textspeak
Teaching Students with Emojis, Emoticons, & TextspeakShelly Sanchez Terrell
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsLinkedIn
 
UX, ethnography and possibilities: for Libraries, Museums and Archives
UX, ethnography and possibilities: for Libraries, Museums and ArchivesUX, ethnography and possibilities: for Libraries, Museums and Archives
UX, ethnography and possibilities: for Libraries, Museums and ArchivesNed Potter
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with DataSeth Familian
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017Drift
 

Destaque (12)

Android design in action
Android design in actionAndroid design in action
Android design in action
 
Website Testing WINS!
Website Testing WINS!Website Testing WINS!
Website Testing WINS!
 
MozCon 2016! Mind Games: Craft Killer Experiences with 7 Lessons from Cogniti...
MozCon 2016! Mind Games: Craft Killer Experiences with 7 Lessons from Cogniti...MozCon 2016! Mind Games: Craft Killer Experiences with 7 Lessons from Cogniti...
MozCon 2016! Mind Games: Craft Killer Experiences with 7 Lessons from Cogniti...
 
Understand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesUnderstand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakes
 
Seriously Advanced A/B Testing by Wyatt Jenkins
Seriously Advanced A/B Testing	by Wyatt JenkinsSeriously Advanced A/B Testing	by Wyatt Jenkins
Seriously Advanced A/B Testing by Wyatt Jenkins
 
10 A/B Testing Mistakes that Make Your Wallet Cry
10 A/B Testing Mistakes that Make Your Wallet Cry10 A/B Testing Mistakes that Make Your Wallet Cry
10 A/B Testing Mistakes that Make Your Wallet Cry
 
Design for Continuous Experimentation
Design for Continuous ExperimentationDesign for Continuous Experimentation
Design for Continuous Experimentation
 
Teaching Students with Emojis, Emoticons, & Textspeak
Teaching Students with Emojis, Emoticons, & TextspeakTeaching Students with Emojis, Emoticons, & Textspeak
Teaching Students with Emojis, Emoticons, & Textspeak
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving Cars
 
UX, ethnography and possibilities: for Libraries, Museums and Archives
UX, ethnography and possibilities: for Libraries, Museums and ArchivesUX, ethnography and possibilities: for Libraries, Museums and Archives
UX, ethnography and possibilities: for Libraries, Museums and Archives
 
Visual Design with Data
Visual Design with DataVisual Design with Data
Visual Design with Data
 
3 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 20173 Things Every Sales Team Needs to Be Thinking About in 2017
3 Things Every Sales Team Needs to Be Thinking About in 2017
 

Semelhante a A/B Testing and the Infinite Monkey Theory

SearchLove Boston 2017 | Richard Fergie | You Aren't Doing Science and That's OK
SearchLove Boston 2017 | Richard Fergie | You Aren't Doing Science and That's OKSearchLove Boston 2017 | Richard Fergie | You Aren't Doing Science and That's OK
SearchLove Boston 2017 | Richard Fergie | You Aren't Doing Science and That's OKDistilled
 
Creating a culture that provokes failure and boosts improvement
Creating a culture that provokes failure and boosts improvementCreating a culture that provokes failure and boosts improvement
Creating a culture that provokes failure and boosts improvementBen Dressler
 
Hexawise Soap Opera Testing
Hexawise Soap Opera TestingHexawise Soap Opera Testing
Hexawise Soap Opera TestingTyler Klose
 
10 Guidelines for A/B Testing
10 Guidelines for A/B Testing10 Guidelines for A/B Testing
10 Guidelines for A/B TestingEmily Robinson
 
A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18
A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18
A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18Mariia Bocheva
 
19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testingDominic Woodman
 
SearchLove London 2018 - Dom Woodman - A year of SEO split testing changed ho...
SearchLove London 2018 - Dom Woodman - A year of SEO split testing changed ho...SearchLove London 2018 - Dom Woodman - A year of SEO split testing changed ho...
SearchLove London 2018 - Dom Woodman - A year of SEO split testing changed ho...Distilled
 
SearchLove San Diego - Dom Woodman - A Year of SEO Split Testing Changed How ...
SearchLove San Diego - Dom Woodman - A Year of SEO Split Testing Changed How ...SearchLove San Diego - Dom Woodman - A Year of SEO Split Testing Changed How ...
SearchLove San Diego - Dom Woodman - A Year of SEO Split Testing Changed How ...Distilled
 
7 ways you are doing your A/B testing wrong by Côme Courteault
7 ways you are doing your A/B testing wrong by Côme Courteault7 ways you are doing your A/B testing wrong by Côme Courteault
7 ways you are doing your A/B testing wrong by Côme CourteaultTheFamily
 
Design Thinking in the Product Development Process - Product tank oxford
Design Thinking in the Product Development Process - Product tank oxford Design Thinking in the Product Development Process - Product tank oxford
Design Thinking in the Product Development Process - Product tank oxford AJ Justo
 
How a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedHow a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedDominic Woodman
 
The Real Lessons of Dr. Deming’s Red Bead Factory
The Real Lessons of Dr. Deming’s Red Bead FactoryThe Real Lessons of Dr. Deming’s Red Bead Factory
The Real Lessons of Dr. Deming’s Red Bead FactoryMark Graban
 
Biscuits, SEO and an Insane F***ing Content Strategy
Biscuits, SEO and an Insane F***ing Content StrategyBiscuits, SEO and an Insane F***ing Content Strategy
Biscuits, SEO and an Insane F***ing Content StrategyWayne Barker
 
Opticon 2015-Statistics in 40 minutes
Opticon 2015-Statistics in 40 minutesOpticon 2015-Statistics in 40 minutes
Opticon 2015-Statistics in 40 minutesOptimizely
 
Things Could Get Worse: Ideas About Regression Testing
Things Could Get Worse: Ideas About Regression TestingThings Could Get Worse: Ideas About Regression Testing
Things Could Get Worse: Ideas About Regression TestingTechWell
 
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, NetflixUX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, NetflixUX STRAT
 
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”NakatoArase
 
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”NakatoArase
 

Semelhante a A/B Testing and the Infinite Monkey Theory (20)

SearchLove Boston 2017 | Richard Fergie | You Aren't Doing Science and That's OK
SearchLove Boston 2017 | Richard Fergie | You Aren't Doing Science and That's OKSearchLove Boston 2017 | Richard Fergie | You Aren't Doing Science and That's OK
SearchLove Boston 2017 | Richard Fergie | You Aren't Doing Science and That's OK
 
A/B tests
A/B testsA/B tests
A/B tests
 
Creating a culture that provokes failure and boosts improvement
Creating a culture that provokes failure and boosts improvementCreating a culture that provokes failure and boosts improvement
Creating a culture that provokes failure and boosts improvement
 
Hexawise Soap Opera Testing
Hexawise Soap Opera TestingHexawise Soap Opera Testing
Hexawise Soap Opera Testing
 
10 Guidelines for A/B Testing
10 Guidelines for A/B Testing10 Guidelines for A/B Testing
10 Guidelines for A/B Testing
 
A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18
A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18
A/B testing, optimization and results analysis by Mariia Bocheva, ATD'18
 
19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing
 
SearchLove London 2018 - Dom Woodman - A year of SEO split testing changed ho...
SearchLove London 2018 - Dom Woodman - A year of SEO split testing changed ho...SearchLove London 2018 - Dom Woodman - A year of SEO split testing changed ho...
SearchLove London 2018 - Dom Woodman - A year of SEO split testing changed ho...
 
SearchLove San Diego - Dom Woodman - A Year of SEO Split Testing Changed How ...
SearchLove San Diego - Dom Woodman - A Year of SEO Split Testing Changed How ...SearchLove San Diego - Dom Woodman - A Year of SEO Split Testing Changed How ...
SearchLove San Diego - Dom Woodman - A Year of SEO Split Testing Changed How ...
 
7 ways you are doing your A/B testing wrong by Côme Courteault
7 ways you are doing your A/B testing wrong by Côme Courteault7 ways you are doing your A/B testing wrong by Côme Courteault
7 ways you are doing your A/B testing wrong by Côme Courteault
 
Design Thinking in the Product Development Process - Product tank oxford
Design Thinking in the Product Development Process - Product tank oxford Design Thinking in the Product Development Process - Product tank oxford
Design Thinking in the Product Development Process - Product tank oxford
 
How a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedHow a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO worked
 
The Real Lessons of Dr. Deming’s Red Bead Factory
The Real Lessons of Dr. Deming’s Red Bead FactoryThe Real Lessons of Dr. Deming’s Red Bead Factory
The Real Lessons of Dr. Deming’s Red Bead Factory
 
Biscuits, SEO and an Insane F***ing Content Strategy
Biscuits, SEO and an Insane F***ing Content StrategyBiscuits, SEO and an Insane F***ing Content Strategy
Biscuits, SEO and an Insane F***ing Content Strategy
 
Opticon 2015-Statistics in 40 minutes
Opticon 2015-Statistics in 40 minutesOpticon 2015-Statistics in 40 minutes
Opticon 2015-Statistics in 40 minutes
 
Things Could Get Worse: Ideas About Regression Testing
Things Could Get Worse: Ideas About Regression TestingThings Could Get Worse: Ideas About Regression Testing
Things Could Get Worse: Ideas About Regression Testing
 
Ooda pres
Ooda presOoda pres
Ooda pres
 
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, NetflixUX STRAT Online 2020: Dr. Martin Tingley, Netflix
UX STRAT Online 2020: Dr. Martin Tingley, Netflix
 
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
 
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
あなたのアイデアは”Goo dアイデア!?” それとも”badアイデア!?”
 

Último

ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 

Último (20)

ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 

A/B Testing and the Infinite Monkey Theory

  • 1. http://en.wikipedia.org/wiki/Portraits_of_Shakespeare A/B Testing 
 and the Infinite Monkey Theorem. Lukasz Twardowski www.useitbetter.com
  • 2. a monkey hitting keys at random for an infinite amount of time will almost surely type the complete works of William Shakespeare.
  • 3. a monkey hitting keys at random for an infinite amount of time will almost surely A/B testing reach the conversion rate of Amazon.
  • 4. A/B testing helps find out which of two versions performs better while running simultaneously. THEORY
  • 5. We do this because every day is different, unlike in the Groundhog Day movie. Groundhog Day (1993, Dir. Harold Ramis)
  • 6. http://nerds.airbnb.com/experiments-at-airbnb/ A single change, bad or good, will not change a trend. Unless a change is A/B tested, you won’t know its impact.
  • 8. The industry average hit rate for A/B testing = Provide the benchmark: EXERCISE 1.
  • 9. The industry average hit rate for A/B testing = 14% Just 1 out of 7 A/B tests is successful! http://conversionxl.com/ab-tests-fail/ Provide the benchmark: EXERCISE 1.
  • 10. King Kong (1933, Dir. Merian Cooper, Ernest Schoedsack) How to be the greatest monkey in the biz if infinity is not an option?
  • 11. Be a quick monkey. How to be the best monkey in the biz?
  • 12. 1 out of 7 tests wins x 2 weeks per test = slow growth Do the math: EXERCISE 2. Unless you experiment at scale.
  • 13. The currency in which you pay for A/B tests
 is traffic.
  • 14. The currency in which you pay for A/B tests 
 is traffic. The more you have, the more tests you can run.
  • 15. The currency in which you pay for A/B tests
 is traffic. The more you have, the more tests you can run. Never waste what you have.
  • 16. Shop Direct Scaled to 101 experiments a month in two years. 100+ year old company Etsy 25 releases a day, most of them are 
 A/B tests. A startup launched in 2005 http://www.slideshare.net/danmckinley/design-for-continuous-experimentation(linkedin)
  • 17. Zero Tests Per Month. Here’s the test idea, numbers and execution. Can we proceed? Let’s meet to discuss. Maybe next week? Looks good. Will check with Z and get back to you. So here’s the test idea, numbers… Sorry, had other priorities. Can we meet next week? Sure! (D***!) Have you checked with Z? Have you…? Have you…?
  • 18. Ground rules: 
 1. Test ideas are subject to prioritization not approval.
  • 19. evidence x opportunity size x strategy = priority Magic formula: EXERCISE 3. The worst idea gets tested if resources are available.
  • 20. 101 Tests Per Month. Ok then, we’ll do this, this and that test. Others will wait. Guys, our strategy shifted to checkout optimization. Guys, we need to increase basket value. Now this and that one… And this… These two would work… Xmas is coming! DO NOTHING! …this, this and that…
  • 21. Ground rules: 
 2. Accept the fact that things will go wrong.
  • 22. Cheat like a monkey. How to be the best monkey in the biz?
  • 23. If 1 out of 7 tests wins, what about the other 6?
  • 24. https://www.groovehq.com/blog/failed-ab-tests What was the result of the
 Button Colors Test by Groove? EXERCISE 1.
  • 25.
  • 26. If 1 out of 7 tests wins, what about the other 6? 5 of them will be inconclusive.
  • 27. Most tests are inconclusive because: a) too few users were using the changed feature for it to get statistical significance. b) the changed feature had little to do with metrics used to evaluate the test. c) there were multiple changes in the same test and they levelled up.
  • 28. Complete the sentence: EXERCISE 4. You do it to find out what works and how well. A/B testing is NOT about __________.making money
  • 29. You can successfully run tests that have no chance of success.
  • 30. … removing a feature … slowing down the website … Cheat: Experiment to test significance. Test results show that… didn’t reduce conversion.
  • 31. … we shouldn’t waste time on that. Cheat: test significance. Test results show that…
  • 32. Cheat: One change per test. Order matters. Select products, produce videos, upload, add links, launch test Add links Select products Produce videos … INCONCLUSIVE
  • 33. … people don’t click “watch video” links. Cheat: Measure against your hypothesis. … adding videos had no impact on conversion. INCONCLUSIVE CONCLUSIVE Test results show that…
  • 34. A great presentation by Etsy: goo.gl/WQpY65
  • 35. The benefit you get from A/B testing is knowledge not revenue.
  • 36. The benefit you get from A/B testing is knowledge not revenue. Revenue will come as a result of applied knowledge.
  • 37. Don’t be
 a monkey. How to be the best monkey in the biz? Don’t be a gnome either.
  • 38. What about this 1 test out of 7 that fails?
  • 39. http://conversionxl.com/ab-tests-fail/ 3 out of 4 companies (that are A/B testing) make changes based on intuition or best practices. 50% NOT A/B testing 50% A/B testing
  • 40. collect underpants + ? = profit Solve equation: EXERCISE 5.
  • 41. A/B test is launched. Test results come back negative. The idea gets killed, next test is launched. A/B Testing Flow Fail Fast Approach
  • 42. One failed test doesn’t make collecting underpants a bad idea.
  • 43. A/B test is launched. Test results come back negative. Survey responses give a clue why. Users are surveyed alongside the test. Respondents’ logs give another clue. Respondents are emailed to clarify the issue. The issue is solved, the test relaunched. Users’ behaviors are logged. Pre-test research is done. Example of A/B Testing Flow at Spotify Prepare for failure. Courtesy of @bendressler researcher at Spotify
  • 44. The real price you pay for not researching why tests fail is the death of great ideas.
  • 45. User Testing Voice of Customer I predict that doing B will change X by Y% because of Z. Are Metrics Good? Accepted Rejected What really happened? Insight and Evidence Metrics Based Evaluation Hypothesis check Evidence-Led Flow Hypothesis Based
 A/B Testing Qual/Quant Analytics
  • 46. User Testing Voice of Customer I predict that doing B will change X by Y% because of Z. Are Metrics Good? Accepted Rejected What really happened? Insight and Evidence Metrics Based Evaluation Hypothesis check Evidence-Led Flow Hypothesis Based
 A/B Testing
  • 47. 1TB Behavioural Raw Data 40M Unique Interactions Collect behavioral data. Build segmentation rules. 41 Sets of Rules Created Explore, analyze. visualize. Quantify an opportunity Translate an insight into a test. average stats per website from the last month UseItBetter - The Platform for Evidence-Led Experimentation at Scale
  • 48. An analyst researching for an infinite amount of time will almost surely get you to 100% hit ratio. Which isn’t good either.
  • 49. If you are going to A/B test:
  • 50. 1. Never waste your traffic.
  • 51. 1. Never waste your traffic. 2. Many small changes are better than one big change.
  • 52. 1. Never waste your traffic. 2. Many small changes are better than one big change. 3. Even the smallest change needs an insight.
  • 53. 1. Never waste your traffic. 2. Many small changes are better than one big change. 3. Even the smallest change needs an insight. 4. Prepare for failure.
  • 54. 1. Never waste your traffic. 2. Many small changes are better than one big change. 3. Even the smallest change needs an insight. 4. Prepare for failure. 5. It’s OK to fail if you know why you failed.
  • 55. 1. Never waste your traffic. 2. Many small changes are better than one big change. 3. Even the smallest change needs an insight. 4. Prepare for failure. 5. It’s OK to fail if you know why you failed. 6. Iterate.
  • 56. 1. Never waste your traffic. 2. Many small changes are better than one big change. 3. Even the smallest change needs an insight. 4. Prepare for failure. 5. It’s OK to fail if you know why you failed. 6. Iterate. 7. Be honest.
  • 57. For the sake of this presentation, I assumed that the results of the 7 tests I referred to had been correctly read out by the people who are familiar with the terms like statistical significance, confidence intervals, p-value etc. Otherwise, it’s likely that the one winning test was just a phantom. Disclaimer
  • 58. Get in touch: THE FINAL EXERCISE Łukasz Twardowski https://linkedin.com/in/twardowski