This document discusses best practices for running A/B tests, including collecting enough data over multiple weeks and business cycles, avoiding relying solely on significance levels to determine a winner, and integrating test data with web analytics. It also emphasizes formulating a clear hypothesis by outlining why a change is needed and how its impact will be measured. The presenter advocates getting familiar with key statistical concepts and learning statistics.
13. After 4 weeks
+36.2% / significance 96%
Total sample: 9396 visitors / 122 conversions
14. What we have now:
9,396
What we need:
34,304 visitors in total
What we’re missing:
24,908 visitors
73% of the required
sample size.
http://www.evanmiller.org/ab-testing/sample-size.html
23. Test duration: 2-4 weeks (2 or more business cycles)
Sample size: Pre-calculated sample size (users)
400+ conversions (goals)
Significance: 95% or higher
Power: 80% P-Value: 0%
What I look for:
25. Devices
Browsers
Channels
Country
Days of
the week
Hours of
the day
User
types
Motivation
Campaigns
Age
Gender
Gender
BrowsersDevices
Browsers
Channels
Countries
Days of
the week
Hours of
the day
User
types
Motivation
Campaigns
Age
Days of
the month
Gender
Devices
Experiment
31. Why do we think we need to make a change?
What is it that we want to change?
What impact do we expect to see?
How will we measure this impact?
32. 1. Because we saw [data/feedback]
2. We expect that [change] will cause [impact]
3. We’ll measure this using [data metric]
33. Because [exit surveys indicated that saving money is
most important to our users.]
We expect that [tweaking our value prop to reflect this]
will cause [more people to fill out our lead form.]
We’ll measure this using [form submission rate as our
key metric.]
34. Because [exit surveys indicated that saving money is
most important to our users.]
We expect that [tweaking our value prop to reflect this]
will cause [more people to fill out our lead form.]
We’ll measure this using [form submission rate as our
key metric.]
35. Because [exit surveys indicated that saving money is
most important to our users.]
We expect that [tweaking our value prop to reflect this]
will cause [more people to fill out our lead form.]
We’ll measure this using [form submission rate as our
key metric.]
36. Because [exit surveys indicated that saving money is
most important to our users.]
We expect that [tweaking our value prop to reflect this]
will cause [more people to fill out our lead form.]
We’ll measure this using [form submission rate as our
key metric.]
We expect to see reliable results in [four weeks.]