0% found this document useful (0 votes)

111 views6 pages

Udacity Ab Testing Final Project

The document outlines an A/B testing plan for a free trial screener that prompts students to indicate their weekly time commitment before proceeding to checkout. The control group will bypass the screener, while the experimental group will encounter it, potentially affecting conversion rates and introducing selection bias. Key evaluation metrics include gross conversion, which is deemed reliable, while retention and net conversion are influenced by the screener's selection effects, necessitating careful analysis of sample sizes and traffic exposure for the experiment.

Uploaded by

yiming lee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

111 views6 pages

Udacity Ab Testing Final Project

Uploaded by

yiming lee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

the change we are going to test: student clicks the ‘start free trial’ button → there

will be a pop-up window to ask how much time (in hours) the student will
dedicate for this course per week? (free trial screener) :
1. >= 5 hours per week → go to the checkout page directly
2. < 5 hours per week → friendly message that hints the student might not
suitable for this free trial, they could access the course material directly
instead → if they persist, then it will take the students to the checkout page as
well

control group: click ‘start free trial’ → no screener, directly to checkout page
experiment group: click ‘start free trial’ → screener, offer two options 1.continue to
enroll the free trial 2.access the course material without enrolling the free trial

Unit of diversion: cookie; A cookie uniquely identifies a user's browser session.

When a user first visits the page, a cookie is generated, they are randomly assigned
to either the control group (no screener) or the treatment group (with screener).

Invariant metric:

Number of cookies: since our unit of diversion is cookie, this metric should be
comparable between these two groups.

Number of user-ids (Did not work):

● The intervention directly affects whether a user proceeds to enrollment and

receives a user-id. This means the number of user-ids will likely differ
between the control and treatment groups.
● Because the number of user-ids is influenced by the screener, it cannot
serve as a stable baseline to compare the two groups before the
intervention.

Number of Clicks: The change (free trial screener) happens after students click the
‘start free trial’ button, since there is nothing change at the user interface (like the
button size/color etc), so the number of clicks should be comparable between the
experiment and control group.

Click-through-probability: That is, number of unique cookies to click the "Start free
trial" button divided by number of unique cookies to view the course overview page.
Gross conversion(Did not work): That is, number of user-ids to complete checkout
and enroll in the free trial divided by number of unique cookies to click the "Start free
trial" button. Not Invariant: The screener directly impacts the number of user IDs
enrolled (numerator). Users who click "Start free trial" in the control group
automatically proceed to checkout, while those in the treatment group might be
discouraged by the screener, leading to fewer enrollments (treatment group)
compared to the control group (who bypass the screener). This difference isn't due to
the overall pool of users (reflected by clicks) but by the intervention itself (screener).

Retention (Did not work): That is, number of user-ids to remain enrolled past the
14-day boundary (and thus make at least one payment) divided by number of
user-ids to complete checkout. Not Invariant (Indirectly): While the screener
doesn't directly affect users who have already enrolled, it might indirectly impact the
pool of users who enroll (denominator) as explained above for gross conversion.
This creates an uneven baseline for comparing retention rates between groups.

Net conversion (Did not work): That is, number of user-ids to remain enrolled past
the 14-day boundary (and thus make at least one payment) divided by the number of
unique cookies to click the "Start free trial" button.

Selection Bias: The screener introduces selection bias. Users in the treatment
group who click "Start free trial" and then choose to enroll after seeing the screener
might be more likely to be those with a higher initial interest (due to the screener
potentially filtering out some users). This creates an uneven starting point for the two
groups (control vs. treatment) when comparing net conversion rates. It might show
a higher net conversion rate in the treatment group simply because the
screener filtered out users less likely to convert in the first place.

Evaluation metrics:

1.retention(did not work): Since the screener might discourage users with less time
commitment from enrolling in the first place, the treatment group might have a
higher retention rate simply because it started with a more engaged pool of
users.
Denominator: number of unique cookies to click the "Start free trial" button, is comparable
between the control and experimental group:

2.net conversion(did not work):

selection bias: happens when the group of users taking part in your A/B test doesn't
accurately represent your overall audience.

The denominator is comparable for each group; these are the people who show
initial interest;

control group: All users who click "Start free trial" proceed directly to checkout,
representing a wider range of user motivations and commitment levels;
Treatment Group: Users who see the screener before checkout. This group
might have a higher concentration of users with a stronger initial interest due
to the screener potentially filtering out some users with less interest. So, in the
treatment group, the net conversion rate might seem higher compared to the control
group, but it's because the screener discourages some users from enrolling.

3.gross conversion(works!): numerator: number of user-ids to complete checkout

and enroll in the free trial; By focusing on gross conversion (treatment group) and
avoiding metrics heavily influenced by the screener's selection bias (net
conversion), you gain a clearer picture of the screener's true impact on the free
trial process.
Measuring variability:

Baseline values help set a

benchmark for what is
"normal" or expected in the
current system without any
changes.

The evaluation metric i choose here is gross conversion:

Gross conversion: That is, number of user-ids to complete checkout and enroll in the free
trial divided by number of unique cookies to click the "Start free trial" button. (dmin= 0.01)

5000 cookies visiting the course overview page, the denominator:

5000*0.08=400; the probability is given click, p of enrolling = 0.20625

standard deviation =
Sizing:

1.Choosing Number of Samples given Power: how many pageviews total

(across both groups) would you need to collect to adequately power the
experiment?

Use an alpha of 0.05 and a beta of 0.2; the baseline conversion rate for gross
conversion here is 20.625% and the minimum detectable effect is 1%, then use the
online calculator → 25835 (clicks we need, based on the calculator we use, this
number is for one group) → 25835*2 is for two groups → to calculate the pageview
we needed → 25835*2/0.08 = 645875

2.Choosing Duration vs. Exposure: What percentage of Udacity's traffic would you
divert to this experiment (assuming there were no other experiments you wanted to
run simultaneously)? Is the change risky enough that you wouldn't want to run on all
traffic?

According to our estimates of baseline value for metrics, Unique cookies to view
course overview page per day is 40000, and from our calculation, the pageview
we need is 645875 (for both groups);

1.if we run on all traffic: 645875/40000 = 16.15 days

2.if we run on 50% of traffic: 645875/20000 = 32.29 days

3.if we run on 25% of traffic: 645875/10000 = 64.59 days

A/B Test: Course Screener Impact Analysis
No ratings yet
A/B Test: Course Screener Impact Analysis
7 pages
Facebook Status Colour Change AB Tetsing Case Study
No ratings yet
Facebook Status Colour Change AB Tetsing Case Study
11 pages
A - B Testing
No ratings yet
A - B Testing
27 pages
A/B Testing Guide for Job Interviews
No ratings yet
A/B Testing Guide for Job Interviews
13 pages
02 ABTest
No ratings yet
02 ABTest
3 pages
AB Test Notes
No ratings yet
AB Test Notes
7 pages
Google (DA) 面试准备
No ratings yet
Google (DA) 面试准备
20 pages
Digital Marketing Experiment Guide
No ratings yet
Digital Marketing Experiment Guide
66 pages
25 A - B Testing Concepts You Must Know - Interview Refresher
No ratings yet
25 A - B Testing Concepts You Must Know - Interview Refresher
7 pages
Class 6 - ECON 4050-1
No ratings yet
Class 6 - ECON 4050-1
44 pages
Study Material Module 2 - BBADMC602
No ratings yet
Study Material Module 2 - BBADMC602
15 pages
A-B Testing - Framework-2025061017080742
No ratings yet
A-B Testing - Framework-2025061017080742
5 pages
Data Science Product Questions
No ratings yet
Data Science Product Questions
92 pages
AB Testing Cheat Sheet
No ratings yet
AB Testing Cheat Sheet
13 pages
A/B Testing for Web Optimization
No ratings yet
A/B Testing for Web Optimization
2 pages
Design of Experiments For Orelbuy
No ratings yet
Design of Experiments For Orelbuy
2 pages
Student Introduction To The Simulation Updated 11-27-23 - FINAL
No ratings yet
Student Introduction To The Simulation Updated 11-27-23 - FINAL
11 pages
Marketing Experiments: - Dr. Ashish K Jha
No ratings yet
Marketing Experiments: - Dr. Ashish K Jha
14 pages
Bias, Deceptive Pattern, and Impact As UX Designer
No ratings yet
Bias, Deceptive Pattern, and Impact As UX Designer
13 pages
Draft Proposal PHD Research Arjan Haring
No ratings yet
Draft Proposal PHD Research Arjan Haring
3 pages
Marketing Experiment - Vedantu PDF
No ratings yet
Marketing Experiment - Vedantu PDF
2 pages
Digital Experimentation Insights
No ratings yet
Digital Experimentation Insights
23 pages
A/B Testing for Game Developers
No ratings yet
A/B Testing for Game Developers
33 pages
5 Mistakes
No ratings yet
5 Mistakes
14 pages
EP-Experimentation Best Practices-200125-135749
No ratings yet
EP-Experimentation Best Practices-200125-135749
5 pages
GoodUI Data Stories Issue 2
No ratings yet
GoodUI Data Stories Issue 2
11 pages
Advanced Experiment Design Guide
No ratings yet
Advanced Experiment Design Guide
45 pages
MA Task 1
No ratings yet
MA Task 1
5 pages
A Comprehensive Getting Started Guide To A/B Testing
No ratings yet
A Comprehensive Getting Started Guide To A/B Testing
8 pages
Alternative Experiment Designs
No ratings yet
Alternative Experiment Designs
45 pages
D4BC SecondEdition Workbook Current
No ratings yet
D4BC SecondEdition Workbook Current
25 pages
Result - 23 - 2 - 2022, 10 - 33 - 30 PM
No ratings yet
Result - 23 - 2 - 2022, 10 - 33 - 30 PM
1 page
Lecture 10
No ratings yet
Lecture 10
22 pages
Pitfalls of Long-Term Online Controlled Experiments: Pavel Dmitriev, Brian Frasca, Somit Gupta, Ron Kohavi, Garnet Vaz
No ratings yet
Pitfalls of Long-Term Online Controlled Experiments: Pavel Dmitriev, Brian Frasca, Somit Gupta, Ron Kohavi, Garnet Vaz
11 pages
Manual Testing FAQs Guide
No ratings yet
Manual Testing FAQs Guide
4 pages
RM Summary Chapter 7
No ratings yet
RM Summary Chapter 7
8 pages
A/B Testing for Web Designers
No ratings yet
A/B Testing for Web Designers
15 pages
4 ABTesting
No ratings yet
4 ABTesting
18 pages
A/B Testing for Beginners
No ratings yet
A/B Testing for Beginners
15 pages
AB Testing - Part I
No ratings yet
AB Testing - Part I
25 pages
Big Book of Experimentation
No ratings yet
Big Book of Experimentation
101 pages
A B+testing
No ratings yet
A B+testing
3 pages
5 Tricks When AB Testing Is Off The Table by Emily Glassberg Sands Teconomics Medium
No ratings yet
5 Tricks When AB Testing Is Off The Table by Emily Glassberg Sands Teconomics Medium
2 pages
Manual Testing Essentials
No ratings yet
Manual Testing Essentials
73 pages
Prof. Debashish Pradhan: Consumer Behaviour
100% (1)
Prof. Debashish Pradhan: Consumer Behaviour
3 pages
Meta Marketing Science Test Design
No ratings yet
Meta Marketing Science Test Design
11 pages
Igic PDF
No ratings yet
Igic PDF
3 pages
Testing and Experimenting Markets
No ratings yet
Testing and Experimenting Markets
11 pages
SEO Guide to Web Controlled Experiments
No ratings yet
SEO Guide to Web Controlled Experiments
34 pages
ShapeShift: South Asian Fitness App
No ratings yet
ShapeShift: South Asian Fitness App
24 pages
2012-06 ART ControlledExperimentsTutorialAll
No ratings yet
2012-06 ART ControlledExperimentsTutorialAll
49 pages
7COM1025 Coursework Explanation Slides 2023
No ratings yet
7COM1025 Coursework Explanation Slides 2023
15 pages
Criteria D Sample
No ratings yet
Criteria D Sample
8 pages
Marketing Experiment - Snickers
No ratings yet
Marketing Experiment - Snickers
2 pages
9 Most Common A B Testing Sins 1678686114
No ratings yet
9 Most Common A B Testing Sins 1678686114
11 pages
Design A Marketing Experiment Sample Report of SNICKERS
No ratings yet
Design A Marketing Experiment Sample Report of SNICKERS
2 pages
Joining Forces: Investigating The Influence of Design For Behaviour Change On Sustainable Innovation
No ratings yet
Joining Forces: Investigating The Influence of Design For Behaviour Change On Sustainable Innovation
11 pages
Application of Graph Theory in Navigation
No ratings yet
Application of Graph Theory in Navigation
10 pages
Salesforce Sales Cloud Interview Questions
No ratings yet
Salesforce Sales Cloud Interview Questions
7 pages
ZPA Professional Written Exam
No ratings yet
ZPA Professional Written Exam
35 pages
October-November 2020 IPO Performance
No ratings yet
October-November 2020 IPO Performance
17 pages
Shaheer Anwar CV
No ratings yet
Shaheer Anwar CV
2 pages
Optics for Physics Students
No ratings yet
Optics for Physics Students
33 pages
Enhancing Data Management: An Integrated Solution For Database Backup, Recovery, Conversion, and Encryption Capabilities
No ratings yet
Enhancing Data Management: An Integrated Solution For Database Backup, Recovery, Conversion, and Encryption Capabilities
15 pages
FP 7.1 FTDOnboardingToFMCThroughFDM TOI
No ratings yet
FP 7.1 FTDOnboardingToFMCThroughFDM TOI
94 pages
Effectiveness of Modular Distance Learning To Grade 11 HUMSS Students
No ratings yet
Effectiveness of Modular Distance Learning To Grade 11 HUMSS Students
18 pages
Internet Cafe
80% (5)
Internet Cafe
13 pages
12th IMCCRT Boucher
No ratings yet
12th IMCCRT Boucher
13 pages
DEH-X1650UB - DEH-X1650UBG - Owner Manual - QRD3207A
No ratings yet
DEH-X1650UB - DEH-X1650UBG - Owner Manual - QRD3207A
52 pages
ERPCODE Inc
No ratings yet
ERPCODE Inc
5 pages
Digital Technologies and Applications: Saad Motahhir Badre Bossoufi Editors
No ratings yet
Digital Technologies and Applications: Saad Motahhir Badre Bossoufi Editors
1,770 pages
Chapter 1 - The Worlds of Database Systems
No ratings yet
Chapter 1 - The Worlds of Database Systems
31 pages
Analysing The Effects of Lean Manufacturing Using VSM Based Simulation Generator
No ratings yet
Analysing The Effects of Lean Manufacturing Using VSM Based Simulation Generator
23 pages
Bullet Manufacturing Processes
No ratings yet
Bullet Manufacturing Processes
3 pages
8251 Material PDF
No ratings yet
8251 Material PDF
31 pages
Mixer b20c
No ratings yet
Mixer b20c
8 pages
CAU 03 Conjur - Fundamentals ConjurCLI
No ratings yet
CAU 03 Conjur - Fundamentals ConjurCLI
16 pages
Aims Objects of IT Act
No ratings yet
Aims Objects of IT Act
14 pages
Acute Proposal Thane Rural
No ratings yet
Acute Proposal Thane Rural
4 pages
PRWB User Manual
No ratings yet
PRWB User Manual
7 pages
Seminar Report-2
No ratings yet
Seminar Report-2
17 pages
Project PPT Tempo-1
No ratings yet
Project PPT Tempo-1
31 pages
ESD Protection: Labour Standards
No ratings yet
ESD Protection: Labour Standards
7 pages
Me2032 QB
No ratings yet
Me2032 QB
10 pages
Opportunities and Risks Associated With The Advent of Digital Currency
No ratings yet
Opportunities and Risks Associated With The Advent of Digital Currency
63 pages
Maintenance Manual of Chery A113 - Chassis PDF
50% (2)
Maintenance Manual of Chery A113 - Chassis PDF
45 pages
Chapter-3 ER Model
No ratings yet
Chapter-3 ER Model
12 pages

Udacity Ab Testing Final Project

Uploaded by

Udacity Ab Testing Final Project

Uploaded by

the change we are going to test: student clicks the ‘start free trial’ button → there

Unit of diversion: cookie; A cookie uniquely identifies a user's browser session.

Number of user-ids (Did not work):

● The intervention directly affects whether a user proceeds to enrollment and

2.net conversion(did not work):

3.gross conversion(works!): numerator: number of user-ids to complete checkout

Baseline values help set a

The evaluation metric i choose here is gross conversion:

5000 cookies visiting the course overview page, the denominator:

1.Choosing Number of Samples given Power: how many pageviews total

1.if we run on all traffic: 645875/40000 = 16.15 days

2.if we run on 50% of traffic: 645875/20000 = 32.29 days

3.if we run on 25% of traffic: 645875/10000 = 64.59 days

You might also like