Google search
Popularity
Google search
Popularity
Google search
Popularity
Obama campaign
Impact
What about learning?
What's the role of A/B tests, i.e., online randomized field experiments, in the study of online learning?
What about learning?
What's the role of A/B tests, i.e., online randomized field experiments, in the study of online learning?
Nienke Ruijs / Han van der Maas / Gunter Maris / Alexander Savi
A/B tests in online learning
Coursera
Khan Academy
DuoLingo
Coursera
Khan Academy
DuoLingo
Mostly anecdotal. Exceptions: MOOCs and ASSISTments.
Why anecdotal?
Evidence-based
Ecologically valid
Double blind
Iterative
Non-invasive
Verification
Evaluating interventions in Math Garden to increase the return on investment of online learning.
"Capitalists"…
…hiding in the data
"Capitalists"…
…hiding in the data
Capitalists…
…hiding in the data
Surface learning, deep learning, strategic learning.
Status quo
Toil time
Expectations
domain_id | y.level | term | estimate | std.error | statistic | p.value |
---|---|---|---|---|---|---|
1 | 0 | (Intercept) | 6.965 | 0.004 | 512.009 | 0 |
1 | 0 | cond_1_bw | 1.290 | 0.009 | 28.320 | 0 |
1 | 0 | cond_2_bw | 1.510 | 0.010 | 39.957 | 0 |
1 | 0 | cond_3_bw | 1.500 | 0.012 | 33.195 | 0 |
1 | 1 | (Intercept) | 18.969 | 0.004 | 807.042 | 0 |
1 | 1 | cond_1_bw | 1.252 | 0.008 | 26.553 | 0 |
1 | 1 | cond_2_bw | 1.472 | 0.010 | 39.059 | 0 |
1 | 1 | cond_3_bw | 1.487 | 0.012 | 33.443 | 0 |
domain_id | term | estimate | std.error | statistic | p.value |
---|---|---|---|---|---|
1 | (Intercept) | 8544.932 | 3.168 | 2697.040 | 0 |
1 | cond_1_bw | 31.548 | 8.927 | 3.534 | 0 |
1 | cond_2_bw | 172.111 | 8.981 | 19.164 | 0 |
1 | cond_3_bw | -40.947 | 8.995 | -4.552 | 0 |
Manipulation check
answer | condition | toil_time | response_in_seconds |
---|---|---|---|
? | 3 | 9 | 8.968 |
? | 3 | 9 | 8.995 |
? | 3 | 9 | 1.147 |
? | 3 | 9 | 1.810 |
? | 3 | 9 | 8.979 |
In some browsers question mark button was greyed out but active, so we
Manifest
We used proxy measures for learning
signal-to-noise ratio; learning
Latent
signal-to-noise ratio; learning
Manifest
We used proxy measures for learning
signal-to-noise ratio; learning
Latent
signal-to-noise ratio; learning
Where to locate the learning?
Heterogeneous treatment effects
Adaptivity
Too much power?
Exploration
Do users terminate their games?
Do they start playing less frequently?
Multiple-choice versus open-ended questions? Influence on choice for difficulty level? Many possible questions to answer.
Local minima
Generalizability
Strengths: ecological validity, non-invasiveness, …
Weaknesses: large scale required, local minima, …
Threats: ?
Opportunities:
Email savi@uva.nl
Slides www.alexandersavi.nl
Savi, A. O., Ruijs, N. M., Maris, G. K. J., & van der Maas, H. L. J. (2016). The role of A/B tests in the study of large-scale online learning. Manuscript in preparation.