Lesson 11: Evaluate your experiment and make a decision

A good experimentation platform calculates results for you and displays the performance of each variant, taking care of the statistical details so you can focus on learning from the experiment. With those insights, you make a decision on how to proceed with the change you tested.

At this stage, your experiment has successfully run for a period of time and has no visible errors. Congratulations! Now it's time for the fun part. You have at least one result to interpret, but often there are more than just one. More precisely, your experiment has T x M results to interpret, where T = Number of treatment groups (excluding control) and M = Number of metrics.

For an experiment with 3 treatment groups and 4 metrics, you have 12 results to interpret.

Overall decision recommendations

A good experimentation platform provides overall decision recommendations that use the outcomes of all metrics to suggest whether a specific treatment is worth rolling out.

The shipping recommendation recommends you to ship a change if at least one success metric has moved in the desired direction with significance. Simultaneously, all guardrail metrics must be significantly non-inferior, meaning that they're all within the acceptable margin you set using the non-inferiority margin. The test must also be in a healthy state, with no significant negative changes in any of the metrics, and no sign that there is a problem with the quality of the test.

Metric results

For each metric, you see a comparison between the control group and each treatment group. You can dig deeper into the results to see metric values, confidence intervals, variances, and more. If you ran your experiment with results delivered continuously, you can also view the results over time.

Exploration

If at the end of the experiment you find things that you would like to dig deeper into, you can do exploratory analysis. Here you can add any metric and see how it performed for each of the treatment groups, and split the results by dimensions.