Who is the Confidence Bootcamp for?

The bootcamp is designed for anyone who wants to improve their experimentation skills. Courses are tailored for data scientists, analysts, engineers, product managers, and leaders — whether you are running your first A/B test or scaling an experimentation program across your organization.

Is the bootcamp free?

Yes, the Confidence Bootcamp is completely free. All 11 courses, 90+ lessons, and resources are available at no cost. You can start learning immediately without creating an account, though signing in lets you track your progress across devices.

The bootcamp covers the full experimentation lifecycle: A/B testing fundamentals, hypothesis formulation, interpreting experiment results, metrics design, sample size calculation, feature flags, and building an experimentation culture. It includes 11 courses with over 90 lessons built by the Confidence team at Spotify.

How long does the bootcamp take to complete?

The full bootcamp takes approximately 20 hours to complete across all 11 courses. Individual courses range from 30 minutes to 3 hours. You can learn at your own pace and pick the courses most relevant to your role.

Do I need prior experience with A/B testing or statistics?

No prior experience is required. The bootcamp starts with foundational courses like Intro to Experimentation and progressively covers more advanced topics like sequential testing and variance reduction. Each course clearly indicates which roles it is designed for.

Who created the Confidence Bootcamp?

The Confidence Bootcamp was created by the Confidence team at Spotify, the same team that builds the experimentation and feature flagging platform used across Spotify. The content reflects real-world experimentation practices used at one of the world's largest digital products.

Lesson 1: Multi-metric decision making

Summary

This lesson teaches you how to formalize decision-making from experiments with guardrail and success metrics. Confidence uses a decision rule to map the results of all success and guardrail metrics to one decision: Ship or not.

Reader exercise

What is the primary purpose of guardrail metrics in experiments?

To measure the success of a product change

To ensure metrics do not move in the wrong direction due to a product change

To increase the statistical significance of success metrics

To calculate the required sample size for an experiment

Reader exercise

What is Spotify's decision rule for shipping a product change?

Ship if at least one success metric has improved significantly

Ship if all guardrail metrics have moved in the desired direction

Ship if at least one success metric has improved significantly, and all guardrail metrics are significantly non-inferior

Ship if the treatment group outperforms the control group in all metrics

Reader exercise

What is the relationship between the Non-Inferiority Margin (NIM) and sample size?

A smaller NIM requires a larger sample size

A larger NIM requires a larger sample size

NIM doesn't affect sample size

Sample size is only determined by success metrics

Confidence

Lesson 1: Multi-metric decision making

The two main types of metrics

Spotify's decision rule

Intro to guardrail metrics and non-inferiority tests

What is the primary purpose of guardrail metrics in experiments?

What is Spotify's decision rule for shipping a product change?

What is the relationship between the Non-Inferiority Margin (NIM) and sample size?

Notes for nerds

Lesson 1: Multi-metric decision making

The two main types of metrics

Spotify's decision rule

Intro to guardrail metrics and non-inferiority tests

What is the primary purpose of guardrail metrics in experiments?

What is Spotify's decision rule for shipping a product change?

What is the relationship between the Non-Inferiority Margin (NIM) and sample size?

Notes for nerds