Confidence
  • Documentation
  • Blog
  • Bootcamp
  • Status
  • Confidence Bootcamp
    • My learning
    • Intro to experimentation
      • Introduction
      • Lesson 1: Why you should experiment
      • Lesson 2: Experiment hypothesis
      • Lesson 3: Success and guardrail metrics
      • Lesson 4: Success metrics
      • Lesson 5: Set up your experiment
      • Lesson 6: Calculation frequency
      • Lesson 7: Target audience
      • Lesson 8: Sample size
      • Lesson 9: Quality assurance
      • Lesson 10: Run your experiment
      • Lesson 11: Evaluate your experiment and make a decision
      • Lesson 12: A/B tests and rollouts
      • Course wrap up
    • Intro to metrics
      • Introduction
      • Lesson 1: What is a metric?
      • Lesson 2: Metric roles
      • Lesson 3: Time considerations
      • Lesson 4: Capturing behavior
      • Lesson 5: Strategic metrics
      • Lesson 6: Interpretability
      • Lesson 7: Feasibility and sensitivity
      • Lesson 8: Variance reduction
      • Lesson 9: Select metrics
      • Lesson 10: Segment-level analysis
      • Course wrap up
    • Scientific product development
      • Introduction
      • Lesson 1: Why you should experiment
      • Lesson 2: The scientific method
      • Lesson 3: Randomized controlled trials
      • Lesson 4: Experiment hypothesis
      • Lesson 5: Case study
        • Case study
        • Answers to case study
      • Lesson 6: Why do we need statistics?
      • Lesson 7: Success metrics
      • Lesson 8: Detectable effects and sample size
      • Lesson 9: Make a decision
      • Course wrap up
    • A primer on hypothesis testing
      • Introduction
      • Lesson 1: Introduction to hypothesis testing
      • Lesson 2: True vs estimated effects
      • Lesson 3: Sampling distribution of the difference-in-means estimator
      • Lesson 4: Z-tests and how to reject the null hypothesis
      • Lesson 5: False postive rate and alpha
      • Lesson 6: True positive rate, MDE, and power
      • Course wrap up
    • Intro to Feature Flags
      • Introduction
      • Lesson 1: What is a feature flag?
      • Lesson 2: Lifecycle of a feature flag
      • Lesson 3: Clients
      • Lesson 4: Evaluation context and targeting
    • Sample size calculation - I
      • Introduction
      • Lesson 1: What is the required sample size?
      • Lesson 2: Alpha and power
      • Lesson 3: Baseline mean and variance
      • Lesson 4: Sample size playground - I
    • Sample size calculation - II
      • Introduction
      • Lesson 1: Multi-metric decision making
      • Lesson 2: Number of success metrics
      • Lesson 3: Number of guardrail metrics
      • Lesson 4: Number of comparisons
      • Lesson 5: Sample size playground - II
    • Sample size calculation - III
      • Introduction
      • Lesson 1: Binary metrics
      • Lesson 2: Treatment group proportions
      • Lesson 3: Variance reduction
      • Lesson 4: Sequential testing and sample size
      • Lesson 5: Sample size playground - III
    • Advance your experimentation
      • Introduction
      • Lesson 1: Guardrail metrics with non-inferiority margins
      • Lesson 2: Choose evaluation frequency
      • Lesson 3: Metrics' roles in experiments
      • Lesson 4: Cumulative holdback evaluations
    • Experimentation culture
      • Introduction
      • Lesson 1: Onboarding into experimentation
      • Lesson 2: Empowering experimentation champions
      • Lesson 3: Sustaining the experimentation culture
    • Videos

Lesson 10: Run Your Experiment

Summary

In this lesson, you learn how to monitor your experiment after launch, including checking exposure counts, reviewing automatic monitoring checks, and accessing your results either continuously or once the experiment ends.

After you launch your experiment, you should monitor that everything works fine. A good experimentation platform supports you in this stage with automatic monitoring checks. In this section of the course you learn what those checks are, and how to monitor that your experiment delivers your new experience.

Directly after launch

When you launch the A/B test, exposure calculations run at frequent intervals initially. This makes it possible to directly see that the experiment works as intended.

In Confidence

Confidence runs automatic monitoring checks throughout the experiment. Hover the Live status on the right sidebar to see their current status.

Monitoring Checks

If you use Confidence's flags, you can also see the real-time resolve count alongside the exposure schedule.

At any time while the experiment is live

As soon as the first exposure and metric calculation finish, you can check the status of monitoring checks for your experiment.

If you configured your experiment to deliver results continuously, the full results page is available while the experiment is live with results for all metrics that have at least some data.

When you end the experiment

If you configured your experiment to deliver results at the end of the experiment, the results page only shows the results for all metrics after you end it.

You can also use exploratory analysis to further drill down the results of metrics by splitting the results by dimensions, and add new metrics that weren't part of the initial experiment setup. You can, for example, split the results by market to see how results differ between different markets, or add a metric that measures something similar to an existing metric to make sure that an unexpected result is present also in the new metric.

You can use exploratory analysis during the experiment, but to prevent p-hacking and cherry picking of results, you should wait until the experiment has ended before exploring additional metrics.

In Confidence

Read more about how to interpret the results in Confidence. Use the Explore tab to split results by dimensions and add metrics beyond the initial setup.

Reader exercise

Which is a correct statement about monitoring and validation checks?

Was this page helpful?

PreviousLesson 9: Quality assurance
NextLesson 11: Evaluate your experiment and make a decision

© Copyright 2026. All rights reserved.

Follow us on TwitterFollow us on GitHub

On this page

  1. Directly after launch

  2. At any time while the experiment is live

  3. When you end the experiment