The Experimentation RFP Series

Feature lists tend to oversimplify experimentation offerings. Not all implementations of the same feature are alike, and a vague RFP leads to frustration and friction when the platform does not deliver what the checkmark promised. Here we describe how we would specify an RFP for various capabilities, and what to look for to ensure the implementation is worth committing to.

Features Confidence has built

Topics where we have a connected implementation and can show what the connected version looks like.

We built this

How to Write an Experimentation Platform RFP for Sample Size Calculators

Most experimentation platforms have a sample size calculator. Almost none connect it to the analysis method the experiment will actually use. Here is what to ask instead of "do you have a calculator?"

We built this

How to Write an Experimentation Platform RFP for Sequential Testing

Every platform claims to support sequential testing. The claim is almost always incomplete. Here is what to ask instead of "do you have sequential testing?"

We built this

How to Write an Experimentation Platform RFP for Multi-Metric Decision Making

Every platform lets you add multiple metrics. Most will display results for each one. What they rarely tell you is what to do next. Here is what to ask.

We built this

How to Write an Experimentation Platform RFP for Multiple Testing Corrections

Every platform says it corrects for multiple comparisons. Most do, partially. Here is what to ask instead of "do you correct for multiple testing?"

We built this

How to Write an Experimentation Platform RFP for Variance Reduction

Every platform that offers variance reduction claims it cuts runtime by 20-50%. What is usually missing is how far that reduction actually reaches. Here is what to ask.

We built this

How to Write an Experimentation Platform RFP for Ratio Metrics

Revenue per user. Streams per session. Most platforms support ratio metrics. Most get the variance wrong. Here is what to ask instead of "do you support it?"

We built this

How to Write an Experimentation Platform RFP for Fixed-Power Designs

Every platform gives you a sample size estimate before the experiment starts. Almost none revisit it after. Here is what to ask about during-experiment power monitoring.

We built this

How to Write an Experimentation Platform RFP for Observation Windows and Time-in Metrics

Every platform measures user behavior after exposure. The question most buyers never ask is: over what time period? Here is what to ask about observation windows.

We built this

How to Write an Experimentation Platform RFP for Monitoring and Alerting

Every platform lets you look at results. The question is whether the platform looks for you. Here is what to ask about monitoring and alerting.

We built this

How to Write an Experimentation Platform RFP for Clustered Randomization

Most platforms let you randomize by account or store. The question is what happens in the analysis after randomization. Here is what to ask.

We built this

How to Write an Experimentation Platform RFP for Metric Zero-Handling

When a user generates zero events, the platform makes a choice that changes what the experiment measures. Most vendors do not document which choice they make.

We built this

How to Write an Experimentation Platform RFP for Exploratory Analysis and Dimensions

Every platform lets you slice by dimension. The question is whether the platform controls the false positive rate when you do.

We built this

How to Write an Experimentation Platform RFP for Experiment Design Review

Every platform lets you configure an experiment. Few require a structured review of the design before launch. Here is what to ask instead of "do you have experiment review?"

We built this

How to Write an Experimentation Platform RFP for Experiment Coordination

Most experiments can run overlapping. For the ones that cannot, control with precision is essential. Here is what to ask instead of "do you support mutual exclusion?"

Features we chose not to build

Topics where we deliberately chose not to ship the feature, and what to look for if you need it.

We chose not to

How to Write an Experimentation Platform RFP for Percentile Metrics

Most vendors support percentile metrics. Most implementations break down exactly when you need them. Here is what to ask instead of "do you support it?"

We chose not to

How to Write an Experimentation Platform RFP for Geo-Lift and Synthetic Control

Most experimentation programs do not need geo-lift. If you do, the question is whether the platform forces you to confront the assumptions that make or break the analysis.

We chose not to

How to Write an Experimentation Platform RFP for Switchback Experiments

Switchback experiments exist because standard A/B tests break down when users interact with each other. Only two vendors offer support. Here is what to ask.

We chose not to

How to Write an Experimentation Platform RFP for Bayesian Inference

The label Bayesian does not tell you what stopping rule is used. Here is what to ask instead.