The Experimentation RFP Series

Feature lists tend to oversimplify experimentation offerings. Not all implementations of the same feature are alike, and a vague RFP leads to frustration and friction when the platform does not deliver what the checkmark promised. Here we describe how we would specify an RFP for various capabilities, and what to look for to ensure the implementation is worth committing to.

Features Confidence has built

Topics where we have a connected implementation and can show what the connected version looks like.

We built this
How to Write an Experimentation Platform RFP for Sample Size Calculators

Most experimentation platforms have a sample size calculator. Almost none connect it to the analysis method the experiment will actually use. Here is what to ask instead of "do you have a calculator?"

Read more
We built this
How to Write an Experimentation Platform RFP for Sequential Testing

Every platform claims to support sequential testing. The claim is almost always incomplete. Here is what to ask instead of "do you have sequential testing?"

Read more
We built this
How to Write an Experimentation Platform RFP for Multi-Metric Decision Making

Every platform lets you add multiple metrics. Most will display results for each one. What they rarely tell you is what to do next. Here is what to ask.

Read more
We built this
How to Write an Experimentation Platform RFP for Multiple Testing Corrections

Every platform says it corrects for multiple comparisons. Most do, partially. Here is what to ask instead of "do you correct for multiple testing?"

Read more
We built this
How to Write an Experimentation Platform RFP for Variance Reduction

Every platform that offers variance reduction claims it cuts runtime by 20-50%. What is usually missing is how far that reduction actually reaches. Here is what to ask.

Read more
We built this
How to Write an Experimentation Platform RFP for Ratio Metrics

Revenue per user. Streams per session. Most platforms support ratio metrics. Most get the variance wrong. Here is what to ask instead of "do you support it?"

Read more
We built this
How to Write an Experimentation Platform RFP for Fixed-Power Designs

Every platform gives you a sample size estimate before the experiment starts. Almost none revisit it after. Here is what to ask about during-experiment power monitoring.

Read more
We built this
How to Write an Experimentation Platform RFP for Observation Windows and Time-in Metrics

Every platform measures user behavior after exposure. The question most buyers never ask is: over what time period? Here is what to ask about observation windows.

Read more
We built this
How to Write an Experimentation Platform RFP for Monitoring and Alerting

Every platform lets you look at results. The question is whether the platform looks for you. Here is what to ask about monitoring and alerting.

Read more
We built this
How to Write an Experimentation Platform RFP for Clustered Randomization

Most platforms let you randomize by account or store. The question is what happens in the analysis after randomization. Here is what to ask.

Read more
We built this
How to Write an Experimentation Platform RFP for Metric Zero-Handling

When a user generates zero events, the platform makes a choice that changes what the experiment measures. Most vendors do not document which choice they make.

Read more
We built this
How to Write an Experimentation Platform RFP for Exploratory Analysis and Dimensions

Every platform lets you slice by dimension. The question is whether the platform controls the false positive rate when you do.

Read more

Features we chose not to build

Topics where we deliberately chose not to ship the feature, and what to look for if you need it.

We chose not to
How to Write an Experimentation Platform RFP for Percentile Metrics

Most vendors support percentile metrics. Most implementations break down exactly when you need them. Here is what to ask instead of "do you support it?"

Read more
We chose not to
How to Write an Experimentation Platform RFP for Geo-Lift and Synthetic Control

Most experimentation programs do not need geo-lift. If you do, the question is whether the platform forces you to confront the assumptions that make or break the analysis.

Read more
We chose not to
How to Write an Experimentation Platform RFP for Switchback Experiments

Switchback experiments exist because standard A/B tests break down when users interact with each other. Only two vendors offer support. Here is what to ask.

Read more
We chose not to
How to Write an Experimentation Platform RFP for Bayesian Inference

The label Bayesian does not tell you what stopping rule is used. Here is what to ask instead.

Read more