How do you design tests for a feature flag system?

Question

Accepted Answer

Test the flag's two states (on/off), all combinations with other flags it interacts with, the rollout mechanism (percentage, user targeting), the off-default fallback, and the cleanup pathway. Don't trust the flag platform — assume it can return wrong values and the system should still degrade gracefully. Feature flags are testing's force multiplier — and bug magnet. They expand the cross-product of system state, and naive tests miss the failure modes that flag rollouts cause. Per-flag binary states. Each flag should be tested in both on and off. If a flag is added with default off, both states need explicit coverage before promotion. Flag interactions. Two flags A and B both controlling parts of the same flow → 4 combinations. With n interacting flags it's 2^n; use pairwise once n > 4. Targeting / rollout mechanisms: Percentage rollout: user X is in the 10% bucket; assertion that they consistently get the same value across requests. User targeting: specific users / cohorts get the fla

How do you design tests for a feature flag system?

// WHAT INTERVIEWERS LOOK FOR

// COMMON PITFALL

How do you design tests for a feature flag system?

Short answer

Detail

// WHAT INTERVIEWERS LOOK FOR

// COMMON PITFALL