What's the difference between blue-green and canary deployments from a testing perspective?

Question

Accepted Answer

Blue-green flips 100% of traffic at once after smoke tests pass on green — easy rollback, but if smoke missed a bug, everyone hits it. Canary shifts traffic gradually (1% → 10% → 100%) while watching error rate and latency — small blast radius, but slower and needs strong observability. Blue-green runs two identical environments. Blue is current production. Green is the new version, deployed and warmed but receiving zero user traffic. You run smoke tests against green; on pass, flip the load balancer to send 100% of traffic to green. Blue stays around for instant rollback. Testing implications: Smoke tests on green need to be comprehensive — once you flip, every user sees it. Database migrations are tricky — if green needs a schema change, both versions need to coexist during the flip (expand-contract pattern). Rollback is fast (flip the LB back). One-shot risk: if smoke missed a bug or the bug only manifests under real production load, every user feels it. Canary sends a small percent

What's the difference between blue-green and canary deployments from a testing perspective?

// WHAT INTERVIEWERS LOOK FOR

// COMMON PITFALL

What's the difference between blue-green and canary deployments from a testing perspective?

Short answer

Detail

// WHAT INTERVIEWERS LOOK FOR

// COMMON PITFALL