Performance testing interview questions

// 38 QUESTIONS Β· UPDATED MAY 2026

Performance testing interview questions covering JMeter, k6, Gatling, load patterns, percentile analysis, and how to diagnose bottlenecks from real-world test data.

Level

Showing 38 of 38 questions

  1. What is the difference between load testing and stress testing?Junior

    Load testing measures behaviour at expected production traffic. Stress testing pushes well beyond that to find the breaking point and how…

  2. Why do you report p95 instead of average response time?Mid

    Averages hide the long tail. A request that's fast for 95% of users but takes 8 seconds for the slowest 5% will look 'fine' on average bu…

  3. What are the key metrics in performance testing?Junior

    Response time (usually as percentiles β€” p50/p95/p99), throughput (requests or transactions per second), error rate (% of failed requests)…

  4. How would you compare JMeter, k6, and Gatling at a high level?Junior

    JMeter is GUI-driven Java, mature, plugin-rich, heavy on RAM. k6 is JavaScript, scriptable, CLI-first, modern, integrates well with CI. G…

  5. What's the difference between concurrency and arrival rate?Junior

    Concurrency (closed model) holds N users active at once β€” if the system slows, fewer requests per second. Arrival rate (open model) injec…

  6. Walk through designing a load test for a checkout flow.Mid

    Map the user journey end-to-end, parametrise with realistic data (carts, addresses, cards), mix transaction types by production ratio, mo…

  7. How do you handle dynamic data like CSRF tokens and session IDs in JMeter?Mid

    Extract values from prior responses with regex/JSON/CSS extractors, store as JMeter variables, then reference them in subsequent requests…

  8. What are think times and why do they matter in load tests?Mid

    Think time is the pause a real user makes between actions β€” reading a page, deciding, typing. Removing think times multiplies effective l…

  9. How do you parametrise test data in k6?Mid

    Load CSV or JSON via SharedArray for memory efficiency across VUs, pick rows by VU/iteration index for uniqueness, pull secrets from envi…

  10. How do you set realistic SLOs for a load test?Mid

    Start from production data β€” historical p95/p99 latencies and user-impact studies. Set SLOs tighter than current performance to drive imp…

  11. What's the difference between load testing and stress testing in how you actually execute them?Mid

    Load runs at expected target RPS sustained for a long hold β€” pass/fail against SLOs, no surprises wanted. Stress ramps past target until…

  12. How would you use k6 thresholds to fail a CI pipeline on regression?Mid

    Define thresholds per metric (p95 latency, error rate, custom metrics) β€” k6 exits non-zero on breach, failing the pipeline step. Persist…

  13. How do you correlate response time with backend resource saturation?Mid

    Run the load test while collecting host and service metrics (CPU, memory, IO, DB pool, GC, queue depth) on synchronised timestamps. Plot…

  14. What's the difference between server-side and client-side performance metrics?Mid

    Server-side measures backend behaviour (request latency, DB time, throughput) β€” load tools and APM see this. Client-side measures user ex…

  15. How do you isolate whether a slow response is the database, application, or network?Senior

    Use distributed tracing to break the request into spans β€” DB query time, app compute time, network legs. APM (Datadog, New Relic) shows p…

  16. Walk me through how you'd plan capacity testing for a Black Friday spike.Senior

    Model expected peak from historical data (e.g. 5x last year's peak), test 2x peak to verify headroom, run scenarios for spike, sustained,…

  17. How do you load test a system with heavy WebSocket or SSE traffic?Senior

    Use k6's ws module or Gatling for WebSocket β€” open many concurrent connections, send/receive messages, measure connection-establishment t…

  18. How do you handle warm-up periods and cold-cache effects in load tests?Senior

    Run an explicit warm-up stage at 10-20% of target load to populate caches, JIT-compile, and prime connection pools β€” discard those metric…

  19. How do you build a sustainable performance test suite that runs in CI without becoming a bottleneck?Senior

    Tier the suite β€” fast smoke perf on every PR (under 5 min, narrow scope), full load nightly, soak weekly. Use thresholds that fail loud o…

  20. What's your approach to soak testing memory leaks over 8+ hour runs?Senior

    Run sustained moderate load (30-50% of peak) for 8-24 hours while capturing heap, RSS, file descriptors, DB connection counts, and GC fre…

  21. How would you generate realistic test data at scale for a marketplace search load test?Senior

    Sanitised production query logs are gold β€” anonymise PII, then replay actual query distributions. For synthetic generation, model Zipfian…

  22. How do you reproduce and root-cause a flaky performance result?Senior

    Check noisy-neighbour effects (shared infra, CI runner contention), GC pauses, connection-pool warm-up, downstream rate limits. Re-run wi…

  23. How would you justify investment in a dedicated performance testing function?Lead

    Tie to incidents avoided (cost of one Black Friday outage), customer impact (latency complaints, churn correlation with p95), and enginee…

  24. What is spike testing and how does it differ from stress testing?Junior

    Spike testing applies a sudden, extreme burst of load for a short duration to see whether the system recovers gracefully. Stress testing…

  25. What is volume testing and when would you use it?Junior

    Volume testing verifies that the system behaves correctly when it handles large amounts of data β€” not concurrent users, but data size. Us…

  26. What are Core Web Vitals and why do they matter for web performance testing?Junior

    Core Web Vitals are Google's set of user-experience performance metrics: LCP (Largest Contentful Paint β€” load speed), INP (Interaction to…

  27. How do you design a realistic workload model for a load test?Mid

    A workload model defines who your virtual users are, what they do, in what proportion, and at what pace. Derive it from production access…

  28. What is a ramp-up period in a load test and why does it matter?Mid

    A ramp-up period gradually increases virtual users from zero to the target load over a defined time, rather than hitting full load immedi…

  29. What is a performance test baseline and how do you establish one?Mid

    A baseline is a measured performance profile of the system at a known, stable state β€” usually the current production version. It sets the…

  30. How do you detect performance regressions between releases automatically?Mid

    Compare each run's key metrics against the stored baseline using percentage-deviation thresholds. Fail the build when p95 or p99 exceeds…

  31. How do you test the performance impact of caching (Redis, CDN, database query cache) in a load test?Mid

    Run the test in two modes: cold-cache (cache cleared before run) and warm-cache (cache seeded or pre-warmed). The difference in p95 betwe…

  32. How do you integrate Lighthouse into a CI pipeline to catch frontend performance regressions?Mid

    Use the Lighthouse CI npm package to run Lighthouse against a running preview build on every PR. Define assertions for Core Web Vitals an…

  33. How does test environment fidelity affect your confidence in performance test results?Mid

    A performance test run against an under-specced environment produces results that are valid only for that environment. Always document th…

  34. How do you generate distributed load when a single machine is not enough?Senior

    Use your tool's native distributed mode: k6 Operator on Kubernetes, JMeter in distributed mode with controller + injectors, or Gatling En…

  35. How do you test API rate limiting under load?Senior

    Test three scenarios: normal traffic below the limit (no 429s), burst traffic that hits the limit (429s appear at the expected threshold)…

  36. How do you test database query performance under realistic load?Senior

    Enable slow query logging during load tests, use EXPLAIN ANALYZE on queries that appear in the slow log, and measure query latency at bot…

  37. How would you build a performance regression detection system that scales across multiple services?Senior

    Standardise the test-and-report pipeline across services: a common schema for result storage, per-service baselines versioned with code,…

  38. How do you define and maintain performance SLOs across multiple services in a growing organisation?Lead

    Define SLOs at the service level, derived from user-facing business requirements rather than infrastructure capabilities. Review and upda…