Flaky Tests

A flaky test is a test that produces inconsistent results across runs without code changes. Flaky tests block deployments, waste CI time, and erode team confidence in test suites.

Quick Reference

Location	What it shows
Dashboard → QA View	Most flaky tests in the selected period
Dashboard → Developer View	Flaky tests by the author
Analytics → Summary	Flakiness trends over time
Test Runs → Summary	Flaky counts by category
Test Case → History	Stability percentage
Specs	Flaky rate per spec file

How is flaky test detection activated?

Flaky test detection activates automatically when retries are enabled in Playwright. No additional configuration required.

playwright.config.ts

export default defineConfig({
  retries: process.env.CI ? 2 : 0,
});

When a test fails on the first attempt but passes on retry, Playwright marks it as flaky.

How does TestDino detect Flaky tests?

TestDino identifies flaky tests in two ways:

Within a Single Run
Across Multiple Runs

A test that fails initially but passes on retry is marked flaky. The retry count appears in the test details. Test marked as flaky after passing on retry

Test marked as flaky after passing on retry

Both detection methods indicate that the test result depends on something other than your code.

Where to find flaky tests?

Dashboard → QA View → Most Flaky Tests Lists tests with the highest flaky rates in the selected period. Most flaky tests in QA Dashboard

Analytics → Summary → Flakiness & Test Issues Displays flakiness trends over time with a list of affected tests. Flakiness trends in Analytics Summary

Test Runs → Summary Each run shows flaky test counts with sub-categories: Timing Related, Environment Dependent, Network Dependent, Assertion Intermittent, and Other. Flaky test categories in Test Run Summary

Flaky test categories in Test Run Summary

Test Case → History Shows stability percentage and execution history for a single test. Test case stability and history

Flaky Test Categories

TestDino classifies flaky tests by root cause:

Category	Description
Timing Related	Race conditions, order dependencies, and insufficient waits
Environment Dependent	Fails only in specific environments or runners
Network Dependent	Intermittent API or service failures
Assertion Intermittent	Non-deterministic data causes occasional mismatches
Other	Unstable for reasons outside the above

Common reasons tests become flaky

Using fixed waits instead of waiting for the page to be ready
Missing await, steps run out of order
Weak selectors, element changes, or matches more than one thing
Tests share data and affect each other
Parallel runs collide, same user or record used by multiple tests
Slow or unstable network or third-party APIs
CI setup is different from local run

See Prevention for detailed strategies to avoid flaky tests.

Impact on CI Checks

GitHub CI Checks can handle flaky tests in two ways: Strict: Flaky tests count as failures. Use for production branches where stability matters. Neutral: Flaky tests are excluded from the pass rate calculation. Use for development branches to reduce noise. See GitHub CI Checks for configuration details.

Next Steps

Detection

Find flaky tests in your suite

Root Cause Analysis

Diagnose why tests are flaky

Prevention

Write stable tests

GitHub CI Checks

Configure flaky handling in CI

Getting Started

Guides

Platform

Quick Reference

How is flaky test detection activated?

How does TestDino detect Flaky tests?

Where to find flaky tests?

Flaky Test Categories

Impact on CI Checks

Next Steps

Detection

Root Cause Analysis

Prevention

GitHub CI Checks

Getting Started

Guides

Platform

​Quick Reference

​​How is flaky test detection activated?

​How does TestDino detect Flaky tests?

​Where to find flaky tests?

​Flaky Test Categories

​Impact on CI Checks

​Next Steps

Detection

Root Cause Analysis

Prevention

GitHub CI Checks

Quick Reference

How is flaky test detection activated?

How does TestDino detect Flaky tests?

Where to find flaky tests?

Flaky Test Categories

Impact on CI Checks

Next Steps