Skip to content

Run

4 emulators 572 tests

Results as of this run. The arrow shows each target's movement since the previous run it was tested in. The suite grew this run, so a downward arrow can be the new tests biting rather than a target regressing.

Suite grew from 526 to 572 tests this run.

That's 46 new tests measured against every target. Movement below compares to the previous run, so a fall here is as likely to be the stricter suite as a real regression.

What changed in the suite this run

Suite on

Grew to 572 tests, up 46 on the first run: thirty-six more in Tier 1 and ten more in Tier 2, deepening coverage of the core and complete-feature behaviour.

  1. live (AWS) · full coverage
    100% ground truth
    Tier 1 100%
    Tier 2 100%
    Tier 3 100%
  2. - · full coverage
    100.0% 0.0pp unchanged
    Tier 1 100.0%
    Tier 2 100.0%
    Tier 3 100.0%
  3. - · full coverage
    93.5% +0.6pp rose 0.6 percentage points
    Tier 1 99.0%
    Tier 2 96.1%
    Tier 3 81.9%
  4. - · full coverage
    92.7% +0.6pp rose 0.6 percentage points
    Tier 1 99.0%
    Tier 2 91.3%
    Tier 3 81.9%
  5. - · 43 unsupported
    87.3% -0.9pp fell 0.9 percentage points
    Tier 1 98.3%
    Tier 2 16.7%
    Tier 3 92.8%