Current state and future vision for iOS & Android test automation
| Metric | ๐ iOS | ๐ค Android |
|---|---|---|
| Total Pipelines (30 days) | 1,001 | 351 |
| Overall Success Rate | 48.5% | 79.7% |
| Overall Failure Rate | 51.4% | 20.2% |
| Total Failures | 491 | 70 |
| Main Branch Success | 26.2% | 78.7% |
| Main Branch Failures | 245 / 332 | 25 / 117 |
| Release Branch Success | 76.5% | 80.0% |
| Primary Failure Source | UI Tests | Emulator.wtf |
| Top Failing Job | test:ui:draco | ui-tests:nightly |
| Coverage Tool | llvm-cov | Kover |
| Coverage Integration | โ SonarQube | โ SonarQube + Badge |
| Coverage Job | In test:unit | Dedicated job |
Unit tests run on every MR, providing immediate feedback to developers
Comprehensive coverage reports integrated with SonarQube for quality metrics
Tests run independently with no dependencies, maximizing pipeline speed
iOS UI tests are manual-only on MRs, reducing confidence before merge
Emulator.wtf runs nightly only, catching issues after they're merged
No unified dashboard for test results across both platforms
No systematic approach to identifying and fixing flaky tests
245 failures out of 332 main pipelines
3 out of 4 main builds fail
25 failures out of 117 main pipelines
Better but still concerning
P0 IMMEDIATE: 73.7% main failure (245/332). Root cause: UI tests. War room needed. This blocks ALL development.
P0 THIS WEEK: Fix test:ui:draco and test:ui:zhl. Add retries, improve assertions, reduce flakiness. 245 failures traced to this.
P1 THIS SPRINT: Reduce 21.3% to <5%. Fix emulator.wtf nightly tests. Address MR !1682 flakiness (9 failures).
P1 NEXT SPRINT: Build flakiness dashboard. Track test health. Alert on degradation. Prevent future crises.
Catch issues earlier, ship with confidence