[build-tools] Store per-attempt results for Maestro test retries by hSATAC · Pull Request #3498 · expo/eas-cli

hSATAC · 2026-03-12T08:38:13Z

Why

Maestro test retries currently overwrite JUnit reports and device logs on each attempt, so only the final attempt's results are stored. This means we lose visibility into earlier failed attempts, making it harder to diagnose flaky tests.

Depends on: expo/universe#25691 and expo/universe#25692 (must be deployed first — the new unique index on (turtle_job_run_id, path, retry_count) is required to accept multiple results per test path).

How

Three changes across build-tools:

internalMaestroTest.ts — Move outputPath and device log path inside the retry loop with an attempt-N suffix, so each attempt produces a separate
file instead of overwriting. Removed the now-unnecessary fs.promises.rm call.
maestroResultParser.ts — Extract an internal parseJUnitFile helper, then rewrite the merge logic in parseMaestroResults to group JUnit results by
flow name. When multiple files exist for the same flow, the attempt index is extracted from the filename (attempt-N). For single-file flows (backward
compat with reuse_devices: true old format), retryCount is still derived from ai-*.json occurrence count.
reportMaestroTestResults.ts — Update the duplicate-name guard to check (name, retryCount) pairs instead of just names. Same name with different
retryCount is expected (per-attempt results), while same (name, retryCount) indicates a real naming conflict.

Test Plan

Added test for per-attempt report filenames in internalMaestroTest.test.ts
Added 4 tests in maestroResultParser.test.ts: per-attempt results, reuse_devices=true per-attempt, sharding with multiple testsuites, and backward compat with old single JUnit file
Added test in reportMaestroTestResults.test.ts for same name with different retryCount being sent (not skipped)
All 49 tests across the 3 test files pass

Signed-off-by: Ash Wu <hsatac@gmail.com>

linear · 2026-03-12T08:38:16Z

ENG-19926 Store per-attempt results for Maestro test retries

github-actions · 2026-03-12T08:38:38Z

⏩ The changelog entry check has been skipped since the "no changelog" label is present.

codecov · 2026-03-12T08:43:06Z

Codecov Report

❌ Patch coverage is 98.00000% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 53.62%. Comparing base (c51ac27) to head (8620121).

Files with missing lines	Patch %	Lines
...d-tools/src/steps/functions/maestroResultParser.ts	97.73%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3498      +/-   ##
==========================================
+ Coverage   53.56%   53.62%   +0.06%     
==========================================
  Files         815      815              
  Lines       34566    34600      +34     
  Branches     7175     7181       +6     
==========================================
+ Hits        18513    18551      +38     
+ Misses      15971    15967       -4     
  Partials       82       82

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

github-actions · 2026-03-12T08:47:57Z

Subscribed to pull request

File Patterns	Mentions
*/	@douglowder

Generated by CodeMention

Copilot

Pull request overview

This PR updates the Maestro test pipeline in build-tools to preserve per-attempt artifacts and results when retries are enabled, instead of overwriting the final attempt’s output, and adjusts parsing/reporting to support multiple results per flow.

Changes:

Generate per-attempt Maestro report and device-log filenames (e.g., *-attempt-N.*) so retries don’t overwrite prior artifacts.
Update Maestro JUnit parsing/merge logic to emit per-attempt flow results by grouping by flow name and extracting attempt index from the JUnit filename.
Update result reporting to only treat duplicates as conflicts when they share the same (name, retryCount) pair, and add/extend tests for the new behaviors.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
packages/build-tools/src/steps/functions/internalMaestroTest.ts	Writes unique report/device-log outputs per retry attempt.
packages/build-tools/src/steps/functions/maestroResultParser.ts	Parses JUnit per file and merges results into per-attempt flow outputs.
packages/build-tools/src/steps/functions/reportMaestroTestResults.ts	Adjusts duplicate detection to allow same name across different retry attempts.
packages/build-tools/src/steps/functions/tests/internalMaestroTest.test.ts	Adds coverage for per-attempt report filename generation.
packages/build-tools/src/steps/functions/tests/maestroResultParser.test.ts	Adds coverage for per-attempt parsing (including reuse_devices + sharding shapes + backward compat).
packages/build-tools/src/steps/functions/tests/reportMaestroTestResults.test.ts	Verifies per-attempt results with same name but different retryCount are reported.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-12T13:39:33Z

packages/build-tools/src/steps/functions/maestroResultParser.ts

+      // Multiple results — per-attempt JUnit files. Sort by attempt index from filename.
+      const sorted = flowEntries
+        .map(entry => {
+          const match = entry.sourceFile.match(ATTEMPT_PATTERN);
+          const attemptIndex = match ? parseInt(match[1], 10) : 0;
+          return { ...entry, attemptIndex };
+        })
+        .sort((a, b) => a.attemptIndex - b.attemptIndex);


When multiple JUnit XML files exist for the same flow, entries without an attempt-N match are assigned attemptIndex = 0. If the directory contains multiple files for the same flow that don't follow the attempt-N naming (e.g., leftover files or a different naming scheme), all of those results will get retryCount: 0, which can lead to false conflicts and the entire report being skipped. Consider falling back to the sorted position (0..n-1) for unmatched files, or deriving retryCount from flowOccurrences when the attempt index can’t be parsed, to keep results distinct and reporting robust.

Suggested change

// Multiple results — per-attempt JUnit files. Sort by attempt index from filename.

const sorted = flowEntries

.map(entry => {

const match = entry.sourceFile.match(ATTEMPT_PATTERN);

const attemptIndex = match ? parseInt(match[1], 10) : 0;

return { ...entry, attemptIndex };

})

.sort((a, b) => a.attemptIndex - b.attemptIndex);

// Multiple results — per-attempt JUnit files.

// Prefer attempt index from filename (attempt-N); fall back to per-flow position (0..n-1)

// to keep retryCount values distinct even when filenames don't follow the pattern.

const sorted = flowEntries

.map((entry, index) => {

const match = entry.sourceFile.match(ATTEMPT_PATTERN);

const parsed = match ? parseInt(match[1], 10) : NaN;

const hasExplicitAttemptIndex = Number.isInteger(parsed) && parsed >= 0;

const attemptIndex = hasExplicitAttemptIndex ? parsed : index;

return { ...entry, attemptIndex, hasExplicitAttemptIndex };

})

.sort((a, b) => {

if (a.hasExplicitAttemptIndex && b.hasExplicitAttemptIndex) {

return a.attemptIndex - b.attemptIndex;

}

if (a.hasExplicitAttemptIndex) return -1;

if (b.hasExplicitAttemptIndex) return 1;

return a.attemptIndex - b.attemptIndex;

});

[build-tools] Store per-attempt results for Maestro test retries

8620121

Signed-off-by: Ash Wu <hsatac@gmail.com>

hSATAC added the no changelog PR that doesn't require a changelog entry label Mar 12, 2026

hSATAC marked this pull request as ready for review March 12, 2026 08:47

hSATAC requested a review from sjchmiela March 12, 2026 08:47

sjchmiela requested a review from Copilot March 12, 2026 13:33

Copilot started reviewing on behalf of sjchmiela March 12, 2026 13:35 View session

Copilot AI reviewed Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[build-tools] Store per-attempt results for Maestro test retries#3498

[build-tools] Store per-attempt results for Maestro test retries#3498
hSATAC wants to merge 1 commit intomainfrom
ash/eng-19926-store-per-attempt-results-for-maestro-test-retries-client

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-      // Multiple results — per-attempt JUnit files. Sort by attempt index from filename.
-      const sorted = flowEntries
-        .map(entry => {
-          const match = entry.sourceFile.match(ATTEMPT_PATTERN);
-          const attemptIndex = match ? parseInt(match[1], 10) : 0;
-          return { ...entry, attemptIndex };
-        })
-        .sort((a, b) => a.attemptIndex - b.attemptIndex);
+      // Multiple results — per-attempt JUnit files.
+      // Prefer attempt index from filename (attempt-N); fall back to per-flow position (0..n-1)
+      // to keep retryCount values distinct even when filenames don't follow the pattern.
+      const sorted = flowEntries
+        .map((entry, index) => {
+          const match = entry.sourceFile.match(ATTEMPT_PATTERN);
+          const parsed = match ? parseInt(match[1], 10) : NaN;
+          const hasExplicitAttemptIndex = Number.isInteger(parsed) && parsed >= 0;
+          const attemptIndex = hasExplicitAttemptIndex ? parsed : index;
+          return { ...entry, attemptIndex, hasExplicitAttemptIndex };
+        })
+        .sort((a, b) => {
+          if (a.hasExplicitAttemptIndex && b.hasExplicitAttemptIndex) {
+            return a.attemptIndex - b.attemptIndex;
+          }
+          if (a.hasExplicitAttemptIndex) return -1;
+          if (b.hasExplicitAttemptIndex) return 1;
+          return a.attemptIndex - b.attemptIndex;
+        });

Conversation

Uh oh!

Why

How

Test Plan

Uh oh!

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants