Browser Harness QA Signoff
Date: 2026-03-12
Scope: Browser Harness v1, browser-only providers, judge-provider decoupling, fixture-based manual signoff
This inventory is the canonical manual/browser QA checklist for Browser Harness. It complements automated backend, frontend, and Playwright coverage and is the required input to the final playwright-interactive signoff.
Runtime Assumptions
- The app is reachable in a browser.
- Browser Harness fixtures are reachable from the same app host.
- The source dataset already exists in the current workspace.
- At least one API provider exists for judge scoring when the dataset requires judge evaluation.
User-Visible Claims To Sign Off
- The app exposes a dedicated
Browser Harnessroute and navigation entry. - The page blocks export when the selection is incompatible with v1 constraints.
ChatGPT,Gemini, andCustomflows can each generate a script, capture responses, and import a result.- Imported captures create normal completed runs instead of a parallel browser-only result type.
- Imported runs are attributed to browser-only providers rather than API providers.
- Judge-required datasets can be configured with an explicit judge provider distinct from the run provider.
- Imported Browser Harness runs remain visible in
Runsand usable inComparison. - The Browser Harness page fits desktop and mobile viewports without clipping or horizontal dependency.
- No console errors or page errors occur during the signoff flows.
Controls And States
Exploratory Checks
- Try a mismatched origin and confirm the runtime refuses to run on the wrong host.
- Use the JSON fallback import path and confirm the app still creates a normal run.
Negative Confirmations
The final signoff note must explicitly confirm:
- no viewport clipping
- no horizontal overflow
- no wrong provider attribution
- no silently unscored judge-required run
- no broken deep link from Browser Harness import into
Runs - no console errors
- no page errors
Evidence Contract
Store the evidence bundle under .artifacts/manual-qa/browser-harness/:
browser-harness-signoff.json- desktop screenshots
- mobile screenshots
- run detail screenshot
- comparison screenshot
- any console/page error capture if non-empty
If a claim is not covered, record the reason explicitly. “Not tested” is not acceptable for the three fixture flows, run attribution, judge display, and viewport fit.
