Core Alpha Signoff Inventory
Date: 2026-03-12
Scope: PX-QA-003, PX-QA-023, /leaderboard truthfulness pass, shared alpha browser signoff
This inventory is the canonical coverage list for the current alpha product surface. It defines what must be verified before the final manual $playwright-interactive signoff is considered complete.
Runtime Assumptions
- The local stack is running and reachable from the browser.
- Backend and worker can be exercised against the Docker local OpenAI-compatible test gateway and a provider-backed gateway.
- For provider-backed signoff, the default gateway is
http://host.docker.internal:1234. - The baseline model is
qwen3.5-0.8b. - If a different verified endpoint is used, record that override in the final report.
User-Visible Claims To Sign Off
- Keyboard users can reach both
main contentandnavigationthrough skip links without focus loss. - The mobile navigation dialog can be opened and closed with keyboard input, including
Escape. - Visible images in the core alpha surfaces expose alt text instead of raw unlabeled media.
- The main shell and active runs board fit desktop and mobile viewports without horizontal dependency.
- Provider smoke test feedback is readable enough to trust before launching provider-backed runs.
Settingsaccurately describes the current alpha scope instead of implying hidden configuration or future controls already exist./leaderboardis a truthful roadmap / scope page and does not masquerade as a shipped Arena leaderboard.- Arena remains a design-track capability; the real operator paths today are
RunsandComparison.
Coverage Matrix
Report Contract
The automated/manual evidence bundle should write to .artifacts/manual-qa/ and include:
active-runs-ui-audit-report.json- screenshots for each major surface listed above
- the runtime endpoint/model used for provider-backed verification
- negative confirmations for:
- no clipping
- no horizontal overflow
- no focus jumps
- no broken
Escapedismissal - no missing alt text on visible images
- no copy that claims Arena / leaderboard is already shipped
Exit Criteria
- Every row in the coverage matrix has either screenshot evidence or an explicit automated check reference.
- Manual signoff confirms desktop and mobile variants for shell, providers, runs, settings, and leaderboard scope.
- Any exclusions are recorded with a concrete reason. “Not tested” is not acceptable for the core shell and truthful-scope checks.
- The final signoff note explicitly states whether provider-backed verification used the default gateway (
http://host.docker.internal:1234) and baseline model (qwen3.5-0.8b) or an overridden verified target.
