Test Results
After a test run completes, Swarm produces several layers of analysis.UX Score
Each persona assigns a UX score from 0-100 based on their experience. The score reflects:- How easily they accomplished (or failed to accomplish) the goal
- Friction points encountered along the way
- Clarity of the interface and copy
- Error handling and recovery
- Overall satisfaction
| Score Range | Interpretation |
|---|---|
| 90-100 | Excellent — minimal friction |
| 70-89 | Good — minor issues found |
| 50-69 | Needs work — notable friction points |
| Below 50 | Poor — significant usability problems |
Per-persona results
Each persona’s run includes:- Step-by-step walkthrough — every action taken, page visited, and form filled
- Screenshots — captured at each step for visual reference
- Thoughts and reactions — what the persona was thinking at each step
- Status — whether they completed the goal, got stuck, or encountered an error
- Individual UX score — their personal rating of the experience
Synthesis report
The synthesis report aggregates findings across all personas:- Executive summary — a high-level overview of the test results
- Common friction points — issues encountered by multiple personas, ranked by severity
- Positive highlights — things that worked well across the board
- Tickets — structured issue descriptions ready for your issue tracker
Tickets
Each ticket includes:- A descriptive title
- Steps to reproduce
- Which personas encountered the issue
- Suggested severity level
Judge verdict
An independent AI judge reviews the test evidence and provides:- Verdict — pass, fail, or partial
- Confidence level — how certain the judge is in its assessment
- Evidence blocks — specific observations supporting the verdict
- Recommended actions — prioritized list of improvements
Viewing results
Results are available in two places:- CLI — streamed in real time during the test, with a summary at completion
- Dashboard — full detailed view at the live dashboard link shown when the test starts
