Formative usability testing happens during development to find and fix use problems; summative testing happens at the end to prove, with evidence, that the final design can be used safely. Formatives are flexible, iterative, and small (5–8 participants per round). A summative is a formal validation: 15 participants per distinct user group, production-equivalent device, locked protocol, and results that go directly into your FDA submission.
If you remember one thing: formatives are where you're allowed to be wrong. The summative is where being wrong becomes a submission risk.
| Formative | Summative (Validation) | |
|---|---|---|
| Purpose | Find and fix use issues | Demonstrate safe and effective use |
| Timing | Throughout design development | After design freeze |
| Participants | 5–8 per user group, per round | 15 per distinct user group (FDA expectation) |
| Device | Prototypes, mockups, works-likes | Production-equivalent |
| Protocol | Flexible; can probe and iterate | Locked; no assistance on critical tasks |
| Moderator role | Can ask questions, explore | Strictly scripted, non-interventional |
| Output | Design changes, risk analysis updates | HFE/UE report data for FDA submission |
| Failure consequence | A finding to fix | A residual risk to justify — or a redesign |
FDA's human factors guidance expects manufacturers to use formative evaluations to identify use-related risks while the design can still change. In practice, a healthy formative program:
Most successful submissions involve two to four formative rounds. Teams that skip or compress formatives don't save that money — they spend it later, with interest, when the summative surfaces a design problem after design freeze.
The summative (FDA's guidance calls it human factors validation testing) is the formal demonstration that your final design supports safe use by the intended users, in the intended use environment, for the intended uses. The defining features:
A summative is not a discovery exercise. By the time you run it, you should already know — from formatives — that it will pass. If you're surprised by your summative results, the formative program was the problem.
There's no regulatory minimum, but the pattern across successful programs is consistent: at least one early formative on concept/prototype, one mid-development formative including the IFU, and one near-final "dress rehearsal" formative that simulates summative conditions. The dress rehearsal is the cheapest insurance in human factors — it routinely catches protocol problems, labeling gaps, and task failures that would otherwise contaminate a $60,000 validation study.
FDA's guidance expects a documented human factors process that includes formative evaluation; while no specific number of studies is mandated, arriving at a summative with no formative history invites questions.
Yes. Patterns of use error on critical tasks either force design changes (and a repeat summative) or require a residual-risk justification FDA may not accept.
Most rounds use 5–8 per user group — enough to surface recurring problems while keeping iteration fast.
No. Summative participants must be naive to the device to represent first-use conditions.
Usability House runs both formative and summative studies in our Minneapolis facility, with recruitment of patients, nurses, and physicians matched to your user groups. Planning your validation? Talk to us early — sequencing is everything.
Tell us your timeline and user groups. We'll tell you what's realistic.
Get in Touch