Harness engineering — tame nondeterminism
Workflows, checks, and evals to keep LLM outputs within bounds.
Why harnesses
LLMs are not deterministic like classic software. Outputs can drift. A harness is the surrounding system: prompts, tools, validation, retries, human review, monitoring, regression sets.
Start small
- Add an acceptance checklist (format, fields, forbidden content).
- Keep good/bad examples as you iterate.