Harness engineering — tame nondeterminism

Workflows, checks, and evals to keep LLM outputs within bounds.

Why harnesses

LLMs are not deterministic like classic software. Outputs can drift. A harness is the surrounding system: prompts, tools, validation, retries, human review, monitoring, regression sets.

Start small

Add an acceptance checklist (format, fields, forbidden content).
Keep good/bad examples as you iterate.

OpenAI on harness engineering

Harness engineering — tame nondeterminism

Why harnesses

Start small

Read more