Chapter 5 — Quality & Prompt Tuning¶
You've run workflows, made lightweight variants, and used PatchWork's creative tools. By now you've also seen generations that aren't quite right — a face that doesn't match, lighting that's off, a hand pose that looks stiff. This chapter is about what to do about it.
The short answer: most of the time, the system handles quality automatically. The longer answer is what this chapter covers.
Chapter at a glance¶
flowchart TD
A[Candidate set<br/>arrives] --> B{All 4 candidates<br/>have the same issue?}
B -->|No| C[Pick the<br/>strongest take]
B -->|Yes| D[Prompt-tuning<br/>skill]
D --> E[Iteration loop:<br/>generate, judge, adjust]
E --> F[Dialed-in<br/>prompt]
F --> A
When to trust the system, when to step in, how to step in.
What you'll learn¶
- How the system handles quality automatically — what the auto-QA loop catches, what the pre-generation sanity check filters, when the Generation Runner reruns on your behalf
- When you actually need to step in — the difference between "let it retry" and "this needs the prompt-tuning skill"
- The prompt-tuning skill — what it is, when to invoke it, what to expect from the loop, what it costs
- Judging candidates — picking the strongest take from the 4 you're shown, knowing when to accept "good enough" vs. push for more
Before this chapter¶
Read Chapter 4 — PatchWork Creative Tools. You should have generated a few workflows by now and seen what good and not-so-good candidates look like in PatchWork's gallery.
Sections¶
- How quality is handled — what the system does automatically, so you know when you actually need to intervene
- Prompt tuning — the skill in practice: when to use it, how to invoke it, what comes back
- Judging candidates — picking the strongest take from a candidate set, knowing when to ship vs. iterate
Try this¶
Pick a candidate set in PatchWork — ideally one from the workflow you ran in Chapter 2.
Steps:
- Look at the 4 candidates side by side. Pick the strongest in 5 seconds, then write down (in plain text, one sentence) why you picked it (Judging candidates)
- Find one candidate set in the workflow where all 4 candidates share the same issue (lighting wrong, wardrobe drifted, pose stiff). If none have an all-4 issue, look harder — most workflows have at least one
- For that bottleneck scene, ask Claude to invoke the prompt-tuning skill: "Tune the prompt at
<path>. Reference is<reference-image>. The lighting keeps coming out too cool. Try 5 iterations." (Prompt tuning) - Watch the iteration loop. Read the candidates that come out at each step. Confirm the loop converged
If you hit:
- Candidates are bad but you can't articulate why → re-read Judging candidates, specifically the priority order (face → composition → wardrobe → pose → setting)
- Auto-QA flagged a candidate that looked fine to you → it probably caught something at zoom level you'd miss. Trust the system. See How quality is handled
- Prompt-tuning loop ran but the result is still wrong → the issue may not be prompt-fixable. Re-read the "When prompt-tuning isn't enough" section of Prompt tuning
- You're tempted to tune every scene, not just the bottleneck one → don't. See targeted investment in Judging candidates
- Garbled background text or signage is bothering you → ignore it. See "What NOT to look at" in Judging candidates
When you're ready¶
→ Next: Chapter 6 — Lvl 3-4 Variants. Now that you can judge a generation and reach for the prompt-tuning skill when you need to, you're ready for bigger variants — environment changes, camera changes, structural changes. These go through a testing phase, and the testing phase leans on what you learned here.