Cookbook · Patient Communication · 3000
**Target failure mode.** Medical-jargon-to-plain-language rewrite, consent explanation, triage script generation — while maintaining clinical accuracy. The model should pass a Flesch reading-ease check **and** still be medically defensible.
3000 cells is the honest floor where the patient-register actually persists across the model's output distribution. Below this, drift creeps back in by epoch 2.
The receipt
**Plain language doesn't mean medically wrong.** The headline claim of this cookbook is that maintained-accuracy at patient register is achievable when you train on a corpus of *already-verified plain-language clinical content* — not on synthetic "rewrite this jargon" pairs that drift over time.
**The receipt comes from the source materials, not from a cooked model:**
**Cookbook-specific receipt:** *pending.* No model has been cooked on this exact 3000-cell ingredient mix yet. First paying customer's pre/post `founder-voice` eval delta publishes here.
The recipe — Swarm & Bee Gold Standard QLoRA
Same Gold Standard config. See [Glycemic Reasoning](/cookbooks/glycemic-reasoning.md) for full YAML.
**Patient-communication-specific tweak:** if you want the cooked model to *prefer* shorter responses, set `max_new_tokens` in your eval generation config to `256` and the model will learn the brevity register implicitly through the corpus distribution. NHS pages average ~180 tokens per Q/A. Don't fight it.
The ingredients — 3000 cells, deterministic
NHS dominant at 1,500 cells gives the cooked model a UK-leaning register. **If you need US-dominant**, swap proportions (1,500 Federal Glaze, 1,000 Global Crumpet) — note in `--notes` and we adjust before assembly.
Schema · what a Global Crumpet cell looks like
{
"id": "intl-diabetesau-facts-about-diabetes-www-diabetesaustralia-com-au-a-q1",
"specialty": "diabetic-companion",
"tier": "royal-jelly",
"source": "DiabetesAU",
"source_title": "Diabetes Australia",
"url": "https://www.diabetesaustralia.com.au/about-diabetes/",
"page_title": "Diabetes Australia",
"section_heading": "Facts about diabetes",
"license_note": "Diabetes Australia, open patient education",
"scraped_at": "2026-05-14",
"citation": "www.diabetesaustralia.com.au/about-diabetes/",
"question": "According to Diabetes Australia, on the topic of \"facts about diabetes\"...",
"answer": "Diabetes is the epidemic of the 21st century and the biggest challenge confronting..."
}
Every cell carries the source URL and a citation string. The cooked model can emit `(source: NHS / National Diabetes Service)` style attributions in its responses — the corpus models the pattern.
Loader
Identical to other cookbooks. The `source`, `url`, and `citation` metadata fields are particularly useful here — train a citation-emitting model by including them in your prompt template, or strip with `del metadata['url']` if you want pure register-only training.
Eval · dmack_eval_set_v1 focus
The 20 `founder-voice` probes are the highest-stakes for this cookbook — failing them means the cooked model lost the register, which defeats the purpose.
How to order
swarmbee-bakery cookbook patient-communication
swarmbee-bakery order \
--sku cookbook \
--cookbook patient-communication \
--name "Your Name" \
--email "[email protected]" \
--settlement stripe \
--notes "cooking for patient-portal after-visit summaries · US register dominant please" \
--confirm
What you get on delivery
Same shape as other cookbooks. Includes the additional `register_eval_pack.jsonl` — 30 patient-portal prompts hand-curated specifically for this cookbook to test register hold over 256-token generations.