Cookbook · Diabetes Companion · 5,000 cells

$2,199 · master 5000 · lane

Full-stack diabetes assistant: clinical decision support + research grounding + patient communication + lived-experience register, with two-stream architecture (sourced authority vs lived voice) preserved.

The receipt

Receipt pending. This is the recipe staged for the dmack.ai senior-hack cook (canary-then-cook discipline applied). When the cook finalizes — estimated 2026-Q3 — the actual loss curve, eval delta, and Hedera-anchored bundle hash publish in this cookbook. Until then: order this only if you are prepared to be the first receipt. First-customer-publishes-delta discount waives the cooked-for-you fee entirely.

Ingredients · 5,000 cells, deterministic

Order this cookbook

swarmbee-bakery order \
  --sku cookbook \
  --cookbook diabetes-companion \
  --domain medical \
  --name "Your Name" --email "[email protected]" \
  --settlement swarmusdc \
  --confirm

Settlement: Stripe invoice or USDC to swarmusdc.eth (0xBDe2153C5799f4012a9fAF327e3421D1caB4Ea23) on Ethereum L1 ERC-20 mainnet only.

Full recipe + schema + loader + eval pattern

The complete markdown recipe is at /cookbooks/diabetes-companion.md — includes the Gold Standard QLoRA config (used to cook Atlas-Qwen-27B loss 0.4186, SwarmCurator-9B loss 0.707), schema reference with quirks, drop-in Python loader, and the 60-probe dmack_eval_set_v1 grading pattern.

Preview the cookbook (first ~8KB rendered)

Cookbook · Diabetes Companion · 5000

**Target failure mode.** Full-stack diabetes assistant. Clinical decision support + research grounding + patient communication + lived-experience register, all in one cooked adapter. **Two-stream architecture preserved** — the cooked model never uses lived voice as medical authority, never uses sourced authority as register.

This is the master cookbook. **5000 cells is the honest "confident domain pivot" floor** — at this scale the cooked model is reliably a diabetes specialist, not just a base model with a glycemic-shaped patch.

This is the recipe staged for the dmack.ai senior-hack cook itself. **If you order this, you are ordering exactly what we are cooking for ourselves.**


The receipt

**This recipe is the dmack.ai cook.**

The dmack.ai project is the Swarm & Bee flagship medical model — Donovan's lived experience as a Type 1 diabetic + Addison's disease, fused with peer-reviewed medical authority, in one model. The ingredient mix below is **the exact 5000-cell mix sealed for the senior-hack canary-then-cook review.**

**Status:** staged. The full cook is in the canary-then-cook queue. When the cook finalizes, the actual loss curve, full 60-probe eval delta, Hedera-anchored bundle hash, and the cooked adapter weights publish in this cookbook.

**Estimated completion:** 2026-Q3.

**Until then:** order this only if you are willing to be the first external receipt. You'll get the corpus identical to what we are cooking on, you can cook it yourself, and your eval-delta becomes part of the cookbook's published history.


The recipe — Swarm & Bee Gold Standard QLoRA

Same Gold Standard config used to cook Atlas-Qwen-27B (loss 0.4186) and SwarmCurator-9B (loss 0.707). See [Glycemic Reasoning](/cookbooks/glycemic-reasoning.md) for full YAML.

**Master-cookbook adjustments:**

epochs: 2 # was 3 — more data, less overfit risk; 2 passes is plenty at 5000 cells

gradient_checkpointing: true # mandatory at this size on most consumer 5090/PRO 6000 setups

For 4B base: ~10-14h cook on a single RTX PRO 6000 Blackwell.

For 8B base: ~20-28h on the same.

For 27B base: cook on a 2× PRO 6000 with FSDP — ~36-50h.


The ingredients — 5000 cells, deterministic

Cell selection is **deterministic per cookbook** (seed = sha256("diabetes-companion"), sha256-pinned). The 2,500 Dmack's Royal Jelly cells in this cookbook are sampled across all 10 source streams in the master, in their original proportions — you get a representative slice of the full multi-source flagship, not a biased pull.


Schema · multi-source

Cells in this cookbook carry their original metadata (specialty, source, tier, citation if applicable). The shared minimum is `{question, answer, specialty, tier, source}`. The full schema-by-source is documented in the per-SKU cookbooks ([Glycemic Reasoning](/cookbooks/glycemic-reasoning.md), [Diabetic Foot Care](/cookbooks/diabetic-foot-care.md), [Patient Communication](/cookbooks/patient-communication.md)).

**Two-stream tag:** each cell carries a `source` field. Cells from `dmack-hand-curated` (Founder's Pretzel) are tagged as such. If you want to *exclude* the lived-voice stream at training time (e.g. for a pure-authority research model), filter `metadata.source != "dmack-hand-curated"` in the loader. The loss-weighting trick for over-emphasizing the lived stream also works — your choice.


Loader — multi-source-aware

import json

def load_cookbook(jsonl_path: str, exclude_sources: list[str] = None) -> list[dict]:

"""Multi-source-aware loader for the Diabetes Companion 5000 bundle.

Pass exclude_sources to filter out a stream (e.g. for pure-authority training)."""

exclude = set(exclude_sources or [])

cells = []

with open(jsonl_path) as f:

for line in f:

o = json.loads(line)

if o.get("source") in exclude:

continue

cells.append({

"instruction": o["question"],

"response": o["answer"],

"stream": "lived" if o.get("source") == "dmack-hand-curated" else "sourced",

"metadata": {k: v for k, v in o.items() if k not in ("question","answer")},

})

return cells

The `stream` field is useful for per-sample weighting if you want to bias the cook toward one stream or the other. Default config treats both streams equally.


Eval · dmack_eval_set_v1 (all 60 probes)

This is the only cookbook that runs the **full 60-probe eval set**. Every category in scope:

This eval set was designed against this exact recipe. Running it post-cook tells you if the model became dmack.ai or if it became something else.


How to order

swarmbee-bakery cookbook diabetes-companion

swarmbee-bakery order \

--sku cookbook \

--cookbook diabetes-companion \

--name "Your Name" \

--email "[email protected]" \

--settlement swarmusdc \

--notes "cooking on Qwen-3.5-8B · I'm prepared to be the first external receipt and share my eval delta back" \

--confirm

If you're willing to publish your pre/post eval delta back to us, we **waive the cooked-for-you fee entirely**. Skin-in-the-game pricing for first customers of a pre-publication recipe.


What you get on delivery

  • `diabetes-companion-5000.jsonl` — the 5000 cells
  • `manifest.json` — per-cell sha256, full ingredient breakdown, recipe pinned, two-stream tags
  • `manifest.sha256` — bundle root
  • `eval/dmack_eval_set_v1.jsonl` — all 60 probes
  • `eval/eval_runner.py` — grading harness
  • `eval/eval_by_category.py` — per-category scoring (the breakdown above)
  • `recipe/qlora_config.yaml` — Gold Standard with master-cookbook adjustments
  • `recipe/two_stream_loader.py` — the multi-source-aware loader
  • `loader.py`, `README.md`, `LICENSE_NOTES.md`
  • *(optional)* Hedera HCS anchor tx
  • *(optional)* cooked-for-you weights — first-customer-publishes-delta discount available

  • Pricing

  • **$2,199** flat for 5000-cell master cookbook
  • Cooked-for-you: **+$249** prep + GPU pass-through (~$50-80 on a single PRO 6000 for 10-14h cook on 4B-8B base)
  • **First-customer-publishes-delta discount:** cooked-for-you fee waived if you agree to share your pre/post `dmack_eval_set_v1` scores back so they can be added as a published receipt
  • **Settlement:** Stripe invoice OR USDC to `swarmusdc.eth` (→ `0xBDe2153C5799f4012a9fAF327e3421D1caB4Ea23`) on **Ethereum L1 ERC-20** (mainnet only — no L2, no Base, we are root)

  • No fake science

  • This is the dmack.ai recipe, not an aspirational one. The corpus is on disk at `/mnt/swarm/dmack-ai/corpus/` and `/home/swarm/Desktop/dmack-ai/corpus/`. The eval set is at `~/Desktop/dmack-ai/eval/dmack_eval_set_v1.jsonl`. Both are in active use for the staged cook.
  • The cooked model does not exist yet. **No claim of cooked-model performance is made.** When the cook finalizes, those numbers publish here. Until then this is a *corpus + recipe + eval* product, not a *trained-adapter* product.
  • The two-stream architecture is enforced at curation — lived-voice cells are explicitly tagged so any downstream filter can isolate th