Engineering NotesThe bug that didn't stay fixed
The Claude Bridge wouldn't bind to port 5690 after a Monday reboot. The fix took twelve minutes. Four days later the same bug came back, wit…
Read the issue →We follow the strands that actually move the field. Each issue picks one and goes to the bottom of it — papers, code, prices, politics, and the people who built it.
Capability evals, scaling laws, and the strange weather of new architectures. We compare claims to traces, not benchmarks.
Workflows that actually finish. Planner-executor splits, tool budgets, recovery, and why most agent demos quietly fail at step three.
Tokens cost money. Latency costs users. We profile what production really pays, and how routing, caching, and distillation move the needle.
Red-team transcripts, refusal behavior, jailbreak archeology, and what the safety teams actually argue about behind the papers.
Long-form interviews with people shipping the systems. Less personal-brand. More how the on-call rotation looks at 3 a.m.
Essays that step back: what counts as understanding, when does scale plateau, and which old ideas just got new substrate.
Engineering NotesThe Claude Bridge wouldn't bind to port 5690 after a Monday reboot. The fix took twelve minutes. Four days later the same bug came back, wit…
Read the issue →
AI & AutomationI built an automated quality check for my own writing pipeline. Then it rejected the single best post the system produced. Four versions of …
Read →
AI & AutomationThe 70-line endpoint that connects FRED to my distillfin pipeline, what DGS10 actually looked like the first time it came back, and why 'dat…
Read →
AI & AutomationA walkthrough of the local Node bridge that lets my n8n workflows call Claude — the four flags that keep the subprocess from deadlocking, th…
Read →
AI & AutomationA first-person postmortem of an indie AI content automation stack that almost killed its own AdSense application. Three concrete failures, o…
Read →Most AI writing is a press release with adjectives. We try to do the opposite — slowly, with our names on it.
If we cite a result, we read the methodology section. If the methodology won't survive a careful read, we say so — and we name the figure.
Marketing copy doesn't ship. We test on our own eval suite, in our own runtime, and publish the prompts we used so you can break them too.
If a source can't go on record, it's context for us — not a story for you. We've left scoops on the table and we'll do it again.
Bottom of every issue: who pays us, who pays the people we cover, what we own. If we got beta access, we say what week.
Every issue has a corrections box. Every quarter has a public retro. The thing we got wrong stays on its own page.
We don't run launch coverage. We run six-week follow-ups, when the demos have met production and the receipts are in.