Context rot: why your agent forgets what you said at turn 40
Long-running model loops look great on the benchmark and quietly collapse in real loops. A small forensic look at where attention is actually spent.
Long-running field notes on building with LLMs in production — evals, inference systems, prompts that survive contact, and the failure modes nobody puts in the docs.
Long-running model loops look great on the benchmark and quietly collapse in real loops. A small forensic look at where attention is actually spent.
1 min #essays #gear
Eight days, one carry-on, two pairs of socks too many. A list, and what I would change.
1 min #essays #travel
The Soya Main Line in late winter, with a thermos of hojicha and one bad paperback.
1 min #essays #reading
A small argument against the kindle, with apologies to its inventors and to the airline that lost my last one.
1 min #essays #writing
A list, an apology to the proprietors of each, and a small map drawn from memory.
1 min #essays
No itinerary, one museum, a long walk, and dinner before sunset. A theory of unhurried days.