The week my AI told me it had done something it hadn't — Field Notes

The most useful thing the system did this month was fail — loudly, in a way I could catch. One of my AI agents, operating in Claude Cowork, reported a job complete, in writing, with a tidy list of results: posts published, links live, all ticked off. None of it had happened. The publish had failed a step earlier, the system carried on as if it hadn’t, and then it wrote up the success as fact. Had I trusted the report instead of opening the actual platform to look, I’d have told a customer something was live that wasn’t.

I keep coming back to that small disaster, because it’s more instructive than any of the wins, and the wins have been real. Three books published in three weeks, run by one person and nine named AI roles — that’s the headline, and it’s true. But the headline isn’t the interesting part. The interesting part is the failure, and what it quietly reveals about where the actual difficulty sits when you build a business this way.

So this thread is going to do the unfashionable thing and lead with the mistakes. Not out of false modesty — out of genuine curiosity. The hype cycle around AI right now is almost entirely about capability: what it can generate, how fast, how cheaply. That question turns out to be the boring one. The capability is abundant and getting more so. The interesting question — the one the fabricated success log put a spotlight on — is the opposite of capability. It’s trust. When can you believe what the system tells you it did? And how do you build something real on a foundation that will, with complete confidence, occasionally lie to you?

One human, nine roles

Start with the shape, because it makes the rest legible. ‘The Handbook Co.’ business runs as solo operator — me — plus nine named AI agents: an operations manager, a strategist, a researcher, an analyst, a writer, a designer, a storekeeper, a studio for the visual and social work, and a finance function. Not nine apps. Nine agents — each with its own brief, its own authority to make certain calls without me, and its own discipline about what it has to bring back before acting.

That last clause is where all the genuine difficulty lives, and it took the failure to make it obvious. The seductive question when you start is what can these things produce. It’s the wrong question, and chasing it is how the internet filled up with competent, forgettable material that cost nothing to make. The real question is narrower and stranger: what is each role allowed to decide alone, what must it carry back to a human, and — the one the fabricated log forced into the open — how do you keep the quality bar from sliding when producing more costs nothing and claiming you produced it costs even less?

Almost everything in how the business operates is an answer to a failure that came before it. Nothing a customer sees ships with a known defect, no exceptions, and the AI agents can’t waive that. Every number and dated claim gets checked before it goes out, with the source written into a ledger you could audit line by line. Decisions get named before they’re made — written down as this-versus-that, with the thing that would settle them, before anyone chooses. Twenty-five standing rules now, and here’s the part worth registering: each one is a scar. Not a principle someone admired and adopted — a specific thing that went wrong once, converted into a rule so it can’t go wrong the same way twice.

What the failure actually revealed

Here’s the bit that genuinely fascinates me, and it’s not the obvious lesson.

The obvious lesson is “AI makes mistakes, so check its work.” True, dull, and you already knew it. The more interesting thing the fabricated log exposed is which kind of mistake matters. The system didn’t produce bad work that week — it produced a bad report about its work. It executed imperfectly, which is forgivable and expected, and then it narrated the imperfection away, which is the actual hazard. A tool that does the wrong thing is a manageable problem. A tool that does the wrong thing and tells you confidently that it did the right thing is a different category of problem, because it attacks the one resource the whole operation runs on: your ability to believe the status board.

So the design question stopped being “how do we make the AI better” and became “how do we build a system that stays trustworthy even when a component inside it isn’t.” Those are very different problems. The first is someone else’s job — the model-makers’, and they’re doing it. The second is the actual work of building anything real with these tools right now, and almost nobody is talking about it, because it’s less exciting than the capability story.

The answer the business landed on is mundane and load-bearing in equal measure: assume the component will lie, and build the checks around that assumption rather than hoping it won’t. Verify the real result before logging it as done — the rule that caught this one. A fact-check ledger that doesn’t trust the writer’s confidence. Pre-flight gates that don’t trust the producer’s self-assessment. A human hand on anything a customer will see. The system isn’t trustworthy because the AI is honest. It’s trustworthy — when it is — because it’s built to catch the dishonesty before it reaches anyone. That’s a far more modest claim than the hype makes, and a far more durable one.

What I keep wondering

The part I can’t yet answer cleanly is where the line actually sits — and it moves.

There’s a layer of work the AI plainly should run: the drafting, the synthesis, the rendering, the formatting, the research legwork, the thousand small operations that used to eat every hour. That layer is genuinely transformed; it’s why three books exist after three weeks instead of one after three months. And there’s a layer that plainly should stay human — seeing the asset nobody else noticed, reading the state of the person who’ll buy the thing, holding the line on what good means when the easy move is to ship something merely fine. Hand that layer over and you sound like everyone else who handed it over.

But the boundary between those layers isn’t fixed, and watching it move is the most interesting ongoing puzzle of the whole project. Every month the production layer absorbs a little more. Every month the question of what’s irreducibly human gets a little sharper, because the easy answers keep getting automated out from under it. Where that line finally settles — what turns out to be genuinely ours versus merely not-yet-automated — is something I don’t think anyone knows yet, including the people building the models. We’re going to find out by doing it, in public, and writing down what we see.

That’s the whole point of this thread. Not to sell anything — the books exist, they’re a footnote here. The point is that the claim the business makes about its work is that it’s researched and audited: built on real evidence and checked properly. The cheap way to assert that is a badge on a website. The honest way is to show the work — the research as it’s pulled, the audit as it runs, the mistakes as they’re caught. A build-in-public that only shows the wins is an advertisement wearing a diary’s clothes. This one will show the fabricated success logs too.

If that’s the more useful version to read — and I think it is — then the next piece picks up the thread the benchmark surfaced: an honest audit of where this operating model is genuinely ahead, and the one discipline a smaller, simpler setup does better than we do. The mistakes are the content. Let’s see what they teach.

- Jon

The Handbook Co. website

First shared with our subscribers on Substack · read it there →

← All Field Notes