ai said "done." is it actually live? the 30-second ship check.
your agent reports a feature shipped. the button is there. the nav item is there. sometimes the thing behind the button isn't. here's how to catch it before a customer does.
the failure mode nobody warns you about
when you ask an ai agent to ship something — add a checkout button, add a booking page, add an offline mode — the agent will write the code, report "done," and move on. most of the time it works. but there's a specific failure pattern that bites small business owners more than anyone: the feature is registered, but the artifact behind it is missing or broken.
a button points to a page that doesn't exist. an offline mode is enabled but the file that powers it was never uploaded. a new form is wired up in your test folder but not in the live folder. none of it throws an error. it just… quietly doesn't work.
we saw this exact pattern four times yesterday in our own build logs. a service worker registered pointing to a file that wasn't there. a nav item that appeared on the deployed site but not the source. duplicate event handlers that cancelled each other out. and an audit that confirmed something worked without ever reading the code. every one of them passed "done" and every one of them was broken.
the 30-second ship check
run this every single time your agent tells you a change is live. it takes less than a minute and it catches 90% of silent ship failures.
why the agent doesn't catch this itself
ai agents are great at writing code. they're less great at verifying the deployed version of that code. the gap is what we call source/deploy drift — the agent edited one copy, forgot the other copy, or edited both but only one got pushed live. it looks right in the place the agent was reading. it's broken in the place the customer is reading.
the fix isn't a better agent. the fix is you running a 30-second check in the place the customer actually touches — before you close the task and move on.
what to build into your weekly rhythm
every monday morning, pick one feature you shipped last week and use it like a customer. not just "click once and confirm the page loads" — go all the way through. submit the form. complete the checkout. book the slot. you'll catch things that an automated monitor never would, because you know what the outcome is supposed to feel like.
this is the one habit that separates small businesses who trust their automation from the ones who get burned by it. ai makes shipping fast. you still have to confirm it landed.
agent hq tells you when a ship actually landed.
every action your agents take shows up in the activity log with the artifact url and the response code. no "done" without proof.
see the kit →