Illustration of a fox in judge's robes standing before a courtroom mirror with its reflection appearing more confident and holding a gavel

Why You Shouldn’t Ask Your AI to Audit Its Own Instructions

Bysteve May 5, 2026May 18, 2026

A viral post this week says you should ask Claude to audit its own instructions and delete whatever it flags as unnecessary.

I tried the opposite approach. Here’s why it works better.

I’ve been running an AI assistant full-time for 3 months. It manages my email, runs my marketing, posts to social media, sends my newsletter, manages ad campaigns. It has about 50 rules in its instruction files.

Every one of those rules exists because something went wrong.

“Never add contacts to list 13” is there because we accidentally emailed 2,800 people a welcome message they’d already received. “Always verify links before sending” is there because a broken URL went out to my entire Udemy student base. “Never let the AI calculate dates” is there because it got the math wrong in 3 different cron jobs in one week.

The viral advice says: paste your rules into Claude and ask it which ones to cut. Claude will happily tell you to remove half of them. It’ll say things like “I already do this by default” and “this is redundant with your other instructions.”

The problem? Anthropic’s own engineering team has found that models are unreliable judges of their own behavior. My AI will confidently say “I already verify links by default” — and then not verify links. The rules aren’t there because the AI needs a reminder. They’re there because without the rule, it fails silently and I find out when a customer complains.

Here’s what actually works for keeping your AI instructions clean:

Run a separate model as the auditor. Don’t ask Claude to grade Claude’s homework. Spawn a second instance with explicit criteria: find contradictions, find duplicates, find rules that fight each other. Two perspectives catch what one misses.

Review rules when you upgrade models. A rule written for Sonnet 3.5 might be unnecessary on Opus 4. But a human has to make that call, not the model that wants to believe it doesn’t need guardrails.

Test before you cut. Delete a rule, run your 3 most common tasks, and check the output carefully.

Keep a log of why each rule exists. When you know the failure that created a rule, you can judge whether the failure is still possible. When you don’t, you’re guessing.

The viral post’s core insight — that instruction bloat is real and degrades output — is correct. But “ask the AI to audit itself” is the blind leading the blind. Your instruction files are scar tissue from real failures. Treat them with respect.

OpenClaw is a self-hosted AI assistant that runs on your own server 24/7. It keeps its own memory, runs scheduled tasks, and learns your workflows over time.

I teach a class on setting up and getting the most from OpenClaw — details at themeperks.com/openclaw-course/.

AI Assistant Blog

My AI Assistant Was Burning $114 a Day. It Found the Problem Itself.
Bysteve April 24, 2026May 18, 2026

My AI assistant was burning $114 a day. I had no idea until it built its own cost tracker. I gave it access to the billing API and told it to find where the money went. It caught something I never would have: 76% of costs weren’t from actual work. They were overhead from how…

Read More My AI Assistant Was Burning $114 a Day. It Found the Problem Itself.
AI Assistant Blog

My Business Built a Nervous System One Workflow at a Time
Bysteve May 17, 2026May 18, 2026

Most business owners start their morning checking a dozen apps: email, social media, analytics. Half an hour gone before real work begins. I start mine by reading a single message. Every morning at 10 AM, my AI assistant delivers a briefing to my phone. Industry news filtered for relevance. Flagged emails needing decisions. Task reminders….

Read More My Business Built a Nervous System One Workflow at a Time
AI Assistant Blog

My AI Assistant Runs 20 Tasks a Day Without Being Asked
Bysteve May 15, 2026May 18, 2026

Last week my website went down in the middle of the night. By morning, my AI assistant had already rebooted it, added more memory to prevent a repeat, and set up monitoring that checks every five minutes. I found out from its daily status report. Which is one of about 20 tasks it runs without…

Read More My AI Assistant Runs 20 Tasks a Day Without Being Asked
AI Assistant Blog

Amazon’s New AI Agents Cost What? Here’s the $20 Alternative
Bysteve April 29, 2026May 18, 2026

Amazon announced yesterday that OpenAI’s models, Codex, and Managed Agents are coming to AWS Bedrock. The pitch: GPT-5.5 running inside your existing AWS environment, with enterprise security, governance, and billing already wired up. Plus pre-built agents that can handle multi-step workflows. Sounds great until you look at what it takes. Bedrock Managed Agents require API…

Read More Amazon’s New AI Agents Cost What? Here’s the $20 Alternative
AI Assistant Blog

22 Books Were Invisible on Amazon. My AI Fixed That in One Session.
Bysteve May 7, 2026May 18, 2026

My 29 books have been on Amazon for years. Until last week, 7 of them had ads. The other 22 were invisible. In one session, my AI assistant built 9 new ad campaigns — every remaining writing book, each with custom keyword research and negative keyword lists built from scratch. The travel writing campaign got…

Read More 22 Books Were Invisible on Amazon. My AI Fixed That in One Session.
AI Assistant Blog

Where My AI Costs Actually Come From
Bysteve May 20, 2026

Everyone assumes the expensive part of running an AI is “thinking hard.” I pulled my actual cost data and the breakdown surprised me. Context management—the AI organizing its working memory—is 75% of the bill on heavy days. All output including reasoning: 3.4%. Local work (voice synthesis, audio processing): $6/day. Deep collaborative conversation: $100/day. Conversation length…

Read More Where My AI Costs Actually Come From

Similar Posts