Ink and watercolor illustration of an ornate brass ticket booth at night, golden coins piling up inside while the connecting rod to the turnstile is cleanly severed, a fox in a colorful scarf waiting with a ticket at the locked turnstile

What I Learned Testing Four LLM Providers for My AI Assistant

Bysteve June 30, 2026June 30, 2026

The cost of running a full-time AI assistant snuck up on me. Not the server. The tokens.

I run two AI assistants on my server. They handle email, social media, course management, infrastructure monitoring. Around the clock. The conversation context for a persistent agent grows to hundreds of thousands of tokens.

Prompt caching is what makes that affordable. Cache reads cost roughly 12x less than cache writes with Anthropic. As long as your stable context hits cache, each turn is pennies.

Mine stopped hitting.

Calls that should have been cache reads were billing as full writes. A session costing pennies was costing dollars. I tried Google’s Gemini 3.5 Flash. Its OpenAI-compatible endpoint wasn’t caching at all. Every call, full price. OpenAI’s GPT models worked but had their own pricing curves.

After weeks of auditing token usage and testing alternatives, I moved everything to GLM-5.2 from Z.AI. Flat-rate coding plan. Zero per-token cost. All 11 cron jobs, both servers, every sub-agent.

The part I didn’t expect: GLM is genuinely good at agentic work. Complex tool chains, long context, error recovery. I kept Anthropic and Gemini as fallbacks. Haven’t needed them.

OpenClaw is a self-hosted AI assistant that runs on your server, reads your code, and handles tasks while you sleep. I teach two classes on setting up and getting the most from OpenClaw on Udemy: Easy OpenClaw and Get Real Work Done With an AI Assistant.

AI Assistant Blog

My AI Narrated 150 Course Lessons in My Voice
Bysteve May 19, 2026

150 lessons across 5 online courses needed narrated video. The usual way: weeks of studio time. My AI cloned my voice from existing recordings and narrated all of them. It wrote the scripts from my course material, designed the slide decks, and assembled the videos with timed transitions. My part: review the scripts. A few…

Read More My AI Narrated 150 Course Lessons in My Voice
AI Assistant Blog

Two AIs Analyzed the Same Manuscript — Their Disagreement Was the Useful Part
Bysteve May 28, 2026

I gave the same 42-chapter manuscript to Claude and ChatGPT without letting either see the other’s work. They agreed on the biggest problem. They disagreed on the fix. Both caught the same flaw: the book opened with backstory when the strongest material was in chapter 21. Both flagged chapters that restated what the scene just…

Read More Two AIs Analyzed the Same Manuscript — Their Disagreement Was the Useful Part
AI Assistant Blog

My AI Runs a Team: Three Model Tiers for Twenty Scheduled Tasks
Bysteve June 1, 2026

Most people interact with AI the same way: type a question, get an answer, close the tab. My AI runs a team. Twenty scheduled tasks run on my server: blog posts, email triage, website audits, newsletters, social media. Each one spawns a separate worker and picks the right AI model for the job. A blog…

Read More My AI Runs a Team: Three Model Tiers for Twenty Scheduled Tasks
AI Assistant Blog

What an OpenClaw User Did With the Claude Code Leak
Bysteve April 3, 2026May 18, 2026

On April 1, Anthropic accidentally leaked the entire Claude Code source code. A human packaging error included a source map in a routine update. The internet was on it within hours. Analysts published breakdowns. A Korean developer rewrote the entire product from scratch overnight in Python. His repo became the fastest in GitHub history to…

Read More What an OpenClaw User Did With the Claude Code Leak
AI Assistant Blog

118 New Subscribers While I Was at Dinner in Port
Bysteve April 28, 2026May 18, 2026

Email subscription infrastructure — forms, routing, automated digests, a launch email — takes real dev time. Most businesses schedule it and come back to it. Last weekend I needed exactly that. I was on a cruise ship docked in San Diego. In a single session, my AI assistant built the subscription form, the backend routing,…

Read More 118 New Subscribers While I Was at Dinner in Port
AI Assistant Blog

My AI Assistant Was Burning $114 a Day. It Found the Problem Itself.
Bysteve April 24, 2026May 18, 2026

My AI assistant was burning $114 a day. I had no idea until it built its own cost tracker. I gave it access to the billing API and told it to find where the money went. It caught something I never would have: 76% of costs weren’t from actual work. They were overhead from how…

Read More My AI Assistant Was Burning $114 a Day. It Found the Problem Itself.

Similar Posts