<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0">
  <channel>
    <description>Rambling Rows</description>
    <image>
      <url>https://rrows.net/uploads/2026/rrows-icon-sq-144px.png</url>
      <title>Rambling Rows</title>
      <link>https://rrows.net/</link>
    </image>
    <title>Cost on Rambling Rows</title>
    <link>https://rrows.net/categories/cost/</link>
    
    <language>en</language>
    
    <lastBuildDate>Sun, 24 May 2026 11:12:41 +1000</lastBuildDate>
    <item>
      <title>What running AI agents actually costs</title>
      <link>https://rrows.net/2026/05/24/what-running-ai-agents-actually.html?utm_source=rss&amp;utm_medium=feed&amp;utm_campaign=rrows</link>
      <pubDate>Sun, 24 May 2026 11:12:41 +1000</pubDate>
      
      <guid isPermaLink="false">http://rrows.micro.blog/2026/05/24/what-running-ai-agents-actually.html</guid>
      <description>&lt;img src=&#34;https://cdn.uploads.micro.blog/202171/2026/productive-bot-600px.jpg&#34; width=&#34;600&#34; height=&#34;338&#34; alt=&#34;&#34;&gt;
&lt;p&gt;I wanted to know if my AI subscription was earning its keep. So I repriced 30 days of real usage at pay-per-token rates and compared it to what I actually pay.&lt;/p&gt;
&lt;p&gt;The answer: $96 USD equivalent in tokens consumed. My subscription costs $100 USD a month. That&amp;rsquo;s not a rounding error - that&amp;rsquo;s a subscription running at near-full utilisation.&lt;/p&gt;
&lt;p&gt;Not a prototype. Not a weekend experiment. A system I actually depend on - morning briefings, task management, research, document work, health tracking, portfolio analysis. The Autonomi, as I call it, runs continuously and does real work.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The plan I&amp;rsquo;m on&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href=&#34;https://support.claude.com/en/articles/11049741-what-is-the-max-plan&#34;&gt;Claude Max&lt;/a&gt; is a flat-rate subscription at $100 USD a month - billed in Australian dollars at the monthly exchange rate. It gives you five times the usage capacity of Pro, which matters once you start running agents at any real volume.&lt;/p&gt;
&lt;p&gt;Flat-rate sounds simple. The trap is that you lose visibility into where the cost actually goes. When there&amp;rsquo;s no per-session invoice, it&amp;rsquo;s easy to assume you&amp;rsquo;re well inside your limits and never check. I wanted the actual picture.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The breakdown is where it gets interesting.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;73% of the equivalent token spend is Opus 4.7 output. Not input. Output.&lt;/p&gt;
&lt;p&gt;This is the thing people miss when they estimate AI costs. In a chat interface, input and output are roughly balanced - you write a paragraph, you get a paragraph back. In an agentic workflow, the ratio inverts. The model is reasoning through multi-step tasks, generating tool calls, writing structured results, checking its own work, producing long-form outputs from short prompts. You send a few hundred tokens in. Thousands come back.&lt;/p&gt;
&lt;p&gt;The model is working. And working costs more than thinking out loud.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What the 5x headroom actually buys you&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The practical benefit of the Max plan isn&amp;rsquo;t that you pay less per token. It&amp;rsquo;s that you stop rationing Opus.&lt;/p&gt;
&lt;p&gt;On a standard Pro plan, heavy Opus usage burns your quota quickly. You start making decisions at the model selection screen - is this task worth Opus, or should I use Sonnet? That&amp;rsquo;s friction. It&amp;rsquo;s also the wrong question, because some tasks genuinely need the best model and others don&amp;rsquo;t, and you don&amp;rsquo;t always know which is which until you&amp;rsquo;re in them.&lt;/p&gt;
&lt;p&gt;With 5x capacity, I use Opus when the work calls for it and don&amp;rsquo;t think about it. Investment research, meeting prep, long-form analysis - Opus. Scheduling, structured data extraction, short-form drafts - Sonnet or Haiku. The model choice gets made on merit, not quota anxiety.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;ve only hit a hard limit once in recent memory - a two-hour pause after an unusually intensive session. That was a vibe coding run where I was pushing hard. One timeout in months of daily use is a reasonable ceiling.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;What the agents are actually doing&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;My system runs a morning briefing agent, a Todoist integration layer, a scheduling system and a session tracker, plus whatever I throw at it through Cowork across the day. In a typical week that includes data munging, investment research, health data analysis, document generation and correspondence.&lt;/p&gt;
&lt;p&gt;The expensive work isn&amp;rsquo;t the mechanical stuff - structured data in, formatted output out. It&amp;rsquo;s the analysis: synthesising across multiple sources, adapting to new constraints, producing work that requires judgment. Every time Opus produces a backtest summary, a meeting prep note or a long-form post, it generates thousands of tokens on the output side.&lt;/p&gt;
&lt;p&gt;I&amp;rsquo;ve already replaced one agent that was using the top-tier model unnecessarily - the morning briefing, which was hitting context limits and breaking. A deterministic Python script now handles it for almost zero cost. The output is cleaner. The agent was solving a problem it had partly created.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The honest number&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;$96 USD equivalent consumed against a $100 flat-rate plan. In periods when I&amp;rsquo;m vibe coding hard, I&amp;rsquo;d expect that number to be higher - the subscription absorbs it. That&amp;rsquo;s exactly what a flat-rate plan is for.&lt;/p&gt;
&lt;p&gt;$100 a month for a personal AI system that does the kind of work a part-time researcher and assistant would do is, by any reasonable measure, extraordinary value. The question isn&amp;rsquo;t whether it&amp;rsquo;s expensive. It&amp;rsquo;s whether you&amp;rsquo;d pay $100 a month for the same output from a human.&lt;/p&gt;
&lt;p&gt;The answer to that one is obvious.&lt;/p&gt;</description>
    </item>
    
  </channel>
</rss>
