Discussion about this post

User's avatar
Pyro's avatar
Nov 26Edited

>The user message model also isn’t sustainable for Augment Code as a business. For example, over the last 30 days, a user on our $250 Max plan has issued 335 requests per hour, every hour, for 30 days, and is approaching $15,000 per month in cost to Augment Code.

-------------

It's kinda concerning that a company creating "frontier coding agents" can't think of ways to prevent this... like I'm sure even broken llama3 finetunes on huggingface would have come up with a solution.

Good analysis on the Netflix vs. tokens point, tho, that's the actual insight here. COGS scaling linearly with usage is the real structural problem, not "AI bad."

But the doom framing is overblown. Inference costs are dropping 10x every 18 months. Even SOTA releases are way cheaper now than two years back (Opus 4.5 for 20buck/1m token vs 70bucks for Opus 4.1).

What's unsustainable today becomes trivial tomorrow. And local models exist with the gap between closed SOTA and open SOTA getting smaller.

The weak players will die. That's not a tsunami, that's a market doing market things and nothing special.

Kinda fun that the people criticizing overblown hype do the exact same in the other direction...

No posts

Ready for more?