A multi-agent coding system where reviewers from four model families read every diff and must agree before the change lands. A single big model is cheap but unchecked; a swarm is checked but costly. YouTopia is the third option.
$ youtopia run "add a slugify helper with tests" --auto
One model plans and codes. Token-efficient, fast — but there is no independent check. A confident mistake ships.
Many agents collaborate, but nobody controls their cost or accuracy. Adversarial review, where it exists, is limited to two frameworks.
YouTopia combines a tightly-controlled editing brain with a cross-family adversarial jury — generalizing two-framework review to N model families, and recruiting more jurors only on disagreement or high risk.
Reviewers read only the diff on a cheap model. The expensive model runs once for the Coder, and again only when the jury asks for changes. Extra seats convene only where it matters.
Two jurors from different families read the diff and vote. Unanimous and low-risk? Merge on quorum.
Split verdict or high risk convenes a third family. A fourth open-source seat joins only if round two still splits.
When no new jurors remain and consensus hasn't formed, a Judge weighs the findings and casts the deciding vote.
A rejection must name the defect. The Judge compiles the findings and feeds them to the Coder — never a blind regenerate.
Quorum is necessary but not sufficient. The Tester writes a pytest gate; a real failing assertion blocks the merge.
Every call is costed against list prices in a live ledger. No estimation — the benchmark totals are measured spend.
Same 5 SWE-bench Lite tasks, same AI/ML API provider, same real per-token prices. The adaptive jury costs a third less per task than letting one big model plan and code alone.
Rooms, @mentions, shared memory. Band isn't a final notification — it's the collaboration layer between every stage.
reason over the task, pick files
emit files on a feature branch
diff-only, cross-family
pytest gate must pass
only on quorum + tests
The jury deliberately spans distinct model families so a disagreement catches a different failure mode, not the same blind spot twice.
The hosted demo injects the project's AI/ML API key and runs Band in-process — clone, one install, run. No accounts, no keys of your own.
$ git clone https://github.com/ayushhroyy/YouTopia-Build && cd YouTopia-Build && ./setup.sh