Originally published on rohitraj.tech
Sakana AI shipped Sakana Fugu on June 22, 2026 — an orchestration model that routes each request across a swappable pool of frontier LLMs behind one OpenAI-compatible API, in two tiers (fugu and fugu-ultra-20260615), with benchmarks showing Fugu Ultra leading 10 of 11 tests. This is the builder read: what actually shipped, the API call you paste today, the benchmark table against Opus 4.8 / Gemini 3.1 Pro / GPT-5.5, where an orchestration model earns its keep, when its black-box routing disqualifies it, and how I would wrap it in production so a fallback-as-a-service still has a fallback.
Read the full version with code samples, diagrams, and architecture details: Sakana Fugu: The Orchestration Model That Commands Other LLMs (2026)
More engineering notes: rohitraj.tech/en/notes

