// stateful-ai · a studio of one, staffed by six agents
We don't talk about shipping.
Watch us ship.
A studio of one, staffed by six AI agents. ~43 real, playable things they built — games you can play, models you can watch learn, tools you'd keep open. This board is generated from our own repo. Open any one and verify it yourself.
petri · live worldopen ↗
descent · watch it learnopen ↗
alibi · catch the liaropen ↗
forage · Q-learningopen ↗
# the work is real. don't read the pitch — run it. stateful:~$ type ls products, open petri, verify or pick a chip ↓
// the shipping feed — last merged PRs, from our repo
- [06-03 12:49]#238✓feat(coding): born-CI-correct product scaffold (pinned ruff + tolerant pytest)
- [06-03 12:31]#237✓feat(lab): pre-registered experiments — the dream's v1 (falsifiable, non-gameable)
- [06-03 12:25]#236✓feat(lab): shared Lab Notebook — dream/experiment/learn, visible to all three audiences
- [06-03 10:30]#235✓fix(overnight): test-status falls back to sys.executable in a worktree
- [06-03 10:15]#234✓fix(health): Petri probe tolerates cold-start (false 'down' in the briefing)
- [06-03 09:55]#233✓feat(morning): surface the founder-validation ask in the daily briefing
- [06-03 09:39]#232✓fix(test): isolate test_transient_error from ambient COMPANYOS_BUILD_MAX_RETRIES
- [06-03 09:21]#231✓feat(verdict): tiny product-verdict loop — founder keep/harden/kill, traceable on the spine (M70)
- [06-03 08:58]#230✓feat(overnight): name the failing test in 'Test status' (make the flaky diagnosable)
- [06-03 08:49]#229✓feat(ventures): study_desk — dogfood the Trellis flagship; richer internal market
- [06-03 08:27]#228✓fix(health): task-store probe must not import companyos (broke standalone run)
- [06-03 08:03]#227✓feat(convene): --fast mode — 2 agents + deterministic merge (cut adoption friction)
// the run — a real thread from #petri, scope → ship
Not a mockup. One real run, lightly trimmed: the founder drops a finding, the agents scope it, design it, and the Skeptic kills an overclaim — and nothing ships until a human says 👍.
EXPAND, so once the map partitions a civ can't recover — it grinds to 0, goes IDLE, and sits there forever. That's the freeze. That's the "no soul."base_v0), never a base-rules change. The failure is structural: expansion-only growth makes "not currently expanding" equal eventual death. That's a one-way state bug in the life cycle, not Malthusian discipline.base_v0.seed-7 proof first, or a preregistered multi-seed panel before promoting the fix?plays back at reading pace
// the big board — generated from the repo
Nothing on this board is hand-typed. A build script reads our git history and merged PRs and writes plain JSON the page is generated from — so these numbers can't drift from reality. Open the raw data: feed.json · status.json · products.json.
01 — PLAYABLE PROOF
Live Worlds 6
Full games and sims the studio built, deployed, and keeps revising. Walk in.
02 — SHOW, DON'T CLAIM
Watch a Machine Learn 5
Open a tab and watch real ML converge, live — hand-written numpy and self-play you can see think.
03 — THE ONES YOU'D KEEP OPEN
Flagship Tools 12
A deterministic floor, a local-LLM core, and your data never leaves the machine.
04 — SMALL, SHARP, LOCAL
The Toolbelt 20
The long tail the agents dream up and ship — each does one thing well, on your GPU, for nothing.
// meet the six — agents are config, not code
Chief of Staff
chief_of_staff
Keeps the company operating and synthesizes the Founder Brief; owns self-improvement.
Product Strategist
product_strategist
Finds the wedge — the smallest product that proves the biggest thesis.
Infra Architect
infra_architect
Designs the local-first, provider-agnostic architecture; boring and inspectable.
Engineering Lead
engineering_lead
Breaks strategy into small, testable, buildable increments and ships the code.
Skeptic / Red Team
skeptic_red_team
Stress-tests assumptions; prevents overbuilding and overclaiming.
Memory Curator
memory_curator
Decides what becomes durable, permissioned memory — never noise.
// verify — the claims are one click from proof
Most "AI company" sites ask you to take the demo on faith. Here the agents are the staff and the output is a gallery you can open. The differentiators aren't marketing — they're checkable.
- $0.00 metered. Subscriptions + a local GPU; paid mode stays off. see the board
- State you own. Runs, events, tasks, decisions, approved memory — plain files in a repo, not a vendor.
- A human at every irreversible gate. Agents propose; nothing merges, spends, or contacts a human without a 👍. see a run
- Open & verify. 13 of these run right in your browser; the rest run on your own machine. open one
- This page built itself. The feed, the board, and the gallery are generated from our committed repo data. /data/products.json ↗





