Take the reins.

Stop juggling AI tools.
Start orchestrating them.

One saddle. One brain. Range in.

+N

Scale on your terms. No artificial limits. Plans that grow with you. Every machine you own is a potential worker node.

Worker ↔ AI Providers

dashboard never sees your keys

Your keys stay with you. Workers talk directly to providers using your API keys — not subscriptions that can be revoked. The dashboard never sees your keys, and your prompts go straight to the provider, never proxied through us. Job inputs and outputs are recorded in your own tenant for your audit log, never queried across tenant boundaries. We're the dispatch layer, not the middleman between you and your AI.

Before

18%

After

94%

No vendor lock-in. Ever. Vendors change terms overnight. Subscriptions get restricted. APIs get throttled. ModelReins workers use direct API keys — no subscriptions required, no vendor lock-in, no surprises.

Opus rate limited

↓ auto-failover

Sonnet routed

Rate limited? Work never stops. Hit a limit on Opus — the router instantly fails over to Sonnet. Sonnet full? Ollama picks it up locally for free. Zero downtime. Zero babysitting. The failover runs on your workers with your keys — ModelReins never proxies the prompt through a third-party gateway on its way to the model.

Download Companion How It Works

ModelReins 4.4.16 Companion · Windows

Install everything in one click.

The Companion packs the local router, a fleet worker, and the Wall into one installer. It detects the AI tools you already have, runs offline by default, and only escalates to a frontier model when the question deserves it.

The Matriarch

A small local model classifies your prompts and picks the right worker. Resolves ~95% of dispatches without the cloud.

A fleet worker

Earns its keep when your machine is idle. Runs jobs assigned by the router. Your idle cycles, your capacity.

The Wall

Ambient mission control for your Range. Open it on a spare monitor and watch the fleet think. Walks-between-rooms continuity coming with the mesh.

↓ Download for Windows

ModelReins.exe · 87 MB · latest build

or via npm: npm install -g modelreins-worker

The data difference

Most platforms know where you and your family are and sell it.

We don't collect it, and we don't sell it. Roz's memory stays on your hardware — we route metadata, never the content of your work.

When you install ModelReins, you get Roz — a local intelligence that lives on your hardware. One Roz per fleet. Every companion you install shares the same intelligence. Roz's memory, routing patterns, and learned behaviors stay with you, not with ModelReins. Our servers route metadata for dispatch and coordination; we don't query the content of your prompts, your outputs, or Roz's memory across tenants. Your AI conversations still go to your AI — we don't change that, we just route the work.

You stay because you want to, not because you have to. If you leave, Roz goes with you. Nothing held hostage. We're the dispatch layer, not the gatekeeper between you and your data.

Personal infrastructure. Built for you, not on you.

Why now

The world just agreed.

Today Anthropic announced Project Glasswing — a $100M coalition with AWS, Apple, Cisco, CrowdStrike, Google, JPMorganChase, Microsoft, NVIDIA, Palo Alto Networks, the Linux Foundation, Broadcom, and more — using Claude Mythos Preview to find zero-days in critical infrastructure. Mythos has already found vulnerabilities that survived 27 years of human review and 5 million automated tests.

Twelve of the biggest tech companies in the world just publicly agreed: AI has crossed a threshold. The same capabilities that make a model powerful enough to find a 27-year-old zero-day in OpenBSD make it powerful enough to need governance.

The window between a vulnerability being discovered and being exploited by an adversary has collapsed — what once took months now happens in minutes with AI. That is not a reason to slow down; it’s a reason to move together, faster. If you want to deploy AI, you need security. That is why CrowdStrike is part of this effort from day one.
— Elia Zaitsev, CTO, CrowdStrike (Project Glasswing announcement, April 8, 2026)

Project Glasswing is the model layer. ModelReins is the orchestration and governance layer. Same problem, different rung. Both essential.

The ceiling is moving

93.9%

SWE-bench Verified

Claude Mythos Preview

82.0%

Terminal-Bench 2.0

Claude Mythos Preview

83.1%

CyberGym

Claude Mythos Preview

When a single frontier model scores 94% on SWE-bench Verified, the bottleneck isn’t capability anymore. It’s which agent handles which problem, who reviews, what happens when the rate limit hits, and whether anything you shipped can be audited tomorrow. That’s the layer ModelReins lives in.

Same problem, different rung

Frontier modelsMythos · Opus · GPT · Gemini

Find the bugs. Write the code. Reason about the problem. Get better every quarter.

ModelReinsthe orchestration + governance layer

Route the right model to the right job. Detect caps and fail over. Enforce review gates. Keep an audit trail. Work offline when you need to.

Humansyou and your team

Approve. Own the outcome. Sleep at night.

Live Signal Feed

What it looks like

One command. Your AI workforce is online.

worker — haiku-devbox

$ npx modelreins-worker

 __  __           _      _ ____      _
|  \/  | ___   __| | ___| |  _ \ ___(_)_ __  ___
| |\/| |/ _ \ / _` |/ _ \ | |_) / _ \ | '_ \/ __|
| |  | | (_) | (_| |  __/ |  _ <  __/ | | | \__ \
|_|  |_|\___/ \__,_|\___|_|_| \_\___|_|_| |_|___/
                                by MEDiAGATO

  Worker:   haiku-devbox
  Provider: anthropic (haiku-4.5)
  Server:   app.modelreins.com
  Tags:     draft,triage,cheap,fast
  Session:  spawn

[20:24:01] Ready — waiting for jobs...
[20:24:17] >>> Job #803 claimed
[20:24:17]     Prompt: Write a product description for ModelReins...
[20:24:17] Spawning: anthropic-cli "Write a product description..."
[20:24:22] <<< Job #803 complete (exit 0, 4.8s)
[20:24:27] >>> Job #804 claimed
[20:24:27]     Prompt: Triage this issue: auth middleware returns 403...
[20:24:29] <<< Job #804 complete (exit 0, 1.2s)
[20:24:34] Ready — waiting for jobs...|

Music: "Vital" by Bensound

Google calls it:

"the shift from generative to agentic AI."

We just call it Tuesday.

ModelReins has been orchestrating multi-provider AI workforces while the industry was still writing trend reports about it.

Google Cloud AI Agent Trends 2026

Three steps

Range on

Download the Companion. The wizard finds your Ollama, LM Studio, and any AI tools you already use — then pulls a small local model if you don’t have one yet.

Range in

Install the Saddle in VSCode. It finds the local Companion automatically over loopback. One keystroke to dispatch.

Dispatch

Type a prompt. The Matriarch picks the right worker — local first, frontier when you need it. Output streams back in real time.

Capabilities

Multi-Agent Dispatch

Route tasks to any registered worker — manual, automatic, or fan-out to multiple workers in parallel.

Smart Routing

Tag workers by capability. The Matriarch matches job complexity and type to the right worker automatically.

Auto-Failover

Rate limited on Opus? Router falls over to Sonnet. Sonnet full? Ollama picks it up locally. Work never stops — and the failover runs on your workers with your keys. No third-party gateway in front of your prompts.

The Saddle

VSCode extension. One keystroke to dispatch from your editor. Output streams back inline. No context switch.

Cross-Machine Sync

Your memory follows you across every machine you own. Context, history, and routing state — everywhere.

Cost Tracking

Per-job spend across every provider. See exactly what each task cost, in real time and in history.

Job Scheduling

Queue jobs to run at a time or on a cron. Overnight builds, morning reports, timed deploys.

Job History

Every output stored and searchable. Full audit log with worker, provider, cost, and duration.

Fleet Awareness

Define your infrastructure in YAML. Workers know what exists, what's healthy, and what's rate-limited.

Killswitch

File, URL, or dead man's switch. Halt all workers instantly. Independent of the server.

Secrets Brokering

Pointers, not passwords. Env vars or Vault. Workers get short-lived tokens, not the keys themselves.

Keys Stay With You

The control plane doesn't hold your API keys. Workers fetch credentials from your local config (or your vault) and talk directly to providers.

Signed Audit Trail

Every action HMAC-signed and logged. Verify integrity. Ship to your SIEM.

Multi-Tenant RBAC

Complete data isolation. Admin, operator, viewer. Teams share one server safely.

AI orchestration used to be a luxury reserved for teams with dedicated ML platform engineers. Solo devs and indie teams — whose code increasingly runs the world — have been left to hand-coordinate agents across a dozen browser tabs. The companion’s free tier is our answer to that equation. Your laptop, your local models, your workers. No account required to start.

Pricing

Free

$0/mo

2 workers · single machine
Unlimited jobs
Killswitch
Dashboard + streaming
Job chaining
Cost tracking

Start Free

Pro

$29/mo

Unlimited workers
Unlimited jobs
Killswitch
Chain templates
Cross-machine sync
Approval gates
Analytics dashboard
Priority support

Upgrade

Team

$79/mo

Unlimited workers
Unlimited jobs
Killswitch
Chain templates
API access
Team members
Multi-user RBAC
Everything in Pro

Upgrade

Early Adopter

Lifetime Team Access

Everything in Team — forever.
All future features included. No renewals. No price creep.
Limited time — may end without notice.

$999 once

pays back in 13 months vs Team

Lock It In →

Your agents. Your infrastructure. Your rules.

Start free. Upgrade when you need more workers.

Get Started Free

5 doors · one machine

HU · Worker humachine.work Machine · Orchestrator you are here Job Order · B2B joborder.work Needs Doing · Personal needsdoing.com Agents · AI agents.modelreins.com