Self-hosted AI agents: run them on a server you own.

Self-hosting an AI agent means running the agent's runtime — the loop that calls the model, executes tools, and holds state — on infrastructure you control, not inside a vendor's cloud. You own the server, the secrets, and the audit log. This is the 2026 guide: why own the box, what a production agent actually needs, and the real options from a raw VPS to an agent-native OS.

Read the code (Apache-2.0)Deploy guide →

Why self-host instead of using a managed platform

Data sovereignty

Sensitive context — customer data, credentials, proprietary code — never crosses a third-party boundary. It stays on hardware you control, in the jurisdiction you choose.

Cost predictability

A flat server bill, not per-task credits or per-seat fees that scale with your success. Model it in the calculator.

No lock-in

Open-source means you can read, fork, and run the stack forever. If the vendor disappears, your agents keep running. The repo is the proof.

What a production agent actually needs

A Python script in tmux is a prototype, not production. Six things separate "it ran once" from "it runs unattended":

Persistent memoryState that survives restarts and idle gaps — not a session that evaporates when the process dies. osmoda persists sessions to disk atomically.

Crash recoveryAgents wedge: a model call hangs, a tool errors, the heartbeat stops. You need a watchdog that restarts before you notice — 6-second median on osmoda.

Audit trailEvery tool call, every action, logged immutably. For debugging today and compliance tomorrow. osmoda uses a SHA-256 hash-chained ledger.

Secrets vaultThe agent should request a signed action, never hold the raw API key or wallet. osmoda's KEYD vault does this — keys never enter the model's context.

Scheduled jobsCron that survives reboots and recovers if a run wedges mid-task. Built into the routines daemon.

Atomic rollbackA bad deploy must be one command to undo. NixOS generations make every config change reversible.

Assembling these on a raw VPS is weeks of glue you'll maintain forever. The alternative is an agent-native OS where they're the substrate — see the self-healing servers and audit ledger deep-dives.

Your self-hosting options, honestly compared

Raw VPS + scripts

Good: Cheapest floor; total control.

Cost: You build persistence, recovery, audit, secrets, and scheduling yourself — weeks of glue, and it breaks at 3am with no one watching.

Docker Compose

Good: Reproducible service definitions; familiar.

Cost: Containers don't give you crash recovery, an audit ledger, or a secrets model for agents — those are still your problem on top.

Kubernetes

Good: Battle-tested orchestration at scale.

Cost: Helm + Argo + Istio for a handful of agents is operational overkill; you still bolt on the agent-specific layer.

Agent-native OS (osmoda)

Good: The agent substrate ships as the OS: 10 daemons, 92 tools, audit ledger, vault, watchdog, atomic rollback. Apache-2.0.

Cost: Early-beta — honest about it on the production-readiness page. Single-host today; trust-tier enforcement is v1.4.

The osmoda approach: the agent runtime IS the operating system

osmoda is a NixOS distribution built so the things a production agent needs are the substrate, not add-ons. 10 system daemons, 92 typed tools, 20 pre-built skills, the SHA-256 audit ledger, the KEYD vault, watchdog recovery, and atomic rollback all ship in the base image. You bring your own LLM key (Anthropic, OpenAI) or run a local model; everything else is already wired.

Self-host it free under Apache-2.0 on any box you own, or use the managed tier from $29/month if you'd rather not run the infrastructure. Either way the code is public and the data is yours. It's early-beta and we're honest about the edges — read the production-readiness matrix before betting a critical workload on it.

Current release: v1.3.0. Source at github.com/bolivian-peru/os-moda. Browse the full architecture in the technical spec.

FAQ

What does it mean to self-host an AI agent?

Self-hosting an AI agent means running the agent's runtime — the loop that calls the model, executes tools, and holds state — on infrastructure you control, instead of inside a vendor's managed cloud. You own the server, the filesystem, the secrets, and the audit log; the agent runs as a persistent process you can SSH into. The model inference can still be a hosted API (Anthropic, OpenAI) via your own key, or a local model — what's 'self-hosted' is the agent and everything around it.

Why self-host AI agents instead of using a managed platform?

Three reasons: data sovereignty (sensitive context never crosses a third-party boundary), cost predictability (a flat server bill instead of per-task or per-seat metering that scales with success), and no lock-in (open-source means you can read, fork, and run the stack forever). The trade-off is that you take on the ops — though an agent-native OS automates most of it. Self-hosting suits anyone with privileged data, a workload that will grow, or a requirement to prove what the agent did.

What do you actually need to run an AI agent reliably on your own server?

More than a Python script in tmux. A production agent needs: persistent memory that survives restarts, automatic crash recovery (agents wedge), an audit trail of every action (for debugging and compliance), a secrets vault so the agent never holds raw API keys, scheduled-job support that survives reboots, and atomic rollback so a bad deploy is reversible. Assembling these on a raw VPS is weeks of glue; an agent-native OS ships them as the substrate.

Can you self-host AI agents for free?

Yes — osmoda is Apache-2.0 licensed, so you can clone the repo, run it on any server you own (a $5 VPS, a homelab box, your own hardware), and pay nothing for the platform. You only pay for the machine and your own LLM tokens. The managed tier ($29–$299/mo) exists if you'd rather not run the infrastructure yourself, but self-hosting the full stack is free and the code is public at https://github.com/bolivian-peru/os-moda.

Is self-hosting AI agents secure?

It can be more secure than managed — your data never leaves infrastructure you control, secrets live in a vault the agent can't read directly, and every action is logged to a tamper-evident ledger. But an agent with system access is still a serious responsibility: today osmoda's capability declarations are audited rather than kernel-enforced (bubblewrap sandboxing is on the v1.4 roadmap), so self-host with the same care you'd give any process that has root. See the production-readiness page for the honest security posture.

Own the box. Own the agents. Own the data.

Spawn a server →Self-host free →