Self-Hosting

Run Oz agents on your own infrastructure with managed workers or unmanaged CLI-based execution.

Enterprise feature: Self-hosted Oz agents are available exclusively to teams on an Enterprise plan. To enable self-hosting for your team, contact sales.

Self-hosting lets your team run Oz agent workloads on your own infrastructure instead of Warp-managed servers. You control the execution environment, compute resources, and network access — repositories are cloned and stored only on your machines, and agents can reach services behind your VPN or firewall.

Warp uses a split architecture: execution happens on your infrastructure, while orchestration, session management, and LLM inference route through Warp's backend. This means agent interactions — including code context in session transcripts and LLM prompts — transit Warp's control plane under Zero Data Retention (ZDR) agreements. Warp does not persistently store your source code or train on your data.

Self-hosted Oz architecture showing customer-managed execution with Oz orchestration

With any self-hosted architecture:

Agent runs are tracked and steerable — View status, metadata, and session transcripts in the Oz dashboard, the Warp app, or via the API/SDK. Authorized teammates can attach to running sessions to monitor or steer agents.
Connectivity to Warp's backend is required — Agents need outbound access to Warp for orchestration, session storage, and LLM inference. No inbound ports need to be opened.
Resource limits are controlled by your infrastructure — Concurrency and compute are only limited by the machines you provision, not by Warp.

Enterprise teams that need full control over LLM inference routing can use Bring Your Own LLM (BYOLLM) to route inference through their own AWS Bedrock or other cloud provider accounts. BYOLLM currently applies to interactive (local) agents; cloud agent support is coming.

Architecture options

There are two architectures for self-hosted Oz agents:

Managed — Run the oz-agent-worker daemon on your infrastructure. The Oz platform orchestrates agents remotely, starting them in isolated Docker containers, Kubernetes Jobs, or directly on the host. Similar to a GitHub self-hosted runner.
Unmanaged — Use the oz agent run command to start agents anywhere — in CI pipelines, Kubernetes pods, VMs, or dev boxes. You control orchestration; Warp provides tracking and observability.

Comparison

Managed architecture:

✅ Full Oz orchestration — start agents from Slack, Linear, the API/SDK, Cloud Mode, or oz agent run-cloud
✅ Automatic environment setup (Docker image, repo cloning, setup commands)
✅ Task isolation via Docker containers or Kubernetes Jobs
✅ Remotely start and stop agents
✅ Runs on all Linux distributions; will support macOS and Windows in the future
✅ Three execution backends: Docker (default), Kubernetes, and Direct (no container runtime required)

Unmanaged architecture:

✅ Runs on any platform Warp supports (Linux, macOS, Windows)
✅ No Docker dependency — agents run directly on the host
✅ Drop-in replacement for other CLI agents (Claude Code, Codex CLI) in your existing orchestrator
✅ Tracked and steerable sessions
❌ No remote start via Slack, Linear, or the API (you start agents yourself)
❌ No automatic environment setup (you manage the environment)

Choosing an architecture

Use these questions to determine which architecture fits your team:

Do you want Oz to handle starting and stopping agents (from Slack, the web interface, the Warp desktop app, or the API)?
- Yes → Use the managed architecture.
- No, you have your own triggering mechanism → Use the unmanaged architecture.
Do you need agents to run on Windows or macOS?
- Yes → Use the unmanaged architecture.
Can your development environment run in a Docker container or Kubernetes pod?
- Yes, Docker → Use the managed architecture with the Docker backend.
- Yes, Kubernetes → Use the managed architecture with the Kubernetes backend.
- No (for example, complex multi-service stacks, heavy resource requirements, or container limitations) → Use the unmanaged architecture or the managed architecture with the Direct backend.
Do you want to use BYOLLM for cloud agent inference?
- Yes → Use the managed architecture. BYOLLM support for the managed architecture is coming soon.
- Not needed → Either architecture works.
Do you have your own orchestrator (CI/CD, Kubernetes, internal job scheduler) that starts agents on demand?
- Yes → Use the unmanaged architecture with oz agent run as a drop-in.
- No → Use the managed architecture.

The two architectures are not mutually exclusive. Some teams use the managed architecture for integration-triggered work (Slack, Linear) and the unmanaged architecture for CI pipelines or dev boxes.

Choosing a managed backend

The managed architecture supports three backends for task execution:

Are you deploying the worker into a Kubernetes cluster?
- Yes → Use the Kubernetes backend. Each task runs as a Kubernetes Job in your cluster. Install using the included Helm chart.
- No → Continue to the next question.
Is Docker available on your worker host?
- Yes → Use the Docker backend (default). Tasks run in isolated containers.
- No → Use the Direct backend. Tasks run directly on the host.
Do you need container-level isolation between tasks?
- Yes → Use the Docker backend or Kubernetes backend.
- No → Any backend works.
Do you need Kubernetes-native scheduling, resource management, or policy enforcement?
- Yes → Use the Kubernetes backend.
- No → The Docker or Direct backend is simpler to set up.

See the managed worker reference for backend configuration details.

Unmanaged architecture

With the unmanaged architecture, you run oz agent run inside your own orchestrator or dev environment. This works on any platform Warp supports (Linux, macOS, and Windows), with no dependency on Docker or any other sandboxing platform.

You are responsible for executing oz agent run on your infrastructure, similarly to how you would integrate Claude Code or Codex CLI. The agent runs directly on the host, which could itself be a Kubernetes pod, VM, container, or CI runner.

When to use

CI/CD pipelines — Run agents as part of your build or deployment workflow. This is how the warpdotdev/oz-agent-action GitHub Action works.
Kubernetes pods — Run agents in pods with access to your cluster's network and services.
Dev boxes and VMs — Run agents in pre-provisioned development environments, especially useful for large monorepos with long setup times.
Existing orchestrators — Drop oz agent run into any system that schedules work (Jenkins, Buildkite, internal job schedulers).

Setup

Install the Oz CLI on the machine where agents will run. See Installing the CLI for platform-specific instructions.
Authenticate using an API key (recommended for automation):

export WARP_API_KEY="your_team_api_key"

Run the agent:

oz agent run --prompt "Refactor the authentication module" --share team

The agent runs in the current working directory and has access to whatever tools and network resources the host provides.

Use --share to control session visibility:

--share — share the session with yourself (accessible on other devices or in a browser)
--share team or --share team:view — give all team members read-only access
--share team:edit — give all team members read/write access
--share [email protected] — give a specific user read-only access
--share [email protected]:edit — give a specific user read/write access

The --share flag can be repeated to combine multiple sharing targets. If you authenticate with a team API key, agents are automatically team-scoped.

Example: GitHub Actions

Warp maintains the warpdotdev/oz-agent-action for running agents in GitHub Actions. This action wraps oz agent run under the hood, making it a drop-in for CI workflows:

- name: Run Oz agent
  uses: warpdotdev/oz-agent-action@v1  # wraps `oz agent run` under the hood
  with:
    prompt: "Review the code changes on this branch"
    warp_api_key: ${{ secrets.WARP_API_KEY }}

See GitHub Actions integration for full details.

Example: Kubernetes

Run an agent inside a Kubernetes pod with access to your cluster's services:

apiVersion: batch/v1
kind: Job
metadata:
  name: oz-agent-task
spec:
  template:
    spec:
      containers:
      - name: oz-agent
        image: warpdotdev/warp-agent:latest
        command: ["agent", "run", "--prompt", "Run the test suite and report failures"]
        env:
        - name: WARP_API_KEY
          valueFrom:
            secretKeyRef:
              name: warp-credentials
              key: api-key
      restartPolicy: Never

For production deployments, pin to a specific Docker image digest (e.g., warpdotdev/warp-agent@sha256:...) instead of latest to ensure reproducible builds.

Whether Kubernetes pods provide sufficient sandboxing for agents depends on your cluster configuration and risk profile. Evaluate your pod security policies, network policies, and RBAC settings based on your organization's security requirements.

Tracking and observability

Unmanaged agents are tracked on Warp's backend. Each run creates a persistent session that your team can view in the Oz dashboard, attach to via session sharing, and query through the API/SDK.

Managed architecture

With the managed architecture, you run the oz-agent-worker daemon on your infrastructure. The daemon connects to Warp's backend, waits for agent tasks to be assigned to it, and executes those tasks on its host using one of three backends:

Docker backend (default) — Runs each task in an isolated Docker container.
Kubernetes backend — Runs each task as a Kubernetes Job in your cluster.
Direct backend — Runs each task directly on the host without a container runtime.

This model works similarly to a GitHub self-hosted runner.

The managed architecture enables full orchestration by the Oz platform — it can remotely start agents via Slack, Linear, the API/SDK, Cloud Mode, and the oz agent run-cloud command. Agents can access host resources through volume mounts (Docker backend), Kubernetes-native configuration (Kubernetes backend), and injected environment variables.

Prerequisites

Before setting up a self-hosted worker, ensure you have:

A machine to run the worker — A VM, server, or local machine running Linux (recommended for production), macOS, or Windows. For the Kubernetes backend, the worker runs as a Deployment inside your cluster.
Docker installed (Docker backend only) — The worker uses Docker to run agent tasks in isolated containers. The Docker daemon must run Linux containers (Windows containers are not supported). Verify Docker is installed and running with docker info. Skip this if you plan to use the Kubernetes or Direct backend.
A Kubernetes cluster (Kubernetes backend only) — The worker needs API access to create Jobs and Pods in a target namespace. See the Kubernetes backend reference for RBAC requirements.
Enterprise plan with self-hosting enabled — Contact sales if self-hosting is not yet enabled for your team.
A team API key — In the Warp app, go to Settings > Platform to create a team-scoped API key.

When using the Docker backend, task containers require a linux/amd64 or linux/arm64 Docker daemon. The worker host itself can be any OS — Docker Desktop on macOS and Windows runs a Linux VM that satisfies this requirement.

Install Docker

If you plan to use the Docker backend (the default), install Docker on the worker host. Skip this step if you are using the Kubernetes backend or Direct backend.

If Docker is not already installed, follow the official Docker installation guide for your platform.

Verify Docker is running:

docker info

Running the worker

The worker is open source. See the oz-agent-worker repository for source code, issues, and contribution guidelines.

There are four ways to install the worker: via Docker (recommended for Docker backend), via the Helm chart (recommended for Kubernetes backend), via go install, or by building from source.

The worker can be configured entirely via CLI flags, or via a YAML config file for more complex setups.

Set your API key

In the Warp app, go to Settings > Platform to create a team API key. Then export it as an environment variable:

export WARP_API_KEY="your_team_api_key"

Option 1: Docker (recommended)

The worker needs access to the Docker daemon to spawn task containers. Mount the host's Docker socket into the container:

docker run -v /var/run/docker.sock:/var/run/docker.sock \
  -e WARP_API_KEY="$WARP_API_KEY" \
  warpdotdev/oz-agent-worker --worker-id "my-worker"

Option 2: Go install

go install github.com/warpdotdev/oz-agent-worker@latest
oz-agent-worker --api-key "$WARP_API_KEY" --worker-id "my-worker"

Option 3: Build from source

git clone https://github.com/warpdotdev/oz-agent-worker.git
cd oz-agent-worker
go build -o oz-agent-worker
./oz-agent-worker --api-key "$WARP_API_KEY" --worker-id "my-worker"

Option 4: Helm chart (Kubernetes)

For Kubernetes deployments, use the included Helm chart. The chart deploys the worker as a long-lived Deployment that creates one Kubernetes Job per task. It does not require CRDs or cluster-scoped RBAC.

# Clone the worker repository
git clone https://github.com/warpdotdev/oz-agent-worker.git

# Create the namespace
kubectl create namespace warp-oz

# Create a Secret with your API key (if not using an existing Secret)
kubectl create secret generic oz-agent-worker \
  --from-literal=WARP_API_KEY="$WARP_API_KEY" \
  --namespace warp-oz

# Install the chart (replace the image tag with the latest release from the oz-agent-worker repository)
helm install oz-agent-worker ./oz-agent-worker/charts/oz-agent-worker \
  --namespace warp-oz \
  --set worker.workerId=oz-k8s-worker \
  --set image.tag=<version>

Set image.tag explicitly to pin the worker image. Check the oz-agent-worker releases for the latest version. Do not rely on latest.

The chart includes namespace-scoped RBAC, a ConfigMap for worker configuration, and an optional Secret for the API key. To scale, deploy multiple Helm releases with distinct worker IDs rather than increasing replicas on a single release.

For configuration options and operational notes, see the Kubernetes backend and Helm chart sections in the managed worker reference.

For the full list of worker CLI flags, Docker connectivity options, and private registry configuration, see the managed worker reference.

Routing runs to self-hosted workers

To run an Oz cloud agent on your self-hosted worker, specify the --host flag with your worker ID. The --host value must match the --worker-id of a connected worker exactly.

From the CLI

oz agent run-cloud --prompt "Refactor the authentication module" --host "my-worker"

You can combine --host with any other run-cloud flags, such as --environment, --model, --mcp, --skill, --computer-use, and --attach.

From scheduled agents

When creating or updating a schedule, specify the host:

oz schedule create --name "daily-cleanup" \
  --cron "0 9 * * *" \
  --prompt "Run dead code cleanup" \
  --environment ENV_ID \
  --host "my-worker"

oz schedule update SCHEDULE_ID --host "my-worker"

From integrations

When creating or updating an integration, specify the host:

oz integration create slack --host "my-worker" ...
oz integration update linear --host "my-worker" ...

All tasks created through that integration will be routed to your self-hosted worker.

From the API / SDKs

When creating a run via the Oz API, include worker_host in the config:

curl -X POST https://app.warp.dev/api/v1/agent/run \
  --header 'Authorization: Bearer YOUR_API_KEY' \
  --header 'Content-Type: application/json' \
  --data '{
    "prompt": "Refactor the authentication module",
    "config": {
      "environment_id": "ENV_ID",
      "worker_host": "my-worker"
    }
  }'

From the web UI

When creating a run, schedule, or integration in the Oz web app, select your self-hosted worker from the host dropdown.

Environments with self-hosted workers

Self-hosted workers fully support environments. When a task specifies an environment, the worker:

Resolves the Docker image for the task. The precedence for image selection is: Warp environment image > worker-configured default_image (Kubernetes backend only) > ubuntu:22.04.
Clones the repositories and runs setup commands as configured.
Executes the agent inside the prepared container or Kubernetes Job.

The same environment can be used for both Warp-hosted and self-hosted runs without modification. See Environments for details on creating and configuring environments.

With the Kubernetes backend, setting a default_image on the worker lets you skip creating a Warp environment when all your tasks use the same base image.

Environments work the same way across all three backends (Docker, Kubernetes, and Direct). The environment's Docker image is used as the task container image regardless of backend. With the Kubernetes backend, the image is pulled according to the cluster's image pull policy and any configured imagePullSecrets in the pod_template.

The architecture of the environment's Docker image must match the architecture of the execution node. For the Docker backend, this means matching the Docker daemon platform. For the Kubernetes backend, this means matching the node architecture where the task pod is scheduled.

Musl-based Docker images (such as Alpine Linux) are not supported as task images. The agent runtime requires glibc. Use glibc-based images like Debian, Ubuntu, or the default (non-Alpine) variants of official Docker Hub images.

Monitoring runs

Self-hosted runs have the same observability as Warp-hosted runs:

Management UI — View task status, history, and metadata in the Oz dashboard.
Session sharing — Authorized teammates can attach to running tasks to monitor progress.
APIs and SDKs — Query task history and build monitoring using the Oz API.

Network requirements

Self-hosted Oz agents do not require any network ingress. They do require outbound (egress) access to the following services:

Warp's backend (all architectures):

app.warp.dev — port 443
rtc.app.warp.dev — port 443
sessions.app.warp.dev — port 443
oz.warp.dev — port 443 (managed architecture only)

The central Docker Hub — for pulling task images (managed architecture only)

GitHub (github.com) — only with the managed architecture, when using a Warp environment with configured GitHub repositories

Linux distribution-specific package repositories — only with the managed architecture, when using a Warp environment that does not have Git pre-installed. The exact repositories depend on the package manager configuration within the environment's base image.

All traffic uses HTTPS (port 443). No inbound ports need to be opened.

Security considerations

Self-hosting uses a split architecture. Understanding which data stays on your infrastructure and which routes through Warp is critical for security evaluation:

Stored and executed only on your infrastructure:

Repository clones and source files
Build artifacts and compiled outputs
Runtime secrets and environment variables
Container filesystem state

Routes through Warp's backend (under ZDR):

Orchestration metadata (task status, lifecycle events)
Session transcripts — which include agent-generated summaries of code context, file contents the agent reads, and command output
LLM inference requests and responses — which include code context from the agent's interactions

While repositories are cloned and stored only on your infrastructure, code content appears in session transcripts and LLM prompts as part of normal agent operation. All data routed through Warp's backend is covered by ZDR agreements — Warp does not persistently store your source code or use it for model training.

Additional considerations:

Docker socket access — (Docker backend) The worker requires access to the Docker daemon to create task containers. When running the worker via Docker, this means mounting /var/run/docker.sock. Ensure appropriate access controls on the host.
Kubernetes RBAC — (Kubernetes backend) The worker needs namespaced permissions to create, get, list, watch, and delete Jobs and Pods. The Helm chart creates a minimal Role/RoleBinding scoped to a single namespace. The task namespace must allow creating Jobs with a root init container, as sidecar materialization currently depends on that pattern. Review your Pod Security Standards and admission policies accordingly.
Kubernetes service accounts — (Kubernetes backend) The worker Deployment's ServiceAccount (used by the long-lived worker process) is separate from the optional task Job serviceAccountName you may configure in pod_template. Scope each appropriately.
Network egress — See Network requirements for the full list of required endpoints. No inbound ports need to be opened.
API key management — Store your WARP_API_KEY securely (e.g., in a Kubernetes Secret or secrets manager). Avoid hardcoding it in scripts or config files. If your organization uses an external secrets manager (HashiCorp Vault, AWS Secrets Manager, GCP Secret Manager, etc.), you can inject secrets into task pods via the CSI Secrets Store Driver or a similar operator — configure the required volumes, volumeMounts, and annotations in pod_template.
Task isolation — Docker backend: each task runs in its own container. Kubernetes backend: each task runs as a separate Kubernetes Job/Pod. Direct backend: each task runs in an isolated workspace directory but shares the host OS. Containers, Jobs, and workspaces are removed after execution by default (disable with --no-cleanup for debugging).
Volume mounts — (Docker backend) If using -v / --volumes, be mindful of what host paths you expose to task containers.
Direct backend environment — The direct backend intentionally starts tasks with a minimal environment (HOME, TMPDIR, PATH only). Sensitive worker credentials like WARP_API_KEY are not passed to tasks unless explicitly configured.
LLM inference — Enterprise teams needing full inference control can use BYOLLM for interactive (local) agents; cloud agent BYOLLM support is coming.
VPN and on-prem access — Since agents run on your infrastructure, they inherit your network access. This means self-hosted agents can reach services behind VPNs, self-hosted GitLab/Bitbucket instances, and other internal resources.

Troubleshooting

The troubleshooting steps below apply to the managed architecture (oz-agent-worker daemon). For unmanaged deployments, refer to the documentation for the environment running oz agent run (e.g., GitHub Actions, Kubernetes).

Worker won't start

Docker backend — Verify Docker is running (docker info) and that the daemon platform is linux/amd64 or linux/arm64.
Kubernetes backend — The worker runs a startup preflight Job to verify cluster connectivity and permissions. If the preflight fails, check the worker logs for details. Common causes: insufficient RBAC permissions, Pod Security policies blocking the root init container, or an unreachable Kubernetes API server. If your cluster restricts image sources, set preflight_image to an allowlisted image (defaults to busybox:1.36). To pull images from a private container registry, configure imagePullSecrets in pod_template — these secrets also apply to the preflight Job.
Direct backend — Verify the Oz CLI is installed and in your PATH (or set oz_path in the config file).

Worker won't connect

Verify your API key is correct, not expired, and has team scope.
Regenerate the API key in Settings > Platform if you suspect it is invalid.
Ensure the machine has outbound internet access to Oz.
Check that no firewall rules are blocking WebSocket connections to wss://oz.warp.dev.
Increase log verbosity with --log-level debug to see connection details.

Tasks not being picked up

Confirm the worker is running and connected (check the worker logs).
Verify the --host parameter matches your --worker-id exactly (case-sensitive).
Ensure the worker's team matches the team creating the task.

Task failures

Review task logs in the Oz dashboard or via session sharing.
Use --no-cleanup to keep the container, Job, or workspace around for inspection after failure.
Use --log-level debug to see detailed execution logs.
Ensure the worker machine or cluster has sufficient resources (CPU, memory, disk).
Docker backend — Verify Docker is running (docker info). If using a custom image, confirm it is glibc-based (not Alpine/musl) and that its architecture matches the worker's Docker daemon platform.
Kubernetes backend — Check the task Job and Pod status with kubectl get jobs,pods -n <namespace>. Common issues include unschedulable pods (check node selectors, tolerations, and resource requests), image pull failures (check imagePullSecrets in pod_template), and admission policy rejections. The worker will fail a task early if its pod remains unschedulable beyond unschedulable_timeout (default: 30s).
Direct backend — Verify the Oz CLI is accessible and that the workspace root directory has write permissions.

Image pull failures

Docker backend — If using a private registry, ensure Docker credentials are available to the worker (see Private Docker registries). Try pulling the image manually with docker pull <image> on the worker host.
Kubernetes backend — Configure imagePullSecrets in the pod_template section of your worker config. Verify the secret exists in the task namespace and contains valid credentials.
Verify the image exists and the tag is correct.
Check network connectivity to the registry.

PreviousDeployment Patterns NextManaged worker reference

Last updated 4 days ago

Was this helpful?

hashtagArchitecture options

hashtagComparison

hashtagChoosing an architecture

hashtagChoosing a managed backend

hashtagUnmanaged architecture

hashtagWhen to use

hashtagSetup

hashtagExample: GitHub Actions

hashtagExample: Kubernetes

hashtagTracking and observability

hashtagManaged architecture

hashtagPrerequisites

hashtagInstall Docker

hashtagRunning the worker

hashtagSet your API key

hashtagOption 1: Docker (recommended)

hashtagOption 2: Go install

hashtagOption 3: Build from source

hashtagOption 4: Helm chart (Kubernetes)

hashtagRouting runs to self-hosted workers

hashtagFrom the CLI

hashtagFrom scheduled agents

hashtagFrom integrations

hashtagFrom the API / SDKs

hashtagFrom the web UI

hashtagEnvironments with self-hosted workers

hashtagMonitoring runs

hashtagNetwork requirements

hashtagSecurity considerations

hashtagTroubleshooting

hashtagWorker won't start

hashtagWorker won't connect

hashtagTasks not being picked up

hashtagTask failures

hashtagImage pull failures

Architecture options

Comparison

Choosing an architecture

Choosing a managed backend

Unmanaged architecture

When to use

Setup

Example: GitHub Actions

Example: Kubernetes

Tracking and observability

Managed architecture

Prerequisites

Install Docker

Running the worker

Set your API key

Option 1: Docker (recommended)

Option 2: Go install

Option 3: Build from source

Option 4: Helm chart (Kubernetes)

Routing runs to self-hosted workers

From the CLI

From scheduled agents

From integrations

From the API / SDKs

From the web UI

Environments with self-hosted workers

Monitoring runs

Network requirements

Security considerations

Troubleshooting

Worker won't start

Worker won't connect

Tasks not being picked up

Task failures

Image pull failures