Run Hermes Gateway as a persistent remote agent on RunPod.
You'll need two secrets before deploying. Generate and store them in a password manager before proceeding.
1. Hermes API Key — protects your pod endpoint from unauthorized access:
openssl rand -hex 322. OpenRouter API Key — get one at openrouter.ai/workspaces/default/keys
Store both keys in a password manager (1Password, Bitwarden, etc.) — you'll need them every time you deploy a pod.
- Go to RunPod CPU deploy
- Select 2 vCPUs as a starting point
- Name your pod or keep the generated name
- Click Change Template and select hermes_runpod_base
- Click Edit Template and set the following env vars at minimum:
HERMES_API_KEY— a secret key to protect your endpoint (generate one withopenssl rand -hex 32) — click Select Secret to use a stored secretMODEL_PROVIDER— e.g.openrouterOPENROUTER_API_KEY— your OpenRouter API key — click Select Secret to use a stored secretOPENROUTER_MODEL— e.g.anthropic/claude-sonnet-4.6(see options below)GIT_REPO_URL— your repo URL so the agent clones it on startup (optional)
- Click Deploy On-Demand
Once the pod is running, your endpoint is:
https://<pod-id>-8642.proxy.runpod.net
Image: ghcr.io/awolf81/hermes-runpod:base
Port: 8642
Disk: 10 GB minimum
Connects Hermes to any model via OpenRouter.
ENABLE_VLLM=0
MODEL_PROVIDER=openrouter
OPENROUTER_API_KEY=sk-or-...
OPENROUTER_MODEL=anthropic/claude-sonnet-4.6
HERMES_API_KEY=your-secret-keyExample models:
anthropic/claude-sonnet-4.6openai/gpt-5.3-codexgoogle/gemini-3.1-pro-previewqwen/qwen3.6-plusnvidia/nemotron-3-super-120b-a12bxiaomi/mimo-v2-pro
Connects Hermes to models via OpenCode Zen.
ENABLE_VLLM=0
MODEL_PROVIDER=opencode
OPENCODE_API_KEY=...
OPENCODE_MODEL=claude-sonnet-4
HERMES_API_KEY=your-secret-keyUse a dedicated RunPod serverless model endpoint (e.g. runpod-flash-nemotron).
ENABLE_VLLM=0
MODEL_BASE_URL=https://<endpoint-id>.api.runpod.ai/v1
HERMES_API_KEY=your-secret-keyExample models deployable as RunPod serverless endpoints:
nvidia/nemotron-3-super-120b-a12b(used in runpod-flash-nemotron)qwen/qwen3.6-plusxiaomi/mimo-v2-pro
Clone a repo into the workspace when the pod starts:
GIT_REPO_URL=https://github.com/you/your-repo
GIT_REPO_REF=main
GIT_TOKEN=ghp_... # only for private reposAdd the following provider to ~/.config/opencode/opencode.json:
{
"$schema": "https://opencode.ai/config.json",
"provider": {
"hermes-runpod": {
"npm": "@ai-sdk/openai-compatible",
"name": "Hermes on RunPod",
"options": {
"baseURL": "https://<pod-id>-8642.proxy.runpod.net/v1",
"apiKey": "{env:HERMES_API_KEY}"
},
"models": {
"hermes-agent": {
"id": "hermes-agent",
"name": "Hermes Agent (RunPod)",
"limit": {
"context": 32768,
"output": 4096
}
}
}
}
},
"model": "hermes-agent"
}Set HERMES_API_KEY in your shell and launch OpenCode:
export HERMES_API_KEY=your-secret-key
opencodecurl https://<pod-id>-8642.proxy.runpod.net/v1/models \
-H 'Authorization: Bearer your-secret-key'
Just in case the template sharing on Runpod is not working - I'm not sure if it is working. Here is the manual setup for the template:

