The honest comparison
Urfael, Hermes Agent, and OpenClaw, every win and every gap in one table. Urfael wins where it counts for a machine that lives on your desk and acts for you: the smallest blast radius, a flat bill, and the ability to prove what it did. We are smaller and newer, and we say so in the same table.
| Capability | Urfael | Hermes | OpenClaw |
|---|---|---|---|
| No inbound network port | none | varies | gateway/DMs |
| Ships an attack benchmark | yes, npm run security | no | no |
| Flat-rate cost (no per-token) | yes, subscription | no | no |
| Can prove what it did (ledger + seal) | yes | no | no |
| Live, watchable multi-agent Council | yes | opaque | opaque |
| Undoable coding turns (checkpoint + rewind) | yes, auto-checkpoint + reversible rewind | none documented | none documented |
| Skill hub that can't ship malware | scanned + sha-pinned + never run | partial | no scan gate |
| Proactive memory recall (every turn) | retrieves per turn | frozen snapshot | agent must search |
| Generative UI that can't run code | sanitized canvas | no | renders agent HTML |
| Chat-channel breadth | 19 (11 native + 8 webhook) | many | 20+ |
| Battle-tested at scale | small, and we say so | large | very large |
Where it counts for a machine on your desk: blast radius, cost, provability, and not overstating maturity. OpenClaw and Hermes are genuinely excellent at breadth and scale.
Want the caveats, not just the wins? See what's lightly tested ›