Evos

We codified decades of
operational expertise.
Then we open sourced it.

Evos are the first to translate and codify decades of operational domain expertise into agent-ready skills.

01

Every industry runs on expertise that took decades to build.

Humans have always acquired and stored the most valuable expertise in their fields. Yet, it’s been uncaptured by technology — locked in the heads of the people who do the work.

Compatible with

Claude
OpenClaw
OpenAI
GitHub
VS Code
Cursor
Gemini
ClawHub
SkillsMP
Amp
Roo Code
OpenCode
Letta
Mistral
Databricks
Hacker News
Claude
OpenClaw
OpenAI
GitHub
VS Code
Cursor
Gemini
ClawHub
SkillsMP
Amp
Roo Code
OpenCode
Letta
Mistral
Databricks
Hacker News

Five kinds of capability.

What a capability is

Benchmarked performance

Every capability is eval-tested against real operational scenarios.

8 sectors across 4 industries, measured over 201 eval scenarios against a Sonnet 4 with no domain capability — and growing.

Capability families

LogisticsExceptions operator30 eval scenarios

Freight exception management

Works delays, damages, shortages, refused deliveries, and carrier disputes from first signal to resolution. Applies Carmack Amendment filing logic, assesses financial exposure, and routes each case down the right path — write-off, claim, or escalation.

With Evos capability95.0%
Sonnet 4, no domain capability85.2%

+9.8pp over baseline across 30 scenarios

What it covers

  • Carmack Amendment filing
  • Financial impact assessment
  • Carrier-specific escalation
  • Dispute resolution routing

How it stays trustworthy

Every capability is governed.

A capable operator is not enough — it has to be one you can trust to act. Four guarantees hold for every capability in the library.

  1. 01

    Built from real operators

    Not scraped, not generated. Every capability is captured from people who have spent years doing the work, through structured elicitation drawn from cognitive and behavioural science.

  2. 02

    Named, versioned, governed

    Each capability is named and versioned, with its own guardrails — cost, rate, and the threshold above which it needs sign-off. You can see exactly what an operator can do, and when it changed.

  3. 03

    Deny-by-default

    An operator can only use the capabilities on its manifest. It calls nothing it has not been explicitly granted — no surprise actions, no scope it was never given.

  4. 04

    Monitored, self-tightening

    Every capability is monitored in production. When success on one starts to slip, the operator automatically routes that work back to human approval until it recovers.

See it in context

The operators these capabilities power.

Browse the full catalogue of role areas an operator can own, across every legacy industry.

Operator catalogue

The operations workforce you don’t have to hire.