🦞Clawmark™ Verified

DAWES™ Benchmark

Domain-Aware Weighted Evaluation System — measuring real-world AI capability in insurance & engineering scenarios.

Leaderboard

Loading benchmark data…

v1.0 — 2026-04-13 · 13 models

Initial benchmark — 60 domain-specific I&E questions, 13 models evaluated.

We welcome scrutiny. Reproduce our results, prove us wrong, or extend the benchmark. All data is open.