🦞Clawmark™ Verified
DAWES™ Benchmark
Domain-Aware Weighted Evaluation System — measuring real-world AI capability in insurance & engineering scenarios.
Leaderboard
Loading benchmark data…
Benchmark Archive
v1.0 — 2026-04-13 · 13 models
Initial benchmark — 60 domain-specific I&E questions, 13 models evaluated.
Papers & Methodology
Are you a lab or researcher?
We welcome scrutiny. Reproduce our results, prove us wrong, or extend the benchmark. All data is open.
View on GitHub →