agentbench
// Benchmark your OpenClaw agent across 40 real-world tasks. Tests file creation, research, data analysis, multi-step workflows, memory, error handling, and tool efficiency. Not a coding benchmark — measures your agent setup and config.
$ git log --oneline --stat
stars:1,933
forks:367
updated:March 4, 2026
SKILL.md
Този skill няма публичен SKILL.md файл.
Разгледайте в GitHub