Claude Skill
LeoYeAI/myclaw-bench
MyClaw-Bench is the definitive benchmark for AI agents on OpenClaw, featuring 45 tasks across 4 tiers. Powered by MyClaw.ai, it enables standardized LLM-based agent evaluation.
Overview
Repository
Install this Skill
git clone https://github.com/LeoYeAI/myclaw-bench.gitRegistry
Summary
MyClaw-Bench is the definitive benchmark for evaluating AI agents on OpenClaw, featuring 45 tasks across 4 tiers. Powered by MyClaw.ai, it provides a standardized, rigorous testing framework for LLM-based agents.
OpenClaw上AI智能体的权威基准测试。涵盖4个层级共45项任务。由MyClaw.ai提供技术支持。
Key features
- 45 tasks across 4 progressive tiers
- Standardized benchmark for OpenClaw agents
- Powered by MyClaw.ai infrastructure
- Designed for LLM-based agent evaluation
- Open-source and community-driven
Use cases
- Benchmarking AI agents on OpenClaw
- Evaluating LLM-based agent performance
- Comparing agent capabilities across tiers
- Research in agent testing and evaluation
- Developing robust AI agents