Claude Skill

LeoYeAI/myclaw-bench

MyClaw-Bench is the definitive benchmark for AI agents on OpenClaw, featuring 45 tasks across 4 tiers. Powered by MyClaw.ai, it enables standardized LLM-based agent evaluation.

Overview

Stars232
Forks39
LanguagePython
Last pushed2026-03-09
Last synced2026-06-04
View on GitHub

Repository

OwnerLeoYeAI
Repositorymyclaw-bench
Full nameLeoYeAI/myclaw-bench
Repo ID1,176,468,769

Install this Skill

git clone https://github.com/LeoYeAI/myclaw-bench.git

Registry

TypeUnknown
Quality scoreUnknown
VerificationUnknown
Last verifiedUnknown

Summary

MyClaw-Bench is the definitive benchmark for evaluating AI agents on OpenClaw, featuring 45 tasks across 4 tiers. Powered by MyClaw.ai, it provides a standardized, rigorous testing framework for LLM-based agents.

Chinese description

OpenClaw上AI智能体的权威基准测试。涵盖4个层级共45项任务。由MyClaw.ai提供技术支持。

Key features

  • 45 tasks across 4 progressive tiers
  • Standardized benchmark for OpenClaw agents
  • Powered by MyClaw.ai infrastructure
  • Designed for LLM-based agent evaluation
  • Open-source and community-driven

Use cases

  • Benchmarking AI agents on OpenClaw
  • Evaluating LLM-based agent performance
  • Comparing agent capabilities across tiers
  • Research in agent testing and evaluation
  • Developing robust AI agents

Topics

Explore more

Data from GitHub. Synced on 2026-06-04