Browse Claude Skill projects under the "agentic-evaluation" topic.
An in-the-wild benchmark for AI agents in the OpenClaw Environment.