Claude Skill

AgentR1/Claw-R1

Claw-R1 empowers OpenClaw with advanced agentic reinforcement learning. Open-source Python project for adaptive, intelligent agent behavior.

Overview

Stars181
Forks9
LanguagePython
Last pushed2026-04-08
Last synced2026-06-22
View on GitHub

Repository

OwnerAgentR1
RepositoryClaw-R1
Full nameAgentR1/Claw-R1
Repo ID1,171,472,579

Install this Skill

git clone https://github.com/AgentR1/Claw-R1.git

Registry

Typeopenclaw_skill
Quality score75/100
Verificationreadme_parsed
Last verified2026-06-22
Platforms
ClaudeOpenClaw
Capabilities
pdfagentagentic-rlopenclaw
Detected files
README.mddocspyproject.toml

Summary

Claw-R1 is an open-source project that enhances OpenClaw with advanced agentic reinforcement learning (RL) capabilities, enabling intelligent, adaptive behavior in agent-based systems.

Chinese description

Claw-R1:赋予OpenClaw高级代理强化学习能力。

Key features

  • Integrates agentic RL into OpenClaw for autonomous decision-making
  • Built on Python for ease of use and extensibility
  • Designed for adaptive and learning-based agent behavior
  • Open-source with community-driven development

Use cases

  • Training intelligent agents in simulated environments
  • Enhancing OpenClaw with RL-based control policies
  • Research in agentic reinforcement learning
  • Building adaptive automation systems

README excerpt

<h1 align="center"> Claw-R1: The Data Foundation for <br> Agentic Reinforcement Learning </h1> <p align="center"> <a href="https://agentr1.github.io/"><img src="https://img.shields.io/badge/Project-Home-orange.svg" alt="Project Home"></a> <a href="https://github.com/AgentR1/Claw-R1/stargazers"><img src="https://img.shields.io/github/stars/AgentR1/Claw-R1" alt="GitHub Repo stars"></a> <a href="https://github.com/AgentR1/Claw-R1/network/members"><img src="https://img.shields.io/github/forks/AgentR1/Claw-R1" alt="GitHub forks"></a> <a href="https://agentr1.github.io/Claw-R1/"><img src="https://img.shields.io/badge/docs-latest-blue.svg" alt="Docs"></a> </p> <p align="center"><img src="./assets/logo.jpeg" width="600px" alt="Claw-R1 Logo" /></p> ## News - **[2026.06]** **Claw-R1 Demo Technical Report Updated.** We updated the Claw-R1 Demo technical report: [https://arxiv.org/abs/2606.09138](https://arxiv.org/abs/2606.09138). - **[2026.06]** **Claw-R1 Dashboard Released.** The live dashboard adds Agentic RL data lifestyle management: collection monitoring, step-level representation, curation signals, prefix-tree optimization preview, and training consumption tracking. - **[2026.04]** 🌲 **Prefix Tree Merge for Agentic RL Training.** A new algorithm that deduplicates shared prefix computation in multi-step agent training via prefix tree packing + FlexAttention. Currently under testing on the [`prefix-tree-merge`](https://github.com/AgentR1/Claw-R1/tree/prefix-tree-merge) branch. See [documentation](https://agentr1.github.io/Claw-R1/components/prefix-tree-merge/). - **[2026.04]** 📚 **RL Training Internals Tutorial.** A comprehensive tutorial covering core RL concepts (Reward / Value / Advantage / Return / Loss), PPO & GRPO algorithms, and Claw-R1's step-level agent

Topics

Explore more

Data from GitHub. Synced on 2026-06-22