Claude Skill
mrgoonie/human-mcp
Human-MCP is a TypeScript MCP server that bridges AI agents like Claude Code with human tools and services, enabling real-world task execution through a standardized protocol.
Overview
Repository
Install this Skill
git clone https://github.com/human-mcp/human-mcp.gitRegistry
git clone https://github.com/human-mcp/human-mcp.gitclaude mcp add --scope user human-mcp npx @goonnguyen/human-mcp --env GOOGLE_GEMINI_API_KEY=your_api_key_hereclaude mcp add --scope project human-mcp npx @goonnguyen/human-mcp --env GOOGLE_GEMINI_API_KEY=your_api_key_hereclaude mcp add --scope local human-mcp npx @goonnguyen/human-mcp --env GOOGLE_GEMINI_API_KEY=your_api_key_hereclaude mcp add --scope user human-mcp cmd /c npx @goonnguyen/human-mcp --env GOOGLE_GEMINI_API_KEY=your_api_key_here
Summary
Human-MCP is a TypeScript-based MCP server that bridges AI agents with human capabilities, enabling Claude Code and other AI assistants to interact with real-world tools and services through a standardized Model Context Protocol interface.
将人类能力赋予AI代理
Key features
- Standardized MCP interface for AI-human tool interaction
- Built with TypeScript for reliability and type safety
- Designed for Claude Code and other MCP-compatible agents
- Open-source project with active community support
Use cases
- Enabling Claude Code to execute real-world tasks via MCP
- Integrating human-in-the-loop capabilities into AI workflows
- Building custom AI agents that leverage human tools and services
README excerpt
# Human MCP 👁️ > Bringing Human Capabilities to AI Agents  Human MCP v2.16.0 is a comprehensive Model Context Protocol server that provides AI coding agents with human-like capabilities including visual analysis, document processing, speech generation, content creation, image editing, browser automation, and advanced reasoning for debugging, understanding, and enhancing multimodal content. ## "Human MCP" is a part of [ClaudeKit](https://claudekit.cc)  ## Features 🎯 **Visual Analysis (Eyes) - ✅ Complete (4 tools)** - **eyes_analyze**: Analyze images, videos, and GIFs for UI bugs, errors, and accessibility - **eyes_compare**: Compare two images to find visual differences - **eyes_read_document**: Extract text and data from PDF, DOCX, XLSX, PPTX, and more - **eyes_summarize_document**: Generate summaries and insights from documents ✋ **Content Generation & Image Editing (Hands) - ✅ Complete (18 tools)** - **Image Generation** (1 tool): gemini_gen_image - Generate images from text using Imagen API - **Video Generation** (2 tools): gemini_gen_video, gemini_image_to_video - Create videos with Veo 3.0 - **Music Generation** (2 tools): minimax_gen_music, elevenlabs_gen_music - Generate music with vocals - **Sound Effects** (1 tool): elevenlabs_gen_sfx - Generate sound effects from text descriptions - **AI Image Editing** (5 tools): Gemini-powered editing with inpainting, outpainting, style transfer, object manipulation, composition - **Jimp Processing** (4 tools): Local image manipulation - crop, resize, rotate, mask - **Background Removal** (1 tool): rmbg_remove_background - AI-powered background removal - **Browser Automation** (3 tools): playwright_screenshot_fullpage, playwright_screenshot_viewport, playwright_