Claude Skill
Agents365-ai/video-podcast-maker
Video Podcast Maker is an AI skill for coding agents that creates video podcasts for Bilibili & YouTube. Supports multi-language, 6 TTS engines, and 4K Remotion rendering.
Overview
Repository
Install this Skill
pip install -r skills/video-podcast-maker/requirements.txtRegistry
pip install -r skills/video-podcast-maker/requirements.txtnpx create-video@latest my-video-projectnpx remotion studio # Should open browser previewnpm install remotion @remotion/cli @remotion/player zodnpx remotion studio src/remotion/index.ts
Summary
Video Podcast Maker is an AI-powered skill for coding agents that automates video podcast creation. It supports Bilibili and YouTube, offers multi-language output (zh-CN/en-US), integrates 6 TTS engines (Edge, Azure, ElevenLabs, OpenAI, Doubao, CosyVoice), and renders in 4K via Remotion.
面向编码代理的AI驱动视频播客创作技能。支持Bilibili和YouTube,多语言(中文简体/美式英语),6种TTS引擎(Edge/Azure/ElevenLabs/OpenAI/Doubao/CosyVoice),4K Remotion渲染。
Key features
- AI-powered video podcast creation for coding agents
- Supports Bilibili & YouTube platforms
- Multi-language output: zh-CN and en-US
- 6 TTS engines: Edge, Azure, ElevenLabs, OpenAI, Doubao, CosyVoice
- 4K Remotion rendering for high-quality video
Use cases
- Automate podcast-style video production for YouTube
- Create Bilibili content with AI-generated narration
- Generate multi-language video podcasts for global audiences
- Leverage diverse TTS voices for dynamic audio tracks
- Produce 4K videos for professional publishing
README excerpt
# Video Podcast Maker [中文文档](README_CN.md) Automated pipeline to create professional video podcasts from a topic. **Supports Bilibili, YouTube, Xiaohongshu, Douyin, and WeChat Channels** with multi-language output (zh-CN, en-US). Combines research, script generation, multi-engine TTS (Edge/Azure/Doubao/CosyVoice), Remotion video rendering, and FFmpeg audio mixing. **Works with:** [Claude Code](https://claude.ai/code) · [OpenClaw](https://openclaw.ai/) (ClawHub) · [OpenCode](https://opencode.ai/) · [Codex](https://openai.com/index/introducing-codex/) — any coding agent that supports SKILL.md **Publish to:** Bilibili · YouTube · Xiaohongshu · Douyin · WeChat Channels > **No coding required!** Just describe your topic in plain language — the coding agent guides you through each step interactively. You make creative decisions, the agent handles all the technical details. Creating your first video podcast is easier than you think. > **Note:** This project is still under active development and may not be fully mature yet. We are continuously iterating and improving. Your feedback and suggestions are greatly appreciated — feel free to [open an issue](https://github.com/Agents365-ai/video-podcast-maker/issues) or reach out! ## Features - **Topic Research** - Web search and content gathering - **Script Writing** - Structured narration with section markers - **Multi-TTS** - Edge TTS (free), Azure Speech, Volcengine Doubao, CosyVoice, ElevenLabs, Google Cloud TTS, OpenAI TTS - **Remotion Video** - React-based video composition with animations - **Visual Style Editing** - Adjust colors, fonts, and layout in Remotion Studio UI - **Real-time Preview** - Remotion Studio for instant debugging before render - **Auto Timing** - Audio-video sync via `timing.json` - **BGM Mixing**