Claude Skill

Agents365-ai/video-podcast-maker

Video Podcast Maker is an AI skill for coding agents that creates video podcasts for Bilibili & YouTube. Supports multi-language, 6 TTS engines, and 4K Remotion rendering.

Overview

Stars758
Forks99
LanguagePython
Last pushed2026-04-27
Last synced2026-06-06
View on GitHub

Repository

OwnerAgents365-ai
Repositoryvideo-podcast-maker
Full nameAgents365-ai/video-podcast-maker
Repo ID1,159,261,054

Install this Skill

pip install -r skills/video-podcast-maker/requirements.txt

Registry

Typeopenclaw_skill
Quality score75/100
Verificationreadme_parsed
Last verified2026-06-06
Platforms
ClaudeOpenClawCodex
Capabilities
browsersearchimagevideoterminalworkflowagent-skillsai-videobilibiliclaude-code
Detected files
README.md
Config keys
AZURE_SPEECH_KEYVOLCENGINE_APPIDVOLCENGINE_ACCESS_TOKENDASHSCOPE_API_KEYELEVENLABS_API_KEYGOOGLE_TTS_API_KEYOPENAI_API_KEYGEMINI_API_KEY
Install methods
  • pip install -r skills/video-podcast-maker/requirements.txt
  • npx create-video@latest my-video-project
  • npx remotion studio # Should open browser preview
  • npm install remotion @remotion/cli @remotion/player zod
  • npx remotion studio src/remotion/index.ts

Summary

Video Podcast Maker is an AI-powered skill for coding agents that automates video podcast creation. It supports Bilibili and YouTube, offers multi-language output (zh-CN/en-US), integrates 6 TTS engines (Edge, Azure, ElevenLabs, OpenAI, Doubao, CosyVoice), and renders in 4K via Remotion.

Chinese description

面向编码代理的AI驱动视频播客创作技能。支持Bilibili和YouTube,多语言(中文简体/美式英语),6种TTS引擎(Edge/Azure/ElevenLabs/OpenAI/Doubao/CosyVoice),4K Remotion渲染。

Key features

  • AI-powered video podcast creation for coding agents
  • Supports Bilibili & YouTube platforms
  • Multi-language output: zh-CN and en-US
  • 6 TTS engines: Edge, Azure, ElevenLabs, OpenAI, Doubao, CosyVoice
  • 4K Remotion rendering for high-quality video

Use cases

  • Automate podcast-style video production for YouTube
  • Create Bilibili content with AI-generated narration
  • Generate multi-language video podcasts for global audiences
  • Leverage diverse TTS voices for dynamic audio tracks
  • Produce 4K videos for professional publishing

README excerpt

# Video Podcast Maker [中文文档](README_CN.md) Automated pipeline to create professional video podcasts from a topic. **Supports Bilibili, YouTube, Xiaohongshu, Douyin, and WeChat Channels** with multi-language output (zh-CN, en-US). Combines research, script generation, multi-engine TTS (Edge/Azure/Doubao/CosyVoice), Remotion video rendering, and FFmpeg audio mixing. **Works with:** [Claude Code](https://claude.ai/code) · [OpenClaw](https://openclaw.ai/) (ClawHub) · [OpenCode](https://opencode.ai/) · [Codex](https://openai.com/index/introducing-codex/) — any coding agent that supports SKILL.md **Publish to:** Bilibili · YouTube · Xiaohongshu · Douyin · WeChat Channels > **No coding required!** Just describe your topic in plain language — the coding agent guides you through each step interactively. You make creative decisions, the agent handles all the technical details. Creating your first video podcast is easier than you think. > **Note:** This project is still under active development and may not be fully mature yet. We are continuously iterating and improving. Your feedback and suggestions are greatly appreciated — feel free to [open an issue](https://github.com/Agents365-ai/video-podcast-maker/issues) or reach out! ## Features - **Topic Research** - Web search and content gathering - **Script Writing** - Structured narration with section markers - **Multi-TTS** - Edge TTS (free), Azure Speech, Volcengine Doubao, CosyVoice, ElevenLabs, Google Cloud TTS, OpenAI TTS - **Remotion Video** - React-based video composition with animations - **Visual Style Editing** - Adjust colors, fonts, and layout in Remotion Studio UI - **Real-time Preview** - Remotion Studio for instant debugging before render - **Auto Timing** - Audio-video sync via `timing.json` - **BGM Mixing**

Topics

Explore more

Data from GitHub. Synced on 2026-06-06