update: add podcast-creator skill by mvanhorn · Pull Request #4 · MiniMax-AI/skills

mvanhorn · 2026-03-22T14:07:53Z

Summary

Adds a podcast-creator skill that converts text scripts into podcast episodes using MiniMax TTS and Music APIs.

What it does

Takes a text script (plain text, Markdown, or structured JSON with chapters)
Generates narration via MiniMax TTS API (speech-2.8-hd) with configurable voice selection
Generates intro/outro music via MiniMax Music API (music-2.5+)
Assembles everything with ffmpeg into a final podcast mp3 with crossfading and ID3 tags

Why

The TTS and Music APIs are only used inside frontend-dev for web asset generation. This skill surfaces them for audio content creation. The existing minimax_tts.py and minimax_music.py scripts serve as the foundation. The new podcast_create.py orchestrator handles chapter splitting, voice assignment, and ffmpeg assembly.

Structure

skills/podcast-creator/
  SKILL.md                          # Skill definition with 6-step workflow
  scripts/
    podcast_create.py               # Audio assembler (crossfade + concat + ID3)
    minimax_tts.py                  # TTS script (copied from frontend-dev)
    minimax_music.py                # Music script (copied from frontend-dev)
  references/
    requirements.txt                # Python deps
    script-format.md                # Input format documentation

Follows the same pattern as gif-sticker-maker: SKILL.md with mandatory workflow steps, scripts dir with self-contained Python CLIs, references dir with supplementary docs.

Test plan

Verify SKILL.md frontmatter parses correctly
Run python3 -m py_compile scripts/podcast_create.py (passes)
Test with MINIMAX_API_KEY: generate narration, music, and assemble

This contribution was developed with AI assistance (Claude Code).

Adds a podcast-creator skill that converts text scripts into podcast episodes using MiniMax TTS and Music APIs. Supports plain text, Markdown, and structured JSON input formats. Uses ffmpeg for audio assembly with crossfading between narration and intro/outro music. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

wu335230960 · 2026-03-22T14:08:23Z

恩啊

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update: add podcast-creator skill#4

update: add podcast-creator skill#4
mvanhorn wants to merge 1 commit intoMiniMax-AI:mainfrom
mvanhorn:osc/feat-podcast-creator-skill

mvanhorn commented Mar 22, 2026

Uh oh!

wu335230960 commented Mar 22, 2026 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mvanhorn commented Mar 22, 2026

Summary

What it does

Why

Structure

Test plan

Uh oh!

wu335230960 commented Mar 22, 2026 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants