ai-podcast-creation
AI音声合成と音楽生成を組み合わせ、複数話者の会話やBGM付きのポッドキャストを自動作成するSkill。
📜 元の英語説明(参考)
Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabilities: multi-voice conversations, background music, intro/outro, full episodes. Use for: podcast production, audiobooks, voice content, audio newsletters. Triggers: podcast, ai podcast, text to speech podcast, audio content, voice over, ai audiobook, multi voice, conversation ai, notebooklm alternative, audio generation, podcast automation, ai narrator, voice content, audio newsletter, podcast maker
🇯🇵 日本人クリエイター向け解説
AI音声合成と音楽生成を組み合わせ、複数話者の会話やBGM付きのポッドキャストを自動作成するSkill。
※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。
下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。 ダウンロード → 解凍 → 配置まで全自動。
mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o ai-podcast-creation.zip https://jpskill.com/download/6168.zip && unzip -o ai-podcast-creation.zip && rm ai-podcast-creation.zip
$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/6168.zip -OutFile "$d\ai-podcast-creation.zip"; Expand-Archive "$d\ai-podcast-creation.zip" -DestinationPath $d -Force; ri "$d\ai-podcast-creation.zip"
完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。
💾 手動でダウンロードしたい(コマンドが難しい人向け)
- 1. 下の青いボタンを押して
ai-podcast-creation.zipをダウンロード - 2. ZIPファイルをダブルクリックで解凍 →
ai-podcast-creationフォルダができる - 3. そのフォルダを
C:\Users\あなたの名前\.claude\skills\(Win)または~/.claude/skills/(Mac)へ移動 - 4. Claude Code を再起動
⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。
🎯 このSkillでできること
下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。
📦 インストール方法 (3ステップ)
- 1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
- 2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
- 3. 展開してできたフォルダを、ホームフォルダの
.claude/skills/に置く- · macOS / Linux:
~/.claude/skills/ - · Windows:
%USERPROFILE%\.claude\skills\
- · macOS / Linux:
Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。
詳しい使い方ガイドを見る →- 最終更新
- 2026-05-17
- 取得日時
- 2026-05-17
- 同梱ファイル
- 1
📖 Skill本文(日本語訳)
※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。
AIポッドキャスト作成

inference.sh CLI を介して、AIを活用したポッドキャストやオーディオコンテンツを作成できます。
クイックスタート
curl -fsSL https://cli.inference.sh | sh && infsh login
# ポッドキャストセグメントを生成
infsh app run infsh/kokoro-tts --input '{
"text": "Welcome to the AI Frontiers podcast. Today we explore the latest developments in generative AI.",
"voice": "am_michael"
}'
利用可能な音声
Kokoro TTS
| Voice ID | 説明 | 最適な用途 |
|---|---|---|
af_sarah |
アメリカ人女性、温かい | ホスト、ナレーター |
af_nicole |
アメリカ人女性、プロフェッショナル | ニュース、ビジネス |
am_michael |
アメリカ人男性、権威的 | ドキュメンタリー、テクノロジー |
am_adam |
アメリカ人男性、会話的 | カジュアルなポッドキャスト |
bf_emma |
イギリス人女性、洗練された | オーディオブック |
bm_george |
イギリス人男性、クラシック | フォーマルなコンテンツ |
DIA TTS (会話型)
| Voice ID | 説明 | 最適な用途 |
|---|---|---|
dia-conversational |
自然な会話 | 対話、インタビュー |
Chatterbox
| Voice ID | 説明 | 最適な用途 |
|---|---|---|
chatterbox-default |
表現豊か | カジュアル、エンターテイメント |
ポッドキャストのワークフロー
シンプルなナレーション
# 単一音声のポッドキャストセグメント
infsh app run infsh/kokoro-tts --input '{
"text": "Your podcast script here. Make it conversational and engaging. Add natural pauses with punctuation.",
"voice": "am_michael"
}'
複数音声の会話
# ホストの紹介
infsh app run infsh/kokoro-tts --input '{
"text": "Welcome back to Tech Talk. Today I have a special guest to discuss AI developments.",
"voice": "am_michael"
}' > host_intro.json
# ゲストの返答
infsh app run infsh/kokoro-tts --input '{
"text": "Thanks for having me. I am excited to share what we have been working on.",
"voice": "af_sarah"
}' > guest_response.json
# 会話にマージ
infsh app run infsh/media-merger --input '{
"audio_files": ["<host-url>", "<guest-url>"],
"crossfade_ms": 500
}'
全エピソードのパイプライン
# 1. Claudeでスクリプトを生成
infsh app run openrouter/claude-sonnet-45 --input '{
"prompt": "Write a 5-minute podcast script about the impact of AI on creative work. Format as a two-person dialogue between HOST and GUEST. Include natural conversation, questions, and insights."
}' > script.json
# 2. イントロ音楽を生成
infsh app run infsh/ai-music --input '{
"prompt": "Podcast intro music, upbeat, modern, tech feel, 15 seconds"
}' > intro_music.json
# 3. ホストのセグメントを生成
infsh app run infsh/kokoro-tts --input '{
"text": "<host-lines>",
"voice": "am_michael"
}' > host.json
# 4. ゲストのセグメントを生成
infsh app run infsh/kokoro-tts --input '{
"text": "<guest-lines>",
"voice": "af_sarah"
}' > guest.json
# 5. アウトロ音楽を生成
infsh app run infsh/ai-music --input '{
"prompt": "Podcast outro music, matching intro style, fade out, 10 seconds"
}' > outro_music.json
# 6. 全てをマージ
infsh app run infsh/media-merger --input '{
"audio_files": [
"<intro-music>",
"<host>",
"<guest>",
"<outro-music>"
],
"crossfade_ms": 1000
}'
NotebookLMスタイルのコンテンツ
ドキュメントからポッドキャストスタイルの議論を生成します。
# 1. 主要なポイントを抽出
infsh app run openrouter/claude-sonnet-45 --input '{
"prompt": "Read this document and create a podcast script where two hosts discuss the key points in an engaging, conversational way. Include questions, insights, and natural dialogue.\n\nDocument:\n<your-document-content>"
}' > discussion_script.json
# 2. ホストAを生成
infsh app run infsh/kokoro-tts --input '{
"text": "<host-a-lines>",
"voice": "am_michael"
}' > host_a.json
# 3. ホストBを生成
infsh app run infsh/kokoro-tts --input '{
"text": "<host-b-lines>",
"voice": "af_sarah"
}' > host_b.json
# 4. 交互に配置してマージ
infsh app run infsh/media-merger --input '{
"audio_files": ["<host-a-1>", "<host-b-1>", "<host-a-2>", "<host-b-2>"],
"crossfade_ms": 300
}'
オーディオブックの章
# 長尺のナレーション
infsh app run infsh/kokoro-tts --input '{
"text": "Chapter One. It was a dark and stormy night when the first AI achieved consciousness...",
"voice": "bf_emma",
"speed": 0.9
}'
オーディオの強化
BGMを追加
# 1. ポッドキャストの音声を生成
infsh app run infsh/kokoro-tts --input '{
"text": "<podcast-script>",
"voice": "am_michael"
}' > podcast.json
# 2. 環境音楽を生成
infsh app run infsh/ai-music --input '{
"prompt": "Soft ambient background music for podcast, subtle, non-distracting, loopable"
}' > background.json
# 3. 背景音量を下げてミックス
infsh app run infsh/media-merger --input '{
"audio_files": ["<podcast-url>"],
"background_audio": "<background-url>",
"background_volume": 0.15
}'
効果音を追加
# セグメント間のトランジションサウンド
infsh app run infsh/ai-music --input '{
"prompt": "Short podcast transition sound, whoosh, 2 seconds"
}' > transition.json
スクリプト作成のヒント
Claudeへのプロンプト
infsh app run openrouter/claude-sonnet-45 --input '{
"prompt": "Write a podcast script with these requirements:
- Topic: [YOUR TOPIC]
- Duration: 5 minutes (about 750 words)
- Format: Two hosts (HOST_A and HOST_B)
- Tone: Conversational, informative, engaging
- Include: Hook intro, 3 main points, call to action
- Mark speaker changes clearly
Make it sound natural, not scripted. Add verbal fillers like \"you know\" and \"I mean\" occasionally."
}'
ポッドキャストのテンプレート
インタビュー形式
HOST: Introduction and welcome
GUEST: Thank you, happy to be here
HOST: First question about background
GUEST: Response with story
HOST: Follow-up question
GUEST: Deeper insight
... continue pattern ...
HOST: Closing question
GUEST: Fina 📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開
AI Podcast Creation

Create AI-powered podcasts and audio content via inference.sh CLI.
Quick Start
curl -fsSL https://cli.inference.sh | sh && infsh login
# Generate podcast segment
infsh app run infsh/kokoro-tts --input '{
"text": "Welcome to the AI Frontiers podcast. Today we explore the latest developments in generative AI.",
"voice": "am_michael"
}'
Available Voices
Kokoro TTS
| Voice ID | Description | Best For |
|---|---|---|
af_sarah |
American female, warm | Host, narrator |
af_nicole |
American female, professional | News, business |
am_michael |
American male, authoritative | Documentary, tech |
am_adam |
American male, conversational | Casual podcast |
bf_emma |
British female, refined | Audiobooks |
bm_george |
British male, classic | Formal content |
DIA TTS (Conversational)
| Voice ID | Description | Best For |
|---|---|---|
dia-conversational |
Natural conversation | Dialogue, interviews |
Chatterbox
| Voice ID | Description | Best For |
|---|---|---|
chatterbox-default |
Expressive | Casual, entertainment |
Podcast Workflows
Simple Narration
# Single voice podcast segment
infsh app run infsh/kokoro-tts --input '{
"text": "Your podcast script here. Make it conversational and engaging. Add natural pauses with punctuation.",
"voice": "am_michael"
}'
Multi-Voice Conversation
# Host introduction
infsh app run infsh/kokoro-tts --input '{
"text": "Welcome back to Tech Talk. Today I have a special guest to discuss AI developments.",
"voice": "am_michael"
}' > host_intro.json
# Guest response
infsh app run infsh/kokoro-tts --input '{
"text": "Thanks for having me. I am excited to share what we have been working on.",
"voice": "af_sarah"
}' > guest_response.json
# Merge into conversation
infsh app run infsh/media-merger --input '{
"audio_files": ["<host-url>", "<guest-url>"],
"crossfade_ms": 500
}'
Full Episode Pipeline
# 1. Generate script with Claude
infsh app run openrouter/claude-sonnet-45 --input '{
"prompt": "Write a 5-minute podcast script about the impact of AI on creative work. Format as a two-person dialogue between HOST and GUEST. Include natural conversation, questions, and insights."
}' > script.json
# 2. Generate intro music
infsh app run infsh/ai-music --input '{
"prompt": "Podcast intro music, upbeat, modern, tech feel, 15 seconds"
}' > intro_music.json
# 3. Generate host segments
infsh app run infsh/kokoro-tts --input '{
"text": "<host-lines>",
"voice": "am_michael"
}' > host.json
# 4. Generate guest segments
infsh app run infsh/kokoro-tts --input '{
"text": "<guest-lines>",
"voice": "af_sarah"
}' > guest.json
# 5. Generate outro music
infsh app run infsh/ai-music --input '{
"prompt": "Podcast outro music, matching intro style, fade out, 10 seconds"
}' > outro_music.json
# 6. Merge everything
infsh app run infsh/media-merger --input '{
"audio_files": [
"<intro-music>",
"<host>",
"<guest>",
"<outro-music>"
],
"crossfade_ms": 1000
}'
NotebookLM-Style Content
Generate podcast-style discussions from documents.
# 1. Extract key points
infsh app run openrouter/claude-sonnet-45 --input '{
"prompt": "Read this document and create a podcast script where two hosts discuss the key points in an engaging, conversational way. Include questions, insights, and natural dialogue.\n\nDocument:\n<your-document-content>"
}' > discussion_script.json
# 2. Generate Host A
infsh app run infsh/kokoro-tts --input '{
"text": "<host-a-lines>",
"voice": "am_michael"
}' > host_a.json
# 3. Generate Host B
infsh app run infsh/kokoro-tts --input '{
"text": "<host-b-lines>",
"voice": "af_sarah"
}' > host_b.json
# 4. Interleave and merge
infsh app run infsh/media-merger --input '{
"audio_files": ["<host-a-1>", "<host-b-1>", "<host-a-2>", "<host-b-2>"],
"crossfade_ms": 300
}'
Audiobook Chapter
# Long-form narration
infsh app run infsh/kokoro-tts --input '{
"text": "Chapter One. It was a dark and stormy night when the first AI achieved consciousness...",
"voice": "bf_emma",
"speed": 0.9
}'
Audio Enhancement
Add Background Music
# 1. Generate podcast audio
infsh app run infsh/kokoro-tts --input '{
"text": "<podcast-script>",
"voice": "am_michael"
}' > podcast.json
# 2. Generate ambient music
infsh app run infsh/ai-music --input '{
"prompt": "Soft ambient background music for podcast, subtle, non-distracting, loopable"
}' > background.json
# 3. Mix with lower background volume
infsh app run infsh/media-merger --input '{
"audio_files": ["<podcast-url>"],
"background_audio": "<background-url>",
"background_volume": 0.15
}'
Add Sound Effects
# Transition sounds between segments
infsh app run infsh/ai-music --input '{
"prompt": "Short podcast transition sound, whoosh, 2 seconds"
}' > transition.json
Script Writing Tips
Prompt for Claude
infsh app run openrouter/claude-sonnet-45 --input '{
"prompt": "Write a podcast script with these requirements:
- Topic: [YOUR TOPIC]
- Duration: 5 minutes (about 750 words)
- Format: Two hosts (HOST_A and HOST_B)
- Tone: Conversational, informative, engaging
- Include: Hook intro, 3 main points, call to action
- Mark speaker changes clearly
Make it sound natural, not scripted. Add verbal fillers like \"you know\" and \"I mean\" occasionally."
}'
Podcast Templates
Interview Format
HOST: Introduction and welcome
GUEST: Thank you, happy to be here
HOST: First question about background
GUEST: Response with story
HOST: Follow-up question
GUEST: Deeper insight
... continue pattern ...
HOST: Closing question
GUEST: Final thoughts
HOST: Thank you and outro
Solo Episode
Introduction with hook
Topic overview
Point 1 with examples
Point 2 with examples
Point 3 with examples
Summary and takeaways
Call to action
Outro
News Roundup
Intro music
Welcome and date
Story 1: headline + details
Story 2: headline + details
Story 3: headline + details
Analysis/opinion segment
Outro
Best Practices
- Natural punctuation - Use commas and periods for pacing
- Short sentences - Easier to speak and listen
- Varied voices - Different speakers prevent monotony
- Background music - Subtle, at 10-15% volume
- Crossfades - Smooth transitions between segments
- Edit scripts - Remove filler before generating
Related Skills
# Text-to-speech models
npx skills add inferencesh/skills@text-to-speech
# AI music generation
npx skills add inferencesh/skills@ai-music-generation
# LLM for scripts
npx skills add inferencesh/skills@llm-models
# Content pipelines
npx skills add inferencesh/skills@ai-content-pipeline
# Full platform skill
npx skills add inferencesh/skills@inference-sh
Browse all apps: infsh app list --category audio