🎨 画像AI コミュニティ

ai-content-pipeline

画像や動画、音声、テキストを組み合わせ、多段階のAIコンテンツ制作ワークフローを構築するSkill。

📜 元の英語説明(参考)

Build multi-step AI content creation pipelines combining image, video, audio, and text. Workflow examples: generate image -> animate -> add voiceover -> merge with music. Tools: FLUX, Veo, Kokoro TTS, OmniHuman, media merger, upscaling. Use for: YouTube videos, social media content, marketing materials, automated content. Triggers: content pipeline, ai workflow, content creation, multi-step ai, content automation, ai video workflow, generate and edit, ai content factory, automated content creation, ai production pipeline, media pipeline, content at scale

🇯🇵 日本人クリエイター向け解説

一言でいうと

画像や動画、音声、テキストを組み合わせ、多段階のAIコンテンツ制作ワークフローを構築するSkill。

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⬇ このSkillをダウンロード(.skill) 元のソースを見る ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-17
取得日時: 2026-05-17
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

AIコンテンツパイプライン

AI Content Pipeline

inference.sh CLI を介して、複数ステップのコンテンツ作成パイプラインを構築します。

クイックスタート

curl -fsSL https://cli.inference.sh | sh && infsh login

# シンプルなパイプライン: 画像を生成 -> 動画にアニメーション化
infsh app run falai/flux-dev --input '{"prompt": "portrait of a woman smiling"}' > image.json
infsh app run falai/wan-2-5 --input '{"image_url": "<url-from-previous>"}'

パイプラインパターン

パターン1: 画像 -> 動画 -> 音声

[FLUX Image] -> [Wan 2.5 Video] -> [Foley Sound]

パターン2: スクリプト -> 音声 -> アバター

[LLM Script] -> [Kokoro TTS] -> [OmniHuman Avatar]

パターン3: リサーチ -> コンテンツ -> 配信

[Tavily Search] -> [Claude Summary] -> [FLUX Visual] -> [Twitter Post]

完全なワークフロー

YouTubeショートパイプライン

トピックから完全なショートフォーム動画を作成します。

# 1. Claudeでスクリプトを生成
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a 30-second script about the future of AI. Make it engaging and conversational. Just the script, no stage directions."
}' > script.json

# 2. Kokoroでナレーションを生成
infsh app run infsh/kokoro-tts --input '{
  "text": "<script-text>",
  "voice": "af_sarah"
}' > voice.json

# 3. FLUXで背景画像を生成
infsh app run falai/flux-dev --input '{
  "prompt": "Futuristic city skyline at sunset, cyberpunk aesthetic, 4K wallpaper"
}' > background.json

# 4. Wanで画像を動画にアニメーション化
infsh app run falai/wan-2-5 --input '{
  "image_url": "<background-url>",
  "prompt": "slow camera pan across cityscape, subtle movement"
}' > video.json

# 5. キャプションを追加 (手動または別のツールで)

# 6. 動画と音声を結合
infsh app run infsh/media-merger --input '{
  "video_url": "<video-url>",
  "audio_url": "<voice-url>"
}'

トーキングヘッド動画パイプライン

コンテンツをプレゼンテーションするAIアバターを作成します。

# 1. スクリプトを作成
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a 1-minute explainer script about quantum computing for beginners."
}' > script.json

# 2. 音声を生成
infsh app run infsh/kokoro-tts --input '{
  "text": "<script>",
  "voice": "am_michael"
}' > speech.json

# 3. ポートレート画像を生成または使用
infsh app run falai/flux-dev --input '{
  "prompt": "Professional headshot of a friendly tech presenter, neutral background, looking at camera"
}' > portrait.json

# 4. トーキングヘッド動画を作成
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "<portrait-url>",
  "audio_url": "<speech-url>"
}' > talking_head.json

製品デモパイプライン

製品紹介動画を作成します。

# 1. 製品画像を生成
infsh app run falai/flux-dev --input '{
  "prompt": "Sleek wireless earbuds on white surface, studio lighting, product photography"
}' > product.json

# 2. 製品の登場をアニメーション化
infsh app run falai/wan-2-5 --input '{
  "image_url": "<product-url>",
  "prompt": "slow 360 rotation, smooth motion"
}' > product_video.json

# 3. 動画品質をアップスケール
infsh app run falai/topaz-video-upscaler --input '{
  "video_url": "<product-video-url>"
}' > upscaled.json

# 4. 背景音楽を追加
infsh app run infsh/media-merger --input '{
  "video_url": "<upscaled-url>",
  "audio_url": "https://your-music.mp3",
  "audio_volume": 0.3
}'

ブログから動画へのパイプライン

書かれたコンテンツを動画形式に変換します。

# 1. ブログ記事を要約
infsh app run openrouter/claude-haiku-45 --input '{
  "prompt": "Summarize this blog post into 5 key points for a video script: <blog-content>"
}' > summary.json

# 2. 各ポイントの画像を生成
for i in 1 2 3 4 5; do
  infsh app run falai/flux-dev --input "{
    \"prompt\": \"Visual representing point $i: <point-text>\"
  }" > "image_$i.json"
done

# 3. 各画像をアニメーション化
for i in 1 2 3 4 5; do
  infsh app run falai/wan-2-5 --input "{
    \"image_url\": \"<image-$i-url>\"
  }" > "video_$i.json"
done

# 4. ナレーションを生成
infsh app run infsh/kokoro-tts --input '{
  "text": "<full-script>",
  "voice": "bf_emma"
}' > narration.json

# 5. すべてのクリップを結合
infsh app run infsh/media-merger --input '{
  "videos": ["<video1>", "<video2>", "<video3>", "<video4>", "<video5>"],
  "audio_url": "<narration-url>",
  "transition": "crossfade"
}'

パイプラインの構成要素

コンテンツ生成

ステップ	アプリ	目的
スクリプト	`openrouter/claude-sonnet-45`	コンテンツの作成
リサーチ	`tavily/search-assistant`	情報収集
要約	`openrouter/claude-haiku-45`	コンテンツの要約

ビジュアルアセット

ステップ	アプリ	目的
画像	`falai/flux-dev`	画像の生成
画像	`google/imagen-3`	代替画像生成
アップスケール	`falai/topaz-image-upscaler`	品質向上

アニメーション

ステップ	アプリ	目的
I2V	`falai/wan-2-5`	画像のアニメーション化
T2V	`google/veo-3-1-fast`	テキストからの生成
アバター	`bytedance/omnihuman-1-5`	トーキングヘッド

オーディオ

ステップ	アプリ	目的
TTS	`infsh/kokoro-tts`	音声ナレーション
音楽	`infsh/ai-music`	背景音楽
フォーリー	`infsh/hunyuanvideo-foley`	効果音

ポストプロダクション

ステップ	アプリ	目的
アップスケール	`falai/topaz-video-upscaler`	動画の品質向上
マージ	`infsh/media-merger`	メディアの結合
キャプション	`infsh/caption-video`	字幕の追加

ベストプラクティス

まずパイプラインを計画する - 実行する前に各ステップをマッピングします
中間結果を保存する - 反復のために出力を保存します
適切な品質を使用する - ドラフトには高速モデル、最終版には高品質モデルを使用します
解像度を合わせる - 全体を通してアスペクト比を一貫させます
各ステップをテストする - 進む前に出力を確認します

AI Content Pipeline

Build multi-step content creation pipelines via inference.sh CLI.

Quick Start

curl -fsSL https://cli.inference.sh | sh && infsh login

# Simple pipeline: Generate image -> Animate to video
infsh app run falai/flux-dev --input '{"prompt": "portrait of a woman smiling"}' > image.json
infsh app run falai/wan-2-5 --input '{"image_url": "<url-from-previous>"}'

Pipeline Patterns

Pattern 1: Image -> Video -> Audio

[FLUX Image] -> [Wan 2.5 Video] -> [Foley Sound]

Pattern 2: Script -> Speech -> Avatar

[LLM Script] -> [Kokoro TTS] -> [OmniHuman Avatar]

Pattern 3: Research -> Content -> Distribution

[Tavily Search] -> [Claude Summary] -> [FLUX Visual] -> [Twitter Post]

Complete Workflows

YouTube Short Pipeline

Create a complete short-form video from a topic.

# 1. Generate script with Claude
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a 30-second script about the future of AI. Make it engaging and conversational. Just the script, no stage directions."
}' > script.json

# 2. Generate voiceover with Kokoro
infsh app run infsh/kokoro-tts --input '{
  "text": "<script-text>",
  "voice": "af_sarah"
}' > voice.json

# 3. Generate background image with FLUX
infsh app run falai/flux-dev --input '{
  "prompt": "Futuristic city skyline at sunset, cyberpunk aesthetic, 4K wallpaper"
}' > background.json

# 4. Animate image to video with Wan
infsh app run falai/wan-2-5 --input '{
  "image_url": "<background-url>",
  "prompt": "slow camera pan across cityscape, subtle movement"
}' > video.json

# 5. Add captions (manually or with another tool)

# 6. Merge video with audio
infsh app run infsh/media-merger --input '{
  "video_url": "<video-url>",
  "audio_url": "<voice-url>"
}'

Talking Head Video Pipeline

Create an AI avatar presenting content.

# 1. Write the script
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a 1-minute explainer script about quantum computing for beginners."
}' > script.json

# 2. Generate speech
infsh app run infsh/kokoro-tts --input '{
  "text": "<script>",
  "voice": "am_michael"
}' > speech.json

# 3. Generate or use a portrait image
infsh app run falai/flux-dev --input '{
  "prompt": "Professional headshot of a friendly tech presenter, neutral background, looking at camera"
}' > portrait.json

# 4. Create talking head video
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "<portrait-url>",
  "audio_url": "<speech-url>"
}' > talking_head.json

Product Demo Pipeline

Create a product showcase video.

# 1. Generate product image
infsh app run falai/flux-dev --input '{
  "prompt": "Sleek wireless earbuds on white surface, studio lighting, product photography"
}' > product.json

# 2. Animate product reveal
infsh app run falai/wan-2-5 --input '{
  "image_url": "<product-url>",
  "prompt": "slow 360 rotation, smooth motion"
}' > product_video.json

# 3. Upscale video quality
infsh app run falai/topaz-video-upscaler --input '{
  "video_url": "<product-video-url>"
}' > upscaled.json

# 4. Add background music
infsh app run infsh/media-merger --input '{
  "video_url": "<upscaled-url>",
  "audio_url": "https://your-music.mp3",
  "audio_volume": 0.3
}'

Blog to Video Pipeline

Convert written content to video format.

# 1. Summarize blog post
infsh app run openrouter/claude-haiku-45 --input '{
  "prompt": "Summarize this blog post into 5 key points for a video script: <blog-content>"
}' > summary.json

# 2. Generate images for each point
for i in 1 2 3 4 5; do
  infsh app run falai/flux-dev --input "{
    \"prompt\": \"Visual representing point $i: <point-text>\"
  }" > "image_$i.json"
done

# 3. Animate each image
for i in 1 2 3 4 5; do
  infsh app run falai/wan-2-5 --input "{
    \"image_url\": \"<image-$i-url>\"
  }" > "video_$i.json"
done

# 4. Generate voiceover
infsh app run infsh/kokoro-tts --input '{
  "text": "<full-script>",
  "voice": "bf_emma"
}' > narration.json

# 5. Merge all clips
infsh app run infsh/media-merger --input '{
  "videos": ["<video1>", "<video2>", "<video3>", "<video4>", "<video5>"],
  "audio_url": "<narration-url>",
  "transition": "crossfade"
}'

Pipeline Building Blocks

Content Generation

Step	App	Purpose
Script	`openrouter/claude-sonnet-45`	Write content
Research	`tavily/search-assistant`	Gather information
Summary	`openrouter/claude-haiku-45`	Condense content

Visual Assets

Step	App	Purpose
Image	`falai/flux-dev`	Generate images
Image	`google/imagen-3`	Alternative image gen
Upscale	`falai/topaz-image-upscaler`	Enhance quality

Animation

Step	App	Purpose
I2V	`falai/wan-2-5`	Animate images
T2V	`google/veo-3-1-fast`	Generate from text
Avatar	`bytedance/omnihuman-1-5`	Talking heads

Audio

Step	App	Purpose
TTS	`infsh/kokoro-tts`	Voice narration
Music	`infsh/ai-music`	Background music
Foley	`infsh/hunyuanvideo-foley`	Sound effects

Post-Production

Step	App	Purpose
Upscale	`falai/topaz-video-upscaler`	Enhance video
Merge	`infsh/media-merger`	Combine media
Caption	`infsh/caption-video`	Add subtitles

Best Practices

Plan the pipeline first - Map out each step before running
Save intermediate results - Store outputs for iteration
Use appropriate quality - Fast models for drafts, quality for finals
Match resolutions - Keep consistent aspect ratios throughout
Test each step - Verify outputs before proceeding

Related Skills

# Video generation models
npx skills add inferencesh/skills@ai-video-generation

# Image generation
npx skills add inferencesh/skills@ai-image-generation

# Text-to-speech
npx skills add inferencesh/skills@text-to-speech

# LLM models for scripts
npx skills add inferencesh/skills@llm-models

# Full platform skill
npx skills add inferencesh/skills@inference-sh

Browse all apps: infsh app list

Documentation

Content Pipeline Example - Official pipeline guide
Building Workflows - Workflow best practices

ai-content-pipeline

🇯🇵 日本人クリエイター向け解説

🎯 このSkillでできること

📦 インストール方法 (3ステップ)

📖 Skill本文(日本語訳)

AIコンテンツパイプライン

クイックスタート

パイプラインパターン

パターン1: 画像 -> 動画 -> 音声

パターン2: スクリプト -> 音声 -> アバター

パターン3: リサーチ -> コンテンツ -> 配信

完全なワークフロー

YouTubeショートパイプライン

トーキングヘッド動画パイプライン

製品デモパイプライン

ブログから動画へのパイプライン

パイプラインの構成要素

コンテンツ生成

ビジュアルアセット

アニメーション

オーディオ

ポストプロダクション

ベストプラクティス

関連スキル

AI Content Pipeline

Quick Start

Pipeline Patterns

Pattern 1: Image -> Video -> Audio

Pattern 2: Script -> Speech -> Avatar

Pattern 3: Research -> Content -> Distribution

Complete Workflows

YouTube Short Pipeline

Talking Head Video Pipeline

Product Demo Pipeline

Blog to Video Pipeline

Pipeline Building Blocks

Content Generation

Visual Assets

Animation

Audio

Post-Production

Best Practices

Related Skills

Documentation