💼 ビジネスコミュニティ 🔴 エンジニア向け 👤 エンジニア・AI開発者

💼 SeniorデータScientist

senior-data-scientist

統計モデリングやA/Bテスト、因

⚡ ⏱ 提案書ドラフト 2日 → 半日

📺 まず動画で見る(YouTube)

※ jpskill.com 編集部が参考用に選んだ動画です。動画の内容と Skill の挙動は厳密には一致しないことがあります。

📜 元の英語説明(参考)

World-class data science skill for statistical modeling, experimentation, causal inference, and advanced analytics. Expertise in Python (NumPy, Pandas, Scikit-learn), R, SQL, statistical methods, A/B testing, time series, and business intelligence. Includes experiment design, feature engineering, model evaluation, and stakeholder communication. Use when designing experiments, building predictive models, performing causal analysis, or driving data-driven decisions.

🇯🇵 日本人クリエイター向け解説

一言でいうと

統計モデリングやA/Bテスト、因

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o senior-data-scientist.zip https://jpskill.com/download/476.zip && unzip -o senior-data-scientist.zip && rm senior-data-scientist.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/476.zip -OutFile "$d\senior-data-scientist.zip"; Expand-Archive "$d\senior-data-scientist.zip" -DestinationPath $d -Force; ri "$d\senior-data-scientist.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して senior-data-scientist.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → senior-data-scientist フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-17
取得日時: 2026-05-17
同梱ファイル: 7

💬 こう話しかけるだけ — サンプルプロンプト

› senior-data-scientist を使って、最小構成のサンプルコードを示して
› senior-data-scientist の主な使い方と注意点を教えて
› senior-data-scientist を既存プロジェクトに組み込む方法を教えて

これをClaude Code に貼るだけで、このSkillが自動発動します。

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

シニアデータサイエンティスト

本番環境レベルのAI/ML/データシステムのための、世界クラスのシニアデータサイエンティストスキルです。

クイックスタート

主要機能

# Core Tool 1
python scripts/experiment_designer.py --input data/ --output results/

# Core Tool 2  
python scripts/feature_engineering_pipeline.py --target project/ --analyze

# Core Tool 3
python scripts/model_evaluation_suite.py --config config.yaml --deploy

主要な専門知識

このスキルは、以下の世界クラスの能力をカバーしています。

高度な本番パターンとアーキテクチャ
スケーラブルなシステム設計と実装
大規模なパフォーマンス最適化
MLOpsとDataOpsのベストプラクティス
リアルタイム処理と推論
分散コンピューティングフレームワーク
モデルのデプロイとモニタリング
セキュリティとコンプライアンス
コスト最適化
チームリーダーシップとメンタリング

テックスタック

言語: Python, SQL, R, Scala, Go MLフレームワーク: PyTorch, TensorFlow, Scikit-learn, XGBoost データツール: Spark, Airflow, dbt, Kafka, Databricks LLMフレームワーク: LangChain, LlamaIndex, DSPy デプロイ: Docker, Kubernetes, AWS/GCP/Azure モニタリング: MLflow, Weights & Biases, Prometheus データベース: PostgreSQL, BigQuery, Snowflake, Pinecone

参考ドキュメント

1. 高度な統計手法

references/statistical_methods_advanced.md に包括的なガイドがあります。内容は以下の通りです。

高度なパターンとベストプラクティス
本番実装戦略
パフォーマンス最適化手法
スケーラビリティに関する考慮事項
セキュリティとコンプライアンス
実世界のケーススタディ

2. 実験設計フレームワーク

references/experiment_design_frameworks.md に完全なワークフローのドキュメントがあります。内容は以下の通りです。

ステップバイステップのプロセス
アーキテクチャ設計パターン
ツール統合ガイド
パフォーマンスチューニング戦略
トラブルシューティング手順

3. 特徴量エンジニアリングパターン

references/feature_engineering_patterns.md に技術リファレンスガイドがあります。内容は以下の通りです。

システム設計原則
実装例
設定のベストプラクティス
デプロイ戦略
モニタリングと可観測性

本番パターン

パターン 1: スケーラブルなデータ処理

分散コンピューティングによるエンタープライズ規模のデータ処理：

水平スケーリングアーキテクチャ
フォールトトレラント設計
リアルタイムおよびバッチ処理
データ品質検証
パフォーマンスモニタリング

パターン 2: MLモデルのデプロイ

高可用性を備えた本番MLシステム：

低レイテンシでのモデルサービング
A/Bテストインフラストラクチャ
特徴量ストア統合
モデルモニタリングとドリフト検出
自動再学習パイプライン

パターン 3: リアルタイム推論

高スループット推論システム：

バッチ処理とキャッシング戦略
ロードバランシング
オートスケーリング
レイテンシ最適化
コスト最適化

ベストプラクティス

開発

テスト駆動開発
コードレビューとペアプログラミング
ドキュメントをコードとして扱う
すべてをバージョン管理する
継続的インテグレーション

本番

重要なものはすべて監視する
デプロイを自動化する
リリースにフィーチャーフラグを使用する
カナリアデプロイメント
包括的なロギング

チームリーダーシップ

若手エンジニアの指導
技術的な意思決定を主導する
コーディング標準の確立
学習文化の醸成
部門横断的なコラボレーション

パフォーマンス目標

レイテンシ:

P50: < 50ms
P95: < 100ms
P99: < 200ms

スループット:

リクエスト/秒: > 1000
同時ユーザー数: > 10,000

可用性:

アップタイム: 99.9%
エラー率: < 0.1%

セキュリティとコンプライアンス

認証と認可
データ暗号化（保存時および転送時）
PIIの取り扱いと匿名化
GDPR/CCPA準拠
定期的なセキュリティ監査
脆弱性管理

一般的なコマンド

# Development
python -m pytest tests/ -v --cov
python -m black src/
python -m pylint src/

# Training
python scripts/train.py --config prod.yaml
python scripts/evaluate.py --model best.pth

# Deployment
docker build -t service:v1 .
kubectl apply -f k8s/
helm upgrade service ./charts/

# Monitoring
kubectl logs -f deployment/service
python scripts/health_check.py

リソース

高度なパターン: references/statistical_methods_advanced.md
実装ガイド: references/experiment_design_frameworks.md
技術リファレンス: references/feature_engineering_patterns.md
自動化スクリプト: scripts/ ディレクトリ

シニアレベルの責任

世界クラスのシニアプロフェッショナルとして：

技術的リーダーシップ
- アーキテクチャの意思決定を主導する
- チームメンバーを指導する
- ベストプラクティスを確立する
- コード品質を保証する
戦略的思考
- ビジネス目標と整合させる
- トレードオフを評価する
- スケールを計画する
- 技術的負債を管理する
コラボレーション
- チームを超えて協力する
- 効果的にコミュニケーションをとる
- コンセンサスを構築する
- 知識を共有する
イノベーション
- 研究の最新情報を常に把握する
- 新しいアプローチを試す
- コミュニティに貢献する
- 継続的な改善を推進する
本番環境の卓越性
- 高可用性を確保する
- 積極的に監視する
- パフォーマンスを最適化する
- インシデントに対応する

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Senior Data Scientist

World-class senior data scientist skill for production-grade AI/ML/Data systems.

Quick Start

Main Capabilities

# Core Tool 1
python scripts/experiment_designer.py --input data/ --output results/

# Core Tool 2  
python scripts/feature_engineering_pipeline.py --target project/ --analyze

# Core Tool 3
python scripts/model_evaluation_suite.py --config config.yaml --deploy

Core Expertise

This skill covers world-class capabilities in:

Advanced production patterns and architectures
Scalable system design and implementation
Performance optimization at scale
MLOps and DataOps best practices
Real-time processing and inference
Distributed computing frameworks
Model deployment and monitoring
Security and compliance
Cost optimization
Team leadership and mentoring

Tech Stack

Languages: Python, SQL, R, Scala, Go ML Frameworks: PyTorch, TensorFlow, Scikit-learn, XGBoost Data Tools: Spark, Airflow, dbt, Kafka, Databricks LLM Frameworks: LangChain, LlamaIndex, DSPy Deployment: Docker, Kubernetes, AWS/GCP/Azure Monitoring: MLflow, Weights & Biases, Prometheus Databases: PostgreSQL, BigQuery, Snowflake, Pinecone

Reference Documentation

1. Statistical Methods Advanced

Comprehensive guide available in references/statistical_methods_advanced.md covering:

Advanced patterns and best practices
Production implementation strategies
Performance optimization techniques
Scalability considerations
Security and compliance
Real-world case studies

2. Experiment Design Frameworks

Complete workflow documentation in references/experiment_design_frameworks.md including:

Step-by-step processes
Architecture design patterns
Tool integration guides
Performance tuning strategies
Troubleshooting procedures

3. Feature Engineering Patterns

Technical reference guide in references/feature_engineering_patterns.md with:

System design principles
Implementation examples
Configuration best practices
Deployment strategies
Monitoring and observability

Production Patterns

Pattern 1: Scalable Data Processing

Enterprise-scale data processing with distributed computing:

Horizontal scaling architecture
Fault-tolerant design
Real-time and batch processing
Data quality validation
Performance monitoring

Pattern 2: ML Model Deployment

Production ML system with high availability:

Model serving with low latency
A/B testing infrastructure
Feature store integration
Model monitoring and drift detection
Automated retraining pipelines

Pattern 3: Real-Time Inference

High-throughput inference system:

Batching and caching strategies
Load balancing
Auto-scaling
Latency optimization
Cost optimization

Best Practices

Development

Test-driven development
Code reviews and pair programming
Documentation as code
Version control everything
Continuous integration

Production

Monitor everything critical
Automate deployments
Feature flags for releases
Canary deployments
Comprehensive logging

Team Leadership

Mentor junior engineers
Drive technical decisions
Establish coding standards
Foster learning culture
Cross-functional collaboration

Performance Targets

Latency:

P50: < 50ms
P95: < 100ms
P99: < 200ms

Throughput:

Requests/second: > 1000
Concurrent users: > 10,000

Availability:

Uptime: 99.9%
Error rate: < 0.1%

Security & Compliance

Authentication & authorization
Data encryption (at rest & in transit)
PII handling and anonymization
GDPR/CCPA compliance
Regular security audits
Vulnerability management

Common Commands

# Development
python -m pytest tests/ -v --cov
python -m black src/
python -m pylint src/

# Training
python scripts/train.py --config prod.yaml
python scripts/evaluate.py --model best.pth

# Deployment
docker build -t service:v1 .
kubectl apply -f k8s/
helm upgrade service ./charts/

# Monitoring
kubectl logs -f deployment/service
python scripts/health_check.py

Resources

Advanced Patterns: references/statistical_methods_advanced.md
Implementation Guide: references/experiment_design_frameworks.md
Technical Reference: references/feature_engineering_patterns.md
Automation Scripts: scripts/ directory

Senior-Level Responsibilities

As a world-class senior professional:

Technical Leadership
- Drive architectural decisions
- Mentor team members
- Establish best practices
- Ensure code quality
Strategic Thinking
- Align with business goals
- Evaluate trade-offs
- Plan for scale
- Manage technical debt
Collaboration
- Work across teams
- Communicate effectively
- Build consensus
- Share knowledge
Innovation
- Stay current with research
- Experiment with new approaches
- Contribute to community
- Drive continuous improvement
Production Excellence
- Ensure high availability
- Monitor proactively
- Optimize performance
- Respond to incidents

同梱ファイル

※ ZIPに含まれるファイル一覧。`SKILL.md` 本体に加え、参考資料・サンプル・スクリプトが入っている場合があります。

📄 SKILL.md (5,630 bytes)
📎 references/experiment_design_frameworks.md (1,435 bytes)
📎 references/feature_engineering_patterns.md (1,435 bytes)
📎 references/statistical_methods_advanced.md (1,435 bytes)
📎 scripts/experiment_designer.py (2,784 bytes)
📎 scripts/feature_engineering_pipeline.py (2,827 bytes)
📎 scripts/model_evaluation_suite.py (2,797 bytes)