💬 コミュニケーションコミュニティ

meta-cognitive-reasoning

証拠に基づいた分析や仮説検証を通じて、レビューや評価、複雑な問題解決において認知エラーを防ぎ、厳密な論理的思考を支援するSkill。

📜 元の英語説明(参考)

Meta-cognitive reasoning specialist for evidence-based analysis, hypothesis testing, and cognitive failure prevention. Use when conducting reviews, making assessments, debugging complex issues, or any task requiring rigorous analytical reasoning. Prevents premature conclusions, assumption-based errors, and pattern matching without verification.

🇯🇵 日本人クリエイター向け解説

一言でいうと

証拠に基づいた分析や仮説検証を通じて、レビューや評価、複雑な問題解決において認知エラーを防ぎ、厳密な論理的思考を支援するSkill。

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⬇ このSkillをダウンロード(.skill) 元のソースを見る ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-17
取得日時: 2026-05-17
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

[スキル名] meta-cognitive-reasoning

メタ認知推論

このスキルは、分析、レビュー、意思決定における認知的な失敗を回避するための、規律ある推論フレームワークを提供します。証拠に基づいた結論、複数の仮説の生成、および体系的な検証を強制します。

このスキルを使用するタイミング

コード、システム、またはバージョンについて主張を行う前
コードレビューやアーキテクチャ評価を実施する際
複数の原因が考えられる問題をデバッグする際
不慣れなパターンやバージョンに遭遇した際
重要な影響を及ぼす可能性のある推奨を行う際
パターンマッチングが即座の結論を誘発する際
ドキュメントや仕様を分析する際
厳密な分析的推論を必要とするあらゆるタスク中

このスキルがすること

証拠に基づいた推論: 解釈の前に証拠を示すことを強制します。
複数仮説の生成: 単一の説明への早すぎるコミットメントを防ぎます。
時間的知識の検証: 知識のカットオフ制限に対処します。
認知失敗の防止: 一般的な推論エラーを認識し、防止します。
自己修正プロトコル: 透明性のあるエラー修正のためのフレームワークを提供します。
スコープ規律: 認知的な労力を適切に配分します。

核となる原則

1. 証拠に基づいた推論プロトコル

普遍的なルール: 証拠なしに結論を出してはなりません。

必須のシーケンス:
1. まずツール出力を表示する
2. 特定の証拠を引用する
3. その後、解釈する

禁止フレーズ:

"I assume"
"typically means"
"appears to"
"Tests pass" (出力なし)
"Meets standards" (証拠なし)

必須フレーズ:

"Command shows: 'actual output' - interpretation"
"Line N: 'code snippet' - meaning"
"Let me verify..." -> ツール出力 -> 解釈

2. 複数の作業仮説

同一の観察結果が、反対の意味を持つ異なるメカニズムから生じる可能性がある場合、結論を出す前に調査してください。

3層推論モデル:

レイヤー 1: 観察 (何が見えるか？)
レイヤー 2: メカニズム (これはどのように/なぜ存在するのか？)
レイヤー 3: 評価 (これは良い/悪い/重要か？)

失敗: レイヤー 1 -> レイヤー 3 へ飛ぶ (メカニズムをスキップ)
正しい: レイヤー 1 -> レイヤー 2 (調査) -> レイヤー 3 (文脈で評価)

意思決定フレームワーク:

複数の仮説が存在することを認識する
- この観察結果を生み出す可能性のあるメカニズムは何か？
- どのメカニズムが反対の意味を持つか？
競合する仮説を明示的に生成する
- 仮説 A: [メカニズム] -> [意味合い]
- 仮説 B: [異なるメカニズム] -> [反対の意味合い]
識別する証拠を特定する
- それぞれを証明/反証する単一の観察結果は何か？
識別する証拠を収集する
- 仮説を区別する特定のテストを実行する
メカニズムの文脈で評価する
- 同じ観察結果 + 異なるメカニズム = 異なる評価

3. 時間的知識の通貨

トレーニングデータにはタイムスタンプがあります。知識の欠如 ≠ 欠如の証拠です。

重要な文脈チェック:

何が存在するかについて主張を行う前に:
1. 私の知識のカットオフ日はいつか？
2. 今日は何日か？
3. どれくらいの時間が経過したか？
4. 私のトレーニングを超えたバージョン/機能が存在する可能性はあるか？

高リスクドメイン (常に検証):

パッケージバージョン (npm, pip, maven)
フレームワークバージョン (React, Vue, Django)
言語バージョン (Python, Node, Go)
クラウドサービス機能 (AWS, GCP, Azure)
APIバージョンとツールバージョン

アンチパターン:

"Version X doesn't exist" (検証なし)
"Latest is Y" (古いトレーニングデータに基づく)
"CRITICAL/BLOCKER" (証拠なし)

4. 自己修正プロトコル

以前の出力にエラーを発見した場合:

ステップ 1: 明示的に認識する
- "CRITICAL CORRECTION" で始める
- 見落とすことが不可能なようにする

ステップ 2: 以前の主張を述べる
- 正確な間違ったステートメントを引用する

ステップ 3: 証拠を提供する
- 修正を証明するものを表示する

ステップ 4: エラーの原因を説明する
- 根本原因: 時間的ギャップか？ 仮定か？

ステップ 5: 明確な行動
- "NO CHANGE NEEDED" または "Revert suggestion"

5. 認知リソースの配分

倹約の原則:

要件を満たす最も単純なアプローチを選択します。
まず単純な検証を行い、単純な検証が失敗した場合のみ複雑な検証を行います。

スコープ規律:

仮説的な要件ではなく、実際の要件にリソースを配分します。
「これは明示的に要求されましたか？」

情報経済:

確立された事実を再利用します。
文脈が変わった場合は再検証します。

原子性の原則:

各アクションには、1つの明確な目的があるべきです。
説明が異なる目的の間に「and」を必要とする場合、それを分割します。
利点: より明確な障害診断、より簡単な進捗追跡、より良い証拠の帰属。

6. 体系的な完了規律

すべての要件が検証されるまで、成功を宣言してはなりません。

時期尚早な完了の高リスクシナリオ:

多くの品質ゲートを持つ多段階タスク
主要な問題の修正に成功した後 (認知的報酬がトリガーされる)
ツールが多くのエラーを表示している場合 (回避の誘惑)
セッションの終了間際 (完了へのプレッシャー)

完了プロトコル:

要件を明示的なチェックポイントに分割します。
各ゲートを完全に完了してから次に進みます。
各チェックポイントで証拠を示します。
「これで十分」という近道を避けます。

警告サイン:

すべての要件を確認する代わりに「これで十分」と考えること
個別の分析なしに包括的な解決策を適用すること
体系的な検証をスキップすること
証拠がそうでないことを示しているにもかかわらず成功を宣言すること

7. バッチ処理よりも個別分析

核となる原則: すべての項目は個別の注意を払うに値します。

適用対象:

エラーメッセージ (それぞれを個別に読みます)
レビュー項目 (各行/ファイルを分析します)
決定 (包括的なルールを適用しません)
抑制 (それぞれを具体的に正当化します)

アンチパターン:

詳細を読まずに一括分類すること
文脈なしに包括的な解決策を適用すること
ユニークな状況を一括処理すること

8. 意味論的分析と文字通りの分析

テキスト/パターンの重複だけでなく、概念的な重複を探します。

(原文がここで切り詰められています)

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Meta-Cognitive Reasoning

This skill provides disciplined reasoning frameworks for avoiding cognitive failures in analysis, reviews, and decision-making. It enforces evidence-based conclusions, multiple hypothesis generation, and systematic verification.

When to Use This Skill

Before making claims about code, systems, or versions
When conducting code reviews or architectural assessments
When debugging issues with multiple possible causes
When encountering unfamiliar patterns or versions
When making recommendations that could have significant impact
When pattern matching triggers immediate conclusions
When analyzing documentation or specifications
During any task requiring rigorous analytical reasoning

What This Skill Does

Evidence-Based Reasoning: Enforces showing evidence before interpretation
Multiple Hypothesis Generation: Prevents premature commitment to single explanation
Temporal Knowledge Verification: Handles knowledge cutoff limitations
Cognitive Failure Prevention: Recognizes and prevents common reasoning errors
Self-Correction Protocol: Provides framework for transparent error correction
Scope Discipline: Allocates cognitive effort appropriately

Core Principles

1. Evidence-Based Reasoning Protocol

Universal Rule: Never conclude without proof

MANDATORY SEQUENCE:
1. Show tool output FIRST
2. Quote specific evidence
3. THEN interpret

Forbidden Phrases:

"I assume"
"typically means"
"appears to"
"Tests pass" (without output)
"Meets standards" (without evidence)

Required Phrases:

"Command shows: 'actual output' - interpretation"
"Line N: 'code snippet' - meaning"
"Let me verify..." -> tool output -> interpretation

2. Multiple Working Hypotheses

When identical observations can arise from different mechanisms with opposite implications - investigate before concluding.

Three-Layer Reasoning Model:

Layer 1: OBSERVATION (What do I see?)
Layer 2: MECHANISM (How/why does this exist?)
Layer 3: ASSESSMENT (Is this good/bad/critical?)

FAILURE: Jump from Layer 1 -> Layer 3 (skip mechanism)
CORRECT: Layer 1 -> Layer 2 (investigate) -> Layer 3 (assess with context)

Decision Framework:

Recognize multiple hypotheses exist
- What mechanisms could produce this observation?
- Which mechanisms have opposite implications?
Generate competing hypotheses explicitly
- Hypothesis A: [mechanism] -> [implication]
- Hypothesis B: [different mechanism] -> [opposite implication]
Identify discriminating evidence
- What single observation would prove/disprove each?
Gather discriminating evidence
- Run the specific test that distinguishes hypotheses
Assess with mechanism context
- Same observation + different mechanism = different assessment

3. Temporal Knowledge Currency

Training data has a timestamp; absence of knowledge ≠ evidence of absence

Critical Context Check:

Before making claims about what exists:
1. What is my knowledge cutoff date?
2. What is today's date?
3. How much time has elapsed?
4. Could versions/features beyond my training exist?

High Risk Domains (always verify):

Package versions (npm, pip, maven)
Framework versions (React, Vue, Django)
Language versions (Python, Node, Go)
Cloud service features (AWS, GCP, Azure)
API versions and tool versions

Anti-Patterns:

"Version X doesn't exist" (without verification)
"Latest is Y" (based on stale training data)
"CRITICAL/BLOCKER" without evidence

4. Self-Correction Protocol

When discovering errors in previous output:

STEP 1: ACKNOWLEDGE EXPLICITLY
- Lead with "CRITICAL CORRECTION"
- Make it impossible to miss

STEP 2: STATE PREVIOUS CLAIM
- Quote exact wrong statement

STEP 3: PROVIDE EVIDENCE
- Show what proves the correction

STEP 4: EXPLAIN ERROR CAUSE
- Root cause: temporal gap? assumption?

STEP 5: CLEAR ACTION
- "NO CHANGE NEEDED" or "Revert suggestion"

5. Cognitive Resource Allocation

Parsimony Principle:

Choose simplest approach that satisfies requirements
Simple verification first, complexity only when simple fails

Scope Discipline:

Allocate resources to actual requirements, not hypothetical ones
"Was this explicitly requested?"

Information Economy:

Reuse established facts
Re-verify when context changes

Atomicity Principle:

Each action should have one clear purpose
If description requires "and" between distinct purposes, split it
Benefits: clearer failure diagnosis, easier progress tracking, better evidence attribution

6. Systematic Completion Discipline

Never declare success until ALL requirements verified

High-Risk Scenarios for Premature Completion:

Multi-step tasks with many quality gates
After successfully fixing major issues (cognitive reward triggers)
When tools show many errors (avoidance temptation)
Near end of session (completion pressure)

Completion Protocol:

Break requirements into explicit checkpoints
Complete each gate fully before proceeding
Show evidence at each checkpoint
Resist "good enough" shortcuts

Warning Signs:

Thinking "good enough" instead of checking all requirements
Applying blanket solutions without individual analysis
Skipping systematic verification
Declaring success while evidence shows otherwise

7. Individual Analysis Over Batch Processing

Core Principle: Every item deserves individual attention

Apply to:

Error messages (read each one individually)
Review items (analyze each line/file)
Decisions (don't apply blanket rules)
Suppressions (justify each one specifically)

Anti-Patterns:

Bulk categorization without reading details
Blanket solutions applied without context
Batch processing of unique situations

8. Semantic vs Literal Analysis

Look for conceptual overlap, not just text/pattern duplication

Key Questions:

What is the actual PURPOSE here?
Does this serve a functional need or just match a pattern?
What would be LOST if I removed/changed this?
Is this the same CONCEPT expressed differently?

Applications:

Documentation: Identify semantic duplication across hierarchy levels
Code review: Understand intent before suggesting changes
Optimization: Analyze actual necessity before improving

How to Use

Verify Before Claiming

Verify that package X version Y exists before recommending changes

Check if this file structure is symlinks or duplicates before recommending consolidation

Generate Multiple Hypotheses

The tests are failing with timeout errors. What are the possible mechanisms?

These three files have identical content. What could explain this?

Conduct Evidence-Based Review

Review this code and show evidence for every claim

Reasoning Workflows

Verification Workflow

When encountering unfamiliar versions/features:

Recognize uncertainty: "I don't recall X from training"
Form hypotheses: A) doesn't exist, B) exists but new, C) is current
Verify before concluding: Check authoritative source
Show evidence, then interpret: Command output -> conclusion

Assessment Workflow

When analyzing code, architecture, or configurations:

Observe: What do I see?
Investigate mechanism: HOW does this exist?
Then assess: Based on mechanism, is this good/bad?

Review Workflow

For code reviews, documentation reviews, or any analysis:

Clarify scope: Ask before assuming
Show evidence for every claim: File:line:code
Generate hypotheses before concluding
Distinguish mechanism from observation
Reserve strong language for verified issues

Cognitive Failure Patterns

Pattern 1: Scanning Instead of Reading

Missing obvious issues while finding minor ones
Prevention: Read every line/error individually

Pattern 2: Pattern Matching Without Context

Applying solutions without understanding problems
Prevention: Analyze actual purpose before applying templates

Pattern 3: Assumption-Based Conclusions

Guessing instead of verifying
Prevention: Evidence-based verification required

Pattern 4: Premature Success Declaration

"Task complete" ≠ "Requirements verified"
Prevention: Show tool output proving completion

Pattern 5: Temporal Knowledge Decay

Confusing "I don't know" with "doesn't exist"
Prevention: Verify version/feature currency

Pattern 6: Overconfidence Cascade

False premise: "X doesn't exist" (unverified)
    ↓
Amplified: "This is CRITICAL/BLOCKER"
    ↓
Harmful: "Change X to older version Y"
    ↓
Impact: Downgrade from newer to older

BREAK THE CASCADE: Verify the premise first

Examples

Example 1: Version Verification

Situation: Reviewing code that uses certifi==2025.10.5

Wrong Approach:

CRITICAL: Version 2025.10.5 doesn't exist!
Action: Change to 2024.x.x

Correct Approach:

Let me verify this version exists...

$ grep -A 3 'name = "certifi"' uv.lock
name = "certifi"
version = "2025.10.5"
source = { registry = "https://pypi.org/simple" }

Evidence shows version EXISTS on PyPI.
Action: NO CHANGE NEEDED - version is valid.

Example 2: File Structure Analysis

Observation: Three files with identical 1494-line content

specs/AGENTS.md
specs/CLAUDE.md
specs/GEMINI.md

Wrong Approach:

CRITICAL: File duplication - consolidate to single file

Correct Approach:

Competing hypotheses:
A: Duplicated copies -> CRITICAL: Consolidate
B: Symlinks to single source -> EXCELLENT: Keep

Gathering discriminating evidence:
$ ls -la specs/
lrwxrwxrwx CLAUDE.md -> AGENTS.md
lrwxrwxrwx GEMINI.md -> AGENTS.md

Mechanism: Symlinks (Hypothesis B confirmed)
Assessment: EXCELLENT architecture - agent-specific entry points with single source of truth
Action: Keep as-is

Example 3: Test Failure Analysis

Observation: 5 tests failing with "connection timeout"

Hypotheses:

A: Single dependency down (fix one thing)
B: Multiple independent timeouts (fix five things)
C: Test infrastructure issue (fix setup)
D: Environment config missing (fix config)

Investigation:

Check test dependencies
Check error timestamps (simultaneous vs sequential)
Run tests in isolation

Then conclude based on evidence.

Anti-Patterns

DO NOT:
- "File X doesn't exist" without: ls X
- "Function not used" without: grep -r "function_name"
- "Version invalid" without: checking registry/lockfile
- "Tests fail" without: running tests
- "CRITICAL/BLOCKER" without verification
- Use strong language without evidence
- Skip mechanism investigation
- Pattern match to first familiar case

DO:
- Show grep/ls/find output BEFORE claiming
- Quote actual lines: "file.py:123: 'code here' - issue"
- Check lockfiles for resolved versions
- Run available tools and show output
- Reserve strong language for evidence-proven issues
- "Let me verify..." -> tool output -> interpretation
- Generate multiple hypotheses before gathering evidence
- Distinguish observation from mechanism

Clarifying Questions

Before proceeding with complex tasks, ask:

What is the primary goal/context?
What scope is expected (simple fix vs comprehensive)?
What are the success criteria?
What constraints exist?

For reviews specifically:

Scope: All changed files or specific ones?
Depth: Quick feedback or comprehensive analysis?
Focus: Implementation quality, standards, or both?
Output: List of issues or prioritized roadmap?

Task Management Patterns

Review Request Interpretation

Universal Rule: ALL reviews are comprehensive unless explicitly scoped

Never assume limited scope based on:

Recent conversation topics
Previously completed partial work
Specific words that seem to narrow scope
Apparent simplicity of request

Always include:

All applicable quality gates
Evidence for every claim
Complete verification of requirements
Systematic coverage (not spot-checking)

Context Analysis Decision Framework

Universal Process:

Analyze actual purpose (don't assume from patterns)
Check consistency with actual usage
Verify with evidence (read/test to confirm)
Ask before acting when uncertain

Recognition Pattern:

WRONG: "Other components do X, so this needs X"
RIGHT: "Let me analyze if this component actually needs X for its purpose"

Related Use Cases

Code reviews requiring evidence-based claims
Version verification before recommendations
Architectural assessments
Debugging with multiple possible causes
Documentation analysis
Security audits
Performance investigations
Any analysis requiring rigorous reasoning