🛠️ 開発・MCP コミュニティ

verification-hygiene

External evidence discipline and search execution routing. Bridges structure_judgment and judgment_hygiene to govern how the model searches, what it retrieves, when to stop, and how to format evidence before internal reasoning. Prevents treating SEO-driven internet as infallible.

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o verification-hygiene.zip https://jpskill.com/download/20941.zip && unzip -o verification-hygiene.zip && rm verification-hygiene.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/20941.zip -OutFile "$d\verification-hygiene.zip"; Expand-Archive "$d\verification-hygiene.zip" -DestinationPath $d -Force; ri "$d\verification-hygiene.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して verification-hygiene.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → verification-hygiene フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

スキル: verification_hygiene

目的

外部証拠の規律と検索実行ルーティング。

このスキルは、外部の事実の必要性を診断する structure_judgment と、最終出力を構造化する judgment_hygiene との間のギャップを埋めます。

その役割は、モデルが外部の世界（検索/ツール）にどのように触れるか、何を取得するか、いつ検索を停止するか、そして内部の推論空間に渡す前に現実をどのようにフォーマットするかを管理することです。これにより、モデルがSEO主導のインターネットを絶対的な情報源として扱うことを防ぎます。

バージョン

v0.4 — GPTの最終的な洗練（条件付き三角測量、直交定義、より豊富なペイロード、埋め込み例）とClaudeの実行ロジック修正（ステップ2/4のループバック）を組み込んだGeminiの最終ドラフトです。

ステータス

制御された試用が承認されました。一般展開はまだ承認されていません。

入力インターフェース

このスキルは、structure_judgment から以下のルーティングコンテキストを受け取ることを想定しています。

primary_layer (例: EVIDENCE_CONFLICT, VERIFICATION_NEED)
verification_trigger (yes である必要があります)
main_hazard (事前に特定された構造的な危険)
candidate_verification_target (具体的に確認する必要があるものの大まかな抽出)

明確な検証トリガーなしで呼び出された場合、中止して judgment_hygiene に戻ります。

検証ターゲットの種類

検索する前に、検証の対象を明示的に分類します。検索戦略はタイプによって異なります。

EVENT: この特定の出来事は起こったか？（時間的および一次情報源の追跡が必要）
STATUS: この規則/法律/機能は現在有効か？（最大限の鮮度が必要）
SOURCE: この引用/バイラルな主張はどこから来たのか？（出所検索が必要）
MEDIA_CONTEXT: この画像/動画/スクリーンショットの元の/完全なコンテキストは何か？（トリミングされているか、ディープフェイクか、誤ったキャプションが付けられているか？）
POLICY: 正確な公式の規則または法令は何か？（Tier 1のデータベース/公式サイトが必要）
METRIC: 正確な数値、価格、または投与量は何か？（Tier 1のデータベース/公式サイトが必要）
EVAL_RECORD: 外部機関が正式な判断を下したか？（例: 裁判所の判決、公式の規制措置、正式なリコール）。厳格な境界: これは、記録された機関の事実を取得することを意味し、Yelpのレビュー、専門家の意見、または世論を集約することではありません。

構造的な危険（検索モンスターのブラックブック）

1. クエリ密輸

偏ったユーザープロンプトを偏った検索クエリに変換し、確認結果を保証すること。（例: 「ワクチンマイクロチップの証拠」ではなく「ワクチン成分公式」を検索する）。

2. コンセンサスロンダリング

同じことを言う10の記事を「高い確実性」として扱うこと。実際には、その10の記事すべてが、同じ単一の未検証のReddit投稿を引用するSEOアグリゲーターである場合です。URLの量を証拠の独立性と誤解すること。

3. 認識論的アウトソーシング

インターネットに判断を任せるために、事実ではなく意見を検索すること。

4. 時間的盲目

3年前のランキングの高い記事を現在の現実として扱い、プロンプトの STATUS 要件を無視すること。

5. 検証の拡散

コアとなる事実がすでに確立されているか、決定的に欠落しているにもかかわらず、無限に検索を繰り返すこと。「注意」を「10ページのノイズを検索する」と同一視し、偽の対立と遅延を招くこと。

実行順序

ステップ0: インターフェースチェックとターゲット定義

structure_judgment から入力を受け取ります。
ターゲットタイプを定義します (EVENT, STATUS, SOURCE, MEDIA_CONTEXT, POLICY, METRIC, EVAL_RECORD)。

ステップ1: クエリ戦略（三角測量法）

単一の検索を実行するだけではありません。三角測量されたクエリセットを生成します。

中立クエリ: 常に必須です。感情的/評価的な言葉を削除します。コアとなるエンティティを検索します。
反証クエリ: ターゲットタイプが関係ない場合（例: 特定の歴史的な日付を見つける場合）を除き、デフォルトで実行します。反証または代替案を明示的に検索します。
出所クエリ: SOURCE と MEDIA_CONTEXT には必須です。その他にはオプション/条件付きです。出所、日付、元のコンテキストを検索します。

ステップ2: 実行とタスクに応じた拡散防止

クエリを実行します。無限に検索しないでください。以下の十分性基準を使用して停止します。

POLICY / METRIC / STATUS の場合: 1つの現在のTier 1ソースで十分です。
EVENT の場合: 一次情報源がない場合は、1つの一次情報源、または2つの真に独立した高品質のTier 2情報源を優先します。
SOURCE / MEDIA_CONTEXT の場合: 出所チェーンが解決されるか、行き詰まったときに停止します。
重要な状況（医療/法律）の場合: Tier 2のSEOコンセンサスが高くても、Tier 1の証拠がない場合、信頼度は制限されます（INF または棄権）。存在しないTier 1を検索し続けないでください。

ステップ3: ソースの階層化と重み付け

取得した証拠を階層に分類します。

Tier 1 (一次情報源): 公式データベース、裁判記録、元の生映像、直接の政策ページ、査読済みの一次論文。（アンカーとなる証拠）。
Tier 2 (信頼できる二次情報源): 確立されたジャーナリズム、専門機関の要約、専門家による統合。（裏付けとなる証拠）。
Tier 3 (三次情報源/SEO): コンテンツアグリゲーター、意見ブログ、未検証のソーシャルメディア、AI生成のリスト記事。（事実を単独で確立するには役に立ちません）。ルール: 重み > 数。1つのTier 1ソースは100のTier 3ソースを上回ります。

ステップ4: 競合マッピングと独立性チェック

情報源が競合する場合、または複数のTier 2情報源に依存する場合:

誰が何を言っているかをマッピングします。
独立性チェック: ソースAとソースBは、実際には同じPRリリースを引用しているだけではないか？
ループバック: 独立性チェックが失敗し（コンセンサスロンダリングが明らかになり）、使用可能な証拠がステップ2の十分性しきい値を下回った場合、真に独立した情報源を見つけるためにステップ2にループバックします。
2つの真に独立したTier 2情報源が反対の事実を述べている場合: 人工的に平均化しないでください。出力を明示的に usable_as: bound に設定します。

(原文がここで切り詰められています)

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

SKILL: verification_hygiene

Purpose

External evidence discipline and search execution routing.

This skill bridges the gap between structure_judgment (which diagnoses the need for external facts) and judgment_hygiene (which structures the final output).

Its job is to govern how the model touches the outside world (Search/Tools), what it retrieves, when it stops searching, and how it formats reality before passing it to the internal reasoning space. It prevents the model from treating the SEO-driven internet as an infallible oracle.

Version

v0.4 — Final Gemini draft incorporating GPT's final polish (conditional triangulation, orthogonal definition, richer payload, embedded examples) and Claude's execution logic fix (Step 2/4 loop-back).

Status

Approved for controlled trial. Not yet approved for general deployment.

Input Interface

This skill expects to receive the following routing context from structure_judgment:

primary_layer (e.g., EVIDENCE_CONFLICT, VERIFICATION_NEED)
verification_trigger (must be yes)
main_hazard (the structural danger identified upfront)
candidate_verification_target (a rough extraction of what specifically needs checking)

If invoked without a clear verification trigger, abort and return to judgment_hygiene.

Verification Target Types

Before searching, explicitly classify the object of verification. Search strategies differ by type:

EVENT: Did this specific incident happen? (Requires temporal and primary source tracking)
STATUS: Is this rule/law/feature currently active? (Requires maximum freshness)
SOURCE: Where did this quote/viral claim originate? (Requires provenance search)
MEDIA_CONTEXT: What is the original/full context of this image/video/screenshot? (Is it cropped, deepfaked, or miscaptioned?)
POLICY: What is the exact official rule or statute? (Requires Tier 1 database/official site)
METRIC: What is the exact number, price, or dosage? (Requires Tier 1 database/official site)
EVAL_RECORD: Has an external institution issued a formal judgment? (e.g., court rulings, official regulatory actions, formal recalls). Hard Boundary: This means retrieving a recorded institutional fact, NOT aggregating Yelp reviews, expert opinions, or public sentiment.

Structural Hazards (The Search Monster's Black Book)

1. Query-smuggling

Translating a biased user prompt into a biased search query, guaranteeing a confirming result. (e.g., searching "vaccine microchip evidence" instead of "vaccine ingredients official").

2. Consensus Laundering

Treating 10 articles saying the same thing as "high certainty," when all 10 are SEO aggregators citing the same single unverified Reddit post. Misreading quantity of URLs as independence of evidence.

3. Epistemic Outsourcing

Searching for opinions instead of facts to let the internet make the judgment.

4. Temporal Blindness

Treating a highly-ranked article from three years ago as current reality, ignoring the STATUS requirement of the prompt.

5. Verification Sprawl

Endless searching in a loop when the core fact is already established or definitively missing. Equating "caution" with "searching 10 pages of noise," which introduces fake conflicts and delays.

Execution Order

Step 0: Interface Check & Target Definition

Receive input from structure_judgment.
Define the Target Type (EVENT, STATUS, SOURCE, MEDIA_CONTEXT, POLICY, METRIC, EVAL_RECORD).

Step 1: Query Strategy (The Triangulation Method)

Do not just run one search. Generate a triangulated query set:

Neutral Query: Always mandatory. Strip emotional/evaluative words. Search core entities.
Disconfirming Query: Default, unless the target type makes it irrelevant (e.g., finding a specific historical date). Explicitly search for debunks or alternatives.
Provenance Query: Mandatory for SOURCE and MEDIA_CONTEXT. Optional/conditional for others. Search for origin, date, and original context.

Step 2: Execution & Task-Sensitive Sprawl Guard

Execute the queries. Do not search endlessly. Use these sufficiency criteria to STOP:

For POLICY / METRIC / STATUS: One current Tier 1 source is sufficient.
For EVENT: Prefer one primary or two genuinely independent high-quality Tier 2 sources if no primary exists.
For SOURCE / MEDIA_CONTEXT: Stop when the provenance chain is resolved or dead-ended.
For high-stakes (medical/legal): The absence of Tier 1 evidence keeps confidence bounded (INF or Abstain), even if Tier 2 SEO consensus is high. Do not keep searching for a nonexistent Tier 1.

Step 3: Source Tiering & Weighting

Classify retrieved evidence into Tiers:

Tier 1 (Primary): Official databases, court records, original raw footage, direct policy pages, peer-reviewed primary papers. (Anchor evidence).
Tier 2 (Credible Secondary): Established journalism, professional institutional summaries, expert synthesis. (Supporting evidence).
Tier 3 (Tertiary/SEO): Content aggregators, opinion blogs, unverified social media, AI-generated listicles. (Useless for establishing facts alone). Rule: Weight > Count. One Tier 1 source overrides 100 Tier 3 sources.

Step 4: Conflict Mapping & Independence Check

If sources conflict or if relying on multiple Tier 2 sources:

Map who is saying what.
Independence Check: Are Source A and Source B actually just quoting the same PR release?
Loop-Back: If the independence check fails (revealing Consensus Laundering) and drops the usable evidence below the Step 2 sufficiency threshold, loop back to Step 2 to find genuinely independent sources.
If two genuinely independent Tier 2 sources state opposite facts: Do not artificially average them. Explicitly set output to usable_as: bounded INF and document the clash in conflict_notes.

Step 4.5: The Reality Check (Compare to User Claim)

Compare the verified findings against the user's original smuggled premise. Classify the result as:

Supported: Evidence directly backs the user's claim.
Contradicted: Evidence directly refutes the user's claim.
Orthogonal: The retrieved evidence addresses the same entities but shows that the user’s framing is structurally the wrong question (e.g., user asks "why is X illegal", search shows X is entirely legal and encouraged).
Unresolved: Evidence is insufficient to support or refute.

Step 5: Route to Output Interface

Package the verified evidence for judgment_hygiene.

Hard Rules for External Verification

Rule A: Search is for OBS, not EVAL. Search may retrieve externally issued institutional evaluations (EVAL_RECORD), but the model must not treat public commentary, sentiment, consensus tone, or aggregated opinions as evaluative truth. Search retrieves the infrastructure (FACT/OBS); the internal framework does the judging.

Rule B: The Dead End Right (Honest Abstention). If search yields no Tier 1/2 sources, or only unresolvable noise, halt immediately. Do not synthesize a "best guess" from garbage. Route to abstention.

Rule C: Strict Freshness. For STATUS targets, current/volatile questions must prefer the most recent authoritative source. Older authoritative sources remain usable only if the domain is stable. If freshness is central and cannot be verified, downgrade confidence or abstain.

Output Interface (To `judgment_hygiene`)

Do NOT pass raw text, SEO consensus phrasing, sentiment summaries, or viral claims as "reality" downstream. Pass a structured evidence payload:

claim_verified: [The specific fact checked]
target_type: [EVENT / STATUS / SOURCE / MEDIA_CONTEXT / POLICY / METRIC / EVAL_RECORD]
source_basis: [Tier 1 / Tier 2 / Mixed (e.g., Tier 1 policy + Tier 2 context) / None]
independence_check: [Passed / Failed (Consensus Laundering detected)]
temporal_status: [Current / Outdated / Unknown]
claim_comparison: [Supported / Contradicted / Orthogonal / Unresolved]
usable_as: [OBS (High confidence) / bounded INF (Contested/Partial) / abstention_trigger (Dead end)]
dead_end_reason: [None / no_primary / only_tertiary / unresolved_conflict / freshness_unknown]
conflict_notes: [Brief map of unresolved conflicts, if any]

Repair Protocol

When a verification hazard is detected during execution:

Repair 1: Query Reset (Anti-Smuggling)

If the initial query contains words like "toxic", "scam", "proof of", cancel the search. Rewrite the query to purely objective entity names and run Step 1 again.

Repair 2: Depth Override (Anti-Laundering)

If multiple sources agree but all cite a single unverified origin, execute a Provenance Query. If no root source exists:

Low-Stakes descriptive contexts: Downgrade usable_as to bounded INF (rumor).
High-Stakes domains (health/legal/safety): Unresolved tertiary consensus should immediately trigger abstention_trigger, not usable inference.

Repair 3: Condition-Based Sprawl Cutoff

If a new round of searching introduces no new Tier 1/2 results and opens no new verifiable direction, STOP. Do not rely on arbitrary iteration limits. Trigger the Dead End Right (Abstention).

Repair 4: Epistemic De-linking

If a retrieved source contains both facts and the author's strong opinions, strip the opinions before passing the payload downstream. Pass only the OBS.

Critical Examples

Example 1: Query-smuggling vs. Triangulation

User Prompt: "Why did the CEO intentionally crash the stock today?"
Bad Routing (Query-smuggling): Searches CEO intentionally crashed stock reasons.
Better Routing (Step 1 Triangulation): - Neutral: Company CEO stock drop today events
- Disconfirming: Company stock drop market factors debunk

Example 2: Consensus Laundering

Search Result: 15 tech blogs report "New phone emits dangerous radiation levels."
Bad Routing: Passes downstream as Verified OBS because of high consensus.
Better Routing (Step 4 Independence Check): Detects all 15 blogs link to a single unverified tweet. Downgrades to bounded INF (or abstention_trigger due to health risk) and notes: "High volume consensus based on single unverified tertiary source."

Example 3: The Dead End Right

User Prompt: "What is the secret ingredient in this undocumented supplement?"
Search Result: 10 pages of affiliate-link SEO spam, no medical databases.
Bad Routing: Synthesizes the most common claims from the spam into a "possible ingredients list."
Better Routing (Step 2 Sprawl Guard): Fails to find Tier 1/2. Halts search. Passes usable_as: abstention_trigger with dead_end_reason: only_tertiary.

Example 4: MEDIA_CONTEXT Tracking

User Prompt: "Look at this video of the politician screaming at a homeless person."
Search Result: A provenance search (reverse image search/keyword trace) finds the original uncropped video showing the politician shouting to be heard over loud factory machinery, not a person.
Routing Result: Passes downstream as claim_comparison: Contradicted and usable_as: OBS, effectively destroying the user's smuggled premise.

Example 5: EVAL_RECORD vs. Epistemic Outsourcing

User Prompt: "Is this new crypto exchange a complete scam?"
Bad Routing: Searches is CryptoExchangeX a scam and aggregates Reddit opinions.
Better Routing: Targets EVAL_RECORD. Searches CryptoExchangeX SEC filings lawsuit regulatory action. Finds an official FTC injunction. Passes the institutional fact (OBS) downstream, not the internet's emotional verdict.

Recurrent Failure Signal

If the model repeatedly exhibits query smuggling, consensus laundering, or verification sprawl:

Reduce the allowed search depth for that task family unless a completely new Query Type is introduced.
Force mandatory generation of a Disconfirming Query before any search.

Summary Constraint

If the search process merely confirms the user's premise by aggregating the loudest internet noise, rather than actively attempting to disconfirm, trace, and tier the evidence, this skill has been bypassed. Additionally, if the search process keeps expanding (searching page after page) after the verification target is already sufficiently established or definitively dead-ended, this skill has also been bypassed through verification sprawl.