🛠️ 開発・MCP コミュニティ 🔴 エンジニア向け 👤 エンジニア・AI開発者

🛠️ Scientific Critical Thinking

scientific-critical-thinking

科学的な主張やデータの質を評価し、実験

⚡ ⏱ 障害ポストモーテム 1日 → 1時間

📺 まず動画で見る(YouTube)

▶ 【衝撃】最強のAIエージェント「Claude Code」の最新機能・使い方・プログラミングをAIで効率化する超実践術を解説! ↗

※ jpskill.com 編集部が参考用に選んだ動画です。動画の内容と Skill の挙動は厳密には一致しないことがあります。

📜 元の英語説明(参考)

Evaluate scientific claims and evidence quality. Use for assessing experimental design validity, identifying biases and confounders, applying evidence grading frameworks (GRADE, Cochrane Risk of Bias), or teaching critical analysis. Best for understanding evidence quality, identifying flaws. For formal peer review writing use peer-review.

🇯🇵 日本人クリエイター向け解説

一言でいうと

科学的な主張やデータの質を評価し、実験

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o scientific-critical-thinking.zip https://jpskill.com/download/4232.zip && unzip -o scientific-critical-thinking.zip && rm scientific-critical-thinking.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/4232.zip -OutFile "$d\scientific-critical-thinking.zip"; Expand-Archive "$d\scientific-critical-thinking.zip" -DestinationPath $d -Force; ri "$d\scientific-critical-thinking.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して scientific-critical-thinking.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → scientific-critical-thinking フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-17
取得日時: 2026-05-18
同梱ファイル: 9

💬 こう話しかけるだけ — サンプルプロンプト

› Scientific Critical Thinking を使って、最小構成のサンプルコードを示して
› Scientific Critical Thinking の主な使い方と注意点を教えて
› Scientific Critical Thinking を既存プロジェクトに組み込む方法を教えて

これをClaude Code に貼るだけで、このSkillが自動発動します。

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

科学的批判的思考

概要

批判的思考とは、科学的厳密性を評価するための体系的なプロセスです。GRADEおよびCochrane ROBフレームワークを用いて、方法論、実験計画、統計的妥当性、バイアス、交絡、エビデンスの質を評価します。このスキルを科学的主張の批判的分析に適用してください。

このスキルを使用する場面

このスキルは、以下の状況で使用してください。

研究方法論と実験計画の評価
統計的妥当性とエビデンスの質の評価
研究におけるバイアスと交絡の特定
科学的主張と結論のレビュー
システマティックレビューまたはメタアナリシスの実施
GRADEまたはCochraneのバイアスリスク評価の適用
研究論文の批判的分析の提供

科学的図解による視覚的強化

このスキルを用いて文書を作成する際は、常に科学的図や図解を追加して視覚的コミュニケーションを強化することを検討してください。

文書にまだ図や図解が含まれていない場合：

scientific-schematicsスキルを使用して、AIを活用した出版品質の図を生成してください。
目的の図を自然言語で記述するだけです。
Nano Banana Proが自動的に図解を生成、レビュー、洗練します。

新規文書の場合： 科学的図解は、テキストで記述されている主要な概念、ワークフロー、アーキテクチャ、または関係を視覚的に表現するために、デフォルトで生成されるべきです。

図解の生成方法：

python scripts/generate_schematic.py "your diagram description" -o figures/output.png

AIは自動的に以下を行います。

適切な書式設定で出版品質の画像を生成します。
複数回の反復を通じてレビューと洗練を行います。
アクセシビリティ（色覚異常対応、高コントラスト）を保証します。
出力は figures/ ディレクトリに保存されます。

図解を追加する場面：

批判的思考フレームワークの図
バイアス特定のための意思決定ツリー
エビデンスの質評価フローチャート
GRADE評価方法論の図
バイアスリスク評価フレームワーク
妥当性評価の可視化
可視化によって恩恵を受けるあらゆる複雑な概念

図解作成の詳細なガイダンスについては、scientific-schematicsスキル文書を参照してください。

コア機能

1. 方法論の批判

研究方法論の厳密性、妥当性、潜在的な欠陥を評価します。

適用場面：

研究論文のレビュー
実験計画の評価
研究プロトコルの評価
新しい研究の計画

評価フレームワーク：

研究デザインの評価
- デザインは研究課題に適していますか？
- デザインは主張されている因果関係を裏付けることができますか？
- 比較群は適切かつ十分ですか？
- 実験的、準実験的、観察的デザインが正当化されるかどうかを検討してください。
妥当性分析
- 内的妥当性： 因果推論を信頼できますか？
  - ランダム化の質を確認してください。
  - 交絡制御を評価してください。
  - 選択バイアスを評価してください。
  - 脱落/ドロップアウトパターンをレビューしてください。
- 外的妥当性： 結果は一般化できますか？
  - サンプルの代表性を評価してください。
  - 設定の生態学的妥当性を検討してください。
  - 条件がターゲットアプリケーションと一致するかどうかを評価してください。
- 構成概念妥当性： 測定は意図された構成概念を捉えていますか？
  - 測定の妥当性確認をレビューしてください。
  - 操作的定義を確認してください。
  - 測定が直接的か代理的かを評価してください。
- 統計的結論の妥当性： 統計的推論は適切ですか？
  - 適切な検出力/サンプルサイズを確認してください。
  - 仮定の遵守を確認してください。
  - 検定の適切性を評価してください。
制御と盲検化
- ランダム化は適切に実施されましたか（シーケンス生成、割り付けの隠蔽）？
- 盲検化は可能であり、実施されましたか（参加者、提供者、評価者）？
- 制御条件は適切ですか（プラセボ、アクティブコントロール、無治療）？
- パフォーマンスバイアスまたは検出バイアスが結果に影響を与える可能性はありますか？
測定の質
- 測定器は妥当性が確認され、信頼性がありますか？
- 測定は可能な限り客観的ですか、それとも限界が認められた主観的なものですか？
- アウトカム評価は標準化されていますか？
- 複数の測定が結果の三角測量に使用されていますか？

参照： 詳細な原則については references/scientific_method.md を、包括的なデザインチェックリストについては references/experimental_design.md を参照してください。

2. バイアスの検出

結果を歪める可能性のあるバイアスの潜在的な原因を特定し、評価します。

適用場面：

公開された研究のレビュー
新しい研究のデザイン
矛盾するエビデンスの解釈
研究の質の評価

体系的なバイアスレビュー：

認知バイアス（研究者）
- 確証バイアス： 支持する結果のみが強調されていますか？
- HARKing： 仮説は事前に述べられましたか、それとも結果を見てから形成されましたか？
- 出版バイアス： 否定的な結果が文献から欠落していませんか？
- チェリーピッキング： エビデンスは選択的に報告されていますか？
- 事前登録と分析計画の透明性を確認してください。
選択バイアス
- サンプリングバイアス： サンプルは対象集団を代表していますか？
- ボランティアバイアス： 参加者は体系的な方法で自己選択していますか？
- 脱落バイアス： グループ間で脱落に差がありますか？
- 生存者バイアス： サンプルには「生存者」のみが見えていますか？
- 参加者フロー図を調べ、ベースライン特性を比較してください。
測定バイアス
- 観察者バイアス： 期待が観察に影響を与える可能性はありますか？
- 想起バイアス： 遡及的報告は体系的に不正確ですか？
- 社会的望ましさ： 回答は受容性に向けて偏っていますか？
- 測定器バイアス： 測定ツールは体系的に誤っていますか？
- 盲検化、妥当性確認、測定の客観性を評価してください。
分析バイアス
- Pハッキング： 有意性が出るまで複数の分析が実施されましたか？
- アウトカムの切り替え： 有意でないアウトカムが有意なものに置き換えられましたか？
- 選択的報告： 計画されたすべての分析が報告されていますか？

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Scientific Critical Thinking

Overview

Critical thinking is a systematic process for evaluating scientific rigor. Assess methodology, experimental design, statistical validity, biases, confounding, and evidence quality using GRADE and Cochrane ROB frameworks. Apply this skill for critical analysis of scientific claims.

When to Use This Skill

This skill should be used when:

Evaluating research methodology and experimental design
Assessing statistical validity and evidence quality
Identifying biases and confounding in studies
Reviewing scientific claims and conclusions
Conducting systematic reviews or meta-analyses
Applying GRADE or Cochrane risk of bias assessments
Providing critical analysis of research papers

Visual Enhancement with Scientific Schematics

When creating documents with this skill, always consider adding scientific diagrams and schematics to enhance visual communication.

If your document does not already contain schematics or diagrams:

Use the scientific-schematics skill to generate AI-powered publication-quality diagrams
Simply describe your desired diagram in natural language
Nano Banana Pro will automatically generate, review, and refine the schematic

For new documents: Scientific schematics should be generated by default to visually represent key concepts, workflows, architectures, or relationships described in the text.

How to generate schematics:

python scripts/generate_schematic.py "your diagram description" -o figures/output.png

The AI will automatically:

Create publication-quality images with proper formatting
Review and refine through multiple iterations
Ensure accessibility (colorblind-friendly, high contrast)
Save outputs in the figures/ directory

When to add schematics:

Critical thinking framework diagrams
Bias identification decision trees
Evidence quality assessment flowcharts
GRADE assessment methodology diagrams
Risk of bias evaluation frameworks
Validity assessment visualizations
Any complex concept that benefits from visualization

For detailed guidance on creating schematics, refer to the scientific-schematics skill documentation.

Core Capabilities

1. Methodology Critique

Evaluate research methodology for rigor, validity, and potential flaws.

Apply when:

Reviewing research papers
Assessing experimental designs
Evaluating study protocols
Planning new research

Evaluation framework:

Study Design Assessment
- Is the design appropriate for the research question?
- Can the design support causal claims being made?
- Are comparison groups appropriate and adequate?
- Consider whether experimental, quasi-experimental, or observational design is justified
Validity Analysis
- Internal validity: Can we trust the causal inference?
  - Check randomization quality
  - Evaluate confounding control
  - Assess selection bias
  - Review attrition/dropout patterns
- External validity: Do results generalize?
  - Evaluate sample representativeness
  - Consider ecological validity of setting
  - Assess whether conditions match target application
- Construct validity: Do measures capture intended constructs?
  - Review measurement validation
  - Check operational definitions
  - Assess whether measures are direct or proxy
- Statistical conclusion validity: Are statistical inferences sound?
  - Verify adequate power/sample size
  - Check assumption compliance
  - Evaluate test appropriateness
Control and Blinding
- Was randomization properly implemented (sequence generation, allocation concealment)?
- Was blinding feasible and implemented (participants, providers, assessors)?
- Are control conditions appropriate (placebo, active control, no treatment)?
- Could performance or detection bias affect results?
Measurement Quality
- Are instruments validated and reliable?
- Are measures objective when possible, or subjective with acknowledged limitations?
- Is outcome assessment standardized?
- Are multiple measures used to triangulate findings?

Reference: See references/scientific_method.md for detailed principles and references/experimental_design.md for comprehensive design checklist.

2. Bias Detection

Identify and evaluate potential sources of bias that could distort findings.

Apply when:

Reviewing published research
Designing new studies
Interpreting conflicting evidence
Assessing research quality

Systematic bias review:

Cognitive Biases (Researcher)
- Confirmation bias: Are only supporting findings highlighted?
- HARKing: Were hypotheses stated a priori or formed after seeing results?
- Publication bias: Are negative results missing from literature?
- Cherry-picking: Is evidence selectively reported?
- Check for preregistration and analysis plan transparency
Selection Biases
- Sampling bias: Is sample representative of target population?
- Volunteer bias: Do participants self-select in systematic ways?
- Attrition bias: Is dropout differential between groups?
- Survivorship bias: Are only "survivors" visible in sample?
- Examine participant flow diagrams and compare baseline characteristics
Measurement Biases
- Observer bias: Could expectations influence observations?
- Recall bias: Are retrospective reports systematically inaccurate?
- Social desirability: Are responses biased toward acceptability?
- Instrument bias: Do measurement tools systematically err?
- Evaluate blinding, validation, and measurement objectivity
Analysis Biases
- P-hacking: Were multiple analyses conducted until significance emerged?
- Outcome switching: Were non-significant outcomes replaced with significant ones?
- Selective reporting: Are all planned analyses reported?
- Subgroup fishing: Were subgroup analyses conducted without correction?
- Check for study registration and compare to published outcomes
Confounding
- What variables could affect both exposure and outcome?
- Were confounders measured and controlled (statistically or by design)?
- Could unmeasured confounding explain findings?
- Are there plausible alternative explanations?

Reference: See references/common_biases.md for comprehensive bias taxonomy with detection and mitigation strategies.

3. Statistical Analysis Evaluation

Critically assess statistical methods, interpretation, and reporting.

Apply when:

Reviewing quantitative research
Evaluating data-driven claims
Assessing clinical trial results
Reviewing meta-analyses

Statistical review checklist:

Sample Size and Power
- Was a priori power analysis conducted?
- Is sample adequate for detecting meaningful effects?
- Is the study underpowered (common problem)?
- Do significant results from small samples raise flags for inflated effect sizes?
Statistical Tests
- Are tests appropriate for data type and distribution?
- Were test assumptions checked and met?
- Are parametric tests justified, or should non-parametric alternatives be used?
- Is the analysis matched to study design (e.g., paired vs. independent)?
Multiple Comparisons
- Were multiple hypotheses tested?
- Was correction applied (Bonferroni, FDR, other)?
- Are primary outcomes distinguished from secondary/exploratory?
- Could findings be false positives from multiple testing?
P-Value Interpretation
- Are p-values interpreted correctly (probability of data if null is true)?
- Is non-significance incorrectly interpreted as "no effect"?
- Is statistical significance conflated with practical importance?
- Are exact p-values reported, or only "p < .05"?
- Is there suspicious clustering just below .05?
Effect Sizes and Confidence Intervals
- Are effect sizes reported alongside significance?
- Are confidence intervals provided to show precision?
- Is the effect size meaningful in practical terms?
- Are standardized effect sizes interpreted with field-specific context?
Missing Data
- How much data is missing?
- Is missing data mechanism considered (MCAR, MAR, MNAR)?
- How is missing data handled (deletion, imputation, maximum likelihood)?
- Could missing data bias results?
Regression and Modeling
- Is the model overfitted (too many predictors, no cross-validation)?
- Are predictions made outside the data range (extrapolation)?
- Are multicollinearity issues addressed?
- Are model assumptions checked?
Common Pitfalls
- Correlation treated as causation
- Ignoring regression to the mean
- Base rate neglect
- Texas sharpshooter fallacy (pattern finding in noise)
- Simpson's paradox (confounding by subgroups)

Reference: See references/statistical_pitfalls.md for detailed pitfalls and correct practices.

4. Evidence Quality Assessment

Evaluate the strength and quality of evidence systematically.

Apply when:

Weighing evidence for decisions
Conducting literature reviews
Comparing conflicting findings
Determining confidence in conclusions

Evidence evaluation framework:

Study Design Hierarchy
- Systematic reviews/meta-analyses (highest for intervention effects)
- Randomized controlled trials
- Cohort studies
- Case-control studies
- Cross-sectional studies
- Case series/reports
- Expert opinion (lowest)
Important: Higher-level designs aren't always better quality. A well-designed observational study can be stronger than a poorly-conducted RCT.
Quality Within Design Type
- Risk of bias assessment (use appropriate tool: Cochrane ROB, Newcastle-Ottawa, etc.)
- Methodological rigor
- Transparency and reporting completeness
- Conflicts of interest
GRADE Considerations (if applicable)
- Start with design type (RCT = high, observational = low)
- Downgrade for:
  - Risk of bias
  - Inconsistency across studies
  - Indirectness (wrong population/intervention/outcome)
  - Imprecision (wide confidence intervals, small samples)
  - Publication bias
- Upgrade for:
  - Large effect sizes
  - Dose-response relationships
  - Confounders would reduce (not increase) effect
Convergence of Evidence
- Stronger when:
  - Multiple independent replications
  - Different research groups and settings
  - Different methodologies converge on same conclusion
  - Mechanistic and empirical evidence align
- Weaker when:
  - Single study or research group
  - Contradictory findings in literature
  - Publication bias evident
  - No replication attempts
Contextual Factors
- Biological/theoretical plausibility
- Consistency with established knowledge
- Temporality (cause precedes effect)
- Specificity of relationship
- Strength of association

Reference: See references/evidence_hierarchy.md for detailed hierarchy, GRADE system, and quality assessment tools.

5. Logical Fallacy Identification

Detect and name logical errors in scientific arguments and claims.

Apply when:

Evaluating scientific claims
Reviewing discussion/conclusion sections
Assessing popular science communication
Identifying flawed reasoning

Common fallacies in science:

Causation Fallacies
- Post hoc ergo propter hoc: "B followed A, so A caused B"
- Correlation = causation: Confusing association with causality
- Reverse causation: Mistaking cause for effect
- Single cause fallacy: Attributing complex outcomes to one factor
Generalization Fallacies
- Hasty generalization: Broad conclusions from small samples
- Anecdotal fallacy: Personal stories as proof
- Cherry-picking: Selecting only supporting evidence
- Ecological fallacy: Group patterns applied to individuals
Authority and Source Fallacies
- Appeal to authority: "Expert said it, so it's true" (without evidence)
- Ad hominem: Attacking person, not argument
- Genetic fallacy: Judging by origin, not merits
- Appeal to nature: "Natural = good/safe"
Statistical Fallacies
- Base rate neglect: Ignoring prior probability
- Texas sharpshooter: Finding patterns in random data
- Multiple comparisons: Not correcting for multiple tests
- Prosecutor's fallacy: Confusing P(E|H) with P(H|E)
Structural Fallacies
- False dichotomy: "Either A or B" when more options exist
- Moving goalposts: Changing evidence standards after they're met
- Begging the question: Circular reasoning
- Straw man: Misrepresenting arguments to attack them
Science-Specific Fallacies
- Galileo gambit: "They laughed at Galileo, so my fringe idea is correct"
- Argument from ignorance: "Not proven false, so true"
- Nirvana fallacy: Rejecting imperfect solutions
- Unfalsifiability: Making untestable claims

When identifying fallacies:

Name the specific fallacy
Explain why the reasoning is flawed
Identify what evidence would be needed for valid inference
Note that fallacious reasoning doesn't prove the conclusion false—just that this argument doesn't support it

Reference: See references/logical_fallacies.md for comprehensive fallacy catalog with examples and detection strategies.

6. Research Design Guidance

Provide constructive guidance for planning rigorous studies.

Apply when:

Helping design new experiments
Planning research projects
Reviewing research proposals
Improving study protocols

Design process:

Research Question Refinement
- Ensure question is specific, answerable, and falsifiable
- Verify it addresses a gap or contradiction in literature
- Confirm feasibility (resources, ethics, time)
- Define variables operationally
Design Selection
- Match design to question (causal → experimental; associational → observational)
- Consider feasibility and ethical constraints
- Choose between-subjects, within-subjects, or mixed designs
- Plan factorial designs if testing multiple factors
Bias Minimization Strategy
- Implement randomization when possible
- Plan blinding at all feasible levels (participants, providers, assessors)
- Identify and plan to control confounds (randomization, matching, stratification, statistical adjustment)
- Standardize all procedures
- Plan to minimize attrition
Sample Planning
- Conduct a priori power analysis (specify expected effect, desired power, alpha)
- Account for attrition in sample size
- Define clear inclusion/exclusion criteria
- Consider recruitment strategy and feasibility
- Plan for sample representativeness
Measurement Strategy
- Select validated, reliable instruments
- Use objective measures when possible
- Plan multiple measures of key constructs (triangulation)
- Ensure measures are sensitive to expected changes
- Establish inter-rater reliability procedures
Analysis Planning
- Prespecify all hypotheses and analyses
- Designate primary outcome clearly
- Plan statistical tests with assumption checks
- Specify how missing data will be handled
- Plan to report effect sizes and confidence intervals
- Consider multiple comparison corrections
Transparency and Rigor
- Preregister study and analysis plan
- Use reporting guidelines (CONSORT, STROBE, PRISMA)
- Plan to report all outcomes, not just significant ones
- Distinguish confirmatory from exploratory analyses
- Commit to data/code sharing

Reference: See references/experimental_design.md for comprehensive design checklist covering all stages from question to dissemination.

7. Claim Evaluation

Systematically evaluate scientific claims for validity and support.

Apply when:

Assessing conclusions in papers
Evaluating media reports of research
Reviewing abstract or introduction claims
Checking if data support conclusions

Claim evaluation process:

Identify the Claim
- What exactly is being claimed?
- Is it a causal claim, associational claim, or descriptive claim?
- How strong is the claim (proven, likely, suggested, possible)?
Assess the Evidence
- What evidence is provided?
- Is evidence direct or indirect?
- Is evidence sufficient for the strength of claim?
- Are alternative explanations ruled out?
Check Logical Connection
- Do conclusions follow from the data?
- Are there logical leaps?
- Is correlational data used to support causal claims?
- Are limitations acknowledged?
Evaluate Proportionality
- Is confidence proportional to evidence strength?
- Are hedging words used appropriately?
- Are limitations downplayed?
- Is speculation clearly labeled?
Check for Overgeneralization
- Do claims extend beyond the sample studied?
- Are population restrictions acknowledged?
- Is context-dependence recognized?
- Are caveats about generalization included?
Red Flags
- Causal language from correlational studies
- "Proves" or absolute certainty
- Cherry-picked citations
- Ignoring contradictory evidence
- Dismissing limitations
- Extrapolation beyond data

Provide specific feedback:

Quote the problematic claim
Explain what evidence would be needed to support it
Suggest appropriate hedging language if warranted
Distinguish between data (what was found) and interpretation (what it means)

Application Guidelines

General Approach

Be Constructive
- Identify strengths as well as weaknesses
- Suggest improvements rather than just criticizing
- Distinguish between fatal flaws and minor limitations
- Recognize that all research has limitations
Be Specific
- Point to specific instances (e.g., "Table 2 shows..." or "In the Methods section...")
- Quote problematic statements
- Provide concrete examples of issues
- Reference specific principles or standards violated
Be Proportionate
- Match criticism severity to issue importance
- Distinguish between major threats to validity and minor concerns
- Consider whether issues affect primary conclusions
- Acknowledge uncertainty in your own assessments
Apply Consistent Standards
- Use same criteria across all studies
- Don't apply stricter standards to findings you dislike
- Acknowledge your own potential biases
- Base judgments on methodology, not results
Consider Context
- Acknowledge practical and ethical constraints
- Consider field-specific norms for effect sizes and methods
- Recognize exploratory vs. confirmatory contexts
- Account for resource limitations in evaluating studies

When Providing Critique

Structure feedback as:

Summary: Brief overview of what was evaluated
Strengths: What was done well (important for credibility and learning)
Concerns: Issues organized by severity
- Critical issues (threaten validity of main conclusions)
- Important issues (affect interpretation but not fatally)
- Minor issues (worth noting but don't change conclusions)
Specific Recommendations: Actionable suggestions for improvement
Overall Assessment: Balanced conclusion about evidence quality and what can be concluded

Use precise terminology:

Name specific biases, fallacies, and methodological issues
Reference established standards and guidelines
Cite principles from scientific methodology
Use technical terms accurately

When Uncertain

Acknowledge uncertainty: "This could be X or Y; additional information needed is Z"
Ask clarifying questions: "Was [methodological detail] done? This affects interpretation."
Provide conditional assessments: "If X was done, then Y follows; if not, then Z is concern"
Note what additional information would resolve uncertainty

Reference Materials

This skill includes comprehensive reference materials that provide detailed frameworks for critical evaluation:

references/scientific_method.md - Core principles of scientific methodology, the scientific process, critical evaluation criteria, red flags in scientific claims, causal inference standards, peer review, and open science principles
references/common_biases.md - Comprehensive taxonomy of cognitive, experimental, methodological, statistical, and analysis biases with detection and mitigation strategies
references/statistical_pitfalls.md - Common statistical errors and misinterpretations including p-value misunderstandings, multiple comparisons problems, sample size issues, effect size mistakes, correlation/causation confusion, regression pitfalls, and meta-analysis issues
references/evidence_hierarchy.md - Traditional evidence hierarchy, GRADE system, study quality assessment criteria, domain-specific considerations, evidence synthesis principles, and practical decision frameworks
references/logical_fallacies.md - Logical fallacies common in scientific discourse organized by type (causation, generalization, authority, relevance, structure, statistical) with examples and detection strategies
references/experimental_design.md - Comprehensive experimental design checklist covering research questions, hypotheses, study design selection, variables, sampling, blinding, randomization, control groups, procedures, measurement, bias minimization, data management, statistical planning, ethical considerations, validity threats, and reporting standards

When to consult references:

Load references into context when detailed frameworks are needed
Use grep to search references for specific topics: grep -r "pattern" references/
References provide depth; SKILL.md provides procedural guidance
Consult references for comprehensive lists, detailed criteria, and specific examples

Remember

Scientific critical thinking is about:

Systematic evaluation using established principles
Constructive critique that improves science
Proportional confidence to evidence strength
Transparency about uncertainty and limitations
Consistent application of standards
Recognition that all research has limitations
Balance between skepticism and openness to evidence

Always distinguish between:

Data (what was observed) and interpretation (what it means)
Correlation and causation
Statistical significance and practical importance
Exploratory and confirmatory findings
What is known and what is uncertain
Evidence against a claim and evidence for the null

Goals of critical thinking:

Identify strengths and weaknesses accurately
Determine what conclusions are supported
Recognize limitations and uncertainties
Suggest improvements for future work
Advance scientific understanding

同梱ファイル

※ ZIPに含まれるファイル一覧。`SKILL.md` 本体に加え、参考資料・サンプル・スクリプトが入っている場合があります。

📄 SKILL.md (23,705 bytes)
📎 references/common_biases.md (11,093 bytes)
📎 references/evidence_hierarchy.md (13,444 bytes)
📎 references/experimental_design.md (16,068 bytes)
📎 references/logical_fallacies.md (18,014 bytes)
📎 references/scientific_method.md (6,100 bytes)
📎 references/statistical_pitfalls.md (15,726 bytes)
📎 scripts/generate_schematic_ai.py (32,647 bytes)
📎 scripts/generate_schematic.py (5,146 bytes)