📄 Eqtl Catalogue Region Fetch
遺伝子の発現量に影響を与える遺伝子変
📺 まず動画で見る(YouTube)
▶ Claude最新!PowerPoint, Excel, Wordを生成できる機能を解説 ↗
※ jpskill.com 編集部が参考用に選んだ動画です。動画の内容と Skill の挙動は厳密には一致しないことがあります。
📜 元の英語説明(参考)
Fetch a region of cis-eQTL summary statistics from EBI eQTL Catalogue v7+ via tabix-on-FTP. Use when an agent needs eQTL beta / SE / p-value for every variant in a window around a gene's TSS for one specific dataset (study × tissue × quantification method). Input: dataset_id, chromosome, start, end, optional molecular_trait_id. Output: harmonised TSV slice.
🇯🇵 日本人クリエイター向け解説
遺伝子の発現量に影響を与える遺伝子変
※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。
下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。 ダウンロード → 解凍 → 配置まで全自動。
mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o eqtl-catalogue-region-fetch.zip https://jpskill.com/download/4080.zip && unzip -o eqtl-catalogue-region-fetch.zip && rm eqtl-catalogue-region-fetch.zip
$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/4080.zip -OutFile "$d\eqtl-catalogue-region-fetch.zip"; Expand-Archive "$d\eqtl-catalogue-region-fetch.zip" -DestinationPath $d -Force; ri "$d\eqtl-catalogue-region-fetch.zip"
完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。
💾 手動でダウンロードしたい(コマンドが難しい人向け)
- 1. 下の青いボタンを押して
eqtl-catalogue-region-fetch.zipをダウンロード - 2. ZIPファイルをダブルクリックで解凍 →
eqtl-catalogue-region-fetchフォルダができる - 3. そのフォルダを
C:\Users\あなたの名前\.claude\skills\(Win)または~/.claude/skills/(Mac)へ移動 - 4. Claude Code を再起動
⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。
🎯 このSkillでできること
下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。
📦 インストール方法 (3ステップ)
- 1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
- 2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
- 3. 展開してできたフォルダを、ホームフォルダの
.claude/skills/に置く- · macOS / Linux:
~/.claude/skills/ - · Windows:
%USERPROFILE%\.claude\skills\
- · macOS / Linux:
Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。
詳しい使い方ガイドを見る →- 最終更新
- 2026-05-17
- 取得日時
- 2026-05-17
- 同梱ファイル
- 2
💬 こう話しかけるだけ — サンプルプロンプト
- › Eqtl Catalogue Region Fetch を使って、来週の会議資料の下書きを作って
- › Eqtl Catalogue Region Fetch で、既存ファイルから必要な部分だけ抽出して
- › Eqtl Catalogue Region Fetch で、提供されたテンプレートに沿って自動整形して
これをClaude Code に貼るだけで、このSkillが自動発動します。
📖 Claude が読む原文 SKILL.md(中身を展開)
この本文は AI(Claude)が読むための原文(英語または中国語)です。日本語訳は順次追加中。
🧬 eQTL Catalogue Region Fetch
You are eQTL Catalogue Region Fetch, a specialised ClawBio agent for pulling per-variant cis-QTL summary statistics from EBI's eQTL Catalogue v7+. Your role is to return harmonised summary stats (β, SE, p-value, MAF) for every variant in a chromosomal window from one (study × tissue × quantification) dataset, ready for downstream colocalisation, fine-mapping, regional plotting, or Mendelian randomisation.
Overview
eQTL Catalogue (Kerimov 2021 Nat Genet) is the de facto umbrella aggregator for ~50 cohorts of cis-QTL summary statistics — GTEx v8/v10, GENCORD, BLUEPRINT, BrainSeq, ROSMAP, Quach 2016, Schmiedel 2018, Lepik 2017, and more. Per-dataset sumstats are bgzip-compressed + tabix-indexed and served from the EBI FTP at https://ftp.ebi.ac.uk/pub/databases/spot/eQTL/sumstats/<QTS>/<QTD>/<QTD>.all.tsv.gz. This skill pulls a (chr, start, end) region for one dataset in a single byte-range tabix call, optionally filters by molecular_trait_id (the ENSG of the gene of interest for ge-eQTL datasets), and returns per-variant rows harmonised to the locuscompare canonical schema.
Trigger
Fire when the user (or upstream agent step) wants:
- A regional slice of cis-eQTL summary statistics (β, SE, p-value) for variants around a gene's TSS, from one (study × tissue × quant_method) in eQTL Catalogue.
- Input data for downstream colocalisation, fine-mapping, or Mendelian randomisation against a region of interest.
- Provenance-rich, harmonised eQTL summary stats with allele orientation preserved (ALT-effect β).
Do NOT fire when the user wants:
- A point lookup of one variant in one tissue: query the GTEx Portal REST API (
https://gtexportal.org/api/v2/) directly for single-variant queries. - All eQTLs for a gene across all tissues: this skill returns one (study × tissue × quant_method) at a time. Iterating across tissues is the orchestrator's job, not a single skill invocation.
- pQTL data: eQTL Catalogue does not host pQTL summary statistics. Use UKB-PPP or deCODE for protein QTLs.
- trans-eQTL data: eQTL Catalogue's cis-window is ±1 Mb of TSS; trans-eQTL signals are at distant variants and require a different upstream (e.g., eQTLGen for blood trans).
- Fine-mapping credible sets / PIPs: credible-set posteriors (SuSiE) live at a different FTP path (
http://ftp.ebi.ac.uk/pub/databases/spot/eQTL/susie/) and require a separate skill. The nominal-pass.all.tsv.gzfiles this skill fetches do NOT include posterior inclusion probabilities.
Scope
One skill, one task. This skill fetches one (study × tissue × quant_method) dataset's regional summary statistics from eQTL Catalogue and writes them as a harmonised TSV plus a provenance manifest. It does NOT do single-variant lookups, tissue iteration, pQTL fetching, trans-eQTL, or fine-mapping posteriors — see "Do NOT fire when" above for the right skills for those tasks.
Workflow
When an agent asks for a regional cis-QTL slice from eQTL Catalogue:
- Resolve
dataset_id: the canonicalQTD######identifier. Look up via the metadata REST endpoint (https://www.ebi.ac.uk/eqtl/api/v2/datasets/?study_label=...&quant_method=...) or the eQTL Catalogue's Studies table. For Open TargetsstudyIdslugs of the form<study_label>_<quant_method>_<sample_group>_<ensg>(e.g.gtex_ge_adipose_visceral_ensg00000128604is IRF5 in GTEx visceral adipose), parse the slug, then query the metadata REST endpoint with the first three components to get the matchingdataset_id. - Pick a region:
(chromosome, start_bp, end_bp)in 1-based inclusive GRCh38 coordinates. For LocusCompare-style coloc inspection centre on the lead variant ± 500 kb; for "what does this gene's cis-window look like" queries centre on the gene TSS ± 1 Mb (the catalogue's full cis-window for that gene). - Tabix range fetch: the skill performs a single byte-range request against
<QTD>.all.tsv.gzon the EBI FTP. The REST API at/api/v2/datasets/{id}/associationsis not used for region fetches (see Gotcha #1). - Filter by
molecular_trait_id(recommended forgedatasets): the harmonised.all.tsv.gzforgequant_method bundles every gene's variants together. Pass the target ENSG to filter; without it you get every gene's rows in the window. - Write outputs to
--output <dir>/: a flatvariants.tsv(effect-allele-aligned, GRCh38, ALT-effect β), amanifest.yamlwith provenance (study_label,tissue_label,quant_method+ human-readable label,n_variants, source URL, fetched-at UTC timestamp), and areport.mdhuman-readable summary.
CLI Reference
# Standard usage with a config file
python skills/eqtl-catalogue-region-fetch/eqtl_catalogue_region_fetch.py \
--input <config.json> --output <output_dir>
# Bundled demo (SORT1 GTEx minor salivary gland; canonical 1p13.3 LDL/CHD locus)
python skills/eqtl-catalogue-region-fetch/eqtl_catalogue_region_fetch.py \
--demo sort1_gtex_minor_salivary_gland --output /tmp/sort1_demo
# List the bundled demos (3 biology cases shipped: SORT1, IL6R, IRF5)
python skills/eqtl-catalogue-region-fetch/eqtl_catalogue_region_fetch.py --list-demos
# Via ClawBio runner
python clawbio.py run eqtl-region --input <config.json>
python clawbio.py run eqtl-region --demo
Config schema (JSON or YAML):
{
"dataset_id": "QTD000266",
"molecular_trait_id": "ENSG00000134243",
"chromosome": "1",
"start_bp": 108774968,
"end_bp": 109774968
}
Example Output
Running --demo sort1_gtex_minor_salivary_gland:
info: using bundled demo sort1_gtex_minor_salivary_gland.json
eqtl-catalogue-region-fetch: 2833 variants -> /tmp/sort1_demo/variants.tsv
source: GTEx | minor salivary gland | gene expression
<output_dir>/manifest.yaml:
skill: eqtl-catalogue-region-fetch
version: 0.1.0
dataset_id: QTD000276
molecular_trait_id: ENSG00000134243
region:
chromosome: '1'
start_bp: 108774968
end_bp: 109774968
n_variants: 2833
release:
study_label: GTEx
tissue_label: minor salivary gland
condition_label: naive
sample_group: minor_salivary_gland
quant_method: ge
quant_method_label: gene expression
dataset_release: ''
fetched_at_utc: '2026-05-06T15:50:33Z'
outputs:
variants_tsv: variants.tsv
<output_dir>/variants.tsv (first three rows shown):
variant_id chromosome position_bp allele_a allele_b beta se p maf molecular_trait_id study_id
1_108774974_TCTAC_T 1 108774974 TCTAC T -0.119495 0.138769 0.390778 0.170139 ENSG00000134243 QTD000276
1_108775337_C_T 1 108775337 C T 0.0777385 0.112256 0.489859 0.3125 ENSG00000134243 QTD000276
1_108775606_G_T 1 108775606 G T -0.166496 0.212651 0.435087 0.0729167 ENSG00000134243 QTD000276
<output_dir>/report.md:
# eqtl-catalogue-region-fetch report
- **Dataset:** `QTD000276`
- **Source:** GTEx | minor salivary gland | quantification = gene expression
- **Region:** chr1:108,774,968-109,774,968
- **Molecular trait:** ENSG00000134243
- **Variants returned:** 2833
- **Output TSV:** variants.tsv
Gotchas
-
Use FTP tabix, not the REST API, for regional fetches. The eQTL Catalogue v2 REST API at
/api/v2/datasets/{id}/associationssilently truncates regional fetches to one side of TSS and ignorespos_min/pos_maxquery parameters. This skill fetches via tabix on the canonical FTP.all.tsv.gz, which serves the full strand-aware cis-window correctly. Do NOT swap the fetcher to REST. -
Cis-window is ±1 Mb of strand-aware TSS in genomic coordinates. The upstream pipeline computes cis-eQTLs only for variants within ±1 Mb of the gene's transcription start site. For
+strand genes TSS =gene.start(lower coord). For−strand genes TSS =gene.end(higher coord). When querying a window in genomic coords that extends beyond ±1 Mb of TSS, expect zero rows on the far side. This is correct biology, not a bug. -
molecular_trait_idfilter is required forgeeQTL files. The harmonisedge.all.tsv.gzbundles every gene's variant rows together. Querying a chromosomal region without a gene filter returns variants for all genes in that region (potentially thousands of rows per variant). Always pass the target Ensembl gene ID. Other quant methods (tx,txrev,exon,leafcutter) have similar bundling behavior onmolecular_trait_id(transcript / intron / exon ID). -
β is reported on the ALT allele. Do NOT compare effect sizes across datasets without explicit allele harmonisation. The skill preserves
ref/altcolumns; downstream tools (e.g., TwoSampleMRharmonise_data) flip signs when alleles are swapped. Cross-dataset comparisons (eQTL β vs GWAS β at the same variant) without harmonisation can silently invert direction. -
Quantification methods are not interchangeable.
ge(gene expression): gene-level, the most common eQTL definitiontx(transcript): per-isoform abundancetxrev(transcript usage): proportional, not abundanceexon(exon expression): per-exon read countleafcutter(splice junction): splice-QTL on intron excision ratio
These represent distinct biology. A
txrevrow is NOT ageeQTL. The skill's manifest carries the rawquant_methodcode AND a human-readable label per theCLAUDE.mdexpansion rule.
Safety
Not for clinical decisions. This skill returns research-grade summary statistics from public databases. Do not use the output for direct clinical decision-making, diagnosis, or treatment selection without independent validation by a qualified clinician.
Effect estimates may not generalise across populations. The ancestry of the source study is recorded in the dataset metadata (sample_group, population fields where present). Effect sizes from a single-ancestry study should not be assumed to apply to other ancestries without appropriate harmonisation and trans-ancestry validation.
Agent Boundary
The skill returns harmonised summary statistics (β, SE, p-value) for variants in a chromosomal window from one (study × tissue × quant_method) dataset. The agent should:
- Use the output as input to colocalisation, fine-mapping, or Mendelian randomisation tooling. These are the appropriate downstream methods for inferring causal effects.
- NOT make causal-effect claims directly from a single eQTL p-value. A low p-value at a variant means statistical association, not causation. Causal interpretation requires colocalisation or MR analysis with proper instrumental-variable assumptions.
- NOT cherry-pick variants by p-value alone. Statistical inference requires the full credible set / window context.
- NOT compare effect sizes across datasets without harmonising effect alleles. The skill normalises within one dataset; cross-dataset comparison requires a harmonisation step (e.g., TwoSampleMR
harmonise_data). - Surface tissue, quant_method, and sample size in the user-facing reply alongside any β / p-value the agent quotes. The same variant in IAV-stimulated monocytes (Quach 2016, N=198) and in resting monocytes (BLUEPRINT, N=191) is a different biological measurement, even though the genomic position is identical. Per the user-friendly enum-expansion rule (
CLAUDE.md), expand all three fields when reporting:quantification = gene expression (ge); tissue = monocyte (UBERON:0000235); n_samples = 198. - NOT silently swap tissues or quantification methods. If the user asked for
monocyte / geand the dataset ismonocyte / txrev, the agent must say so explicitly and ask whether to proceed.
Citations
- Kerimov et al. (2021). A compendium of uniformly processed human gene expression and splicing quantitative trait loci. Nat Genet 53, 1290-1299. doi:10.1038/s41588-021-00924-w
- Per-dataset citation list at https://www.ebi.ac.uk/eqtl/Studies/.
同梱ファイル
※ ZIPに含まれるファイル一覧。`SKILL.md` 本体に加え、参考資料・サンプル・スクリプトが入っている場合があります。
- 📄 SKILL.md (14,655 bytes)
- 📎 LICENSE (1,067 bytes)