数据与复现药物发现与化学信息学FreedomIntelligence/OpenClaw-Medical-Skills数据与复现
CL

claw-ancestry-pca

维护者 FreedomIntelligence · 最近更新 2026年4月1日

Ancestry decomposition PCA against the Simons Genome Diversity Project.

OpenClawNanoClaw分析处理复现实验claw-ancestry-pca⚙️ clawbio pipelinesgenomics, ancestry & pharmacogenomicsancestry

原始来源

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/claw-ancestry-pca

维护者
FreedomIntelligence
许可
MIT
最近更新
2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

  • Place your study cohort in global genetic context by computing joint PCA against Simons Genome Diversity Project (SGDP) — 345 samples ,面向 164 populations spanning every inhabited continent。
  • Panel:PC1 vs PC2 — main population structure of your cohort。
  • Panel B:PC3 vs PC2 ,支持 regional groupings 、 confidence ellipses。
  • Panel C:PC3 vs PC1 ,支持 language/cultural groupings。
  • Panel D:Global context — your samples (circles) vs SGDP (triangles)。

原始文档

SKILL.md 摘录

Why this exists

If you ask ChatGPT to "run a PCA against a global reference panel," it will:

  • Not know which reference panel to use
  • Hallucinate PLINK flags for merging datasets with different variant sets
  • Skip IBD removal (related individuals distort PCA)
  • Not normalise contig names between your VCF and the reference
  • Produce a single scatter plot with no population labels

This skill encodes the correct methodological decisions:

  • Uses SGDP (the gold-standard reference for global diversity)
  • Handles contig normalisation (chr1 vs 1)
  • Filters to common biallelic SNPs shared between datasets
  • Removes related individuals via IBD checks
  • Produces publication-quality multi-panel figures with confidence ellipses
  • Differentiates your samples (circles) from reference (triangles)

Reference Panel

The skill bundles the SGDP v4 dataset (Mallick et al., 2016, Nature):

  • 345 samples from 164 populations
  • Whole-genome sequencing at high coverage
  • MAF > 0.1% filter applied
  • Populations span: Africa, Americas, Central/South Asia, East Asia, Europe, Middle East, Oceania

Demo (works out of the box)

The demo uses pre-computed PCA results from the Peruvian Genome Project (736 samples, 28 populations) and generates the full 4-panel figure instantly.

适用场景

  • Use claw-ancestry-pca ,用于 medicinal chemistry 、 drug-discovery work。
  • Apply claw-ancestry-pca to compound,target,或 screening workflows。

不适用场景

  • Do not rely on this catalog entry alone ,用于 installation 或 maintenance details。

相关技能

相关技能

返回目录
AA
数据与复现药物发现与化学信息学

aav-vector-design-agent

AAV vector design: capsid selection, promoter optimization, payload capacity.

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
AG
数据与复现药物发现与化学信息学

agentd-drug-discovery

AgentD autonomous drug discovery: target identification, hit finding, ADMET optimization.

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现药物发现与化学信息学

bio-clinical-databases-hla-typing

Call HLA alleles from NGS data using OptiType, HLA-HD, or arcasHLA for immunogenomics applications. Use when determining…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现药物发现与化学信息学

bio-clinical-databases-pharmacogenomics

Query PharmGKB and CPIC for drug-gene interactions, pharmacogenomic annotations, and dosing guidelines. Use when predict…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看