Data & ReproDrug Discovery & CheminformaticsFreedomIntelligence/OpenClaw-Medical-SkillsData & Reproduction

claw-metagenomics

Maintainer FreedomIntelligence · Last updated April 1, 2026

Shotgun metagenomics profiling — taxonomy, resistome, and functional pathways.

OpenClawNanoClawAnalysisReproductionclaw-metagenomics⚙️ clawbio pipelinesgenomics, ancestry & pharmacogenomicsshotgun

Original source

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/claw-metagenomics

Skill Snapshot

2 min

Comprehensive shotgun metagenomics analysis combining taxonomic classification, antimicrobial resistance gene detection, and functional pathway profiling from paired-end FASTQ files.
python metagenomics_profiler.py \ --r1 sample_R1.fastq.gz \ --r2 sample_R2.fastq.gz \ --output metagenomics_report.

Source Doc

Takes paired-end FASTQ files (R1, R2) or a single concatenated FASTQ as input
Runs Kraken2 taxonomic classification against a standard database (e.g., Standard-8, PlusPF)
Refines abundances with Bracken at species level (read re-estimation)
Detects antimicrobial resistance genes with RGI against the CARD database
Classifies detected ARGs by WHO critical priority pathogen association
Optionally runs HUMAnN3 for functional pathway profiling (MetaCyc + UniRef)
Generates three publication-quality figures:
- Figure 1: Taxonomy bar chart — top 20 species by relative abundance
- Figure 2: Resistome heatmap — ARG families by drug class with abundance
- Figure 3: WHO-critical ARG summary — priority-tier breakdown of detected resistance genes
Produces a full reproducibility bundle (commands.sh, environment.yml, checksums.sha256)

If you ask a general AI to "analyse a metagenome," it will:

This skill encodes the correct methodological decisions:

Kraken2 confidence threshold of 0.2 (reduces false positives in environmental samples)
Bracken re-estimation at species level with minimum 10 reads
RGI MAIN with "Perfect" and "Strict" hit criteria only (no "Loose" hits)
WHO Critical Priority Pathogen list mapped to detected ARG families
HUMAnN3 with MetaCyc stratification for pathway-level functional context
Thread count auto-detected from available CPUs
Full reproducibility bundle for every run

The skill works with any shotgun metagenome but has been validated on:

Peru sewage metagenomics study (6 samples, 3 collection sites: Lima, Cusco, Iquitos)
Environmental sewage samples with mixed microbial communities
Read depths ranging from 2M to 15M paired-end reads per sample