Data & ReproStatistics & Data AnalysisFreedomIntelligence/OpenClaw-Medical-SkillsData & Reproduction
TO

tooluniverse-single-cell

Maintainer FreedomIntelligence · Last updated April 1, 2026

Production-ready single-cell and expression matrix analysis using scanpy, anndata, and scipy. Performs scRNA-seq QC, normalization, PCA, UMAP, Leiden/Louvain clustering, differential expression (Wilcoxon, t-test, DESeq2), cell type annotation, per-cell-type statistical analysis, gene-expression correlation, batch correction (Harmony), trajectory inference, and cell-cell communication analysis. NEW: Analyzes ligand-r….

OpenClawNanoClawAnalysisReproductiontooluniverse-single-cell🏥 medical & clinicalmedical toolsproduction

Original source

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/tooluniverse-single-cell

Maintainer
FreedomIntelligence
License
MIT
Last updated
April 1, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

  • Comprehensive single-cell RNA-seq analysis and expression matrix processing using scanpy, anndata, scipy, and ToolUniverse. Designed for both full scRNA-seq workflows (raw counts to annotated cell types) and targeted expression-level analyses (per-cell-type DE, correlation, ANOVA, clustering).
  • IMPORTANT: This skill handles complex multi-workflow analysis. Most implementation details have been moved to references/ for progressive disclosure. This document focuses on high-level decision-making and workflow orchestration.
  • import scanpy as sc import anndata as ad import pandas as pd import numpy as np from scipy import stats from scipy.cluster.hierarchy import linkage, fcluster, dendrogram from scipy.spatial.distance import pdist from sklearn.decomposition import PCA from sklearn.preprocessing import StandardScaler from statsmodels.stats.multitest import multipletests.

Source Doc

Excerpt From SKILL.md

When to Use This Skill

Apply when users:

  • Have scRNA-seq data (h5ad, 10X, CSV count matrices) and want analysis
  • Ask about cell type identification, clustering, or annotation
  • Need differential expression analysis by cell type or condition
  • Want gene-expression correlation analysis (e.g., gene length vs expression by cell type)
  • Ask about PCA, UMAP, t-SNE for expression data
  • Need Leiden/Louvain clustering on expression matrices
  • Want statistical comparisons between cell types (t-test, ANOVA, fold change)
  • Ask about marker genes for cell populations
  • Need batch correction (Harmony, combat)
  • Want trajectory or pseudotime analysis
  • Ask about cell-cell communication (ligand-receptor interactions)
  • Questions mention "single-cell", "scRNA-seq", "cell type", "h5ad"
  • Questions involve immune cell types (CD4, CD8, CD14, CD19, monocytes, etc.)

BixBench Coverage: 18+ questions across 5 projects (bix-22, bix-27, bix-31, bix-33, bix-36)

NOT for (use other skills instead):

  • Bulk RNA-seq DESeq2 analysis only → Use tooluniverse-rnaseq-deseq2
  • Gene enrichment only (no expression data) → Use tooluniverse-gene-enrichment
  • VCF/variant analysis → Use tooluniverse-variant-analysis
  • Statistical modeling (regression, survival) → Use tooluniverse-statistical-modeling

Core Principles

  1. Data-first approach - Load, inspect, and validate data before any analysis
  2. AnnData-centric - All data flows through anndata objects for consistency
  3. Cell type awareness - Many questions require per-cell-type subsetting and analysis
  4. Statistical rigor - Proper normalization, multiple testing correction, effect sizes
  5. Scanpy standard pipeline - Follow established best practices for scRNA-seq
  6. Flexible input - Handle h5ad, 10X, CSV/TSV, pre-processed and raw data
  7. Question-driven - Parse what the user is actually asking and extract the specific answer
  8. Enrichment integration - Chain DE results into GO/KEGG/Reactome enrichment when requested
  9. Large dataset support - Efficient handling of datasets with >100k cells

Optional

import harmonypy # batch correction bash pip install scanpy anndata leidenalg umap-learn harmonypy gseapy pandas numpy scipy scikit-learn statsmodels


---

Use cases

  • Have scRNA-seq data (h5ad, 10X, CSV count matrices) and want analysis.
  • Ask about cell type identification, clustering, or annotation.
  • Need differential expression analysis by cell type or condition.
  • Want gene-expression correlation analysis (e.g., gene length vs expression by cell type).

Not for

  • Do not rely on this catalog entry alone for installation or maintenance details.

Related skills

Related skills

Back to directory
AR
Data & ReproStatistics & Data Analysis

arxiv-database

This skill provides Python tools for searching and retrieving preprints from arXiv.org via its public Atom API. It supports keyword search,…

Claude CodeAnalysis
K-Dense-AI/claude-scientific-skillsView
BA
Data & ReproStatistics & Data Analysis

bayesian-optimizer

Bayesian optimization for experimental design and hyperparameter tuning in biomedical research.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproStatistics & Data Analysis

bio-alignment-files-bam-statistics

Compute alignment statistics: flagstat, idxstats, coverage depth.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproStatistics & Data Analysis

bio-alignment-msa-statistics

Calculate alignment statistics including sequence identity, conservation scores, substitution matrices, and similarity metrics. Use when com…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView