Data & ReproStatistics & Data AnalysisFreedomIntelligence/OpenClaw-Medical-SkillsData & Reproduction

tooluniverse-single-cell

Maintainer FreedomIntelligence · Last updated March 31, 2026

Production-ready single-cell and expression matrix analysis using scanpy, anndata, and scipy. Performs scRNA-seq QC, normalization, PCA, UMAP, Leiden/Louvain clustering, differential expression (Wilcoxon, t-test, DESeq2), cell type annotation, per-cell-type statistical analysis, gene-expression correlation, batch correction (Harmony), trajectory inference, and cell-cell communication analysis. NEW: Analyzes ligand-r….

OpenClawNanoClawAnalysisReproductiontooluniverse-single-cell🏥 medical & clinicalmedical toolsproduction

Original source

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/tooluniverse-single-cell

Maintainer: FreedomIntelligence
License: MIT
Last updated: March 31, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

Comprehensive single-cell RNA-seq analysis and expression matrix processing using scanpy, anndata, scipy, and ToolUniverse. Designed for both full scRNA-seq workflows (raw counts to annotated cell types) and targeted expression-level analyses (per-cell-type DE, correlation, ANOVA, clustering).
IMPORTANT: This skill handles complex multi-workflow analysis. Most implementation details have been moved to references/ for progressive disclosure. This document focuses on high-level decision-making and workflow orchestration.
import scanpy as sc import anndata as ad import pandas as pd import numpy as np from scipy import stats from scipy.cluster.hierarchy import linkage, fcluster, dendrogram from scipy.spatial.distance import pdist from sklearn.decomposition import PCA from sklearn.preprocessing import StandardScaler from statsmodels.stats.multitest import multipletests.

Source Doc

Excerpt From SKILL.md

When to Use This Skill

Apply when users:

Have scRNA-seq data (h5ad, 10X, CSV count matrices) and want analysis
Ask about cell type identification, clustering, or annotation
Need differential expression analysis by cell type or condition
Want gene-expression correlation analysis (e.g., gene length vs expression by cell type)
Ask about PCA, UMAP, t-SNE for expression data
Need Leiden/Louvain clustering on expression matrices
Want statistical comparisons between cell types (t-test, ANOVA, fold change)
Ask about marker genes for cell populations
Need batch correction (Harmony, combat)
Want trajectory or pseudotime analysis
Ask about cell-cell communication (ligand-receptor interactions)
Questions mention "single-cell", "scRNA-seq", "cell type", "h5ad"
Questions involve immune cell types (CD4, CD8, CD14, CD19, monocytes, etc.)

BixBench Coverage: 18+ questions across 5 projects (bix-22, bix-27, bix-31, bix-33, bix-36)

NOT for (use other skills instead):

Bulk RNA-seq DESeq2 analysis only → Use tooluniverse-rnaseq-deseq2
Gene enrichment only (no expression data) → Use tooluniverse-gene-enrichment
VCF/variant analysis → Use tooluniverse-variant-analysis
Statistical modeling (regression, survival) → Use tooluniverse-statistical-modeling

Core Principles

Data-first approach - Load, inspect, and validate data before any analysis
AnnData-centric - All data flows through anndata objects for consistency
Cell type awareness - Many questions require per-cell-type subsetting and analysis
Statistical rigor - Proper normalization, multiple testing correction, effect sizes
Scanpy standard pipeline - Follow established best practices for scRNA-seq
Flexible input - Handle h5ad, 10X, CSV/TSV, pre-processed and raw data
Question-driven - Parse what the user is actually asking and extract the specific answer
Enrichment integration - Chain DE results into GO/KEGG/Reactome enrichment when requested
Large dataset support - Efficient handling of datasets with >100k cells

Optional

import harmonypy # batch correction bash pip install scanpy anndata leidenalg umap-learn harmonypy gseapy pandas numpy scipy scikit-learn statsmodels

---

Use cases

Have scRNA-seq data (h5ad, 10X, CSV count matrices) and want analysis.
Ask about cell type identification, clustering, or annotation.
Need differential expression analysis by cell type or condition.
Want gene-expression correlation analysis (e.g., gene length vs expression by cell type).

Not for

Do not rely on this catalog entry alone for installation or maintenance details.

Related skills

Back to directory

Data & ReproStatistics & Data Analysis

arxiv-database

This skill provides Python tools for searching and retrieving preprints from arXiv.org via its public Atom API. It supports keyword search,…

Claude CodeAnalysis

K-Dense-AI/claude-scientific-skillsView

Data & ReproStatistics & Data Analysis

bayesian-optimizer

Bayesian optimization for experimental design and hyperparameter tuning in biomedical research.

OpenClawNanoClawAnalysis

FreedomIntelligence/OpenClaw-Medical-SkillsView

Data & ReproStatistics & Data Analysis

bio-alignment-files-bam-statistics

Compute alignment statistics: flagstat, idxstats, coverage depth.

OpenClawNanoClawAnalysis

FreedomIntelligence/OpenClaw-Medical-SkillsView

Data & ReproStatistics & Data Analysis

bio-alignment-msa-statistics

Calculate alignment statistics including sequence identity, conservation scores, substitution matrices, and similarity metrics. Use when com…

OpenClawNanoClawAnalysis

FreedomIntelligence/OpenClaw-Medical-SkillsView