数据与复现单细胞与空间组学FreedomIntelligence/OpenClaw-Medical-Skills数据与复现
SI

single-cell-multi-omics-integration

维护者 FreedomIntelligence · 最近更新 2026年4月1日

Quick-reference sheet for OmicVerse tutorials spanning MOFA, GLUE pairing, SIMBA integration, TOSICA transfer, and StaVIA cartography.

OpenClawNanoClaw分析处理复现实验single-multiomics🔬 omics & computational biologysingle-cell & spatial omicsquick

原始来源

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/single-multiomics

维护者
FreedomIntelligence
许可
MIT
最近更新
2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

  • This skill walk-through summarizes OmicVerse notebooks that cover paired 、 unpaired multi-omic integration,multi-batch embedding,reference transfer,、 trajectory cartography。
  • Data preparation:Load preprocessed AnnData objects ,用于 RNA (rna_p_n_raw.h5ad) 、 ATAC (atac_p_n_raw.h5ad) ,支持 ov.utils.read,、 initialise pyMOFA ,支持 matching omics 、 omics_name lists。
  • Model training:Call mofa_preprocess() to select highly variable features 、 run factor model ,支持 mofa_run(outfile=...),which exports learned MOFA+ factors to HDF5 model file。
  • Result inspection:Reload downstream AnnData,append factor scores ,通过 ov.single.factor_exact,、 explore factor–cluster associations ,使用 factor_correlation,get_weights,、 plotting helpers in pyMOFAART (plot_r2,plot_cor,plot_factor,plot_weights,etc.)。
  • Export workflow:Persist factors 、 weights through MOFA HDF5 artifact 、 reuse them by instantiating pyMOFAART(model_path=...) ,用于 later annotation 或 visualisation sessions。

原始文档

SKILL.md 摘录

MOFA after GLUE pairing (t_mofa_glue.ipynb)

  • Data preparation: Start from GLUE-derived embeddings (rna-emb.h5ad, atac.emb.h5ad), build a GLUE_pair object, and run correlation() to align unpaired cells before subsetting to highly variable features.
  • Model training: Instantiate pyMOFA with the aligned AnnData objects, run mofa_preprocess(), and save the joint factors through mofa_run(outfile='models/chen_rna_atac.hdf5').
  • Result inspection: Use pyMOFAART plus AnnData that now contains the GLUE embeddings to compute factors (get_factors) and visualise variance explained, factor–cluster correlations, and ranked feature weights.
  • Export workflow: Reuse the saved MOFA HDF5 model for downstream inspection; GLUE embeddings can be embedded with scvi.model.utils.mde (GPU-accelerated MDE is optional, sc.tl.umap works on CPU).
  • Dependencies & hardware: Requires both mofapy2 and the GLUE tooling (scglue, scvi-tools, pymde); GPU acceleration only affects optional MDE visualisation.

SIMBA batch integration (t_simba.ipynb)

  • Data preparation: Fetch the concatenated AnnData (simba_adata_raw.h5ad) derived from multiple pancreas studies and pass it, alongside a results directory, to pySIMBA.
  • Model training: Execute preprocess(...) to bin features and build a SIMBA-compatible graph, then call gen_graph() followed by train(num_workers=...) to launch PyTorch-BigGraph optimisation (can scale with CPU workers) and load(...) to resume trained checkpoints.
  • Result inspection: Apply batch_correction() to obtain the harmonised AnnData with SIMBA embeddings (X_simba) and visualise using mde/sc.tl.umap coloured by cell type or batch.
  • Export workflow: Training outputs reside in the workdir (e.g., result_human_pancreas/pbg/graph0); reuse them with simba_object.load(...) for later analyses.
  • Dependencies & hardware: Requires installing simba and simba_pbg (PyTorch BigGraph backend). GPU is optional; make sure adequate CPU threads and memory are available for graph training.

适用场景

  • Use single-cell-multi-omics-integration ,用于 single-cell 或 spatial omics analysis。
  • Apply single-cell-multi-omics-integration to 聚类,integration,或 trajectory workflows。

不适用场景

  • Do not rely on this catalog entry alone ,用于 installation 或 maintenance details。

相关技能

相关技能

返回目录
AN
数据与复现单细胞与空间组学

AnnData

AnnData is a Python package for handling annotated data matrices, storing experimental measurements (X) alongside observ…

Claude CodeOpenClaw分析处理
K-Dense-AI/claude-scientific-skills查看
BI
数据与复现单细胞与空间组学

bio-imaging-mass-cytometry-cell-segmentation

Cell segmentation from multiplexed tissue images. Covers deep learning (Cellpose, Mesmer) and classical approaches for n…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现单细胞与空间组学

bio-read-qc-umi-processing

Extract, process, and deduplicate reads using Unique Molecular Identifiers (UMIs) with umi_tools. Use when library prep…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现单细胞与空间组学

bio-single-cell-batch-integration

Integrate multiple scRNA-seq samples/batches using Harmony, scVI, Seurat anchors, and fastMNN. Remove technical variatio…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看