AnnData
AnnData is a Python package for handling annotated data matrices, storing experimental measurements (X) alongside observ…
维护者 FreedomIntelligence · 最近更新 2026年4月1日
Detect and remove doublets (multiple cells captured in one droplet) from single-cell RNA-seq data. Uses Scrublet (Python), DoubletFinder (R), and scDblFinder (R). Essential QC step before clustering to avoid artificial cell populations. Use when identifying and removing doublets from scRNA-seq data.
原始来源
https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-single-cell-doublet-detection
技能摘要
原始文档
Goal: Detect and score doublets in scRNA-seq data using simulated doublet profiles.
Approach: Simulate artificial doublets by combining random cell pairs, embed real and simulated cells together, and score each cell's similarity to simulated doublets.
"Remove doublets from my data" → Identify droplets containing multiple cells by comparing each cell's profile to computationally simulated doublets, then filter flagged cells.
import matplotlib.pyplot as plt
scrub.plot_histogram()
plt.savefig('doublet_histogram.pdf')
## DoubletFinder (R)
**Goal:** Detect doublets in Seurat objects using DoubletFinder's pANN-based classification.
**Approach:** Optimize the pK neighborhood parameter via parameter sweep, compute artificial nearest neighbor proportions, and classify cells as singlets or doublets.
相关技能
AnnData is a Python package for handling annotated data matrices, storing experimental measurements (X) alongside observ…
Arboreto is a computational library for inferring gene regulatory networks (GRNs) from gene expression data using parall…
Cell segmentation from multiplexed tissue images. Covers deep learning (Cellpose, Mesmer) and classical approaches for n…
Extract, process, and deduplicate reads using Unique Molecular Identifiers (UMIs) with umi_tools. Use when library prep…