Data & ReproSingle-Cell & Spatial OmicsFreedomIntelligence/OpenClaw-Medical-SkillsData & Reproduction
BI

bio-single-cell-doublet-detection

Maintainer FreedomIntelligence · Last updated April 1, 2026

Detect and remove doublets (multiple cells captured in one droplet) from single-cell RNA-seq data. Uses Scrublet (Python), DoubletFinder (R), and scDblFinder (R). Essential QC step before clustering to avoid artificial cell populations. Use when identifying and removing doublets from scRNA-seq data.

OpenClawNanoClawAnalysisReproductionbio-single-cell-doublet-detection🧬 bioinformatics (gptomics bio-* suite)bioinformatics — single-cell & spatial omicsdetect

Original source

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-single-cell-doublet-detection

Maintainer
FreedomIntelligence
License
MIT
Last updated
April 1, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

  • Doublets are droplets containing two or more cells. They appear as artificial intermediate cell populations and must be removed before analysis.
  • sc.pp.normalize_total(adata, target_sum=1e4) sc.pp.log1p(adata) sc.pp.highly_variable_genes(adata) sc.pp.pca(adata) sc.pp.neighbors(adata) sc.tl.umap(adata).
  • sc.pl.umap(adata, color=['doublet_score', 'predicted_doublet'], save='_doublets.pdf').

Source Doc

Excerpt From SKILL.md

Scrublet (Python)

Goal: Detect and score doublets in scRNA-seq data using simulated doublet profiles.

Approach: Simulate artificial doublets by combining random cell pairs, embed real and simulated cells together, and score each cell's similarity to simulated doublets.

"Remove doublets from my data" → Identify droplets containing multiple cells by comparing each cell's profile to computationally simulated doublets, then filter flagged cells.

Visualize Doublet Scores

import matplotlib.pyplot as plt

scrub.plot_histogram()
plt.savefig('doublet_histogram.pdf')

## DoubletFinder (R)

**Goal:** Detect doublets in Seurat objects using DoubletFinder's pANN-based classification.

**Approach:** Optimize the pK neighborhood parameter via parameter sweep, compute artificial nearest neighbor proportions, and classify cells as singlets or doublets.

Use cases

  • Use when identifying and removing doublets from scRNA-seq data.

Not for

  • Do not rely on this catalog entry alone for installation or maintenance details.

Upstream Related Skills

  • preprocessing - QC before doublet detection
  • clustering - Run after filtering doublets
  • data-io - Load data before processing

Related skills

Related skills

Back to directory
AN
Data & ReproSingle-Cell & Spatial Omics

AnnData

AnnData is a Python package for handling annotated data matrices, storing experimental measurements (X) alongside observation metadata (obs)…

Claude CodeOpenClawAnalysis
K-Dense-AI/claude-scientific-skillsView
AR
Data & ReproSingle-Cell & Spatial Omics

Arboreto

Arboreto is a computational library for inferring gene regulatory networks (GRNs) from gene expression data using paralleli.

Claude CodeOpenClawAnalysis
K-Dense-AI/claude-scientific-skillsView
BI
Data & ReproSingle-Cell & Spatial Omics

bio-imaging-mass-cytometry-cell-segmentation

Cell segmentation from multiplexed tissue images. Covers deep learning (Cellpose, Mesmer) and classical approaches for nuclear and whole-cel…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproSingle-Cell & Spatial Omics

bio-read-qc-umi-processing

Extract, process, and deduplicate reads using Unique Molecular Identifiers (UMIs) with umi_tools. Use when library prep includes UMIs and ac…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView