数据与复现科研绘图与可视化FreedomIntelligence/OpenClaw-Medical-Skills数据与复现
BI

bio-workflows-crispr-screen-pipeline

维护者 FreedomIntelligence · 最近更新 2026年4月1日

bio-workflows-crispr-screen-pipeline:CRISPR screen:guide counting → MAGeCK → hit calling → visualization。

OpenClawNanoClaw分析处理写作整理bio-workflows-crispr-screen-pipeline🧠 bioos extended suitebioos extended bioinformatics suitecrispr

原始来源

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-workflows-crispr-screen-pipeline

维护者
FreedomIntelligence
许可
MIT
最近更新
2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

  • mageck count \ -l 库.csv \ -n experiment \ --sample-label Day0,Day14_Rep1,Day14_Rep2,Day14_Rep3 \ --fastq Day0.fastq.gz Day14_Rep1.fastq.gz Day14_Rep2.fastq.gz Day14_Rep3.fastq.gz \ --trim-5 0 \ --pdf-report。
  • def calculate_gini(x):x = np.sort(x[x > 0]) n = len(x) cumsum = np.cumsum(x) return (2 np.sum((np.arange(1,n+1) x)) - (n + 1) cumsum[-1])、(n cumsum[-1])。

原始文档

SKILL.md 摘录

Step 2: Quality Control

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

counts = pd.read_csv('experiment.count.txt', sep='\t', index_col=0)
counts_numeric = counts.iloc[:, 1:]

qc_stats = {}
for col in counts_numeric.columns:
    total = counts_numeric[col].sum()
    zeros = (counts_numeric[col] == 0).sum()
    gini = calculate_gini(counts_numeric[col].values)
    qc_stats[col] = {'total_reads': total, 'zero_count_guides': zeros, 'gini': gini}

qc_df = pd.DataFrame(qc_stats).T
print('QC Summary:')
print(qc_df)

## QC thresholds

assert qc_df['zero_count_guides'].max() < len(counts) * 0.2, 'Too many zero-count guides'
assert qc_df['gini'].max() < 0.4, 'Gini index too high (uneven distribution)'
print('QC passed!')

For dropout/negative selection screens

mageck test
-k experiment.count.txt
-t Day14_Rep1,Day14_Rep2,Day14_Rep3
-c Day0
-n negative_screen
--pdf-report
--gene-lfc-method alphamedian

适用场景

  • Use bio-workflows-crispr-screen-pipeline to prepare 论文级图表。
  • Apply bio-workflows-crispr-screen-pipeline when results need clear visual communication。

不适用场景

  • Do not rely on this catalog entry alone ,用于 installation 或 maintenance details。

上游相关技能

  • crispr-screens/screen-qc - Detailed QC metrics
  • crispr-screens/mageck-analysis - MAGeCK parameters
  • crispr-screens/hit-calling - Hit calling methods
  • crispr-screens/crispresso-editing - Individual editing analysis
  • crispr-screens/library-design - sgRNA selection and library design
  • crispr-screens/batch-correction - Multi-batch normalization

相关技能

相关技能

返回目录
BI
数据与复现科研绘图与可视化

bio-chipseq-visualization

bio-chipseq-visualization:可视化 ChIP-seq data ,使用 deepTools,Gviz,、 ChIPseeker。 创建 heatmaps,profile plots,、 genome browser…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现科研绘图与可视化

bio-consensus-sequences

bio-consensus-sequences:生成 consensus FASTA sequences by applying VCF variants to reference ,使用 bcftools consensus。 适合在cr…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现科研绘图与可视化

bio-copy-number-cnv-visualization

bio-copy-number-cnv-visualization:可视化 copy number profiles,segments,、 compare across samples。 创建 publication-quality plo…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现科研绘图与可视化

bio-data-visualization-circos-plots

bio-data-visualization-circos-plots:Circular genome visualization ,支持 Circos 或 pycirclize。

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看