arxiv-database
arxiv-database:This skill provides Python tools ,用于 searching 、 retrieving preprints ,面向 arXiv.org ,通过 its public Atom A…
维护者 FreedomIntelligence · 最近更新 2026年4月1日
bio-sequence-statistics:Calculate sequence statistics (N50,length distribution,GC content,summary reports) ,使用 Biopython。 适合在analyzing sequence 数据集s,generating QC reports,或 comparing assemblies时使用。
原始来源
https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-sequence-statistics
技能摘要
原始文档
from collections import Counter
lengths = [len(r.seq) for r in SeqIO.parse('sequences.fasta', 'fasta')]
## Comprehensive Summary Report
**Goal:** Generate a complete QC summary (counts, lengths, N50, GC) for any FASTA file.
**Approach:** Load all records, compute length and GC arrays, derive N50/L50 from cumulative sorted lengths, and package into a dictionary.
**Reference (BioPython 1.83+):**
## Compare Multiple Assemblies
**Goal:** Generate a side-by-side comparison table of key metrics across multiple assembly files.
**Approach:** Run `sequence_summary` on each file and format results into an aligned table.
**Reference (BioPython 1.83+):**
相关技能
arxiv-database:This skill provides Python tools ,用于 searching 、 retrieving preprints ,面向 arXiv.org ,通过 its public Atom A…
bayesian-optimizer:Bayesian optimization ,用于 experimental design 、 hyperparameter tuning in biomedical research。
bio-alignment-files-bam-statistics:Compute alignment statistics:flagstat,idxstats,coverage depth。
bio-alignment-msa-statistics:Calculate alignment statistics ,涵盖 sequence identity,conservation scores,substitution matri…