数据与复现蛋白质组与代谢组FreedomIntelligence/OpenClaw-Medical-Skills数据与复现
BI

bio-proteomics-peptide-identification

维护者 FreedomIntelligence · 最近更新 2026年4月1日

Peptide-spectrum matching and protein identification from MS/MS data. Use when identifying peptides from tandem mass spectra. Covers database searching, spectral library matching, and FDR estimation using target-decoy approaches.

OpenClawNanoClaw分析处理复现实验bio-proteomics-peptide-identification🧬 bioinformatics (gptomics bio-* suite)bioinformatics — proteomics & metabolomicspeptide

原始来源

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-proteomics-peptide-identification

维护者
FreedomIntelligence
许可
MIT
最近更新
2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

  • Python:pyopenms ,用于 in-memory database search 、 PSM handling。
  • CLI:comet,MSFragger,X!Tandem ,用于 high-throughput database searching。
  • R:MSnbase::readMSData() ,用于 importing search results。
  • Identify peptides ,面向 my MS/MS spectra" → Match tandem mass spectra against protein database to identify peptide sequences,then control false discovery rate ,使用 target-decoy competition. Python:pyopenms ,用于 in-memory database search 、 PSM handling CLI:comet,MSFragger,X!Tandem ,用于 high-throughput database searching R:MSnbase::readMSData() ,用于 importing search results。
  • fasta_entries = [] FASTAFile().load('uniprot_human.fasta',fasta_entries)。

原始文档

SKILL.md 摘录

Database Search with pyOpenMS

Goal: Identify peptide sequences from tandem mass spectra by matching against a protein database.

Approach: Load a FASTA database, perform in-silico tryptic digestion to generate theoretical peptides, then match experimental spectra against theoretical fragment ion patterns to identify peptide-spectrum matches (PSMs).

from pyopenms import MSExperiment, MzMLFile, FASTAFile, ProteaseDigestion
from pyopenms import ModificationsDB, AASequence

## In-silico digestion

digestion = ProteaseDigestion()
digestion.setEnzyme('Trypsin')
digestion.setMissedCleavages(2)

peptides = []
for entry in fasta_entries:
    seq = AASequence.fromString(entry.sequence)
    result = []
    digestion.digest(seq, result)
    peptides.extend([(entry.identifier, str(p)) for p in result])

Read mzIdentML results

psms <- readMzIdData('results.mzid')

适用场景

  • 适合在identifying peptides ,面向 tandem mass spectra时使用。

不适用场景

  • Do not rely on this catalog entry alone ,用于 installation 或 maintenance details。

上游相关技能

  • data-import - Load raw MS data before identification
  • protein-inference - Group peptides to proteins
  • ptm-analysis - Identify modified peptides

相关技能

相关技能

返回目录
BI
数据与复现蛋白质组与代谢组

bio-metabolomics-lipidomics

Specialized lipidomics analysis for lipid identification, quantification, and pathway interpretation. Covers LC-MS lipid…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现蛋白质组与代谢组

bio-metabolomics-metabolite-annotation

Metabolite identification from m/z and retention time. Covers database matching, MS/MS spectral matching, and confidence…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现蛋白质组与代谢组

bio-metabolomics-msdial-preprocessing

MS-DIAL-based metabolomics preprocessing as alternative to XCMS. Covers peak detection, alignment, annotation, and expor…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现蛋白质组与代谢组

bio-metabolomics-pathway-mapping

Map metabolites to biological pathways using KEGG, Reactome, and MetaboAnalyst. Perform pathway enrichment and topology…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看