数据与复现蛋白质组与代谢组FreedomIntelligence/OpenClaw-Medical-Skills数据与复现

bio-proteomics-peptide-identification

维护者 FreedomIntelligence · 最近更新 2026年4月1日

Peptide-spectrum matching and protein identification from MS/MS data. Use when identifying peptides from tandem mass spectra. Covers database searching, spectral library matching, and FDR estimation using target-decoy approaches.

OpenClawNanoClaw分析处理复现实验bio-proteomics-peptide-identification🧬 bioinformatics (gptomics bio-* suite)bioinformatics — proteomics & metabolomicspeptide

原始来源

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-proteomics-peptide-identification

维护者: FreedomIntelligence
许可: MIT
最近更新: 2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

Python：pyopenms ，用于 in-memory database search 、 PSM handling。
CLI：comet，MSFragger，X!Tandem ，用于 high-throughput database searching。
R：MSnbase：：readMSData() ，用于 importing search results。
Identify peptides ，面向 my MS/MS spectra" → Match tandem mass spectra against protein database to identify peptide sequences，then control false discovery rate ，使用 target-decoy competition. Python：pyopenms ，用于 in-memory database search 、 PSM handling CLI：comet，MSFragger，X!Tandem ，用于 high-throughput database searching R：MSnbase：：readMSData() ，用于 importing search results。
fasta_entries = [] FASTAFile().load('uniprot_human.fasta'，fasta_entries)。

原始文档

SKILL.md 摘录

Database Search with pyOpenMS

Goal: Identify peptide sequences from tandem mass spectra by matching against a protein database.

Approach: Load a FASTA database, perform in-silico tryptic digestion to generate theoretical peptides, then match experimental spectra against theoretical fragment ion patterns to identify peptide-spectrum matches (PSMs).

from pyopenms import MSExperiment, MzMLFile, FASTAFile, ProteaseDigestion
from pyopenms import ModificationsDB, AASequence

## In-silico digestion

digestion = ProteaseDigestion()
digestion.setEnzyme('Trypsin')
digestion.setMissedCleavages(2)

peptides = []
for entry in fasta_entries:
    seq = AASequence.fromString(entry.sequence)
    result = []
    digestion.digest(seq, result)
    peptides.extend([(entry.identifier, str(p)) for p in result])

Read mzIdentML results

psms <- readMzIdData('results.mzid')

适用场景

适合在identifying peptides ，面向 tandem mass spectra时使用。

不适用场景

Do not rely on this catalog entry alone ，用于 installation 或 maintenance details。

上游相关技能

data-import - Load raw MS data before identification
protein-inference - Group peptides to proteins
ptm-analysis - Identify modified peptides

bio-metabolomics-lipidomics

Specialized lipidomics analysis for lipid identification, quantification, and pathway interpretation. Covers LC-MS lipid…

OpenClawNanoClaw分析处理

FreedomIntelligence/OpenClaw-Medical-Skills查看

数据与复现蛋白质组与代谢组

bio-metabolomics-metabolite-annotation

Metabolite identification from m/z and retention time. Covers database matching, MS/MS spectral matching, and confidence…

OpenClawNanoClaw分析处理

FreedomIntelligence/OpenClaw-Medical-Skills查看

数据与复现蛋白质组与代谢组

bio-metabolomics-msdial-preprocessing

MS-DIAL-based metabolomics preprocessing as alternative to XCMS. Covers peak detection, alignment, annotation, and expor…

OpenClawNanoClaw分析处理

FreedomIntelligence/OpenClaw-Medical-Skills查看

数据与复现蛋白质组与代谢组

bio-metabolomics-pathway-mapping

Map metabolites to biological pathways using KEGG, Reactome, and MetaboAnalyst. Perform pathway enrichment and topology…

OpenClawNanoClaw分析处理

FreedomIntelligence/OpenClaw-Medical-Skills查看