数据与复现蛋白质组与代谢组FreedomIntelligence/OpenClaw-Medical-Skills数据与复现

bio-proteomics-proteomics-qc

维护者 FreedomIntelligence · 最近更新 2026年4月1日

Quality control and assessment for proteomics data. Use when evaluating proteomics data quality before downstream analysis. Covers sample metrics, missing value patterns, replicate correlation, batch effects, and intensity distributions.

OpenClawNanoClaw分析处理复现实验bio-proteomics-proteomics-qc🧬 bioinformatics (gptomics bio-* suite)bioinformatics — proteomics & metabolomicsquality

原始来源

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-proteomics-proteomics-qc

维护者: FreedomIntelligence
许可: MIT
最近更新: 2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

Python：pandas + matplotlib/seaborn ，用于 QC metrics 、 visualization。
R：limma：：plotMDS()，correlation heatmaps，CV distributions。
Check quality of my proteomics data" → Assess data quality through identification rates，missing value patterns，replicate correlation，intensity distributions，、 batch effect detection before downstream analysis. Python：pandas + matplotlib/seaborn ，用于 QC metrics 、 visualization R：limma：：plotMDS()，correlation heatmaps，CV distributions。
sns.clustermap(intensity_matrix.corr()，cmap='RdBu_r'，center=0，vmin=-1，vmax=1，figsize=(10，10)，annot=False) plt.savefig('correlation_heatmap.pdf')。

原始文档

SKILL.md 摘录

Replicate Correlation

import seaborn as sns
import matplotlib.pyplot as plt
from scipy.stats import pearsonr

def replicate_correlation(intensity_matrix, sample_groups):
    '''Calculate within-group correlations'''
    corr_matrix = intensity_matrix.corr(method='pearson')

    # Mask for within-group comparisons
    results = []
    for group in sample_groups.unique():
        group_samples = sample_groups[sample_groups == group].index
        for i, s1 in enumerate(group_samples):
            for s2 in group_samples[i+1:]:
                r = corr_matrix.loc[s1, s2]
                results.append({'group': group, 'sample1': s1, 'sample2': s2, 'correlation': r})

    return pd.DataFrame(results)

## Batch Effect Detection with PCA

**Goal:** Detect batch effects in proteomics data by testing whether processing batches explain significant variance in the principal components.

**Approach:** Impute missing values, scale the intensity matrix, run PCA, then test the association of each top PC with batch labels using one-way ANOVA.

## R: QC with limma

```r
library(limma)
library(ggplot2)

适用场景

适合在evaluating proteomics data quality before downstream analysis时使用。

不适用场景

Do not rely on this catalog entry alone ，用于 installation 或 maintenance details。

上游相关技能

data-import - Load data before QC
quantification - Normalization after QC
differential-abundance - Analysis after QC passes
data-visualization/heatmaps-clustering - QC heatmaps

bio-metabolomics-lipidomics

Specialized lipidomics analysis for lipid identification, quantification, and pathway interpretation. Covers LC-MS lipid…

OpenClawNanoClaw分析处理

FreedomIntelligence/OpenClaw-Medical-Skills查看

数据与复现蛋白质组与代谢组

bio-metabolomics-metabolite-annotation

Metabolite identification from m/z and retention time. Covers database matching, MS/MS spectral matching, and confidence…

OpenClawNanoClaw分析处理

FreedomIntelligence/OpenClaw-Medical-Skills查看

数据与复现蛋白质组与代谢组

bio-metabolomics-msdial-preprocessing

MS-DIAL-based metabolomics preprocessing as alternative to XCMS. Covers peak detection, alignment, annotation, and expor…

OpenClawNanoClaw分析处理

FreedomIntelligence/OpenClaw-Medical-Skills查看

数据与复现蛋白质组与代谢组

bio-metabolomics-pathway-mapping

Map metabolites to biological pathways using KEGG, Reactome, and MetaboAnalyst. Perform pathway enrichment and topology…

OpenClawNanoClaw分析处理

FreedomIntelligence/OpenClaw-Medical-Skills查看