Specialized lipidomics analysis for lipid identification, quantification, and pathway interpretation. Covers LC-MS lipid…
bio-proteomics-proteomics-qc
维护者 FreedomIntelligence · 最近更新 2026年4月1日
Quality control and assessment for proteomics data. Use when evaluating proteomics data quality before downstream analysis. Covers sample metrics, missing value patterns, replicate correlation, batch effects, and intensity distributions.
原始来源
FreedomIntelligence/OpenClaw-Medical-Skills
https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-proteomics-proteomics-qc
- 维护者
- FreedomIntelligence
- 许可
- MIT
- 最近更新
- 2026年4月1日
技能摘要
来自 SKILL.md 的关键信息
核心说明
- Python:pandas + matplotlib/seaborn ,用于 QC metrics 、 visualization。
- R:limma::plotMDS(),correlation heatmaps,CV distributions。
- Check quality of my proteomics data" → Assess data quality through identification rates,missing value patterns,replicate correlation,intensity distributions,、 batch effect detection before downstream analysis. Python:pandas + matplotlib/seaborn ,用于 QC metrics 、 visualization R:limma::plotMDS(),correlation heatmaps,CV distributions。
- sns.clustermap(intensity_matrix.corr(),cmap='RdBu_r',center=0,vmin=-1,vmax=1,figsize=(10,10),annot=False) plt.savefig('correlation_heatmap.pdf')。
原始文档
SKILL.md 摘录
Replicate Correlation
import seaborn as sns
import matplotlib.pyplot as plt
from scipy.stats import pearsonr
def replicate_correlation(intensity_matrix, sample_groups):
'''Calculate within-group correlations'''
corr_matrix = intensity_matrix.corr(method='pearson')
# Mask for within-group comparisons
results = []
for group in sample_groups.unique():
group_samples = sample_groups[sample_groups == group].index
for i, s1 in enumerate(group_samples):
for s2 in group_samples[i+1:]:
r = corr_matrix.loc[s1, s2]
results.append({'group': group, 'sample1': s1, 'sample2': s2, 'correlation': r})
return pd.DataFrame(results)
## Batch Effect Detection with PCA
**Goal:** Detect batch effects in proteomics data by testing whether processing batches explain significant variance in the principal components.
**Approach:** Impute missing values, scale the intensity matrix, run PCA, then test the association of each top PC with batch labels using one-way ANOVA.
## R: QC with limma
```r
library(limma)
library(ggplot2)
适用场景
- 适合在evaluating proteomics data quality before downstream analysis时使用。
不适用场景
- Do not rely on this catalog entry alone ,用于 installation 或 maintenance details。
上游相关技能
- data-import - Load data before QC
- quantification - Normalization after QC
- differential-abundance - Analysis after QC passes
- data-visualization/heatmaps-clustering - QC heatmaps
相关技能
相关技能
Metabolite identification from m/z and retention time. Covers database matching, MS/MS spectral matching, and confidence…
MS-DIAL-based metabolomics preprocessing as alternative to XCMS. Covers peak detection, alignment, annotation, and expor…
Map metabolites to biological pathways using KEGG, Reactome, and MetaboAnalyst. Perform pathway enrichment and topology…