Data & ReproStatistics & Data AnalysisFreedomIntelligence/OpenClaw-Medical-SkillsData & Reproduction
BI

bio-metabolomics-normalization-qc

Maintainer FreedomIntelligence · Last updated April 1, 2026

Quality control and normalization for metabolomics data. Covers QC-based correction, batch effect removal, and data transformation methods. Use when correcting technical variation in metabolomics data before statistical analysis.

OpenClawNanoClawAnalysisReproductionbio-metabolomics-normalization-qc🧬 bioinformatics (gptomics bio-* suite)bioinformatics — proteomics & metabolomicsquality

Original source

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-metabolomics-normalization-qc

Maintainer
FreedomIntelligence
License
MIT
Last updated
April 1, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

  • data <- read.csv('feature_table.csv', row.names = 1) sample_info <- read.csv('sample_info.csv').
  • qc_samples <- sample_info$sample_name[sample_info$sample_type == 'QC'] bio_samples <- sample_info$sample_name[sample_info$sample_type!= 'QC'].
  • data_qc <- data[qc_samples, ] data_bio <- data[bio_samples, ].

Source Doc

Excerpt From SKILL.md

Load and Inspect Data

Goal: Load the feature table and sample metadata, separating QC and biological samples for downstream processing.

Approach: Read CSV files, partition by sample type, and assess missing value prevalence.

"Normalize my metabolomics data and correct for batch effects" → Apply QC-based signal correction, handle missing values, transform intensities, and assess normalization quality via RSD and PCA.

library(tidyverse)
library(pcaMethods)

## Missing value summary

missing_pct <- colMeans(is.na(data)) * 100
cat('Features with >50% missing:', sum(missing_pct > 50), '\n')

QC-Based Normalization (QC-RSC)

Goal: Remove injection-order-dependent signal drift using QC sample trends.

Approach: Fit a LOESS curve to QC sample intensities over injection order, then correct all samples by dividing by the predicted drift and rescaling to the QC median.

Use cases

  • Use when correcting technical variation in metabolomics data before statistical analysis.

Not for

  • Do not rely on this catalog entry alone for installation or maintenance details.

Upstream Related Skills

  • xcms-preprocessing - Generate feature table
  • statistical-analysis - Downstream analysis
  • differential-expression/batch-correction - Similar concepts

Related skills

Related skills

Back to directory
AR
Data & ReproStatistics & Data Analysis

arxiv-database

This skill provides Python tools for searching and retrieving preprints from arXiv.org via its public Atom API. It supports keyword search,…

Claude CodeAnalysis
K-Dense-AI/claude-scientific-skillsView
BA
Data & ReproStatistics & Data Analysis

bayesian-optimizer

Bayesian optimization for experimental design and hyperparameter tuning in biomedical research.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproStatistics & Data Analysis

bio-alignment-files-bam-statistics

Compute alignment statistics: flagstat, idxstats, coverage depth.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproStatistics & Data Analysis

bio-alignment-msa-statistics

Calculate alignment statistics including sequence identity, conservation scores, substitution matrices, and similarity metrics. Use when com…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView