Data & ReproBioinformatics & GenomicsFreedomIntelligence/OpenClaw-Medical-SkillsData & Reproduction
BI

bio-workflows-biomarker-pipeline

Maintainer FreedomIntelligence · Last updated April 1, 2026

Biomarker discovery: omics → feature selection → validation → report.

OpenClawNanoClawAnalysisReproductionbio-workflows-biomarker-pipeline🧠 bioos extended suitebioos extended bioinformatics suitebiomarker

Original source

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-workflows-biomarker-pipeline

Maintainer
FreedomIntelligence
License
MIT
Last updated
April 1, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

  • Complete pipeline from expression data to validated biomarker panels with classifier.
  • X_train, X_test, y_train, y_test = train_test_split( X, y, test_size=0.2, stratify=y, random_state=42 ).

Source Doc

Excerpt From SKILL.md

Step 1: Data Preparation

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler

expr = pd.read_csv('expression.csv', index_col=0)
meta = pd.read_csv('metadata.csv', index_col=0)

X = expr.T  # samples x genes
y = meta.loc[X.index, 'condition'].values

## Fit scaler on training only to prevent data leakage

scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)
X_test_scaled = scaler.transform(X_test)

QC Checkpoint 1: Check class balance, sample counts per group

  • Minimum 10 samples per class recommended
  • Classes should be reasonably balanced (ratio <3:1)

Option A: Boruta (All-Relevant Selection)

from boruta import BorutaPy
from sklearn.ensemble import RandomForestClassifier
from sklearn.feature_selection import SelectKBest, f_classif

Use cases

  • Use bio-workflows-biomarker-pipeline for genomics and bioinformatics workflows.
  • Apply bio-workflows-biomarker-pipeline to sequencing, variant, or omics analysis tasks.

Not for

  • Do not rely on this catalog entry alone for installation or maintenance details.

Upstream Related Skills

  • machine-learning/biomarker-discovery - Detailed feature selection methods
  • machine-learning/model-validation - Nested CV implementation details
  • machine-learning/omics-classifiers - Classifier options and tuning
  • machine-learning/prediction-explanation - SHAP and LIME interpretation
  • differential-expression/de-results - Pre-filter with DE genes
  • pathway-analysis/go-enrichment - Functional enrichment of biomarkers

Related skills

Related skills

Back to directory
AG
Data & ReproBioinformatics & Genomics

agent-browser

Browse the web for any task — research topics, read articles, interact with web apps, fill forms, take screenshots, extract data, and test w…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
AL
Data & ReproBioinformatics & Genomics

alpha-vantage

Access 20+ years of global financial data: equities, options, forex, crypto, commodities, economic indicators, and 50+ technical indicators.

Claude CodeAnalysis
K-Dense-AI/claude-scientific-skillsView
BI
Data & ReproBioinformatics & Genomics

bio-alignment-filtering

Filter alignments by flag, quality, region, or paired status.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproBioinformatics & Genomics

bio-alignment-indexing

Index BAM/CRAM files with samtools index for random access.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView