Data & ReproBioinformatics & GenomicsFreedomIntelligence/OpenClaw-Medical-SkillsData & Reproduction
BI

bio-machine-learning-biomarker-discovery

Maintainer FreedomIntelligence · Last updated April 1, 2026

Identify biomarkers from omics data with LASSO, elastic net, SHAP.

OpenClawNanoClawAnalysisReproductionbio-machine-learning-biomarker-discovery🧠 bioos extended suitebioos extended bioinformatics suiteidentify

Original source

FreedomIntelligence/OpenClaw-Medical-Skills

https://github.com/FreedomIntelligence/OpenClaw-Medical-Skills/tree/main/skills/bio-machine-learning-biomarker-discovery

Maintainer
FreedomIntelligence
License
MIT
Last updated
April 1, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

  • boruta = BorutaPy(rf, n_estimators='auto', max_iter=100, random_state=42, verbose=0) boruta.fit(X.values, y).
  • selected = X.columns[boruta.support_] tentative = X.columns[boruta.support_weak_] print(f'Selected: {len(selected)}, Tentative: {len(tentative)}').
  • feature_ranks = pd.DataFrame({ 'feature': X.columns, 'rank': boruta.ranking_, 'selected': boruta.support_ }).sort_values('rank').
  • selected_features = mrmr_classif(X=X, y=pd.Series(y), K=50) X_selected = X[selected_features].

Source Doc

Excerpt From SKILL.md

Boruta All-Relevant Selection

Identifies all features that are significantly better than random (shadow features).

from boruta import BorutaPy
from sklearn.ensemble import RandomForestClassifier
import pandas as pd
import numpy as np

rf = RandomForestClassifier(n_estimators=100, n_jobs=-1, random_state=42)

## mRMR (Minimum Redundancy Maximum Relevance)

Selects features that are individually relevant but minimally redundant with each other.

```python
from mrmr import mrmr_classif

## LASSO Feature Selection

L1 regularization drives irrelevant coefficients to zero.

```python
from sklearn.linear_model import LassoCV
from sklearn.preprocessing import StandardScaler

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

Use cases

  • Use bio-machine-learning-biomarker-discovery for genomics and bioinformatics workflows.
  • Apply bio-machine-learning-biomarker-discovery to sequencing, variant, or omics analysis tasks.

Not for

  • Do not rely on this catalog entry alone for installation or maintenance details.

Upstream Related Skills

  • differential-expression/de-results - Pre-filter with DE genes
  • pathway-analysis/go-enrichment - Functional enrichment of selected features
  • machine-learning/omics-classifiers - Use selected features for prediction

Related skills

Related skills

Back to directory
AG
Data & ReproBioinformatics & Genomics

agent-browser

Browse the web for any task — research topics, read articles, interact with web apps, fill forms, take screenshots, extract data, and test w…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
AL
Data & ReproBioinformatics & Genomics

alpha-vantage

Access 20+ years of global financial data: equities, options, forex, crypto, commodities, economic indicators, and 50+ technical indicators.

Claude CodeAnalysis
K-Dense-AI/claude-scientific-skillsView
BI
Data & ReproBioinformatics & Genomics

bio-alignment-filtering

Filter alignments by flag, quality, region, or paired status.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproBioinformatics & Genomics

bio-alignment-indexing

Index BAM/CRAM files with samtools index for random access.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView