AnnData
AnnData is a Python package for handling annotated data matrices, storing experimental measurements (X) alongside observation metadata (obs)…
Maintainer K-Dense Inc. · Last updated April 1, 2026
Geniml is a Python package for building machine learning models on genomic interval data from BED files. It provides unsupervised methods for learning embeddings of genomic regions, single cells, and metadata labels, enabling similarity searches, clustering, and downstream ML tasks.
Original source
https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/geniml
Skill Snapshot
Source Doc
Install geniml using uv:
For ML dependencies (PyTorch, etc.):
Development version from GitHub:
Geniml provides five primary capabilities, each detailed in dedicated reference files:
Train unsupervised embeddings of genomic regions using word2vec-style learning.
Use for: Dimensionality reduction of BED files, region similarity analysis, feature vectors for downstream ML.
Workflow:
Reference: See references/region2vec.md for detailed workflow, parameters, and examples.
Related skills
AnnData is a Python package for handling annotated data matrices, storing experimental measurements (X) alongside observation metadata (obs)…
Arboreto is a computational library for inferring gene regulatory networks (GRNs) from gene expression data using paralleli.
Cell segmentation from multiplexed tissue images. Covers deep learning (Cellpose, Mesmer) and classical approaches for nuclear and whole-cel…
Extract, process, and deduplicate reads using Unique Molecular Identifiers (UMIs) with umi_tools. Use when library prep includes UMIs and ac…