Data & ReproBioinformatics & GenomicsK-Dense-AI/claude-scientific-skillsData & Reproduction
TI

tiledbvcf

Maintainer Jeremy Leipzig · Last updated April 1, 2026

TileDB-VCF is a high-performance C++ library with Python and CLI interfaces for efficient storage and retrieval of genomic variant-call data. Built on TileDB's sparse array technology, it enables scalable ingestion of VCF/BCF files, incremental sample addition without expensive merging operations, and efficient parallel queries of variant data stored locally or in the cloud.

Claude CodeOpenClawNanoClawAnalysisReproductiontiledbvcfbioinformaticspackagebioinformatics & genomics

Original source

K-Dense-AI/claude-scientific-skills

https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/tiledbvcf

Maintainer
Jeremy Leipzig
License
MIT license
Last updated
April 1, 2026

Skill Snapshot

Key Details From SKILL.md

2 min

Key Notes

  • TileDB-VCF is a high-performance C++ library with Python and CLI interfaces for efficient storage and retrieval of genomic variant-call data. Built on TileDB's sparse array technology, it enables scalable ingestion of VCF/BCF files, incremental sample addition without expensive merging operations, and efficient parallel queries of variant data stored locally or in the cloud.
  • CONDA_SUBDIR=osx-64 conda config --env --set subdir osx-64.

Source Doc

Excerpt From SKILL.md

When to Use This Skill

This skill should be used when:

  • Learning TileDB-VCF concepts and workflows
  • Prototyping genomics analyses and pipelines
  • Working with small-to-medium datasets (< 1000 samples)
  • Need incremental addition of new samples to existing datasets
  • Require efficient querying of specific genomic regions across many samples
  • Working with cloud-stored variant data (S3, Azure, GCS)
  • Need to export subsets of large VCF datasets
  • Building variant databases for cohort studies
  • Educational projects and method development
  • Performance is critical for variant data operations

Installation

Preferred Method: Conda/Mamba


## Create the conda environment

conda create -n tiledb-vcf "python<3.10"
conda activate tiledb-vcf

Use cases

  • Learning TileDB-VCF concepts and workflows.
  • Prototyping genomics analyses and pipelines.
  • Working with small-to-medium datasets (< 1000 samples).
  • Need incremental addition of new samples to existing datasets.

Not for

  • Do not rely on this catalog entry alone for installation or maintenance details.
  • Do not assume this entry replaces the original database documentation or API notes.

Related skills

Related skills

Back to directory
AL
Data & ReproBioinformatics & Genomics

alpha-vantage

Access 20+ years of global financial data: equities, options, forex, crypto, commodities, economic indicators, and 50+ technical indicators.

Claude CodeAnalysis
K-Dense-AI/claude-scientific-skillsView
BI
Data & ReproBioinformatics & Genomics

bio-alignment-msa-parsing

Parse and analyze multiple sequence alignments using Biopython. Extract sequences, identify conserved regions, analyze gaps, work with annot…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproBioinformatics & Genomics

bio-alignment-validation

Validate alignment file integrity and detect truncated/corrupt files.

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView
BI
Data & ReproBioinformatics & Genomics

bio-atac-seq-atac-peak-calling

Call accessible chromatin regions from ATAC-seq data using MACS3 with ATAC-specific parameters. Use when identifying open chromatin regions…

OpenClawNanoClawAnalysis
FreedomIntelligence/OpenClaw-Medical-SkillsView