数据与复现生物信息与基因组学K-Dense-AI/claude-scientific-skills数据与复现
TI

tiledbvcf

维护者 Jeremy Leipzig · 最近更新 2026年4月1日

TileDB-VCF is a high-performance C++ library with Python and CLI interfaces for efficient storage and retrieval of genomic variant-call data. Built on TileDB's sparse array technology, it enables scalable ingestion of VCF/BCF files, incremental sample addition without expensive merging operations, and efficient parallel queries of variant data stored locally or in the cloud.

Claude CodeOpenClawNanoClaw分析处理复现实验tiledbvcfbioinformaticspackagebioinformatics & genomics

原始来源

K-Dense-AI/claude-scientific-skills

https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/tiledbvcf

维护者
Jeremy Leipzig
许可
MIT license
最近更新
2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

  • TileDB-VCF是一个high-performance C++ 库 ,支持 Python 、 CLI interfaces ,用于 efficient storage 、 retrieval of genomic variant-call data. Built on TileDB's sparse array technology,it enables scalable ingestion of VCF/BCF files,incremental sample addition without expensive merging operations,、 efficient parallel queries of variant data stored locally 或 in cloud。
  • CONDA_SUBDIR=osx-64 conda config --env --set subdir osx-64。

原始文档

SKILL.md 摘录

When to Use This Skill

This skill should be used when:

  • Learning TileDB-VCF concepts and workflows
  • Prototyping genomics analyses and pipelines
  • Working with small-to-medium datasets (< 1000 samples)
  • Need incremental addition of new samples to existing datasets
  • Require efficient querying of specific genomic regions across many samples
  • Working with cloud-stored variant data (S3, Azure, GCS)
  • Need to export subsets of large VCF datasets
  • Building variant databases for cohort studies
  • Educational projects and method development
  • Performance is critical for variant data operations

Installation

Preferred Method: Conda/Mamba


## Create the conda environment

conda create -n tiledb-vcf "python<3.10"
conda activate tiledb-vcf

适用场景

  • Learning TileDB-VCF concepts 、 workflows。
  • Prototyping genomics analyses 、 pipelines。
  • Working ,支持 small-to-medium 数据集s (< 1000 samples)。
  • Need incremental addition of new samples to existing 数据集s。

不适用场景

  • Do not rely on this catalog entry alone ,用于 installation 或 maintenance details。
  • Do not assume this entry replaces original database documentation 或 API notes。

相关技能

相关技能

返回目录
AL
数据与复现生物信息与基因组学

alpha-vantage

Access 20+ years of global financial data: equities, options, forex, crypto, commodities, economic indicators, and 50+ t…

Claude Code分析处理
K-Dense-AI/claude-scientific-skills查看
BI
数据与复现生物信息与基因组学

bio-alignment-msa-parsing

Parse and analyze multiple sequence alignments using Biopython. Extract sequences, identify conserved regions, analyze g…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现生物信息与基因组学

bio-alignment-validation

Validate alignment file integrity and detect truncated/corrupt files.

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现生物信息与基因组学

bio-atac-seq-atac-peak-calling

Call accessible chromatin regions from ATAC-seq data using MACS3 with ATAC-specific parameters. Use when identifying ope…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看