数据与复现统计与数据分析K-Dense-AI/claude-scientific-skills数据与复现
VA

Vaex

维护者 K-Dense Inc. · 最近更新 2026年4月1日

Vaex是一个high-performance Python 库 designed ,用于 la。

Claude CodeOpenClawNanoClaw分析处理复现实验vaexdata-analysispackagedata analysis & visualization

原始来源

K-Dense-AI/claude-scientific-skills

https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/vaex

维护者
K-Dense Inc.
许可
MIT license
最近更新
2026年4月1日

技能摘要

来自 SKILL.md 的关键信息

2 min

核心说明

  • Vaex是一个high-performance Python 库 designed ,用于 lazy,out-of-core DataFrames to process 、 visualize tabular 数据集s that are too large to fit into RAM. Vaex can process over billion rows per second,enabling interactive data exploration 、 analysis on 数据集s ,支持 billions of rows。
  • df = vaex.open('large_file.hdf5') # 或.csv,.arrow,.parquet。

原始文档

SKILL.md 摘录

When to Use This Skill

Use Vaex when:

  • Processing tabular datasets larger than available RAM (gigabytes to terabytes)
  • Performing fast statistical aggregations on massive datasets
  • Creating visualizations and heatmaps of large datasets
  • Building machine learning pipelines on big data
  • Converting between data formats (CSV, HDF5, Arrow, Parquet)
  • Needing lazy evaluation and virtual columns to avoid memory overhead
  • Working with astronomical data, financial time series, or other large-scale scientific datasets

Core Capabilities

Vaex provides six primary capability areas, each documented in detail in the references directory:

1. DataFrames and Data Loading

Load and create Vaex DataFrames from various sources including files (HDF5, CSV, Arrow, Parquet), pandas DataFrames, NumPy arrays, and dictionaries. Reference references/core_dataframes.md for:

  • Opening large files efficiently
  • Converting from pandas/NumPy/Arrow
  • Working with example datasets
  • Understanding DataFrame structure

适用场景

  • Processing tabular 数据集s larger than available RAM (gigabytes to terabytes)。
  • Performing fast statistical aggregations on massive 数据集s。
  • Creating visuali。

不适用场景

  • Do not rely on this catalog entry alone ,用于 installation 或 maintenance details。

相关技能

相关技能

返回目录
AE
数据与复现统计与数据分析

aeon

aeon:Aeon是一个兼容 scikit-learn Python 工具包 ,用于 时序机器学习。 It provides state-of- -art algorithms ,用于 分类,回归,聚类,预测,异常检测,分割,、 相似性检索…

Claude CodeOpenClaw分析处理
K-Dense-AI/claude-scientific-skills查看
AR
数据与复现统计与数据分析

arxiv-database

arxiv-database:This skill provides Python tools ,用于 searching 、 retrieving preprints ,面向 arXiv.org ,通过 its public Atom A…

Claude Code分析处理
K-Dense-AI/claude-scientific-skills查看
BI
数据与复现统计与数据分析

bio-causal-genomics-fine-mapping

bio-causal-genomics-fine-mapping:Fine-mapping narrows GWAS association signals to identify likely causal variants。 Key o…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看
BI
数据与复现统计与数据分析

bio-crispr-screens-base-editing-analysis

bio-crispr-screens-base-editing-analysis:分析 base editing 、 prime editing outcomes ,涵盖 editing efficiency,bystander edits…

OpenClawNanoClaw分析处理
FreedomIntelligence/OpenClaw-Medical-Skills查看