Dask
Dask is a Python library for parallel and distributed computing that enables three critical capabilities: - **Larger-than-memory execution**…
Maintainer K-Dense Inc. · Last updated March 31, 2026
Perform comprehensive exploratory data analysis (EDA) on scientific data files across multiple domains. This skill provides automated file type detection, format-specific analysis, data quality assessment, and generates detailed markdown reports suitable for documentation and downstream analysis planning. **Key Capabilities:** - Automatic detection and analysis of 200+ scientific file formats - Comprehensive format-….
Original source
https://github.com/K-Dense-AI/claude-scientific-skills/tree/main/scientific-skills/exploratory-data-analysis
Skill Snapshot
Source Doc
Use this skill when:
The skill has comprehensive coverage of scientific file formats organized into six major categories:
Structure files, computational chemistry outputs, molecular dynamics trajectories, and chemical databases.
File types include: .pdb, .cif, .mol, .mol2, .sdf, .xyz, .smi, .gro, .log, .fchk, .cube, .dcd, .xtc, .trr, .prmtop, .psf, and more.
Reference file: references/chemistry_molecular_formats.md
Related skills
Dask is a Python library for parallel and distributed computing that enables three critical capabilities: - **Larger-than-memory execution**…
Comprehensive geospatial science skill covering GIS, remote sensing, spatial analysis, and ML for Earth observation across 70+ topics with 5…
GeoPandas extends pandas to enable spatial operations on geometric types. It combines the capabilities of pandas and shapely for geospatial…
NetworkX is a Python package for creating, manipulating, and analy.